MongoDB with pyMongo I - Installation
List of MongoDB with PyMongo
- MongoDB with PyMongo I - Installing MongoDB
- MongoDB with PyMongo II - Connecting and accessing MongoDB
- MongoDB with pyMongo III - Range Querying MongoDB
- MongoDB RESTful API with Flask
- Flexible schema - supports hierarchical data structure.
- Oriented toward programmers - it supports associative arrays such as php arrays, python dictionaries, JSON objects, Ruby hash etc.
- Lots of MongoDB Drivers and Client Libraries
Drivers in MongoDB are used for connectivity between client applications and the database. For example, if we have a Python program and we want to connect to MongoDB, then we need to download and integrate the Python driver so that the program can work with the MongoDB database. PyMongo is the driver for Python.
Picture source : MongoDB Introduction; Schema Design
- Flexible deployment.
- Designed for BigData.
- Aggregation Framework.
This tutorial is largely based on PyMongo and Install MongoDB on Ubuntu.
We need to make sure that the PyMongo distribution installed. If so, in the Python shell, the following should run without raising an exception:
>>> import pymongo
If not, on Ubuntu 14, install it like this:
$ sudo apt-get install python-setuptools $ sudo easy_install pymongo
Or just use "pip"
$ pip install pymongo
PyMongo is just a driver. How about the MongoDB?
Reference: Install MongoDB on Ubuntu.
The Ubuntu package management tool (i.e. dpkg and apt) ensure package consistency and authenticity by requiring that distributors sign packages with GPG keys. Issue the following command to import the MongoDB public GPG Key:
$ sudo apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv EA312927
Create a /etc/apt/sources.list.d/mongodb.list file using the following command.
$ echo "deb http://repo.mongodb.org/apt/ubuntu trusty/mongodb-org/3.2 multiverse" | sudo tee /etc/apt/sources.list.d/mongodb-org-3.2.list
Now issue the following command to reload your repository:
$ sudo apt-get update
The following command installs the latest stable version of MongoDB:
$ sudo apt-get install -y mongodb-org
When this command completes, you have successfully installed MongoDB! Continue for configuration and start-up suggestions.
Ref : Run MongoDB Community Edition
"The MongoDB instance stores its data files in /var/lib/mongo and its log files in /var/log/mongo, and runs using the mongod user account. If you change the user that runs the MongoDB process, you must modify the access control rights to the /var/lib/mongo and /var/log/mongo directories."
Start / Stop MongoDB
$ sudo service mongodb start start: Job is already running: mongodb $ sudo service mongodb stop mongodb stop/waiting $ sudo service mongodb start mongodb start/running, process 12648
Restart MongoDB
You may restart the mongod process by issuing the following command:
sudo service mongodb restart
mongod is the primary daemon process for the MongoDB system. It handles data requests, manages data format, and performs background management operations.
We can start the mongod process by issuing the following command:
$ sudo mongod start
We may get dbpath error because unless specified, the mongod will look for data files in the default /data/db directory, but it's not there or the directory does not have proper permission. So, we should do this:
$ sudo mkdir -p /data/db $ sudo chmod -R 755 /data/db
# m.py def get_db(): from pymongo import MongoClient client = MongoClient('localhost:27017') db = client.myFirstMB return db def add_country(db): db.countries.insert({"name" : "Canada"}) def get_country(db): return db.countries.find_one() if __name__ == "__main__": db = get_db() add_country(db) print get_country(db)
Run:
$ python m.py /usr/lib/python2.7/dist-packages/pkg_resources.py:1031: UserWarning: /home/k/.python-eggs is writable by group/others and vulnerable to attack when used with get_resource_filename. Consider a more secure location (set with .set_extraction_path or the PYTHON_EGG_CACHE environment variable). warnings.warn(msg, UserWarning) {u'_id': ObjectId('53c181ce8ce4b12ad763c1dd'), u'name': u'Canada'}
We got a successful run with a warning related to the permission for group/others, but we can easily remove the waring:
$ chmod g-wx,o-wx ~/.python-eggs $ python m.py {u'_id': ObjectId('53c181ce8ce4b12ad763c1dd'), u'name': u'Canada'}
Our task for successful run has been accomplished. We just wanted to see how pymongo works and how easy it is to start using it.
Note that the code was run against a MongoDB instance (mongod) that we have provided.
$ ps -ef|grep mongod mongodb 10407 1 1 10:18 ? 00:01:57 /usr/bin/mongod --config /etc/mongodb.conf
For more details about the inner workings of the code, please visit my next tutorial: Connecting and accessing MongoDB.
List of MongoDB with PyMongo
- MongoDB with PyMongo I - Installing MongoDB
- MongoDB with PyMongo II - Connecting and accessing MongoDB
- MongoDB with pyMongo III - Range Querying MongoDB
- MongoDB RESTful API with Flask
Python tutorial
Python Home
Introduction
Running Python Programs (os, sys, import)
Modules and IDLE (Import, Reload, exec)
Object Types - Numbers, Strings, and None
Strings - Escape Sequence, Raw String, and Slicing
Strings - Methods
Formatting Strings - expressions and method calls
Files and os.path
Traversing directories recursively
Subprocess Module
Regular Expressions with Python
Regular Expressions Cheat Sheet
Object Types - Lists
Object Types - Dictionaries and Tuples
Functions def, *args, **kargs
Functions lambda
Built-in Functions
map, filter, and reduce
Decorators
List Comprehension
Sets (union/intersection) and itertools - Jaccard coefficient and shingling to check plagiarism
Hashing (Hash tables and hashlib)
Dictionary Comprehension with zip
The yield keyword
Generator Functions and Expressions
generator.send() method
Iterators
Classes and Instances (__init__, __call__, etc.)
if__name__ == '__main__'
argparse
Exceptions
@static method vs class method
Private attributes and private methods
bits, bytes, bitstring, and constBitStream
json.dump(s) and json.load(s)
Python Object Serialization - pickle and json
Python Object Serialization - yaml and json
Priority queue and heap queue data structure
Graph data structure
Dijkstra's shortest path algorithm
Prim's spanning tree algorithm
Closure
Functional programming in Python
Remote running a local file using ssh
SQLite 3 - A. Connecting to DB, create/drop table, and insert data into a table
SQLite 3 - B. Selecting, updating and deleting data
MongoDB with PyMongo I - Installing MongoDB ...
Python HTTP Web Services - urllib, httplib2
Web scraping with Selenium for checking domain availability
REST API : Http Requests for Humans with Flask
Blog app with Tornado
Multithreading ...
Python Network Programming I - Basic Server / Client : A Basics
Python Network Programming I - Basic Server / Client : B File Transfer
Python Network Programming II - Chat Server / Client
Python Network Programming III - Echo Server using socketserver network framework
Python Network Programming IV - Asynchronous Request Handling : ThreadingMixIn and ForkingMixIn
Python Coding Questions I
Python Coding Questions II
Python Coding Questions III
Python Coding Questions IV
Python Coding Questions V
Python Coding Questions VI
Python Coding Questions VII
Python Coding Questions VIII
Python Coding Questions IX
Python Coding Questions X
Image processing with Python image library Pillow
Python and C++ with SIP
PyDev with Eclipse
Matplotlib
Redis with Python
NumPy array basics A
NumPy Matrix and Linear Algebra
Pandas with NumPy and Matplotlib
Celluar Automata
Batch gradient descent algorithm
Longest Common Substring Algorithm
Python Unit Test - TDD using unittest.TestCase class
Simple tool - Google page ranking by keywords
Google App Hello World
Google App webapp2 and WSGI
Uploading Google App Hello World
Python 2 vs Python 3
virtualenv and virtualenvwrapper
Uploading a big file to AWS S3 using boto module
Scheduled stopping and starting an AWS instance
Cloudera CDH5 - Scheduled stopping and starting services
Removing Cloud Files - Rackspace API with curl and subprocess
Checking if a process is running/hanging and stop/run a scheduled task on Windows
Apache Spark 1.3 with PySpark (Spark Python API) Shell
Apache Spark 1.2 Streaming
bottle 0.12.7 - Fast and simple WSGI-micro framework for small web-applications ...
Flask app with Apache WSGI on Ubuntu14/CentOS7 ...
Fabric - streamlining the use of SSH for application deployment
Ansible Quick Preview - Setting up web servers with Nginx, configure enviroments, and deploy an App
Neural Networks with backpropagation for XOR using one hidden layer
NLP - NLTK (Natural Language Toolkit) ...
RabbitMQ(Message broker server) and Celery(Task queue) ...
OpenCV3 and Matplotlib ...
Simple tool - Concatenating slides using FFmpeg ...
iPython - Signal Processing with NumPy
iPython and Jupyter - Install Jupyter, iPython Notebook, drawing with Matplotlib, and publishing it to Github
iPython and Jupyter Notebook with Embedded D3.js
Downloading YouTube videos using youtube-dl embedded with Python
Machine Learning : scikit-learn ...
Django 1.6/1.8 Web Framework ...
Ph.D. / Golden Gate Ave, San Francisco / Seoul National Univ / Carnegie Mellon / UC Berkeley / DevOps / Deep Learning / Visualization