Skip to main content

Indexing in MongoDB

As we know index is technique to arrange data in database which support the efficient resolution of queries. If we run a query to select data, MongoDB do scan every document of collection. Scanning of all document may affect performance of MongoDB when it process the large volume of data.
The index stores the value of a specific field or set of fields, ordered by the value of the field as specified in index.

The ensureIndex() Method

In MongoDB to create an index we need to use ensureIndex() method.
Syntax:
 Basic syntax of ensureIndex() method is given below:


>db.COLLECTION_NAME.ensureIndex({KEY:1})


In MongoDB if we want to create index in ascending order the we will use 1 or descending order to use -1. 

Example


>db.testCol.ensureIndex({"title":1})



In ensureIndex() method we can pass multiple fields, to create index on multiple fields.

>db.testCol.ensureIndex({"title":1,"description":-1})


ensureIndex() method also accepts list of options (which are optional), whose list is given below:

Parameter

Type

Description

Background
Boolean
Builds the index in the background so that building an index does not block other database activities. Specify true to build in the background. The default value is false.
Unique
Boolean
Creates a unique index so that the collection will not accept insertion of documents where the index key or keys match an existing value in the index. Specify true to create a unique index. The default value is false.
Name
String
The name of the index. If unspecified, MongoDB generates an index name by concatenating the names of the indexed fields and the sort order.

dropDups
Boolean
Creates a unique index on a field that may have duplicates. MongoDB indexes only the first occurrence of a key and removes all documents from the collection that contain subsequent occurrences of that key. Specify true to create unique index. The default value is false.
Sparse
Boolean
If true, the index only references documents with the specified field. These indexes use less space but behave differently in some situations (particularly sorts). The default value is false.
expireAfterSeconds
Integer
Specifies a value, in seconds, as a TTL to control how long MongoDB retains documents in this collection.
V
index version
The index version number. The default index version depends on the version of mongod running when creating the index.

Weights
Document

The weight is a number ranging from 1 to 99,999 and denotes the significance of the field relative to the other indexed fields in terms of the score.
default_language
String
For a text index, the language that determines the list of stop words and the rules for the stemmer and tokenizer. The default value is english.
language_override
String
For a text index, specify the name of the field in the document that contains, the language to override the default language.

Popular posts from this blog

Query to find stored procedures by nested stored procedure name

Problem: Suppose we have a stored procedure which has been used in several stored procedure, I mean stored procedure usp_proc1 is nested in many stored procedures like below
BEGIN DECLARE@ResultTABLE ( IDINT, NAMEVARCHAR(50), [ADDRESS]VARCHAR(255) ) INSERTINTO@Result EXECusp_proc1@Name='codefari' END
So I want to find all those queries who containing usp_proc1
Solution: There is a lot of solutions, I'm giving some of them below. If you want to get the only name of the stored procedures then use the following query. Using join query on system tables syscomments and sysobjects we can get the stored procedures name which containing the particular table, nested procs or any other string.
SELECTDISTINCTo.name FROMsyscommentss INNERJOINsysobjectsoONs.id=o.id

Check for changes to an SQL Server table?

Problem: Suppose your team is working on the under-development project so it might be possible continuous work on the database and perform changes in Table, Stored procedure as per requirement, and daily you have to update the testing server database as per changes are done in developing server database then how it is possible to trace those changes. There are a lot of solutions for this problem which is listed below Solution 1: For SQL Server 2000, 2005 and above use the CHECKSUM command SELECTCHECKSUM_AGG(BINARY_CHECKSUM(*))FROMYour_Table_NameWITH (NOLOCK); That will return the same number each time its run as long as the table contents haven't changed. Unfortunately CHECKSUM does not work always properly to detect changes. It is only a primitive checksum and no CRC calculation. Therefore you can't use it to detect all changes, e. g. symmetrical changes result in the same CHECKSUM! Solution 2: 1.Run the following query. Before executing query replace DB_Name with your database name…

Merge and Merge join transformation in SSIS

MERGE TRANSFORMATION
Using Merge Transformation we can combine two sorted data-set into single data-set basically Merge Transformation used to combines rows from two sorted data flows into one sorted data flow. Following tasks you may perform using Merge Transformation: 1.Suppose we have a scenario like, we need to merge data from a database table and excel means we want to merge data from two different data sources. For such type of scenario, you can use Merge Transformation. 2.If we want to merge data from two same structured tables but exists two different servers. 3.Sometimes we get an error due to data in a row, after correcting errors in the data we can re-merge rows easily. See below explanations may help you to understand Merge Transformation: I do evaluate here, you already know about the data source, data conversion, data flow, task flow, control flow etc. Note:Before Merge transformation, we need to sort the data using Sort Transformation. After sorting data add data path to Merge…