Skip to main content

XML Index in SQL server

XML Index
An XML index is designed to handle indexing XML type column and has been available since SQL Server 2005. Many organizations store and transmit their data in XML format. We know XML is very convenient and popular to consume and provide the data that is the reason SQL server team worked on this and increased the performance of XML query using XML Indexing. When these indexes are used properly, they can dramatically reduce execution time in queries made against XML columns.

Why XML Index
When an index does not exist, an XML field must be 'shred' when the query is executed. This means that XML data is separated from XML tags, and is arranged in a relational format. An XML index works before time, represents XML data in the already-shredded version, allowing easy filtering.

You must know some XML Index Ground Rules
The XML index can only be applied to XML data type columns. To use the XML index on a specific table, there should be a cluster index on the primary key column in the table
Note: a primary key constraint includes a clustered index upon creation by default.
There are two types of XML indexes, first is Primary and second is Secondary.
A primary XML index essentially contains one row of information for each node in the XML column. This information is made up of XML node values, types, and paths. A primary index is a ‘pre-shredded’ representation of the XML blob – an easy-to-filter copy of the XML data.
Secondary XML indexes are dependent on primary XML indexes – you cannot create any secondary indexes without first having created a primary index.
Secondary XML indexes come in 3 types: PATH, VALUE, and PROPERTY indexes. Secondary indexes are designed to provide additional optimization for certain types of XML queries.
1-     PATH secondary indexes often help to optimize queries that make use of a path. An exist() method in the query usually indicates that a PATH index may improve the query execution.
2-     PROPERTY secondary indexes are designed to be helpful with queries where the primary key field is used, and where multiple values are retrieved from a single XML instance.
3-    VALUE secondary indexes are good for queries that use wildcards in a path to find known values – if the full path is not known, but the value being filtered IS known, a VALUE index will probably optimize the query execution.

How to create XML Index
As we know, primary key column and clustered index is mandatory to create an XML index so we have to create clustered index first. Using the following index we can create the XML index.

CREATE TABLE Product.Inventory (ID INT PRIMARY KEY, XmlDetails XML); 
-- Create primary index. 
CREATE PRIMARY XML INDEX PIdx_Inventory_XmlDetails  
ON T(XmlCol); 
-- Create secondary indexes (PATH, VALUE, PROPERTY). 
CREATE XML INDEX PIdx_Inventory_XmlDetails_PATH ON T(XmlDetails) 
USING XML INDEX PIdx_Inventory_XmlDetails 
CREATE XML INDEX PIdx_T_XmlDetails_VALUE ON T(XmlDetails) 
USING XML INDEX PIdx_Inventory_XmlDetails 
CREATE XML INDEX PIdx_Inventory_XmlDetails_PROPERTY ON T(XmlDetails) 
USING XML INDEX PIdx_Inventory_XmlDetails 

In above example, suppose we have a table Inventory which contains all information if your product and XML columns which has all information like Product details, SKU, ISBN etc. in XML format if we want to increase the XML query performance then we need indexing of this column using above script.

Popular posts from this blog

Merge and Merge join transformation in SSIS

Using Merge Transformation we can combine two sorted data-set into single data-set basically Merge Transformation used to combines rows from two sorted data flows into one sorted data flow. Following tasks you may perform using Merge Transformation: 1.Suppose we have a scenario like, we need to merge data from a database table and excel means we want to merge data from two different data sources. For such type of scenario, you can use Merge Transformation. 2.If we want to merge data from two same structured tables but exists two different servers. 3.Sometimes we get an error due to data in a row, after correcting errors in the data we can re-merge rows easily. See below explanations may help you to understand Merge Transformation: I do evaluate here, you already know about the data source, data conversion, data flow, task flow, control flow etc. Note:Before Merge transformation, we need to sort the data using Sort Transformation. After sorting data add data path to Merge…

Add day to ISODate in MongoDB

We can use $add operator to add days in ISODate in mongodb, $add is the Arithmetic Aggregation Operator which adds number and date in mongodb.

{ $add: [ <expression1>, <expression2>, ... ] }

Note:  If one of the argument is date $add operator treats to other arguments as milliseconds to add to the date.
Example: Suppose we have a Test collection as below.

{"Title" : "Add day to ISODate in MongoBD","CreatedDate" : ISODate("2016-07-07T08:00:00.000Z")}

Query to add 2 days in CreatedDate

db.Test.aggregate([      { $project: { Title: 1, AddedDate: { $add: [ "$CreatedDate", 2*24*60*60000 ] } } }    ])


{ "_id" : ObjectId("579a1567ac1b3f3732483de0"), "Title" : "Add day to ISODate in MongoBD", "AddedDate" : ISODate("2016-07-09T08:00:00.000Z") }

Note: As mentioned in above note we have to convert days in millisecond because $add operator treat to other arg…

What is difference between UNION and UNION ALL in SQL Server

We use UNION and UNION ALL operator to combine multiple results set into one result set.
UNION operator is used to combining multiple results set into one result set but removes any duplicate rows. Basically, UNION is used to performing a DISTINCT operation across all columns in the result set. UNION operator has the extra overhead of removing duplicate rows and sorting result.
UNION ALL operator use to combine multiple results set into one result set but it does not remove any duplicate result. Actually, this does not remove duplicate rows so it is faster than the UNION operator. If you want to combine multiple results and without duplicate records then use UNION otherwise UNION ALL is better.
Following some rules for using UNION/UNION ALL operator
1.The number of the column should be the same in the query's when you want to combine them. 2.The column should be of the same data type. 3.ORDER BY clause can be applied to the overall result set not within each result set.
4.Column name of …