Skip to main content

Merge and Merge join transformation in SSIS

Using Merge Transformation we can combine two sorted data-set into single data-set basically Merge Transformation used to combines rows from two sorted data flows into one sorted data flow.
Following tasks you may perform using Merge Transformation:
1.        Suppose we have a scenario like, we need to merge data from a database table and excel means we want to merge data from two different data sources. For such type of scenario, you can use Merge Transformation.
2.        If we want to merge data from two same structured tables but exists two different servers.
3.        Sometimes we get an error due to data in a row, after correcting errors in the data we can re-merge rows easily.
See below explanations may help you to understand Merge Transformation:
I do evaluate here, you already know about the data source, data conversion, data flow, task flow, control flow etc.
Note:  Before Merge transformation, we need to sort the data using Sort Transformation.
After sorting data add data path to Merge Transformation, a dialog box will be open See below:

Now choose one option from Input, suppose we chose Merge Input 1 same as we add data path from the second source.
Note: If Merge input 1 and Merge input 2 column name are same then no issue otherwise we have to map column, see below.

If we don’t want to add data from input you can ignore here, means we don’t want to add cAddress column from Merge Input 2 so we will select <ignore> same you can do with Merge Input 1.
See below full data flow of merge transformation.

Merge join transformation is the popular tool which is used by most BI developers, The Merge Join Combine to sorted data into one output using the FULL, LEFT or INNER JOIN.
You can configure the Merge Join transformation in the following ways:
1.    Specify the join is a FULL, LEFT, or INNER join.
2.    Specify the columns the join uses.
3.    Specify whether the transformation handles null values as equal to other nulls.
For example, you can use a LEFT join to join a table that includes customer information with a table that lists the phone of the customer. The result is a table that lists all customers and their phone numbers.

The implementation is same as Merge transformation except need to select Join Type in Merge join transformation editor dialog box, see below:
Here we need to select option of Join Type and (Full, Left or Inner join), suppose we select option inner join then select map the column from two shorted data sources (in above pic. cId mapped to cust_Id because cId Primary ke in customers table and foreign key in custPhone table ).

To see the result, we can enable data viewer after sorting and Merge Join transformation.
Result will be look like below: Customers data set after sorting.

 Customer- phone data set after sorting.

Merge join transformation output.

Popular posts from this blog

Remove special character from string in MongoDB

Problem: Suppose wehave a collection and one field is type string contains some special character (like !@#$%) and we don’t want these special character.
Solution: We can easily remove the special character from field using script “replace(/[^a-zA-Z 0-9 ]/g, '')” in our query.  How can we remove special character from string using this script please see following example.
Example: Suppose we have a collection “EduSurvey “where we are collecting information from institutions.

{Name:"JB institute”, About:"This is good one collage for MBA", Information:"This $%%institute ##has good faculty etc$$"}
{Name:"MK institute”, About:"This is good one collage for MCA", Information:"This$$%# is the dummy text12"}
{Name:"MG institute”, About:"This is good one collage for B,Tech", Information:"This# institute@ has&* good infrastructure"}

Did you notice Information fields contains some special character so we…

What is difference between UNION and UNION ALL in SQL Server

We use UNION and UNION ALL operator to combine multiple results set into one result set.
UNION operator is used to combining multiple results set into one result set but removes any duplicate rows. Basically, UNION is used to performing a DISTINCT operation across all columns in the result set. UNION operator has the extra overhead of removing duplicate rows and sorting result.
UNION ALL operator use to combine multiple results set into one result set but it does not remove any duplicate result. Actually, this does not remove duplicate rows so it is faster than the UNION operator. If you want to combine multiple results and without duplicate records then use UNION otherwise UNION ALL is better.
Following some rules for using UNION/UNION ALL operator
1.The number of the column should be the same in the query's when you want to combine them. 2.The column should be of the same data type. 3.ORDER BY clause can be applied to the overall result set not within each result set.
4.Column name of …