Monthly Archives: July 2013

What is big data and how big it is?

Yesterday, I was discussing with my younger brother who works as a petrophysicist. He suddenly paused me, “Hey, wait. I am hearing much about this big data. What makes this big data and how is it different from the data we deal?” I face this question quite often by clients as well. As I started my 5 minutes lecture to him on big data, I decided to compose a post with some nice collections to give a beginner a head start to big data. And here I am…

Continue reading

A sample to integrate data from Hadoop (HDInsight) using SSIS

Hadoop mostly deals with unstructured data. And all your structured data lives in relational databases. After you made necessary processing it on the Hadoop cluster you may need to bring your analysis to your data warehouse or to your RDBMS tables for further analysis so that unstructured data could compliment to structured database.

As I was playing around with HDInsight (Microsoft’s implementation of Apache Hadoop) on Azure I thought it will be useful to compile a step by step guide to integrate data from Hadoop cluster (HDInsight) using SQL Server Integration service.

Continue reading