Create HDInsight Cluster in Azure Portal

Creating an HDInsight cluster from the Azure portal is very easy. However, sometimes you want all the choices and best practices explained as well as the “how to”. I have created a series of slides with audio recordings to walk you through the process and choices. They are available as sessions 1-8 of “Create HDInsightContinue reading “Create HDInsight Cluster in Azure Portal”

Taking Flight a.k.a. The Data Dragon’s Life After Microsoft

Cross-posted (with slightly worse formatting) from http://befriendingdragons.com/2014/07/23/taking-flight-a-k-a-the-data-dragons-life-after-microsoft/ Life is a journey – we can choose to fly through it with our wings spread to catch and channel the winds, or we can let the winds pummel us to the ground. I choose to take flight, enjoy the journey, and land on my feet. Then take off again. Even whenContinue reading “Taking Flight a.k.a. The Data Dragon’s Life After Microsoft”

Use Additional Storage Accounts with HDInsight Hive

When you create an HDInsight Hadoop cluster you pass in one or more storage accounts and their associated keys. This allows you to access the files on all associated storage accounts from the cluster. If you want to use public storage that isn’t passed in at create time that’s easy – simply supply the storageContinue reading “Use Additional Storage Accounts with HDInsight Hive”

HDInsight: Jiving about Hadoop and Hive with CAT

Tomorrow I will be talking about Hive as part of Pragmatic Work’s Women in Technology (WIT) month of webcasts. I am proud to be part of this lineup with all these stellar WITs! I encourage my fellow WITs to get more involved in your data community and if you don’t already do so start tweeting,Continue reading “HDInsight: Jiving about Hadoop and Hive with CAT”

HDInsight: Hive Internal and External Tables Intro

Small Bites of Big Data Cindy Gross, SQLCAT PM HDInsight is Microsoft’s distribution, in partnership with Hortonworks, of Hadoop. Hive is the component of the Hadoop ecosystem that imposes structure on Hadoop data in a way that makes it usable from BI tools that expect rows and columns with defined data types. Hive tables canContinue reading “HDInsight: Hive Internal and External Tables Intro”

Hurricane Sandy Mash-Up: Hive, SQL Server, PowerPivot, Power View

Small Bites of Big Data Authors: Cindy Gross Microsoft SQLCAT PM, Ed Katibah Microsoft SQLCAT PM Tech Reviewers: Bob Beauchemin Developer Skills Partner at SQLSkills, Jeannine Nelson-Takaki Microsoft Technical Writer, John Sirmon Microsoft SQLCAT PM, Lara Rubbelke Microsoft Technical Architect, Murshed Zaman Microsoft SQLCAT PM For my #SQLPASS Summit 2012 talk SQLCAT: Big Data –Continue reading “Hurricane Sandy Mash-Up: Hive, SQL Server, PowerPivot, Power View”

Big Data – All Abuzz About Hive at #SQLPASS Summit 2012

Big Data – All Abuzz About Hive Small Bites of Big Data Cindy Gross, SQLCAT PM I hope to see you at the #SQLPASS Summit 2012 this week! There are many reasons people come to the PASS Summit – SQL friends, SQL family, networking, great content in 190 sessions, the SQL clinic, the product team,Continue reading “Big Data – All Abuzz About Hive at #SQLPASS Summit 2012”

Load SQL Server BCP Data to Hive

Load SQL Server BCP Data to Hive Small Bites of Big Data Cindy Gross, SQLCAT PM As you start learning more about Hadoop you may want to take a look at how the same data and queries work for SQL Server and for Hadoop. There are various ways to do this. For now I’ll showContinue reading “Load SQL Server BCP Data to Hive”

Hadoop Hive Error: Could not connect client socket, timed_out

Small Bites of Big Data Cindy Gross, SQLCAT PM With the Hadoop on Azure CTP, when you create a Hadoop cluster it expires after a few days to free up the resources for other CTP users. Therefore each time I do a demo or test I am likely to create a new Hadoop cluster. ThereContinue reading “Hadoop Hive Error: Could not connect client socket, timed_out”

Open Ports for HadoopOnAzure CTP – Small Bites of Big Data

Open Ports for HadoopOnAzure CTP Small Bites of Big Data Cindy Gross, SQLCAT PM UPDATED Jun 2013: HadoopOnAzure CTP has been replaced by HDInsight Preview. See Troubleshooting ODBC connectivity to HDInsight http://social.msdn.microsoft.com/Forums/en-US/hdinsight/thread/b4ca52ea-f7cf-420c-959d-53e09f801f7d.       Once you have created your Hadoop on Azure cluster you will likely be moving data in and out of the system. That meansContinue reading “Open Ports for HadoopOnAzure CTP – Small Bites of Big Data”

%d bloggers like this: