Category: Small Bites of Big Data
-
Befriending Dragons | #Words4Justice
Today is my last official day at Microsoft. I no longer feel safe, comfortable, or valued working in tech. Going forward I’ll be working to actively disrupt tech culture and systems to reduce harassment and discrimination. Keep an eye on #Words4Justice. 😊 Be kind. Be brave. Go beyond ally to accomplice to actively…
-
Windows Hyper-V Dragon
After all these years soaring through the data world, from SQL Server 1.11 all the way through today’s modern Big Data technologies, I am making a flight adjustment. My next adventure will be in the land of the Windows Hypervisor: Hyper-V. Last week I started working with my new team and…
-
Big Data for the SQL Eye
SQL Server is a great technology – I’ve been using it since 1993 when the user interface consisted of a query window with the options to save and execute and not much else. With every release there’s something new and exciting and there’s always something to learn about even the…
-
Hadoop Likes Big Files
One of the frequently overlooked yet essential best practices for Hadoop is to prefer fewer, bigger files over more, smaller files. How small is too small and how many is too many? How do you stitch together all those small Internet of Things files into files “big enough” for Hadoop…
-
Azure Data Factory: Hub Not Found
You can use the new Azure portal to create or edit Azure Data Factory components. Once you are done you may automate the process of creating future Data Factory components from PowerShell. In that case you can use the JSON files you edited in the portal GUI as configuration files…
-
Create HDInsight Cluster in Azure Portal
Creating an HDInsight cluster from the Azure portal is very easy. However, sometimes you want all the choices and best practices explained as well as the “how to”. I have created a series of slides with audio recordings to walk you through the process and choices. They are available as…
-
Master Choosing the Right Project for Hadoop
Hadoop is the hot buzzword of the Big Data world, and many IT people are being told “go create a Hadoop cluster and do some magic”. It’s hard to know where to start or which projects are a good fit. The information available online is sparse, often conflicting, and usually…
-
AzureCopy to the Rescue for an S3 to Azure Blob Copy!
This week I helped a client move files from AWS S3 to Azure Storage blobs. Sounds simple, right? Here’s the tricky part… While there are both Azure and AWS cmdlets for PowerShell, they don’t cooperate. Neither has a cmdlet that accepts credentials from the other and neither accepts arbitrary URLs…
-
PowerShell works for Amazon AWS S3 too!
Technorati Tags: azure blob store,azure storage,aws,s3,neal analytics,windows azure,powershell More and more we have to work with data in many different locations. This week I got to work with S3 files that were moving to Azure blob storage. I was surprised to find that Amazon has published AWS cmdlets for PowerShell.…
-
Understanding WASB and Hadoop Storage in Azure
Yesterday we learned Why WASB Makes Hadoop on Azure So Very Cool. Now let’s dive deeper into Windows Azure storage and WASB. I’ll answer some of the common questions I get when people first try to understand how WASB is the same as and different from HDFS. What is HDFS?…