Getting Started Using Hadoop, Part 2: Building a Cluster

In Part 1 of this series, I discussed some of the basic concepts around Hadoop, specifically when it's appropriate to use Hadoop to solve your data engineering problems and the terminology of the Hadoop eco-system. This post will cover how to install … [Continue reading]

Getting Started Using Hadoop, Part 1: Intro

For the last couple of days I've been at the eMetrics conference in San Francisco. There were several panels that discussed big data, both from an engineering standpoint as well as how to adopt newer technologies from a business … [Continue reading]

Instructions for Installing & Using R on Amazon EC2

If you're an R user, you've surely heard all the hype around 'big data' and how R is commonly used to analyze these volumes of data. One thing that's often missing from the discussion is HOW to work around issues using big data and R, specifically … [Continue reading]