Using Hadoop Like a Boss

Once you’re doing real development, you’ll want to get into the habit of using smaller, test datasets on your local machine and running your code iteratively in Local Jobrunner Mode (which lets you locally test and debug your Map and Reduce code), then Pseudo-Distributed Mode (which more closely mimics the production environment), then finally Fully-Distributed Mode (your real production cluster). By doing this iterative development, you’ll be able to get bugs worked out on smaller subsets of the data so that when you run on your full dataset with real production resources, you’ll have all the kinks worked out, and your job won’t crash three-quarters of the way in.

Read more

Looking Forward to AFCEA’s West13 in San Diego

In the last two years, I’ve been to my share of IT tradeshows and conferences. Las Vegas. San Francisco. Dallas. DC. San Diego. Of all the cities I’ve traveled to for work, I enjoyed San Diego the most, so the fact that my next conference is AFCEA’s West 13 in the San Diego Convention Center is a great way to start off my 2013 business travels.

Read more

What Is the Correct ESXi 5.1 Upgrade Process?

Going by the book, the upgrade process is precisely defined and should be followed in a specified order and manner whenever possible. In real life, other options and challenges always exist that might come into play in your organization. If you want to upgrade to vSphere 5.1 with the least possible headaches, you should perform the following steps in their precise order:

Read more