This project has moved. For the latest updates, please go here.

Any Hadoop Implementations? Production or Experimental?

Dec 12, 2011 at 11:52 PM

Is anyone using Hadoop with openPDC?

Are you using it in Production or just Experimentation?

If you are, can you share some of your experiences, procedures, or references that might help kick start other researchers that may be interested in using Hadoop?


Dec 18, 2011 at 10:58 PM


TVA uses Hadoop in production with both HDFS for cheap scalable storage and MapReduce for processing large amounts of time series data. Some materials you may find helpful:


Original Whitepapers:

OSCON-data presentation (good TVA story here):

High Level articles/coverage of project:

Engineering Literature:


General time series processing with Hadoop (along with another source code example):


Hope that helps,



Dec 20, 2011 at 12:26 AM
Edited Dec 20, 2011 at 1:08 AM

That was by far the most excellent answer I have ever received!



PS:  I added your references to the Hadoop section of the FAQ