Monthly Archives: May 2011

Hadoop Orchestration

Most data processing tasks with Hadoop require multiple Hadoop jobs with dependencies between them. The dependency arises out of the need for one job to use the output for another job. The dependency between Hadoop jobs can be expressed as … Continue reading

Posted in Hadoop and Map Reduce, Java, Workflow | Tagged , , | 8 Comments