Category Archives: Cluster Computation

Big Data System Design with Bayesian Optimization

Designing complex Big Data system with myriad of  parameters and design choices is a daunting task. It’s almost a black art. Typically we stay with the default parameter settings, unless it fails to meet your requirement which forces you venture out … Continue reading

Posted in Big Data, Cluster Computation, Data Science, Optimization | Tagged , | 1 Comment

Bring some Spark into your life

Hadoop is a great cluster computing framework. But sometimes  it may not be a great fit for your particular problem in hand. Or you may be having Hadoop fatigue and want to explore other options. There are certain problems where … Continue reading

Posted in Big Data, Cluster Computation, Scala, Spark | Tagged , , , | 4 Comments