Monthly Archives: August 2010

Log Analysis and Incident Reporting with Hadoop

What do we have We have a very successful eCommerce site. We have lot of traffic  in our eCommerce site, which is good.  Our  visitors are busy buying our cool products, which is even better. What do we want We … Continue reading

Posted in Hadoop and Map Reduce, Web | Tagged , , | 9 Comments

Programming language bloat

The other day I was reading a  blog about programing language complexity and bloat and how this new language is going to usher a new era.  There is no panacea in programing language landscape.  Most languages start  out lean and … Continue reading

Posted in Programing Language | Tagged , | Leave a comment

Folding, Cross Validation with Map Reduce

In my current web log data mining project using Hadoop, I am trying to build a predictive model for predicting certain attributes of the web site visitor.  I have number of ETL (Extract, Transform and Load)tasks to prepare the data … Continue reading

Posted in Hadoop and Map Reduce | Tagged , , , | 3 Comments

Cassandra and Hadoop

I was always interested in mining patterns and knowledge from data. While working on a data mining project some time ago, I ran into a road block when dealing with very large data set. Most data mining algorithms are monolithic … Continue reading

Posted in Cassandra, Hadoop and Map Reduce, NOSQL | Tagged , | 1 Comment