Tag Archives: nearest neighbor

Handling Rare Events and Class Imbalance in Predictive Modeling for Machine Failure

Most supervised Machine Learning algorithms face difficulty when there is class imbalance in the training data i.e., amount of data belonging one class heavily outnumber the other class. However, there are may real life problems where we encounter this situation e.g., … Continue reading

Posted in Big Data, Data Science, ETL, Hadoop and Map Reduce | Tagged , , , , | Leave a comment

Warm Starting a Recommender with Hadoop

In my earlier post I discussed the solution for cold starting a recommender. Cold starting refers to the situation when no user interaction data is available. You may have a newly registered user in your web site. The user may … Continue reading

Posted in Big Data, Data Mining, Hadoop and Map Reduce, Recommendation Engine | Tagged , | 1 Comment