Category Archives: Data Mining

Mining Seasonal Products from Sales Data

The other day someone asked me how to include products with seasonal demand in recommendations based on collaborative filtering or some other technique. The solution to the problem involves two steps. The first step is to identify products with seasonal … Continue reading

Posted in Big Data, Data Mining, Data Science, eCommerce, Map Reduce, Recommendation Engine | Tagged , , , | Leave a comment

Customer Segmentation Based on Online Behavior using ScikitLearn

Customer segmentation or clustering is useful in various ways. It could be used for targeted marketing. Sometimes when building predictive model, it’s more effective to cluster the data and build a separate predictive model for each cluster. In this post, … Continue reading

Posted in Data Mining, Data Science, Machine Learning | Tagged , , , , | 4 Comments

Association Mining with Improved Apriori Algorithm

Association mining solves many real life  problems e.g., frequent items bought together, songs frequently listened together in one session etc. Apriori is a popular algorithm for mining frequent items sets. In this post, we will go over a Hadoop based … Continue reading

Posted in Association Mining, Big Data, Data Mining, Data Science, Hadoop and Map Reduce, Marketing Analytic, Rule Mining | Tagged , , | Leave a comment

Nearest Flunking Neighbors

Adoption of eLearning or Learning Management Systems (LMS) has increased significantly within academic and business world. In some cases, depending on the content  and the eLearning system being used, high drop out rates have been reported as a serious problem. … Continue reading

Posted in Big Data, Data Mining, Data Science, Hadoop and Map Reduce, Predictive Analytic | Tagged , , , , , | 1 Comment

Novelty in Personalization

We all have the unfortunate  experience of being pigeon holed by Personalization and Recommendation engines. When recommendation are based on our past behavior and there is very little  opportunity to explore. But our past actions are not always good predictors for … Continue reading

Posted in Big Data, Data Mining, Data Science, Hadoop and Map Reduce, Personalization, Recommendation Engine | Tagged , | 1 Comment

Making Recommendations in Real Time

Making recommendations based on an user’s current behavior in a small time window is a powerful feature that has been added to sifarish recently. In this post I will go over the details of this feature. The real time feature … Continue reading

Posted in Big Data, Collaborative Filtering, Data Mining, Data Science, Hadoop and Map Reduce, Real Time Processing, Recommendation Engine, Redis, Storm | Tagged , | 2 Comments

Using Mutual Information to Find Critical Factors in Hospital Readmission

Nobody likes hospital readmission soon after discharge, whether it’s the patient or the insurance company. Predictive analytic techniques have been used to predict the likelihood of hospital readmission, using the various medical, personal and demographic input or feature attributes. However, … Continue reading

Posted in Big Data, Correlation, Data Mining, Data Science, Hadoop and Map Reduce, Healthcare Analytic, Predictive Analytic | Tagged , , , | 5 Comments

Retarget Campaign for Abandoned Shopping Carts with Decision Tree

Research has shown that customers who have abandoned shopping carts, when subjected to retargeting email campaign, often come back and in many cases end up buying more than what was originally in the shopping cart. There are many attributes of such … Continue reading

Posted in Big Data, Data Mining, Data Science, eCommerce, Hadoop and Map Reduce, Marketing Analytic | Tagged , , , , , | Leave a comment

Real Time Fraud Detection with Sequence Mining

Real time fraud detection  is one of the use cases, where multiple components of the Big Data eco system come into play in a significant way, Hadoop batch processing  for building the predictive model and Storm for predicting fraud from … Continue reading

Posted in Big Data, Data Mining, Data Science, Fraud Detection, Hadoop and Map Reduce, Redis, Storm | Tagged , , , | 27 Comments

Predicting Customer Loyalty Trajectory

Customer loyalty is the strength of the relationship a customer has with a business as manifested by customer purchasing more and at high frequency. There are various signal or events related to a customer’s engagement with a business. Some examples … Continue reading

Posted in Big Data, Data Mining, Data Science, Hadoop and Map Reduce, Marketing Analytic | Tagged , , | Leave a comment