Category Archives: Web Analytic

Detecting Incidents with Context from Log Data

Analyzing vast amount of machine generated unstructured or semi structured data is Hadoop’s forte. Many of us have gone through the exercise of searching log files, most likely with grep,  for some pattern and then looking at surrounding log lines … Continue reading

Posted in Big Data, Hadoop and Map Reduce, Log Analysis, Uncategorized, Web Analytic | Tagged , , , | Leave a comment

Tracking Web Site Bounce Rate in Real Time

Bounce rate for a page  in a web site, is the  proportion of sessions with only that page in the session. This post will show how to calculate bounce rate in real time with Storm using web log data. We … Continue reading

Posted in Big Data, Optimization, Real Time Processing, Reinforcement Learning, Storm, Web Analytic | Tagged , | 2 Comments

From Explicit User Engagement to Implicit Product Rating

The basic input for sifarish or any other collaborative filtering  based recommendation engine is user rating of items. However explicit  rating by users is not always available. Even when it’s available, it’s been known that generally only users with extreme … Continue reading

Posted in Big Data, Data Science, eCommerce, Hadoop and Map Reduce, Recommendation Engine, Web Analytic | Tagged , , | 15 Comments

Big Web Checkout Abandonment

The topic for this post, is of interest to any online retailer. Shopping cart abandonment is dreaded by online stores. It’s more common in online stores than brick and mortar stores. In this post I will be discussing Hadoop based checkout abandonment analysis based … Continue reading

Posted in Big Data, Data Science, Hadoop and Map Reduce, Web Analytic | Tagged , , , | Leave a comment

Big Web Analytic

I had started on a Hadoop based web analytic open source project some time ago. Recently I did some work on it and decided blog about the development I did on the the project. The project is  called visitante and … Continue reading

Posted in Big Data, ETL, Hadoop and Map Reduce, Hive, Web Analytic | Tagged , , | 9 Comments