-
Recent Posts
Top Posts
- Hive Plays Well with JSON
- Map Reduce Secondary Sort Does It All
- Cassandra Secondary Index Patterns
- Cassandra Range Query Made Simple
- Recommendation Engine Powered by Hadoop (Part 1)
- Redis as Messaging Middleware
- Data Loader for NOSQL Databases
- Ruling with Drools Rule Engine
- Multi Cluster Hadoop Job Monitoring
- Geo Spatial Indexing with MongoDB
Archives
- May 2013
- April 2013
- March 2013
- February 2013
- January 2013
- December 2012
- November 2012
- October 2012
- September 2012
- August 2012
- July 2012
- June 2012
- May 2012
- April 2012
- March 2012
- February 2012
- January 2012
- December 2011
- November 2011
- October 2011
- August 2011
- July 2011
- June 2011
- May 2011
- April 2011
- March 2011
- January 2011
- December 2010
- November 2010
- October 2010
- September 2010
- August 2010
Categories
- AI
- Big Data
- BPM
- Cassandra
- Cluster Computation
- Collaborative Filtering
- Correlation
- Data Mining
- Data Model
- Data Warehouse
- eCommerce
- Fraud Detection
- Hadoop and Map Reduce
- HBase
- Hive
- Indexing
- Java
- Key Value Store
- Map Reduce
- Marketing Analytic
- Messaging
- MongoDB
- NOSQL
- Performance
- Predictive Analytic
- Programing Language
- Query
- Real Time Processing
- Recommendation Engine
- Redis
- Ruby
- Rule Engine
- Scala
- Semantic
- Spark
- Text Analytic
- Uncategorized
- Web
- Web Analytic
- Workflow
Meta
- Big Data BPM Cassandra Collaborative Filtering Data Mining Data Model Data Warehouse eCommerce Fraud Detection Hadoop and Map Reduce HBase Hive Indexing Java Map Reduce MongoDB NOSQL Performance Predictive Analytic Programing Language Query Real Time Processing Recommendation Engine Ruby Rule Engine Text Analytic Uncategorized Web Web Analytic Workflow
Tags
Alerts Analytic API big data bloat Cassandra Cassndra Collaborative filter CSV loader customer churn data mining Data model ETL fraud Geo spatial index Hadoop HBase Index jaccard lucene Mapreduce map reduce Mobile Advetisement MongoDB Monitoring NOSQL outlier pefromance model Presence data programing language Query recommendation recommendation engine ruby Rule Engine Secondary index Secondary sort similarity Solr stemming text analytic Visitor conversion Web click stream analysis Web log mining Workflow
Tag Archives: big data
It’s a lonely life for outliers
In this post, I am back to outliers and fraud analytic. In this earlier post, I did an overview of outliers detection techniques that are being implemented with Hadoop in my open source project beymani. In this earlier post, I … Continue reading
Posted in Big Data, Fraud Detection, Hadoop and Map Reduce, Predictive Analytic
Tagged big data, fraud, Hadoop, oulier
1 Comment
Warm Starting a Recommender with Hadoop
In my earlier post I discussed the solution for cold starting a recommender. Cold starting refers to the situation when no user interaction data is available. You may have a newly registered user in your web site. The user may … Continue reading
Posted in Big Data, Data Mining, Hadoop and Map Reduce, Recommendation Engine
Tagged big data, nearest neighbor
1 Comment