Monthly Archives: June 2013

A Learning but Greedy Gambler

In multi-armed bandit (MAB) ┬áproblem, a gambler must decide which arm of K slot machines to pull in sequence of N rounds of pulls to maximize the overall return. Many real life optimization and decision making problems can be modelled … Continue reading

Posted in Big Data, Data Science, Hadoop and Map Reduce, Marketing Analytic, Optimization, Reinforcement Learning, Storm | Tagged | 4 Comments