Monthly Archives: June 2013

A Learning but Greedy Gambler

Posted on June 29, 2013 by Pranab

In multi-armed bandit (MAB) problem, a gambler must decide which arm of K slot machines to pull in sequence of N rounds of pulls to maximize the overall return. Many real life optimization and decision making problems can be modelled … Continue reading →

Posted in Big Data, Data Science, Hadoop and Map Reduce, Marketing Analytic, Optimization, Reinforcement Learning, Storm | Tagged muti armed bandit | 6 Comments

Monthly Archives: June 2013

A Learning but Greedy Gambler

Recent Posts

Top Posts

Archives

Categories

Meta

About me

My Recent Tweets