Monthly Archives: August 2013

Bandits Know the Best Product Price

In an earlier post, I did a survey of a class of reinforcement learning ¬†algorithms, known as Multi Arm Bandit(MAB)¬†. Essentially, these algorithms make decisions and learn from rewards received from the environment. You can also think of them as … Continue reading

Posted in Big Data, Data Science, Hadoop and Map Reduce, Marketing Analytic, Optimizatiom, Optimization, Reinforcement Learning | Tagged , , , | 2 Comments