Category Archives: Optimizatiom

Bandits Know the Best Product Price

In an earlier post, I did a survey of a class of reinforcement learning  algorithms, known as Multi Arm Bandit(MAB) . Essentially, these algorithms make decisions and learn from rewards received from the environment. You can also think of them as … Continue reading

Posted in Big Data, Data Science, Hadoop and Map Reduce, Marketing Analytic, Optimizatiom, Optimization, Reinforcement Learning | Tagged , , , | 4 Comments