RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising, 2018. ,

Regret bounds and regimes of optimality for user-user and item-item collaborative filtering, 2018 Information Theory and Applications Workshop, 2018. ,

A fast bandit algorithm for recommendations to users with heterogeneous tastes, Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, ser. AAAI'13, pp.1135-1141, 2013. ,

Bandits and recommender systems, Revised Selected Papers of the First International Workshop on Machine Learning, vol.9432, pp.325-336, 2015. ,

URL : https://hal.archives-ouvertes.fr/hal-01256033

Adaptive -greedy exploration in reinforcement learning based on value differences, Proceedings of the 33rd Annual German Conference on Advances in Artificial Intelligence, ser. KI'10, pp.203-210, 2010. ,

Finite-time analysis of the multiarmed bandit problem, Machine Learning, 2002. ,

The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, 2003. ,

On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems, 2008. ,

URL : https://hal.archives-ouvertes.fr/hal-00281392

Openai gym, 2016. ,

Multi-armed bandit, dynamic environments and meta-bandits, Environments, pp.1-14, 2006. ,

URL : https://hal.archives-ouvertes.fr/hal-00113668

Comparing accuracy of cosine-based similarity and correlation-based similarity algorithms in tourism recommender systems, 4th IEEE International Conference on Management of Innovation and Technology, pp.469-474, 2008. ,

Mining of massive datasets, 2014. ,

Recommender systems, 2016. ,