Method for cold start of a multi-armed bandit in a recommender system

Smriti Bhagat, Stéphane Caron. US Patent Application US-20150012345-A1.

Abstract

A method performed by a recommender system to recommend items to a new user includes calculating reward estimates from multiple multi-armed bandit models of a user and her social network friends. The new user's social network friends have multi-armed bandit models that are well established. The mixed multi-armed bandit estimates are processed to select the arm that maximizes the estimated reward to the new user. The multi-armed bandit arm of the greatest reward estimate is played and the new user responds by providing feedback so that the new user's multi-armed bandit model is updated as time progresses.

BibTeX

@misc{bhagat2015patent,
  title = {Method for cold start of a multi-armed bandit in a recommender system},
  author = {Smriti Bhagat and St{\'e}phane Caron},
  year = {2015},
  month = {January},
  note = {US Patent App. 14/308,044},
  url = {https://www.google.com/patents/US20150012345}
}
Content on this website is under the CC-BY 4.0 license.