Strategic Linear Contextual Bandits Apple Machine Learning Research
Motivated by the phenomenon of strategic agents gaming a recommendation system to maximize the number of times they are recommended to users, we study a strategic variant of the linear contextual bandit problem, where the arms strategically misreport privately observed contexts to the learner. %… Read More »Strategic Linear Contextual Bandits Apple Machine Learning Research