Information Complexity in Bandit Subset Selection

Emilie Kaufmann; Shivaram Kalyanakrishnan

Communication Dans Un Congrès Année : 2013

Information Complexity in Bandit Subset Selection

(1, 2) ,

1
2

Emilie Kaufmann

Fonction : Auteur
PersonId : 10422
IdHAL : emilie-kaufmann
ORCID : 0000-0002-5496-824X
IdRef : 197040810

Signal, Statistique et Apprentissage

Département Traitement du Signal et des Images

Shivaram Kalyanakrishnan

Fonction : Auteur

Résumé

We consider the problem of efficiently exploring the arms of a stochastic bandit to identify the best subset of a specified size. Under the PAC and the fixed-budget formulations, we derive improved bounds by using KL-divergence-based confidence intervals. Whereas the application of a similar idea in the regret setting has yielded bounds in terms of the KL-divergence between the arms, our bounds in the pure-exploration setting involve the ``Chernoff information'' between the arms. In addition to introducing this novel quantity to the bandits literature, we contribute a comparison between strategies based on uniform and adaptive sampling for pure-exploration problems, finding evidence in favor of the latter.

Mots clés

Stochastic multi-armed bandits subset selection KL-divergence.

Domaines

Machine Learning [stat.ML] Théorie [stat.TH]

TelecomParis HAL : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-02288406

Soumis le : samedi 14 septembre 2019-18:46:53

Dernière modification le : lundi 9 octobre 2023-12:49:39

Dates et versions

hal-02288406 , version 1 (14-09-2019)

Identifiants

HAL Id : hal-02288406 , version 1

Citer

Emilie Kaufmann, Shivaram Kalyanakrishnan. Information Complexity in Bandit Subset Selection. Conference On Learning Theory, Jun 2013, Princeton, United States. ⟨hal-02288406⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI IDS S2A

51 Consultations

0 Téléchargements

Information Complexity in Bandit Subset Selection

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager