35:[["$","audio",null,{"id":"tts"}],["$","$L3a",null,{"paperID":"2001.08655","publisher":"arxiv","paperJSON":{"title":"Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting","paperID":"2001.08655","avgLineHeight":11.96,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"We design and analyze C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI, an algorithm for finding the best set of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"items, also called an arm, within the framework of cascading bandits. An upper bound on the time complexity of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI is derived by overcoming a crucial analytical challenge, namely, that of probabilistically estimating the amount of available feedback at each step. To do so, we define a new class of random variables (r.v.’s) which we term as left-sided sub-Gaussian r.v.’s; these are r.v.’s whose cumulant generating functions (CGFs) can be bounded by a quadratic only for non-positive arguments of the CGFs. This enables the application of a sufficiently tight Bernstein-type concentration inequality. We show, through the derivation of a lower bound on the time complexity, that the performance of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI is optimal in some practical regimes. Finally, extensive numerical simulations corroborate the efficacy of C","element":"span"},{"text":"AS","element":"span"},{"text":"- ","element":"span"},{"text":"CADE","element":"span"},{"text":"BAI as well as the tightness of our upper bound on its time complexity.","element":"span"}]]},{"heading":"1. Introduction","paragraphs":[[{"text":"Online recommender systems seek to recommend a small list of items (such as movies or hotels) to users based on a larger ground set ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"text":"] := ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , L","element":"span"},{"style":{"fontStyle":"italic"},"text":"} ","element":"span"},{"text":"of items. In this paper, we consider the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"cascading bandits ","element":"span"},{"text":"model (","element":"span"},{"href":"#id-0","referenceIndex":10,"text":"Craswell ","element":"a"},{"href":"#id-0","referenceIndex":10,"text":"et al.","element":"a"},{"href":"#id-0","referenceIndex":10,"text":", ","element":"a"},{"href":"#id-0","referenceIndex":10,"text":"2008","element":"a"},{"text":"; ","element":"span"},{"href":"#id-1","referenceIndex":20,"text":"Kveton et al.","element":"a"},{"href":"#id-1","referenceIndex":20,"text":", ","element":"a"},{"href":"#id-1","referenceIndex":20,"text":"2015a","element":"a"},{"text":"), which is widely used in information retrieval and online advertising. Upon seeing the chosen list, the user looks at the items sequentially. She ","element":"span"},{"style":{"fontStyle":"italic"},"text":"clicks ","element":"span"},{"text":"on an item if she is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"attracted ","element":"span"},{"text":"by it and skips to the ","element":"span"},{"text":"next one otherwise. This process stops when she clicks on one item in the list or if no item is clicked, it is deemed that she is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not attracted ","element":"span"},{"text":"by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"any ","element":"span"},{"text":"of the items. The items that are in the ground set but not in the chosen list and those in the list that come after the attractive one are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"unobserved","element":"span"},{"text":".","element":"span"}],[{"text":"Each item ","element":"span"},{"style":{"height":16},"width":115.16,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-0.png","element":"img","alt":" i ∈ [L]","inline":true},{"text":", with a certain ","element":"span"},{"style":{"fontStyle":"italic"},"text":"click probability ","element":"span"},{"style":{"height":16},"width":114.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-1.png","element":"img","alt":" w(i) ∈","inline":true,"padRight":true},{"text":"[0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1] ","element":"span"},{"text":"which is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"unknown ","element":"span"},{"text":"to the learning agent, attracts the user independently of other items. Under this assumption, the optimal solution is the list of items with largest ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":"’s. Based on the chosen lists and obtained feedback in previous steps, the agent tries to learn the click probabilities (explore the combinatorial space) in order to find the optimal list with high probability in as few time steps as possible.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Main Contributions. ","element":"span"},{"text":"Given ","element":"span"},{"style":{"height":12.4},"width":94.79,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-2.png","element":"img","alt":" δ > 0","inline":true},{"text":", a learning agent aims to find a list of optimal items of size ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"with probability at least ","element":"span"},{"style":{"height":11.6},"width":89.46,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-3.png","element":"img","alt":" 1 − δ","inline":true,"padRight":true},{"text":"in minimal time steps. To achieve a greater generality, we provide results for identifying a list of nearoptimal items (","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"Even-Dar et al.","element":"a"},{"href":"#id-2","referenceIndex":11,"text":", ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-3","referenceIndex":25,"text":"Mannor & Tsitsiklis","element":"a"},{"href":"#id-3","referenceIndex":25,"text":", ","element":"a"},{"href":"#id-3","referenceIndex":25,"text":"2004","element":"a"},{"text":"; ","element":"span"},{"href":"#id-4","referenceIndex":17,"text":"Kalyanakrishnan et al.","element":"a"},{"href":"#id-4","referenceIndex":17,"text":", ","element":"a"},{"href":"#id-4","referenceIndex":17,"text":"2012","element":"a"},{"text":"), where the notion of near-optimality is precisely defined in Section ","element":"span"},{"text":"2","element":"span"},{"text":". First, we design C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":104.62,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-4.png","element":"img","alt":"ϵ, δ, K","inline":true},{"text":") and derive an upper bound on its time complexity. Second, we establish a lower bound on the time complexity of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"any ","element":"span"},{"text":"best arm identification (BAI) algorithm in cascading bandits, which implies that the performance of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":104.62,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-5.png","element":"img","alt":"ϵ, δ, K","inline":true},{"text":") is optimal in some regimes. Finally, our extensive numerical results corroborate the efficacy of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":104.62,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/0-6.png","element":"img","alt":"ϵ, δ, K","inline":true},{"text":") and the tightness of our upper bound on its time complexity.","element":"span"}],[{"text":"Different from combinatorial semi-bandit settings, the amount of feedback in cascading bandits is, in general, random. The analysis of cascading bandits involves the unique challenge in adapting to the variation of the amount of feedback across time. To this end, we define a random variable (r.v.) that describes the feedback from the user at a step and bound its expectation. We define a novel class of r.v.’s, known as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"left-sided sub-Gaussian ","element":"span"},{"text":"(LSG) r.v.’s, and apply a concentration inequality to quantify the variation of the amount of feedback.","element":"span"}],[{"text":"Bernstein-type concentration inequalities are applied in many stochastic bandit problems and indicate that subGaussian (SG) distributions possess light tails (","element":"span"},{"href":"#id-5","referenceIndex":3,"text":"Audibert ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"& Bubeck","element":"a"},{"href":"#id-5","referenceIndex":3,"text":", ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"2010","element":"a"},{"text":"). Since it turns out that we only need to analyze a one-sided tail in this work, it suffices to consider a one-sided SG condition, which motivates the definition of LSG. We also provide a general estimate of a certain corresponding parameter in Theorem ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":", which is crucial for the utilization of the inequality. This may be of independent interest. Summary and future work are deferred to Appendix ","element":"span"},{"text":"7","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Literature review. ","element":"span"},{"text":"In a stochastic combinatorial bandit (SCB) model, an arm corresponds to a list of items in the ground set, and each item is associated with an r.v. at each time step. The corresponding reward depends on the constituent items’ realizations. We first review the related works on the BAI problem, in which a learning agent aims to identify an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"optimal arm","element":"span"},{"text":", i.e., a list of optimal items. (i) Given ","element":"span"},{"style":{"height":12.4},"width":92.36,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-0.png","element":"img","alt":"δ > 0","inline":true},{"text":", a learning agent aims to identify an optimal arm with probability ","element":"span"},{"style":{"height":11.6},"width":79.94,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-1.png","element":"img","alt":" 1−δ","inline":true,"padRight":true},{"text":"in minimal time steps (","element":"span"},{"href":"#id-7","referenceIndex":14,"text":"Jamieson & Nowak","element":"a"},{"href":"#id-7","referenceIndex":14,"text":", ","element":"a"},{"href":"#id-7","referenceIndex":14,"text":"2014","element":"a"},{"text":"; ","element":"span"},{"href":"#id-4","referenceIndex":17,"text":"Kalyanakrishnan et al.","element":"a"},{"href":"#id-4","referenceIndex":17,"text":", ","element":"a"},{"href":"#id-4","referenceIndex":17,"text":"2012","element":"a"},{"text":"). (ii) Given ","element":"span"},{"style":{"fontStyle":"italic"},"text":"B > ","element":"span"},{"text":"0","element":"span"},{"text":", an agent aims to maximize the probability of identifying an optimal arm in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"B ","element":"span"},{"text":"steps (","element":"span"},{"href":"#id-8","referenceIndex":4,"text":"Auer et al.","element":"a"},{"href":"#id-8","referenceIndex":4,"text":", ","element":"a"},{"href":"#id-8","referenceIndex":4,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-5","referenceIndex":3,"text":"Audibert & ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"Bubeck","element":"a"},{"href":"#id-5","referenceIndex":3,"text":", ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"2010","element":"a"},{"text":"; ","element":"span"},{"href":"#id-9","referenceIndex":5,"text":"Carpentier & Locatelli","element":"a"},{"href":"#id-9","referenceIndex":5,"text":", ","element":"a"},{"href":"#id-9","referenceIndex":5,"text":"2016","element":"a"},{"text":"). These two settings are known as the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"fixed-confidence ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"fixed-budget ","element":"span"},{"text":"setting respectively. Under the fixed-confidence setting, the early works aim to identify only one optimal item (","element":"span"},{"href":"#id-5","referenceIndex":3,"text":"Audibert ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"& Bubeck","element":"a"},{"href":"#id-5","referenceIndex":3,"text":", ","element":"a"},{"href":"#id-5","referenceIndex":3,"text":"2010","element":"a"},{"text":") and the later ones aim to find an optimal arm (","element":"span"},{"href":"#id-10","referenceIndex":6,"text":"Chen et al.","element":"a"},{"href":"#id-10","referenceIndex":6,"text":", ","element":"a"},{"href":"#id-10","referenceIndex":6,"text":"2014","element":"a"},{"text":"; ","element":"span"},{"href":"#id-11","referenceIndex":28,"text":"Rejwan & Mansour","element":"a"},{"href":"#id-11","referenceIndex":28,"text":", ","element":"a"},{"href":"#id-11","referenceIndex":28,"text":"2019","element":"a"},{"text":"). Besides, ","element":"span"},{"href":"#id-3","referenceIndex":25,"text":"Mannor & Tsitsiklis ","element":"a"},{"href":"#id-3","referenceIndex":25,"text":"(","element":"a"},{"href":"#id-3","referenceIndex":25,"text":"2004","element":"a"},{"text":"); ","element":"span"},{"href":"#id-12","referenceIndex":18,"text":"Kaufmann et al. ","element":"a"},{"href":"#id-12","referenceIndex":18,"text":"(","element":"a"},{"href":"#id-12","referenceIndex":18,"text":"2016","element":"a"},{"text":"); ","element":"span"},{"href":"#id-13","referenceIndex":1,"text":"Agar- ","element":"a"},{"href":"#id-13","referenceIndex":1,"text":"wal et al. ","element":"a"},{"href":"#id-13","referenceIndex":1,"text":"(","element":"a"},{"href":"#id-13","referenceIndex":1,"text":"2017","element":"a"},{"text":") provide problem-dependent lower bounds on the time complexity when ","element":"span"},{"href":"#id-4","referenceIndex":17,"text":"Kalyanakrishnan et al. ","element":"a"},{"href":"#id-4","referenceIndex":17,"text":"(","element":"a"},{"href":"#id-4","referenceIndex":17,"text":"2012","element":"a"},{"text":") establishes a problem-independent one. All these existing works are under the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"semi-bandit feedback ","element":"span"},{"text":"setting, where an agent observes realizations of all pulled items.","element":"span"}],[{"text":"Secondly, we review the relevant works on the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"regret minimization ","element":"span"},{"text":"(RM) problem, in which an agent aims to maximize his overall reward, or equivalently to minimize the so-called ","element":"span"},{"style":{"fontStyle":"italic"},"text":"cumulative regret","element":"span"},{"text":". Under the semi-bandit feedback setting, this problem has been extensively studied by ","element":"span"},{"href":"#id-14","referenceIndex":22,"text":"Lai & Robbins ","element":"a"},{"href":"#id-14","referenceIndex":22,"text":"(","element":"a"},{"href":"#id-14","referenceIndex":22,"text":"1985","element":"a"},{"text":"); ","element":"span"},{"href":"#id-15","referenceIndex":2,"text":"Anantharam et al. ","element":"a"},{"href":"#id-15","referenceIndex":2,"text":"(","element":"a"},{"href":"#id-15","referenceIndex":2,"text":"1987","element":"a"},{"text":"); ","element":"span"},{"href":"#id-16","referenceIndex":19,"text":"Kveton et al. ","element":"a"},{"href":"#id-16","referenceIndex":19,"text":"(","element":"a"},{"href":"#id-16","referenceIndex":19,"text":"2014","element":"a"},{"text":"); ","element":"span"},{"href":"#id-17","referenceIndex":23,"text":"Li ","element":"a"},{"href":"#id-17","referenceIndex":23,"text":"et al. ","element":"a"},{"href":"#id-17","referenceIndex":23,"text":"(","element":"a"},{"href":"#id-17","referenceIndex":23,"text":"2010","element":"a"},{"text":"); ","element":"span"},{"href":"#id-18","referenceIndex":27,"text":"Qin et al. ","element":"a"},{"href":"#id-18","referenceIndex":27,"text":"(","element":"a"},{"href":"#id-18","referenceIndex":27,"text":"2014","element":"a"},{"text":"). Moreover, motivated by numerous applications in clinical analysis and online advertisement, some researchers consider SCB models with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"partial feedback","element":"span"},{"text":", where an agent observes realizations of only a portion of pulled items. One prime model that incorporates the partial feedback is cascading bandits (","element":"span"},{"href":"#id-0","referenceIndex":10,"text":"Craswell ","element":"a"},{"href":"#id-0","referenceIndex":10,"text":"et al.","element":"a"},{"href":"#id-0","referenceIndex":10,"text":", ","element":"a"},{"href":"#id-0","referenceIndex":10,"text":"2008","element":"a"},{"text":"; ","element":"span"},{"href":"#id-1","referenceIndex":20,"text":"Kveton et al.","element":"a"},{"href":"#id-1","referenceIndex":20,"text":", ","element":"a"},{"href":"#id-1","referenceIndex":20,"text":"2015a","element":"a"},{"text":"). Recently, ","element":"span"},{"href":"#id-19","referenceIndex":21,"text":"Kveton et al. ","element":"a"},{"href":"#id-19","referenceIndex":21,"text":"(","element":"a"},{"href":"#id-19","referenceIndex":21,"text":"2015b","element":"a"},{"text":"); ","element":"span"},{"href":"#id-20","referenceIndex":24,"text":"Li et al. ","element":"a"},{"href":"#id-20","referenceIndex":24,"text":"(","element":"a"},{"href":"#id-20","referenceIndex":24,"text":"2016","element":"a"},{"text":"); ","element":"span"},{"href":"#id-21","referenceIndex":33,"text":"Zong et al. ","element":"a"},{"href":"#id-21","referenceIndex":33,"text":"(","element":"a"},{"href":"#id-21","referenceIndex":33,"text":"2016","element":"a"},{"text":"); ","element":"span"},{"href":"#id-22","referenceIndex":32,"text":"Wang & Chen ","element":"a"},{"href":"#id-22","referenceIndex":32,"text":"(","element":"a"},{"href":"#id-22","referenceIndex":32,"text":"2017","element":"a"},{"text":"); ","element":"span"},{"href":"#id-23","referenceIndex":8,"text":"Cheung et al. ","element":"a"},{"href":"#id-23","referenceIndex":8,"text":"(","element":"a"},{"href":"#id-23","referenceIndex":8,"text":"2019","element":"a"},{"text":") studied this model and derived various regret bounds.","element":"span"}],[{"text":"When the RM problem is studied with both semi-bandit and partial feedback, the BAI problem has only been studied in the semi-bandit feedback setting thus far. Despite existing works, analysis of the BAI problem in the more challenging case of partial feedback is yet to be done. Our work fills in this gap in the literature by studying the fixed-confidence ","element":"span"},{"text":"setting in cascading bandits, and our analysis provides tools for handling the statistical dependence between the amount of feedback and that of time steps in the cascading bandit setting.","element":"span"}]]},{"heading":"2. Problem Setup","paragraphs":[[{"text":"For brevity, we denote the set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , n","element":"span"},{"style":{"fontStyle":"italic"},"text":"} ","element":"span"},{"text":"by ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"] ","element":"span"},{"text":"for any ","element":"span"},{"style":{"height":11.6},"width":113.27,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-2.png","element":"img","alt":"n ∈ N","inline":true},{"text":", and the set of all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"-permutations of ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"]","element":"span"},{"text":", i.e., all ordered ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"-subsets of ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"]","element":"span"},{"text":", by ","element":"span"},{"style":{"height":18.18},"width":98.8,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-3.png","element":"img","alt":" [n](m)","inline":true,"padRight":true},{"text":"for any ","element":"span"},{"style":{"height":12.8},"width":122.03,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-4.png","element":"img","alt":" m ≤ n","inline":true},{"text":". Let there be ","element":"span"},{"style":{"height":11.6},"width":105.74,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-5.png","element":"img","alt":" L ∈ N","inline":true,"padRight":true},{"text":"ground items, contained in ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"text":"]","element":"span"},{"text":". Each item ","element":"span"},{"style":{"height":16},"width":111.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-6.png","element":"img","alt":"i ∈ [L]","inline":true,"padRight":true},{"text":"is associated with a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"weight ","element":"span"},{"style":{"height":16},"width":202.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-7.png","element":"img","alt":" w(i) ∈ [0, 1]","inline":true},{"text":", signifying the item’s click probability. We define an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arm ","element":"span"},{"text":"as a list of ","element":"span"},{"style":{"height":13.2},"width":132.42,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-8.png","element":"img","alt":"K ≤ L","inline":true,"padRight":true},{"text":"items in ","element":"span"},{"style":{"height":18.19},"width":102.44,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-9.png","element":"img","alt":" [L](K)","inline":true},{"text":". At each time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", the agent pulls an arm ","element":"span"},{"style":{"height":18.57},"width":464.3,"height":46.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-10.png","element":"img","alt":" St := (it1, . . . , itK) ∈ [L](K)","inline":true},{"text":". Then the user ","element":"span"},{"text":"examines the items from ","element":"span"},{"style":{"height":16.94},"width":29.73,"height":42.35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-11.png","element":"img","alt":" it1","inline":true,"padRight":true},{"text":"to ","element":"span"},{"style":{"height":17.37},"width":41.73,"height":43.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-12.png","element":"img","alt":" itK","inline":true,"padRight":true},{"text":"one at a time, until one ","element":"span"},{"text":"item is clicked or all items are examined. For each item ","element":"span"},{"style":{"height":11.2},"width":51.8,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-13.png","element":"img","alt":" i ∈","inline":true},{"style":{"height":16},"width":406,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-14.png","element":"img","alt":"[L], Wt(i) ∼ Bern(w(i))","inline":true,"padRight":true},{"text":"are i.i.d. across time. The agent observes ","element":"span"},{"style":{"height":16},"width":172.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-15.png","element":"img","alt":" Wt(i) = 1","inline":true,"padRight":true},{"text":"iff the user clicks on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":". The ","element":"span"},{"style":{"fontStyle":"italic"},"text":"feedback ","element":"span"},{"style":{"height":13.19},"width":45.34,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-16.png","element":"img","alt":"Ot","inline":true,"padRight":true},{"text":"from the user is defined as a vector in ","element":"span"},{"style":{"height":17.39},"width":280.72,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-17.png","element":"img","alt":" {0, 1, ⋆}K, where","inline":true},{"style":{"height":14},"width":95.3,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-18.png","element":"img","alt":"0, 1, ⋆","inline":true,"padRight":true},{"text":"represents observing no click, observing a click and no observation respectively. For example, if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"= 4 ","element":"span"},{"text":"and the user clicks on the third item at time step ","element":"span"},{"text":"2","element":"span"},{"text":", we have ","element":"span"},{"style":{"height":16},"width":281.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-19.png","element":"img","alt":"O2 = {0, 0, 1, ⋆}","inline":true},{"text":". Clearly, there is a one-to-one mapping from ","element":"span"},{"style":{"height":13.19},"width":45.34,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-20.png","element":"img","alt":" Ot","inline":true,"padRight":true},{"text":"to the integer","element":"span"}],[{"style":{"width":"66%"},"width":619,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-21.png","element":"img"}],[{"text":"where we assume ","element":"span"},{"style":{"height":13.6},"width":191.73,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-22.png","element":"img","alt":" min ∅ = ∞","inline":true},{"text":". If ","element":"span"},{"style":{"height":17.4},"width":133.52,"height":43.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-23.png","element":"img","alt":"˜kt < ∞","inline":true,"padRight":true},{"text":"(i.e., ","element":"span"},{"style":{"height":13.19},"width":45.34,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-24.png","element":"img","alt":" Ot","inline":true,"padRight":true},{"text":"is not the all-zero vector), the agent observes ","element":"span"},{"style":{"height":17.5},"width":206.94,"height":43.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-25.png","element":"img","alt":" Wt(itk) = 0","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":17.41},"width":215.37,"height":43.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-26.png","element":"img","alt":"1 ≤ k < ˜kt","inline":true},{"text":", and also observes ","element":"span"},{"style":{"height":21.79},"width":218.93,"height":54.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-27.png","element":"img","alt":" Wt(it˜kt) = 1","inline":true},{"text":", but does ","element":"span"},{"text":"not observe ","element":"span"},{"style":{"height":17.5},"width":116.5,"height":43.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-28.png","element":"img","alt":" Wt(itk)","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":17.4},"width":111.6,"height":43.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-29.png","element":"img","alt":" k > ˜kt","inline":true},{"text":". Otherwise, we have ","element":"span"},{"style":{"height":17.4},"width":78.71,"height":43.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-30.png","element":"img","alt":" ˜kt =","inline":true},{"style":{"height":7.2},"width":40,"height":18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-31.png","element":"img","alt":"∞","inline":true,"padRight":true},{"text":"(i.e., ","element":"span"},{"style":{"height":13.19},"width":45.34,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-32.png","element":"img","alt":" Ot","inline":true,"padRight":true},{"text":"is the all-zero vector), then the agent observes ","element":"span"},{"style":{"height":17.5},"width":191.56,"height":43.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-33.png","element":"img","alt":"Wt(itk) = 0","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":13.2},"width":188.99,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-34.png","element":"img","alt":" 1 ≤ k ≤ K","inline":true},{"text":". We denote ","element":"span"},{"style":{"height":16},"width":274.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-35.png","element":"img","alt":" ¯w(i) = 1 − w(i)","inline":true},{"text":", ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w ","element":"span"},{"text":"= (","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(1)","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"text":"))","element":"span"},{"text":", and the probability law (resp. the expectation) of the process ","element":"span"},{"style":{"height":16.79},"width":505.9,"height":41.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-36.png","element":"img","alt":" ({Wt(i)}i,t) by Pw (resp. Ew).","inline":true}],[{"text":"Without loss of generality, we assume ","element":"span"},{"style":{"height":16},"width":271.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-37.png","element":"img","alt":" w∗ := w(1) ≥","inline":true},{"style":{"height":16},"width":482.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-38.png","element":"img","alt":"w(2) ≥ . . . ≥ w(L) := w′","inline":true},{"text":". We say item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"optimal ","element":"span"},{"text":"if ","element":"span"},{"style":{"height":16},"width":241.59,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-39.png","element":"img","alt":" w(i) ≥ w(K)","inline":true},{"text":". We assume ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"+1) ","element":"span"},{"text":"to ensure there are exactly ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"optimal items. ","element":"span"},{"text":"Next, we say item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"is ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-40.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":"-optimal ","element":"span"},{"text":"(","element":"span"},{"style":{"height":13.2},"width":101.29,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-41.png","element":"img","alt":"ϵ ≥ 0","inline":true},{"text":") if ","element":"span"},{"style":{"height":16},"width":310.23,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-42.png","element":"img","alt":" w(i) ≥ w(K) − ϵ","inline":true,"padRight":true},{"text":"and set ","element":"span"},{"style":{"height":16},"width":935.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-43.png","element":"img","alt":"K′ϵ := max{i ∈ [L] : w(i) ≥ w(K) − ϵ}. Then [K′ϵ] is the","inline":true,"padRight":true},{"text":"set of all ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-44.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal items, ","element":"span"},{"style":{"height":18.19},"width":112.02,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-45.png","element":"img","alt":" [K](K)","inline":true,"padRight":true},{"text":"is the set of all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"optimal arms ","element":"span"},{"style":{"height":10.99},"width":42.73,"height":27.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-46.png","element":"img","alt":" S∗","inline":true,"padRight":true},{"text":"(up to permutation), and ","element":"span"},{"style":{"height":18.19},"width":124.44,"height":45.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-47.png","element":"img","alt":" [K′ϵ](K)","inline":true,"padRight":true},{"text":"is the set of all ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-48.png","element":"img","alt":"ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":"-optimal arms","element":"span"},{"text":".","element":"span"}],[{"text":"To identify an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-49.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal arm, an agent uses an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"algorithm ","element":"span"},{"style":{"height":6.8},"width":23,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-50.png","element":"img","alt":"π","inline":true,"padRight":true},{"text":"that decides which arms to pull, when to stop pulling, and which arm ","element":"span"},{"style":{"height":14.83},"width":45.74,"height":37.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-51.png","element":"img","alt":"ˆSπ","inline":true,"padRight":true},{"text":"to choose eventually. A deterministic and non-anticipatory online algorithm consists in a triple ","element":"span"},{"style":{"height":16},"width":494.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-52.png","element":"img","alt":"π := ((πt)t, T π, φπ) in which:","inline":true}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"sampling rule ","element":"span"},{"style":{"height":9.19},"width":34.71,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-53.png","element":"img","alt":" πt","inline":true,"padRight":true},{"text":"determines, based on the observation history, the arm ","element":"span"},{"style":{"height":14.74},"width":45.74,"height":36.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-54.png","element":"img","alt":" Sπt ","inline":true,"padRight":true},{"text":"to pull at time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":"; in other words, ","element":"span"},{"style":{"height":14.74},"width":45.73,"height":36.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-55.png","element":"img","alt":" Sπt","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":13.59},"width":81.59,"height":33.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-56.png","element":"img","alt":" Ft−1","inline":true},{"text":"-measurable, with ","element":"span"},{"style":{"height":16},"width":506.26,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/1-57.png","element":"img","alt":" Ft := σ(Sπ1 , Oπ1 , . . . , Sπt , Oπt );","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"stopping rule ","element":"span"},{"text":"determines the termination of the algo-","element":"span"}],[{"text":"rithm, which leads to a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"stopping time ","element":"span"},{"style":{"height":12.8},"width":50.84,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-0.png","element":"img","alt":" T π","inline":true,"padRight":true},{"text":"with respect to ","element":"span"},{"style":{"height":16},"width":127.18,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-1.png","element":"img","alt":"(Ft)t∈N","inline":true,"padRight":true},{"text":"satisfying ","element":"span"},{"style":{"height":16},"width":316.85,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-2.png","element":"img","alt":" P(T π < +∞) = 1;","inline":true}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"recommendation rule ","element":"span"},{"style":{"height":14},"width":42.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-3.png","element":"img","alt":" φπ ","inline":true,"padRight":true},{"text":"chooses an arm ","element":"span"},{"style":{"height":17.23},"width":199.08,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-4.png","element":"img","alt":"ˆSπ, which is","inline":true},{"style":{"height":14.39},"width":71.06,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-5.png","element":"img","alt":"FT π","inline":true},{"text":"-measurable.","element":"span"}],[{"text":"We define the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"time complexity ","element":"span"},{"text":"of ","element":"span"},{"style":{"height":12.8},"width":128.06,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-6.png","element":"img","alt":" π as T π","inline":true},{"text":". Under the fixed-confidence setting, a risk parameter (failure probability) ","element":"span"},{"style":{"height":12.4},"width":57.29,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-7.png","element":"img","alt":" δ ∈","inline":true,"padRight":true},{"text":"(0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1) ","element":"span"},{"text":"is fixed. We say an algorithm ","element":"span"},{"style":{"height":16},"width":203.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-8.png","element":"img","alt":" π is (ϵ, δ, K)","inline":true},{"style":{"fontStyle":"italic"},"text":"-PAC (probably approximately correct) ","element":"span"},{"text":"if ","element":"span"},{"style":{"height":18.83},"width":460.25,"height":47.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-9.png","element":"img","alt":" Pw( ˆSπ ⊂[K′ϵ]) ≥ 1−δ. The","inline":true,"padRight":true},{"text":"goal is to obtain an ","element":"span"},{"style":{"height":16},"width":136.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-10.png","element":"img","alt":" (ϵ, δ, K)","inline":true},{"text":"-PAC algorithm ","element":"span"},{"style":{"height":6.8},"width":23,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-11.png","element":"img","alt":" π","inline":true,"padRight":true},{"text":"such that ","element":"span"},{"style":{"height":13.99},"width":106.52,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-12.png","element":"img","alt":"EwT π","inline":true,"padRight":true},{"text":"is small and ","element":"span"},{"style":{"height":12.8},"width":50.83,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-13.png","element":"img","alt":" T π","inline":true,"padRight":true},{"text":"is small with high probability. We also define the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"optimal expected time complexity ","element":"span"},{"text":"over all ","element":"span"},{"style":{"height":16},"width":136.79,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-14.png","element":"img","alt":"(ϵ, δ, K)","inline":true},{"text":"-PAC algorithms as","element":"span"}],[{"style":{"width":"88%"},"width":828,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-15.png","element":"img"}],[{"text":"This measures the hardness of the problem. We abbreviate ","element":"span"},{"style":{"height":16},"width":945.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-16.png","element":"img","alt":"(0, δ, K)-PAC as (δ, K)-PAC, Ew as E, Pw as P, K′ϵ as K′,","inline":true},{"style":{"height":16},"width":487.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-17.png","element":"img","alt":"T π as T , T∗(w, ϵ, δ, K) as T∗ ","inline":true,"padRight":true},{"text":"when there is no ambiguity.","element":"span"}]]},{"heading":"3. Algorithm","paragraphs":[[{"id":"id-24","style":{"width":"100%"},"width":939,"height":1497,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-18.png","element":"img"}],[{"text":"Our algorithm C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":104.62,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-19.png","element":"img","alt":"ϵ, δ, K","inline":true},{"text":") is presented in Algorithm ","element":"span"},{"href":"#id-24","text":"1","element":"a"},{"text":". Intuitively, to identify an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-20.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal arm, an agent ","element":"span"},{"text":"needs to learn the true weights ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"text":"of a number of items in ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"text":"] ","element":"span"},{"text":"by exploring the combinatorial arm space.","element":"span"}],[{"text":"At each step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", we classify an item as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"surviving","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"accepted ","element":"span"},{"text":"or ","element":"span"},{"style":{"fontStyle":"italic"},"text":"rejected","element":"span"},{"text":". Initially, all items are surviving and belong to the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"survival set ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-21.png","element":"img","alt":" Dt","inline":true},{"text":". Over time, an item may be eliminated from ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-22.png","element":"img","alt":" Dt","inline":true},{"text":", in which case we say that it is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"identified","element":"span"},{"text":". Once an item is identified, it can be moved to either the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"accept set ","element":"span"},{"style":{"height":13.99},"width":41.89,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-23.png","element":"img","alt":" At","inline":true,"padRight":true},{"text":"if it is deemed to be ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-24.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal, or the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"reject set ","element":"span"},{"style":{"height":13.19},"width":42.26,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-25.png","element":"img","alt":" Rt","inline":true,"padRight":true},{"text":"otherwise. (i) At step ","element":"span"},{"text":"1","element":"span"},{"text":", all items are in ","element":"span"},{"style":{"height":13.19},"width":49,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-26.png","element":"img","alt":" D1","inline":true},{"text":". (ii) At each step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", the agent selects ","element":"span"},{"style":{"height":16},"width":227.71,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-27.png","element":"img","alt":" min{K, |Dt|}","inline":true,"padRight":true},{"text":"surviving items with the least number of previous observations, ","element":"span"},{"style":{"height":16},"width":82.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-28.png","element":"img","alt":" Tt(i)","inline":true},{"text":"’s, pulls them in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ascending ","element":"span"},{"text":"order of the ","element":"span"},{"style":{"height":16},"width":82.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-29.png","element":"img","alt":" Tt(i)","inline":true},{"text":", and gets cascading feedback from the user in the form of the ","element":"span"},{"style":{"height":17.4},"width":32.75,"height":43.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-30.png","element":"img","alt":"˜kt","inline":true},{"text":"’s. Similarly to a Racing algorithm (","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"Even-Dar et al.","element":"a"},{"href":"#id-2","referenceIndex":11,"text":", ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-25","referenceIndex":26,"text":"Maron & Moore","element":"a"},{"href":"#id-25","referenceIndex":26,"text":", ","element":"a"},{"href":"#id-25","referenceIndex":26,"text":"1994","element":"a"},{"text":"; ","element":"span"},{"href":"#id-26","referenceIndex":13,"text":"Heidrich-Meisner & Igel","element":"a"},{"href":"#id-26","referenceIndex":13,"text":", ","element":"a"},{"href":"#id-26","referenceIndex":13,"text":"2009","element":"a"},{"text":"; ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al.","element":"a"},{"href":"#id-27","referenceIndex":16,"text":", ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":"), this design of ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-31.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"increases the ","element":"span"},{"style":{"height":16},"width":82.55,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-32.png","element":"img","alt":" Tt(i)","inline":true},{"text":"’s of all surviving items almost uniformly and avoids the wastage of time steps. (iii) Next, we maintain upper and lower confidence bounds (UCB, LCB) across time to facilitate the identification of items as in Lines 13–17. The confidence radius is defined as follows:","element":"span"}],[{"style":{"width":"97%"},"width":910,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-33.png","element":"img"}],[{"text":"We set ","element":"span"},{"style":{"height":16},"width":259.26,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-34.png","element":"img","alt":" Ct(i, δ) = +∞","inline":true,"padRight":true},{"text":"when ","element":"span"},{"style":{"height":16},"width":166.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-35.png","element":"img","alt":" Tt(i) = 0","inline":true},{"text":". (iv) Lastly, the algorithm stops once ","element":"span"},{"style":{"height":16.8},"width":591.1,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-36.png","element":"img","alt":" Dt = ∅, |At| ≥ K or |Rt| ≥ L − K.","inline":true}]]},{"heading":"4. Main results","paragraphs":[[{"text":"We develop an upper bound on the time complexity of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":104.62,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-37.png","element":"img","alt":"ϵ, δ, K","inline":true},{"text":") and a lower bound on the expected time complexity of any ","element":"span"},{"style":{"height":16},"width":102.91,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-38.png","element":"img","alt":" (δ, K)","inline":true},{"text":"-PAC algorithm. We also discuss the gap between the bounds. We use ","element":"span"},{"style":{"height":10},"width":152.12,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-39.png","element":"img","alt":" c1, c2, . . .","inline":true,"padRight":true},{"text":"to denote finite and positive universal constants whose values may vary from line to line. The proofs are sketched in Section ","element":"span"},{"href":"#id-28","text":"5 ","element":"a"},{"text":"and more details are provided in Appendix ","element":"span"},{"text":"D","element":"span"},{"text":".","element":"span"}],[{"id":"id-34","style":{"fontWeight":"bold"},"text":"4.1. Upper bound","element":"span"}],[{"text":"The gaps between the click probabilities determine the hardness to identify the items. The gaps are defined as","element":"span"}],[{"style":{"width":"75%"},"width":712,"height":256,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-40.png","element":"img"}],[{"text":"Here ","element":"span"},{"style":{"height":16.02},"width":44.21,"height":40.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-41.png","element":"img","alt":"¯∆i","inline":true,"padRight":true},{"text":"is a slight variation of ","element":"span"},{"style":{"height":13.99},"width":44.21,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-42.png","element":"img","alt":" ∆i","inline":true,"padRight":true},{"text":"that takes into account the ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-43.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimality of items. Moreover, to correctly identify item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"with probability at least ","element":"span"},{"style":{"height":16},"width":124.23,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-44.png","element":"img","alt":" 1 − δ/2","inline":true},{"text":", our algorithm needs to observe it at least","element":"span"}],[{"style":{"width":"83%"},"width":784,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/2-45.png","element":"img"}],[{"id":"id-61","style":{"width":"58%"},"width":545,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-0.png","element":"img"}],[{"text":"times. Similarly to existing works (","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"Even-Dar et al.","element":"a"},{"href":"#id-2","referenceIndex":11,"text":", ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-3","referenceIndex":25,"text":"Mannor & Tsitsiklis","element":"a"},{"href":"#id-3","referenceIndex":25,"text":", ","element":"a"},{"href":"#id-3","referenceIndex":25,"text":"2004","element":"a"},{"text":"), we derive the upper bound with ","element":"span"},{"style":{"height":18.42},"width":203.38,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-1.png","element":"img","alt":"¯∆i’s and ¯Ti,δ","inline":true},{"text":"’s. A larger ","element":"span"},{"style":{"height":16.02},"width":44.21,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-2.png","element":"img","alt":" ¯∆i","inline":true,"padRight":true},{"text":"leads to a smaller ","element":"span"},{"style":{"height":18.42},"width":221.02,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-3.png","element":"img","alt":" ¯Ti,δ, implying","inline":true,"padRight":true},{"text":"that it requires fewer observations to identify item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"correctly. The permutation ","element":"span"},{"style":{"height":6.8},"width":23,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-4.png","element":"img","alt":" σ","inline":true,"padRight":true},{"text":"defines the ordering of ","element":"span"},{"style":{"height":13.63},"width":33,"height":34.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-5.png","element":"img","alt":"¯∆","inline":true},{"text":": ","element":"span"},{"style":{"height":19.31},"width":146.55,"height":48.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-6.png","element":"img","alt":"¯∆σ(1) ≥","inline":true},{"style":{"height":20.69},"width":513.06,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-7.png","element":"img","alt":". . . ≥ ¯∆σ(L). At step t, we set ˆkt","inline":true,"padRight":true},{"text":"as the number of surviving items in ","element":"span"},{"style":{"height":18.4},"width":205.19,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-8.png","element":"img","alt":" St, and Xˆkt;t","inline":true,"padRight":true},{"text":"as the number of observations of them. Note that ","element":"span"},{"style":{"height":17.4},"width":32.75,"height":43.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-9.png","element":"img","alt":"ˆkt","inline":true,"padRight":true},{"text":"is an r.v. We lower bound ","element":"span"},{"style":{"height":15.99},"width":181.14,"height":39.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-10.png","element":"img","alt":" EXk;t with","inline":true}],[{"style":{"width":"78%"},"width":740,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-11.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"µ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"k, w","element":"span"},{"text":"):=","element":"span"}],[{"style":{"width":"80%"},"width":751,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-12.png","element":"img"}],[{"text":"and upper bound ","element":"span"},{"style":{"height":20.3},"width":98.22,"height":50.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-13.png","element":"img","alt":" EX2k;t","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":17.99},"width":450.49,"height":44.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-14.png","element":"img","alt":" v(k, w) := min{k,√2/w′}","inline":true},{"text":". We abbreviate ","element":"span"},{"style":{"height":18.42},"width":59.01,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-15.png","element":"img","alt":"¯Ti,δ","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":17.63},"width":128.09,"height":44.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-16.png","element":"img","alt":"¯Ti, ρ(δ)","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":16},"width":165.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-17.png","element":"img","alt":" ρ, µ(k, w)","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":16},"width":185.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-18.png","element":"img","alt":" µk, v(k, w)","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":9.19},"width":36.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-19.png","element":"img","alt":" vk","inline":true,"padRight":true},{"text":"when there is no ambiguity. In anticipation of Theorem ","element":"span"},{"href":"#id-29","text":"4.1","element":"a"},{"text":", we define three more notations:","element":"span"}],[{"style":{"width":"80%"},"width":759,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-20.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"style":{"height":36.51},"width":907.1,"height":91.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-21.png","element":"img","alt":"2 := max{K′ − K, 1}, Mk := K + 1 − kµK+1−k − K − kµK−k.","inline":true}],[{"id":"id-29","style":{"fontWeight":"bold"},"text":"Theorem 4.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":11.6},"width":228.55,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-22.png","element":"img","alt":" K′ < 2K − 1","inline":true},{"style":{"fontStyle":"italic"},"text":". With probability at least ","element":"span"},{"style":{"height":11.6},"width":87.47,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-23.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-24.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":"-optimal arm after at most ","element":"span"},{"style":{"height":16},"width":383.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-25.png","element":"img","alt":" (c1N1 + c2N2 + c3N3)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"steps, where","element":"span"}],[{"style":{"width":"99%"},"width":933,"height":668,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-26.png","element":"img"}],[{"id":"id-32","text":"When ","element":"span"},{"style":{"height":10.8},"width":97.24,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-27.png","element":"img","alt":" ϵ = 0","inline":true},{"text":", ","element":"span"},{"style":{"height":16.02},"width":151.76,"height":40.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-28.png","element":"img","alt":"¯∆i = ∆i","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"height":16},"width":119.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-29.png","element":"img","alt":" i ∈ [L]","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":10.8},"width":144.94,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-30.png","element":"img","alt":" K′ = K","inline":true},{"text":". We note that it is a waste to pull identified items. This occurs only when ","element":"span"},{"style":{"height":11.6},"width":221.08,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-31.png","element":"img","alt":" K′ < 2K − 1","inline":true,"padRight":true},{"text":"(see Lemma ","element":"span"},{"href":"#id-30","text":"5.9","element":"a"},{"text":") and this scenario is more complicated to analyze. The scenario ","element":"span"},{"style":{"height":13.2},"width":223.17,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-32.png","element":"img","alt":" K′ ≥ 2K − 1","inline":true,"padRight":true},{"text":"is relatively easier to analyze and the result is deferred to Proposition ","element":"span"},{"href":"#id-31","text":"C.1 ","element":"a"},{"text":"(see Appendix ","element":"span"},{"text":"C","element":"span"},{"text":").","element":"span"}],[{"id":"id-35","style":{"fontWeight":"bold"},"text":"Interpretation of the bound. ","element":"span"},{"text":"The first term ","element":"span"},{"style":{"height":13.19},"width":48.02,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-33.png","element":"img","alt":" N1","inline":true,"padRight":true},{"text":"in the bound is unique to the cascading model, which results from the gap between ","element":"span"},{"style":{"height":18.4},"width":84.2,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-34.png","element":"img","alt":" Xˆkt;t","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":18.4},"width":110.77,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-35.png","element":"img","alt":" EXˆkt;t","inline":true},{"text":". We can bound ","element":"span"},{"style":{"height":13.19},"width":48.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-36.png","element":"img","alt":" N1","inline":true,"padRight":true},{"text":"in terms of the maximum and minimum weights, ","element":"span"},{"style":{"height":11.6},"width":176.11,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-37.png","element":"img","alt":" w∗ and w′.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":13.6},"width":443,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-38.png","element":"img","alt":" 0 < w′ < w∗ ≤ 1. We have","inline":true}],[{"style":{"width":"82%"},"width":773,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-39.png","element":"img"}],[{"text":"Next, recall that we say that an item is identified by time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"if it is put into ","element":"span"},{"style":{"height":13.99},"width":46.89,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-40.png","element":"img","alt":" Aτ","inline":true,"padRight":true},{"text":"or ","element":"span"},{"style":{"height":13.19},"width":47.26,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-41.png","element":"img","alt":" Rτ","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":12.8},"width":97.14,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-42.png","element":"img","alt":" τ ≤ t","inline":true},{"text":". In the worstcase scenario, the agent identifies items in descending order of ","element":"span"},{"style":{"height":16.02},"width":44.22,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-43.png","element":"img","alt":"¯∆i","inline":true},{"text":"’s. With probability at least ","element":"span"},{"style":{"height":11.6},"width":95.32,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-44.png","element":"img","alt":" 1 − δ","inline":true},{"text":", it costs at most ","element":"span"},{"style":{"height":13.19},"width":83.14,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-45.png","element":"img","alt":" c2N2","inline":true,"padRight":true},{"text":"steps to identify items ","element":"span"},{"style":{"height":16},"width":398.5,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-46.png","element":"img","alt":" σ(1), . . . , σ(L − K) and","inline":true},{"style":{"height":13.19},"width":83.14,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-47.png","element":"img","alt":"c3N3","inline":true,"padRight":true},{"text":"is for identifying the remaining ones. ","element":"span"},{"text":"More precisely, after item ","element":"span"},{"style":{"height":16},"width":278.95,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-48.png","element":"img","alt":" σ(L−K −k−1)","inline":true,"padRight":true},{"text":"is identified, the number of steps required for identifying item ","element":"span"},{"style":{"height":16},"width":216.89,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-49.png","element":"img","alt":" σ(L−K −k)","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":19.31},"width":944.11,"height":48.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-50.png","element":"img","alt":"(c3/µK−k+1)·(K − k + 1)[ ¯Tσ(L−K+k) − ¯Tσ(L−K+k−1)];","inline":true,"padRight":true},{"text":"we sum these steps up to obtain (","element":"span"},{"href":"#id-32","text":"4.1","element":"a"},{"text":"). Since the results in many existing works (","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"Even-Dar et al.","element":"a"},{"href":"#id-2","referenceIndex":11,"text":", ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al.","element":"a"},{"href":"#id-27","referenceIndex":16,"text":", ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":") mainly involve ","element":"span"},{"style":{"height":16.02},"width":34.28,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-51.png","element":"img","alt":"¯Ti","inline":true},{"text":"’s, we show the dependence of ","element":"span"},{"style":{"height":16.02},"width":170.9,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-52.png","element":"img","alt":" N3 on ¯Ti’s","inline":true,"padRight":true},{"text":"more concretely in (","element":"span"},{"href":"#id-32","text":"4.2","element":"a"},{"text":").","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Technique. ","element":"span"},{"text":"The crucial analytical challenge to derive our bound, especially to establish ","element":"span"},{"style":{"height":14},"width":165.98,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-53.png","element":"img","alt":" µk, vk, N1","inline":true},{"text":", is to quantify the impact of partial feedback that results from the cascading model. Firstly, we bound ","element":"span"},{"style":{"height":18.4},"width":110.76,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-54.png","element":"img","alt":" EXˆkt;t","inline":true,"padRight":true},{"text":"by exploiting some properties of the cascading feedback. Next, to bound the gap between ","element":"span"},{"style":{"height":20.42},"width":187.28,"height":51.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-55.png","element":"img","alt":"�nt=1 Xˆkt;t","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":20.42},"width":213.86,"height":51.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-56.png","element":"img","alt":" �nt=1 EXˆkt;t","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":11.6},"width":115.79,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-57.png","element":"img","alt":" n ∈ N","inline":true},{"text":", ","element":"span"},{"text":"we propose a novel class of r.v.’s, known as LSG r.v.’s, provide an estimate of a certain LSG parameter, and utilize a Berstein-type concentration inequality to bound the tail probability of a certain LSG r.v.. Details are in Section ","element":"span"},{"href":"#id-33","text":"5.1","element":"a"},{"text":".","element":"span"}],[{"text":"To facilitate the remaining discussion in Section ","element":"span"},{"href":"#id-34","text":"4.1","element":"a"},{"text":", we specialize our analysis and results henceforth to the case of ","element":"span"},{"style":{"height":16.03},"width":374.57,"height":40.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-58.png","element":"img","alt":"ϵ=0, in which ¯∆i =∆i","inline":true,"padRight":true},{"text":"and the agent aims to find ","element":"span"},{"style":{"height":11.6},"width":129.55,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-59.png","element":"img","alt":" S∗. The","inline":true,"padRight":true},{"text":"remaining results in Section ","element":"span"},{"href":"#id-34","text":"4.1 ","element":"a"},{"text":"can be directly generalized to the scenario of ","element":"span"},{"style":{"height":11.6},"width":76.02,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-60.png","element":"img","alt":" ϵ>0","inline":true,"padRight":true},{"text":"by replacing ","element":"span"},{"style":{"height":16.02},"width":246.89,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-61.png","element":"img","alt":" ∆i’s with ¯∆i’s.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Comparison to the semi-bandit problem. ","element":"span"},{"text":"A related algorithm in the setting of semi-bandit feedback and ","element":"span"},{"style":{"height":10.8},"width":95.46,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-62.png","element":"img","alt":" ϵ = 0","inline":true,"padRight":true},{"text":"is the B","element":"span"},{"text":"ATCH","element":"span"},{"text":"R","element":"span"},{"text":"ACING ","element":"span"},{"text":"Algorithm, which was proposed by ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"et al. ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"(","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":"). This algorithm has three paramters ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"r ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"b ","element":"span"},{"text":"which respectively represent the number of optimal items, the maximum number of pulls of one item at one step and the size of a pulled arm. When ","element":"span"},{"style":{"fontStyle":"italic"},"text":"r ","element":"span"},{"text":"= 1 ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"b ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":", we denote it as B","element":"span"},{"text":"AT","element":"span"},{"text":"R","element":"span"},{"text":"AC","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"). The fact that our algorithm observes between ","element":"span"},{"text":"1 ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"items per step motivates a comparison among C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":640.84,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-63.png","element":"img","alt":"0, δ, K), BATRAC(K) and BATRAC(1).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Corollary 4.3. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(i) If all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.22,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-64.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-65.png","element":"img","alt":" S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"50%"},"width":478,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/3-66.png","element":"img"}],[{"style":{"width":"80%"},"width":757,"height":242,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps; (ii) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.63,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-1.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-2.png","element":"img","alt":" S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"99%"},"width":934,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-3.png","element":"img"}],[{"id":"id-38","style":{"fontStyle":"italic"},"text":"steps.","element":"span"}],[{"text":"The results of Corollary ","element":"span"},{"href":"#id-35","text":"4.3 ","element":"a"},{"text":"are intuitive: (i) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":"’s are close to ","element":"span"},{"text":"0 ","element":"span"},{"text":"(i.e., at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"text":"), the bound on the time complexity of C","element":"span"},{"text":"ASCADE","element":"span"},{"text":"BAI(","element":"span"},{"style":{"height":14.8},"width":108.37,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-4.png","element":"img","alt":"0, δ, K","inline":true},{"text":") is of the same order as that of B","element":"span"},{"text":"AT","element":"span"},{"text":"R","element":"span"},{"text":"AC","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"text":"); (ii) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":"’s are close to ","element":"span"},{"text":"1 ","element":"span"},{"text":"(i.e., at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"text":"), the bound corresponds with that of B","element":"span"},{"text":"AT","element":"span"},{"text":"R","element":"span"},{"text":"AC","element":"span"},{"text":"(","element":"span"},{"text":"1","element":"span"},{"text":") (","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al.","element":"a"},{"href":"#id-27","referenceIndex":16,"text":", ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":"). We further upper bound the expected time complexity of our algorithm (denoted by ","element":"span"},{"style":{"height":9.19},"width":38.72,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-5.png","element":"img","alt":" π1","inline":true},{"text":") in these cases.","element":"span"}],[{"id":"id-36","style":{"fontWeight":"bold"},"text":"Proposition 4.4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(i) If all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"width":"100%"},"width":938,"height":243,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"(ii) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"width":"92%"},"width":869,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-7.png","element":"img"}],[{"text":"According to the definition of ","element":"span"},{"style":{"height":14},"width":453.83,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-8.png","element":"img","alt":" T∗ in Section 2, T∗ ≤ ET π1","inline":true,"padRight":true},{"text":"and hence also satisfies the above bounds. Corollary ","element":"span"},{"href":"#id-35","text":"4.3 ","element":"a"},{"text":"and Proposition ","element":"span"},{"href":"#id-36","text":"4.4 ","element":"a"},{"text":"indicate that the high probability upper bound on ","element":"span"},{"style":{"height":12.8},"width":64.47,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-9.png","element":"img","alt":" T π1 ","inline":true,"padRight":true},{"text":"and the upper bound on ","element":"span"},{"style":{"height":12.8},"width":91.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-10.png","element":"img","alt":" ET π1 ","inline":true,"padRight":true},{"text":"are of the same order in the sense that (i) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":"’s are at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"text":", both upper bounds are ","element":"span"},{"style":{"height":24.24},"width":662.83,"height":60.61,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-11.png","element":"img","alt":"˜O�(1/K)·�L−Ki=1 ∆−2σ(i) +∆−2σ(L−1)�; (ii)","inline":true,"padRight":true},{"text":"if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":"’s are at least ","element":"span"},{"style":{"height":24.24},"width":508.02,"height":60.61,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-12.png","element":"img","alt":" 1/2, both are ˜O� �L−1i=1 ∆−2σ(i)�.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Specialization to the case of two click probabilities","element":"span"},{"text":". We consider a simplified scenario with the following assumption; this allows us to present the upper bound on the time complexity with greater clarity.","element":"span"}],[{"id":"id-37","style":{"fontWeight":"bold"},"text":"Assumption 4.5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"With ","element":"span"},{"style":{"height":13.39},"width":301.68,"height":33.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-13.png","element":"img","alt":" 0 < w′ < w∗ ≤ 1","inline":true},{"style":{"fontStyle":"italic"},"text":", the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"style":{"fontStyle":"italic"},"text":"optimal and ","element":"span"},{"style":{"height":10.8},"width":115.26,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-14.png","element":"img","alt":" L − K","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"suboptimal items have click probabilities ","element":"span"},{"style":{"height":10.98},"width":45.6,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-15.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":6.8},"width":43.6,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-16.png","element":"img","alt":" w′ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"respectively.","element":"span"}],[{"id":"id-39","style":{"fontWeight":"bold"},"text":"Proposition 4.6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under Assumption ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"4.5","element":"a"},{"style":{"fontStyle":"italic"},"text":", (i) if ","element":"span"},{"style":{"height":13.39},"width":172.69,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-17.png","element":"img","alt":" 0 < w∗ ≤","inline":true,"padRight":true},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":89.32,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-18.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-19.png","element":"img","alt":"S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"84%"},"width":795,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-20.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps; (ii) if ","element":"span"},{"style":{"height":16},"width":250.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-21.png","element":"img","alt":" 1/K < w∗ ≤ 1","inline":true},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":14},"width":98.77,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-22.png","element":"img","alt":" 1 − δ,","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-23.png","element":"img","alt":" S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"99%"},"width":936,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-24.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps.","element":"span"}],[{"text":"In the second case, if ","element":"span"},{"style":{"height":17.38},"width":389.17,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-25.png","element":"img","alt":" L ≥ w∗(w∗ − w′)2/w′2","inline":true},{"text":", the first term dominates the bound. For instance, ","element":"span"},{"style":{"height":18.3},"width":201.18,"height":45.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-26.png","element":"img","alt":" w′ ≥ 1/√L","inline":true,"padRight":true},{"text":"satisfies this condition.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.7. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under Assumption ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"4.5","element":"a"},{"style":{"fontStyle":"italic"},"text":", (i) if ","element":"span"},{"style":{"height":16},"width":215.15,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/4-27.png","element":"img","alt":" 0 w′ ≥ 1/2,","inline":true}],[{"style":{"width":"80%"},"width":757,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"The upper bounds above are achieved by Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1","element":"a"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"id":"id-55","text":"In the first case, the gap between the upper and lower bounds ","element":"span"},{"text":"is manifested in the terms ","element":"span"},{"style":{"height":17.39},"width":207.8,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-7.png","element":"img","alt":" 1/K and w′2","inline":true},{"text":". In the second case, the gap is manifested in ","element":"span"},{"style":{"height":16},"width":323.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-8.png","element":"img","alt":" w∗ and w′(1 − w∗).","inline":true}]]},{"heading":"5. Proof sketch","paragraphs":[[{"id":"id-33","style":{"fontWeight":"bold"},"text":"5.1. Analysis of partial feedback for cascading bandits","element":"span"}],[{"text":"At a high level, the time complexity ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"can be established by analyzing ","element":"span"},{"style":{"height":23.62},"width":187.29,"height":59.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-9.png","element":"img","alt":"�Tt=1 Xˆkt;t","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":18.4},"width":84.2,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-10.png","element":"img","alt":" Xˆkt;t","inline":true},{"text":". The first term is de- ","element":"span"},{"text":"termined by ","element":"span"},{"style":{"height":18.42},"width":59.01,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-11.png","element":"img","alt":"¯Ti,δ","inline":true},{"text":"’s, the number of observations that guarantees the correct identification of items with high probability. These ","element":"span"},{"style":{"height":18.42},"width":59.02,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-12.png","element":"img","alt":"¯Ti,δ","inline":true},{"text":"’s are invariant to the scenario whether the agent receives semi-bandit or partial feedback from the user. The second term ","element":"span"},{"style":{"height":22.61},"width":281.38,"height":56.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-13.png","element":"img","alt":" Xˆkt;t equals to ˆkt","inline":true,"padRight":true},{"text":"in the semi-bandit feedback setting while it is an r.v. in the partial feedback setting. Since ","element":"span"},{"style":{"height":18.42},"width":59.02,"height":46.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-14.png","element":"img","alt":"¯Ti,δ","inline":true},{"text":"’s have already been studied by a number of works on the semi-bandit feedback (","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"Even-Dar et al.","element":"a"},{"href":"#id-2","referenceIndex":11,"text":", ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"2002","element":"a"},{"text":"; ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al.","element":"a"},{"href":"#id-27","referenceIndex":16,"text":", ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":"), the crucial challenge of analyzing cascading bandits is to estimate ","element":"span"},{"style":{"height":18.4},"width":84.2,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-15.png","element":"img","alt":" Xˆkt;t","inline":true,"padRight":true},{"text":"probabilitistically.","element":"span"}],[{"text":"According to Algorithm ","element":"span"},{"href":"#id-24","text":"1","element":"a"},{"text":", ","element":"span"},{"style":{"height":19.01},"width":515.81,"height":47.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-16.png","element":"img","alt":"ˆkt = min{K, |Dt|}. When ˆkt =","inline":true},{"style":{"height":16},"width":158.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-17.png","element":"img","alt":"K ≤ |Dt|","inline":true},{"text":", the agent pulls ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"surviving (i.e., not identified) items. Otherwise, the agent pulls all surviving items first and then complements ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-18.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"with some identified items. In the cascading bandit setting, the agent observes only one item when the first item ","element":"span"},{"style":{"height":16.94},"width":29.73,"height":42.35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-19.png","element":"img","alt":" it1 ","inline":true,"padRight":true},{"text":"is clicked, and the corresponding ","element":"span"},{"text":"probability is ","element":"span"},{"style":{"height":16.99},"width":92.7,"height":42.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-20.png","element":"img","alt":" w(it1)","inline":true},{"text":"; the agent observes two items when ","element":"span"},{"style":{"height":16.94},"width":29.73,"height":42.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-21.png","element":"img","alt":"it1","inline":true,"padRight":true},{"text":"is not clicked but ","element":"span"},{"style":{"height":16.94},"width":29.73,"height":42.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-22.png","element":"img","alt":" it2","inline":true,"padRight":true},{"text":"is clicked, and the probability is ","element":"span"},{"style":{"height":16.98},"width":275.68,"height":42.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-23.png","element":"img","alt":"[1 − w(it1)]w(it2)","inline":true},{"text":"; and so on. Therefore,","element":"span"}],[{"style":{"width":"89%"},"width":843,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-24.png","element":"img"}],[{"id":"id-62","text":"Since ","element":"span"},{"style":{"height":18.4},"width":110.77,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-25.png","element":"img","alt":" EXˆkt;t","inline":true,"padRight":true},{"text":"depends only on ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-26.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"(the pulled arm at step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") and ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-27.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"is learnt online, it is difficult to estimate ","element":"span"},{"style":{"height":18.4},"width":110.77,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-28.png","element":"img","alt":" EXˆkt;t","inline":true,"padRight":true},{"text":"for each step separately. Therefore, the second best thing one can do is to bound ","element":"span"},{"style":{"height":18.4},"width":110.77,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-29.png","element":"img","alt":" EXˆkt;t","inline":true,"padRight":true},{"text":"as a function of ","element":"span"},{"style":{"height":17.4},"width":32.75,"height":43.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-30.png","element":"img","alt":"ˆkt","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w","element":"span"},{"text":". We now present some properties of ","element":"span"},{"style":{"height":18.4},"width":122.8,"height":46.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-31.png","element":"img","alt":" EXˆkt;t.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Theorem 5.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider a set of items with weights ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"u ","element":"span"},{"text":"= ","element":"span"},{"id":"id-40","style":{"height":16},"width":203.16,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-32.png","element":"img","alt":"(u1, . . . , uk)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that ","element":"span"},{"style":{"height":12.8},"width":235.78,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-33.png","element":"img","alt":" u1 ≥ . . . ≥ uk","inline":true},{"style":{"fontStyle":"italic"},"text":", and let ","element":"span"},{"style":{"height":16},"width":140.61,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-34.png","element":"img","alt":" µk(u, I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the expected number of observations when items are placed with order ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I","element":"span"},{"style":{"fontStyle":"italic"},"text":". Let ","element":"span"},{"style":{"height":16},"width":567.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-35.png","element":"img","alt":" Idec = (1, . . . , k), Iinc = (k, . . . , 1)","inline":true},{"style":{"fontStyle":"italic"},"text":", and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be any order, then","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"(i) boundedness: ","element":"span"},{"style":{"height":16},"width":603.49,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-36.png","element":"img","alt":" µk(u, Idec)≤ µk(u, I) ≤ µk(u, Iinc);","inline":true}],[{"id":"id-58","style":{"fontStyle":"italic"},"text":"(ii) monotonicity: let ","element":"span"},{"style":{"height":16},"width":260.09,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-37.png","element":"img","alt":" v=(v1, . . . , vk)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be another vector of weights, then ","element":"span"},{"style":{"height":16},"width":689.34,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-38.png","element":"img","alt":" µk(u, I)≥µk(v, I) if ui ≤vi for all i∈[k].","inline":true}],[{"text":"Theorem ","element":"span"},{"href":"#id-40","text":"5.1 ","element":"a"},{"text":"implies that when ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"w ","element":"span"},{"text":"is fixed, ","element":"span"},{"style":{"height":15.59},"width":98.22,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-39.png","element":"img","alt":" EXk;t","inline":true,"padRight":true},{"text":"attains its minimum when the agent pulls items ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , k ","element":"span"},{"text":"in this order and attains its maximum when the agent pulls ","element":"span"},{"style":{"height":14},"width":111.71,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-40.png","element":"img","alt":" L, L −","inline":true},{"style":{"height":14},"width":296.52,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-41.png","element":"img","alt":"1, . . . , L − K + 1","inline":true,"padRight":true},{"text":"in this order. Moreover, if ","element":"span"},{"style":{"height":16},"width":181.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-42.png","element":"img","alt":" w(i) = w∗","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"height":16.79},"width":243.2,"height":41.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-43.png","element":"img","alt":" i ∈ [k], EXk;t","inline":true,"padRight":true},{"text":"is even smaller; if ","element":"span"},{"style":{"height":16},"width":189.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-44.png","element":"img","alt":" w(j) = w′","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"height":16.79},"width":537.88,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-45.png","element":"img","alt":" j ∈ {L − k + 1, . . . , L}, EXk;t","inline":true,"padRight":true},{"text":"is even larger. This observation inspires Lemma ","element":"span"},{"href":"#id-41","text":"5.2","element":"a"},{"text":".","element":"span"}],[{"id":"id-41","style":{"fontWeight":"bold"},"text":"Lemma 5.2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k, t","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"width":"93%"},"width":877,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-46.png","element":"img"}],[{"id":"id-28","text":"Next, since ","element":"span"},{"style":{"height":15.59},"width":71.65,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-47.png","element":"img","alt":" Xk;t","inline":true},{"text":", instead of ","element":"span"},{"style":{"height":15.59},"width":98.22,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-48.png","element":"img","alt":" EXk;t","inline":true},{"text":", affects the dynamics, we examine the gap between ","element":"span"},{"style":{"height":17.6},"width":174.73,"height":44.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-49.png","element":"img","alt":"�nt=1 Xk;t","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":17.6},"width":201.31,"height":44.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-50.png","element":"img","alt":" �nt=1 EXk;t","inline":true},{"text":". ","element":"span"},{"text":"Clearly, a tight concentration inequality is essential to estimate this gap well. Since ","element":"span"},{"style":{"height":15.59},"width":71.66,"height":38.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-51.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"is a bounded r.v., there are some applicable Bernstein-type inequalities. For instance, we can apply Azuma’s inequality to analyze SG r.v.’s. However, (i) it is challenging to find an SG parameter of ","element":"span"},{"style":{"height":15.59},"width":71.65,"height":38.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-52.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"that is good enough for our purpose (a more detailed explanation is provided after Lemma ","element":"span"},{"href":"#id-42","text":"5.6","element":"a"},{"text":"), and (ii) we only require a one-sided concentration inequality. Hence, we resort to defining a new class of r.v.’s — known as LSG r.v.’s — and provide an estimate of the relevant LSG parameter.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 5.3 ","element":"span"},{"text":"(LSG)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"An r.v. ","element":"span"},{"style":{"height":14},"width":361.87,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-53.png","element":"img","alt":" X is v-LSG (v ≥ 0) if","inline":true}],[{"style":{"width":"82%"},"width":773,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-54.png","element":"img"}],[{"id":"id-6","style":{"fontWeight":"bold"},"text":"Theorem 5.4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be an almost surely bounded nonnegative r.v.. If ","element":"span"},{"style":{"height":15.79},"width":468.37,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-55.png","element":"img","alt":" EX2 ≤ v2, then X is v-LSG.","inline":true}],[{"text":"Furthermore, we bound ","element":"span"},{"style":{"height":20.3},"width":98.22,"height":50.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-56.png","element":"img","alt":" EX2k;t","inline":true,"padRight":true},{"text":"(Lemma ","element":"span"},{"href":"#id-43","text":"5.5","element":"a"},{"text":") and adapt a ","element":"span"},{"text":"variation of Azuma’s inequality as in Theorem ","element":"span"},{"href":"#id-44","text":"B.1 ","element":"a"},{"text":"(","element":"span"},{"href":"#id-45","referenceIndex":29,"text":"Shamir","element":"a"},{"href":"#id-45","referenceIndex":29,"text":", ","element":"a"},{"href":"#id-45","referenceIndex":29,"text":"2011","element":"a"},{"text":") to evaluate the dependence between the number of observations and the number of time steps.","element":"span"}],[{"id":"id-43","style":{"fontWeight":"bold"},"text":"Lemma 5.5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":20.3},"width":597.56,"height":50.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-57.png","element":"img","alt":" k, t, EX2k;t ≤ v2k = min{k2, 2/w′2 }.","inline":true}],[{"id":"id-42","style":{"fontWeight":"bold"},"text":"Lemma 5.6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":14.8},"width":229.22,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-58.png","element":"img","alt":" k, t, δ > 0, set","inline":true}],[{"style":{"width":"82%"},"width":774,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/5-59.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"then ","element":"span"},{"style":{"height":16},"width":195.78,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-0.png","element":"img","alt":" Pr(E∗) ≤ δ","inline":true},{"style":{"fontStyle":"italic"},"text":". Further when ","element":"span"},{"style":{"height":10.8},"width":40.6,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-1.png","element":"img","alt":" E∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"holds, for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T > ","element":"span"},{"text":"0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"style":{"height":17.61},"width":245.62,"height":44.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-2.png","element":"img","alt":"�nt=1 Xk;t ≤T","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"implies that ","element":"span"},{"style":{"height":17.9},"width":492.05,"height":44.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-3.png","element":"img","alt":" n≤2T/µk+2 log(1/δ)v2k/µ2k.","inline":true}],[{"text":"Lemma ","element":"span"},{"href":"#id-42","text":"5.6 ","element":"a"},{"text":"implies that with high probability, we can lower bound the amount of observations on the surviving items over the whole horizon. Subsequently, with probability at least ","element":"span"},{"style":{"height":11.6},"width":85.19,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-4.png","element":"img","alt":" 1 − δ","inline":true},{"text":", the agent would have received sufficiently many observations on the surviving items to return an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-5.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal arm after at most ","element":"span"},{"style":{"height":16},"width":389.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-6.png","element":"img","alt":" (c1N1 + c2N2 + c3N3)","inline":true,"padRight":true},{"text":"time steps (see Theorem ","element":"span"},{"href":"#id-29","text":"4.1","element":"a"},{"text":"). The lemma also indicates that a smaller LSG/SG parameter of ","element":"span"},{"style":{"height":15.59},"width":71.65,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-7.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"leads to a smaller upper bound on the number of time steps. Since we can show ","element":"span"},{"style":{"height":15.59},"width":71.66,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-8.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":9.19},"width":36.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-9.png","element":"img","alt":"vk","inline":true},{"text":"-LSG but cannot show it is ","element":"span"},{"style":{"height":9.19},"width":36.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-10.png","element":"img","alt":" vk","inline":true},{"text":"-SG (a detailed discussion is deferred to Appendix ","element":"span"},{"href":"#id-46","text":"D.9","element":"a"},{"text":"), it is beneficial to consider the class of LSG distributions for our problem. The class of LSG r.v.’s and the general estimate of the LSG parameter, which is crucial for the utilization of the concentration inequality, may be of independent interest.","element":"span"}],[{"id":"id-64","style":{"fontWeight":"bold"},"text":"5.2. Proof sketch of Theorem ","element":"span"},{"href":"#id-29","style":{"fontWeight":"bold"},"text":"4.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Concentration. ","element":"span"},{"text":"As the algorithm proceeds, the agent moves items from ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-11.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"to ","element":"span"},{"style":{"height":13.99},"width":41.89,"height":34.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-12.png","element":"img","alt":" At","inline":true,"padRight":true},{"text":"or ","element":"span"},{"style":{"height":13.19},"width":42.26,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-13.png","element":"img","alt":" Rt","inline":true,"padRight":true},{"text":"according to the con-fidence bounds of all surviving items in ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-14.png","element":"img","alt":" Dt","inline":true},{"text":". This motivates us to define a “nice event”","element":"span"}],[{"style":{"width":"83%"},"width":784,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-15.png","element":"img"}],[{"text":"To show that ","element":"span"},{"style":{"height":20.8},"width":200.22,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-16.png","element":"img","alt":"�Li=1 E(i, δ)","inline":true,"padRight":true},{"text":"holds with high probability, we ","element":"span"},{"text":"utilize Theorem ","element":"span"},{"href":"#id-47","text":"B.2 ","element":"a"},{"text":"(","element":"span"},{"href":"#id-48","referenceIndex":15,"text":"Jamieson et al.","element":"a"},{"href":"#id-48","referenceIndex":15,"text":", ","element":"a"},{"href":"#id-48","referenceIndex":15,"text":"2014","element":"a"},{"text":"; ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al.","element":"a"},{"href":"#id-27","referenceIndex":16,"text":", ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":") and the SG property of ","element":"span"},{"style":{"height":16},"width":96.9,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-17.png","element":"img","alt":" Wt(i)","inline":true,"padRight":true},{"text":"(the r.v. that reflects whether item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"is clicked at time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":").","element":"span"}],[{"id":"id-50","style":{"fontWeight":"bold"},"text":"Lemma 5.7. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":21.5},"width":595.18,"height":53.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-18.png","element":"img","alt":" δ∈[0, 1], P� �Li=1 E(i, δ)�≥ 1−δ/2.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Sufficient observations. ","element":"span"},{"text":"Next, we assume ","element":"span"},{"style":{"height":20.8},"width":200.22,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-19.png","element":"img","alt":"�Li=1 E(i, δ)","inline":true,"padRight":true},{"text":"holds and find the number of observations that guarantees the correct identification of an item. To facilitate the analysis of the expected time complexity (Proposition ","element":"span"},{"href":"#id-36","text":"4.4","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","text":"4.7","element":"a"},{"text":"), we assume ","element":"span"},{"style":{"height":20.8},"width":211.4,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-20.png","element":"img","alt":"�Li=1 E(i, δ′)","inline":true,"padRight":true},{"text":"holds for a fixed ","element":"span"},{"style":{"height":16},"width":178.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-21.png","element":"img","alt":" δ′ ∈ (0, δ]","inline":true,"padRight":true},{"text":"in ","element":"span"},{"text":"Lemma ","element":"span"},{"href":"#id-49","text":"5.8","element":"a"},{"text":", which generalizes ","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al. ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"(","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":", Lemma 2). ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Lemma 5.8. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix any ","element":"span"},{"style":{"height":20.8},"width":604.51,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-22.png","element":"img","alt":" 0<δ′ ≤δ, assume �Li=1 E(i, δ′) holds.","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Set ","element":"span"},{"style":{"height":16},"width":333.83,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-23.png","element":"img","alt":" T ′t := mini∈Dt Tt(i)","inline":true},{"style":{"fontStyle":"italic"},"text":", then for any time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"height":18.42},"width":920.68,"height":46.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-24.png","element":"img","alt":"∀i≤K′, T ′(t)≥ ¯Ti,δ′ ⇒ Lt(i, δ)>Ut(j∗, δ)−ϵ ⇒ i∈At,","inline":true},{"style":{"height":18.42},"width":913.91,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/6-25.png","element":"img","alt":"∀i>K′, T ′(t)≥ ¯Ti,δ′ ⇒ Ut(i, δ) 0; i.e. EeλXi ≤ eλ2σ22 . Let ω ∈ (0,�1/6). Then,","inline":true}],[{"style":{"width":"50%"},"width":987,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-7.png","element":"img"}]]},{"heading":"C. Inﬂuence of ϵ","paragraphs":[[{"text":"In general, a larger ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-8.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"indicates a smaller time complexity. Here are two explanations. (i) When ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-9.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"grows, ","element":"span"},{"style":{"height":14.74},"width":50.7,"height":36.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-10.png","element":"img","alt":" K′ϵ","inline":true},{"text":", the number ","element":"span"},{"text":"of ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-11.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal items also grows. Then it should be easier to identify an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-12.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal arm. (ii) If ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-13.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"is sufficiently large such that ","element":"span"},{"style":{"height":14.74},"width":227.52,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-14.png","element":"img","alt":" K′ϵ ≥ 2K − 1","inline":true},{"text":", then there are at least ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"items left in the survival set ","element":"span"},{"style":{"height":13.19},"width":45,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-15.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"before the algorithm stops. Otherwise, when ","element":"span"},{"style":{"height":16},"width":158.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-16.png","element":"img","alt":"|Dt| < K","inline":true},{"text":", the agent pulls ","element":"span"},{"style":{"height":16},"width":158.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/12-17.png","element":"img","alt":" |Dt| < K","inline":true,"padRight":true},{"text":"surviving items at some steps and this results in a wastage in the number of time steps.","element":"span"}],[{"id":"id-31","style":{"fontWeight":"bold"},"text":"Proposition C.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":13.2},"width":229.17,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-0.png","element":"img","alt":" K′ ≥ 2K − 1","inline":true},{"style":{"fontStyle":"italic"},"text":". With probability at least ","element":"span"},{"style":{"height":11.6},"width":88.44,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-1.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-2.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":"-optimal arm after at most ","element":"span"},{"style":{"height":16},"width":250.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-3.png","element":"img","alt":" (c1N ′1 + c2N ′2)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"steps where","element":"span"}],[{"style":{"width":"99%"},"width":1945,"height":419,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-4.png","element":"img"}]]},{"heading":"D. Proofs of main results","paragraphs":[[{"text":"In this Section, we provide proofs of Proposition ","element":"span"},{"href":"#id-61","text":"4.2","element":"a"},{"text":", Corollary ","element":"span"},{"href":"#id-35","text":"4.3","element":"a"},{"text":", Propositions ","element":"span"},{"href":"#id-36","text":"4.4","element":"a"},{"text":", ","element":"span"},{"href":"#id-39","text":"4.6","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","text":"4.7","element":"a"},{"text":", Corollary ","element":"span"},{"href":"#id-62","text":"4.9","element":"a"},{"text":", Theorem ","element":"span"},{"href":"#id-40","text":"5.1","element":"a"},{"text":", ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":", Lemmas ","element":"span"},{"href":"#id-41","text":"5.2 ","element":"a"},{"text":"– ","element":"span"},{"href":"#id-63","text":"5.11","element":"a"},{"text":", and complete the proof of Theorems ","element":"span"},{"href":"#id-29","text":"4.1","element":"a"},{"text":", ","element":"span"},{"href":"#id-51","text":"4.8","element":"a"},{"text":", Proposition ","element":"span"},{"href":"#id-31","text":"C.1 ","element":"a"},{"text":"in this order.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"D.1. Proof of Proposition ","element":"span"},{"href":"#id-61","style":{"fontWeight":"bold"},"text":"4.2","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":13.6},"width":443,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-5.png","element":"img","alt":" 0 < w′ < w∗ ≤ 1. We have","inline":true}],[{"style":{"width":"39%"},"width":368,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"According to Lemma ","element":"span"},{"href":"#id-41","text":"5.2 ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"40%"},"width":381,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-7.png","element":"img"}],[{"text":"We upper bound ","element":"span"},{"style":{"height":16},"width":262.87,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-8.png","element":"img","alt":" vk/µk and k/µk","inline":true,"padRight":true},{"text":"in two cases:","element":"span"}],[{"text":"(i): ","element":"span"},{"style":{"height":16},"width":941.18,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-9.png","element":"img","alt":" 0 < w∗ ≤ 1/K: 0 < w′ < w∗ ≤ 1/K, vk = k, µk ≥ k/2.","inline":true}],[{"style":{"width":"66%"},"width":624,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-10.png","element":"img"}],[{"text":"(ii): ","element":"span"},{"style":{"height":17.99},"width":745.37,"height":44.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-11.png","element":"img","alt":" 1/K < w∗ ≤ 1: vk ≤√2/w′, µk ≥ 1/(2w∗).","inline":true}],[{"style":{"width":"83%"},"width":786,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.2. Proof of Corollary ","element":"span"},{"href":"#id-35","style":{"fontWeight":"bold"},"text":"4.3","element":"a"}],[{"style":{"width":"99%"},"width":937,"height":422,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/13-13.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps; (ii) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.64,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-0.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-1.png","element":"img","alt":" S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"48%"},"width":935,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-2.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"According to Lemma ","element":"span"},{"href":"#id-41","text":"5.2 ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"40%"},"width":381,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-3.png","element":"img"}],[{"text":"We first upper bound ","element":"span"},{"style":{"height":16},"width":262.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-4.png","element":"img","alt":" vk/µk and k/µk","inline":true,"padRight":true},{"text":"in two cases:","element":"span"}],[{"text":"(i): ","element":"span"},{"style":{"height":16},"width":941.18,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-5.png","element":"img","alt":" 0 < w∗ ≤ 1/K: 0 < w′ < w∗ ≤ 1/K, vk = k, µk ≥ k/2.","inline":true}],[{"style":{"width":"26%"},"width":245,"height":92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-6.png","element":"img"}],[{"text":"(ii): ","element":"span"},{"style":{"height":17.99},"width":745.37,"height":44.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-7.png","element":"img","alt":" 1/K < w∗ ≤ 1: vk ≤√2/w′, µk ≥ 1/(2w∗).","inline":true}],[{"text":"Next, we separate the upper bound in Theorem ","element":"span"},{"href":"#id-29","text":"4.1 ","element":"a"},{"text":"into two parts and bound them separately:","element":"span"}],[{"style":{"width":"97%"},"width":1891,"height":364,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Case 1: All click probabilities ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"style":{"fontWeight":"bold"},"text":"are at most ","element":"span"},{"style":{"height":16},"width":769.57,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-9.png","element":"img","alt":" 1/K. 1/w∗ ≥ K and vk/µk ≤ 2, K1 = K − 1.","inline":true}],[{"style":{"width":"77%"},"width":1516,"height":235,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Case 2: All click probabilities ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"style":{"fontWeight":"bold"},"text":"are at least ","element":"span"},{"style":{"height":17.99},"width":780.35,"height":44.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-11.png","element":"img","alt":" 1/2. 1/w∗ ≤ K implies vk/µk ≤ 4√2, K1 ≥ 1.","inline":true}],[{"style":{"width":"73%"},"width":1429,"height":656,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/14-12.png","element":"img"}],[{"style":{"width":"61%"},"width":1201,"height":260,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-0.png","element":"img"}],[{"text":"Recall that when ","element":"span"},{"style":{"height":13.2},"width":99.24,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-1.png","element":"img","alt":" ϵ = 0,","inline":true}],[{"style":{"width":"29%"},"width":279,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-2.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":16},"width":111.62,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-3.png","element":"img","alt":" i ∈ [L]","inline":true},{"text":". Altogether, we complete the proof.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"D.3. Proof of Proposition ","element":"span"},{"href":"#id-36","style":{"fontWeight":"bold"},"text":"4.4","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(i) If all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"width":"47%"},"width":448,"height":215,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"(ii) if all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":"’s are at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(i) Consider all click probabilities ","element":"span"},{"style":{"height":16},"width":88.33,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-5.png","element":"img","alt":" w(i)′","inline":true},{"style":{"fontWeight":"bold"},"text":"’s are at most ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/K","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"For any ","element":"span"},{"style":{"height":14},"width":179.44,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-6.png","element":"img","alt":" 0 < δ′ ≤ δ","inline":true},{"text":", revisit the proof and result of Theorem ","element":"span"},{"href":"#id-29","text":"4.1","element":"a"},{"text":". First, Lemma ","element":"span"},{"href":"#id-50","text":"5.7 ","element":"a"},{"text":"implies that ","element":"span"},{"style":{"height":28.8},"width":494.31,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-7.png","element":"img","alt":" P��Li=1 E(ϵ, δ′)�≥ 1 − δ′/2","inline":true},{"text":". Assume ","element":"span"},{"style":{"height":20.8},"width":213.85,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-8.png","element":"img","alt":"�Li=1 E(ϵ, δ′)","inline":true,"padRight":true},{"text":"holds from now on. ","element":"span"},{"text":"Secondly, Lemma ","element":"span"},{"href":"#id-49","text":"5.8 ","element":"a"},{"text":"indicates that Algorithm ","element":"span"},{"href":"#id-24","text":"1 ","element":"a"},{"text":"can correctly identify item ","element":"span"},{"style":{"height":18.42},"width":164.84,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-9.png","element":"img","alt":" i after ¯Ti,δ","inline":true,"padRight":true},{"text":"observations. Thirdly, similar to the discussion in Section ","element":"span"},{"href":"#id-64","text":"5.2","element":"a"},{"text":", we set ","element":"span"},{"style":{"height":20.4},"width":281,"height":50.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-10.png","element":"img","alt":"�K−1k=1 δk ≤ δ′/2","inline":true},{"text":". Additionally applying the analysis in Proposition ","element":"span"},{"href":"#id-61","text":"4.2 ","element":"a"},{"text":"and Corollary ","element":"span"},{"href":"#id-35","text":"4.3","element":"a"},{"text":", ","element":"span"},{"text":"with probability at least ","element":"span"},{"style":{"height":11.6},"width":101.85,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-11.png","element":"img","alt":" 1 − δ′","inline":true},{"text":", we can bound the time complexity by","element":"span"}],[{"style":{"width":"97%"},"width":1889,"height":563,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-12.png","element":"img"}],[{"text":"In short, set","element":"span"}],[{"style":{"width":"74%"},"width":702,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-13.png","element":"img"}],[{"text":"then ","element":"span"},{"style":{"height":16},"width":717.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/15-14.png","element":"img","alt":" Pr(T > −A log δ′) < δ′ for any 0 ≤ δ′ ≤ δ.","inline":true}],[{"text":"Meanwhile, Tonelli’s Theorem implies that","element":"span"}],[{"text":"Since ","element":"span"},{"style":{"height":21.77},"width":1248.37,"height":54.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-0.png","element":"img","alt":" x = −A log δ implies δ = e−x/A and� +∞0 e−x/A dx = Ae−x/A|0x=+∞ = A,","inline":true}],[{"style":{"width":"83%"},"width":1615,"height":245,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-1.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"(ii) Consider all click probabilities ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"style":{"fontWeight":"bold"},"text":"’s are at least ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"text":"The analysis is similar to that in Case (i). With the analysis in Theorem ","element":"span"},{"href":"#id-29","text":"4.1 ","element":"a"},{"text":"and results in Proposition ","element":"span"},{"href":"#id-61","text":"4.2 ","element":"a"},{"text":"and Corollary ","element":"span"},{"href":"#id-35","text":"4.3","element":"a"},{"text":", for any ","element":"span"},{"style":{"height":14},"width":178.3,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-2.png","element":"img","alt":" 0 < δ′ ≤ δ","inline":true},{"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":102.39,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-3.png","element":"img","alt":" 1 − δ′","inline":true},{"text":", the time complexity is at most","element":"span"}],[{"style":{"width":"99%"},"width":1941,"height":921,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-4.png","element":"img"}],[{"id":"id-65","style":{"fontWeight":"bold"},"text":"D.4. Proof of Proposition ","element":"span"},{"href":"#id-39","style":{"fontWeight":"bold"},"text":"4.6","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under Assumption ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"4.5","element":"a"},{"style":{"fontStyle":"italic"},"text":", (i) if ","element":"span"},{"style":{"height":16},"width":249.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-5.png","element":"img","alt":" 0 < w∗ ≤ 1/K","inline":true},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":86.52,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-6.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":14.18},"width":130.45,"height":35.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-7.png","element":"img","alt":" S∗ after","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"at most","element":"span"}],[{"style":{"width":"40%"},"width":795,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-8.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps; (ii) if ","element":"span"},{"style":{"height":16},"width":250.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-9.png","element":"img","alt":" 1/K < w∗ ≤ 1","inline":true},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.63,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-10.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs ","element":"span"},{"style":{"height":10.98},"width":42.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-11.png","element":"img","alt":" S∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"after at most","element":"span"}],[{"style":{"width":"49%"},"width":954,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"We first remind ourselves how the algorithm proceeds. ","element":"span"},{"text":"In this instance, ","element":"span"},{"style":{"height":10.8},"width":97.22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-13.png","element":"img","alt":" ϵ = 0","inline":true,"padRight":true},{"text":"yields ","element":"span"},{"style":{"height":10.99},"width":136.06,"height":27.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-14.png","element":"img","alt":" K∗ = 1","inline":true},{"text":". For any item ","element":"span"},{"style":{"height":17.63},"width":467.91,"height":44.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-15.png","element":"img","alt":"i ∈ [L], ¯∆i = ∆i = w∗ − w′","inline":true},{"text":". And according to Lemma ","element":"span"},{"href":"#id-49","text":"5.8","element":"a"},{"text":", item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"will be correctly classified with high probability after ","element":"span"},{"style":{"height":16.02},"width":34.28,"height":40.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-16.png","element":"img","alt":"¯Ti","inline":true,"padRight":true},{"text":"observations where ","element":"span"},{"style":{"height":16},"width":220.85,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-17.png","element":"img","alt":" ρ = δ/(12L),","inline":true}],[{"style":{"width":"61%"},"width":1206,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/16-18.png","element":"img"}],[{"style":{"width":"43%"},"width":848,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-0.png","element":"img"}],[{"text":"This implies that each item requires the same number of observations to be correctly identified. According to the design of algorithm, ","element":"span"},{"style":{"height":16},"width":499.65,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-1.png","element":"img","alt":" Tt(j) − 1 ≤ Tt(i) ≤ Tt(j) + 1","inline":true,"padRight":true},{"text":"for any remaining items ","element":"span"},{"style":{"height":15.2},"width":83.86,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-2.png","element":"img","alt":" i ̸= j","inline":true},{"text":". Therefore, the worst case is as follows:","element":"span"}],[{"text":"• the agent observes one item for ","element":"span"},{"style":{"height":19.31},"width":71.55,"height":48.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-3.png","element":"img","alt":"¯T(w)","inline":true,"padRight":true},{"text":"times and the others for ","element":"span"},{"style":{"height":19.31},"width":142.74,"height":48.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-4.png","element":"img","alt":"¯T(w) − 1","inline":true,"padRight":true},{"text":"times after ","element":"span"},{"style":{"height":10},"width":28.39,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-5.png","element":"img","alt":" t′ ","inline":true,"padRight":true},{"text":"steps, and identifies one item per step for the subsequent ","element":"span"},{"style":{"height":14.4},"width":195.42,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-6.png","element":"img","alt":" L − 2 steps.","inline":true}],[{"text":"Therefore, we now turn to upper bounding the number of steps required to eliminate an item for the first time. According to Lemma ","element":"span"},{"href":"#id-42","text":"5.6","element":"a"},{"text":", we set ","element":"span"},{"style":{"height":19.2},"width":875.55,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-7.png","element":"img","alt":" δ0 = δ/2, k = K, n = t′, ω′K = −�−2t′v2K log(δ/2)","inline":true},{"text":". Then the total number of observations during ","element":"span"},{"style":{"height":10},"width":28.39,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-8.png","element":"img","alt":" t′","inline":true,"padRight":true},{"text":"steps should be larger than ","element":"span"},{"style":{"height":14.39},"width":181.69,"height":35.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-9.png","element":"img","alt":" t′µK + ω′K ","inline":true,"padRight":true},{"text":"with probability at least ","element":"span"},{"style":{"height":16},"width":127.65,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-10.png","element":"img","alt":" 1 − δ/2","inline":true},{"text":". The number of observations can be upper bounded ","element":"span"},{"text":"as follows:","element":"span"}],[{"style":{"width":"50%"},"width":987,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-11.png","element":"img"}],[{"text":"Then with Lemma ","element":"span"},{"href":"#id-50","text":"5.7 ","element":"a"},{"text":"and its ensuing discussion in Section ","element":"span"},{"href":"#id-64","text":"5.2","element":"a"},{"text":", with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.54,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-12.png","element":"img","alt":" 1 − δ","inline":true},{"text":", the time complexity is upper bounded by","element":"span"}],[{"style":{"width":"32%"},"width":627,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-13.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Next, we consider how the values of ","element":"span"},{"style":{"height":11.78},"width":175.69,"height":29.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-14.png","element":"img","alt":" w∗ and w′ ","inline":true,"padRight":true},{"style":{"fontWeight":"bold"},"text":"affect the bound. ","element":"span"},{"text":"According to Lemma ","element":"span"},{"href":"#id-41","text":"5.2 ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"43%"},"width":838,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-15.png","element":"img"}],[{"text":"We discuss two cases separately:","element":"span"}],[{"text":"Case 1: ","element":"span"},{"style":{"height":16},"width":980.66,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-16.png","element":"img","alt":" 0 < w∗ ≤ 1/K: 0 < w′ < w∗ ≤ 1/K, vK = K, µK ≥ K/2","inline":true},{"text":". The upper bound becomes:","element":"span"}],[{"style":{"width":"77%"},"width":1513,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-17.png","element":"img"}],[{"text":"Case 2: ","element":"span"},{"style":{"height":17.99},"width":735.87,"height":44.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-18.png","element":"img","alt":" 1/K < w∗ ≤ 1: vk ≤√2/w′, µk ≥ 1/(2w∗)","inline":true},{"text":". The bound becomes","element":"span"}],[{"style":{"width":"99%"},"width":1930,"height":208,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-19.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.5. Proof of Proposition ","element":"span"},{"href":"#id-38","style":{"fontWeight":"bold"},"text":"4.7","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4.7. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under Assumption ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"4.5","element":"a"},{"style":{"fontStyle":"italic"},"text":", (i) if ","element":"span"},{"style":{"height":16},"width":215.15,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/17-20.png","element":"img","alt":" 0 −A log δ′) < δ′","inline":true},{"text":". Meanwhile, Tonelli’s Theorem implies that","element":"span"}],[{"style":{"width":"81%"},"width":1595,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-5.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":21.77},"width":1248.37,"height":54.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-6.png","element":"img","alt":" x = −A log δ implies δ = e−x/A and� +∞0 e−x/A dx = Ae−x/A|0x=+∞ = A,","inline":true}],[{"style":{"width":"82%"},"width":1611,"height":218,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-7.png","element":"img"}],[{"text":"Case 2: ","element":"span"},{"style":{"height":16},"width":461.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-8.png","element":"img","alt":" 1/2 ≤ w′ < 1 or w∗/w′ ≤ 2","inline":true},{"text":": with a similar analysis, for any ","element":"span"},{"style":{"height":14},"width":266.66,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-9.png","element":"img","alt":" 0 < δ ≤ δ′, with","inline":true}],[{"style":{"width":"81%"},"width":1586,"height":216,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-10.png","element":"img"}],[{"style":{"height":16},"width":527.62,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-11.png","element":"img","alt":"Pr(T > −A log δ′) < δ′. Lastly,","inline":true}],[{"style":{"width":"46%"},"width":432,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.6. Proof of Corollary ","element":"span"},{"href":"#id-62","style":{"fontWeight":"bold"},"text":"4.9","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Corollary 4.9. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under Assumption ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"4.5","element":"a"},{"style":{"fontStyle":"italic"},"text":", we have","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"First, by setting ","element":"span"},{"style":{"height":16},"width":1005.58,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-13.png","element":"img","alt":" w(i) = w∗ for all 1 ≤ i ≤ K and w(j) = w′ for all k < j ≤ L","inline":true},{"text":", the result in Theorem ","element":"span"},{"href":"#id-51","text":"4.8 ","element":"a"},{"text":"becomes","element":"span"}],[{"style":{"width":"78%"},"width":1535,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-14.png","element":"img"}],[{"text":"Next, according to Pinsker’s and reverse Pinsker’s inequality for any two distributions ","element":"span"},{"style":{"fontStyle":"italic"},"text":"P ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Q ","element":"span"},{"text":"defined in the same finite space ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"we have","element":"span"}],[{"style":{"width":"33%"},"width":644,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/18-15.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":20.03},"width":1189.8,"height":50.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-0.png","element":"img","alt":" δ(P, Q) = sup{|P(A)−Q(A)|��A ⊂ X} and αQ = minx∈X:Q(x)>0 Q(x)","inline":true},{"text":". In our case, set ","element":"span"},{"style":{"height":17.38},"width":387.43,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-1.png","element":"img","alt":" δ(w∗, w′) = (w∗ −w′)2","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":1037.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-2.png","element":"img","alt":" α = min{w′, w∗, 1 − w∗, 1 − w′} = min{w′, 1 − w∗}, we have","inline":true}],[{"style":{"width":"65%"},"width":1273,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-3.png","element":"img"}],[{"text":"Further since ","element":"span"},{"style":{"height":16},"width":191.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-4.png","element":"img","alt":" ˜µK ≤ 1/w′ ","inline":true,"padRight":true},{"text":"as stated by Lemma ","element":"span"},{"href":"#id-41","text":"5.2","element":"a"},{"text":", the lower bound becomes","element":"span"}],[{"style":{"width":"69%"},"width":1346,"height":156,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.7. Proof of Theorem ","element":"span"},{"href":"#id-40","style":{"fontWeight":"bold"},"text":"5.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 5.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider a set of items with weights ","element":"span"},{"style":{"height":16},"width":287.39,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-6.png","element":"img","alt":" u = (u1, . . . , uk)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that ","element":"span"},{"style":{"height":12.8},"width":241.16,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-7.png","element":"img","alt":" u1 ≥ . . . ≥ uk","inline":true},{"style":{"fontStyle":"italic"},"text":", and let ","element":"span"},{"style":{"height":16},"width":140.61,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-8.png","element":"img","alt":" µk(u, I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the expected number of observations when items are placed with order ","element":"span"},{"style":{"height":16},"width":886.62,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-9.png","element":"img","alt":" I. Let Idec = (1, . . . , k), Iinc = (k, . . . , 1), and I be any","inline":true}],[{"style":{"fontStyle":"italic"},"text":"order, then","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"(i) Consider any ordered set ","element":"span"},{"style":{"height":17.9},"width":1019.49,"height":44.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-10.png","element":"img","alt":" I = (iI1, . . . , iIk). To show µk(u, Idec) ≤ µk(u, I) ≤ µk(u, Iinc)","inline":true},{"text":", it is sufficient to show ","element":"span"},{"text":"the following:","element":"span"}],[{"style":{"height":16},"width":61.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-11.png","element":"img","alt":"(∗):","inline":true,"padRight":true},{"text":"If there exists ","element":"span"},{"style":{"height":13.2},"width":202.78,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-12.png","element":"img","alt":" 1 ≤ m < k","inline":true,"padRight":true},{"text":"such that ","element":"span"},{"style":{"height":17.84},"width":219.01,"height":44.59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-13.png","element":"img","alt":" uiIm < uiIm+1","inline":true},{"text":", we can change their positions to get ","element":"span"},{"style":{"height":10.8},"width":34.64,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-14.png","element":"img","alt":" I′","inline":true,"padRight":true},{"text":"and have ","element":"span"},{"style":{"height":16},"width":187.33,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-15.png","element":"img","alt":" µk(u, I) >","inline":true},{"style":{"height":16},"width":161.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-16.png","element":"img","alt":"µk(u, I′).","inline":true}],[{"text":"The proof of ","element":"span"},{"style":{"height":16},"width":51.42,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-17.png","element":"img","alt":" (∗)","inline":true,"padRight":true},{"text":"is as follows:","element":"span"}],[{"style":{"width":"51%"},"width":481,"height":82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-18.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"µ","element":"span"},{"style":{"height":16},"width":424.07,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-19.png","element":"img","alt":"k(u, I) − µk(u, I′) = m ·","inline":true}],[{"style":{"width":"94%"},"width":886,"height":274,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-20.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"µ","element":"span"},{"style":{"height":16},"width":424.07,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-21.png","element":"img","alt":"k(u, I) − µk(u, I′) = m ·","inline":true}],[{"text":"(ii) To show the monotonicity, it is sufficient to show the following:","element":"span"}],[{"text":"(#)","element":"span"},{"text":": Set two sets of click probabilities ","element":"span"},{"style":{"fontStyle":"italic"},"text":"u","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"such that ","element":"span"},{"style":{"height":15.13},"width":171.26,"height":37.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-22.png","element":"img","alt":" viIm > uiIm","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":13.2},"width":182.18,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-23.png","element":"img","alt":" 1 ≤ m ≤ k","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":15.84},"width":148.81,"height":39.59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-24.png","element":"img","alt":" viIj = uiIj","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":15.2},"width":106.82,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-25.png","element":"img","alt":" j ̸= m","inline":true},{"text":". Then we ","element":"span"},{"text":"have ","element":"span"},{"style":{"height":16},"width":340.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-26.png","element":"img","alt":" µk(u, I) ≥ µk(v, I).","inline":true}],[{"text":"Here is the proof of ","element":"span"},{"text":"(#)","element":"span"},{"text":". If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":", then obviously we have ","element":"span"},{"style":{"height":16},"width":570.89,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-27.png","element":"img","alt":" µk(u, I) = µk(v, I). If 1 ≤ m < k","inline":true},{"text":", we exchange positions of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"-th and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"+ 1)","element":"span"},{"text":"-th item to get a new ordered set ","element":"span"},{"style":{"height":13.6},"width":124.02,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-28.png","element":"img","alt":" I1, then","inline":true}],[{"style":{"width":"93%"},"width":1824,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/19-29.png","element":"img"}],[{"text":"Hence","element":"span"}],[{"text":"If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"+ 1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"< k","element":"span"},{"text":", note that the only difference between ","element":"span"},{"style":{"height":16},"width":297.65,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-0.png","element":"img","alt":" (u, I1) and (v, I1)","inline":true,"padRight":true},{"text":"now lies in the click probability of the ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"+ 1)","element":"span"},{"text":"-th item. In detail,","element":"span"}],[{"style":{"width":"49%"},"width":957,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-1.png","element":"img"}],[{"text":"We exchange positions of the ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"+ 1)","element":"span"},{"text":"-th and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"+ 2)","element":"span"},{"text":"-th item in ","element":"span"},{"style":{"height":13.19},"width":33.52,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-2.png","element":"img","alt":" I1","inline":true,"padRight":true},{"text":"to get a new ordered set ","element":"span"},{"style":{"height":13.19},"width":33.52,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-3.png","element":"img","alt":" I2","inline":true},{"text":". Similarly we have","element":"span"}],[{"style":{"width":"82%"},"width":1612,"height":380,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-4.png","element":"img"}],[{"text":"We can continue this operation for ","element":"span"},{"style":{"height":10.8},"width":182.74,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-5.png","element":"img","alt":" n = k − m","inline":true,"padRight":true},{"text":"times and get ","element":"span"},{"style":{"height":13.19},"width":37.52,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-6.png","element":"img","alt":" In","inline":true},{"text":". Iteratively, we have ","element":"span"},{"style":{"height":16},"width":577.46,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-7.png","element":"img","alt":" µk(u, I) − µk(v, I) ≥ µk(u, In) −","inline":true},{"style":{"height":16},"width":156.1,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-8.png","element":"img","alt":"µk(v, In)","inline":true},{"text":". Besides, the only difference between ","element":"span"},{"style":{"height":16},"width":305,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-9.png","element":"img","alt":" (u, In) and (v, In)","inline":true,"padRight":true},{"text":"now lies in the click probability of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"-th item:","element":"span"}],[{"style":{"width":"42%"},"width":831,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-10.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"fontStyle":"italic"},"text":"µ","element":"span"},{"style":{"height":16},"width":703.58,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-11.png","element":"img","alt":"k(u, In) = µk(v, In), µk(u, I) ≥ µk(v, I).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"D.8. Proof of Lemma ","element":"span"},{"href":"#id-41","style":{"fontWeight":"bold"},"text":"5.2","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 5.2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k, t","element":"span"},{"style":{"fontStyle":"italic"},"text":",","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Lower bound. ","element":"span"},{"text":"According to Lemma ","element":"span"},{"href":"#id-40","text":"5.1","element":"a"},{"text":", the expectation of observations attains its minimum when we pull an ordered set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , k","element":"span"},{"style":{"fontStyle":"italic"},"text":"}","element":"span"},{"text":", and attains its maximum when we pull an ordered set ","element":"span"},{"style":{"height":16},"width":506.34,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-12.png","element":"img","alt":" {L − k + 1, L − k + 2, . . . , L}","inline":true},{"text":". In other words, depending on the instance, the expectation of observations can be lower bounded as follows:","element":"span"}],[{"style":{"width":"53%"},"width":1045,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-13.png","element":"img"}],[{"text":"Moreover, since the lower bound ","element":"span"},{"style":{"height":10},"width":41.02,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-14.png","element":"img","alt":" µk","inline":true,"padRight":true},{"text":"is larger than the expectation of observations when ","element":"span"},{"style":{"height":16},"width":551.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-15.png","element":"img","alt":" w(i) = w∗ for all 1 ≤ i ≤ k or we","inline":true,"padRight":true},{"text":"pull item ","element":"span"},{"text":"1 ","element":"span"},{"text":"for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"times (note that this is not allowed in Algorithm ","element":"span"},{"href":"#id-24","text":"1","element":"a"},{"text":"), we can utilize only ","element":"span"},{"style":{"height":10.99},"width":45.6,"height":27.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-16.png","element":"img","alt":" w∗ ","inline":true,"padRight":true},{"text":"to lower bound the expectation:","element":"span"}],[{"style":{"width":"45%"},"width":892,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-17.png","element":"img"}],[{"text":"then","element":"span"}],[{"style":{"width":"76%"},"width":718,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/20-18.png","element":"img"}],[{"style":{"width":"93%"},"width":1813,"height":392,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-0.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":17.79},"width":304.67,"height":44.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-1.png","element":"img","alt":" w∗ = k−β ∈ [0, 1]","inline":true},{"text":", then ","element":"span"},{"style":{"height":14.4},"width":101.22,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-2.png","element":"img","alt":" β ≥ 0","inline":true},{"text":". Since ","element":"span"},{"style":{"height":16},"width":181.63,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-3.png","element":"img","alt":" (1 − 1/x)x","inline":true,"padRight":true},{"text":"is a nondecreasing function of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":16},"width":438.07,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-4.png","element":"img","alt":" limx→∞(1 − 1/x)x = 1/e","inline":true},{"text":", ","element":"span"},{"style":{"height":16.19},"width":167.35,"height":40.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-5.png","element":"img","alt":"k1−β ≥ 0,","inline":true}],[{"style":{"width":"61%"},"width":1189,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-6.png","element":"img"}],[{"text":"If ","element":"span"},{"style":{"height":18.19},"width":1083.25,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-7.png","element":"img","alt":" β ≥ 1, let f(x) = e−x, then f (n)(x) = (−1)n · e−x. For any x ≥ 0","inline":true},{"text":", there exists ","element":"span"},{"style":{"height":16},"width":305.89,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-8.png","element":"img","alt":" y ∈ [0, x] such that","inline":true}],[{"style":{"width":"78%"},"width":1533,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-9.png","element":"img"}],[{"text":"This leads to ","element":"span"},{"style":{"height":16},"width":437.9,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-10.png","element":"img","alt":" 1 − e−x ≥ x(1 − x/2) and","inline":true}],[{"style":{"width":"44%"},"width":413,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-11.png","element":"img"}],[{"text":"Otherwise, ","element":"span"},{"style":{"height":14.4},"width":281.93,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-12.png","element":"img","alt":" 0 ≤ β < 1. Since","inline":true}],[{"style":{"width":"72%"},"width":677,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-13.png","element":"img"}],[{"style":{"height":16.6},"width":180.86,"height":41.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-14.png","element":"img","alt":"1 − e−k1−β ","inline":true,"padRight":true},{"text":"decreases as ","element":"span"},{"style":{"height":14.4},"width":23,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-15.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"increases. Then,","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Upper bound. ","element":"span"},{"text":"Similarly we can see that the expectation of observations attains its maximum when we pull an ordered set ","element":"span"},{"style":{"height":16},"width":435.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-16.png","element":"img","alt":"{L, L − 1, . . . , L − k + 1}","inline":true},{"text":", and therefore upper bounded by","element":"span"}],[{"style":{"width":"76%"},"width":1479,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-17.png","element":"img"}],[{"text":"Furthermore, the upper bound ","element":"span"},{"style":{"height":14},"width":41.02,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-18.png","element":"img","alt":" ˜µk","inline":true,"padRight":true},{"text":"is smaller than the expectation of observations when ","element":"span"},{"style":{"height":16},"width":592.37,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-19.png","element":"img","alt":" w(j) = w′ for all L − k + 1 ≤ j ≤ L","inline":true,"padRight":true},{"text":"or we pull item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"L ","element":"span"},{"text":"for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"times (again note that this is not allowed in Algorithm ","element":"span"},{"href":"#id-24","text":"1","element":"a"},{"text":"):","element":"span"}],[{"style":{"width":"74%"},"width":1455,"height":209,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-20.png","element":"img"}],[{"id":"id-46","style":{"fontWeight":"bold"},"text":"D.9. Proof of Theorem ","element":"span"},{"href":"#id-6","style":{"fontWeight":"bold"},"text":"5.4","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 5.4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be an almost surely bounded nonnegative r.v.. If ","element":"span"},{"style":{"height":15.78},"width":468.38,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-21.png","element":"img","alt":" EX2 ≤ v2, then X is v-LSG.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Set ","element":"span"},{"style":{"height":14},"width":141.75,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-22.png","element":"img","alt":" EX = µ","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":13.2},"width":208.14,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-23.png","element":"img","alt":" 0 ≤ X ≤ M","inline":true,"padRight":true},{"text":"a.s., then ","element":"span"},{"style":{"height":13.2},"width":118.04,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-24.png","element":"img","alt":" M ≥ 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":14},"width":196.01,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-25.png","element":"img","alt":" 0 ≤ µ ≤ M","inline":true},{"text":". It is equivalent to show that for any ","element":"span"},{"style":{"height":15.78},"width":154.5,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-26.png","element":"img","alt":" v ≥ EX2","inline":true},{"text":", ","element":"span"},{"style":{"height":13.2},"width":106.31,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-27.png","element":"img","alt":"λ ≤ 0,","inline":true}],[{"style":{"width":"29%"},"width":579,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/21-28.png","element":"img"}],[{"text":"Set","element":"span"}],[{"text":"it is further equivalent to show ","element":"span"},{"style":{"height":16},"width":151.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-0.png","element":"img","alt":" f(λ) ≥ 0","inline":true},{"text":". Then since ","element":"span"},{"style":{"height":13.2},"width":204.34,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-1.png","element":"img","alt":" 0 ≤ X ≤ M","inline":true,"padRight":true},{"text":"a.s., for any ","element":"span"},{"style":{"height":13.2},"width":96.38,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-2.png","element":"img","alt":" λ ≤ 0","inline":true},{"text":", by Bounded Convergence Theorem,","element":"span"}],[{"style":{"width":"94%"},"width":1847,"height":543,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-3.png","element":"img"}],[{"text":"Therefore,","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":10},"width":24,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-4.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"be the probability measure of ","element":"span"},{"style":{"height":16},"width":832.3,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-5.png","element":"img","alt":" X on R. Since 0 ≤ X ≤ M a.s., µ([0, M]) = 1 and","inline":true}],[{"style":{"width":"68%"},"width":1329,"height":617,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-6.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":17.39},"width":516.31,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-7.png","element":"img","alt":" E[exp(λX)])2 > 0, g′(λ) ≤ 0","inline":true},{"text":". Hence ","element":"span"},{"style":{"height":16},"width":75.18,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-8.png","element":"img","alt":" g(λ)","inline":true,"padRight":true},{"text":"is monotonically decreasing on ","element":"span"},{"text":"R","element":"span"},{"text":". Further, for any ","element":"span"},{"style":{"height":13.2},"width":109.55,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-9.png","element":"img","alt":" λ ≤ 0","inline":true},{"text":", since ","element":"span"},{"style":{"height":15.78},"width":170.47,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-10.png","element":"img","alt":"v2 ≥ EX2","inline":true}],[{"style":{"width":"93%"},"width":1812,"height":209,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/22-11.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Given ","element":"span"},{"style":{"height":15.78},"width":174.46,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-0.png","element":"img","alt":" v2 ≥ EX2","inline":true},{"style":{"fontWeight":"bold"},"text":", it is more challenging to show ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"style":{"fontWeight":"bold"},"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"style":{"fontWeight":"bold"},"text":"-SG than to show ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"style":{"fontWeight":"bold"},"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"style":{"fontWeight":"bold"},"text":"-LSG. ","element":"span"},{"text":"By revisiting the proof above, we see that given ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"-LSG, to show ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"-SG suffices to show ","element":"span"},{"style":{"height":16},"width":380.69,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-1.png","element":"img","alt":" f(λ) ≥ 0 for any λ ≥ 0","inline":true},{"text":". Since it is hard to directly tell whether the inequality above holds for any ","element":"span"},{"style":{"height":13.2},"width":96.38,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-2.png","element":"img","alt":" λ ≥ 0","inline":true},{"text":", it is natural to look at how ","element":"span"},{"style":{"height":16},"width":274.12,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-3.png","element":"img","alt":" f(λ) grows in R.","inline":true}],[{"text":"Fix any ","element":"span"},{"style":{"height":16},"width":471.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-4.png","element":"img","alt":" M0 > 0. For any λ ∈ [0, M0]","inline":true},{"text":", again, applying the Bounded Convergence Theorem, we have","element":"span"}],[{"style":{"width":"70%"},"width":1381,"height":319,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-5.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":16},"width":262.89,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-6.png","element":"img","alt":" f(0) = 0 and f ′ ","inline":true,"padRight":true},{"text":"is differentiable on ","element":"span"},{"text":"R","element":"span"},{"text":", it requires at least ","element":"span"},{"style":{"height":16},"width":700.51,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-7.png","element":"img","alt":" r > 0 such that f ′(λ) ≥ 0 for any λ ∈ [0, r]","inline":true},{"text":". Furthermore, since ","element":"span"},{"style":{"height":16},"width":159.07,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-8.png","element":"img","alt":" f ′(0) = 0","inline":true},{"text":", one may consider showing that ","element":"span"},{"style":{"height":16},"width":320.1,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-9.png","element":"img","alt":" f ′′(λ) ≥ 0 on [0, r].","inline":true}],[{"text":"In the proof above, we define a function ","element":"span"},{"style":{"fontStyle":"italic"},"text":"g ","element":"span"},{"text":"to show that ","element":"span"},{"style":{"height":16},"width":316.83,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-10.png","element":"img","alt":" f ′′(λ) ≥ g(λ) ≥ 0","inline":true,"padRight":true},{"text":"on ","element":"span"},{"style":{"height":16},"width":134.98,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-11.png","element":"img","alt":" (−∞, 0]","inline":true},{"text":". However, since ","element":"span"},{"style":{"height":16},"width":156.55,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-12.png","element":"img","alt":" g(λ) ≤ 0","inline":true,"padRight":true},{"text":"on ","element":"span"},{"style":{"height":16},"width":135.55,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-13.png","element":"img","alt":"[0, +∞)","inline":true},{"text":", this cannot help to show ","element":"span"},{"style":{"height":16},"width":320.11,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-14.png","element":"img","alt":" f ′′(λ) ≥ 0 on [0, r].","inline":true}],[{"text":"The discussion above indicates that showing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"-SG is more challenging than showing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"-LSG.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"D.10. Proof of Lemma ","element":"span"},{"href":"#id-43","style":{"fontWeight":"bold"},"text":"5.5","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 5.5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":20.3},"width":599.8,"height":50.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-15.png","element":"img","alt":" k, t, EX2k;t ≤ v2k = min{k2, 2/w′2 }.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Recall ","element":"span"},{"style":{"height":6.8},"width":43.6,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-16.png","element":"img","alt":" w′","inline":true,"padRight":true},{"text":"is the minimum click probability. We abbreviate ","element":"span"},{"style":{"height":15.59},"width":71.66,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-17.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". Firstly, since ","element":"span"},{"style":{"height":17.38},"width":358.42,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-18.png","element":"img","alt":" X ∈ [1, k], EX2 ≤ k2","inline":true},{"text":". Next, note that ","element":"span"},{"style":{"height":13.38},"width":78.7,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-19.png","element":"img","alt":" EX2","inline":true,"padRight":true},{"text":"increases when the click probabilities decrease or ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"increases. Set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Y ","element":"span"},{"text":"as a random variable drawn from a geometric distribution with parameter ","element":"span"},{"style":{"height":17.39},"width":1074.68,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-20.png","element":"img","alt":" w′, then EX2 ≤ EY 2. Since EY 2 = 2/w′2 − 1/w, EX2 ≤ 2/w′2.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"D.11. Proof of Lemma ","element":"span"},{"href":"#id-42","style":{"fontWeight":"bold"},"text":"5.6","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 5.6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":14.8},"width":229.22,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-21.png","element":"img","alt":" k, t, δ > 0, set","inline":true}],[{"style":{"fontStyle":"italic"},"text":"then ","element":"span"},{"style":{"height":16},"width":188.78,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-22.png","element":"img","alt":" Pr(E∗) ≤ δ","inline":true},{"style":{"fontStyle":"italic"},"text":". Further when ","element":"span"},{"style":{"height":10.8},"width":40.6,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-23.png","element":"img","alt":" E∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"holds, for any ","element":"span"},{"style":{"height":17.6},"width":354.14,"height":44.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-24.png","element":"img","alt":" T >0, �nt=1 Xk;t ≤T","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"implies that ","element":"span"},{"style":{"height":17.9},"width":492.06,"height":44.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-25.png","element":"img","alt":" n≤2T/µk+2 log(1/δ)v2k/µ2k.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"We abbreviate ","element":"span"},{"style":{"height":15.59},"width":171.72,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-26.png","element":"img","alt":" Xk;t as Xt","inline":true,"padRight":true},{"text":"(the number of observations of surviving items at step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"when pulling ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"surviving items), and set ","element":"span"},{"style":{"height":13.6},"width":330.2,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-27.png","element":"img","alt":" Dt = Xt − EXt, Ft","inline":true,"padRight":true},{"text":"denote the decisions and observations up to step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Besides, let ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-28.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"be the set to pull at step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", then ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-29.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"is determined by ","element":"span"},{"style":{"height":13.6},"width":215.91,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-30.png","element":"img","alt":" Ft−1, and Xt","inline":true,"padRight":true},{"text":"depends on ","element":"span"},{"style":{"height":13.19},"width":149.64,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-31.png","element":"img","alt":" St. Since","inline":true}],[{"style":{"width":"39%"},"width":776,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-32.png","element":"img"}],[{"style":{"height":14},"width":184.42,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-33.png","element":"img","alt":"D1, . . . , Dt","inline":true,"padRight":true},{"text":"i s a martingale difference sequence adapted to ","element":"span"},{"style":{"height":16},"width":171.47,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-34.png","element":"img","alt":" F = (Ft)t","inline":true},{"text":". Besides, according to Theorem ","element":"span"},{"href":"#id-6","text":"5.4","element":"a"},{"text":", for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", any ","element":"span"},{"style":{"height":20.71},"width":796.94,"height":51.77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-35.png","element":"img","alt":"λ ≤ 0, v2k ≥ EX2 yields E[eλDt|Ft−1] ≤ eλ2v2/2","inline":true},{"text":". Then for any ","element":"span"},{"style":{"height":13.2},"width":109.3,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-36.png","element":"img","alt":" ω > 0,","inline":true}],[{"style":{"width":"61%"},"width":1205,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-37.png","element":"img"}],[{"text":"Let the probability bound in the right hand side be ","element":"span"},{"style":{"height":14},"width":107.84,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-38.png","element":"img","alt":" δ, then","inline":true}],[{"style":{"width":"37%"},"width":355,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/23-39.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":14.4},"width":328.06,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-0.png","element":"img","alt":" EXt ≥ µk for any t,","inline":true}],[{"style":{"width":"62%"},"width":586,"height":262,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-1.png","element":"img"}],[{"text":"Next, for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T > ","element":"span"},{"text":"0","element":"span"},{"text":", consider","element":"span"}],[{"style":{"width":"22%"},"width":207,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-2.png","element":"img"}],[{"text":"Set","element":"span"}],[{"text":"then ","element":"span"},{"style":{"height":15.79},"width":481.2,"height":39.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-3.png","element":"img","alt":" x ≥ 0 and x2 − a0x − b0 ≤ 0","inline":true},{"text":". Note that ","element":"span"},{"style":{"height":17.39},"width":374.63,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-4.png","element":"img","alt":" (p + q)2 ≤ 2(p2 + q2),","inline":true}],[{"style":{"width":"77%"},"width":1510,"height":324,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.12. Proof of Lemma ","element":"span"},{"href":"#id-50","style":{"fontWeight":"bold"},"text":"5.7","element":"a"}],[{"id":"id-66","style":{"fontStyle":"italic"},"text":"Proof of Remark ","element":"span"},{"href":"#id-66","style":{"fontStyle":"italic"},"text":"D.1","element":"a"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"Any non-negative random variable bounded in ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b","element":"span"},{"text":"] ","element":"span"},{"text":"a.s. is sub-Gaussian with parameter ","element":"span"},{"style":{"height":16},"width":167.76,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-6.png","element":"img","alt":" (b − a)/2.","inline":true,"padRight":true},{"text":"Meanwhile, ","element":"span"},{"style":{"height":16},"width":224.73,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-7.png","element":"img","alt":" Wt(i) ∈ [0, 1]","inline":true,"padRight":true},{"text":"yields that ","element":"span"},{"style":{"height":16},"width":888.58,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-8.png","element":"img","alt":" ηt(i) ∈ [w(i) − 1, w(i)]. [w(i) − (w(i) − 1)]/2 = 1/2.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"For all ","element":"span"},{"style":{"height":16},"width":867.98,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-9.png","element":"img","alt":" i ∈ [L], E(i, δ) = {∀t ≥ 1, | ˆwt(i) − w(i)| ≤ Ct(i, δ)}","inline":true},{"text":". Recall that","element":"span"}],[{"style":{"width":"67%"},"width":1311,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-10.png","element":"img"}],[{"text":"then according to Theorem ","element":"span"},{"href":"#id-47","text":"B.2","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"74%"},"width":702,"height":222,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-11.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.13. Proof of Lemma ","element":"span"},{"href":"#id-49","style":{"fontWeight":"bold"},"text":"5.8","element":"a"}],[{"style":{"width":"100%"},"width":938,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/24-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Preliminary","element":"span"},{"text":". Since we use the UCB of the empirical top-","element":"span"},{"style":{"height":16},"width":134.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-0.png","element":"img","alt":"(kt + 1)","inline":true,"padRight":true},{"text":"item to accept ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-1.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal items, hopefully it should be close to the true click probability of item ","element":"span"},{"style":{"height":16},"width":134.9,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-2.png","element":"img","alt":" (kt + 1)","inline":true},{"text":"; likewise, the LCB of the empirical top-","element":"span"},{"style":{"height":16},"width":66.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-3.png","element":"img","alt":"(kt)","inline":true,"padRight":true},{"text":"item should be close to the true click probability of item ","element":"span"},{"style":{"height":16},"width":66.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-4.png","element":"img","alt":" (kt)","inline":true},{"text":". This is stated in Lemma ","element":"span"},{"href":"#id-67","text":"D.2","element":"a"},{"text":".","element":"span"}],[{"id":"id-67","style":{"fontWeight":"bold"},"text":"Lemma D.2 ","element":"span"},{"text":"(","element":"span"},{"href":"#id-27","referenceIndex":16,"text":"Jun et al. ","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"(","element":"a"},{"href":"#id-27","referenceIndex":16,"text":"2016","element":"a"},{"text":", Lemma 3))","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Denote by ","element":"span"},{"style":{"height":14.45},"width":20,"height":36.13,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-5.png","element":"img","alt":"ˆi","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the index of the item with empirical mean is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":"-th largest: i.e., ","element":"span"},{"style":{"height":18.83},"width":332,"height":47.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-6.png","element":"img","alt":"ˆw(ˆ1) ≥ . . . ≥ ˆw(ˆL).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Assume that the empirical means of the arms are controlled by ","element":"span"},{"style":{"height":16},"width":571.71,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-7.png","element":"img","alt":" ϵ : i.e., ∀i, | ˆw(i) − w(i)| < ϵ. Then,","inline":true}],[{"style":{"width":"26%"},"width":521,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-8.png","element":"img"}],[{"text":"After that, Lemma ","element":"span"},{"href":"#id-49","text":"5.8 ","element":"a"},{"text":"shows that the agent will correctly classify the items after a sufficient number of observations, and also show what is the sufficient number of observations for each item.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Recall","element":"span"}],[{"text":"And We use ","element":"span"},{"style":{"height":14.4},"width":132.68,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-9.png","element":"img","alt":" ρ and ρ′ ","inline":true,"padRight":true},{"text":"as abbreviations for ","element":"span"},{"style":{"height":16},"width":230.79,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-10.png","element":"img","alt":" ρ(δ) and ρ(δ′)","inline":true,"padRight":true},{"text":"respectively.","element":"span"}],[{"text":"It suffices to show for the case where ","element":"span"},{"style":{"height":13.99},"width":161.89,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-11.png","element":"img","alt":" At and Rt","inline":true,"padRight":true},{"text":"are empty since otherwise the problem is equivalent to removing rejected or accepted arms from consideration and starting a new problem with ","element":"span"},{"style":{"height":16},"width":385.46,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-12.png","element":"img","alt":" Lnew = L − |At| − |Rt|","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":286.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-13.png","element":"img","alt":" Knew = K − |At|","inline":true,"padRight":true},{"text":"while maintaining the observations so far. Note that when ","element":"span"},{"style":{"height":15.2},"width":341.99,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-14.png","element":"img","alt":" At is empty, kt = K.","inline":true}],[{"id":"id-68","text":"First of all, ","element":"span"},{"style":{"height":16},"width":178,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-15.png","element":"img","alt":" Tt(i) ≥ T ′t ","inline":true,"padRight":true},{"text":"implies that","element":"span"}],[{"text":"Then since ","element":"span"},{"style":{"height":20.8},"width":1226.89,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-16.png","element":"img","alt":"�Li=1 E(i, δ′) holds, | ˆwt(i)−w(i)| ≤ ˜C (Tt(i), ρ′) ≤ ˜C (T ′t, ρ′) for all i ∈ Dt","inline":true},{"text":". Combining this with Lemma ","element":"span"},{"href":"#id-67","text":"D.2","element":"a"},{"text":", ","element":"span"},{"text":"we have","element":"span"}],[{"id":"id-69","style":{"width":"73%"},"width":1427,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-17.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"We first prove that for any ","element":"span"},{"style":{"height":14},"width":124.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-18.png","element":"img","alt":" i ≤ K′,","inline":true}],[{"text":"For clarity, we write ","element":"span"},{"style":{"height":14.18},"width":195.58,"height":35.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-19.png","element":"img","alt":" j∗ = �K + 1","inline":true},{"text":", which is the item with ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"+ 1)","element":"span"},{"text":"-th largest empirical mean at the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":"-th step. We assume the contrary: ","element":"span"},{"style":{"height":16},"width":459.14,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-20.png","element":"img","alt":" Lt(i, δ) ≤ Ut( �K + 1, δ) − ϵ","inline":true},{"text":". We can apply (","element":"span"},{"href":"#id-68","text":"D.1","element":"a"},{"text":") and (","element":"span"},{"href":"#id-69","text":"D.2","element":"a"},{"text":") to obtain","element":"span"}],[{"style":{"width":"73%"},"width":1430,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-21.png","element":"img"}],[{"text":"Next,","element":"span"}],[{"text":"Part (a) of the second line above follows from: (i) if ","element":"span"},{"style":{"height":16},"width":676.99,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-22.png","element":"img","alt":" i ≤ K, w(i)−w(K +1)+ϵ = ∆i +ϵ > 0","inline":true},{"text":"; (ii) else, ","element":"span"},{"style":{"height":13.2},"width":303.23,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-23.png","element":"img","alt":" K < i ≤ K′, since","inline":true},{"style":{"height":16},"width":1951.21,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-24.png","element":"img","alt":"w(i) ≥ w(K) − ϵ, we have w(i) − w(K + 1) + ϵ = w(i) − w(K) + w(K) − w(K + 1) + ϵ = ∆K − ∆i + ϵ ≥ ∆K > 0.","inline":true,"padRight":true},{"text":"Then invert to the third line using","element":"span"}],[{"style":{"width":"43%"},"width":842,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/25-25.png","element":"img"}],[{"style":{"width":"88%"},"width":1715,"height":299,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-0.png","element":"img"}],[{"text":"Therefore, ","element":"span"},{"style":{"height":18.42},"width":162.66,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-1.png","element":"img","alt":"¯T ′t ≥ ¯Ti,δ′","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":23.76},"width":1096.35,"height":59.41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-2.png","element":"img","alt":" Lt(i, δ) > Ut(j∗, δ) − ϵ where j∗ = arg max(kt+1)j∈Dt ˆwt. Then i ∈ At","inline":true,"padRight":true},{"text":"is accepted.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Subsequently, we prove that for any ","element":"span"},{"style":{"height":14},"width":124.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-3.png","element":"img","alt":" i > K′,","inline":true}],[{"text":"Again for brevity, we write ","element":"span"},{"style":{"height":18.03},"width":124.14,"height":45.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-4.png","element":"img","alt":"ˆK = j′","inline":true},{"text":", the item with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"text":"-th largest empirical mean at the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":"-th step. We assume the contrary: ","element":"span"},{"style":{"height":18.83},"width":390.51,"height":47.07,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-5.png","element":"img","alt":"Ut(i, δ) ≥ Lt( ˆK, δ) − ϵ","inline":true},{"text":". Again applying (","element":"span"},{"href":"#id-68","text":"D.1","element":"a"},{"text":") and (","element":"span"},{"href":"#id-69","text":"D.2","element":"a"},{"text":"), we have","element":"span"}],[{"style":{"width":"64%"},"width":1260,"height":115,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-6.png","element":"img"}],[{"text":"Next,","element":"span"}],[{"style":{"width":"79%"},"width":741,"height":154,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-7.png","element":"img"}],[{"text":"Similar to the first case, with","element":"span"}],[{"style":{"width":"100%"},"width":939,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.14. Proof of Lemma ","element":"span"},{"href":"#id-30","style":{"fontWeight":"bold"},"text":"5.9","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 5.9. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":20.8},"width":200.22,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-9.png","element":"img","alt":"�Li=1 E(i, δ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"holds. Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"stops after identifying at most ","element":"span"},{"style":{"height":16},"width":462.63,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-10.png","element":"img","alt":" L − max{K′ − K, 1} items.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Assume ","element":"span"},{"style":{"height":20.8},"width":306.03,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-11.png","element":"img","alt":"�Li=1 E(i, δ) holds.","inline":true,"padRight":true},{"style":{"fontWeight":"bold"},"text":"Case (i)","element":"span"},{"text":": ","element":"span"},{"style":{"height":10.8},"width":137,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-12.png","element":"img","alt":" K′ = K","inline":true},{"text":". In the worst case, the algorithm does not terminate before identifying the ","element":"span"},{"style":{"height":16},"width":127.06,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-13.png","element":"img","alt":" (L − 1)","inline":true},{"text":"-th one. In this case,","element":"span"}],[{"text":"after identifying the ","element":"span"},{"style":{"height":16},"width":127.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-14.png","element":"img","alt":" (L − 1)","inline":true},{"text":"-th one with sufficient observations, either the accept set or the reject set is full, i.e., ","element":"span"},{"style":{"height":16},"width":155.23,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-15.png","element":"img","alt":" |At| = K","inline":true,"padRight":true},{"text":"or ","element":"span"},{"style":{"height":16},"width":231.39,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-16.png","element":"img","alt":" |Rt| = L − K","inline":true},{"text":", the the agent can just place the remaining item in the unfilled set.","element":"span"}],[{"text":"Hence, the algorithm terminates after sufficiently observing and identifying at most ","element":"span"},{"style":{"height":16},"width":613.66,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-17.png","element":"img","alt":" L − 1 = L − max{K′ + K, 1} items.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Case (ii)","element":"span"},{"text":": ","element":"span"},{"style":{"height":11.6},"width":137.02,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-18.png","element":"img","alt":" K′ > K","inline":true},{"text":". The algorithm classify all items correctly according to Lemma ","element":"span"},{"href":"#id-49","text":"5.8","element":"a"},{"text":". since the number of ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-19.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal items","element":"span"}],[{"text":"is ","element":"span"},{"style":{"height":16},"width":640.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-20.png","element":"img","alt":" K′ = max{i : w(i) ≥ w(K) − ϵ} ≥ K","inline":true,"padRight":true},{"text":", the number of suboptimal items is ","element":"span"},{"href":"#id-49","style":{"height":16},"width":687.09,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-21.png","element":"img","alt":" L − K′ ≤ L − K. Hence, |Rt| ≤ L − K′.","inline":true,"padRight":true},{"text":"Besides, ","element":"span"},{"style":{"height":16},"width":155.19,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-22.png","element":"img","alt":" |At| ≤ K","inline":true,"padRight":true},{"text":"according to the design of the algorithm. Therefore,","element":"span"}],[{"style":{"width":"22%"},"width":445,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-23.png","element":"img"}],[{"text":"In other words, the algorithm terminates after sufficiently observing and identifying at most ","element":"span"},{"style":{"height":16},"width":513.3,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/26-24.png","element":"img","alt":" L − K′ + K = L − max{K′ +","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"K, ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":"} ","element":"span"},{"text":"items.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"D.15. Proof of Lemma ","element":"span"},{"href":"#id-63","style":{"fontWeight":"bold"},"text":"5.11","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 5.11. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":13.2},"width":180.92,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-0.png","element":"img","alt":" 1 ≤ ℓ ≤ L,","inline":true}],[{"text":"To manifest the difference between instance ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-1.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"and other instances, with ","element":"span"},{"style":{"height":18.18},"width":621.06,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-2.png","element":"img","alt":" w(0)(i) = w(i) for all i ∈ [L] we write","inline":true}],[{"text":"• ","element":"span"},{"style":{"height":18.18},"width":523.31,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-3.png","element":"img","alt":" {w(0)(1), w(0)(2), . . . , w(0)(L)}","inline":true,"padRight":true},{"text":"under instance ","element":"span"},{"text":"0","element":"span"},{"text":";","element":"span"}],[{"text":"• ","element":"span"},{"style":{"height":18.18},"width":284.76,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-4.png","element":"img","alt":" {w(0)(1), w(0)(2)","inline":true},{"style":{"fontStyle":"italic"},"text":", . . . , w","element":"span"},{"style":{"height":18.18},"width":501,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-5.png","element":"img","alt":"(0)(ℓ − 1), w(ℓ)(ℓ), w(0)(ℓ + 1)","inline":true},{"style":{"fontStyle":"italic"},"text":", . . . , w","element":"span"},{"style":{"height":18.18},"width":120.9,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-6.png","element":"img","alt":"(0)(L)}","inline":true,"padRight":true},{"text":"under instance ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ℓ","element":"span"},{"text":".","element":"span"}],[{"text":"We combine Lemma ","element":"span"},{"href":"#id-70","text":"5.10 ","element":"a"},{"text":"and a result from ","element":"span"},{"href":"#id-12","referenceIndex":18,"text":"Kaufmann et al. ","element":"a"},{"href":"#id-12","referenceIndex":18,"text":"(","element":"a"},{"href":"#id-12","referenceIndex":18,"text":"2016","element":"a"},{"text":") to relate the time complexity and KL divergence together. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Lemma D.3 ","element":"span"},{"text":"((","element":"span"},{"href":"#id-12","referenceIndex":18,"text":"Kaufmann et al.","element":"a"},{"href":"#id-12","referenceIndex":18,"text":", ","element":"a"},{"href":"#id-12","referenceIndex":18,"text":"2016","element":"a"},{"text":", Lemma 19))","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be any almost surely finite stopping time with respect to ","element":"span"},{"style":{"height":13.59},"width":121.6,"height":33.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-7.png","element":"img","alt":" Ft. For","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"every event ","element":"span"},{"style":{"height":14.39},"width":128.95,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-8.png","element":"img","alt":" E ∈ FT","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":", instance ","element":"span"},{"style":{"height":13.2},"width":180.92,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-9.png","element":"img","alt":" 1 ≤ ℓ ≤ L,","inline":true}],[{"style":{"width":"52%"},"width":1016,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Notations. ","element":"span"},{"text":"Before presenting the proof, we remind the reader of the definition of the KL divergence (","element":"span"},{"href":"#id-71","referenceIndex":9,"text":"Cover & Thomas","element":"a"},{"href":"#id-71","referenceIndex":9,"text":", ","element":"a"},{"href":"#id-71","referenceIndex":9,"text":"2012","element":"a"},{"text":"). For two discrete random variables ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Y ","element":"span"},{"text":"with common support ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":",","element":"span"}],[{"style":{"width":"30%"},"width":584,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-11.png","element":"img"}],[{"text":"denotes the KL divergence between probability mass functions of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Y ","element":"span"},{"text":". Next, we also use ","element":"span"},{"style":{"height":16},"width":216.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-12.png","element":"img","alt":" KL(PX∥PY )","inline":true,"padRight":true},{"text":"to also signify this KL divergence. Lastly, when ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"b ","element":"span"},{"text":"are two real numbers between ","element":"span"},{"text":"0 ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":16},"width":604.97,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-13.png","element":"img","alt":" 1, KL(a, b) = KL (Bern(a)∥Bern(b))","inline":true},{"text":", i.e., ","element":"span"},{"text":"KL(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b","element":"span"},{"text":") ","element":"span"},{"text":"denotes the KL divergence between Bern","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"and Bern","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":")","element":"span"},{"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"For any certain ","element":"span"},{"style":{"height":9.19},"width":30.68,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-14.png","element":"img","alt":" st","inline":true},{"text":", we can observe that the KL divergence ","element":"span"},{"style":{"height":29.53},"width":696.85,"height":73.82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-15.png","element":"img","alt":" KL�POπ,0t |Sπ,0t (· | st)�� POπ,ℓt |Sπ,ℓt (· | st)�","inline":true},{"text":"should grow with the KL divergence of observed items. ","element":"span"},{"text":"Further, for each observed item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":", there is a KL divergence of ","element":"span"},{"style":{"height":20.75},"width":852.95,"height":51.87,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-16.png","element":"img","alt":"KL�w(0)(i), w(ℓ)(i)�. Whenever Sπ,0t = st, we have","inline":true}],[{"style":{"width":"87%"},"width":1700,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-17.png","element":"img"}],[{"text":"Then according to Lemma ","element":"span"},{"href":"#id-70","text":"5.10","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"77%"},"width":725,"height":754,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/27-18.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.16. Proof of Theorem ","element":"span"},{"href":"#id-29","style":{"fontWeight":"bold"},"text":"4.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Preliminary. ","element":"span"},{"text":"Recall that ","element":"span"},{"style":{"height":19.31},"width":495.81,"height":48.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-0.png","element":"img","alt":"¯∆σ(1) ≥ ¯∆σ(2) ≥ . . . ≥ ¯∆σ(L)","inline":true},{"text":", and ","element":"span"},{"style":{"height":16},"width":82.55,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-1.png","element":"img","alt":" Tt(i)","inline":true,"padRight":true},{"text":"counts the number of observations of item ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"up to the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":"-th step. The worst case is that the algorithm eliminates ","element":"span"},{"style":{"height":16},"width":232.12,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-2.png","element":"img","alt":" σ(1), σ(2), . . .","inline":true,"padRight":true},{"text":"in order, and the algorithm eliminates at most ","element":"span"},{"text":"1 ","element":"span"},{"text":"item at one time step. Besides, the design of Algorithm ","element":"span"},{"href":"#id-24","text":"1 ","element":"a"},{"text":"implies that","element":"span"}],[{"id":"id-72","style":{"width":"68%"},"width":1340,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-3.png","element":"img"}],[{"text":"In the following discussion,we assume ","element":"span"},{"style":{"height":20.8},"width":586.51,"height":51.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-4.png","element":"img","alt":"�Li=1 E(i, δ) holds and K′ < 2K −1","inline":true,"padRight":true},{"text":"(discussion on ","element":"span"},{"style":{"height":13.2},"width":218.9,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-5.png","element":"img","alt":" K′ ≥ 2K −1","inline":true,"padRight":true},{"text":"is in Appendix ","element":"span"},{"text":"C","element":"span"},{"text":"). ","element":"span"},{"text":"Note that Lemma ","element":"span"},{"href":"#id-50","text":"5.7 ","element":"a"},{"text":"implies that ","element":"span"},{"style":{"height":28.8},"width":459.16,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-6.png","element":"img","alt":" P��Li=1 E(i, δ)�≥ 1 − δ/2","inline":true},{"text":". Besides, we write ","element":"span"},{"style":{"height":18.42},"width":634.42,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-7.png","element":"img","alt":" µ(k, w) as µk, v(k, w) as vk, ¯Ti,δ as ¯Ti,","inline":true},{"style":{"height":16},"width":144.94,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-8.png","element":"img","alt":"ρ(δ) as ρ","inline":true,"padRight":true},{"text":"for simplicity.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Bound the number of observations per phrase. ","element":"span"},{"text":"Observe that there are less than ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"surviving items remaining in the survival set ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-9.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"at some steps before the algorithm terminates, we separate the steps into several phrases:","element":"span"}],[{"text":"(i) During the first phrase, the agent eliminates ","element":"span"},{"style":{"height":12},"width":181.09,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-10.png","element":"img","alt":" L − K + 1","inline":true,"padRight":true},{"text":"items within ","element":"span"},{"style":{"height":12.39},"width":30.39,"height":30.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-11.png","element":"img","alt":" t1","inline":true,"padRight":true},{"text":"steps. According to Lemma ","element":"span"},{"href":"#id-49","text":"5.8 ","element":"a"},{"text":"and Line (","element":"span"},{"href":"#id-72","text":"D.3","element":"a"},{"text":"),","element":"span"}],[{"style":{"width":"47%"},"width":915,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-12.png","element":"img"}],[{"text":"Then the total number of observations of surviving items in ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-13.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"within this phrase can be bounded as follows:","element":"span"}],[{"style":{"width":"69%"},"width":1348,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-14.png","element":"img"}],[{"text":"(ii) During the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"-th phrase for any ","element":"span"},{"style":{"height":16},"width":516.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-15.png","element":"img","alt":" 2 ≤ k ≤ K − max{K′ − K, 1}","inline":true},{"text":", the agent eliminates the ","element":"span"},{"style":{"height":12},"width":182.14,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-16.png","element":"img","alt":" L − K + k","inline":true},{"text":"-th item within ","element":"span"},{"style":{"height":12.39},"width":31.4,"height":30.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-17.png","element":"img","alt":" tk","inline":true,"padRight":true},{"text":"steps. Again apply Lemma ","element":"span"},{"href":"#id-49","text":"5.8 ","element":"a"},{"text":"and Line (","element":"span"},{"href":"#id-72","text":"D.3","element":"a"},{"text":"):","element":"span"}],[{"style":{"width":"57%"},"width":1119,"height":207,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-18.png","element":"img"}],[{"text":"Then the total number of observations of surviving items in ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-19.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"within this phrase can also be bounded:","element":"span"}],[{"style":{"width":"58%"},"width":1137,"height":192,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-20.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Bound the number of time steps per phrase. ","element":"span"},{"text":"Recall that the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"-th (","element":"span"},{"style":{"height":16},"width":516.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-21.png","element":"img","alt":"1 ≤ k ≤ K − max{K′ − K, 1}","inline":true},{"text":") phrase consist of ","element":"span"},{"style":{"height":12.39},"width":31.4,"height":30.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-22.png","element":"img","alt":" tk","inline":true,"padRight":true},{"text":"time steps. Let ","element":"span"},{"style":{"height":13.19},"width":44.2,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-23.png","element":"img","alt":" Zk","inline":true,"padRight":true},{"text":"be the total number of observations within the ","element":"span"},{"style":{"height":12.39},"width":31.39,"height":30.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-24.png","element":"img","alt":" tk","inline":true,"padRight":true},{"text":"steps. Lemma ","element":"span"},{"href":"#id-42","text":"5.6 ","element":"a"},{"text":"indicates that","element":"span"}],[{"style":{"width":"71%"},"width":1385,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-25.png","element":"img"}],[{"text":"Then according to Lemma ","element":"span"},{"href":"#id-42","text":"5.6","element":"a"},{"text":", for any ","element":"span"},{"style":{"height":16},"width":579.77,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-26.png","element":"img","alt":" k (1 ≤ k ≤ K − max{K′ − K, 1})","inline":true},{"text":", with probability at least ","element":"span"},{"style":{"height":14},"width":115.95,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-27.png","element":"img","alt":" 1 − δk,","inline":true}],[{"style":{"width":"29%"},"width":577,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-28.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Bound the time complexity. ","element":"span"},{"text":"Altogether, we would have ","element":"span"},{"style":{"height":21.49},"width":343.97,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-29.png","element":"img","alt":"�K−max{K′−K,1}k=1 tk","inline":true,"padRight":true},{"text":"as the time complexity. Besides, we bound the total error incurred by partial observation by ","element":"span"},{"style":{"height":16},"width":59.14,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-30.png","element":"img","alt":" δ/2","inline":true},{"text":". In other words,","element":"span"}],[{"style":{"width":"84%"},"width":1645,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-31.png","element":"img"}],[{"text":"Depending on the value of ","element":"span"},{"style":{"height":10.8},"width":132.58,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/28-32.png","element":"img","alt":" K′ − K","inline":true},{"text":", there are two cases:","element":"span"}],[{"style":{"width":"52%"},"width":1024,"height":116,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-0.png","element":"img"}],[{"text":"For brevity, we only go through the remaining analysis for the first case, the analysis for the second one is similar.","element":"span"}],[{"text":"Since the second term in the bound on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"merely depends on the problem, we turn to analyze the first term. Since the first term holds for any values of ","element":"span"},{"style":{"height":13.99},"width":34.71,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-1.png","element":"img","alt":" δk","inline":true},{"text":"’s such that ","element":"span"},{"style":{"height":20.4},"width":311.1,"height":50.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-2.png","element":"img","alt":"�2K−K′k=1 δk ≤ δ/2","inline":true},{"text":", we minimize the first term with the method of Lagrange","element":"span"}],[{"style":{"width":"76%"},"width":1488,"height":210,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-3.png","element":"img"}],[{"text":"Let","element":"span"}],[{"style":{"width":"65%"},"width":611,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-4.png","element":"img"}],[{"text":"then for all ","element":"span"},{"style":{"height":13.2},"width":311.4,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-5.png","element":"img","alt":" 1 ≤ k ≤ 2K − K′,","inline":true}],[{"style":{"height":16},"width":60.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-6.png","element":"img","alt":"(▲)","inline":true,"padRight":true},{"text":"attains its maximum when ","element":"span"},{"style":{"height":16.12},"width":669.99,"height":40.29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-7.png","element":"img","alt":" δk = δ∗k for all 1 ≤ k ≤ 2K − K′. Hence","inline":true}],[{"style":{"width":"71%"},"width":1390,"height":493,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-8.png","element":"img"}],[{"text":"Now we bound ","element":"span"},{"style":{"height":16},"width":226.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-9.png","element":"img","alt":" (♠), (♥), (♣)","inline":true,"padRight":true},{"text":"individually.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Bounding ","element":"span"},{"style":{"height":16},"width":62.49,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-10.png","element":"img","alt":" (♠)","inline":true,"padRight":true},{"text":": note that ","element":"span"},{"style":{"height":31.28},"width":1101.84,"height":78.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-11.png","element":"img","alt":" µK+1−k ≥ 2 for all 1 ≤ k ≤ 2K − K′, K′ ≥ K and ck =2v2K−k+1µ2K−k+1 ,","inline":true}],[{"style":{"width":"70%"},"width":1376,"height":145,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Bounding ","element":"span"},{"style":{"height":20.21},"width":1768.97,"height":50.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-13.png","element":"img","alt":" (♥) : Let g(x) = log xx for x > 0, then g′(x) = 1−log xx2 . Since g′(x) > 0 when x ∈ (0, e), g′(e) = 0, g′(x) < 0","inline":true,"padRight":true},{"text":"when ","element":"span"},{"style":{"height":16},"width":311.3,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-14.png","element":"img","alt":" x ∈ (e, +∞), g(x)","inline":true,"padRight":true},{"text":"is increasing on ","element":"span"},{"text":"(0","element":"span"},{"style":{"fontStyle":"italic"},"text":", e","element":"span"},{"text":")","element":"span"},{"text":", is decreasing on ","element":"span"},{"style":{"height":16},"width":138.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-15.png","element":"img","alt":" (e, +∞)","inline":true,"padRight":true},{"text":"and attains its global maximum ","element":"span"},{"style":{"height":19.37},"width":149.04,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-16.png","element":"img","alt":" g(e) = 1e","inline":true,"padRight":true},{"text":"at ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"e","element":"span"},{"text":". Hence,","element":"span"}],[{"style":{"width":"34%"},"width":672,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-17.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Bounding ","element":"span"},{"style":{"height":16},"width":62.49,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-18.png","element":"img","alt":" (♣)","inline":true,"padRight":true},{"text":": We first rewrite this term according to the definition of ","element":"span"},{"style":{"height":17.22},"width":80.48,"height":43.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-19.png","element":"img","alt":"˜Tk’s:","inline":true}],[{"style":{"width":"76%"},"width":722,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/29-20.png","element":"img"}],[{"style":{"width":"91%"},"width":1777,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-0.png","element":"img"}],[{"text":"Next, since ","element":"span"},{"style":{"height":16},"width":421.35,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-1.png","element":"img","alt":" µk ≥ min{k/2, 1/(2w∗)}","inline":true,"padRight":true},{"text":"as shown in Lemma ","element":"span"},{"href":"#id-41","text":"5.2","element":"a"},{"text":", when ","element":"span"},{"style":{"height":16},"width":326.94,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-2.png","element":"img","alt":" K − k + 1 ≤ 1/w∗,","inline":true}],[{"style":{"width":"82%"},"width":1605,"height":628,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-3.png","element":"img"}],[{"text":"and","element":"span"}],[{"style":{"width":"75%"},"width":712,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-4.png","element":"img"}],[{"text":"Further,","element":"span"}],[{"style":{"width":"99%"},"width":929,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Summation of ","element":"span"},{"style":{"height":16},"width":638.34,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-6.png","element":"img","alt":" (♠), (♥), (♣). Recall ρ = δ/(12L) and","inline":true}],[{"style":{"width":"31%"},"width":300,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-7.png","element":"img"}],[{"text":"The time complexity is upper bounded by","element":"span"}],[{"style":{"width":"99%"},"width":937,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-8.png","element":"img"}],[{"text":"where","element":"span"}],[{"style":{"width":"99%"},"width":933,"height":263,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/30-9.png","element":"img"}],[{"style":{"width":"83%"},"width":1616,"height":173,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-0.png","element":"img"}],[{"style":{"height":9.19},"width":75.31,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-1.png","element":"img","alt":"= c4","inline":true}],[{"style":{"width":"94%"},"width":888,"height":167,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-2.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"D.17. Proof of Theorem ","element":"span"},{"href":"#id-51","style":{"fontWeight":"bold"},"text":"4.8","element":"a"}],[{"text":"Recall that ","element":"span"},{"style":{"height":14.74},"width":53.62,"height":36.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-3.png","element":"img","alt":" Oπt","inline":true,"padRight":true},{"text":"is a vector in ","element":"span"},{"style":{"height":17.38},"width":163.04,"height":43.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-4.png","element":"img","alt":" {0, 1, ⋆}K","inline":true},{"text":", where ","element":"span"},{"style":{"height":14},"width":95.3,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-5.png","element":"img","alt":" 0, 1, ⋆","inline":true,"padRight":true},{"text":"represents observing no click, observing a click and no observation ","element":"span"},{"text":"respectively. For example, when ","element":"span"},{"style":{"height":16},"width":267.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-6.png","element":"img","alt":" Sπt = (2, 1, 5, 4)","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":275.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-7.png","element":"img","alt":" Oπt = (0, 0, 1, ⋆)","inline":true},{"text":", items ","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"5","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"4 ","element":"span"},{"text":"are listed in the displayed order; ","element":"span"},{"text":"items ","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1 ","element":"span"},{"text":"are not clicked, item 5 is clicked, and the response to item 4 is not observed. By the definition of the cascading model, the outcome ","element":"span"},{"style":{"height":16},"width":273.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-8.png","element":"img","alt":" Oπt = (0, 0, 1, ⋆)","inline":true,"padRight":true},{"text":"is in general a (possibly emtpy) string of ","element":"span"},{"text":"0","element":"span"},{"text":"s, followed by a ","element":"span"},{"text":"1 ","element":"span"},{"text":"(if the realized reward is ","element":"span"},{"text":"1","element":"span"},{"text":"), and then followed by a possibly empty string of ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-9.png","element":"img","alt":" ⋆","inline":true},{"text":"s. Clearly, ","element":"span"},{"style":{"height":16.38},"width":168.56,"height":40.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-10.png","element":"img","alt":" Sπ,ℓt , Oπ,ℓt","inline":true,"padRight":true},{"text":"are random variables with distribution depending on ","element":"span"},{"style":{"height":14.18},"width":67.39,"height":35.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-11.png","element":"img","alt":" w(ℓ)","inline":true,"padRight":true},{"text":"(hence these random variables distribute differently for different ","element":"span"},{"style":{"height":16},"width":32.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-12.png","element":"img","alt":" ℓ)","inline":true},{"text":", albeit a possibly complicated dependence on ","element":"span"},{"style":{"height":14.58},"width":79.84,"height":36.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-13.png","element":"img","alt":"w(ℓ).","inline":true}],[{"text":"With the analysis in Section ","element":"span"},{"href":"#id-73","text":"5.3","element":"a"},{"text":", according to Lemma ","element":"span"},{"href":"#id-63","text":"5.11 ","element":"a"},{"text":"and the definition of the instance ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-14.png","element":"img","alt":" ℓ","inline":true},{"text":", one obtains for ","element":"span"},{"style":{"height":16},"width":247.53,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-15.png","element":"img","alt":" i ∈ {1, . . . , K}","inline":true,"padRight":true},{"text":"or ","element":"span"},{"style":{"height":16},"width":328.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-16.png","element":"img","alt":" j ∈ {K + 1, . . . , L}","inline":true,"padRight":true},{"text":"respectively,","element":"span"}],[{"style":{"width":"64%"},"width":1253,"height":95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-17.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":13.19},"width":35.14,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-18.png","element":"img","alt":" Yt","inline":true,"padRight":true},{"text":"denote the number of observations of items at time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". By revisiting the definition of ","element":"span"},{"style":{"height":15.59},"width":71.65,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-19.png","element":"img","alt":" Xk;t","inline":true,"padRight":true},{"text":"in Section ","element":"span"},{"href":"#id-34","text":"4.1","element":"a"},{"text":", we see that ","element":"span"},{"style":{"height":15.59},"width":82.77,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-20.png","element":"img","alt":" XK;t","inline":true,"padRight":true},{"text":"actually counts the observation of all pulled items at time step ","element":"span"},{"style":{"height":15.59},"width":581.33,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-21.png","element":"img","alt":" t. Hence, Yt ≤ XK;t. Setting α → 0","inline":true,"padRight":true},{"text":"and summing over the items yields a bound on the expected number of total observations ","element":"span"},{"style":{"height":28.8},"width":510.61,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-22.png","element":"img","alt":" E��Tt=1 Yt�= �Li=1 E[TT (i)]","inline":true},{"text":". Meanwhile, an ","element":"span"},{"text":"upper bound of ","element":"span"},{"style":{"height":15.59},"width":109.34,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-23.png","element":"img","alt":" EXK;t","inline":true,"padRight":true},{"text":"as stated in Lemma ","element":"span"},{"href":"#id-41","text":"5.2 ","element":"a"},{"text":"and tower property indicates that","element":"span"}],[{"style":{"width":"84%"},"width":1641,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-24.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":16},"width":752.35,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-25.png","element":"img","alt":" KL(x, 1 − x) ≥ log(1/2.4x) for any x ∈ [0, 1]","inline":true},{"text":", we complete the proof of Theorem ","element":"span"},{"href":"#id-51","text":"4.8","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"D.18. Proof of Proposition ","element":"span"},{"href":"#id-31","style":{"fontWeight":"bold"},"text":"C.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proposition C.1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":13.2},"width":229.17,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-26.png","element":"img","alt":" K′ ≥ 2K − 1","inline":true},{"style":{"fontStyle":"italic"},"text":". With probability at least ","element":"span"},{"style":{"height":11.6},"width":88.44,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-27.png","element":"img","alt":" 1 − δ","inline":true},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-24","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"outputs an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-28.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":"-optimal arm after at most ","element":"span"},{"style":{"height":16},"width":250.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-29.png","element":"img","alt":" (c1N ′1 + c2N ′2)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"steps where","element":"span"}],[{"style":{"width":"99%"},"width":1945,"height":419,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-30.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Consider ","element":"span"},{"style":{"height":13.2},"width":591.56,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-31.png","element":"img","alt":" K′ ≥ 2K − 1, i.e, K′ − K ≥ K − 1","inline":true},{"text":". According to Lemma ","element":"span"},{"href":"#id-30","text":"5.9","element":"a"},{"text":", there are at least ","element":"span"},{"style":{"height":13.2},"width":383.53,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-32.png","element":"img","alt":" K′ − K + 1 ≥ K items","inline":true,"padRight":true},{"text":"in the survival set ","element":"span"},{"style":{"height":13.19},"width":44.99,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-33.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"before the algorithm terminates, so the algorithm pulls ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"items from the surviving set ","element":"span"},{"style":{"height":13.19},"width":45,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-34.png","element":"img","alt":" Dt","inline":true,"padRight":true},{"text":"at each time step. And for simplicity, we again write ","element":"span"},{"style":{"height":18.42},"width":799.66,"height":46.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-35.png","element":"img","alt":" µ(k, w) as µk, v(k, w) as vk, ¯Ti,δ as ¯Ti, ρ(δ) as ρ.","inline":true}],[{"text":"Recall Lemma ","element":"span"},{"href":"#id-42","text":"5.6","element":"a"},{"text":", we set ","element":"span"},{"style":{"height":19.2},"width":870.4,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-36.png","element":"img","alt":" δ0 = δ/2, k = K, n = t′0, ρ′ = −�−2t′0v2K log(δ/2)","inline":true},{"text":". Then the total number of observations ","element":"span"},{"text":"during ","element":"span"},{"style":{"height":13.96},"width":30.39,"height":34.89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-37.png","element":"img","alt":" t′0 ","inline":true,"padRight":true},{"text":"steps should be larger than ","element":"span"},{"style":{"height":13.96},"width":169.88,"height":34.89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-38.png","element":"img","alt":" t′0µK + ρ′","inline":true,"padRight":true},{"text":"with probability at least ","element":"span"},{"style":{"height":16},"width":127.35,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/31-39.png","element":"img","alt":" 1 − δ/2","inline":true},{"text":". And since the number of observations can","element":"span"}],[{"text":"be upper bounded, we consider","element":"span"}],[{"text":"Lastly, with Lemma ","element":"span"},{"href":"#id-42","text":"5.6","element":"a"},{"text":", ","element":"span"},{"href":"#id-50","text":"5.7 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-49","text":"5.8","element":"a"},{"text":", we obtain that with probability at least ","element":"span"},{"style":{"height":11.6},"width":87.63,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/32-0.png","element":"img","alt":" 1 − δ","inline":true},{"text":", Algorithm ","element":"span"},{"href":"#id-24","text":"1 ","element":"a"},{"text":"stops after at most","element":"span"}],[{"style":{"width":"28%"},"width":545,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/32-1.png","element":"img"}],[{"text":"steps.","element":"span"}]]},{"heading":"E. Additional numerical results","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"E.1. Order of pulled items","element":"span"}],[{"style":{"width":"87%"},"width":817,"height":2406,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/33-0.png","element":"img"}],[{"style":{"width":"89%"},"width":1750,"height":2441,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/34-0.png","element":"img"}],[{"style":{"width":"89%"},"width":1750,"height":2441,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/35-0.png","element":"img"}],[{"id":"id-74","style":{"width":"89%"},"width":1750,"height":2441,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/36-0.png","element":"img"}],[{"text":"Figure E.1: Average time complexity incurred by different sorting order of ","element":"figcaption","subtype":"caption"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/36-1.png","element":"img","alt":" St","inline":true},{"text":": ascending order of ","element":"figcaption","subtype":"caption"},{"style":{"height":16},"width":82.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/36-2.png","element":"img","alt":" Ti(t)","inline":true,"padRight":true},{"text":"(Algorithm ","element":"figcaption","subtype":"caption"},{"href":"#id-24","text":"1","element":"a","subtype":"caption"},{"text":"), ascending/descending order of ","element":"figcaption","subtype":"caption"},{"style":{"height":16},"width":294.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/36-3.png","element":"img","alt":" ˆµt(i)/Ut(i)/Lt(i)","inline":true,"padRight":true},{"text":"in the cascading bandits.","element":"figcaption","subtype":"caption"}],[{"text":"After a large amount of observations, it is likely that the empirical mean ","element":"span"},{"style":{"height":16},"width":87.79,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-0.png","element":"img","alt":" ˆwt(i)","inline":true,"padRight":true},{"text":"approaches the true weight ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":")","element":"span"},{"text":", and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"text":"lies between the confidence bounds ","element":"span"},{"style":{"height":16},"width":321.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-1.png","element":"img","alt":" Ut(i, δ) and Lt(i, δ)","inline":true,"padRight":true},{"text":"with high probability. Therefore, one may consider to sort ","element":"span"},{"style":{"height":13.19},"width":135.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-2.png","element":"img","alt":" St in the","inline":true,"padRight":true},{"text":"descending or ascending order of ","element":"span"},{"style":{"height":16},"width":457.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-3.png","element":"img","alt":" ˆwt(i)’s, Ut(i, δ)’s or Lt(i, δ)","inline":true},{"text":"’s (the difference to Algorithm ","element":"span"},{"href":"#id-24","text":"1 ","element":"a"},{"text":"reveals in Line 5–9). Diving into the numerical results, we found an algorithm always manages to find an ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-4.png","element":"img","alt":" ϵ","inline":true},{"text":"-optimal arm provided that it is not terminated by the limit of ","element":"span"},{"style":{"height":13.78},"width":55.85,"height":34.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-5.png","element":"img","alt":" 107 ","inline":true,"padRight":true},{"text":"steps. Hence, we focus on the comparison of averaged stopping time.","element":"span"}],[{"text":"In Figure ","element":"span"},{"href":"#id-74","text":"E.1","element":"a"},{"text":", we can see that sorting ","element":"span"},{"style":{"height":13.19},"width":36.44,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-6.png","element":"img","alt":" St","inline":true,"padRight":true},{"text":"in the ascending order of ","element":"span"},{"style":{"height":16},"width":83.26,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-7.png","element":"img","alt":" ˆµt(i)","inline":true,"padRight":true},{"text":"or ","element":"span"},{"style":{"height":16},"width":86.47,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-8.png","element":"img","alt":" Ut(i)","inline":true},{"text":", especially the latter one, incurs an apparently larger averaged stopping time than other methods in most cases. Next, the descending order of ","element":"span"},{"style":{"height":16},"width":83.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-9.png","element":"img","alt":" ˆµt(i)","inline":true,"padRight":true},{"text":"does not work well in some cases. Thirdly, the ascending order of ","element":"span"},{"style":{"height":16},"width":86.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-10.png","element":"img","alt":" Lt(i)","inline":true,"padRight":true},{"text":"performs almost the same as our algorithm in most cases but there are several cases where it performs much worse and does not terminate even after ","element":"span"},{"style":{"height":13.78},"width":55.85,"height":34.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-11.png","element":"img","alt":" 107 ","inline":true,"padRight":true},{"text":"iterations. Lastly, the descending order of ","element":"span"},{"style":{"height":16},"width":86.47,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-12.png","element":"img","alt":" Ut(i)","inline":true,"padRight":true},{"text":"works almost as well as Algorithm ","element":"span"},{"href":"#id-24","text":"1 ","element":"a"},{"text":"empirically but is in lack of theoretical guarantee on time complexity. Meanwhile, the standard deviation of the stopping time of our algorithm is negligible comparing to the average value. For instance, in the left-most case of Figure ","element":"span"},{"href":"#id-53","text":"6.1","element":"a"},{"text":", the standard deviation is about ","element":"span"},{"text":"22318","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"54 ","element":"span"},{"text":"when the average is about ","element":"span"},{"text":"754140","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"65","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"E.2. Further empirical evidence","element":"span"}],[{"id":"id-75","text":"Table E.1: Fitted results of upper bounds on the stopping time ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"T ","element":"figcaption","subtype":"caption"},{"text":"of Algorithm ","element":"figcaption","subtype":"caption"},{"href":"#id-24","text":"1 ","element":"a","subtype":"caption"},{"text":"with ","element":"figcaption","subtype":"caption"},{"style":{"height":10.8},"width":89.33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-13.png","element":"img","alt":" ϵ = 0","inline":true,"padRight":true},{"text":"(Proposition ","element":"figcaption","subtype":"caption"},{"href":"#id-39","text":"4.6","element":"a","subtype":"caption"},{"text":").","element":"figcaption","subtype":"caption"}],[{"style":{"width":"89%"},"width":1742,"height":387,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-14.png","element":"img"}],[{"text":"As shown in Table ","element":"span"},{"href":"#id-75","text":"E.1","element":"a"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value is the probability that we reject the assumption of our fitting model versus a constant model (","element":"span"},{"href":"#id-57","referenceIndex":12,"text":"Glantz et al.","element":"a"},{"href":"#id-57","referenceIndex":12,"text":", ","element":"a"},{"href":"#id-57","referenceIndex":12,"text":"1990","element":"a"},{"text":"). Hence, the small ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values indicates that our fitting models are reasonable. Next, all ","element":"span"},{"style":{"height":9.19},"width":33.25,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/37-15.png","element":"img","alt":" c1","inline":true},{"text":"’s are positive, implying all averaged stopping time grows with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"text":", which corroborates our theoretical results.","element":"span"}],[{"style":{"width":"90%"},"width":1759,"height":2313,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/38-0.png","element":"img"}],[{"text":"Figure E.2: Fit the averaged stopping time with functions of ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"K ","element":"figcaption","subtype":"caption"},{"text":"for each case in order. Fix ","element":"figcaption","subtype":"caption"},{"style":{"height":14},"width":392.81,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.08655/images/38-1.png","element":"img","alt":" L = 128, δ = 0.1, ϵ = 0","inline":true},{"text":". Blue dots are the averaged stopping time, red line is the fitted curve, and cyan dashed lines show the ","element":"figcaption","subtype":"caption"},{"text":"95% ","element":"figcaption","subtype":"caption"},{"text":"confidence interval.","element":"figcaption","subtype":"caption"}]]}],"_version":"3.3.4"},"paperNode":"$28:props:children:props:children:0:props:product"}]]