36:[["$","audio",null,{"id":"tts"}],["$","$L3b",null,{"paperID":"1711.02198","publisher":"arxiv","paperJSON":{"title":"Regret Bounds and Regimes of Optimality for User-User and Item-Item Collaborative Filtering","paperID":"1711.02198","avgLineHeight":16.32,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"We consider an online model for recommendation systems, with each user being recommended an item at each time-step and providing ‘like’ or ‘dislike’ feedback. Each user may be recommended a given item at most once. A latent variable model specifies the user preferences: both users and items are clustered into ","element":"span"},{"text":"types","element":"span"},{"text":"$3c","element":"span"}]]},{"heading":"1 Introduction","paragraphs":[[{"text":"Options are good, but if there are too many options, we need help. It is increasingly the case that our interaction with content is mediated by recommendation systems. There are two main approaches taken in recommendation systems: ","element":"span"},{"text":"content filtering ","element":"span"},{"text":"and ","element":"span"},{"text":"collaborative filtering","element":"span"},{"text":". Content filtering makes use of features associated with items and users (e.g., age, location, gender of users and genre, actors, director of movies). In contrast, collaborative filtering is based on observed user preferences. Thus, two users are thought of as similar if they have revealed similar preferences, irrespective of their profile. Likewise, two items are thought of as similar if most users have similar preferences for them. More generally, collaborative filtering (CF) makes use of structure in the matrix of preferences, as in low-rank matrix formulations ","element":"span"},{"text":"[1, ","element":"span"},{"href":"#id-0","referenceIndex":1,"text":"2, ","element":"a"},{"href":"#id-1","referenceIndex":2,"text":"3, ","element":"a"},{"href":"#id-2","referenceIndex":3,"text":"4, ","element":"a"},{"href":"#id-3","referenceIndex":5,"text":"5, ","element":"a"},{"href":"#id-3","referenceIndex":5,"text":"6, ","element":"a"},{"href":"#id-4","referenceIndex":7,"text":"7, ","element":"a"},{"href":"#id-4","referenceIndex":7,"text":"8]","element":"a"},{"text":". In this paper, since our model has no item and user features, all algorithms must do collaborative filtering.","element":"span"}],[{"text":"An important aspect of most recommendation systems is that each recommendation influences what is learned about the users and items, which in turn determines the possible accuracy of future recommendations. This introduces a tension between exploring to obtain information and exploiting existing knowledge to make good recommendations. The tension between exploring and exploiting is exactly the phenomenon of interest in the substantial literature on the multi-armed bandit (MAB) problem and its variants ","element":"span"},{"href":"#id-5","referenceIndex":9,"text":"[9, ","element":"a"},{"href":"#id-5","referenceIndex":9,"text":"10, ","element":"a"},{"href":"#id-6","referenceIndex":10,"text":"11]","element":"a"},{"text":". ","element":"span"},{"text":"In the multi-armed bandit setup, optimal algorithms necessarily converge to repeated play of the same arm; in contrast, a recommendation system that repeatedly recommends the same movie, even if it is a very good movie, is surely problematic! For this reason we will allow each item to be recommended at most once to each user (as done in ","element":"span"},{"href":"#id-7","referenceIndex":11,"text":"[12, ","element":"a"},{"href":"#id-8","referenceIndex":12,"text":"13]","element":"a"},{"text":").","element":"span"}],[{"text":"It is common to think of recommendation systems as a matrix completion problem. Given a subset of observed entries, the matrix completion problem is to estimate the rest of matrix, where it is assumed that the matrix satisfies some properties. This criterion does not capture the experience of users in a recommendation system: a more appropriate measure of performance is the proportion of good recommendations made by the algorithm.","element":"span"}],[{"text":"With the aforementioned issues in mind, we work within a mathematical framework for evaluating the performance of various recommendation system algorithms, related to the models studied in ","element":"span"},{"href":"#id-7","referenceIndex":11,"text":"[12, ","element":"a"},{"href":"#id-8","referenceIndex":12,"text":"13]","element":"a"},{"text":". The framework is detailed in Section ","element":"span"},{"text":"2, ","element":"span"},{"text":"but in brief, at each time-step each user in the system is given a recommendation and then provides binary feedback in the form of ’like’ or ’dislike’. The user preferences are described by a latent variable model in which each user is associated with a user type and each item is associated with an item type. Users who belong to the same user type have identical preferences for all items and items belonging to the same type have identical ratings from all users.","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/1-0.png","element":"img","alt":"1 ","inline":true,"padRight":true},{"text":"The basic measure of performance is ","element":"span"},{"text":"expected regret","element":"span"},{"text":", defined as the expected number of bad recommendations made per user over a time horizon of interest. A second performance criterion is the ","element":"span"},{"text":"cold start time","element":"span"},{"text":", the first time at which recommendations become nontrivial in quality. Our goal is to understand the dependence of these quantities on system parameters and we will therefore seek bounds accurate only to within constant or logarithmic factors.","element":"span"}],[{"text":"In the literature there are two categories of collaborative filtering (CF) algorithms. ","element":"span"},{"text":"Useruser ","element":"span"},{"text":"algorithms ","element":"span"},{"href":"#id-9","referenceIndex":16,"text":"[16, ","element":"a"},{"href":"#id-7","referenceIndex":11,"text":"12, ","element":"a"},{"href":"#id-10","referenceIndex":17,"text":"17] ","element":"a"},{"text":"use structure in the user space to predict user preferences. Here, the preference of user ","element":"span"},{"text":"u ","element":"span"},{"text":"for item ","element":"span"},{"text":"i ","element":"span"},{"text":"is estimated from the preference of other users ","element":"span"},{"style":{"height":8.4},"width":40.96,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/1-1.png","element":"img","alt":" u′ ","inline":true,"padRight":true},{"text":"believed to be similar to ","element":"span"},{"text":"u ","element":"span"},{"text":"based on their previous ratings. Alternatively, ","element":"span"},{"text":"item-item ","element":"span"},{"text":"algorithms ","element":"span"},{"href":"#id-8","referenceIndex":12,"text":"[13, ","element":"a"},{"href":"#id-11","referenceIndex":18,"text":"18, ","element":"a"},{"href":"#id-11","referenceIndex":18,"text":"19] ","element":"a"},{"text":"use structure in the item space. This time, the preference of user ","element":"span"},{"text":"u ","element":"span"},{"text":"for item ","element":"span"},{"text":"i ","element":"span"},{"text":"is estimated from the preference of the same user ","element":"span"},{"text":"u ","element":"span"},{"text":"for other items ","element":"span"},{"style":{"height":12.4},"width":30.88,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/1-2.png","element":"img","alt":" i′ ","inline":true,"padRight":true},{"text":"believed to be similar to ","element":"span"},{"text":"i ","element":"span"},{"text":"based on previous ratings from users that have rated both ","element":"span"},{"style":{"height":12.8},"width":144.16,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/2-0.png","element":"img","alt":" i and i′","inline":true},{"text":". In Sections ","element":"span"},{"text":"4 ","element":"span"},{"text":"and ","element":"span"},{"text":"6 ","element":"span"},{"text":"we develop versions of user-user and item-item CF algorithms tailored to our online recommendation system model and prove performance guarantees. In order to achieve good performance, these algorithms must carefully explore the ","element":"span"},{"text":"a priori ","element":"span"},{"text":"unknown relationships between users and items. One of the unexpected insights that emerge from the analysis is that the item-item algorithm must limit the exploration to only a subset of the items types, where the size of this subset depends on the system parameters and time-horizon. The straightforward approach to Item-Item CF algorithms is to learn the whole preference matrix, and as described in Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"this results in a qualitatively suboptimal cold-start time that can be arbitrarily worse than the one obtained by our algorithm.","element":"span"}],[{"text":"In order to focus on the information structure of the recommendation problem, and the associated exploration-exploitation tradeoff, the majority of the paper assumes that user feedback is noiseless. We generalize our user-user algorithm to handle noisy feedback and also describe how one would similarly accommodate noisy feedback in the item-item algorithm. In essence, estimation of similarity between users (or items) requires some redundancy in the information collected in order to average out the noise.","element":"span"}],[{"text":"We prove nearly tight lower bounds on regret for two parameter regimes of interest, identifying settings in which the proposed algorithms cannot be significantly improved. In the ","element":"span"},{"text":"user structure only ","element":"span"},{"text":"scenario, the model parameters are such that there is no structure in the item space. Analogously, in the ","element":"span"},{"text":"item structure only ","element":"span"},{"text":"scenario, the parameters are such that there is no structure in the user space. We prove information-theoretic lower bounds for the performance of any algorithm in the user-structure only and item-structure only models, which match to within a logarithmic factor the performance obtained by our proposed user-user and item-item CF algorithms. These results are outlined in Section ","element":"span"},{"text":"3.","element":"span"}],[{"text":"One of this paper’s main contributions is the development of techniques for proving lower bounds on the performance of online recommendation algorithms. Our lower bounds depend crucially on the inability to repeatedly recommend the same item to a given user, and for this reason are completely different from lower bounds for multi-armed bandit problems ","element":"span"},{"href":"#id-5","referenceIndex":9,"text":"[10, ","element":"a"},{"href":"#id-5","referenceIndex":9,"text":"9]","element":"a"},{"text":". At a high level, however, the basic challenge is the same as when proving lower bounds for bandits: one must connect the information obtained by the algorithm to the regret incurred. This allows to reason that subsequent recommendations will have low regret only if prior recommendations yielded significant information, which in turn necessitated exploratory recommendations with correspondingly substantial regret. Thus, regret is a conserved quantity and cannot be avoided by employing complicated adaptive algorithms.","element":"span"}],[{"text":"The methods used for the lower bounds are elementary in nature. For example, in the user structure only model, the arguments in Section ","element":"span"},{"text":"7 ","element":"span"},{"text":"are based on two observations. First, one cannot be confident in recommending any item to user ","element":"span"},{"text":"u ","element":"span"},{"text":"at time ","element":"span"},{"text":"t ","element":"span"},{"text":"if there is no user ","element":"span"},{"style":{"height":8.4},"width":40.96,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/2-1.png","element":"img","alt":" u′ ","inline":true,"padRight":true},{"text":"that has rated enough items in common, and in agreement, with user ","element":"span"},{"style":{"height":16.4},"width":324,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/2-2.png","element":"img","alt":" u by time t − 1.","inline":true,"padRight":true},{"text":"In this situation, the similarity of ","element":"span"},{"text":"u ","element":"span"},{"text":"to any other user is uncertain and so too is the outcome of any recommendation. Second, the outcome of recommending item ","element":"span"},{"text":"i ","element":"span"},{"text":"to user ","element":"span"},{"text":"u ","element":"span"},{"text":"at time ","element":"span"},{"text":"t ","element":"span"},{"text":"is also uncertain if none of the users that actually are similar to ","element":"span"},{"text":"u ","element":"span"},{"text":"have rated item ","element":"span"},{"style":{"height":16.4},"width":264.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/2-3.png","element":"img","alt":" i by time t −","inline":true,"padRight":true},{"text":"1. These observations imply a lower bound on the necessary number of exploratory recommendations before it is possible to recommend with much better likelihood of success than chance. Similar reasoning leads to lower bounds in Section ","element":"span"},{"href":"#id-12","text":"5 ","element":"a"},{"text":"for the model with only item-structure.","element":"span"}],[{"text":"A few papers including ","element":"span"},{"href":"#id-7","referenceIndex":11,"text":"[12, ","element":"a"},{"href":"#id-8","referenceIndex":12,"text":"13, ","element":"a"},{"href":"#id-13","referenceIndex":20,"text":"20] ","element":"a"},{"text":"have theoretical analyses for online collaborative filtering. The paper ","element":"span"},{"href":"#id-7","referenceIndex":11,"text":"[12] ","element":"a"},{"text":"analyzes a user-user CF algorithm in a similar setting to ours and ","element":"span"},{"href":"#id-8","referenceIndex":12,"text":"[13] ","element":"a"},{"text":"analyzes an item-item CF algorithm in a somewhat different and more flexible model. Relative to these, our main distinction is obtaining nearly matching lower bounds showing optimality of our algorithms and analysis. ","element":"span"},{"text":"The model studied by Dabeer and coauthors ","element":"span"},{"href":"#id-13","referenceIndex":20,"text":"[20, ","element":"a"},{"text":"1, ","element":"span"},{"href":"#id-14","referenceIndex":21,"text":"22] ","element":"a"},{"text":"is also quite similar to our setup, but their objective is different: they seek an algorithm that ","element":"span"},{"text":"exploits ","element":"span"},{"text":"in a provably optimal fashion asymptotically in time, but their approach does not reveal how to explore. In a different direction, Kerenidis and Prakash ","element":"span"},{"href":"#id-13","referenceIndex":20,"text":"[21] ","element":"a"},{"text":"seek to achieve low computational complexity for recommendation in a similar setup as ours. What they show is that reconstructing the preference matrix only partially, which is what our item-item CF algorithm does, is useful also with regards to computation.","element":"span"}],[{"text":"Hybrid algorithms exploiting both structure in user space and item space have been studied before in ","element":"span"},{"href":"#id-15","referenceIndex":22,"text":"[23, ","element":"a"},{"href":"#id-16","referenceIndex":24,"text":"24, ","element":"a"},{"href":"#id-17","referenceIndex":25,"text":"25]","element":"a"},{"text":". Both Song et al. ","element":"span"},{"href":"#id-15","referenceIndex":22,"text":"[23] ","element":"a"},{"text":"and Borgs et al. ","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"[26] ","element":"a"},{"text":"study a more flexible latent variable model in the offline (matrix completion style) setting and propose collaborative filtering algorithms using both item and user space. In a forthcoming paper we analyze a hybrid algorithm within the same framework studied here.","element":"span"}],[{"text":"1.1 ","element":"span"},{"text":"Outline","element":"span"}],[{"text":"The model and performance metric are described in Section ","element":"span"},{"text":"2. ","element":"span"},{"text":"Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"overviews the main results of this paper and includes numerical simulations to complement the theoretical analyses. Our version of user-user CF is introduced and analyzed in Section ","element":"span"},{"text":"4. ","element":"span"},{"text":"In Section ","element":"span"},{"href":"#id-12","text":"5 ","element":"a"},{"text":"we prove that the proposed algorithm is almost information-theoretically optimal in the setup with user structure only. Our version of item-item CF is described and analyzed in Section ","element":"span"},{"text":"6, ","element":"span"},{"text":"and the corresponding lower bound in the setting with item structure only is given in Section ","element":"span"},{"text":"7. ","element":"span"},{"text":"Appendix ","element":"span"},{"text":"A ","element":"span"},{"text":"contains a few basic probabilistic lemmas, and Appendix ","element":"span"},{"text":"B ","element":"span"},{"text":"relates so-called anytime regret (unknown time horizon) to known time horizon.","element":"span"}],[{"text":"1.2 ","element":"span"},{"text":"Notation","element":"span"}],[{"text":"For an integer ","element":"span"},{"text":"a ","element":"span"},{"text":"we write [","element":"span"},{"style":{"height":17.6},"width":286,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/3-0.png","element":"img","alt":"a] = {1, · · · , a}","inline":true,"padRight":true},{"text":"and for real-valued ","element":"span"},{"style":{"height":17.6},"width":430.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/3-1.png","element":"img","alt":" x let (x)+ = max{x, 0}","inline":true},{"text":". All logarithms are to the base of 2. The set of natural numbers (positive integers) is denoted by ","element":"span"},{"text":"N","element":"span"},{"text":". We note here that variables or parameters in Figure ","element":"span"},{"href":"#id-19","text":"1 ","element":"a"},{"text":"have the same meaning throughout the paper, but any others may take different values in each section. For real-valued ","element":"span"},{"style":{"height":17.6},"width":114.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/3-2.png","element":"img","alt":" x, ⌊x⌋","inline":true,"padRight":true},{"text":"denotes the greatest integer less than or equal to ","element":"span"},{"style":{"height":17.6},"width":188,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/3-3.png","element":"img","alt":" x and ⌈x⌉","inline":true,"padRight":true},{"text":"denotes the smallest integer greater than or equal to ","element":"span"},{"text":"x","element":"span"},{"text":". Numerical constants (","element":"span"},{"style":{"height":11.2},"width":131.24,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/3-4.png","element":"img","alt":"c, c1, c2","inline":true,"padRight":true},{"text":"and so forth) may take different values in different theorem statements unless explicitly stated otherwise.","element":"span"}]]},{"heading":"2 Model","paragraphs":[[{"text":"2.1 ","element":"span"},{"text":"Problem setup","element":"span"}],[{"text":"There is a fixed set of users ","element":"span"},{"text":"{","element":"span"},{"text":"1","element":"span"},{"text":", . . . , N","element":"span"},{"text":"}","element":"span"},{"text":". At each time ","element":"span"},{"text":"t ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", ","element":"span"},{"text":"3","element":"span"},{"text":", . . . ","element":"span"},{"text":"the algorithm recommends an item ","element":"span"},{"style":{"height":17.09},"width":156.32,"height":42.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-0.png","element":"img","alt":" au,t ∈ N","inline":true,"padRight":true},{"text":"to each user ","element":"span"},{"text":"u ","element":"span"},{"text":"and receives feedback ","element":"span"},{"style":{"height":19.42},"width":353.2,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-1.png","element":"img","alt":" Lu,au,s ∈ {+1, −1}","inline":true,"padRight":true},{"text":"(‘like’ or ‘dislike’). For the reasons stated in the introduction, we impose the condition that each item may be recommended at most once to each user. In order that the algorithm never run out of items to recommend, we suppose there are infinitely many items to draw from and identify them with the natural numbers.","element":"span"}],[{"text":"The history ","element":"span"},{"style":{"height":19.42},"width":739.6,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-2.png","element":"img","alt":" Ht = {au,s, Lu,au,s, for u ∈ [N], s ∈ [t]}","inline":true,"padRight":true},{"text":"is the collection of actions and feedback up to time ","element":"span"},{"text":"t","element":"span"},{"text":". We are interested in online learning algorithms, in which the action ","element":"span"},{"style":{"height":13.09},"width":64.32,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-3.png","element":"img","alt":" au,t","inline":true,"padRight":true},{"text":"is a (possibly random) function of the history up through the end of the previous time-step ","element":"span"},{"style":{"height":14.69},"width":92.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-4.png","element":"img","alt":" Ht−1","inline":true},{"text":". This additional randomness is encoded in a random variable ","element":"span"},{"style":{"height":17.49},"width":60.48,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-5.png","element":"img","alt":" ζu,t","inline":true},{"text":", assumed to be independent of all other variables. In this way, ","element":"span"},{"style":{"height":18.29},"width":380.16,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-6.png","element":"img","alt":" au,t = fu,t(Ht−1, ζu,t","inline":true},{"text":"), for some deterministic function ","element":"span"},{"style":{"height":17.49},"width":76.8,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-7.png","element":"img","alt":" fu,t.","inline":true}],[{"text":"Algorithm performance will be evaluated after some arbitrary number of time-steps ","element":"span"},{"text":"T","element":"span"},{"text":". The performance metric we use is expected regret (simply called regret in what follows), defined as the expected number of disliked items recommended per user:","element":"span"}],[{"style":{"width":"71%"},"width":1330,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-8.png","element":"img"}],[{"text":"Here the expectation is with respect to the randomness in both the model and the algorithm. The algorithms we describe depend on knowing the time-horizon ","element":"span"},{"text":"T","element":"span"},{"text":", but by a standard doubling trick (explained in Appendix ","element":"span"},{"text":"B) ","element":"span"},{"text":"it is possible to convert these to algorithms achieving the same (up to constant factors) regret without this knowledge (see, e.g., ","element":"span"},{"href":"#id-20","referenceIndex":27,"text":"[27]","element":"a"},{"text":"). This latter notion of regret, where the algorithm does not know the time-horizon of interest and must achieve good performance across all time-scales, is called ","element":"span"},{"text":"anytime regret ","element":"span"},{"text":"in the literature.","element":"span"}],[{"text":"The time at which point recommendations become nontrivial in quality is another important performance criterion, because until that point users invest effort but get little in return. ","element":"span"},{"text":"In the recommendation systems literature the notion of cold start describes the difficulty of providing useful recommendations when insufficient information is available about user preferences. We define the ","element":"span"},{"text":"cold start time ","element":"span"},{"text":"to be the first time at which the slope of regret as a function of ","element":"span"},{"text":"T ","element":"span"},{"text":"is bounded by some value ","element":"span"},{"style":{"height":11.6},"width":36.96,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-9.png","element":"img","alt":" γ:","inline":true}],[{"style":{"width":"42%"},"width":791,"height":87,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-10.png","element":"img"}],[{"id":"id-21","text":"This is similar to (but somewhat simpler than) the definition in ","element":"span"},{"href":"#id-8","referenceIndex":12,"text":"[13]","element":"a"},{"text":".","element":"span"}],[{"text":"2.2 ","element":"span"},{"text":"User preferences","element":"span"}],[{"text":"We study a latent-variable model for the preferences (‘like’ or ‘dislike’) of the users for the items, based on the idea that there are relatively few ","element":"span"},{"text":"types of users ","element":"span"},{"text":"and/or few ","element":"span"},{"text":"types of items","element":"span"},{"text":". Each user ","element":"span"},{"style":{"height":17.6},"width":135.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-11.png","element":"img","alt":" u ∈ [N","inline":true},{"text":"] has a user type ","element":"span"},{"style":{"height":17.6},"width":88.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-12.png","element":"img","alt":" τU(u","inline":true},{"text":") i.i.d. uniform on [","element":"span"},{"style":{"height":18},"width":268.88,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/4-13.png","element":"img","alt":"qU], where qU","inline":true,"padRight":true},{"text":"is the number of user types.","element":"span"}],[{"style":{"width":"51%"},"width":970,"height":546,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-0.png","element":"img"}],[{"id":"id-19","text":"Figure 1: Notation for the recommendation system model.","element":"figcaption","subtype":"caption"}],[{"text":"We assume that ","element":"span"},{"style":{"height":16.8},"width":154.68,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-1.png","element":"img","alt":" qU ≤ N","inline":true},{"text":", because if ","element":"span"},{"style":{"height":16.8},"width":154.68,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-2.png","element":"img","alt":" qU > N","inline":true,"padRight":true},{"text":"then most users have their own type and all of the results remain unchanged upon replacing ","element":"span"},{"style":{"height":17.01},"width":166.2,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-3.png","element":"img","alt":" qU by N","inline":true},{"text":". Similarly, each item ","element":"span"},{"style":{"height":12.8},"width":102.08,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-4.png","element":"img","alt":" i ∈ N","inline":true,"padRight":true},{"text":"has a random item type ","element":"span"},{"style":{"height":17.6},"width":70.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-5.png","element":"img","alt":" τI(i","inline":true},{"text":") i.i.d. uniform on [","element":"span"},{"style":{"height":18},"width":255.08,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-6.png","element":"img","alt":"qI], where qI","inline":true,"padRight":true},{"text":"is the number of item types","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-7.png","element":"img","alt":"2","inline":true},{"text":". The random variables ","element":"span"},{"style":{"height":18.29},"width":556.32,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-8.png","element":"img","alt":"{τU(u)}1≤u≤N and {τI(i)}1≤i","inline":true,"padRight":true},{"text":"are assumed to be jointly independent.","element":"span"}],[{"text":"All users of a given type have identical preferences for all the items, and similarly all items of a given type are rated in the same way by any particular user. The entire collection of user preferences (","element":"span"},{"style":{"height":18.29},"width":130.56,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-9.png","element":"img","alt":"Lu,i)u,i","inline":true,"padRight":true},{"text":"is therefore encoded into a much smaller ","element":"span"},{"style":{"height":18.48},"width":897.6,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-10.png","element":"img","alt":" preference matrix Ξ = (ξk,j) ∈ {−1, +1}qU×qI,","inline":true,"padRight":true},{"text":"which specifies the preference of each user type for each item type. The preference ","element":"span"},{"style":{"height":17.49},"width":221.96,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-11.png","element":"img","alt":" Lu,i of user","inline":true},{"style":{"height":17.6},"width":439.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-12.png","element":"img","alt":"u ∈ [N] for item i ∈ N","inline":true,"padRight":true},{"text":"is the preference ","element":"span"},{"style":{"height":19.25},"width":183.4,"height":48.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-13.png","element":"img","alt":" ξτU(u),τI(i)","inline":true,"padRight":true},{"text":"of the associated user type ","element":"span"},{"style":{"height":17.6},"width":88.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-14.png","element":"img","alt":" τU(u","inline":true},{"text":") for the item type ","element":"span"},{"style":{"height":17.6},"width":69.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-15.png","element":"img","alt":" τI(i","inline":true},{"text":") in the matrix Ξ, ","element":"span"},{"text":"i.e.","element":"span"},{"text":",","element":"span"}],[{"style":{"width":"17%"},"width":331,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-16.png","element":"img"}],[{"text":"We assume that the entries of Ξ are i.i.d., ","element":"span"},{"style":{"height":18.48},"width":774.64,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-17.png","element":"img","alt":" ξk,j = +1 w.p. 1/2 and ξk,j = −1 w.p. 1/","inline":true},{"text":"2. Generalizing our results to i.i.d. entries with bias ","element":"span"},{"text":"p ","element":"span"},{"text":"is straightforward. However, the independence assumption is quite strong and an important future research direction is to obtain results for more realistic preference matrices. We also consider a noisy model with ","element":"span"},{"style":{"height":19.25},"width":603.84,"height":48.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-18.png","element":"img","alt":" Lu,i = ξτU(u),τI(i) · zu,i where zu,i","inline":true,"padRight":true},{"text":"are i.i.d. random variables with ","element":"span"},{"style":{"height":18.29},"width":808.32,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-19.png","element":"img","alt":" P[zu,i = +1] = 1 − γ and P[zu,i = −1] = γ.","inline":true}],[{"text":"2.3 ","element":"span"},{"text":"Two regimes of interest","element":"span"}],[{"text":"Two specific parameter regimes play a central role in this paper, capturing settings with structure only in user space or only in item space. As described in Section ","element":"span"},{"text":"3, ","element":"span"},{"text":"each of user-user or item-item CF is almost optimal in the corresponding regime.","element":"span"}],[{"id":"id-28","text":"Definition 2.1 ","element":"span"},{"text":"(User structure only (","element":"span"},{"style":{"height":17.81},"width":711.44,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-20.png","element":"img","alt":"qI = 2qU )). The user structure model","inline":true,"padRight":true},{"text":"refers to the case that there is no structure in the item space. To simplify matters, we assume that the preference matrix Ξ ","element":"span"},{"style":{"height":19.74},"width":340.24,"height":49.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-21.png","element":"img","alt":" ∈ {−1, +1}qU×2qU ","inline":true,"padRight":true},{"text":"is deterministic and has columns consisting of all sequences in ","element":"span"},{"style":{"height":17.6},"width":230.4,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-22.png","element":"img","alt":" {−1, +1}qU .","inline":true,"padRight":true},{"text":"Essentially the same preference matrix would arise (with high probability) if ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/5-23.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"is much larger than","element":"span"}],[{"text":"2","element":"span"},{"style":{"height":9.25},"width":39.28,"height":23.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-0.png","element":"img","alt":"qU ","inline":true,"padRight":true},{"text":"(when the entries are i.i.d. as specified above in Subsection ","element":"span"},{"href":"#id-21","text":"2.2)","element":"a"},{"text":".","element":"span"}],[{"id":"id-31","text":"Definition 2.2 ","element":"span"},{"text":"(Item structure only (","element":"span"},{"style":{"height":17.81},"width":741.21,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-1.png","element":"img","alt":"qU = N)). The item structure model","inline":true,"padRight":true},{"text":"refers to the case that there is no structure in the user space. This happens when ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-2.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"is much larger than ","element":"span"},{"text":"N","element":"span"},{"text":", since then most user types have no more than one user. For the purpose of proving near-optimality of item-item CF, it suffices to take ","element":"span"},{"style":{"height":16.61},"width":148.44,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-3.png","element":"img","alt":" qU = N","inline":true,"padRight":true},{"text":"(and we do so).","element":"span"}]]},{"heading":"3 Main results","paragraphs":[[{"text":"We will analyze a version of each of user-user and item-item CF within the general setup described in Section ","element":"span"},{"text":"2. ","element":"span"},{"text":"The resulting regret bounds appear in Theorems ","element":"span"},{"href":"#id-22","text":"4.1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-23","text":"6.1. ","element":"a"},{"text":"These theorems are complemented by information theoretic-lower bounds, Theorems ","element":"span"},{"href":"#id-24","text":"5.1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-25","text":"7.1, ","element":"a"},{"text":"showing that no other algorithm can achieve much better regret (up to multiplicative logarithmic factors) in the specific extreme parameter regimes with user-structure only and item-structure only. ","element":"span"},{"text":"The simplified versions of these theorems appear in this section. Towards the end of this section we present simulation results supporting the theorems.","element":"span"}],[{"text":"3.1 ","element":"span"},{"text":"User-user collaborative filtering","element":"span"}],[{"text":"User-user CF exploits structure in the user space: the basic idea is to recommend items to a user that are liked by similar users. ","element":"span"},{"text":"We analyze an instance of user-user CF described in detail in Section ","element":"span"},{"href":"#id-26","text":"4.1, ","element":"a"},{"text":"obtaining the regret bound given in Theorem ","element":"span"},{"href":"#id-22","text":"4.1 ","element":"a"},{"text":"below. Essentially, the algorithm clusters users according to type by recommending random items for an initial phase, and then uses this knowledge to efficiently explore the preferences of each user type (as opposed to each user individually). The subsequent savings is due to the fact that the cost of exploration can be shared amongst users of the same type.","element":"span"}],[{"text":"The random recommendations made during the initial phase incur regret with slope 1","element":"span"},{"text":"/","element":"span"},{"text":"2, because a random recommendation is disliked with probability half. Afterward, the users are clustered according to type. Recommending an item to ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-4.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"users, one from each type, gives us the preferences of all ","element":"span"},{"text":"N ","element":"span"},{"text":"users for the item, and each such recommendation is disliked with probability 1","element":"span"},{"text":"/","element":"span"},{"text":"2. This results in a slope of ","element":"span"},{"style":{"height":17.81},"width":133.08,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-5.png","element":"img","alt":" qU/2N","inline":true,"padRight":true},{"text":"for regret in the second phase of the algorithm.","element":"span"}],[{"text":"Theorem ","element":"span"},{"href":"#id-22","text":"4.1 ","element":"a"},{"text":"(Regret upper bound in user-user CF, simplified version)","element":"span"},{"text":". ","element":"span"},{"text":"Consider the recommendation system model described in Section ","element":"span"},{"text":"2 ","element":"span"},{"text":"with ","element":"span"},{"style":{"height":16.8},"width":232.88,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-6.png","element":"img","alt":" N users, qU","inline":true,"padRight":true},{"text":"user types, and ","element":"span"},{"style":{"height":17.2},"width":382.16,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-7.png","element":"img","alt":" qI > 126 log N item","inline":true,"padRight":true},{"text":"types. There exists numerical constants ","element":"span"},{"text":"c, C ","element":"span"},{"text":"so that Algorithm ","element":"span"},{"href":"#id-27","text":"1 ","element":"a"},{"text":"achieves regret","element":"span"}],[{"style":{"width":"56%"},"width":1058,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-8.png","element":"img"}],[{"text":"The cold-start time, the time until the slope of the regret drops below ","element":"span"},{"style":{"height":11.6},"width":24,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-9.png","element":"img","alt":" γ","inline":true},{"text":", is evidently Θ(log ","element":"span"},{"text":"N","element":"span"},{"text":") for any ","element":"span"},{"style":{"height":23.97},"width":213.8,"height":59.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/6-10.png","element":"img","alt":" γ ∈ (CqUN , 12","inline":true},{"text":"). It follows from the next theorem that if there is no structure in the item space ","element":"span"},{"text":"and the number of user types is ","element":"span"},{"style":{"height":17.01},"width":540.4,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-0.png","element":"img","alt":" qU = N α for fixed 0 < α <","inline":true,"padRight":true},{"text":"1, then the user-user CF algorithm achieves both regret and cold start time that are optimal up to multiplicative constants.","element":"span"}],[{"text":"Theorem ","element":"span"},{"href":"#id-24","text":"5.1 ","element":"a"},{"text":"(Regret lower bound with user structure only, simplified version)","element":"span"},{"text":". ","element":"span"},{"text":"There exist a numerical constant ","element":"span"},{"text":"c ","element":"span"},{"text":"such that in the user structure model (Defn ","element":"span"},{"href":"#id-28","text":"2.1) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":19.34},"width":402.64,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-1.png","element":"img","alt":" qU > (log N)1.1 user","inline":true,"padRight":true},{"text":"types and ","element":"span"},{"style":{"height":14.69},"width":149.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-2.png","element":"img","alt":" N > N0","inline":true,"padRight":true},{"text":"users, any recommendation algorithm must incur regret","element":"span"}],[{"style":{"width":"49%"},"width":926,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-3.png","element":"img"}],[{"text":"The reasoning for the first part of the lower bound is as follows. If a user has been recommended fewer than log ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-4.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"items, then its similarity with respect to other users cannot be determined. This implies that any recommendation made to this user has uncertain outcome. The second part of the lower bound is obtained by showing that when an item is recommended for the first time to a user from a given user type the outcome of that recommendation is uncertain, and lower bounding the number of such recommendations. This is where we use the condition that each item is recommended at most once to each user.","element":"span"}],[{"text":"The lower bound shows that the poor initial performance of user-user CF, as bad as simply recommending random items, is unavoidable in the setting with only user structure and that its duration depends on the number of user types. In ","element":"span"},{"href":"#id-8","referenceIndex":12,"text":"[13] ","element":"a"},{"text":"it was shown that a version of item-item CF obtains much smaller cold start time than user-user CF in a model with item structure only. Our results on item-item CF, described next, corroborate this.","element":"span"}],[{"text":"3.2 ","element":"span"},{"text":"Item-item collaborative filtering","element":"span"}],[{"text":"Item-item CF exploits structure in the item space: users are recommended items similar to those they have liked. We analyze an instance of item-item CF in Section ","element":"span"},{"href":"#id-29","text":"6.1, ","element":"a"},{"text":"obtaining the regret bound given in Theorem ","element":"span"},{"href":"#id-23","text":"6.1 ","element":"a"},{"text":"below. The algorithm creates several clusters of items, as well as a set of unclustered items. Similarity of two items is estimated by having random users rate both items. Users then explore a single item from each cluster and liked clusters are subsequently recommended. The effort of clustering is shared amongst all the users, and the savings is due to liked explorations yielding an entire cluster of items to recommend.","element":"span"}],[{"text":"Crucially, this version of item-item CF has the feature that only a ","element":"span"},{"text":"subset ","element":"span"},{"text":"of the item space is explored (","element":"span"},{"text":"i.e.","element":"span"},{"text":", only a subset of the item types are clustered, with the others cast aside)","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/7-5.png","element":"img","alt":"3","inline":true},{"text":". To the best of our knowledge, the benefit of limiting the scope of item exploration has not been made explicit before; this only became evident to us in seeking to match the lower bound. The total number of items compared and the number of clusters are chosen depending on the system parameters to give the best regret bound.","element":"span"}],[{"text":"Theorem ","element":"span"},{"href":"#id-23","text":"6.1 ","element":"a"},{"text":"(Regret upper bound in item-item CF, simplified version)","element":"span"},{"text":". ","element":"span"},{"text":"Consider the recommendation system model described in Section ","element":"span"},{"text":"2 ","element":"span"},{"text":"with ","element":"span"},{"style":{"height":17.2},"width":534.84,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-0.png","element":"img","alt":" N > 5 users, qI > 13 log N","inline":true,"padRight":true},{"text":"item types, and ","element":"span"},{"style":{"height":17.81},"width":332.36,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-1.png","element":"img","alt":"qU > 16 log(NqI)","inline":true,"padRight":true},{"text":"user types. There are numerical constants ","element":"span"},{"style":{"height":15.6},"width":281,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-2.png","element":"img","alt":" C, c1, c2 and c3","inline":true,"padRight":true},{"text":"such that Algorithm ","element":"span"},{"href":"#id-30","text":"3 ","element":"a"},{"text":"obtains regret per user at time ","element":"span"},{"text":"T ","element":"span"},{"text":"upper bounded as","element":"span"}],[{"style":{"width":"90%"},"width":1695,"height":267,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-3.png","element":"img"}],[{"text":"If there is no structure in the user space and the number of item types is ","element":"span"},{"style":{"height":20.14},"width":349.92,"height":50.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-4.png","element":"img","alt":" qI = N β for fixed","inline":true},{"style":{"height":16.4},"width":72.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-5.png","element":"img","alt":"β >","inline":true,"padRight":true},{"text":"0, then the item-item CF algorithm is optimal up to a logarithmic factor.","element":"span"}],[{"text":"Theorem ","element":"span"},{"href":"#id-25","text":"7.1 ","element":"a"},{"text":"(Regret lower bound for item structure only, simplified version)","element":"span"},{"text":". ","element":"span"},{"text":"In the item-structure model (Defn. ","element":"span"},{"href":"#id-31","text":"2.2) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":19.53},"width":305.48,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-6.png","element":"img","alt":" qI > 25 (log N)5 ","inline":true,"padRight":true},{"text":"item types and ","element":"span"},{"text":"N > ","element":"span"},{"text":"32 ","element":"span"},{"text":"users, there exist numerical constants ","element":"span"},{"style":{"height":15.6},"width":351.56,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-7.png","element":"img","alt":" C, c1, c2, c3, and c4","inline":true,"padRight":true},{"text":"such that any recommendation algorithm must incur regret","element":"span"}],[{"style":{"width":"67%"},"width":1266,"height":419,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-8.png","element":"img"}],[{"text":"It follows that the cold start time ","element":"span"},{"style":{"height":17.6},"width":449.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-9.png","element":"img","alt":" coldstart(γ) with γ = C","inline":true,"padRight":true},{"text":"in the item-structure only regime is lower bounded as ","element":"span"},{"style":{"height":19.63},"width":185.4,"height":49.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-10.png","element":"img","alt":"�Ω(√qI/N","inline":true},{"text":"), while the upper bound based on our proposed algorithm is ","element":"span"},{"style":{"height":17.81},"width":181.44,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-11.png","element":"img","alt":"�O(qI/N).","inline":true,"padRight":true},{"text":"Note that the cold start time with ","element":"span"},{"style":{"height":18},"width":473.4,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-12.png","element":"img","alt":" γ = 1/ log qI is �Θ(qI/N","inline":true},{"text":"). The gap in the upper and lower bounds on cold start time is a consequence of the existence of the second regime in the lower bound given above, and appears to be an artifact of our proof.","element":"span"}],[{"text":"The proof of the lower bound is based on two main observations. First, if an item has been recommended to fewer than log ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/8-13.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"users, then its similarity with respect to other items cannot be determined; this implies that recommending this item to any user has uncertain outcome. Second, when a user is recommended an item from a given item type for the first time, the outcome of that recommendation is uncertain since this reveals a new variable in the preference matrix. Lower bounding the number of such uncertain recommendations gives the lower bound for regret.","element":"span"}],[{"text":"If there is structure in the item space it is possible to avoid the long cold-start time of algorithms using only user structure: even for a very short time horizon, they can guarantee nontrivial bounds on regret. In particular, the near-optimal algorithm proposed here suffers from a constant value of regret for an initial period. Note that as ","element":"span"},{"text":"N ","element":"span"},{"text":"increases, the regret upper bound (given in Theorem ","element":"span"},{"href":"#id-23","text":"6.1) ","element":"a"},{"text":"in the initial phase (constant ","element":"span"},{"style":{"height":10.69},"width":35.72,"height":26.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-0.png","element":"img","alt":" c1","inline":true},{"text":") does not change, but the length of the initial phase increases. Thus increasing ","element":"span"},{"text":"N ","element":"span"},{"text":"makes it easier to make meaningful recommendations. The same phenomenon is true more generally: the upper bound on regret at any time ","element":"span"},{"text":"T ","element":"span"},{"text":"is a decreasing function of ","element":"span"},{"text":"N","element":"span"},{"text":".","element":"span"}],[{"text":"3.3 ","element":"span"},{"text":"Numerical Simulations","element":"span"}],[{"text":"We simulated our versions of ","element":"span"},{"text":"User-User ","element":"span"},{"text":"and ","element":"span"},{"text":"Item-Item ","element":"span"},{"text":"Algorithms (As described in Sections ","element":"span"},{"text":"4 ","element":"span"},{"text":"and ","element":"span"},{"text":"6)","element":"span"},{"text":". In Figure ","element":"span"},{"href":"#id-32","text":"2, ","element":"a"},{"text":"we plot the regret as a function of time for the ","element":"span"},{"text":"User-User ","element":"span"},{"text":"Algorithm (Alg. ","element":"span"},{"href":"#id-27","text":"1 ","element":"a"},{"text":"in Section ","element":"span"},{"text":"4)","element":"span"},{"text":". We observe that the slope of regret in the asymptotic regime increases by increasing ","element":"span"},{"style":{"height":17.01},"width":285.24,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-1.png","element":"img","alt":"qU for fixed N","inline":true},{"text":". We also observe that increasing ","element":"span"},{"text":"N ","element":"span"},{"text":"decreases the asymptotic slope but does not decrease the cold start time of the algorithm.","element":"span"}],[{"style":{"width":"97%"},"width":1828,"height":719,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-2.png","element":"img"}],[{"text":"Figure 2: Simulated performance for Algorithm ","element":"figcaption","subtype":"caption"},{"text":"User-User","element":"figcaption","subtype":"caption"},{"text":". System parameters are (a) ","element":"figcaption","subtype":"caption"},{"text":"N ","element":"figcaption","subtype":"caption"},{"text":"= 400 and ","element":"figcaption","subtype":"caption"},{"id":"id-32","style":{"height":18},"width":767.04,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-3.png","element":"img","alt":" qI = 100 and (b) qU = 80 and qI = 100.","inline":true}],[{"text":"In Figure ","element":"span"},{"href":"#id-33","text":"3, ","element":"a"},{"text":"we plot the regret as a function of time for the ","element":"span"},{"text":"Item-Item ","element":"span"},{"text":"Algorithm (Alg. ","element":"span"},{"href":"#id-30","text":"3 ","element":"a"},{"text":"in Section ","element":"span"},{"text":"6)","element":"span"},{"text":". We observe that with fixed ","element":"span"},{"text":"N","element":"span"},{"text":", increasing ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-4.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"increases the cold-start time. But with fixed ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-5.png","element":"img","alt":" qI","inline":true},{"text":", the cold-start time shrinks linearly in ","element":"span"},{"text":"N","element":"span"},{"text":". We also observe that the slope of regret after the cold start time increases with increasing ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/9-6.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"and decreasing ","element":"span"},{"text":"N","element":"span"},{"text":", consistent with the statement of Theorem ","element":"span"},{"href":"#id-23","text":"6.1.","element":"a"}]]},{"heading":"4 User-user algorithm and analysis","paragraphs":[[{"text":"In this section, we describe a version of user-user CF and then analyze it within the latent variable model introduced in Section ","element":"span"},{"text":"2.","element":"span"}],[{"style":{"width":"97%"},"width":1828,"height":711,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-0.png","element":"img"}],[{"text":"Figure 3: Simulated performance for Algorithm ","element":"figcaption","subtype":"caption"},{"text":"Item-Item","element":"figcaption","subtype":"caption"},{"text":". System parameters are (a) ","element":"figcaption","subtype":"caption"},{"text":"N ","element":"figcaption","subtype":"caption"},{"text":"= 600 and ","element":"figcaption","subtype":"caption"},{"id":"id-33","style":{"height":18},"width":776.16,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-1.png","element":"img","alt":" qU = 100 and (b) qI = 60 and qU = 100.","inline":true}],[{"id":"id-26","text":"4.1 ","element":"span"},{"text":"Algorithm","element":"span"}],[{"text":"Pseudocode for algorithm ","element":"span"},{"text":"User-User ","element":"span"},{"text":"appears as Algorithm ","element":"span"},{"href":"#id-27","text":"1. ","element":"a"},{"text":"In Step 1, random items are recommended to all of the users. ","element":"span"},{"text":"The ratings of these items are used to construct a partition ","element":"span"},{"style":{"height":17.6},"width":112.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-2.png","element":"img","alt":"{Pk}k","inline":true,"padRight":true},{"text":"of users that recovers the user types correctly with high probability. In Step 2, users are recommended new random items (","element":"span"},{"text":"exploration","element":"span"},{"text":") until an item is liked. If the user is in group ","element":"span"},{"style":{"height":15.28},"width":103.36,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-3.png","element":"img","alt":" Pk of","inline":true,"padRight":true},{"text":"the partition, the item is added to a set ","element":"span"},{"style":{"height":15.28},"width":44.4,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-4.png","element":"img","alt":" Sk","inline":true,"padRight":true},{"text":"of items to be recommended to all other users in the same partition (","element":"span"},{"text":"exploitation","element":"span"},{"text":"). Step 2 (find and recommend items) is repeated indefinitely.","element":"span"}],[{"id":"id-35","text":"Remark 4.1. ","element":"span"},{"text":"Our model assumes that users of the same type have identical ratings. ","element":"span"},{"text":"Hence, users of the same type are always in the same group after partitioning. However, due to random sampling of the items in exploration, users from different types can have identical ratings for the items recommended in Step 1, in which case they will end up in the same partition. It follows that the total number of groups in the user partition is at most ","element":"span"},{"style":{"height":12.21},"width":64.36,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-5.png","element":"img","alt":" qU.","inline":true}],[{"text":"We make a few additional remarks regarding the algorithm:","element":"span"}],[{"text":"• ","element":"span"},{"text":"The labeling of user groups in the partitioning step is arbitrary (and may be different from the similarly arbitrary labeling of user types).","element":"span"}],[{"text":"• ","element":"span"},{"text":"In Step 2, the sets of items ","element":"span"},{"style":{"height":17.6},"width":87.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-6.png","element":"img","alt":" {Sk}","inline":true,"padRight":true},{"text":"at each time contain the items exploitable by users in the ","element":"span"},{"text":"k","element":"span"},{"text":"-th group in the partition. The algorithm predicts that all users in the ","element":"span"},{"text":"k","element":"span"},{"text":"-th group like items in ","element":"span"},{"style":{"height":15.28},"width":56.16,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-7.png","element":"img","alt":" Sk.","inline":true}],[{"text":"• ","element":"span"},{"text":"The algorithm takes ","element":"span"},{"style":{"height":17.2},"width":272.76,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-8.png","element":"img","alt":" T, qU, and N","inline":true,"padRight":true},{"text":"as input. As mentioned in Section ","element":"span"},{"text":"2, ","element":"span"},{"text":"a doubling trick described in Appendix ","element":"span"},{"text":"B ","element":"span"},{"text":"converts the algorithm to one oblivious to ","element":"span"},{"text":"T","element":"span"},{"text":". ","element":"span"},{"text":"It is also fairly straightforward to modify the algorithm to be adaptive to ","element":"span"},{"style":{"height":12.4},"width":48.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/10-9.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"The adaptive algorithm initializes with a trivial partition placing all users in one group. The algorithm subsequently refines","element":"span"}],[{"id":"id-27","style":{"width":"100%"},"width":1873,"height":1127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-0.png","element":"img"}],[{"text":"the partition whenever a user’s feedback indicates that they have been grouped incorrectly. We chose not to do so since it complicates the analysis.","element":"span"}],[{"id":"id-22","text":"Theorem 4.1. ","element":"span"},{"text":"Consider the model introduced in Section ","element":"span"},{"text":"2 ","element":"span"},{"text":"with ","element":"span"},{"style":{"height":16.8},"width":238.16,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-1.png","element":"img","alt":" N users, qU","inline":true,"padRight":true},{"text":"user types and ","element":"span"},{"style":{"height":12.4},"width":40.04,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-2.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"item types. Let ","element":"span"},{"style":{"height":19.92},"width":920.76,"height":49.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-3.png","element":"img","alt":" r = ⌈2 log(Nq2U)⌉. If qI > 18r, then User-User","inline":true,"padRight":true},{"text":"achieves regret","element":"span"}],[{"style":{"width":"0%"},"width":8,"height":4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-4.png","element":"img"}],[{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-5.png","element":"img","alt":"1","inline":true},{"style":{"height":18.54},"width":564.12,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-6.png","element":"img","alt":"2T , if T ≤ r","inline":true}],[{"style":{"width":"15%"},"width":282,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-7.png","element":"img"}],[{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-8.png","element":"img","alt":"1","inline":true},{"style":{"height":23.57},"width":584.16,"height":58.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-9.png","element":"img","alt":"2r + 2qU+2N T + 2 , if T > r .","inline":true}],[{"text":"The assumption ","element":"span"},{"style":{"height":16.21},"width":158.52,"height":40.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-10.png","element":"img","alt":" qI > 18r","inline":true,"padRight":true},{"text":"ensures that with probability 1 ","element":"span"},{"style":{"height":21.26},"width":115.44,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-11.png","element":"img","alt":" − o( 1N ","inline":true,"padRight":true},{"text":") for each user type, there is at ","element":"span"},{"text":"least one item type that is liked. This assumption also ensures that with probability 1 ","element":"span"},{"style":{"height":21.26},"width":216.68,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-12.png","element":"img","alt":" − o( 1N ), for","inline":true,"padRight":true},{"text":"any pair of user types, there is at least one item type which is rated differently by them. If there is no such item type, then the two user types rate everything similarly and are indistinguishable.","element":"span"}],[{"text":"The theorem indicates that up until time ","element":"span"},{"text":"r","element":"span"},{"text":", the algorithm is making meaningless (randomly chosen independent of feedback) recommendations. Random recommendations have probability half of being liked, hence incur regret with slope 1","element":"span"},{"text":"/","element":"span"},{"text":"2. ","element":"span"},{"text":"After that, the algorithm achieves the asymptotic slope indicating that on average ","element":"span"},{"style":{"height":12.21},"width":48.56,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-13.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"recommendations out of ","element":"span"},{"text":"N ","element":"span"},{"text":"are random. ","element":"span"},{"text":"The simplified version of this theorem in Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"is obtained using 2 log ","element":"span"},{"text":"N < ","element":"span"},{"text":"r ","element":"span"},{"text":"< ","element":"span"},{"text":"7 log ","element":"span"},{"text":"N ","element":"span"},{"text":"(since ","element":"span"},{"style":{"height":16.61},"width":148.44,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-14.png","element":"img","alt":"qU ≤ N","inline":true},{"text":"). We also pick the constant ","element":"span"},{"text":"C ","element":"span"},{"text":"large enough so that ","element":"span"},{"style":{"height":23.38},"width":716.36,"height":58.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/11-15.png","element":"img","alt":"r2 + 2qU+2N T + 2 ≤ C(log N + qUN T) for","inline":true,"padRight":true},{"text":"T > ","element":"span"},{"text":"r","element":"span"},{"text":".","element":"span"}],[{"text":"4.2 ","element":"span"},{"text":"Proof of Theorem ","element":"span"},{"href":"#id-22","text":"4.1","element":"a"}],[{"text":"We first bound the probability that the partition created by the algorithm is correct in Lemma ","element":"span"},{"href":"#id-34","text":"4.2. ","element":"a"},{"text":"Next, to prove the theorem we will show that conditioned on the partition being correct, the number of exploratory recommendations (and hence the regret) is upper bounded.","element":"span"}],[{"id":"id-34","style":{"height":20.05},"width":1148.08,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-0.png","element":"img","alt":"Lemma 4.2. Let Buv = {1{�τU (u)=�τU (v)} = 1{τU (u)=τU (v)}}","inline":true,"padRight":true},{"text":"be the event that users ","element":"span"},{"text":"u ","element":"span"},{"text":"and ","element":"span"},{"text":"v ","element":"span"},{"text":"are partitioned correctly with respect to each other in Step 1 of ","element":"span"},{"style":{"height":12.8},"width":470.52,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-1.png","element":"img","alt":" User-User. Let ǫ and r","inline":true,"padRight":true},{"text":"be as defined there. If ","element":"span"},{"style":{"height":27.02},"width":508.24,"height":67.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-2.png","element":"img","alt":" qI > 4r, then P[Bcuv] ≤ 2ǫq2U ","inline":true,"padRight":true},{"text":". It follows that if ","element":"span"},{"style":{"height":17.6},"width":214.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-3.png","element":"img","alt":" B = � Buv","inline":true,"padRight":true},{"text":"is the event that all users are ","element":"span"},{"text":"partitioned correctly, then ","element":"span"},{"style":{"height":17.6},"width":250.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-4.png","element":"img","alt":" P[B] > 1 − ǫ.","inline":true}],[{"text":"Proof. ","element":"span"},{"text":"As observed in Remark ","element":"span"},{"href":"#id-35","text":"4.1, ","element":"a"},{"text":"users from the same partition rate items identically. Therefore the only way an error in partitioning occurs is if users of different types are grouped together. This happens when two users rate all exploratory items identically in Step 1. In Step 1, the first ","element":"span"},{"text":"r ","element":"span"},{"text":"items recommended to all users are chosen uniformly at random independent of feedback, so the types of these items are uniformly distributed on [","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-5.png","element":"img","alt":"qI","inline":true},{"text":"]. Let s be the number of items with distinct item types among the ","element":"span"},{"text":"r ","element":"span"},{"text":"exploratory items from Step 1. This is a balls and bins scenario with ","element":"span"},{"text":"r ","element":"span"},{"text":"balls into ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-6.png","element":"img","alt":"qI","inline":true,"padRight":true},{"text":"bins, and Lemma ","element":"span"},{"href":"#id-36","text":"A.3 ","element":"a"},{"text":"states that if ","element":"span"},{"style":{"height":19.92},"width":844.88,"height":49.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-7.png","element":"img","alt":" qI > 4r, then P[s < r/2] ≤ exp(−r/2) ≤ ǫ/q2U","inline":true},{"text":". By symmetry, ","element":"span"},{"text":"each of the types of the s items with distinct types is uniformly distributed on [","element":"span"},{"style":{"height":18},"width":66.24,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-8.png","element":"img","alt":"qI].","inline":true}],[{"text":"Since all users rate the same items and users of the same type have identical preferences, as far as the lemma is concerned we only consider how the user types themselves rate items in Step 1. Two user types ","element":"span"},{"style":{"height":16.8},"width":123.04,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-9.png","element":"img","alt":" k ̸= k′ ","inline":true,"padRight":true},{"text":"rate s independently chosen items of distinct types in the same way with probability 2","element":"span"},{"style":{"height":5.6},"width":39.4,"height":14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-10.png","element":"img","alt":"−s","inline":true},{"text":". On the event s ","element":"span"},{"style":{"height":17.6},"width":83.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-11.png","element":"img","alt":" ≥ r/","inline":true},{"text":"2, we have 2","element":"span"},{"style":{"height":21.12},"width":357.12,"height":52.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-12.png","element":"img","alt":"−s < 2−r/2 ≤ ǫ/q2U.","inline":true}],[{"text":"The above two statements show that for users ","element":"span"},{"style":{"height":17.6},"width":750.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-13.png","element":"img","alt":" u and v with τU(u) = k and τU(v) = k′,","inline":true}],[{"style":{"width":"54%"},"width":1019,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-14.png","element":"img"}],[{"text":"The second statement in the lemma follows by union bounding over","element":"span"},{"style":{"height":21.15},"width":227.92,"height":52.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-15.png","element":"img","alt":"�qU2�≤ q2U/","inline":true},{"text":"2 pairs of user ","element":"span"},{"text":"types.","element":"span"}],[{"href":"#id-22","style":{"height":16.4},"width":619.8,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-16.png","element":"img","alt":"Proof of Theorem 4.1. For t ≤ r","inline":true},{"text":", the algorithm recommends random items chosen independently of feedback to all users. So at these times ","element":"span"},{"style":{"height":19.43},"width":397.36,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-17.png","element":"img","alt":" P[Lu,au,t = −1] = 1/","inline":true},{"text":"2 for all users ","element":"span"},{"style":{"height":17.6},"width":135.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-18.png","element":"img","alt":" u ∈ [N","inline":true},{"text":"]. It follows that, for ","element":"span"},{"style":{"height":15.2},"width":117.12,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-19.png","element":"img","alt":" T ≤ r,","inline":true}],[{"id":"id-44","style":{"width":"82%"},"width":1545,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-20.png","element":"img"}],[{"text":"Now consider the case ","element":"span"},{"text":"T > ","element":"span"},{"text":"r","element":"span"},{"text":". At ","element":"span"},{"text":"t ","element":"span"},{"text":"= ","element":"span"},{"text":"r","element":"span"},{"text":", ","element":"span"},{"text":"by Lemma ","element":"span"},{"href":"#id-34","text":"4.2, ","element":"a"},{"text":"the partitioning step recovers the user types correctly with probability at least 1 ","element":"span"},{"style":{"height":17.6},"width":655.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-21.png","element":"img","alt":" − ǫ, i.e., P[B] > 1 − ǫ. On event B","inline":true,"padRight":true},{"text":"all users in a partition have the same type, so by construction of the sets ","element":"span"},{"style":{"height":19.25},"width":111.4,"height":48.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-22.png","element":"img","alt":" S�τU(u)","inline":true,"padRight":true},{"text":"in Line ","element":"span"},{"href":"#id-27","text":"15 ","element":"a"},{"text":"of ","element":"span"},{"style":{"height":19.25},"width":611.84,"height":48.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/12-23.png","element":"img","alt":" User-User, items in S�τU (u) are","inline":true}],[{"text":"liked by at least one user of the same type as ","element":"span"},{"text":"u ","element":"span"},{"text":"and therefore also by ","element":"span"},{"text":"u","element":"span"},{"text":", and","element":"span"}],[{"id":"id-47","style":{"width":"76%"},"width":1437,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-0.png","element":"img"}],[{"text":"Because there are ","element":"span"},{"text":"TN ","element":"span"},{"text":"terms in the sum and ","element":"span"},{"style":{"height":17.6},"width":179.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-1.png","element":"img","alt":" P[Bc] ≤ ǫ","inline":true},{"text":", it follows that","element":"span"}],[{"id":"id-42","style":{"width":"79%"},"width":1491,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-2.png","element":"img"}],[{"text":"Now, we need to find an upper bound for the expected number of disliked exploration recommendations in Step 2 of the algorithm, ","element":"span"},{"style":{"height":32.4},"width":952.8,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-3.png","element":"img","alt":" E��Tt=r+1�u∈[N] 1�Lu,au,t = −1, au,t /∈ S�τU (u)��.","inline":true}],[{"text":"It will be useful to relate the expected number of liked and disliked explorations. To this end, we consider the event that every user type likes at least 1","element":"span"},{"text":"/","element":"span"},{"text":"3 of the item types: define the event","element":"span"}],[{"style":{"width":"49%"},"width":921,"height":116,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-4.png","element":"img"}],[{"text":"A Chernoff bound (Lemma ","element":"span"},{"href":"#id-37","text":"A.1) ","element":"a"},{"text":"applied to the i.i.d. ","element":"span"},{"style":{"height":16.4},"width":20,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-5.png","element":"img","alt":" ξ","inline":true,"padRight":true},{"text":"variables gives ","element":"span"},{"style":{"height":18},"width":513.04,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-6.png","element":"img","alt":" P[Cc] ≤ qU exp(−qI/36) ≤","inline":true,"padRight":true},{"text":"1","element":"span"},{"text":"/N","element":"span"},{"text":", where the last inequality due to ","element":"span"},{"style":{"height":16.21},"width":158.04,"height":40.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-7.png","element":"img","alt":" qI > 18r","inline":true},{"text":". Conditioning on event ","element":"span"},{"text":"C","element":"span"},{"text":", we get","element":"span"}],[{"id":"id-39","style":{"width":"89%"},"width":1674,"height":317,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-8.png","element":"img"}],[{"text":"To obtain an upper bound for the first term, in Claim ","element":"span"},{"href":"#id-38","text":"4.3 ","element":"a"},{"text":"below we will upper bound the expected number of exploration recommendations that were ","element":"span"},{"text":"liked","element":"span"},{"text":", and on event ","element":"span"},{"text":"C ","element":"span"},{"text":"this will provide also an upper bound for the expected number of exploration recommendations that were disliked. The number of liked explorations is easier to deal with, because of a self-limiting effect: these result in items added to sets ","element":"span"},{"style":{"height":17.6},"width":90.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-9.png","element":"img","alt":" {Sk}","inline":true,"padRight":true},{"text":"for exploitation, and exploration only happens when there are not enough items to be exploited.","element":"span"}],[{"text":"We now relate the expected number of liked and disliked explorations. At ","element":"span"},{"style":{"height":20.05},"width":387.84,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-10.png","element":"img","alt":" t > r if au,t /∈ S�τU (u),","inline":true,"padRight":true},{"text":"then it means the item is an exploratory recommendation and thus ","element":"span"},{"style":{"height":13.09},"width":64.32,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-11.png","element":"img","alt":" au,t","inline":true,"padRight":true},{"text":"is an independent new random item with uniformly random type ","element":"span"},{"style":{"height":18.29},"width":244.04,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-12.png","element":"img","alt":" τI(au,t) ∈ [qI","inline":true},{"text":"]. Hence, using the definition of event ","element":"span"},{"text":"C","element":"span"},{"text":",","element":"span"}],[{"id":"id-43","style":{"width":"71%"},"width":1347,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-13.png","element":"img"}],[{"text":"and 1 ","element":"span"},{"style":{"height":15.2},"width":169.44,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-14.png","element":"img","alt":" − p ≤ 2p","inline":true},{"text":". It follows that","element":"span"}],[{"style":{"width":"85%"},"width":1595,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/13-15.png","element":"img"}],[{"text":"This means that to bound the first term in ","element":"span"},{"href":"#id-39","text":"(5) ","element":"a"},{"text":"it suffices to bound the contribution from the sum with ","element":"span"},{"style":{"height":18.62},"width":889.44,"height":46.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-0.png","element":"img","alt":" Lu,au,t = +1, as derived in the following claim.","inline":true}],[{"id":"id-38","text":"Claim 4.3. ","element":"span"},{"text":"On event ","element":"span"},{"text":"C","element":"span"},{"text":", the number of liked ‘explore’ recommendations (line 13 of Algorithm ","element":"span"},{"href":"#id-27","text":"1) ","element":"a"},{"text":"by time ","element":"span"},{"text":"T ","element":"span"},{"text":"can be bounded as","element":"span"}],[{"style":{"width":"55%"},"width":1031,"height":141,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-1.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"For user partition ","element":"span"},{"style":{"height":19.71},"width":411.56,"height":49.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-2.png","element":"img","alt":" k and time t, define Stk ","inline":true,"padRight":true},{"text":"to be the set of items denoted by ","element":"span"},{"style":{"height":15.28},"width":43.4,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-3.png","element":"img","alt":" Sk","inline":true,"padRight":true},{"text":"in the algorithm ","element":"span"},{"text":"at time ","element":"span"},{"text":"t","element":"span"},{"text":", after making the time-step ","element":"span"},{"text":"t ","element":"span"},{"text":"recommendations. Item ","element":"span"},{"style":{"height":13.09},"width":64.32,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-4.png","element":"img","alt":" au,t","inline":true,"padRight":true},{"text":"is added to ","element":"span"},{"style":{"height":15.28},"width":43.4,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-5.png","element":"img","alt":" Sk","inline":true,"padRight":true},{"text":"precisely on the event ","element":"span"},{"style":{"height":21.92},"width":815.44,"height":54.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-6.png","element":"img","alt":" {t > r, �τU(u) = k, au,t /∈ St−1k , Lu,au,t = +1}","inline":true},{"text":". Therefore, dropping ","element":"span"},{"text":"C ","element":"span"},{"text":"from the indicator,","element":"span"}],[{"id":"id-40","style":{"width":"99%"},"width":1867,"height":428,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-7.png","element":"img"}],[{"text":"If ","element":"span"},{"style":{"height":21.46},"width":955.6,"height":53.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-8.png","element":"img","alt":" |St−1k | ≥ t, then St−1k \\ {au,1, · · · , · · · , au,t−1} ̸= ∅","inline":true},{"text":". Meanwhile, at time ","element":"span"},{"text":"t","element":"span"},{"text":", the exploration event (recommending ","element":"span"},{"style":{"height":25.42},"width":231.4,"height":63.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-9.png","element":"img","alt":" au,t /∈ St−1�τU(u) ","inline":true,"padRight":true},{"text":"in line 13) happens only if there are no items left in ","element":"span"},{"style":{"height":25.42},"width":312.52,"height":63.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-10.png","element":"img","alt":" St−1τU(u) for user u","inline":true,"padRight":true},{"text":"to exploit, i.e., ","element":"span"},{"style":{"height":25.23},"width":580.24,"height":63.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-11.png","element":"img","alt":" St−1τU(u) \\ {au,1, · · · , au,t−1} = ∅","inline":true},{"text":". In this way, ","element":"span"},{"style":{"height":21.26},"width":188.8,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-12.png","element":"img","alt":" |St−1k | ≥ t","inline":true,"padRight":true},{"text":"guarantees that there is an exploitable item at time ","element":"span"},{"text":"t ","element":"span"},{"text":"for each user in ","element":"span"},{"style":{"height":14.88},"width":48.24,"height":37.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-13.png","element":"img","alt":" Pk","inline":true},{"text":". Consequently,","element":"span"}],[{"id":"id-41","style":{"width":"75%"},"width":1420,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-14.png","element":"img"}],[{"text":"The bound ","element":"span"},{"style":{"height":17.6},"width":72.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-15.png","element":"img","alt":" |Pk|","inline":true,"padRight":true},{"text":"is due to the sum having ","element":"span"},{"style":{"height":17.6},"width":72.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-16.png","element":"img","alt":" |Pk|","inline":true,"padRight":true},{"text":"terms, each upper bounded by 1.","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":21.26},"width":658.96,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-17.png","element":"img","alt":" t∗ = max{t : r ≤ t < T, |St−1k | < t}","inline":true,"padRight":true},{"text":"be the last time for which we are ","element":"span"},{"text":"not ","element":"span"},{"text":"guaranteed (based on the reasoning before the last displayed eqn.) to have an exploitable item. Note that the set over which we take the maximum is nonempty if ","element":"span"},{"style":{"height":18.58},"width":679.88,"height":46.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-18.png","element":"img","alt":" T > r since |Srk| = 0. It follows that","inline":true}],[{"style":{"width":"79%"},"width":1485,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-19.png","element":"img"}],[{"text":"Since for ","element":"span"},{"style":{"height":21.46},"width":572.32,"height":53.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-20.png","element":"img","alt":" t∗ < t < T we have |St−1k | ≥ t","inline":true},{"text":", by ","element":"span"},{"href":"#id-40","text":"(9) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-41","text":"(10) ","element":"a"},{"text":"for these times we have ","element":"span"},{"style":{"height":21.46},"width":316.72,"height":53.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-21.png","element":"img","alt":" |St+1k | − |Stk| = 0","inline":true,"padRight":true},{"text":". This gives (a) in the above display. By definition, ","element":"span"},{"style":{"height":21.87},"width":337.72,"height":54.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-22.png","element":"img","alt":" |St∗−1k | < t∗ < T","inline":true},{"text":". Inequality (b) uses ","element":"span"},{"href":"#id-40","text":"(9) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-41","text":"(10) ","element":"a"},{"text":"to bound ","element":"span"},{"style":{"height":21.87},"width":279.36,"height":54.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-23.png","element":"img","alt":" |St∗k | − |St∗−1k |.","inline":true}],[{"text":"Note that ","element":"span"},{"style":{"height":22.03},"width":324.6,"height":55.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-24.png","element":"img","alt":"�k∈[qU ] |Pk| = N","inline":true},{"text":". Using ","element":"span"},{"href":"#id-40","text":"(8) ","element":"a"},{"text":"and summing the last displayed inequality over the (at ","element":"span"},{"text":"most) ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/14-25.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"partition indices proves the claim.","element":"span"}],[{"text":"We can now complete the proof of Theorem ","element":"span"},{"href":"#id-22","text":"4.1. ","element":"a"},{"text":"By the preceding claim and Equations ","element":"span"},{"href":"#id-42","text":"(4)","element":"a"},{"text":", ","element":"span"},{"href":"#id-39","text":"(5)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-43","text":"(7) ","element":"a"},{"text":"we get","element":"span"}],[{"style":{"width":"80%"},"width":1500,"height":404,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-0.png","element":"img"}],[{"text":"For ","element":"span"},{"text":"T > ","element":"span"},{"text":"r ","element":"span"},{"text":", ","element":"span"},{"text":"we can now bound the regret by combining Equation ","element":"span"},{"href":"#id-44","text":"(2) ","element":"a"},{"text":"with the previous display:","element":"span"}],[{"style":{"width":"85%"},"width":1605,"height":245,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-1.png","element":"img"}],[{"text":"4.3 ","element":"span"},{"text":"User-User Algorithm with Noisy Preferences","element":"span"}],[{"text":"We generalize the result to the scenario in which the feedback to the recommendation system is noisy. In this case, the preference of user ","element":"span"},{"text":"u ","element":"span"},{"text":"for item ","element":"span"},{"text":"i ","element":"span"},{"text":"is","element":"span"}],[{"id":"id-48","style":{"width":"61%"},"width":1145,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-2.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":13.09},"width":61.44,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-3.png","element":"img","alt":" zu,i","inline":true,"padRight":true},{"text":"are i.i.d. random variables with ","element":"span"},{"style":{"height":18.29},"width":808.8,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-4.png","element":"img","alt":" P[zu,i = +1] = 1 − γ and P[zu,i = −1] = γ","inline":true,"padRight":true},{"text":"(we assume 0 ","element":"span"},{"style":{"height":17.6},"width":190.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-5.png","element":"img","alt":" < γ < 1/","inline":true},{"text":"2). With probability ","element":"span"},{"style":{"height":11.6},"width":24,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-6.png","element":"img","alt":" γ","inline":true},{"text":", the preference of user ","element":"span"},{"text":"u ","element":"span"},{"text":"for item ","element":"span"},{"text":"i ","element":"span"},{"text":"is flipped relative to the preference of user type ","element":"span"},{"style":{"height":17.6},"width":89.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-7.png","element":"img","alt":" τU(u","inline":true},{"text":") for item type ","element":"span"},{"style":{"height":17.6},"width":69.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-8.png","element":"img","alt":" τI(i","inline":true},{"text":") in the preference matrix Ξ.","element":"span"}],[{"text":"To accommodate the noisy feedback, we modify the partitioning subroutine in Step 1 of ","element":"span"},{"text":"UserUser ","element":"span"},{"text":"algorithm with ","element":"span"},{"text":"NoisyUserPartition ","element":"span"},{"text":"given in Algorithm ","element":"span"},{"href":"#id-45","text":"2. ","element":"a"},{"text":"The main modification is that in Lines ","element":"span"},{"href":"#id-27","text":"5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-27","text":"6 ","element":"a"},{"text":"of ","element":"span"},{"text":"User-User ","element":"span"},{"text":"algorithm, users are placed in the same partition if they rate all of the first ","element":"span"},{"text":"r ","element":"span"},{"text":"items similarly. Instead, users are now placed in the same partition if they rate the majority of the first ","element":"span"},{"text":"r ","element":"span"},{"text":"items similarly. The parameters ","element":"span"},{"style":{"height":12.8},"width":144.6,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-9.png","element":"img","alt":" λ and r","inline":true,"padRight":true},{"text":"are chosen to guarantee that the partitioning over users is consistent with their type with probability greater than 1 ","element":"span"},{"style":{"height":8},"width":73.44,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-10.png","element":"img","alt":" − ǫ.","inline":true}],[{"text":"Remark 4.2. ","element":"span"},{"text":"In Line ","element":"span"},{"href":"#id-45","text":"8, ","element":"a"},{"text":"the algorithm checks whether there is a partitioning over the users consistent with variables ","element":"span"},{"style":{"height":13.09},"width":66.92,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-11.png","element":"img","alt":" gu,v","inline":true},{"text":". This is true precisely when the graph with edge set ","element":"span"},{"style":{"height":13.09},"width":66.92,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/15-12.png","element":"img","alt":" gu,v","inline":true,"padRight":true},{"text":"is a disjoint union of cliques.","element":"span"}],[{"text":"Remark 4.3. ","element":"span"},{"text":"The noisy feedback decreases performance in two ways: partitioning users correctly requires more exploration recommendations, resulting in a larger cold-start time. Additionally, in Step 2, even good exploitation recommendations can be disliked due to noise. The next theorem quantifies these observations.","element":"span"}],[{"id":"id-45","style":{"width":"100%"},"width":1873,"height":822,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-0.png","element":"img"}],[{"text":"Theorem 4.4. ","element":"span"},{"text":"Consider the model introduced in Section ","element":"span"},{"text":"2 ","element":"span"},{"text":"with ","element":"span"},{"style":{"height":16.61},"width":238.16,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-1.png","element":"img","alt":" N users, qU","inline":true,"padRight":true},{"text":"user types and ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-2.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"item types. Let ","element":"span"},{"style":{"height":24.85},"width":1072.44,"height":62.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-3.png","element":"img","alt":" r =� 12(1−2γ)2 log N�. If qI > 432 log N, then User-User","inline":true,"padRight":true},{"text":"achieves regret","element":"span"}],[{"style":{"width":"0%"},"width":8,"height":4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-4.png","element":"img"}],[{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-5.png","element":"img","alt":"1","inline":true},{"style":{"height":18.54},"width":678.84,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-6.png","element":"img","alt":"2T , if T ≤ r","inline":true}],[{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-7.png","element":"img","alt":"1","inline":true},{"style":{"height":23.57},"width":698.4,"height":58.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-8.png","element":"img","alt":"2r +� 5qU+2N + γ�T + 5 , if T > r .","inline":true}],[{"text":"Proof. ","element":"span"},{"text":"The proof of this theorem is very similar to the proof of Theorem ","element":"span"},{"href":"#id-22","text":"4.1. ","element":"a"},{"text":"Lemma ","element":"span"},{"href":"#id-46","text":"4.5 ","element":"a"},{"text":"replaces Lemma ","element":"span"},{"href":"#id-34","text":"4.2 ","element":"a"},{"text":"to show that with the given choice of parameters in Algorithm ","element":"span"},{"href":"#id-45","text":"2, ","element":"a"},{"text":"the partitioning ","element":"span"},{"style":{"height":14.88},"width":48.24,"height":37.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-9.png","element":"img","alt":" Pk","inline":true,"padRight":true},{"text":"is the same as partitioning over the users by their types with probability greater than 1 ","element":"span"},{"style":{"height":17.6},"width":138.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-10.png","element":"img","alt":" − 1/N.","inline":true,"padRight":true},{"text":"Additionally, Equation ","element":"span"},{"href":"#id-47","text":"(3) ","element":"a"},{"text":"in the proof of Theorem ","element":"span"},{"href":"#id-22","text":"4.1 ","element":"a"},{"text":"changes as follows to be consistent as a result of noisy feedback modeled in ","element":"span"},{"href":"#id-48","text":"(11)","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"57%"},"width":1083,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-11.png","element":"img"}],[{"text":"Equation ","element":"span"},{"href":"#id-43","text":"(6) ","element":"a"},{"text":"is replaced with","element":"span"}],[{"style":{"width":"46%"},"width":868,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-12.png","element":"img"}],[{"text":"and since ","element":"span"},{"style":{"height":17.6},"width":475.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-13.png","element":"img","alt":" γ < 1/2, then 1 − p ≤ 5p","inline":true},{"text":". It follows that","element":"span"}],[{"style":{"width":"85%"},"width":1594,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/16-14.png","element":"img"}],[{"text":"Claim ","element":"span"},{"href":"#id-38","text":"4.3 ","element":"a"},{"text":"bounds the right-hand side. Plugging in these equations in the subsequent part of the proof of Theorem ","element":"span"},{"href":"#id-22","text":"4.1 ","element":"a"},{"text":"gives the statement of the theorem.","element":"span"}],[{"id":"id-46","text":"Lemma 4.5. ","element":"span"},{"text":"Consider the user similarities computed in Step 7 of ","element":"span"},{"text":"NoisyUserPartition ","element":"span"},{"text":"(Algorithm ","element":"span"},{"href":"#id-45","text":"2)","element":"a"},{"text":". Define the event ","element":"span"},{"style":{"height":19.85},"width":571.12,"height":49.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-0.png","element":"img","alt":" Buv = {gu,v = 1{τU(u)=τU (v)}}","inline":true,"padRight":true},{"text":"that these similarities coincide with the underlying user types. If ","element":"span"},{"style":{"height":19.54},"width":782.6,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-1.png","element":"img","alt":" qI > 144 log(N 2/ǫ), then P[Bcuv] ≤ 2ǫ/N 2","inline":true},{"text":". It follows that if ","element":"span"},{"style":{"height":17.6},"width":256.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-2.png","element":"img","alt":" B = � Buv is","inline":true,"padRight":true},{"text":"the event that all users are partitioned correctly, then ","element":"span"},{"style":{"height":17.6},"width":266.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-3.png","element":"img","alt":" P[Bc] > 1 − ǫ.","inline":true}],[{"id":"id-12","text":"The proof of this lemma is similar to the proof of Lemma ","element":"span"},{"href":"#id-34","text":"4.2 ","element":"a"},{"text":"and is deferred to Appendix ","element":"span"},{"text":"C.","element":"span"}]]},{"heading":"5 User structure only: lower bound","paragraphs":[[{"text":"In this section we prove a lower bound on the regret of any online recommendation system in the regime with user structure only where ","element":"span"},{"style":{"height":16.74},"width":161.2,"height":41.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-4.png","element":"img","alt":" qI = 2qU ","inline":true,"padRight":true},{"text":"as described in Definition ","element":"span"},{"href":"#id-28","text":"2.1.","element":"a"}],[{"id":"id-24","style":{"height":21.67},"width":1310.72,"height":54.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-5.png","element":"img","alt":"Theorem 5.1. Let δ > 0 and r = ⌊log qU − log�16 (log qU) log Nδ�⌋","inline":true},{"text":". In the user structure model with ","element":"span"},{"style":{"height":17.01},"width":302.48,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-6.png","element":"img","alt":" N users and qU","inline":true,"padRight":true},{"text":"user types, any recommendation algorithm must incur regret","element":"span"}],[{"style":{"width":"56%"},"width":1054,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-7.png","element":"img"}],[{"text":"Remark 5.1. ","element":"span"},{"text":"The lower bound depends on a parameter ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-8.png","element":"img","alt":" δ","inline":true,"padRight":true},{"text":"that has two effects: (1) the slope of the regret curve during the cold start grows (approaching ","element":"span"},{"text":"1","element":"span"},{"text":"/","element":"span"},{"text":"2","element":"span"},{"text":") as the chosen parameter ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-9.png","element":"img","alt":" δ","inline":true,"padRight":true},{"text":"shrinks to ","element":"span"},{"text":"0","element":"span"},{"text":"; (2) the cold start time ","element":"span"},{"text":"r ","element":"span"},{"text":"is upper bounded as ","element":"span"},{"style":{"height":17.81},"width":472.36,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-10.png","element":"img","alt":" r < log qU − log log(1/δ).","inline":true}],[{"text":"Additionally, if the number of user types satisfies ","element":"span"},{"style":{"height":17.81},"width":204.4,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-11.png","element":"img","alt":" qU/N → 0","inline":true},{"text":", the slope of regret after the cold start time (the asymptotic rate of regret) approaches ","element":"span"},{"style":{"height":20.77},"width":46.8,"height":51.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-12.png","element":"img","alt":"qU2N ","inline":true,"padRight":true},{"text":". This is expected since each item can be ","element":"span"},{"text":"recommended at most once to each user. Hence, even if the structure in the user space is known, the algorithm should explore new items. On average, ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-13.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"explorations (about half of which are disliked) are necessary for every ","element":"span"},{"text":"N ","element":"span"},{"text":"recommendations.","element":"span"}],[{"text":"The simplified version of this theorem in Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"is obtained using ","element":"span"},{"style":{"height":19.35},"width":435.88,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-14.png","element":"img","alt":" N ≥ qU ≥ (log N)1.1,","inline":true},{"style":{"height":17.6},"width":438.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-15.png","element":"img","alt":"δ = 1/100 and N > N0","inline":true,"padRight":true},{"text":"for a constant ","element":"span"},{"style":{"height":14.69},"width":67.24,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-16.png","element":"img","alt":" N0.","inline":true}],[{"text":"5.1 ","element":"span"},{"text":"Proof strategy","element":"span"}],[{"text":"At a high level, the lower bound is based on two observations:","element":"span"}],[{"text":"• ","element":"span"},{"text":"A good estimate of user types is necessary to make meaningful recommendations. Notably, estimating similarity between users requires approximately ","element":"span"},{"text":"log ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-17.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"items rated in common.","element":"span"}],[{"text":"Suppose that the preference matrix Ξ (with elements ","element":"span"},{"style":{"height":17.68},"width":62.52,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-18.png","element":"img","alt":" ξk,j","inline":true},{"text":", the preference of user types for item types) is known (which is the case in user-structure only model). Also, suppose that we have obtained feedback from some user ","element":"span"},{"text":"u ","element":"span"},{"text":"for ","element":"span"},{"text":"t ","element":"span"},{"text":"items. Relative to the total number of types, user ","element":"span"},{"text":"u ","element":"span"},{"text":"must belong to a restricted set of user types consistent with this feedback. If ","element":"span"},{"text":"t ","element":"span"},{"text":"is small, the set of consistent types is large (for instance, if a user has rated only one item, there are roughly ","element":"span"},{"style":{"height":17.81},"width":73.36,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/17-19.png","element":"img","alt":" qU/","inline":true},{"text":"2 candidate user types for this user). At this point, user ","element":"span"},{"text":"u ","element":"span"},{"text":"likes some item ","element":"span"},{"text":"i ","element":"span"},{"text":"with probability proportional to the number of consistent types liking the item. Control of this","element":"span"}],[{"text":"count amounts to a property of the matrix we call (","element":"span"},{"style":{"height":14.4},"width":53.04,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-0.png","element":"img","alt":"t, ǫ","inline":true},{"text":")-column regularity in Definition ","element":"span"},{"href":"#id-49","text":"5.1, ","element":"a"},{"text":"which holds with high probability.","element":"span"}],[{"text":"• ","element":"span"},{"text":"Even if we know the user types (i.e., clustering of users), the first time a given item is recommended to a user from a given type, the outcome is uniformly random.","element":"span"}],[{"text":"Since there is no structure in item space (","element":"span"},{"style":{"height":16.54},"width":168.4,"height":41.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-1.png","element":"img","alt":"qI = 2qU","inline":true},{"text":"), learning the preference of a user type for an item is only achieved by recommending the item to one user from the user type. This is for the reason that the random variable ","element":"span"},{"style":{"height":19.05},"width":183.4,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-2.png","element":"img","alt":" ξτU(u),τI(i)","inline":true,"padRight":true},{"text":"in the preference matrix is independent of all previous history in the situation described.","element":"span"}],[{"text":"5.2 ","element":"span"},{"text":"Proof of Theorem ","element":"span"},{"href":"#id-24","text":"5.1","element":"a"}],[{"text":"We separately prove the two lower bounds in the statement of the theorem, starting with the first. The following regularity property in submatrices of the preference matrix allows us to control the posterior probability for an item being liked.","element":"span"}],[{"id":"id-49","style":{"height":17.6},"width":413.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-3.png","element":"img","alt":"Definition 5.1 ((r, ǫ","inline":true},{"text":")-column regularity)","element":"span"},{"style":{"height":17.6},"width":468.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-4.png","element":"img","alt":". Let A ∈ {−1, +1}m×n","inline":true},{"text":". For ordered tuple of distinct (column) indices ","element":"span"},{"style":{"height":17.6},"width":1097.88,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-5.png","element":"img","alt":" w = (i1, . . . , ir) ∈ [n]r, let M = (A·i)i∈w ∈ {−1, +1}m×r ","inline":true,"padRight":true},{"text":"be the matrix formed from the columns of ","element":"span"},{"text":"A ","element":"span"},{"text":"indexed by ","element":"span"},{"text":"w","element":"span"},{"text":". For given row vector ","element":"span"},{"style":{"height":18.48},"width":709.28,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-6.png","element":"img","alt":" b ∈ {−1, +1}r, let Kb,w(A) ⊆ [m] be","inline":true,"padRight":true},{"text":"the set of rows in ","element":"span"},{"text":"M ","element":"span"},{"text":"that are identical to ","element":"span"},{"text":"b ","element":"span"},{"text":"and denote its cardinality by ","element":"span"},{"style":{"height":18.48},"width":123.72,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-7.png","element":"img","alt":" kb,w(A","inline":true},{"text":"). The matrix ","element":"span"},{"text":"A ","element":"span"},{"text":"is said to be (","element":"span"},{"style":{"height":17.6},"width":415.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-8.png","element":"img","alt":"r, ǫ)-column regular if","inline":true}],[{"style":{"width":"78%"},"width":1472,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-9.png","element":"img"}],[{"id":"id-50","style":{"height":17.6},"width":964.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-10.png","element":"img","alt":"Claim 5.2. If matrix A ∈ {−1, +1}m×n is (r, ǫ","inline":true},{"text":")-column regular","element":"span"},{"style":{"height":17.6},"width":582.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-11.png","element":"img","alt":", then it is also (s, ǫ)-column","inline":true,"padRight":true},{"text":"regular ","element":"span"},{"text":"for all ","element":"span"},{"text":"s < r.","element":"span"}],[{"text":"Proof. ","element":"span"},{"text":"Suppose that ","element":"span"},{"style":{"height":17.6},"width":160.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-12.png","element":"img","alt":" A is (r, ǫ","inline":true},{"text":")-column regular. By induction it suffices to show that ","element":"span"},{"style":{"height":17.6},"width":264.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-13.png","element":"img","alt":" A is (r − 1, ǫ)-","inline":true,"padRight":true},{"text":"column regular. We will check that (1","element":"span"},{"style":{"height":19.53},"width":600.12,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-14.png","element":"img","alt":"−ǫ) m2r−1 ≤ kb,w(A) ≤ (1+ǫ) m2r−1","inline":true,"padRight":true},{"text":"for all size ","element":"span"},{"style":{"height":16},"width":336.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-15.png","element":"img","alt":" r−1 tuples w and","inline":true,"padRight":true},{"text":"vectors ","element":"span"},{"text":"b","element":"span"},{"text":". For any given ","element":"span"},{"style":{"height":19.14},"width":1144.44,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-16.png","element":"img","alt":" w ∈ [n]r−1 and b ∈ {−1, +1}r−1, let b+ = [b 1] ∈ {−1, +1}r ","inline":true,"padRight":true},{"text":"be obtained from ","element":"span"},{"text":"b ","element":"span"},{"text":"by appending +1. Similarly ","element":"span"},{"style":{"height":12.8},"width":44.72,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-17.png","element":"img","alt":" b− ","inline":true,"padRight":true},{"text":"is obtained from ","element":"span"},{"text":"b ","element":"span"},{"text":"by appending ","element":"span"},{"style":{"height":17.6},"width":517.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-18.png","element":"img","alt":" −1. If w′ = (w, i) ∈ [n]r for","inline":true,"padRight":true},{"text":"any ","element":"span"},{"style":{"height":19.44},"width":1792.04,"height":48.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-19.png","element":"img","alt":" i /∈ w, then Kb,w = Kb+,w′ ∪Kb−,w′ and Kb+,w′ ∩Kb−,w′ = ∅, so kb,w = kb+,w′ +kb−,w. Since A is","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":11.2},"width":55.44,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-20.png","element":"img","alt":"r, ǫ","inline":true},{"text":")-column regular, (1","element":"span"},{"style":{"height":19.25},"width":614.17,"height":48.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-21.png","element":"img","alt":"−ǫ) m2r ≤ kb+,w′, kb−,w′ ≤ (1+ǫ) m2r","inline":true,"padRight":true},{"text":", hence (1","element":"span"},{"style":{"height":19.54},"width":543.84,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-22.png","element":"img","alt":"−ǫ) m2r−1 ≤ kb,w ≤ (1+ǫ) m2r−1.","inline":true}],[{"id":"id-51","text":"Lemma 5.3. ","element":"span"},{"text":"Let matrix ","element":"span"},{"style":{"height":17.6},"width":351.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-23.png","element":"img","alt":" A ∈ {−1, +1}m×n ","inline":true,"padRight":true},{"text":"have i.i.d. ","element":"span"},{"text":"Bern(1","element":"span"},{"text":"/","element":"span"},{"text":"2) ","element":"span"},{"text":"entries. If ","element":"span"},{"style":{"height":16},"width":331.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-24.png","element":"img","alt":" ǫ < 1, then A is","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":17.6},"width":72.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-25.png","element":"img","alt":"r, ǫ)","inline":true},{"text":"-column regular (i.e., ","element":"span"},{"style":{"height":17.89},"width":157.6,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-26.png","element":"img","alt":" A ∈ Ωr,ǫ","inline":true},{"text":") with probability at least","element":"span"}],[{"style":{"width":"26%"},"width":492,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/18-27.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"For given column tuple ","element":"span"},{"text":"w ","element":"span"},{"text":"and row vector ","element":"span"},{"text":"b","element":"span"},{"text":", the expected number of times the row vector ","element":"span"},{"text":"b ","element":"span"},{"text":"appears is ","element":"span"},{"style":{"height":18.46},"width":31.92,"height":46.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-0.png","element":"img","alt":"m2r","inline":true,"padRight":true},{"text":". A Chernoff bound (Lemma ","element":"span"},{"href":"#id-37","text":"A.1) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":15.6},"width":203.72,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-1.png","element":"img","alt":" ǫ < 1 gives","inline":true}],[{"style":{"width":"45%"},"width":845,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-2.png","element":"img"}],[{"text":"There are no more than ","element":"span"},{"style":{"height":12.34},"width":41.4,"height":30.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-3.png","element":"img","alt":" nr ","inline":true,"padRight":true},{"text":"possible choices of column tuple ","element":"span"},{"style":{"height":15.6},"width":176.28,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-4.png","element":"img","alt":" w, and 2r ","inline":true,"padRight":true},{"text":"possible choices of row vector ","element":"span"},{"text":"b","element":"span"},{"text":"; the union bound yields the proof.","element":"span"}],[{"text":"Proposition 5.4. ","element":"span"},{"text":"Consider the user structure only model in Definition ","element":"span"},{"href":"#id-28","text":"2.1. ","element":"a"},{"text":"Let ","element":"span"},{"style":{"height":13.2},"width":282.64,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-5.png","element":"img","alt":" δ > 0 and r =","inline":true},{"style":{"height":21.66},"width":931.8,"height":54.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-6.png","element":"img","alt":"⌊log qU − log�16 (log qU) log Nδ�⌋. For any T ≤ r","inline":true},{"text":", the regret is lower bounded by","element":"span"}],[{"style":{"width":"27%"},"width":517,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-7.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"We will show that for preference matrices Ξ satisfying column regularity, at any time ","element":"span"},{"style":{"height":14.4},"width":101.28,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-8.png","element":"img","alt":" t ≤ r,","inline":true,"padRight":true},{"text":"most users have probability roughly half of liking any particular item given the feedback obtained thus far, even if the preference matrix is known. (Recall that the preference matrix contains the preference of each user type for each item type; there is still uncertainty in the actual type of each user or item).","element":"span"}],[{"text":"At time ","element":"span"},{"text":"t","element":"span"},{"text":", suppose that ","element":"span"},{"text":"n ","element":"span"},{"text":"items in total have been recommended by the algorithm (","element":"span"},{"style":{"height":14.4},"width":244.64,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-9.png","element":"img","alt":"n ≤ Nt since","inline":true,"padRight":true},{"text":"each of the ","element":"span"},{"text":"N ","element":"span"},{"text":"users rates one item per time-step). We label the set of items by [","element":"span"},{"text":"n","element":"span"},{"text":"] = ","element":"span"},{"text":"{","element":"span"},{"text":"1","element":"span"},{"text":", . . . , n","element":"span"},{"text":"}","element":"span"},{"text":". Let ","element":"span"},{"style":{"height":17.41},"width":324.08,"height":43.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-10.png","element":"img","alt":" A be the qU × n","inline":true,"padRight":true},{"text":"matrix indicating the preference of each user type for these ","element":"span"},{"text":"n ","element":"span"},{"text":"items. Each item ","element":"span"},{"style":{"height":17.6},"width":539.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-11.png","element":"img","alt":" i has type τI(i) ∼ Unif([2qU","inline":true},{"text":"]) and because the set of columns of the preference matrix Ξ is precisely ","element":"span"},{"style":{"height":17.6},"width":213.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-12.png","element":"img","alt":" {−1, +1}qU","inline":true},{"text":", the columns of ","element":"span"},{"text":"A ","element":"span"},{"text":"are independent and uniformly distributed in ","element":"span"},{"style":{"height":17.6},"width":213.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-13.png","element":"img","alt":" {−1, +1}qU","inline":true,"padRight":true},{"text":"according to this model.","element":"span"}],[{"text":"We now focus on a particular user ","element":"span"},{"style":{"height":20.05},"width":453.48,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-14.png","element":"img","alt":" u. Let w = {au,s}s∈[t−1]","inline":true,"padRight":true},{"text":"be the items recommended to user ","element":"span"},{"text":"u ","element":"span"},{"text":"up to time ","element":"span"},{"style":{"height":11.6},"width":60.88,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-15.png","element":"img","alt":" t −","inline":true,"padRight":true},{"text":"1, and let ","element":"span"},{"style":{"height":21.39},"width":643.4,"height":53.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-16.png","element":"img","alt":" b = (Lu,au,s)s∈[t−1] ∈ {−1, +1}t−1 ","inline":true,"padRight":true},{"text":"be the vector of feedback for these items. We claim that conditional on the matrix ","element":"span"},{"style":{"height":17.6},"width":925.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-17.png","element":"img","alt":" A, vectors b and w, the type τU(u) of user u at","inline":true,"padRight":true},{"text":"the end of time instant ","element":"span"},{"style":{"height":11.6},"width":58.96,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-18.png","element":"img","alt":" t −","inline":true,"padRight":true},{"text":"1 is uniformly distributed over the set of user types ","element":"span"},{"style":{"height":18.48},"width":138.12,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-19.png","element":"img","alt":" Kb,w(A","inline":true},{"text":") consistent with this data (","element":"span"},{"style":{"height":18.48},"width":138.12,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-20.png","element":"img","alt":"Kb,w(A","inline":true},{"text":") is defined in Definition ","element":"span"},{"href":"#id-49","text":"5.1)","element":"a"},{"text":".","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":18.73},"width":431.04,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-21.png","element":"img","alt":" b+ = [b 1] ∈ {−1, +1}t ","inline":true,"padRight":true},{"text":"be obtained from ","element":"span"},{"text":"b ","element":"span"},{"text":"by appending +1. Then ","element":"span"},{"style":{"height":18.63},"width":415.16,"height":46.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-22.png","element":"img","alt":" Lu,au,t = +1 precisely","inline":true,"padRight":true},{"text":"when ","element":"span"},{"style":{"height":20.39},"width":423.24,"height":50.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-23.png","element":"img","alt":" τU(u) ∈ Kb+,{w,au,t}(A","inline":true},{"text":"), which in words reads “user ","element":"span"},{"text":"u ","element":"span"},{"text":"is among those types that are consistent with the first ","element":"span"},{"style":{"height":11.6},"width":58.48,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-24.png","element":"img","alt":" t −","inline":true,"padRight":true},{"text":"1 ratings of ","element":"span"},{"text":"u ","element":"span"},{"text":"and have preference vector with ’+1’ for the item recommended to ","element":"span"},{"text":"u ","element":"span"},{"text":"at time ","element":"span"},{"text":"t","element":"span"},{"text":"”. It follows that for any matrix ","element":"span"},{"text":"A ","element":"span"},{"text":"corresponding to items [","element":"span"},{"text":"n","element":"span"},{"text":"],","element":"span"}],[{"style":{"width":"80%"},"width":1499,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-25.png","element":"img"}],[{"text":"The second equality is due to: i) ","element":"span"},{"style":{"height":17.49},"width":255.36,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-26.png","element":"img","alt":" w, b, and au,t","inline":true,"padRight":true},{"text":"are functions of ","element":"span"},{"style":{"height":14.69},"width":92.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-27.png","element":"img","alt":" Ht−1","inline":true},{"text":"; ii) for fixed ","element":"span"},{"text":"w ","element":"span"},{"text":"and ","element":"span"},{"text":"b ","element":"span"},{"text":"the set ","element":"span"},{"style":{"height":18.48},"width":138.12,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-28.png","element":"img","alt":"Kb,w(A","inline":true},{"text":") is determined by ","element":"span"},{"style":{"height":17.6},"width":216.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-29.png","element":"img","alt":" A; iii) τU(u","inline":true},{"text":") is uniformly distributed on ","element":"span"},{"style":{"height":18.48},"width":137.64,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-30.png","element":"img","alt":" Kb,w(A","inline":true},{"text":") conditional on ","element":"span"},{"text":"A, b, w","element":"span"},{"text":". Recall that Ω","element":"span"},{"style":{"height":10.4},"width":37.6,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-31.png","element":"img","alt":"t,ǫ","inline":true,"padRight":true},{"text":"was defined as the set of (","element":"span"},{"style":{"height":14.4},"width":53.04,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/19-32.png","element":"img","alt":"t, ǫ","inline":true},{"text":")- column regular matrices. It now follows by the","element":"span"}],[{"text":"tower property of conditional expectation that","element":"span"}],[{"style":{"width":"76%"},"width":1428,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-0.png","element":"img"}],[{"text":"The last two equalities are justified as follows: if ","element":"span"},{"style":{"height":17.89},"width":169.6,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-1.png","element":"img","alt":" A ∈ Ωt,ǫ","inline":true,"padRight":true},{"text":"then by Claim ","element":"span"},{"href":"#id-50","text":"5.2, ","element":"a"},{"style":{"height":17.89},"width":311,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-2.png","element":"img","alt":" A ∈ Ωt−1,ǫ. By","inline":true,"padRight":true},{"text":"Definition ","element":"span"},{"href":"#id-49","text":"5.1, ","element":"a"},{"text":"this means that ","element":"span"},{"style":{"height":21.59},"width":1266.68,"height":53.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-3.png","element":"img","alt":" kb,w(A) ≥ (1 − ǫ)m/2t−1 and kb+,{w,i}(A) ≤ (1 + ǫ)m/2t . We pick","inline":true},{"style":{"height":17.6},"width":119.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-4.png","element":"img","alt":"ǫ < 1/","inline":true},{"text":"2 to get the last inequality.","element":"span"}],[{"text":"Fix ","element":"span"},{"style":{"height":13.2},"width":67.12,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-5.png","element":"img","alt":" δ >","inline":true,"padRight":true},{"text":"0 and define","element":"span"}],[{"style":{"width":"29%"},"width":543,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-6.png","element":"img"}],[{"text":"Lemma ","element":"span"},{"href":"#id-51","text":"5.3 ","element":"a"},{"text":"shows that at time ","element":"span"},{"style":{"height":17.2},"width":260.24,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-7.png","element":"img","alt":" t ≤ r < log qU","inline":true},{"text":", for this choice of ","element":"span"},{"style":{"height":17.89},"width":376.76,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-8.png","element":"img","alt":" ǫt, we have A ∈ Ωt,ǫt","inline":true,"padRight":true},{"text":"with probability 1 ","element":"span"},{"style":{"height":12.8},"width":63.68,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-9.png","element":"img","alt":" − δ","inline":true},{"text":". We get the bound","element":"span"}],[{"style":{"width":"76%"},"width":1428,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-10.png","element":"img"}],[{"text":"It follows from the above display and the definition of ","element":"span"},{"style":{"height":15.6},"width":324.48,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-11.png","element":"img","alt":" ǫt that for T ≤ r,","inline":true}],[{"style":{"width":"88%"},"width":1650,"height":297,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-12.png","element":"img"}],[{"text":"where (a) uses the definition of ","element":"span"},{"style":{"height":10.29},"width":29.76,"height":25.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-13.png","element":"img","alt":" ǫt","inline":true},{"text":". (b) uses the summation of a Geometric series and ","element":"span"},{"style":{"height":14.4},"width":228.4,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-14.png","element":"img","alt":" t ≤ T < r <","inline":true,"padRight":true},{"text":"log ","element":"span"},{"style":{"height":12.21},"width":48.56,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-15.png","element":"img","alt":" qU","inline":true},{"text":". (c) uses the definition of ","element":"span"},{"style":{"height":23.78},"width":1062.24,"height":59.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-16.png","element":"img","alt":" r and T ≤ r < log qU − log�12 log qU�− log log 2N log qUδ .","inline":true}],[{"text":"We now proceed to the proof of the second lower bound in Theorem ","element":"span"},{"href":"#id-24","text":"5.1.","element":"a"}],[{"id":"id-52","style":{"width":"69%"},"width":1309,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-17.png","element":"img"}],[{"text":"The main ingredient in the proof of the proposition is definition of an event that implies that the outcome of the associated recommendation is uniformly random. Let ","element":"span"},{"style":{"height":23.49},"width":135.84,"height":58.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-18.png","element":"img","alt":" BtτU (u),i ","inline":true,"padRight":true},{"text":"be the event that ","element":"span"},{"text":"some user of same type ","element":"span"},{"style":{"height":17.6},"width":199.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-19.png","element":"img","alt":" τU(u) as u","inline":true,"padRight":true},{"text":"has rated item ","element":"span"},{"style":{"height":16.4},"width":292.32,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-20.png","element":"img","alt":" i by time t − 1:","inline":true}],[{"style":{"width":"81%"},"width":1531,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-21.png","element":"img"}],[{"id":"id-53","text":"Claim 5.6. ","element":"span"},{"text":"If no user with the same type as ","element":"span"},{"text":"u ","element":"span"},{"text":"has rated item ","element":"span"},{"style":{"height":16.4},"width":252.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-22.png","element":"img","alt":" i by time t−1","inline":true},{"text":", the probability that user ","element":"span"},{"text":"u ","element":"span"},{"text":"likes item ","element":"span"},{"text":"i ","element":"span"},{"text":"conditional on any history consistent with this is ","element":"span"},{"style":{"height":24.47},"width":648.53,"height":61.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-23.png","element":"img","alt":" P[Lu,i = −1|(BtτU (u),i)c, Ht−1] = 12.","inline":true}],[{"text":"Proof. ","element":"span"},{"text":"According to Definition ","element":"span"},{"href":"#id-28","text":"2.1, ","element":"a"},{"text":"in the user-structure only model, the matrix Ξ is deterministic and has columns consisting of all sequences in ","element":"span"},{"style":{"height":17.6},"width":230.4,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-24.png","element":"img","alt":" {−1, +1}qU .","inline":true,"padRight":true},{"text":"We will show that ","element":"span"},{"style":{"height":18.29},"width":168.4,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/20-25.png","element":"img","alt":" P[Lu,i =","inline":true}],[{"style":{"width":"76%"},"width":1437,"height":63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-0.png","element":"img"}],[{"text":"A priori ","element":"span"},{"style":{"height":17.6},"width":70.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-1.png","element":"img","alt":" τI(i","inline":true},{"text":") is uniform on [","element":"span"},{"style":{"height":11.6},"width":36.68,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-2.png","element":"img","alt":"qI","inline":true},{"text":"]. Given the sequence ","element":"span"},{"style":{"height":17.6},"width":76.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-3.png","element":"img","alt":" τU(·","inline":true},{"text":"), the matrix Ξ and the feedback ","element":"span"},{"style":{"height":14.69},"width":92.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-4.png","element":"img","alt":" Ht−1","inline":true,"padRight":true},{"text":"up to time ","element":"span"},{"style":{"height":11.6},"width":57.04,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-5.png","element":"img","alt":" t −","inline":true,"padRight":true},{"text":"1, the posterior distribution of ","element":"span"},{"style":{"height":17.6},"width":69.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-6.png","element":"img","alt":" τI(i","inline":true},{"text":") is uniform over the set of all item types ","element":"span"},{"text":"j ","element":"span"},{"text":"which are consistent with the outcome of recommending ","element":"span"},{"text":"i ","element":"span"},{"text":"to users of various types. We call this set ","element":"span"},{"style":{"height":17.6},"width":101.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-7.png","element":"img","alt":" St(i).","inline":true}],[{"text":"Since on the event (","element":"span"},{"style":{"height":23.49},"width":169.08,"height":58.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-8.png","element":"img","alt":"BtτU (u),i)c","inline":true},{"text":", no user with the same type as ","element":"span"},{"style":{"height":16.4},"width":620.16,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-9.png","element":"img","alt":" u has rated i by time t − 1, and","inline":true,"padRight":true},{"text":"since the matrix Ξ has columns consisting of all sequences in ","element":"span"},{"style":{"height":17.6},"width":213.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-10.png","element":"img","alt":" {−1, +1}qU ","inline":true,"padRight":true},{"text":", half of the item types in set ","element":"span"},{"style":{"height":17.6},"width":72.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-11.png","element":"img","alt":" St(i","inline":true},{"text":") are liked by user ","element":"span"},{"text":"u ","element":"span"},{"text":"and half of them are disliked.","element":"span"}],[{"text":"The final ingredient in the proof of Proposition ","element":"span"},{"href":"#id-52","text":"5.5 ","element":"a"},{"text":"is a lower bound on the number of items recommended for which Claim ","element":"span"},{"href":"#id-53","text":"5.6 ","element":"a"},{"text":"applies.","element":"span"}],[{"text":"Claim 5.7. ","element":"span"},{"text":"The expected number of times a new item is recommended to a user type by time ","element":"span"},{"text":"T ","element":"span"},{"text":"is lower bounded as","element":"span"}],[{"style":{"width":"52%"},"width":987,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-12.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"At the end of time-step ","element":"span"},{"text":"T ","element":"span"},{"text":"each user has been recommended ","element":"span"},{"text":"T ","element":"span"},{"text":"items, hence each ","element":"span"},{"text":"user type ","element":"span"},{"text":"has been recommended at least ","element":"span"},{"text":"T ","element":"span"},{"text":"items. Let ","element":"span"},{"style":{"height":12.21},"width":49.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-13.png","element":"img","alt":" �qU","inline":true,"padRight":true},{"text":"be the number of user types in which there is at least one user. The total number of times an item is recommended to a user type for the first time is at least ","element":"span"},{"style":{"height":16.61},"width":82.36,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-14.png","element":"img","alt":" �qUT","inline":true},{"text":". Applying Lemma ","element":"span"},{"href":"#id-54","text":"A.5 ","element":"a"},{"text":"shows that ","element":"span"},{"style":{"height":21.73},"width":906.24,"height":54.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-15.png","element":"img","alt":" E [�qU] ≥ qU(1−(1−1/qU)N) ≥ qI�1−e(−N/qU )�.","inline":true}],[{"text":"We now complete the proof of Proposition ","element":"span"},{"href":"#id-52","text":"5.5.","element":"a"}],[{"text":"Proof of Prop ","element":"span"},{"href":"#id-52","text":"5.5. ","element":"a"},{"text":"Partitioning recommendations according to ","element":"span"},{"style":{"height":24.02},"width":290.12,"height":60.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-16.png","element":"img","alt":" BtτU (u),au,t gives","inline":true}],[{"style":{"width":"94%"},"width":1774,"height":467,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-17.png","element":"img"}],[{"text":"where all the summations are over ","element":"span"},{"style":{"height":17.6},"width":387,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-18.png","element":"img","alt":" t ∈ [T] and u ∈ [N","inline":true},{"text":"]. Rearranging shows that ","element":"span"},{"style":{"height":17.6},"width":255.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-19.png","element":"img","alt":" regret(T) ≥","inline":true},{"style":{"height":21.89},"width":546.24,"height":54.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/21-20.png","element":"img","alt":"�1 − e(−N/qU)� qU2N T for all T.","inline":true}]]},{"heading":"6 Item-item algorithm and analysis","paragraphs":[[{"text":"This section describes a version of item-item CF with explicit exploration steps and analyzes its performance within the setup specified in Section ","element":"span"},{"text":"2. ","element":"span"},{"text":"The algorithm is quite different from the user-user algorithm. This is due to the inherent asymmetry between users and items: multiple users can rate a given item simultaneously but each user can rate only one item at each time-step.","element":"span"}],[{"id":"id-29","text":"6.1 ","element":"span"},{"text":"Algorithm","element":"span"}],[{"text":"Algorithm ","element":"span"},{"text":"Item-Item ","element":"span"},{"text":"performs the following steps (see Algorithm ","element":"span"},{"href":"#id-30","text":"3)","element":"a"},{"text":". First, items are partitioned according to type; next, each user’s preference for each item type is determined; finally, items from liked partitions are recommended. These steps are now described in more detail.","element":"span"}],[{"text":"Two sets ","element":"span"},{"style":{"height":15.94},"width":251.72,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-0.png","element":"img","alt":" M1 and M2","inline":true},{"text":", each containing ","element":"span"},{"text":"M ","element":"span"},{"text":"random items, are selected. ","element":"span"},{"text":"In the exploration step, each item is recommended to ","element":"span"},{"style":{"height":17.81},"width":334.88,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-1.png","element":"img","alt":" r = ⌈2 log(qI/ǫ)⌉","inline":true,"padRight":true},{"text":"random users. ","element":"span"},{"text":"The feedback from these recommendations later helps to partition the items according to type. The parameter ","element":"span"},{"text":"r ","element":"span"},{"text":"is chosen large enough to guarantee small probability of error in partitioning (","element":"span"},{"style":{"height":13.6},"width":64.08,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-2.png","element":"img","alt":"≤ ǫ","inline":true,"padRight":true},{"text":"for each item). The use of two sets ","element":"span"},{"style":{"height":15.94},"width":242.6,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-3.png","element":"img","alt":" M1 and M2","inline":true},{"text":", as opposed to just one, is to simplify the analysis; as described next, item type representatives are selected from ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-4.png","element":"img","alt":" M2","inline":true},{"text":", and are used to represent clusters of items from ","element":"span"},{"style":{"height":15.94},"width":83.52,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-5.png","element":"img","alt":" M1.","inline":true}],[{"text":"An item from each of ","element":"span"},{"style":{"height":16.4},"width":290.2,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-6.png","element":"img","alt":" ℓ explored types","inline":true,"padRight":true},{"text":"is recommended to all users, where ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-7.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"is a parameter determined by the algorithm. It turns out that it is often beneficial (depending on system parameters) to learn user preferences for only a subset of the types, in which case ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-8.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"is strictly less than ","element":"span"},{"style":{"height":17.2},"width":166.08,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-9.png","element":"img","alt":" qI. Each","inline":true,"padRight":true},{"text":"of the ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-10.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"items chosen from ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-11.png","element":"img","alt":" M2 ","inline":true,"padRight":true},{"text":"is thought of as a ","element":"span"},{"text":"representative ","element":"span"},{"text":"of its type. For each ","element":"span"},{"style":{"height":16.4},"width":228.96,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-12.png","element":"img","alt":" j = 1, . . . , ℓ,","inline":true,"padRight":true},{"text":"all items in ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-13.png","element":"img","alt":" M2 ","inline":true,"padRight":true},{"text":"that appear to be of the same type as the representative item i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-14.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":"are stored in a set ","element":"span"},{"style":{"height":21.94},"width":46.76,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-15.png","element":"img","alt":" S2j ","inline":true,"padRight":true},{"text":"and then removed from ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-16.png","element":"img","alt":" M2","inline":true},{"text":". This guarantees that at each time ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-17.png","element":"img","alt":" M2","inline":true,"padRight":true},{"text":"does not contain items ","element":"span"},{"text":"with the same type as any of the previously selected representative items. For each ","element":"span"},{"style":{"height":16.4},"width":234.24,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-18.png","element":"img","alt":" j = 1, . . . , ℓ,","inline":true,"padRight":true},{"text":"all items in ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-19.png","element":"img","alt":" M1 ","inline":true,"padRight":true},{"text":"that appear to be of the same type as i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-20.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":"are stored in a set ","element":"span"},{"style":{"height":21.94},"width":46.76,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-21.png","element":"img","alt":" S1j ","inline":true,"padRight":true},{"text":"and then removed ","element":"span"},{"text":"from ","element":"span"},{"style":{"height":15.94},"width":83.52,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-22.png","element":"img","alt":" M1.","inline":true}],[{"text":"For each user ","element":"span"},{"text":"u","element":"span"},{"text":", we add the items in the groups ","element":"span"},{"style":{"height":21.94},"width":46.76,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-23.png","element":"img","alt":" S1j ","inline":true,"padRight":true},{"text":"whose representative i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-24.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":"were liked by ","element":"span"},{"text":"u ","element":"span"},{"text":"to ","element":"span"},{"text":"the set of exploitable items ","element":"span"},{"style":{"height":16.4},"width":164.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-25.png","element":"img","alt":" Ru by u","inline":true},{"text":". Finally, in the exploitation phase, each user is recommended items from ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-26.png","element":"img","alt":" Ru","inline":true},{"text":". We choose the number ","element":"span"},{"text":"M ","element":"span"},{"text":"of items in each of ","element":"span"},{"style":{"height":15.94},"width":249.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-27.png","element":"img","alt":" M1 and M2 ","inline":true,"padRight":true},{"text":"as a function of ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-28.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"to ensure that there are enough exploitable items in ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-29.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"for all users ","element":"span"},{"text":"u ","element":"span"},{"text":"for the entire length-","element":"span"},{"text":"T ","element":"span"},{"text":"time-horizon. Then, ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-30.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"is chosen to minimize regret.","element":"span"}],[{"text":"The algorithm description uses the following notation. For an item ","element":"span"},{"text":"i ","element":"span"},{"text":"and time ","element":"span"},{"text":"t > ","element":"span"},{"text":"0,","element":"span"}],[{"style":{"width":"45%"},"width":859,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-31.png","element":"img"}],[{"text":"is the set of users that have rated item ","element":"span"},{"text":"i ","element":"span"},{"text":"before time ","element":"span"},{"text":"t","element":"span"},{"text":". The time ","element":"span"},{"text":"t ","element":"span"},{"text":"is implicit in the algorithm description, with ","element":"span"},{"text":"rated","element":"span"},{"text":"(","element":"span"},{"text":"i","element":"span"},{"text":") used to represent ","element":"span"},{"style":{"height":17.6},"width":139.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-32.png","element":"img","alt":" ratedt(i","inline":true},{"text":") at the time of its appearance.","element":"span"}],[{"id":"id-30","style":{"width":"100%"},"width":1873,"height":590,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/22-33.png","element":"img"}],[{"id":"id-55","style":{"width":"100%"},"width":1873,"height":1417,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-0.png","element":"img"}],[{"text":"Remark 6.1. ","element":"span"},{"text":"The set of items ","element":"span"},{"style":{"height":15.94},"width":240.2,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-1.png","element":"img","alt":" M1 and M2 ","inline":true,"padRight":true},{"text":"are updated throughout algorithm ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":". In the proof, we use the notation ","element":"span"},{"style":{"height":19.34},"width":236.36,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-2.png","element":"img","alt":" M10 and M20 ","inline":true,"padRight":true},{"text":"to refer to the set of items ","element":"span"},{"style":{"height":15.94},"width":236.36,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-3.png","element":"img","alt":" M1 and M2","inline":true,"padRight":true},{"text":"at the beginning ","element":"span"},{"text":"of the algorithm ","element":"span"},{"text":"Item-Item","element":"span"},{"text":".","element":"span"}],[{"id":"id-57","text":"Remark 6.2. ","element":"span"},{"text":"Assignment of users to items for Line ","element":"span"},{"href":"#id-55","text":"3 ","element":"a"},{"text":"of ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":": This is done over ","element":"span"},{"style":{"height":19.54},"width":661.36,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-4.png","element":"img","alt":"⌈(|M10| + |M20|)r/N⌉ ≤ 2Mr/N +1","inline":true,"padRight":true},{"text":"time-steps, with additional recommendations being random new ","element":"span"},{"text":"items. The main requirement in assigning users to items is to make Claim ","element":"span"},{"href":"#id-56","text":"6.3 ","element":"a"},{"text":"(which bounds the probability of mis-classification of each item) hold. Since the claim addresses each item separately, we may introduce dependencies between sets of users for different items. What is important is that the set of users assigned to each specific item is uniform at random among all sets of users (or at least contains a random subset with size ","element":"span"},{"text":"r","element":"span"},{"text":"). For example, one can choose a random permutation over the users, repeat this list ","element":"span"},{"style":{"height":19.34},"width":664.72,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-5.png","element":"img","alt":" ⌈(|M10| + |M20|)r/N⌉ ≤ 2Mr/N + 1","inline":true,"padRight":true},{"text":"times, and then assign each item ","element":"span"},{"text":"to a block of ","element":"span"},{"text":"r ","element":"span"},{"text":"users.","element":"span"}],[{"id":"id-23","style":{"height":19.92},"width":1872.84,"height":49.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-6.png","element":"img","alt":"Theorem 6.1. Let r = ⌈2 log(2Nq2I)⌉. Suppose qI > 13 log N and qU > 4r. Then Item-Item","inline":true,"padRight":true},{"text":"(Algorithm ","element":"span"},{"href":"#id-30","text":"3) ","element":"a"},{"text":"obtains regret per user at time ","element":"span"},{"text":"T ","element":"span"},{"text":"upper bounded as","element":"span"}],[{"style":{"width":"30%"},"width":569,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/23-7.png","element":"img"}],[{"style":{"width":"99%"},"width":1864,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-0.png","element":"img"}],[{"text":"The simplified version of this theorem in Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"is obtained 2 log(","element":"span"},{"style":{"height":18},"width":515.52,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-1.png","element":"img","alt":"NqI) < r < 5 log(NqI) and","inline":true,"padRight":true},{"text":"N > ","element":"span"},{"text":"5.","element":"span"}],[{"text":"Remark 6.3. ","element":"span"},{"text":"The regret bound we get for the algorithm is actually ","element":"span"},{"style":{"height":17.6},"width":558.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-2.png","element":"img","alt":" regret(T) ≤ Y (T). One can","inline":true,"padRight":true},{"text":"obtain ","element":"span"},{"text":"T/","element":"span"},{"text":"2 ","element":"span"},{"text":"by a trivial algorithm which recommends random items independent of feedback to all users. This trivial algorithm improves on our bound for the parameter range in which ","element":"span"},{"text":"T/","element":"span"},{"text":"2 ","element":"span"},{"text":"< Y ","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":")","element":"span"},{"text":". Thus in our analysis we focus on the parameter range where ","element":"span"},{"style":{"height":17.6},"width":232.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-3.png","element":"img","alt":" Y (T) ≤ T/2","inline":true},{"text":"; one consequence is that ","element":"span"},{"style":{"height":13.2},"width":107.32,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-4.png","element":"img","alt":"ℓ < T","inline":true,"padRight":true},{"text":"as chosen in Algorithm ","element":"span"},{"href":"#id-30","text":"3.","element":"a"}],[{"text":"6.2 ","element":"span"},{"text":"Proof of Theorem ","element":"span"},{"href":"#id-23","text":"6.1","element":"a"}],[{"text":"We prove Theorem ","element":"span"},{"href":"#id-23","text":"6.1, ","element":"a"},{"text":"deferring several lemmas and claims to the next subsection.","element":"span"}],[{"text":"The basic error event is misclassification of an item. In ","element":"span"},{"href":"#id-55","style":{"height":21.94},"width":719.72,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-5.png","element":"img","alt":" ItemExplore (Alg. 4), S1j is the set","inline":true,"padRight":true},{"text":"of items in ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-6.png","element":"img","alt":" M1 ","inline":true,"padRight":true},{"text":"that the algorithm posits are of the same type as the ","element":"span"},{"text":"j","element":"span"},{"text":"-th representative i","element":"span"},{"style":{"height":19.82},"width":166.76,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-7.png","element":"img","alt":"j . Let E1i","inline":true,"padRight":true},{"text":"be the event that item ","element":"span"},{"text":"i ","element":"span"},{"text":"was mis-classified,","element":"span"}],[{"id":"id-63","style":{"width":"66%"},"width":1240,"height":57,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-8.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":14.69},"width":42.44,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-9.png","element":"img","alt":" T0","inline":true,"padRight":true},{"text":"be the number of time-steps spent making recommendations in ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":". ","element":"span"},{"text":"Recall that ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-10.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"is the set of items to be recommended to user ","element":"span"},{"text":"u ","element":"span"},{"text":"in the exploit phase. We partition the recommendations made by Algorithm ","element":"span"},{"text":"Item-Item ","element":"span"},{"text":"to decompose the regret as follows:","element":"span"}],[{"id":"id-60","style":{"width":"83%"},"width":1562,"height":725,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-11.png","element":"img"}],[{"text":"The first term, ","element":"span"},{"text":"A1","element":"span"},{"text":", is the regret from early time-steps up to ","element":"span"},{"style":{"height":14.69},"width":42.44,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-12.png","element":"img","alt":" T0","inline":true},{"text":". The second term, ","element":"span"},{"text":"A2","element":"span"},{"text":", is the regret due to not having enough items available for the exploitation phase, which is proved to be small with high probability for sufficiently large ","element":"span"},{"text":"M","element":"span"},{"text":". The third term, ","element":"span"},{"text":"A3","element":"span"},{"text":", is the regret due to exploiting the misclassified items. It is small since few items are misclassified with the proper choice of ","element":"span"},{"style":{"height":12.8},"width":101.76,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/24-13.png","element":"img","alt":" ǫ and","inline":true,"padRight":true},{"text":"r","element":"span"},{"text":". The fourth term, ","element":"span"},{"text":"A4","element":"span"},{"text":", is the regret due to exploiting the correctly classified items. It is intuitively clear and will be checked later that ","element":"span"},{"text":"A4 ","element":"span"},{"text":"= 0 ","element":"span"},{"text":".","element":"span"}],[{"text":"Bounding ","element":"span"},{"text":"A1","element":"span"},{"text":". ","element":"span"},{"text":"Line ","element":"span"},{"href":"#id-55","text":"3 ","element":"a"},{"text":"of ","element":"span"},{"text":"ItemExplore ","element":"span"},{"text":"takes at most","element":"span"},{"style":{"height":21.66},"width":116.04,"height":54.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-0.png","element":"img","alt":"�2MrN �","inline":true},{"text":"units of time to rate every item in ","element":"span"},{"style":{"height":19.34},"width":331.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-1.png","element":"img","alt":"M10 and M20 by r","inline":true,"padRight":true},{"text":"users each, since ","element":"span"},{"text":"N ","element":"span"},{"text":"users provide feedback at each time-step. Remark ","element":"span"},{"href":"#id-57","text":"6.2 ","element":"a"},{"text":"above ","element":"span"},{"text":"discusses the assignment of users to items in this phase. After this, ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-2.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"representative items are rated by every user in the for loop (lines 4 through 10 of ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":"), which takes ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-3.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"time-steps. This gives","element":"span"}],[{"style":{"width":"33%"},"width":633,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-4.png","element":"img"}],[{"text":"For ","element":"span"},{"style":{"height":15.2},"width":137.76,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-5.png","element":"img","alt":" t ≤ T0 ,","inline":true,"padRight":true},{"text":"from the perspective of any user the items recommended to it are of random type and hence","element":"span"}],[{"id":"id-64","style":{"width":"80%"},"width":1500,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-6.png","element":"img"}],[{"style":{"height":16.4},"width":700.52,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-7.png","element":"img","alt":"Bounding A2. Time-steps t > T0","inline":true,"padRight":true},{"text":"are devoted to exploitation as described in ","element":"span"},{"text":"ItemExploit","element":"span"},{"text":". During this phase, a random item ","element":"span"},{"style":{"height":18.29},"width":188.96,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-8.png","element":"img","alt":" au,t /∈ Ru","inline":true,"padRight":true},{"text":"is recommended to user ","element":"span"},{"text":"u ","element":"span"},{"text":"only when there are no items to exploit because all items in ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-9.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"have already been recommended to ","element":"span"},{"text":"u","element":"span"},{"text":". So, the total number of times an item ","element":"span"},{"style":{"height":18.29},"width":176.96,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-10.png","element":"img","alt":" au,t /∈ Ru","inline":true,"padRight":true},{"text":"is recommended in the time interval ","element":"span"},{"style":{"height":14.69},"width":207.16,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-11.png","element":"img","alt":" T0 < t ≤ T","inline":true,"padRight":true},{"text":"is at most (","element":"span"},{"style":{"height":17.6},"width":221.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-12.png","element":"img","alt":"T − |Ru|)+,","inline":true,"padRight":true},{"text":"and","element":"span"}],[{"id":"id-65","style":{"width":"89%"},"width":1666,"height":270,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-13.png","element":"img"}],[{"text":"where the last inequality is from Lemma ","element":"span"},{"href":"#id-58","text":"6.4 ","element":"a"},{"text":"in Section ","element":"span"},{"href":"#id-59","text":"6.3.","element":"a"}],[{"text":"Bounding ","element":"span"},{"text":"A3","element":"span"},{"text":". ","element":"span"},{"text":"Term ","element":"span"},{"text":"A3 ","element":"span"},{"text":"in ","element":"span"},{"href":"#id-60","text":"(15) ","element":"a"},{"text":"is the expected number of mistakes made by the Algorithm ","element":"span"},{"text":"ItemExploit ","element":"span"},{"text":"as a result of misclassification. Claim ","element":"span"},{"href":"#id-61","text":"6.2 ","element":"a"},{"text":"below upper bounds the expected number of “potential misclassifications” (defined in Equation ","element":"span"},{"href":"#id-62","text":"(19)","element":"a"},{"text":") in the algorithm to provide an upper bound for this quantity:","element":"span"}],[{"id":"id-66","style":{"width":"84%"},"width":1576,"height":293,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-14.png","element":"img"}],[{"text":"Bounding ","element":"span"},{"text":"A4","element":"span"},{"text":". ","element":"span"},{"text":"By definition of the mis-classification event, ","element":"span"},{"style":{"height":19.54},"width":43.88,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-15.png","element":"img","alt":" E1i ","inline":true,"padRight":true},{"text":", given in Equation ","element":"span"},{"href":"#id-63","text":"(14)","element":"a"},{"text":", if an item ","element":"span"},{"style":{"height":22.13},"width":118.28,"height":55.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-16.png","element":"img","alt":"i ∈ S1j ","inline":true,"padRight":true},{"text":"is correctly classified, then ","element":"span"},{"style":{"height":20.05},"width":711.4,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/25-17.png","element":"img","alt":" τI(i) = τI(ij). Since Lu,i = ξτU(u),τI(i)","inline":true},{"text":", all users rate ","element":"span"},{"text":"i ","element":"span"},{"text":"the same","element":"span"}],[{"text":"as i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-0.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":". By construction of the sets ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-1.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"in Line ","element":"span"},{"href":"#id-55","text":"11 ","element":"a"},{"text":"of ","element":"span"},{"text":"ItemExploit","element":"span"},{"text":", for any item ","element":"span"},{"style":{"height":15.6},"width":306.92,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-2.png","element":"img","alt":" i ∈ Ru, there is","inline":true,"padRight":true},{"text":"some ","element":"span"},{"style":{"height":18},"width":118.76,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-3.png","element":"img","alt":" j ∈ [qI","inline":true},{"text":"] such that ","element":"span"},{"style":{"height":22.13},"width":241.48,"height":55.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-4.png","element":"img","alt":" i ∈ S1j and u","inline":true,"padRight":true},{"text":"likes item i","element":"span"},{"style":{"height":17.09},"width":178.08,"height":42.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-5.png","element":"img","alt":"j . Hence,","inline":true}],[{"style":{"width":"78%"},"width":1461,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-6.png","element":"img"}],[{"text":"It follows that ","element":"span"},{"text":"A4 ","element":"span"},{"text":"= 0.","element":"span"}],[{"text":"Combining all the bounds. ","element":"span"},{"text":"Plugging in Equations ","element":"span"},{"href":"#id-64","text":"(16)","element":"a"},{"text":", ","element":"span"},{"href":"#id-65","text":"(17)","element":"a"},{"text":", ","element":"span"},{"href":"#id-66","text":"(18) ","element":"a"},{"text":"and ","element":"span"},{"text":"A4 ","element":"span"},{"text":"= 0 into Equation ","element":"span"},{"href":"#id-60","text":"(15) ","element":"a"},{"text":"gives","element":"span"}],[{"style":{"width":"65%"},"width":1232,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-7.png","element":"img"}],[{"text":"Setting ","element":"span"},{"style":{"height":23.97},"width":313.16,"height":59.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-8.png","element":"img","alt":" M = 64TqIℓ gives","inline":true}],[{"style":{"width":"90%"},"width":1688,"height":157,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-9.png","element":"img"}],[{"text":"where (a) holds for ","element":"span"},{"style":{"height":14.8},"width":64.24,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-10.png","element":"img","alt":" ℓ ≥","inline":true,"padRight":true},{"text":"13 (imposed by the algorithm for ","element":"span"},{"style":{"height":14.4},"width":77.68,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-11.png","element":"img","alt":" T ≥","inline":true,"padRight":true},{"text":"2) which gives ","element":"span"},{"style":{"height":18},"width":412.72,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-12.png","element":"img","alt":" M/qI ≤ 5T and r > 6","inline":true,"padRight":true},{"text":"(which holds if ","element":"span"},{"style":{"height":17.81},"width":770.04,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-13.png","element":"img","alt":" qI, N ≥ 3). If ℓ = qI, since qI > 13 log N","inline":true},{"text":", we have 3","element":"span"},{"style":{"height":17.6},"width":609.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-14.png","element":"img","alt":"T exp(−ℓ/13) ≤ 3T/N ≤ r T/N.","inline":true,"padRight":true},{"text":"(b) is obtained by choosing the parameter ","element":"span"},{"style":{"height":12.8},"width":131.84,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-15.png","element":"img","alt":" ℓ to be","inline":true}],[{"style":{"width":"99%"},"width":1870,"height":447,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-16.png","element":"img"}],[{"id":"id-59","text":"6.3 ","element":"span"},{"text":"Lemmas used in the proof","element":"span"}],[{"text":"In the remainder of this section we state and prove the lemmas used in the analysis above.","element":"span"}],[{"id":"id-61","text":"Claim 6.2. ","element":"span"},{"text":"Suppose that ","element":"span"},{"style":{"height":16.8},"width":162.84,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-17.png","element":"img","alt":" qU > 4r","inline":true},{"text":". The expected total number of times a misclassified item is recommended in the exploitation step is upper bounded as","element":"span"}],[{"style":{"width":"50%"},"width":941,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-18.png","element":"img"}],[{"style":{"height":19.73},"width":303.08,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-19.png","element":"img","alt":"Proof. Event E1i ","inline":true,"padRight":true},{"text":", defined in ","element":"span"},{"href":"#id-63","text":"(14) ","element":"a"},{"text":"to be the misclassification of item ","element":"span"},{"style":{"height":15.94},"width":137.96,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/26-20.png","element":"img","alt":" i ∈ M1","inline":true},{"text":", occurs if the preferences ","element":"span"},{"text":"of random users classifying item ","element":"span"},{"text":"i ","element":"span"},{"text":"is the same as a previous representative item with a different type. Given matrix Ξ, this event is a function of the order of choosing the representative items and the choice of random users in Line ","element":"span"},{"href":"#id-55","text":"3 ","element":"a"},{"text":"of ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":". Instead of directly analyzing ","element":"span"},{"style":{"height":19.73},"width":248.48,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-0.png","element":"img","alt":" E1i , we define","inline":true,"padRight":true},{"text":"an event called “potential error event”","element":"span"},{"style":{"height":17.49},"width":82.04,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-1.png","element":"img","alt":"Ei,Ui","inline":true},{"text":", which will be shown to satisfy ","element":"span"},{"style":{"height":19.83},"width":398.6,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-2.png","element":"img","alt":" E1i ⊆Ei,Ui; an upper","inline":true,"padRight":true},{"text":"bound for ","element":"span"},{"style":{"height":18.29},"width":120.92,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-3.png","element":"img","alt":" P[Ei,Ui","inline":true},{"text":"] is given in Claim ","element":"span"},{"href":"#id-56","text":"6.3.","element":"a"}],[{"text":"For item ","element":"span"},{"text":"i ","element":"span"},{"text":"and subset of users ","element":"span"},{"style":{"height":17.6},"width":303.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-4.png","element":"img","alt":" U ⊆ [N] , define","inline":true}],[{"id":"id-62","style":{"width":"74%"},"width":1400,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-5.png","element":"img"}],[{"text":"to be the event that the ratings of users in ","element":"span"},{"text":"U ","element":"span"},{"text":"for item ","element":"span"},{"text":"i ","element":"span"},{"text":"agree with some other item type. For item ","element":"span"},{"style":{"height":21.93},"width":207.52,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-6.png","element":"img","alt":"i ∈ S1j , if t","inline":true,"padRight":true},{"text":"is the time ","element":"span"},{"text":"i ","element":"span"},{"text":"is added to ","element":"span"},{"style":{"height":21.93},"width":46.76,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-7.png","element":"img","alt":" S1j ","inline":true,"padRight":true},{"text":"in the exploration phase, let ","element":"span"},{"style":{"height":18.29},"width":541.76,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-8.png","element":"img","alt":" Ui = ratedt(i) ∩ ratedt(ij) be","inline":true,"padRight":true},{"text":"the set of witness users whose ratings were used to conclude that ","element":"span"},{"style":{"height":17.49},"width":146.48,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-9.png","element":"img","alt":" i and ij","inline":true,"padRight":true},{"text":"are of the same type. Item ","element":"span"},{"text":"i ","element":"span"},{"text":"is added to ","element":"span"},{"style":{"height":22.13},"width":46.76,"height":55.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-10.png","element":"img","alt":" S1j ","inline":true,"padRight":true},{"text":"only if all users in ","element":"span"},{"style":{"height":17.09},"width":378.8,"height":42.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-11.png","element":"img","alt":" Ui agree on i vs. ij","inline":true,"padRight":true},{"text":", so misclassification ","element":"span"},{"style":{"height":19.73},"width":268.8,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-12.png","element":"img","alt":" E1i (defined in","inline":true,"padRight":true},{"text":"Equation ","element":"span"},{"href":"#id-63","text":"(14)","element":"a"},{"text":") implies","element":"span"},{"style":{"height":17.49},"width":82.04,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-13.png","element":"img","alt":"Ei,Ui","inline":true},{"text":". We can now deduce the inequalities, justified below:","element":"span"}],[{"style":{"width":"86%"},"width":1620,"height":293,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-14.png","element":"img"}],[{"text":"Inequality (a) holds because according to Line ","element":"span"},{"href":"#id-55","text":"11 ","element":"a"},{"text":"of ","element":"span"},{"text":"ItemExplore ","element":"span"},{"text":"for every user ","element":"span"},{"style":{"height":15.6},"width":301.16,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-15.png","element":"img","alt":" u , the set Ru is","inline":true,"padRight":true},{"text":"a subset of ","element":"span"},{"style":{"height":19.34},"width":69.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-16.png","element":"img","alt":" M10","inline":true},{"text":", and also the containment ","element":"span"},{"style":{"height":17.49},"width":176.6,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-17.png","element":"img","alt":" Ei ⊆Ei,Ui","inline":true},{"text":"; (b) follows since each item ","element":"span"},{"text":"i ","element":"span"},{"text":"is recommended at most ","element":"span"},{"text":"N ","element":"span"},{"text":"times; (c) uses ","element":"span"},{"style":{"height":19.35},"width":199.6,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-18.png","element":"img","alt":" |M10| = M","inline":true,"padRight":true},{"text":"together with Claim ","element":"span"},{"href":"#id-56","text":"6.3 ","element":"a"},{"text":"which shows that ","element":"span"},{"style":{"height":18.29},"width":245.76,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-19.png","element":"img","alt":" P[Ei,Ui] ≤ 2ǫ.","inline":true}],[{"id":"id-56","text":"Claim 6.3. ","element":"span"},{"text":"Suppose that ","element":"span"},{"style":{"height":16.61},"width":147.48,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-20.png","element":"img","alt":" qU > 4r","inline":true},{"text":". Consider the “potential error” event","element":"span"},{"href":"#id-62","style":{"height":18.29},"width":480.48,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-21.png","element":"img","alt":"Ei,Ui defined in (19) with","inline":true,"padRight":true},{"text":"the set of users ","element":"span"},{"style":{"height":14.69},"width":41.76,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-22.png","element":"img","alt":" Ui","inline":true,"padRight":true},{"text":"defined immediately after. Then ","element":"span"},{"style":{"height":19.83},"width":534.28,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-23.png","element":"img","alt":" P[Ei,Ui] ≤ 2ǫ for all i ∈ M10.","inline":true}],[{"text":"Proof. ","element":"span"},{"text":"Each representative item i","element":"span"},{"style":{"height":17.49},"width":498.76,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-24.png","element":"img","alt":"j chosen in ItemExplore","inline":true,"padRight":true},{"text":"is rated by all of the users. Other items in ","element":"span"},{"style":{"height":19.34},"width":243.08,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-25.png","element":"img","alt":" M10 and M20 ","inline":true,"padRight":true},{"text":"are rated by at least ","element":"span"},{"text":"r ","element":"span"},{"text":"users in the exploration phase. Remark ","element":"span"},{"href":"#id-57","text":"6.2 ","element":"a"},{"text":"describes how ","element":"span"},{"text":"Line ","element":"span"},{"href":"#id-55","text":"3 ","element":"a"},{"text":"in ","element":"span"},{"text":"ItemExplore ","element":"span"},{"text":"is performed to guarantee that the set of ","element":"span"},{"text":"r ","element":"span"},{"text":"users assigned to each specific item is uniformly at random among all subset of users of size ","element":"span"},{"text":"r","element":"span"},{"text":". By Lemma ","element":"span"},{"href":"#id-36","text":"A.3, ","element":"a"},{"text":"if ","element":"span"},{"style":{"height":17.01},"width":263.04,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-26.png","element":"img","alt":" qU > 4r, then","inline":true,"padRight":true},{"text":"with probability at least 1 ","element":"span"},{"style":{"height":18},"width":684.4,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-27.png","element":"img","alt":" − exp(−r/2) ≥ 1 − ǫ/qI there are r/","inline":true},{"text":"2 users with distinct user types in a specific ","element":"span"},{"style":{"height":14.69},"width":41.76,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-28.png","element":"img","alt":" Ui","inline":true},{"text":". It follows that the ","element":"span"},{"text":"r","element":"span"},{"text":"/","element":"span"},{"text":"2 users with distinct types (chosen independently of the feedback) that rate item ","element":"span"},{"style":{"height":17.6},"width":243.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-29.png","element":"img","alt":" i of type τI(i","inline":true},{"text":") also have the same ratings for type ","element":"span"},{"style":{"height":17.6},"width":148.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-30.png","element":"img","alt":" j ̸= τI(i","inline":true},{"text":") with probability at most 2","element":"span"},{"style":{"height":12.8},"width":72.2,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-31.png","element":"img","alt":"−r/2","inline":true},{"text":". (Any two item types ","element":"span"},{"style":{"height":16.8},"width":114.88,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-32.png","element":"img","alt":" j ̸= j′ ","inline":true,"padRight":true},{"text":"have jointly independent columns in the preference matrix.) The choice ","element":"span"},{"style":{"height":18},"width":258,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-33.png","element":"img","alt":" r ≥ 2 log(qI/ǫ","inline":true},{"text":") and a union bound over item types ","element":"span"},{"text":"j ","element":"span"},{"text":"completes the proof.","element":"span"}],[{"id":"id-58","style":{"height":17.6},"width":731.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-34.png","element":"img","alt":"Lemma 6.4. For user u ∈ [N], let Ru","inline":true,"padRight":true},{"text":"be defined as in Line 10 of algorithm ","element":"span"},{"text":"ItemExplore","element":"span"},{"text":". Then","element":"span"}],[{"style":{"width":"67%"},"width":1269,"height":241,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/27-35.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"We begin with a sketch. ","element":"span"},{"text":"Any user ","element":"span"},{"text":"u ","element":"span"},{"text":"likes roughly half of the ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-0.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"item types that have representatives ","element":"span"},{"style":{"height":20.59},"width":132.2,"height":51.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-1.png","element":"img","alt":" {ij}ℓj=1","inline":true},{"text":". The total number of items in ","element":"span"},{"style":{"height":23.57},"width":245.08,"height":58.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-2.png","element":"img","alt":" M10 is 64 qIℓ T","inline":true},{"text":", so there are about ","element":"span"},{"style":{"height":21.26},"width":186.92,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-3.png","element":"img","alt":" 64ℓ T items","inline":true,"padRight":true},{"text":"from each of the item types in ","element":"span"},{"style":{"height":19.35},"width":69.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-4.png","element":"img","alt":" M10","inline":true},{"text":". Adding over the ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-5.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"types, the set ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-6.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"of items in ","element":"span"},{"style":{"height":19.35},"width":258.92,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-7.png","element":"img","alt":" M10 that user","inline":true,"padRight":true},{"text":"u ","element":"span"},{"text":"likes will typically have size at least ","element":"span"},{"text":"T","element":"span"},{"text":".","element":"span"}],[{"text":"Making this argument rigorous requires some care for the following reasons: 1) ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-8.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"is the union of the ","element":"span"},{"style":{"height":22.13},"width":561.32,"height":55.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-9.png","element":"img","alt":" S1j ’s with Lu,ij = +1, but S1j ","inline":true,"padRight":true},{"text":"can be missing items due to misclassification (there may be ","element":"span"},{"text":"items of the same type as i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-10.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":"that have been classified as being of the same type as some i","element":"span"},{"style":{"height":17.68},"width":98.6,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-11.png","element":"img","alt":"j′ for","inline":true,"padRight":true},{"text":"which ","element":"span"},{"style":{"height":20.34},"width":196.24,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-12.png","element":"img","alt":" Lu,ij′ = −","inline":true},{"text":"1). 2) The distribution of the type of each representative item depends on the number of remaining items of each type in ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-13.png","element":"img","alt":" M2 ","inline":true,"padRight":true},{"text":"when the representative is chosen. Again, due to misclassification, this can be different from the actual number of items of each type initially present in ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-14.png","element":"img","alt":" M2","inline":true},{"text":". Moreover, because misclassification of an item depends on the ratings of users for the item, the choice of next item type to be represented is therefore dependent on ratings of users for other types. The effect of this dependence is addressed in Claim ","element":"span"},{"href":"#id-67","text":"6.5 ","element":"a"},{"text":"below.","element":"span"}],[{"text":"We now proceed with the proof, bounding the size of ","element":"span"},{"style":{"height":14.69},"width":56.96,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-15.png","element":"img","alt":" Ru","inline":true,"padRight":true},{"text":"by introducing a different set. For","element":"span"}],[{"id":"id-69","style":{"width":"99%"},"width":1866,"height":168,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-16.png","element":"img"}],[{"text":"be the items in ","element":"span"},{"style":{"height":19.34},"width":69.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-17.png","element":"img","alt":" M10 ","inline":true,"padRight":true},{"text":"whose types are the same as one of the representatives i","element":"span"},{"style":{"height":10.8},"width":14,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-18.png","element":"img","alt":"j","inline":true,"padRight":true},{"text":"that are liked by ","element":"span"},{"text":"u","element":"span"},{"text":". ","element":"span"},{"text":"Note that if an item ","element":"span"},{"style":{"height":19.15},"width":128.48,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-19.png","element":"img","alt":" i ∈ �R1u ","inline":true,"padRight":true},{"text":"is correctly classified by the algorithm (i.e. (","element":"span"},{"style":{"height":19.73},"width":77.88,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-20.png","element":"img","alt":"E1i )c","inline":true,"padRight":true},{"text":"occurs, where ","element":"span"},{"style":{"height":19.73},"width":90.44,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-21.png","element":"img","alt":" E1i is","inline":true,"padRight":true},{"text":"defined in Equation ","element":"span"},{"href":"#id-63","text":"(14)","element":"a"},{"text":"), then ","element":"span"},{"style":{"height":19.83},"width":1271.36,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-22.png","element":"img","alt":" i ∈ Ru. Hence, i ∈ �R1u \\Ru implies E1i and since E1i ⊆Ei,Ui we have","inline":true}],[{"style":{"width":"74%"},"width":1386,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-23.png","element":"img"}],[{"text":"It follows that","element":"span"}],[{"id":"id-68","style":{"width":"96%"},"width":1809,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-24.png","element":"img"}],[{"text":"To bound the second term we use Claim ","element":"span"},{"href":"#id-56","text":"6.3 ","element":"a"},{"text":"above, which gives","element":"span"}],[{"style":{"width":"68%"},"width":1280,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-25.png","element":"img"}],[{"text":"and by Markov’s Inequality","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-26.png","element":"img","alt":"4","inline":true}],[{"id":"id-70","style":{"width":"99%"},"width":1868,"height":154,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/28-27.png","element":"img"}],[{"text":"We now bound the first term on the right-hand side of ","element":"span"},{"href":"#id-68","text":"(22)","element":"a"},{"text":". To this end, let","element":"span"}],[{"id":"id-73","style":{"width":"66%"},"width":1246,"height":60,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-0.png","element":"img"}],[{"text":"be the type of item representatives that are liked by user ","element":"span"},{"text":"u","element":"span"},{"text":". By definition of ","element":"span"},{"href":"#id-69","style":{"height":19.15},"width":213.6,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-1.png","element":"img","alt":"�R1u in (20),","inline":true}],[{"style":{"width":"27%"},"width":507,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-2.png","element":"img"}],[{"text":"Now, we claim that the set ","element":"span"},{"style":{"height":21.12},"width":70.12,"height":52.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-3.png","element":"img","alt":"�L(ℓ)u","inline":true,"padRight":true},{"text":"is independent of all types and preferences for items in ","element":"span"},{"style":{"height":19.54},"width":221.12,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-4.png","element":"img","alt":" M10. To see","inline":true,"padRight":true},{"text":"this, note that ","element":"span"},{"style":{"height":20.93},"width":70.12,"height":52.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-5.png","element":"img","alt":"�L(ℓ)u ","inline":true,"padRight":true},{"text":"is determined by row ","element":"span"},{"text":"u ","element":"span"},{"text":"of the type matrix Ξ and the items in ","element":"span"},{"style":{"height":19.34},"width":69.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-6.png","element":"img","alt":" M20","inline":true},{"text":", their types, and ","element":"span"},{"text":"randomness in the algorithm, which determines the choice of i","element":"span"},{"style":{"height":18.29},"width":189.68,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-7.png","element":"img","alt":"j and τI(ij","inline":true},{"text":"); these together determine ","element":"span"},{"style":{"height":20.58},"width":533.84,"height":51.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-8.png","element":"img","alt":"Lu,ij = ξτU(u),τI(ij) and τI(ij","inline":true},{"text":"). So, conditioning on ","element":"span"},{"style":{"height":20.93},"width":70.12,"height":52.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-9.png","element":"img","alt":"�L(ℓ)u","inline":true,"padRight":true},{"text":"having cardinality ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":19.13},"width":153.6,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-10.png","element":"img","alt":"ℓu, | �R1u|","inline":true,"padRight":true},{"text":"is the sum of ","element":"span"},{"style":{"height":20.77},"width":280.6,"height":51.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-11.png","element":"img","alt":"|M10| = 64qIℓ T","inline":true,"padRight":true},{"text":"i.i.d. Bernoulli variables with parameter ","element":"span"},{"style":{"height":28.56},"width":767.52,"height":71.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-12.png","element":"img","alt":"˜ℓuqI and hence | �R1u| ∼ Binom(64qIℓ T,˜ℓuqI ).","inline":true,"padRight":true},{"text":"Conditioning on ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":18.35},"width":136.52,"height":45.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-13.png","element":"img","alt":"ℓu ≥ ℓ20","inline":true},{"text":", by a Chernoff bound (Lemma ","element":"span"},{"href":"#id-37","text":"A.1) ","element":"a"},{"text":"and stochastic domination of binomials ","element":"span"},{"text":"with increasing number of trials we obtain","element":"span"}],[{"style":{"width":"38%"},"width":721,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-14.png","element":"img"}],[{"text":"In Lemma ","element":"span"},{"href":"#id-67","text":"6.5 ","element":"a"},{"text":"below, we will lower bound the probability that ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":17.6},"width":153.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-15.png","element":"img","alt":"ℓu ≥ ℓ/","inline":true},{"text":"20. Combining the last displayed inequality with Lemma ","element":"span"},{"href":"#id-67","text":"6.5, ","element":"a"},{"text":"we get","element":"span"}],[{"style":{"width":"62%"},"width":1165,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-16.png","element":"img"}],[{"text":"Plugging this and ","element":"span"},{"href":"#id-70","text":"(24) ","element":"a"},{"text":"into ","element":"span"},{"href":"#id-68","text":"(22) ","element":"a"},{"text":"gives","element":"span"}],[{"style":{"width":"68%"},"width":1281,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-17.png","element":"img"}],[{"text":"and since ","element":"span"},{"style":{"height":19.54},"width":661.16,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-18.png","element":"img","alt":" ℓ ≤ T, qI5 and ℓ ≥ 13 (for T ≥ 13)","inline":true}],[{"style":{"width":"34%"},"width":641,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-19.png","element":"img"}],[{"text":"The bound on ","element":"span"},{"style":{"height":17.6},"width":276.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-20.png","element":"img","alt":" E [(T − |Ru|)+","inline":true},{"text":"] is an immediate consequence.","element":"span"}],[{"id":"id-67","style":{"width":"96%"},"width":1804,"height":180,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-21.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"For a given user ","element":"span"},{"style":{"height":19.41},"width":89.6,"height":48.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-22.png","element":"img","alt":" u, ˜ℓu","inline":true,"padRight":true},{"text":"is the number of item type of representatives liked by user ","element":"span"},{"style":{"height":15.09},"width":184.16,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/29-23.png","element":"img","alt":" u. Let Lu","inline":true}],[{"text":"be the item types in [","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-0.png","element":"img","alt":"qI","inline":true},{"text":"] that are liked by user ","element":"span"},{"text":"u","element":"span"},{"text":":","element":"span"}],[{"style":{"width":"65%"},"width":1222,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-1.png","element":"img"}],[{"text":"The variables ","element":"span"},{"style":{"height":20.3},"width":265.8,"height":50.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-2.png","element":"img","alt":" {ξτU (u),j}j∈[qI]","inline":true,"padRight":true},{"text":"are i.i.d. Bern(1","element":"span"},{"text":"/","element":"span"},{"text":"2), so a Chernoff bound (Lemma ","element":"span"},{"href":"#id-37","text":"A.1) ","element":"a"},{"text":"gives","element":"span"}],[{"id":"id-71","style":{"width":"65%"},"width":1228,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-3.png","element":"img"}],[{"text":"We will show that","element":"span"}],[{"id":"id-74","style":{"width":"75%"},"width":1404,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-4.png","element":"img"}],[{"text":"which will prove the lemma by combining with ","element":"span"},{"href":"#id-71","text":"(27)","element":"a"},{"text":".","element":"span"}],[{"text":"We now work towards defining a certain error event in Equation ","element":"span"},{"href":"#id-72","text":"(31) ","element":"a"},{"text":"below. Let the sequence of random variables ","element":"span"},{"style":{"height":17.6},"width":742.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-5.png","element":"img","alt":" X1 = τI(i1), X2 = τI(i2), . . . , Xℓ = τI(iℓ","inline":true},{"text":") denote the types of the item representatives chosen by the algorithm, so that ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":27.2},"width":778.12,"height":68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-6.png","element":"img","alt":"ℓu = | �L(ℓ)u | = �j∈[ℓ] 1[Xj ∈ Lu] (with �L(ℓ)u","inline":true,"padRight":true},{"text":"defined in ","element":"span"},{"href":"#id-73","text":"(25)","element":"a"},{"text":"). Let ","element":"span"},{"text":"¯","element":"span"},{"style":{"height":19.13},"width":90.48,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-7.png","element":"img","alt":"R2(j","inline":true},{"text":") be the set of items in ","element":"span"},{"style":{"height":19.34},"width":251.76,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-8.png","element":"img","alt":" M20 of type j","inline":true}],[{"style":{"width":"64%"},"width":1211,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-9.png","element":"img"}],[{"text":"Later we will use the notation ","element":"span"},{"text":"¯","element":"span"},{"style":{"height":22.79},"width":384.2,"height":56.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-10.png","element":"img","alt":"R2(·) = { ¯R2(j)}qIj=1 ","inline":true,"padRight":true},{"text":"for the collection of these sets. Now let the ","element":"span"},{"text":"event ","element":"span"},{"style":{"height":19.54},"width":43.88,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-11.png","element":"img","alt":" E2i ","inline":true,"padRight":true},{"text":"denote misclassification of an item ","element":"span"},{"style":{"height":19.34},"width":137.48,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-12.png","element":"img","alt":" i ∈ M20 ","inline":true,"padRight":true},{"text":"(similar to event ","element":"span"},{"href":"#id-63","style":{"height":19.54},"width":438.72,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-13.png","element":"img","alt":" E1i for i ∈ M10 in (14)),","inline":true}],[{"id":"id-76","style":{"width":"66%"},"width":1240,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-14.png","element":"img"}],[{"text":"Let ","element":"span"},{"text":"Err ","element":"span"},{"text":"be the event that for some item type ","element":"span"},{"text":"j","element":"span"},{"text":", more than a fraction 1","element":"span"},{"text":"/","element":"span"},{"text":"10 of the items in ","element":"span"},{"style":{"height":19.34},"width":125.44,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-15.png","element":"img","alt":" M20 of","inline":true,"padRight":true},{"text":"type ","element":"span"},{"text":"j ","element":"span"},{"text":"are misclassified:","element":"span"}],[{"id":"id-72","style":{"width":"75%"},"width":1406,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-16.png","element":"img"}],[{"text":"Conditioning on ","element":"span"},{"text":"Err ","element":"span"},{"text":"in the left-hand side of ","element":"span"},{"href":"#id-74","text":"(28) ","element":"a"},{"text":"gives","element":"span"}],[{"id":"id-75","style":{"width":"87%"},"width":1637,"height":284,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-17.png","element":"img"}],[{"text":"Bound on second term of ","element":"span"},{"href":"#id-75","text":"(32) ","element":"a"},{"text":"We will use Markov’s inequality to bound","element":"span"}],[{"style":{"width":"60%"},"width":1132,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-18.png","element":"img"}],[{"text":"We need to consider the effect of matrix Ξ (and in particular its ","element":"span"},{"style":{"height":17.6},"width":88.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-19.png","element":"img","alt":" τU(u","inline":true},{"text":")th row, which determines ","element":"span"},{"style":{"height":17.6},"width":68.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/30-20.png","element":"img","alt":" Lu)","inline":true,"padRight":true},{"text":"on the probability of error in categorizing items. Notably, if users rate two item types similarly, the probability of misclassifying items of these types is higher. ","element":"span"},{"text":"As a result, ","element":"span"},{"style":{"height":15.09},"width":50.24,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-0.png","element":"img","alt":" Lu","inline":true,"padRight":true},{"text":"contains some information about discriminability of distinct item types and we need to control this dependence.","element":"span"}],[{"text":"Claim ","element":"span"},{"href":"#id-61","text":"6.2 ","element":"a"},{"text":"shows that for ","element":"span"},{"style":{"height":19.34},"width":137.48,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-1.png","element":"img","alt":" i ∈ M10","inline":true},{"text":", the probability of ","element":"span"},{"style":{"height":19.54},"width":43.88,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-2.png","element":"img","alt":" E1i ","inline":true,"padRight":true},{"text":"(the event that item ","element":"span"},{"text":"i ","element":"span"},{"text":"is miscategorized) ","element":"span"},{"text":"is ","element":"span"},{"style":{"height":19.73},"width":204.72,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-3.png","element":"img","alt":" P[E1i ] ≤ 2ǫ","inline":true},{"text":". The same proof gives ","element":"span"},{"style":{"height":19.73},"width":385.64,"height":49.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-4.png","element":"img","alt":" P[E2i ] ≤ 2ǫ (with E2i ","inline":true,"padRight":true},{"text":"defined in ","element":"span"},{"href":"#id-76","text":"(30) ","element":"a"},{"text":"for ","element":"span"},{"style":{"height":19.54},"width":368.96,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-5.png","element":"img","alt":" i ∈ M20). To show","inline":true,"padRight":true},{"text":"that, let the potential error event","element":"span"},{"style":{"height":17.49},"width":82.03,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-6.png","element":"img","alt":"Ei,Ui","inline":true},{"text":", be the event that there exists an item type ","element":"span"},{"style":{"height":17.6},"width":260.16,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-7.png","element":"img","alt":" j ̸= τI(i) such","inline":true,"padRight":true},{"text":"that for all the users ","element":"span"},{"style":{"height":22.13},"width":1453.16,"height":55.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-8.png","element":"img","alt":" u in Ui, we have Lu,i = ξτU(u),j. For i ∈ S2j , let Ui = rated(i) ∩ rated(ij) at","inline":true,"padRight":true},{"text":"the time the algorithm added ","element":"span"},{"style":{"height":21.93},"width":132.68,"height":54.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-9.png","element":"img","alt":" i to S2j ","inline":true,"padRight":true},{"text":". Then the containment ","element":"span"},{"style":{"height":19.82},"width":191.48,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-10.png","element":"img","alt":" E2i ⊆Ei,Ui","inline":true,"padRight":true},{"text":"holds. We will later use the inequality ","element":"span"},{"style":{"height":19.9},"width":858.56,"height":49.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-11.png","element":"img","alt":" P[E2i |τI(i), ¯R2(·), Lu] ≤ P[Ei,Ui|τI(i), ¯R2(·), Lu","inline":true},{"text":"] and focus on the set of users ","element":"span"},{"style":{"height":17.6},"width":157.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-12.png","element":"img","alt":" Ui \\ {u},","inline":true,"padRight":true},{"text":"using","element":"span"},{"style":{"height":19.05},"width":307.68,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-13.png","element":"img","alt":"Ei,Ui ⊆Ei,Ui\\{u}.","inline":true}],[{"text":"As specified in Algorithm ","element":"span"},{"style":{"height":19.54},"width":939.36,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-14.png","element":"img","alt":" ItemExplore, for each i ∈ M10 ∪ M20 the set Ui","inline":true,"padRight":true},{"text":"of at least ","element":"span"},{"text":"r ","element":"span"},{"text":"users, ","element":"span"},{"text":"chosen independently of feedback, rate item ","element":"span"},{"text":"i","element":"span"},{"text":". By Lemma ","element":"span"},{"href":"#id-36","text":"A.3, ","element":"a"},{"text":"if ","element":"span"},{"style":{"height":16.61},"width":147.96,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-15.png","element":"img","alt":" qU > 4r","inline":true},{"text":", then there are users of at least ","element":"span"},{"text":"r","element":"span"},{"text":"/","element":"span"},{"text":"2 distinct user types in ","element":"span"},{"style":{"height":14.69},"width":39.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-16.png","element":"img","alt":" Ui","inline":true,"padRight":true},{"text":"with probability at least 1","element":"span"},{"style":{"height":18},"width":458.4,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-17.png","element":"img","alt":"−exp(−r/2) ≥ 1−ǫ/qI .","inline":true,"padRight":true},{"text":"Conditional on ","element":"span"},{"style":{"height":14.69},"width":39.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-18.png","element":"img","alt":" Ui","inline":true,"padRight":true},{"text":"having at least ","element":"span"},{"text":"r","element":"span"},{"text":"/","element":"span"},{"text":"2 distinct user types, there are at least ","element":"span"},{"style":{"height":17.6},"width":104.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-19.png","element":"img","alt":" r/2 −","inline":true,"padRight":true},{"text":"1 users of distinct types in ","element":"span"},{"style":{"height":17.6},"width":144.4,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-20.png","element":"img","alt":"Ui \\{u}","inline":true},{"text":"—also distinct from type of ","element":"span"},{"text":"u","element":"span"},{"text":"—whose preferences for item type ","element":"span"},{"style":{"height":17.6},"width":70.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-21.png","element":"img","alt":" τI(i","inline":true},{"text":") are independent of ","element":"span"},{"style":{"height":15.09},"width":50.24,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-22.png","element":"img","alt":" Lu","inline":true,"padRight":true},{"text":"and ","element":"span"},{"text":"¯","element":"span"},{"style":{"height":19.14},"width":113.28,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-23.png","element":"img","alt":"R2(·).","inline":true}],[{"text":"Conditional on ","element":"span"},{"style":{"height":19.22},"width":277.76,"height":48.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-24.png","element":"img","alt":" τI(i), ¯R2(·), Lu","inline":true},{"text":", any two item types ","element":"span"},{"style":{"height":16.8},"width":110.08,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-25.png","element":"img","alt":" j ̸= j′ ","inline":true,"padRight":true},{"text":"have jointly independent user preferences by ","element":"span"},{"style":{"height":17.6},"width":107.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-26.png","element":"img","alt":" r/2 −","inline":true,"padRight":true},{"text":"1 users with distinct types in ","element":"span"},{"style":{"height":17.6},"width":160.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-27.png","element":"img","alt":" Ui \\ {u}","inline":true},{"text":"; they are rated in the same way by these users with probability at most 2","element":"span"},{"style":{"height":12.8},"width":141.16,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-28.png","element":"img","alt":"−(r/2−1)","inline":true},{"text":". A union bound over item types ","element":"span"},{"style":{"height":17.6},"width":294.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-29.png","element":"img","alt":" j′ ̸= τI(i) gives","inline":true},{"style":{"height":22.59},"width":770.44,"height":56.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-30.png","element":"img","alt":"P[Ei,Ui\\{u}|τI(i), ¯R2(·), Lu] ≤ qI2−(r/2−1)","inline":true},{"text":". The choice ","element":"span"},{"style":{"height":24.91},"width":562.8,"height":62.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-31.png","element":"img","alt":" r > 2 log(qI/ǫ) and ǫ = 12qIN ","inline":true,"padRight":true},{"text":"and the con- ","element":"span"},{"text":"tainment ","element":"span"},{"style":{"height":19.83},"width":243.76,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-32.png","element":"img","alt":" E2i ⊆Ei,Ui ⊆","inline":true}],[{"style":{"height":24.91},"width":1098.92,"height":62.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-33.png","element":"img","alt":"Ei,Ui\\{u} gives P[E2i |τI(i), ¯R2(·), Lu] ≤ 1qIN for all i ∈ M20","inline":true},{"text":". Knowing ¯","element":"span"},{"style":{"height":19.14},"width":101.48,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-34.png","element":"img","alt":"R2(·)","inline":true,"padRight":true},{"text":"determines ","element":"span"},{"style":{"height":24.91},"width":890.64,"height":62.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-35.png","element":"img","alt":" τI(i) for i ∈ M20. Hence, P[E2i | ¯R2(·), Lu] ≤ 1qIN ","inline":true,"padRight":true},{"text":"and Markov’s inequality gives","element":"span"}],[{"style":{"width":"46%"},"width":877,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-36.png","element":"img"}],[{"text":"Union bounding over ","element":"span"},{"style":{"height":17.81},"width":118.76,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-37.png","element":"img","alt":" j ∈ [qI","inline":true},{"text":"] and tower property gives the desired bound.","element":"span"}],[{"text":"Bound on first term of ","element":"span"},{"href":"#id-75","text":"(32)","element":"a"},{"text":". ","element":"span"},{"text":"We start by showing that conditioned on the event ","element":"span"},{"style":{"height":12.7},"width":71.64,"height":31.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-38.png","element":"img","alt":" Errc ","inline":true,"padRight":true},{"text":"conditional on ","element":"span"},{"style":{"height":15.09},"width":50.24,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-39.png","element":"img","alt":" Lu","inline":true,"padRight":true},{"text":"and variables ","element":"span"},{"style":{"height":19.22},"width":803.72,"height":48.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-40.png","element":"img","alt":" X1, · · · , Xm−1 and ¯R2(X1), · · · , ¯R2(Xm−1","inline":true},{"text":"), the type of the ","element":"span"},{"text":"m","element":"span"},{"text":"-th representative item, ","element":"span"},{"style":{"height":14.69},"width":66,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-41.png","element":"img","alt":" Xm","inline":true},{"text":", is almost uniform over all the item types not learned yet, [","element":"span"},{"style":{"height":17.81},"width":424.8,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-42.png","element":"img","alt":"qI] \\ {X1, · · · , Xm−1}.","inline":true,"padRight":true},{"text":"Concretely, we will find upper and lower bounds on","element":"span"}],[{"id":"id-77","style":{"width":"73%"},"width":1382,"height":64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-43.png","element":"img"}],[{"text":"for any ","element":"span"},{"style":{"height":18},"width":488.56,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-44.png","element":"img","alt":" j ∈ [qI] \\ {X1, · · · , Xm−1}","inline":true},{"text":". Later, we will focus on ","element":"span"},{"style":{"height":18},"width":490.56,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-45.png","element":"img","alt":" Lu such that |Lu| ≥ qI/4.","inline":true}],[{"text":"Lower bound for ","element":"span"},{"href":"#id-77","text":"(33)","element":"a"},{"text":". ","element":"span"},{"text":"Let ","element":"span"},{"text":"t ","element":"span"},{"text":"be the time the ","element":"span"},{"text":"m","element":"span"},{"text":"-th item type representative is chosen by the algorithm. For ","element":"span"},{"style":{"height":20.69},"width":366.44,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-46.png","element":"img","alt":" j ∈ [qI] \\ {Xn}m−1n=1 ","inline":true,"padRight":true},{"text":"the probability of choosing representative of type ","element":"span"},{"style":{"height":16},"width":198.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-47.png","element":"img","alt":" Xm = j is","inline":true,"padRight":true},{"text":"equal to the proportion of items of type ","element":"span"},{"text":"j ","element":"span"},{"text":"in the remaining items in ","element":"span"},{"style":{"height":15.93},"width":340.8,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-48.png","element":"img","alt":" M2 at time t − 1 .","inline":true,"padRight":true},{"text":"The number of items of type ","element":"span"},{"style":{"height":18.73},"width":376.72,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-49.png","element":"img","alt":" j in M2 at time t −","inline":true,"padRight":true},{"text":"1 is at least ","element":"span"},{"style":{"height":23.2},"width":461.28,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-50.png","element":"img","alt":" | ¯R2(j)| − �i∈ ¯R2(j) 1[E2i ].","inline":true}],[{"text":"The number of items removed from ","element":"span"},{"style":{"height":18.73},"width":273.28,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-51.png","element":"img","alt":" M2 by time t","inline":true,"padRight":true},{"text":"is at least ","element":"span"},{"style":{"height":21.65},"width":272.16,"height":54.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/31-52.png","element":"img","alt":"�m−1l=1 | ¯R(Xl)|","inline":true,"padRight":true},{"text":"(there could be","element":"span"}],[{"text":"additional items with types not in ","element":"span"},{"style":{"height":20.69},"width":175.4,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-0.png","element":"img","alt":" {Xn}m−1n=1 ","inline":true,"padRight":true},{"text":"that were removed from ","element":"span"},{"style":{"height":15.94},"width":69.32,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-1.png","element":"img","alt":" M2","inline":true,"padRight":true},{"text":"due to misclassification). ","element":"span"},{"text":"Hence, the total number of remaining items in ","element":"span"},{"style":{"height":18.74},"width":260.8,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-2.png","element":"img","alt":" M2 by time t","inline":true,"padRight":true},{"text":"is at most ","element":"span"},{"style":{"height":21.65},"width":495.08,"height":54.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-3.png","element":"img","alt":" M − �m−1n=1 | ¯R2(Xn)|. Let","inline":true},{"style":{"height":15.84},"width":65.4,"height":39.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-4.png","element":"img","alt":"ZE2","inline":true,"padRight":true},{"text":"be the collection of indicator variables ","element":"span"},{"style":{"height":24.34},"width":916.8,"height":60.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-5.png","element":"img","alt":" ZE2 = {1[E2i ]}i∈M20. For any j ∈ [qI] \\ {Xn}m−1n=1 :","inline":true}],[{"style":{"width":"73%"},"width":1377,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-6.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":12.7},"width":71.64,"height":31.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-7.png","element":"img","alt":" Errc ","inline":true,"padRight":true},{"text":"(defined in ","element":"span"},{"href":"#id-72","text":"(31)","element":"a"},{"text":"), ","element":"span"},{"style":{"height":24.08},"width":1187.64,"height":60.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-8.png","element":"img","alt":"�i∈ ¯R2(j) 1[E2i ] ≤ | ¯R2(j)|/10 for any j ∈ [qI] \\ {Xn}m−1n=1 . There-","inline":true,"padRight":true},{"text":"fore,","element":"span"}],[{"id":"id-78","style":{"width":"87%"},"width":1630,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-9.png","element":"img"}],[{"text":"Upper bound for ","element":"span"},{"href":"#id-77","text":"(33)","element":"a"},{"text":". ","element":"span"},{"text":"This time we lower bound the number of items remaining in ","element":"span"},{"style":{"height":18.74},"width":213.92,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-10.png","element":"img","alt":" M2 by the","inline":true,"padRight":true},{"text":"number of items in types [","element":"span"},{"style":{"height":20.69},"width":274.76,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-11.png","element":"img","alt":"qI] \\ {Xn}m−1n=1 ","inline":true,"padRight":true},{"text":"minus the number of possible mistakes which removed ","element":"span"},{"text":"some items from them. This gives","element":"span"}],[{"style":{"width":"93%"},"width":1755,"height":187,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-12.png","element":"img"}],[{"text":"By definition of ","element":"span"},{"text":"Err ","element":"span"},{"text":"(in ","element":"span"},{"href":"#id-72","text":"(31)","element":"a"},{"text":"), we have","element":"span"}],[{"id":"id-79","style":{"width":"86%"},"width":1626,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-13.png","element":"img"}],[{"text":"Combining lower and upper bounds. ","element":"span"},{"text":"The expected value of ","element":"span"},{"style":{"height":19.22},"width":127.2,"height":48.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-14.png","element":"img","alt":" | ¯R2(j)|","inline":true,"padRight":true},{"text":"conditional on variables ","element":"span"},{"style":{"height":20.69},"width":189.6,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-15.png","element":"img","alt":" {Xn}m−1n=1 ,","inline":true},{"style":{"height":20.88},"width":428,"height":52.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-16.png","element":"img","alt":"{ ¯R2(Xn)}m−1n=1 , and Lu","inline":true,"padRight":true},{"text":"is invariant to choice of ","element":"span"},{"style":{"height":20.88},"width":346.76,"height":52.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-17.png","element":"img","alt":" j ∈ [qI] \\ {Xn}m−1n=1 ","inline":true,"padRight":true},{"text":". So, the conditional expectation","element":"span"}],[{"style":{"width":"67%"},"width":1263,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-18.png","element":"img"}],[{"text":"is independent of ","element":"span"},{"style":{"height":20.69},"width":418.28,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-19.png","element":"img","alt":" j for j ∈ [qI]\\{Xn}m−1n=1 ","inline":true,"padRight":true},{"text":". Note that in the above display, the conditional expectation ","element":"span"},{"text":"is with respect to ","element":"span"},{"text":"¯","element":"span"},{"style":{"height":19.14},"width":514,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-20.png","element":"img","alt":"R2(Xn) for n = 1, · · · , m−","inline":true},{"text":"1. Hence, using tower property of expectation on ","element":"span"},{"href":"#id-78","text":"(34) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-79","text":"(35) ","element":"a"},{"text":"to remove the conditioning on ","element":"span"},{"style":{"height":24.8},"width":318.84,"height":62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-21.png","element":"img","alt":" { ¯R(j)}j /∈{Xn}m−1n=1 ","inline":true,"padRight":true},{"text":"along with the definition of ","element":"span"},{"text":"C ","element":"span"},{"text":"above gives","element":"span"}],[{"style":{"width":"63%"},"width":1182,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-22.png","element":"img"}],[{"text":"Since there are ","element":"span"},{"style":{"height":20.69},"width":726.44,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-23.png","element":"img","alt":" qI − (m − 1) types j ∈ [qI] \\ {Xn}m−1n=1 ","inline":true,"padRight":true},{"text":", summing over ","element":"span"},{"text":"j ","element":"span"},{"text":"in the second inequality of ","element":"span"},{"text":"the last display gives ","element":"span"},{"style":{"height":25.1},"width":309.64,"height":62.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/32-24.png","element":"img","alt":" C ≥ 910 1qI−(m−1)","inline":true},{"text":". Plugging this into the first inequality of the last display","element":"span"}],[{"text":"gives for all ","element":"span"},{"style":{"height":20.69},"width":363.84,"height":51.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-0.png","element":"img","alt":" j ∈ [qI] \\ {Xn}m−1n=1 ,","inline":true}],[{"style":{"width":"64%"},"width":1203,"height":101,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-1.png","element":"img"}],[{"text":"Conditional on ","element":"span"},{"style":{"height":17.81},"width":1292.96,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-2.png","element":"img","alt":" Lu such that |Lu| ≥ qI/4, for s ≤ m ≤ ℓ ≤ qI and s ≤ ℓ/20 we have","inline":true}],[{"style":{"width":"96%"},"width":1809,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-3.png","element":"img"}],[{"text":"So, on the event ","element":"span"},{"style":{"height":19.34},"width":430.76,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-4.png","element":"img","alt":"�ℓn=1 1[Xn ∈ Lu] ≤ ℓ20","inline":true},{"text":", the random variable ","element":"span"},{"style":{"height":18.38},"width":320.48,"height":45.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-5.png","element":"img","alt":" �ℓn=1 1[Xn ∈ Lu","inline":true},{"text":"] conditional on ","element":"span"},{"style":{"height":15.09},"width":50.24,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-6.png","element":"img","alt":" Lu","inline":true,"padRight":true},{"text":"and events ","element":"span"},{"style":{"height":17.6},"width":380.28,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-7.png","element":"img","alt":" |Lu| ≥ qI/4 and Errc","inline":true},{"text":", stochastically dominates a Binomial random variable with mean ","element":"span"},{"style":{"height":17.6},"width":40.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-8.png","element":"img","alt":"ℓ/","inline":true},{"text":"12. Hence, by a Chernoff bound,","element":"span"}],[{"style":{"width":"81%"},"width":1517,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-9.png","element":"img"}]]},{"heading":"7 Item structure only: lower bound","paragraphs":[[{"text":"In this section we prove a lower bound on the regret of any online recommendation system in the regime with item structure only where ","element":"span"},{"style":{"height":16.8},"width":159,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-10.png","element":"img","alt":" qU = N","inline":true,"padRight":true},{"text":"as described in Definition ","element":"span"},{"href":"#id-31","text":"2.2. ","element":"a"},{"text":"Throughout this section, we will assume ","element":"span"},{"text":"N > ","element":"span"},{"text":"32.","element":"span"}],[{"id":"id-25","style":{"height":17.81},"width":1224.6,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-11.png","element":"img","alt":"Theorem 7.1. Let r = ⌊.8 log qI − 4 log log N⌋ and η = 1/ log N","inline":true},{"text":". In the item structure model with ","element":"span"},{"style":{"height":17.2},"width":293.48,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-12.png","element":"img","alt":"N users and qI","inline":true,"padRight":true},{"text":"item types, any recommendation algorithm must incur regret","element":"span"}],[{"id":"id-102","style":{"width":"99%"},"width":1864,"height":668,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-13.png","element":"img"}],[{"text":"The assumption ","element":"span"},{"text":"N > ","element":"span"},{"text":"32 implies ","element":"span"},{"style":{"height":17.6},"width":130.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-14.png","element":"img","alt":" η < 1/","inline":true},{"text":"5. Note that the function ","element":"span"},{"text":"Z","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":") is continuous up to a multiplicative constant factor","element":"span"},{"style":{"height":14.74},"width":31.2,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/33-15.png","element":"img","alt":"6.","inline":true}],[{"text":"To get the simplified version in Section ","element":"span"},{"text":"3, ","element":"span"},{"text":"we bounded the value of ","element":"span"},{"text":"Z","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":") in the second (in which ","element":"span"},{"style":{"height":28.37},"width":615.92,"height":70.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-0.png","element":"img","alt":"2√qI3N ≤ T < 4qI log qIN ) by T8 log qI ","inline":true,"padRight":true},{"text":". Also, the assumption ","element":"span"},{"style":{"height":19.54},"width":319.4,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-1.png","element":"img","alt":" qI > 25 (log N)5","inline":true,"padRight":true},{"text":"guarantees that","element":"span"}],[{"style":{"width":"99%"},"width":1864,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-2.png","element":"img"}],[{"text":"7.1 ","element":"span"},{"text":"Proof strategy","element":"span"}],[{"text":"We call a recommendation ","element":"span"},{"style":{"height":17.49},"width":587.08,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-3.png","element":"img","alt":" au,t to user u at time t a bad","inline":true,"padRight":true},{"text":"(or uncertain) recommendation when the probability of ","element":"span"},{"style":{"height":19.43},"width":1228.72,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-4.png","element":"img","alt":" Lu,au,t = +1 given the history is close to (or smaller than) 1/","inline":true},{"text":"2. Conversely, recommendations for which the probability of ","element":"span"},{"style":{"height":19.43},"width":952.8,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-5.png","element":"img","alt":" Lu,au,t = +1 is close to one (much greater than","inline":true,"padRight":true},{"text":"1","element":"span"},{"text":"/","element":"span"},{"text":"2) are considered ","element":"span"},{"text":"good ","element":"span"},{"text":"recommendations. Good and bad refers only to the confidence that the recommendation is liked given the history at the moment the recommendation is made: a good recommendation is not always liked and a bad recommendation is not always disliked.","element":"span"}],[{"text":"We identify two scenarios in which recommendations are necessarily bad in the item structure only model introduced in Definition ","element":"span"},{"href":"#id-31","text":"2.2: ","element":"a"},{"text":"(i) A good estimate of item types is necessary in order to make meaningful recommendations. To determine whether or not two items are of the same type, approximately log ","element":"span"},{"style":{"height":12.4},"width":40.04,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-6.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"users should rate both of them. To formalize this, similar to the lower bound for the model with user structure only in Section ","element":"span"},{"href":"#id-12","text":"5, ","element":"a"},{"text":"we use the concept of (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-7.png","element":"img","alt":"r, η","inline":true},{"text":")-row regularity (Definition ","element":"span"},{"href":"#id-80","text":"7.1)","element":"a"},{"text":". Lemma ","element":"span"},{"href":"#id-81","text":"7.3 ","element":"a"},{"text":"shows that for a (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-8.png","element":"img","alt":"r, η","inline":true},{"text":")-row regular preference matrix, items with fewer than ","element":"span"},{"text":"r ","element":"span"},{"text":"ratings are liked by any user with probability roughly half, even if the preference matrix is known. (ii) Even when we know the item types (i.e., clustering of items), if a given user ","element":"span"},{"text":"u ","element":"span"},{"text":"has not rated any item with the same type as item ","element":"span"},{"text":"i ","element":"span"},{"text":"before, the probability that user ","element":"span"},{"text":"u ","element":"span"},{"text":"likes item ","element":"span"},{"text":"i ","element":"span"},{"text":"is 1","element":"span"},{"text":"/","element":"span"},{"text":"2. Lemma ","element":"span"},{"href":"#id-82","text":"7.4 ","element":"a"},{"text":"shows this property.","element":"span"}],[{"text":"In Lemma ","element":"span"},{"href":"#id-83","text":"7.5 ","element":"a"},{"text":"we bound regret in terms of the number of good recommendations and in Lemma ","element":"span"},{"href":"#id-84","text":"7.6 ","element":"a"},{"text":"we upper bound the number of good recommendations, which entails a counting argument. The theorem follows immediately from Lemmas ","element":"span"},{"href":"#id-83","text":"7.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-84","text":"7.6.","element":"a"}],[{"text":"7.2 ","element":"span"},{"text":"Proof of Theorem ","element":"span"},{"href":"#id-25","text":"7.1","element":"a"}],[{"id":"id-80","text":"Definition 7.1 ","element":"span"},{"text":"(Row-regularity)","element":"span"},{"text":". ","element":"span"},{"text":"The matrix ","element":"span"},{"style":{"height":17.6},"width":339.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-9.png","element":"img","alt":" A ∈ {−1, +1}n×m ","inline":true,"padRight":true},{"text":"is said to be (","element":"span"},{"style":{"height":17.6},"width":350.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-10.png","element":"img","alt":"r, η)-row regular if","inline":true,"padRight":true},{"text":"its transpose is (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-11.png","element":"img","alt":"r, η","inline":true},{"text":")-column regular (Definition ","element":"span"},{"href":"#id-49","text":"5.1)","element":"a"},{"text":". We write ","element":"span"},{"style":{"height":18.29},"width":411.08,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-12.png","element":"img","alt":" A⊤ ∈ Ωr,η, where Ωr,η","inline":true,"padRight":true},{"text":"is the set of (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-13.png","element":"img","alt":"r, η","inline":true},{"text":")-column regular matrices.","element":"span"}],[{"text":"The following lemma is an immediate corollary of Lemma ","element":"span"},{"href":"#id-51","text":"5.3.","element":"a"}],[{"id":"id-85","text":"Lemma 7.2. ","element":"span"},{"text":"Let matrix ","element":"span"},{"style":{"height":17.6},"width":350.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-14.png","element":"img","alt":" A ∈ {−1, +1}n×m ","inline":true,"padRight":true},{"text":"have i.i.d. ","element":"span"},{"text":"Bern(1","element":"span"},{"text":"/","element":"span"},{"text":"2) ","element":"span"},{"text":"entries. If ","element":"span"},{"style":{"height":16.8},"width":336.28,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-15.png","element":"img","alt":" η < 1, then A is","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":17.6},"width":74.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-16.png","element":"img","alt":"r, η)","inline":true},{"text":"-row regular with probability at least","element":"span"}],[{"style":{"width":"26%"},"width":492,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-17.png","element":"img"}],[{"text":"Throughout this section we will fix","element":"span"}],[{"id":"id-89","style":{"width":"73%"},"width":1379,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/34-18.png","element":"img"}],[{"text":"Applying Lemma ","element":"span"},{"href":"#id-85","text":"7.2 ","element":"a"},{"text":"with these choices of ","element":"span"},{"style":{"height":16.4},"width":132.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-0.png","element":"img","alt":" r and η","inline":true},{"text":", we obtain that so long as ","element":"span"},{"text":"N > ","element":"span"},{"text":"32 the preference matrix Ξ is (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-1.png","element":"img","alt":"r, η","inline":true},{"text":") row-regular (","element":"span"},{"style":{"height":17.89},"width":269.48,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-2.png","element":"img","alt":"i.e., Ξ⊤ ∈ Ωr,η","inline":true},{"text":") with probability","element":"span"}],[{"id":"id-87","style":{"width":"96%"},"width":1801,"height":383,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-3.png","element":"img"}],[{"text":"The following lemma shows that if ","element":"span"},{"style":{"height":19.33},"width":30.72,"height":48.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-4.png","element":"img","alt":" cti ","inline":true,"padRight":true},{"text":"is small and the preference matrix is row-regular, then the ","element":"span"},{"text":"outcome of recommending item ","element":"span"},{"text":"i ","element":"span"},{"text":"to any user at time ","element":"span"},{"text":"t ","element":"span"},{"text":"is uncertain.","element":"span"}],[{"id":"id-81","style":{"width":"90%"},"width":1687,"height":188,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-5.png","element":"img"}],[{"text":"The next lemma shows that if a user ","element":"span"},{"text":"u ","element":"span"},{"text":"has not rated any item with the same type as item ","element":"span"},{"text":"i ","element":"span"},{"text":"before, the probability that ","element":"span"},{"text":"u ","element":"span"},{"text":"likes item ","element":"span"},{"text":"i ","element":"span"},{"text":"is 1/2. Let ","element":"span"},{"style":{"height":23.68},"width":127.24,"height":59.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-6.png","element":"img","alt":" Btu,τI(i) ","inline":true,"padRight":true},{"text":"denote the event that user ","element":"span"},{"text":"u ","element":"span"},{"text":"has rated ","element":"span"},{"text":"an item of the same type as item ","element":"span"},{"style":{"height":16.4},"width":379.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-7.png","element":"img","alt":" i by time t − 1, i.e.,","inline":true}],[{"id":"id-82","style":{"width":"99%"},"width":1866,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-8.png","element":"img"}],[{"text":"Lemmas ","element":"span"},{"href":"#id-81","text":"7.3 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-82","text":"7.4 ","element":"a"},{"text":"(proved in Section ","element":"span"},{"href":"#id-86","text":"7.3) ","element":"a"},{"text":"identify scenarios in which recommendations are bad. In the complementary scenario, recommendations are not necessarily bad (and may be good). We denote the number of such recommendations by","element":"span"}],[{"id":"id-90","style":{"width":"69%"},"width":1301,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-9.png","element":"img"}],[{"text":"The proof of the following lemma uses Lemmas ","element":"span"},{"href":"#id-81","text":"7.3 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-82","text":"7.4 ","element":"a"},{"text":"to lower bound regret in terms of expectation of ","element":"span"},{"text":"good","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":").","element":"span"}],[{"id":"id-83","style":{"width":"75%"},"width":1407,"height":189,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/35-10.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"We partition the liked recommendations based on ","element":"span"},{"style":{"height":23.83},"width":279.88,"height":59.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-0.png","element":"img","alt":" ctau,t, Btu,τI(au,t)","inline":true},{"text":", and row regularity of Ξ:","element":"span"}],[{"style":{"width":"97%"},"width":1821,"height":494,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-1.png","element":"img"}],[{"text":"The proof is obtained by plugging in the four bounds below and simplifying.","element":"span"}],[{"text":"Bounding ","element":"span"},{"text":"A1","element":"span"},{"text":". ","element":"span"},{"text":"Plugging in the probability of Ξ being row regular from ","element":"span"},{"href":"#id-87","text":"(38) ","element":"a"},{"text":"gives","element":"span"}],[{"style":{"width":"72%"},"width":1361,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-2.png","element":"img"}],[{"text":"Bounding ","element":"span"},{"text":"A2","element":"span"},{"text":". ","element":"span"},{"text":"Multiplying the statement of Lemma ","element":"span"},{"href":"#id-81","text":"7.3 ","element":"a"},{"text":"by ","element":"span"},{"style":{"height":20.63},"width":652.32,"height":51.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-3.png","element":"img","alt":" P[au,t = i, cti < r, ΞT ∈ Ωr,η] and","inline":true,"padRight":true},{"text":"summing over ","element":"span"},{"text":"i ","element":"span"},{"text":"gives","element":"span"}],[{"style":{"width":"67%"},"width":1255,"height":287,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-4.png","element":"img"}],[{"text":"Bounding ","element":"span"},{"text":"A3","element":"span"},{"text":". ","element":"span"},{"text":"Lemma ","element":"span"},{"href":"#id-82","text":"7.4 ","element":"a"},{"text":"gives","element":"span"}],[{"style":{"width":"74%"},"width":1399,"height":423,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-5.png","element":"img"}],[{"text":"Bounding ","element":"span"},{"text":"A4","element":"span"},{"text":". ","element":"span"},{"text":"We bound by one the probability that a good recommendation is liked to obtain","element":"span"}],[{"style":{"width":"104%"},"width":1957,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/36-6.png","element":"img"}],[{"text":"Next, we upper bound the expected number of ","element":"span"},{"text":"good ","element":"span"},{"text":"recommendations made by the algorithm in terms of parameters of the model.","element":"span"}],[{"id":"id-84","text":"Lemma 7.6 ","element":"span"},{"text":"(Upper Bound for expected number of good recommendations)","element":"span"},{"text":". ","element":"span"},{"text":"For any algorithm,","element":"span"}],[{"style":{"width":"73%"},"width":1368,"height":206,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-0.png","element":"img"}],[{"text":"We prove this lemma in the next subsection. Theorem ","element":"span"},{"href":"#id-25","text":"7.1 ","element":"a"},{"text":"is an immediate consequence of Lemmas ","element":"span"},{"href":"#id-83","text":"7.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-84","text":"7.6.","element":"a"}],[{"id":"id-86","text":"7.3 ","element":"span"},{"text":"Proofs of lemmas","element":"span"}],[{"text":"7.3.1 ","element":"span"},{"text":"Proof of Lemma ","element":"span"},{"href":"#id-81","text":"7.3","element":"a"}],[{"text":"We show that if an item has been rated by fewer than ","element":"span"},{"text":"r ","element":"span"},{"text":"users, its type is uncertain, because many item types are consistent with the history even if the preference matrix is known. For a row-regular preference matrix, uncertainty in the type of an item makes it impossible to accurately predict user preferences for that item.","element":"span"}],[{"text":"Consider item ","element":"span"},{"style":{"height":18.29},"width":862.96,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-1.png","element":"img","alt":" i at time t. Let w = {u ∈ [N] : au,s = i, s < t}","inline":true,"padRight":true},{"text":"be the ordered tuple corresponding to the set of users that were recommended item ","element":"span"},{"text":"i ","element":"span"},{"text":"up to time ","element":"span"},{"style":{"height":18.29},"width":682.88,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-2.png","element":"img","alt":" t − 1 , and let b = {Lu,i}u∈w be the","inline":true,"padRight":true},{"text":"vector of feedback from users in ","element":"span"},{"text":"w ","element":"span"},{"text":"about item ","element":"span"},{"style":{"height":19.33},"width":664.8,"height":48.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-3.png","element":"img","alt":" i . Note that cti < r implies |w| < r .","inline":true,"padRight":true},{"text":"We re-introduce ","element":"span"},{"text":"the notation from Definition ","element":"span"},{"href":"#id-49","text":"5.1: ","element":"a"},{"text":"if ","element":"span"},{"text":"M ","element":"span"},{"text":"is the matrix obtained by concatenating the rows of Ξ indexed by ","element":"span"},{"style":{"height":18.48},"width":311.68,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-4.png","element":"img","alt":" w, then Kb,w(Ξ⊤","inline":true},{"text":") is the set of columns of ","element":"span"},{"text":"M ","element":"span"},{"text":"(corresponding to the item types) equal to ","element":"span"},{"text":"b","element":"span"},{"text":". This is the set of item types consistent with the ratings ","element":"span"},{"text":"b ","element":"span"},{"text":"of users ","element":"span"},{"text":"w ","element":"span"},{"text":"for item ","element":"span"},{"text":"i","element":"span"},{"text":".","element":"span"}],[{"text":"Conditional on Ξ, ","element":"span"},{"style":{"height":17.6},"width":674.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-5.png","element":"img","alt":" w, and b, the type τI(i) of item i","inline":true,"padRight":true},{"text":"at the end of time ","element":"span"},{"style":{"height":11.6},"width":63.28,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-6.png","element":"img","alt":" t −","inline":true,"padRight":true},{"text":"1 is uniformly distributed over the set of item types ","element":"span"},{"style":{"height":18.48},"width":150.4,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-7.png","element":"img","alt":" Kb,w(Ξ⊤","inline":true},{"text":"). This allows us to relate the posterior probability of ","element":"span"},{"text":"i ","element":"span"},{"text":"being liked to row regularity of Ξ as follows. Let ","element":"span"},{"style":{"height":20.34},"width":497,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-8.png","element":"img","alt":" b+ = [b 1] ∈ {−1, +1}|w|+1 ","inline":true,"padRight":true},{"text":"be obtained from ","element":"span"},{"text":"b ","element":"span"},{"text":"by appending +1 ","element":"span"},{"text":". ","element":"span"},{"text":"For a given user ","element":"span"},{"style":{"height":20.05},"width":1210.08,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-9.png","element":"img","alt":" u /∈ w , we have Lu,i = +1 precisely when τI(i) ∈ Kb+,{w,u}(Ξ⊤) ,","inline":true,"padRight":true},{"text":"which in words reads “item ","element":"span"},{"text":"i ","element":"span"},{"text":"is among those types that are consistent with the ratings of ","element":"span"},{"text":"i ","element":"span"},{"text":"up to time ","element":"span"},{"style":{"height":11.6},"width":54.16,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-10.png","element":"img","alt":" t−","inline":true},{"text":"1 and have preference vector with ‘+1’ for user u”. It follows that for any preference matrix Ξ and any user ","element":"span"},{"text":"u ","element":"span"},{"text":"which has not rated ","element":"span"},{"text":"i ","element":"span"},{"text":"up to time ","element":"span"},{"style":{"height":14.8},"width":109.92,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-11.png","element":"img","alt":" t − 1 ,","inline":true}],[{"id":"id-88","style":{"width":"88%"},"width":1661,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-12.png","element":"img"}],[{"text":"The second equality is due to: (i) ","element":"span"},{"text":"w ","element":"span"},{"text":"and ","element":"span"},{"text":"b ","element":"span"},{"text":"are functions of the history up to time ","element":"span"},{"style":{"height":15.2},"width":233.76,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-13.png","element":"img","alt":" t − 1, Ht−1;","inline":true,"padRight":true},{"text":"(ii) for fixed ","element":"span"},{"style":{"height":18.48},"width":471.04,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-14.png","element":"img","alt":" w and b, the set Kb,w(Ξ⊤","inline":true},{"text":") is determined by Ξ","element":"span"},{"style":{"height":17.6},"width":205.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-15.png","element":"img","alt":"⊤ ; (ii) τI(i","inline":true},{"text":") is uniformly distributed on ","element":"span"},{"style":{"height":18.48},"width":414.24,"height":46.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/37-16.png","element":"img","alt":"Kb,w(Ξ⊤) given Ht−1.","inline":true}],[{"text":"Recall that Ξ","element":"span"},{"style":{"height":17.89},"width":151.88,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-0.png","element":"img","alt":"⊤ ∈ Ωr,η","inline":true,"padRight":true},{"text":"if the preference matrix Ξ is (","element":"span"},{"style":{"height":12.4},"width":56.56,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-1.png","element":"img","alt":"r, η","inline":true},{"text":")-row regular. We have","element":"span"}],[{"style":{"height":32.93},"width":1969.32,"height":82.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-2.png","element":"img","alt":"P�Lu,i = +1,au,t = i | cti ≤ r, Ξ⊤ ∈ Ωr,η� (a)= E�P[Lu,i = +1 | Ht−1, Ξ] P[au,t = i|Ht−1 ]�� cti ≤ r, Ξ⊤ ∈ Ωr,η�","inline":true}],[{"style":{"width":"87%"},"width":1629,"height":270,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-3.png","element":"img"}],[{"text":"and this proves the lemma. It remain to justify the steps above. (a) follows since conditional on ","element":"span"},{"style":{"height":14.69},"width":92.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-4.png","element":"img","alt":"Ht−1","inline":true},{"text":", the random variable ","element":"span"},{"style":{"height":13.09},"width":64.32,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-5.png","element":"img","alt":" au,t","inline":true,"padRight":true},{"text":"is independent of all other random variables. (b) uses ","element":"span"},{"href":"#id-88","text":"(42) ","element":"a"},{"text":"and the fact that ","element":"span"},{"style":{"height":18.29},"width":282.44,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-6.png","element":"img","alt":" P[au,t = i|Ht−1","inline":true},{"text":"] is nonzero only if ","element":"span"},{"text":"u ","element":"span"},{"text":"has not rated item ","element":"span"},{"text":"i","element":"span"},{"text":", so we may add ","element":"span"},{"style":{"height":17.6},"width":290.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-7.png","element":"img","alt":" 1[u /∈ w]. (c) is","inline":true,"padRight":true},{"text":"justified as follows: if Ξ","element":"span"},{"style":{"height":17.89},"width":160.52,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-8.png","element":"img","alt":"⊤ ∈ Ωr,η","inline":true},{"text":", then by Claim ","element":"span"},{"href":"#id-50","text":"5.2, ","element":"a"},{"text":"Ξ","element":"span"},{"style":{"height":17.89},"width":218.4,"height":44.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-9.png","element":"img","alt":"⊤ ∈ Ωr−1,η.","inline":true,"padRight":true},{"text":"By Definition ","element":"span"},{"href":"#id-49","text":"5.1, ","element":"a"},{"text":"this means that ","element":"span"},{"href":"#id-89","style":{"height":22.78},"width":1778.12,"height":56.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-10.png","element":"img","alt":" kb,w(Ξ⊤) ≥ (1 − η)qI/2|w| and kb+,{w,u}(Ξ⊤) ≤ (1 + η)qI/2|w|+1. (d) If N > 32, then η in (37)","inline":true,"padRight":true},{"text":"is less than 1","element":"span"},{"text":"/ ","element":"span"},{"text":"log 32 and (1 + ","element":"span"},{"style":{"height":17.6},"width":384,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-11.png","element":"img","alt":" η)/(1 − η) ≤ 1 + 3η.","inline":true}],[{"text":"7.3.2 ","element":"span"},{"text":"Proof of Lemma ","element":"span"},{"href":"#id-82","text":"7.4","element":"a"}],[{"text":"At a high level, we make two observations: (i) if user ","element":"span"},{"text":"u ","element":"span"},{"text":"has not rated any item with type ","element":"span"},{"style":{"height":17.6},"width":226.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-12.png","element":"img","alt":" τI(i) before,","inline":true,"padRight":true},{"text":"the feedback in the history ","element":"span"},{"style":{"height":14.69},"width":92.36,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-13.png","element":"img","alt":" Ht−1","inline":true,"padRight":true},{"text":"is independent of the value of ","element":"span"},{"style":{"height":19.06},"width":117.64,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-14.png","element":"img","alt":" ξu,τI(i)","inline":true,"padRight":true},{"text":"given all other elements of matrix Ξ and the item types; (ii) the types of items (function ","element":"span"},{"style":{"height":17.6},"width":66.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-15.png","element":"img","alt":" τI(·","inline":true},{"text":")) are independent of matrix Ξ, and the elements of Ξ are independent. Hence, conditional on (","element":"span"},{"style":{"height":23.49},"width":161.4,"height":58.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-16.png","element":"img","alt":"Btu,τI(i))c","inline":true},{"text":", the posterior distribution ","element":"span"},{"text":"at time ","element":"span"},{"style":{"height":17.49},"width":151.2,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-17.png","element":"img","alt":" t of Lu,i","inline":true,"padRight":true},{"text":"is uniform on ","element":"span"},{"style":{"height":17.6},"width":185.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-18.png","element":"img","alt":" {−1, +1}.","inline":true}],[{"text":"Concretely, we may think of revealing the entries of Ξ on a “need-to-know” basis. Conditional on (","element":"span"},{"style":{"height":23.49},"width":487.72,"height":58.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-19.png","element":"img","alt":"Btu,τI(i))c, entry ξτU(u),τI(i)","inline":true,"padRight":true},{"text":"has not yet been touched, so","element":"span"}],[{"style":{"width":"76%"},"width":1433,"height":163,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-20.png","element":"img"}],[{"text":"The lemma now follows by the tower property.","element":"span"}],[{"text":"7.3.3 ","element":"span"},{"text":"Proof of Lemma ","element":"span"},{"href":"#id-84","text":"7.6","element":"a"}],[{"text":"Lemma ","element":"span"},{"href":"#id-84","text":"7.6 ","element":"a"},{"text":"upper bounds the expected number of good recommendations. To prepare for the proof of this lemma we introduce some notation. Recall that ","element":"span"},{"style":{"height":19.14},"width":30.71,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-21.png","element":"img","alt":" cti ","inline":true,"padRight":true},{"text":"(defined in ","element":"span"},{"href":"#id-87","text":"(39)","element":"a"},{"text":") is the number of users ","element":"span"},{"text":"who have rated item ","element":"span"},{"text":"i ","element":"span"},{"text":"before time ","element":"span"},{"style":{"height":19.14},"width":429.52,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-22.png","element":"img","alt":" t. Let Ft = {i : cti ≥ r}","inline":true,"padRight":true},{"text":"be the set of items with at least ","element":"span"},{"text":"r ","element":"span"},{"text":"ratings ","element":"span"},{"text":"before time ","element":"span"},{"style":{"height":17.6},"width":290.88,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-23.png","element":"img","alt":" t and ft := |Ft|","inline":true,"padRight":true},{"text":"their number,","element":"span"}],[{"style":{"width":"17%"},"width":328,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/38-24.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":19.14},"width":382.96,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-0.png","element":"img","alt":" Gt = {i : 0 < cti < r}","inline":true,"padRight":true},{"text":"be the set of items that have been rated by at least one and fewer than ","element":"span"},{"text":"r ","element":"span"},{"text":"users by time ","element":"span"},{"style":{"height":17.6},"width":284.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-1.png","element":"img","alt":" t and gt := |Gt|","inline":true,"padRight":true},{"text":"their number,","element":"span"}],[{"id":"id-91","style":{"width":"21%"},"width":410,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-2.png","element":"img"}],[{"text":"In the following claim, we bound the number of good recommendations up to time ","element":"span"},{"text":"T ","element":"span"},{"text":"in terms of ","element":"span"},{"style":{"height":16.4},"width":206.4,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-3.png","element":"img","alt":" fT and gT .","inline":true}],[{"id":"id-92","text":"Claim 7.7. ","element":"span"},{"text":"The number of good recommendations ","element":"span"},{"text":"good","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":")","element":"span"},{"text":", defined in ","element":"span"},{"href":"#id-90","text":"(41)","element":"a"},{"text":", satisfies ","element":"span"},{"style":{"height":17.6},"width":205.36,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-4.png","element":"img","alt":" good(T) ≤","inline":true},{"style":{"height":16.4},"width":301.48,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-5.png","element":"img","alt":"TN − gT − fT r.","inline":true}],[{"style":{"height":16.8},"width":359.04,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-6.png","element":"img","alt":"Proof. Any i ∈ FT","inline":true,"padRight":true},{"text":"is recommended to at least ","element":"span"},{"text":"r ","element":"span"},{"text":"users by the end of time ","element":"span"},{"text":"T . ","element":"span"},{"text":"So, for any ","element":"span"},{"style":{"height":15.6},"width":149.76,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-7.png","element":"img","alt":" i ∈ FT ,","inline":true,"padRight":true},{"text":"there are ","element":"span"},{"text":"r ","element":"span"},{"text":"recommendations in which ","element":"span"},{"text":"i ","element":"span"},{"text":"has been rated fewer than ","element":"span"},{"text":"r ","element":"span"},{"text":"previous times:","element":"span"}],[{"style":{"width":"72%"},"width":1350,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-8.png","element":"img"}],[{"text":"Any ","element":"span"},{"style":{"height":15.09},"width":118.56,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-9.png","element":"img","alt":" i ∈ GT","inline":true,"padRight":true},{"text":"has been recommended at least once, so for these items","element":"span"}],[{"style":{"width":"44%"},"width":842,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-10.png","element":"img"}],[{"text":"All recommended items are either in ","element":"span"},{"style":{"height":15.09},"width":175.2,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-11.png","element":"img","alt":" FT or GT","inline":true,"padRight":true},{"text":". So, the total number of recommendations satisfies","element":"span"}],[{"style":{"width":"87%"},"width":1634,"height":387,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-12.png","element":"img"}],[{"text":"where we used ","element":"span"},{"style":{"height":17.65},"width":260.08,"height":44.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-13.png","element":"img","alt":" good(T) ≤ �","inline":true}],[{"text":"The rest of the section contains the proof of Lemma ","element":"span"},{"href":"#id-84","text":"7.6.","element":"a"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-84","text":"7.6. ","element":"a"},{"text":"The lemma consists of three bounds on ","element":"span"},{"text":"good","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":"). ","element":"span"},{"style":{"height":17.6},"width":874.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/39-14.png","element":"img","alt":"Proof of inequality E [good(T)] ≤ TN − Tr","inline":true},{"text":". This bound is the easiest, so we start with this.","element":"span"}],[{"style":{"width":"68%"},"width":1291,"height":321,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-0.png","element":"img"}],[{"text":"(a) uses the definition of ","element":"span"},{"text":"good","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":"). For any item ","element":"span"},{"text":"i","element":"span"},{"text":", the sequence ","element":"span"},{"style":{"height":19.33},"width":30.72,"height":48.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-1.png","element":"img","alt":" cti ","inline":true,"padRight":true},{"text":"is nondecreasing in ","element":"span"},{"text":"t","element":"span"},{"text":". This shows ","element":"span"},{"text":"that if for an item ","element":"span"},{"style":{"height":19.14},"width":277.24,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-2.png","element":"img","alt":" cti > r at t ≤ T","inline":true},{"text":", then item ","element":"span"},{"style":{"height":19.94},"width":305.88,"height":49.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-3.png","element":"img","alt":" i satisfies cTi ≥ r","inline":true},{"text":". (b) is derived by changing the order ","element":"span"},{"text":"of summations, the definition of ","element":"span"},{"style":{"height":15.09},"width":55.2,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-4.png","element":"img","alt":" FT","inline":true,"padRight":true},{"text":"and equality ","element":"span"},{"style":{"height":19.42},"width":939.36,"height":48.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-5.png","element":"img","alt":" 1[cti ≥ r, au,t = i] = 1[au,t = i]−1[cti < r, au,t = i].","inline":true,"padRight":true},{"text":"(c) Because each item is recommended at most once to each user, each item (and specifically items in ","element":"span"},{"style":{"height":15.09},"width":55.2,"height":37.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-6.png","element":"img","alt":" FT","inline":true,"padRight":true},{"text":") are recommended at most ","element":"span"},{"text":"N ","element":"span"},{"text":"times. This gives for ","element":"span"},{"style":{"height":28.08},"width":739.04,"height":70.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-7.png","element":"img","alt":" i ∈ FT , �t∈[T ]u∈[N] 1[au,t = i] ≤ N as the","inline":true,"padRight":true},{"text":"bound for the first term in (c). The second term in (c) is bounded by Equation ","element":"span"},{"href":"#id-91","text":"(43)","element":"a"},{"text":".","element":"span"}],[{"text":"Plugging this into the statement of Claim ","element":"span"},{"href":"#id-92","text":"7.7 ","element":"a"},{"text":"gives for any values of ","element":"span"},{"style":{"height":16.4},"width":206.4,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-8.png","element":"img","alt":" fT and gT ,","inline":true}],[{"style":{"width":"60%"},"width":1134,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-9.png","element":"img"}],[{"text":"We now prove the other two bounds on ","element":"span"},{"text":"good","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":"), this time in terms of the number of item types each user has rated by time ","element":"span"},{"text":"T","element":"span"},{"text":". Let Γ","element":"span"},{"style":{"height":19.36},"width":24,"height":48.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-10.png","element":"img","alt":"Tu ","inline":true,"padRight":true},{"text":"be the set of item types that are recommended to user ","element":"span"},{"text":"u ","element":"span"},{"text":"up to time ","element":"span"},{"text":"T","element":"span"},{"text":",","element":"span"}],[{"style":{"width":"99%"},"width":1865,"height":263,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-11.png","element":"img"}],[{"id":"id-93","text":"Proof. ","element":"span"},{"text":"For user ","element":"span"},{"text":"u","element":"span"},{"text":", the number of times an item type is rated for the first time by this user is equal to the number of item types rated by user ","element":"span"},{"style":{"height":24.62},"width":865.44,"height":61.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-12.png","element":"img","alt":" u. Hence, �t∈[T] 1[(Btu,τI(au,t))c] = γTu and","inline":true},{"style":{"height":24.62},"width":563.04,"height":61.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-13.png","element":"img","alt":"�t∈[T] 1[Btu,τI(au,t)] = T − γTu ","inline":true,"padRight":true},{"text":". Summing over ","element":"span"},{"style":{"height":29.02},"width":975.56,"height":72.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-14.png","element":"img","alt":" u and using good(T) ≤ �t∈[T ]u∈[N] 1[Btu,τI(au,t)] proves","inline":true,"padRight":true},{"text":"the claim.","element":"span"}],[{"style":{"height":17.6},"width":889.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-15.png","element":"img","alt":"Proof of inequality E [good(T)] ≤ NT − N","inline":true},{"text":". This follows from Claim ","element":"span"},{"href":"#id-93","text":"7.8 ","element":"a"},{"text":"since ","element":"span"},{"style":{"height":19.55},"width":273.12,"height":48.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-16.png","element":"img","alt":" γTu ≥ 1 for all","inline":true},{"style":{"height":17.6},"width":154.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-17.png","element":"img","alt":"u ∈ [N].","inline":true}],[{"style":{"height":21.27},"width":994.16,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-18.png","element":"img","alt":"Proof of inequality E [good(T)] ≤ NT − 12NZ(T).","inline":true,"padRight":true},{"text":"This last inequality is more involved than the ","element":"span"},{"text":"others. Let ","element":"span"},{"style":{"height":21.73},"width":33.68,"height":54.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-19.png","element":"img","alt":" rtj ","inline":true,"padRight":true},{"text":"be the number of items with type ","element":"span"},{"text":"j ","element":"span"},{"text":"that have been recommended (to any user) by ","element":"span"},{"text":"time ","element":"span"},{"text":"t","element":"span"},{"text":":","element":"span"}],[{"id":"id-94","style":{"width":"99%"},"width":1868,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/40-20.png","element":"img"}],[{"style":{"width":"99%"},"width":1865,"height":227,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-0.png","element":"img"}],[{"text":"We get a lower bound on min","element":"span"},{"style":{"height":19.36},"width":84.96,"height":48.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-1.png","element":"img","alt":"u γTu ","inline":true,"padRight":true},{"text":"via an upper bound on max","element":"span"},{"style":{"height":22.33},"width":69.6,"height":55.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-2.png","element":"img","alt":"j rTj ","inline":true,"padRight":true},{"text":", which is in turn obtained via ","element":"span"},{"text":"martingale concentration bounds for each ","element":"span"},{"style":{"height":22.53},"width":45.12,"height":56.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-3.png","element":"img","alt":" rTj ","inline":true,"padRight":true},{"text":". This is essentially just a question of bounding the ","element":"span"},{"text":"fullest bin in a balls and bins scenario, with the added complication that the time-steps in which balls are thrown is random. Thus the number of balls (i.e., ","element":"span"},{"style":{"height":16.4},"width":147.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-4.png","element":"img","alt":" fT + gT","inline":true,"padRight":true},{"text":") is random, and the decision to throw a ball at a given time may depend on the configuration of balls in bins.","element":"span"}],[{"id":"id-101","style":{"width":"77%"},"width":1450,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-5.png","element":"img"}],[{"text":"2 ","element":"span"},{"text":", ","element":"span"},{"style":{"height":19.63},"width":365.68,"height":49.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-6.png","element":"img","alt":"if k < k0 := √qI/3","inline":true}],[{"text":"3 ","element":"span"},{"text":"log ","element":"span"},{"style":{"height":24.42},"width":727.6,"height":61.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-7.png","element":"img","alt":" qIlog qI−log k , if k0 ≤ k < k1 := qI/2","inline":true}],[{"text":"8 log ","element":"span"},{"style":{"height":17.2},"width":757.64,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-8.png","element":"img","alt":" qI , if k1 ≤ k < k2 := 2qI log qI","inline":true}],[{"style":{"width":"64%"},"width":1202,"height":199,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-9.png","element":"img"}],[{"text":"Proof. ","element":"span"},{"text":"First, we define a useful martinagle. Let ","element":"span"},{"style":{"height":22.21},"width":492.56,"height":55.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-10.png","element":"img","alt":" rt = (rt1, . . . , rtqI) where rtj ","inline":true,"padRight":true},{"text":"is defined in ","element":"span"},{"href":"#id-94","text":"(46)","element":"a"},{"text":". Note ","element":"span"},{"text":"that ","element":"span"},{"style":{"height":21.92},"width":298.16,"height":54.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-11.png","element":"img","alt":" ft + gt = �j rtj ","inline":true,"padRight":true},{"text":"is the total number of recommended items at the end of time ","element":"span"},{"text":"t","element":"span"},{"text":". Any new ","element":"span"},{"text":"item has type uniformly distributed on [","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-12.png","element":"img","alt":"qI","inline":true},{"text":"]; as a consequence, the sequence ","element":"span"},{"style":{"height":21.54},"width":399.28,"height":53.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-13.png","element":"img","alt":" rtj − (ft + gt)/qI is a","inline":true,"padRight":true},{"text":"martingale with respect to filtration ","element":"span"},{"style":{"height":19.14},"width":696.48,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-14.png","element":"img","alt":" Ft = σ(r0, r1, . . . , rt), because ft + gt","inline":true,"padRight":true},{"text":"is incremented whenever a new item is recommended and each new item increases ","element":"span"},{"style":{"height":21.54},"width":34.68,"height":53.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-15.png","element":"img","alt":" rtj ","inline":true,"padRight":true},{"text":"by one with probability 1","element":"span"},{"style":{"height":17.81},"width":76.32,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-16.png","element":"img","alt":"/qI.","inline":true}],[{"text":"It turns out to be easier to work with a different martingale that considers recommendations to each user separately, so that the item counts are incremented by at most one at each step. Consider the lexicographical ordering on pairs (","element":"span"},{"style":{"height":17.6},"width":466.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-17.png","element":"img","alt":"t, u), where (s, v) ≤ (t, u","inline":true},{"text":") if either ","element":"span"},{"style":{"height":14.8},"width":474.28,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-18.png","element":"img","alt":" s < t or s = t and v ≤ u","inline":true,"padRight":true},{"text":"(such that the recommendation to user ","element":"span"},{"text":"v ","element":"span"},{"text":"at time ","element":"span"},{"text":"s ","element":"span"},{"text":"occurred before that of user ","element":"span"},{"text":"u ","element":"span"},{"text":"at time ","element":"span"},{"text":"t","element":"span"},{"text":"). For","element":"span"}],[{"style":{"width":"77%"},"width":1452,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-19.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":22.4},"width":368.96,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-20.png","element":"img","alt":" rt,u = (rt,u1 , . . . , rt,uqI ","inline":true,"padRight":true},{"text":") and define ","element":"span"},{"style":{"height":24.03},"width":263.84,"height":60.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-21.png","element":"img","alt":" ρt,u = �j rt,uj","inline":true,"padRight":true},{"text":"to be the total number of items recommended by (","element":"span"},{"style":{"height":19.54},"width":488.16,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-22.png","element":"img","alt":"t, u), e.g., ρT,N = fT + gT","inline":true,"padRight":true},{"text":". We now define a sequence of stopping times ","element":"span"},{"style":{"height":17.6},"width":264.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-23.png","element":"img","alt":" Zk ∈ N × [N],","inline":true}],[{"style":{"width":"41%"},"width":769,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-24.png","element":"img"}],[{"text":"where (","element":"span"},{"text":"t, ","element":"span"},{"text":"0) is interpreted as (","element":"span"},{"style":{"height":17.6},"width":566.16,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-25.png","element":"img","alt":"t − 1, N) and Z0 = (0, N). Zk","inline":true,"padRight":true},{"text":"is the first (","element":"span"},{"text":"t, u","element":"span"},{"text":") such that a new item is recommended by the algorithm for the ","element":"span"},{"text":"k","element":"span"},{"text":"-th time, so ","element":"span"},{"style":{"height":19.14},"width":327.12,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-26.png","element":"img","alt":" ρZk = k. The Zk","inline":true,"padRight":true},{"text":"are stopping times with respect to (","element":"span"},{"style":{"height":18.34},"width":64.16,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-27.png","element":"img","alt":"ρt,u","inline":true},{"text":"), and observe that ","element":"span"},{"style":{"height":19.54},"width":1224.8,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-28.png","element":"img","alt":" k∗ = max{k : Zk ≤ (T, N)} = ρT,N = fT +gT since fT +gT is the","inline":true,"padRight":true},{"text":"total number of items recommended by the algorithm by the end of time ","element":"span"},{"style":{"height":19.14},"width":461.28,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-29.png","element":"img","alt":" T. Also, ρZk∗ = fT + gT","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":26.4},"width":537.6,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/41-30.png","element":"img","alt":" rZk∗j = r(T,N)j for all j ∈ [qI].","inline":true}],[{"text":"Fix item type ","element":"span"},{"style":{"height":17.81},"width":127.88,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-0.png","element":"img","alt":" j ∈ [qI","inline":true},{"text":"]. The sequence ","element":"span"},{"style":{"height":24.03},"width":396.68,"height":60.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-1.png","element":"img","alt":" Mt,uj = rt,uj − ρt,u/qI","inline":true,"padRight":true},{"text":"is a martingale with respect to the filtration ","element":"span"},{"style":{"height":19.54},"width":435,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-2.png","element":"img","alt":" Ft,u = σ(r1,1, . . . rt,u) †","inline":true},{"text":". It follows that ","element":"span"},{"style":{"height":26.4},"width":335.2,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-3.png","element":"img","alt":"�Mkj := M(T,N)∧Zkj","inline":true,"padRight":true},{"text":"is martingale as well, this time with respect to ","element":"span"},{"style":{"height":25.22},"width":1200.6,"height":63.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-4.png","element":"img","alt":"�Fk := F(T,N)∧Zk. Since Zk∗ ≤ (T, N), we have �Mk∗j = MZk∗j","inline":true,"padRight":true},{"text":". We will use this notation to prove statement of the claim in three different regimes. First, we would like to apply martingale concentration (Lemma ","element":"span"},{"href":"#id-95","text":"A.2) ","element":"a"},{"text":"to ","element":"span"},{"style":{"height":22.53},"width":65.04,"height":56.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-5.png","element":"img","alt":"�Mkj ","inline":true,"padRight":true},{"text":", and to this end observe that Var(","element":"span"},{"style":{"height":22.53},"width":338.12,"height":56.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-6.png","element":"img","alt":"�Mkj | �Fk−1) ≤ 1/qI","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":22.34},"width":301.84,"height":55.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-7.png","element":"img","alt":" |�Mkj − �Mk−1| ≤","inline":true,"padRight":true},{"text":"1 almost surely.","element":"span"}],[{"style":{"height":17.01},"width":713.48,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-8.png","element":"img","alt":"Step 1 For any k ≥ k2 := 2qI log qI","inline":true},{"text":", Lemma ","element":"span"},{"href":"#id-95","text":"A.2 ","element":"a"},{"text":"gives","element":"span"}],[{"style":{"width":"56%"},"width":1049,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-9.png","element":"img"}],[{"text":"This gives","element":"span"}],[{"style":{"height":36.72},"width":1234.8,"height":91.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-10.png","element":"img","alt":"P�maxj∈[qI] rZk∗j ≥ ΘqI(k∗), k∗ ≥ k2�≤ P�∃k ≥ k2 s.t. maxj∈[qI] rZkj ≥ 4kqI","inline":true}],[{"id":"id-97","style":{"width":"82%"},"width":1543,"height":142,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-11.png","element":"img"}],[{"text":"where (a) uses ","element":"span"},{"style":{"height":19.14},"width":155.96,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-12.png","element":"img","alt":" ρZk = k","inline":true},{"text":". (b) uses a union bound and the inequality in the last display. (c) uses definition of ","element":"span"},{"style":{"height":21.27},"width":559.88,"height":53.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-13.png","element":"img","alt":" k2 and 1 − exp(−2/qI) > q−2I","inline":true,"padRight":true},{"text":"(which is derived using ","element":"span"},{"style":{"height":19.34},"width":604.8,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-14.png","element":"img","alt":" e−a ≤ 1 − a + a2/2 and qI > 1).","inline":true}],[{"style":{"height":16.4},"width":605,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-15.png","element":"img","alt":"Step 2 For any k2 > k we get","inline":true}],[{"style":{"width":"82%"},"width":1547,"height":114,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-16.png","element":"img"}],[{"text":"This gives","element":"span"}],[{"id":"id-98","style":{"width":"85%"},"width":1606,"height":238,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-17.png","element":"img"}],[{"text":"(a) uses ","element":"span"},{"style":{"height":19.14},"width":147.32,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/42-18.png","element":"img","alt":" ρZk = k","inline":true},{"text":". (b) uses the inequality in the above display.","element":"span"}],[{"style":{"height":17.81},"width":678.16,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-0.png","element":"img","alt":"Step 3 This step, k < k1 := qI/","inline":true},{"text":"2, corresponds to bounding the number of balls in the fullest bin when the number of balls, ","element":"span"},{"text":"k","element":"span"},{"text":", is sublinear in the number of bins, ","element":"span"},{"style":{"height":22.47},"width":493.44,"height":56.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-1.png","element":"img","alt":" qI (since k = q1−3δI with","inline":true},{"style":{"height":24.91},"width":186.8,"height":62.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-2.png","element":"img","alt":"δ > 18 log qI ","inline":true,"padRight":true},{"text":"). We will show that in this regime, the number of balls in the fullest bin is bounded by ","element":"span"},{"text":"1","element":"span"},{"style":{"height":17.6},"width":41.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-3.png","element":"img","alt":"/δ","inline":true},{"text":". For given ","element":"span"},{"style":{"height":27.42},"width":1145.28,"height":68.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-4.png","element":"img","alt":" k < k1, define δ = 13log qI−log klog qI (such that k = q1−3δI ). Then,","inline":true}],[{"id":"id-96","style":{"width":"80%"},"width":1513,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-5.png","element":"img"}],[{"text":"(a) is a union bound over ","element":"span"},{"style":{"height":17.81},"width":118.76,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-6.png","element":"img","alt":" j ∈ [qI","inline":true},{"text":"]. (b) uses the fact that ","element":"span"},{"style":{"height":23.39},"width":219.52,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-7.png","element":"img","alt":" rZk+11 = rZk1","inline":true,"padRight":true},{"text":"+ 1 with probability 1","element":"span"},{"style":{"height":17.81},"width":147.84,"height":44.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-8.png","element":"img","alt":"/qI and","inline":true},{"style":{"height":23.39},"width":219.52,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-9.png","element":"img","alt":"rZk+11 = rZk1","inline":true,"padRight":true},{"text":"with probability 1 ","element":"span"},{"style":{"height":18},"width":126.44,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-10.png","element":"img","alt":" − 1/qI","inline":true,"padRight":true},{"text":"independently of ","element":"span"},{"style":{"height":22.05},"width":60.16,"height":55.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-11.png","element":"img","alt":" rZk1 ","inline":true,"padRight":true},{"text":". (c) holds for every ","element":"span"},{"style":{"height":25.11},"width":349.92,"height":62.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-12.png","element":"img","alt":" δ > 18 log qI ( which","inline":true,"padRight":true},{"text":"is due to ","element":"span"},{"style":{"height":25.83},"width":581.28,"height":64.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-13.png","element":"img","alt":" k < k1) using� k1/δ�≤ (keδ)1/δ.","inline":true}],[{"id":"id-99","style":{"width":"92%"},"width":1740,"height":235,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-14.png","element":"img"}],[{"text":"(a) uses ","element":"span"},{"style":{"height":26.4},"width":249.16,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-15.png","element":"img","alt":" rZk∗j = r(T,N)j","inline":true,"padRight":true},{"text":". (b) uses a union bound and ","element":"span"},{"href":"#id-96","text":"(51)","element":"a"},{"text":". Last inequality uses ","element":"span"},{"style":{"height":17.01},"width":153.6,"height":42.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-16.png","element":"img","alt":" k1 < qI.","inline":true}],[{"text":"Step 4 ","element":"span"},{"text":"This step uses a variation of the Birthday Paradox to bound max","element":"span"},{"style":{"height":24.83},"width":197.28,"height":62.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-17.png","element":"img","alt":"j∈[qI] rT,Nj .","inline":true}],[{"id":"id-100","style":{"width":"95%"},"width":1795,"height":630,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-18.png","element":"img"}],[{"text":"(a) and (b) use the fact that ","element":"span"},{"style":{"height":24.64},"width":60.16,"height":61.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-19.png","element":"img","alt":" rZkj","inline":true,"padRight":true},{"text":"is a nondecreasing function of ","element":"span"},{"style":{"height":24.26},"width":628.16,"height":60.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-20.png","element":"img","alt":" k. We define rZ0j = 0. The type","inline":true,"padRight":true},{"text":"of the (","element":"span"},{"text":"k ","element":"span"},{"text":"+ 1)-th drawn item is independent of the type of the previous ","element":"span"},{"text":"k ","element":"span"},{"text":"drawn items. Hence, conditional on ","element":"span"},{"style":{"height":24.93},"width":394.88,"height":62.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-21.png","element":"img","alt":" rZk =�rZk1 , · · · , rZkqI�","inline":true},{"text":", the random variable ","element":"span"},{"style":{"height":15.54},"width":98.04,"height":38.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-22.png","element":"img","alt":" rZk+1 ","inline":true,"padRight":true},{"text":"is independent of ","element":"span"},{"style":{"height":15.54},"width":98.52,"height":38.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-23.png","element":"img","alt":" rZk−1","inline":true},{"text":". This gives equality (c). (d) uses exp","element":"span"},{"style":{"height":20.8},"width":1380.2,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-24.png","element":"img","alt":"�k log(1 − k/qI)�≥ exp�− k2/(qI − k)�≥ exp�− 2k2/qI�≥ 1 − 2k2/qI","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":19.82},"width":416.16,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/43-25.png","element":"img","alt":" k ≤ k0 ≤ √qI ≤ qI/2.","inline":true}],[{"text":"We put it all together,","element":"span"}],[{"style":{"height":37.25},"width":1059.24,"height":93.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-0.png","element":"img","alt":"P�maxj∈[qI] rTj ≥ ΘqI(fT + gT )� (a)= P�maxj∈[qI] rZk∗j ≥ ΘqI(k∗)�","inline":true}],[{"style":{"width":"81%"},"width":1531,"height":335,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-1.png","element":"img"}],[{"text":"(a) uses the definition of ","element":"span"},{"style":{"height":15.28},"width":215.04,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-2.png","element":"img","alt":" Zk and k∗.","inline":true,"padRight":true},{"text":"(b) uses ","element":"span"},{"href":"#id-97","text":"(49)","element":"a"},{"text":", ","element":"span"},{"href":"#id-98","text":"(50)","element":"a"},{"text":", ","element":"span"},{"href":"#id-99","text":"(52) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-100","text":"(53)","element":"a"},{"text":". Last inequality uses ","element":"span"},{"style":{"height":16.21},"width":155.52,"height":40.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-3.png","element":"img","alt":"qI > 10.","inline":true}],[{"style":{"width":"80%"},"width":1516,"height":223,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-4.png","element":"img"}],[{"text":"This statement and Claim ","element":"span"},{"href":"#id-93","text":"7.8 ","element":"a"},{"text":"imply that with probability at least 1","element":"span"},{"text":"/","element":"span"},{"text":"2 we have","element":"span"}],[{"style":{"width":"35%"},"width":656,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-5.png","element":"img"}],[{"text":"Combining this bound and Claim ","element":"span"},{"href":"#id-92","text":"7.7 ","element":"a"},{"text":"gives that with probability at least 1/2,","element":"span"}],[{"style":{"width":"75%"},"width":1416,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-6.png","element":"img"}],[{"text":"Using the definition of the function Θ","element":"span"},{"style":{"height":19.34},"width":75.32,"height":48.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-7.png","element":"img","alt":"qI(k","inline":true},{"text":") in Claim ","element":"span"},{"href":"#id-101","text":"7.9 ","element":"a"},{"text":"and some algebra, one can show that for any ","element":"span"},{"text":"k > ","element":"span"},{"text":"0","element":"span"},{"text":",","element":"span"}],[{"style":{"width":"89%"},"width":1684,"height":394,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-8.png","element":"img"}],[{"text":"or alternatively, max","element":"span"},{"style":{"height":26.66},"width":630.52,"height":66.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-9.png","element":"img","alt":"�k, NTΘqI (k)�≥ NZ(T) where Z(T","inline":true},{"text":") is defined in ","element":"span"},{"href":"#id-102","text":"(36)","element":"a"},{"text":". To prove this, we show that if ","element":"span"},{"style":{"height":25.06},"width":567.88,"height":62.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/44-10.png","element":"img","alt":" k < Z(T), then ΘqI(k) ≤ NTZ(T) ","inline":true,"padRight":true},{"text":"for each regime of parameter ","element":"span"},{"text":"T","element":"span"},{"text":". Note that the above bound is ","element":"span"},{"text":"not tight, but it is chosen such that this lower bound (and consequently the function ","element":"span"},{"text":"Z","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":") which is a scaling of the above lower bound as defined in ","element":"span"},{"href":"#id-102","text":"(36)","element":"a"},{"text":") is continuous up to a multiplicative constant factor.","element":"span"}],[{"text":"Equation ","element":"span"},{"href":"#id-90","text":"(41) ","element":"a"},{"text":"shows that ","element":"span"},{"style":{"height":17.6},"width":284.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/45-0.png","element":"img","alt":" good(T) ≤ NT","inline":true,"padRight":true},{"text":"with probability one. Hence,","element":"span"}],[{"style":{"width":"58%"},"width":1099,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/45-1.png","element":"img"}],[{"text":"This completes the proof of the lemma.","element":"span"}]]},{"heading":"8 Discussion","paragraphs":[[{"text":"In this paper, we analyzed the performance of online collaborative filtering within a latent variable model for the preferences of users for items. We proposed variants of user-user CF and item-item CF that explicitly explore the preference space. We also proved lower bounds for regret in the extreme regimes of parameters corresponding to user-structure only (no structure in item space) and item-structure only (no structure in user space). The lower bounds showed that the proposed algorithms are almost information-theoretically optimal in these parameter regimes.","element":"span"}],[{"text":"Adaptivity to unknown time time horizon ","element":"span"},{"text":"T","element":"span"},{"text":", as required to bound the anytime regret, is achieved via a doubling trick whereby the algorithm is run afresh at a growing sequence of epochs. In practice one would surely benefit from using knowledge gained from exploration in earlier epochs instead of starting from scratch at each epoch. We mentioned how the user-user algorithm can be modified to be adaptive to the number ","element":"span"},{"style":{"height":11.6},"width":45.68,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/45-2.png","element":"img","alt":" qU","inline":true,"padRight":true},{"text":"of user types, but it is less obvious how to make the item-item algorithm adaptive without resorting to an impractical trick analogous to the doubling trick used for ","element":"span"},{"text":"T","element":"span"},{"text":".","element":"span"}],[{"text":"It is possible to modify all of the proposed algorithms to handle i.i.d. noise in the user feedback. We did this only for the user-user algorithm, but it is straightforward to do so also for the item-item algorithm. A hybrid algorithm, exploiting structure in both user space and item space, that is nearly information-theoretically optimal in all regimes appears in a forthcoming paper.","element":"span"}],[{"text":"While various insights were obtained through the analysis carried out in the paper, the assumed randomly generated user preference matrix is unrealistic. A reasonable next objective is to perform a similar analysis with a more flexible model for user preferences, perhaps described by a low-rank matrix or a graphical model.","element":"span"}]]},{"heading":"Acknowledgment","paragraphs":[[{"text":"We are indebted to Devavrat Shah, George Chen, and Luis Voloch for many inspiring discussions and thank Alexander Rakhlin for suggesting several references. This work was supported in part by grants NSF CCF-1565516, ONR N00014-17-1-2147, and DARPA W911NF-16-1-0551.","element":"span"}]]},{"heading":"A Concentration Lemmas","paragraphs":[[{"text":"The following lemma is derived by application of Chernoff bound to Binomial variables ","element":"span"},{"href":"#id-20","referenceIndex":27,"text":"[28]","element":"a"},{"text":".","element":"span"}],[{"id":"id-37","text":"Lemma A.1 ","element":"span"},{"text":"(Chernoff bound)","element":"span"},{"style":{"height":17.6},"width":489.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-0.png","element":"img","alt":". Let X1, · · · , Xn ∈ [0, 1]","inline":true,"padRight":true},{"text":"be independent random variables. Let ","element":"span"},{"style":{"height":20},"width":643.2,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-1.png","element":"img","alt":"X = �ni=1 Xi and ¯X = �ni=1 EXi","inline":true},{"text":". Then, for any ","element":"span"},{"style":{"height":14.8},"width":110.44,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-2.png","element":"img","alt":" ǫ > 0,","inline":true}],[{"id":"id-95","style":{"width":"100%"},"width":1872,"height":886,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-3.png","element":"img"}],[{"id":"id-36","text":"Lemma A.3 ","element":"span"},{"text":"(Balls and bins: tail bound for number of nonempty bins)","element":"span"},{"style":{"height":17.6},"width":488.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-4.png","element":"img","alt":". Suppose m ≤ n/4. If m","inline":true,"padRight":true},{"text":"balls are placed into ","element":"span"},{"text":"n ","element":"span"},{"text":"bins each independently and uniformly at random, then with probability at least ","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":17.6},"width":509.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-5.png","element":"img","alt":" − exp(−m/2) at least m/2","inline":true,"padRight":true},{"text":"bins are nonempty.","element":"span"}],[{"text":"Proof. ","element":"span"},{"text":"Any configuration with at most ","element":"span"},{"text":"m/","element":"span"},{"text":"2 nonempty bins has at least ","element":"span"},{"style":{"height":17.6},"width":148.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-6.png","element":"img","alt":" n − m/","inline":true},{"text":"2 empty bins. Thus we may bound the probability of having ","element":"span"},{"style":{"height":17.6},"width":391.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-7.png","element":"img","alt":" some set of n − m/","inline":true},{"text":"2 bins be empty. There are","element":"span"}],[{"style":{"width":"99%"},"width":1863,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-8.png","element":"img"}],[{"text":"which has probability [(","element":"span"},{"style":{"height":17.6},"width":203.52,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-9.png","element":"img","alt":"m/2)/n]m.","inline":true,"padRight":true},{"text":"Thus, using union bound, the probability of at most ","element":"span"},{"text":"m/","element":"span"},{"text":"2 nonempty bins is bounded by","element":"span"}],[{"style":{"width":"71%"},"width":1343,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-10.png","element":"img"}],[{"text":"where we used ","element":"span"},{"style":{"height":17.6},"width":178.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-11.png","element":"img","alt":" m ≤ n/4.","inline":true}],[{"text":"The following generalization of the above lemma is used in the analysis of the recommendation system in noisy setup.","element":"span"}],[{"text":"Lemma A.4 ","element":"span"},{"text":"(Generalized Balls and bins: tail bound for number of nonempty bins)","element":"span"},{"text":". ","element":"span"},{"text":"Fix ","element":"span"},{"text":"0 ","element":"span"},{"text":"< a < ","element":"span"},{"text":"1 ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":17.6},"width":1788.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-12.png","element":"img","alt":" c > 0. Define b ≜ min{t : −(1 − a/t) log a + (1 − a)/t log(1 − a) > c}. Suppose m ≤ n/b. If m","inline":true,"padRight":true},{"text":"balls are placed into ","element":"span"},{"text":"n ","element":"span"},{"text":"bins each independently and uniformly at random, then with probability at least ","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":17.6},"width":452.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-13.png","element":"img","alt":" − exp(−cm) at least na","inline":true,"padRight":true},{"text":"bins are nonempty.","element":"span"}],[{"text":"Proof. ","element":"span"},{"text":"Any configuration with at most ","element":"span"},{"text":"m/","element":"span"},{"text":"2 nonempty bins has at least ","element":"span"},{"style":{"height":17.6},"width":148.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-14.png","element":"img","alt":" n − m/","inline":true},{"text":"2 empty bins. Thus we may bound the probability of having ","element":"span"},{"style":{"height":17.6},"width":391.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-15.png","element":"img","alt":" some set of n − m/","inline":true},{"text":"2 bins be empty. There are","element":"span"}],[{"style":{"width":"99%"},"width":1863,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/46-16.png","element":"img"}],[{"text":"which has probability [(","element":"span"},{"style":{"height":17.6},"width":188.4,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-0.png","element":"img","alt":"m/2)/n]m","inline":true},{"text":". Thus, the probability of at most ","element":"span"},{"text":"m/","element":"span"},{"text":"2 nonempty bins is bounded by","element":"span"}],[{"style":{"width":"71%"},"width":1343,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-1.png","element":"img"}],[{"text":"where we used ","element":"span"},{"style":{"height":17.6},"width":178.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-2.png","element":"img","alt":" m ≤ n/4.","inline":true}],[{"text":"The following lemma records a simple consequence of linearity of expectation.","element":"span"}],[{"id":"id-54","text":"Lemma A.5 ","element":"span"},{"text":"(Balls and bins: bound for the expected number of nonempty bins)","element":"span"},{"text":". ","element":"span"},{"text":"If we throw ","element":"span"},{"text":"m ","element":"span"},{"text":"balls into ","element":"span"},{"text":"n ","element":"span"},{"text":"bins independently uniformly at random, then, the expected number of nonempty bins is ","element":"span"},{"style":{"height":17.6},"width":355.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-3.png","element":"img","alt":" n[1 − (1 − 1/n)m] .","inline":true}]]},{"heading":"B Converting to anytime regret","paragraphs":[[{"text":"The ","element":"span"},{"text":"doubling trick ","element":"span"},{"text":"converts an online algorithm designed for a finite known time horizon to an algorithm that does not require knowledge of the time horizon and yet achieves the same regret (up to multiplicative constant) at any time ","element":"span"},{"href":"#id-20","referenceIndex":27,"text":"[27] ","element":"a"},{"text":"(i.e., ","element":"span"},{"text":"anytime regret","element":"span"},{"text":").","element":"span"}],[{"text":"The trick is to divide time into intervals and restart algorithm at the beginning of each interval. Let ","element":"span"},{"text":"A","element":"span"},{"text":"(","element":"span"},{"text":"T","element":"span"},{"text":") be an online algorithm taking the known time horizon as input and achieving regret R(","element":"span"},{"text":"T","element":"span"},{"text":") at time ","element":"span"},{"text":"T","element":"span"},{"text":". There are two regret scalings of interest. (1) If R(","element":"span"},{"style":{"height":17.6},"width":221.2,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-4.png","element":"img","alt":"T) = O(T α","inline":true},{"text":") for some 0 ","element":"span"},{"style":{"height":14.8},"width":179.52,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-5.png","element":"img","alt":" < α < 1,","inline":true,"padRight":true},{"text":"then to achieve anytime regret, the doubling trick uses time intervals of length 2","element":"span"},{"style":{"height":17.74},"width":353.48,"height":44.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-6.png","element":"img","alt":", 22, 23, .., 2m. This","inline":true,"padRight":true},{"text":"achieves regret of at most R(","element":"span"},{"style":{"height":17.6},"width":592.6,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-7.png","element":"img","alt":"T)/(1−2α) at time T for any T","inline":true},{"text":". (2) Alternatively, if R(","element":"span"},{"text":"T","element":"span"},{"text":") = ","element":"span"},{"text":"O","element":"span"},{"text":"(log ","element":"span"},{"text":"T","element":"span"},{"text":"), then using intervals of length 2","element":"span"},{"style":{"height":20.35},"width":222.32,"height":50.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-8.png","element":"img","alt":"2, 222, .., 22m ","inline":true,"padRight":true},{"text":"achieves regret of at most 4R(","element":"span"},{"text":"T","element":"span"},{"text":") at time ","element":"span"},{"text":"T ","element":"span"},{"text":"for any ","element":"span"},{"text":"T","element":"span"},{"text":".","element":"span"}],[{"text":"Clearly, different scalings can be used before and after ","element":"span"},{"style":{"height":14.69},"width":42.44,"height":36.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-9.png","element":"img","alt":" T1","inline":true,"padRight":true},{"text":"if the algorithm achieves regret ","element":"span"},{"style":{"height":19.99},"width":793.64,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-10.png","element":"img","alt":"O(log T) for T < T1 and O(√T) if T ≥ T1","inline":true},{"text":", as is the case for the proposed item-item CF algorithm.","element":"span"}]]},{"heading":"C Proof of Lemma 4.5","paragraphs":[[{"text":"We show that ","element":"span"},{"style":{"height":19.15},"width":336.2,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-11.png","element":"img","alt":" P[Bcuv] ≤ 2ǫ/(N 2","inline":true},{"text":") for any pair of users ","element":"span"},{"style":{"height":17.6},"width":216.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-12.png","element":"img","alt":" u, v ∈ [N].","inline":true,"padRight":true},{"text":"Using a union bound over the ","element":"span"},{"style":{"height":22.62},"width":70.4,"height":56.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-13.png","element":"img","alt":"�N2�","inline":true,"padRight":true},{"text":"pairs of users gives ","element":"span"},{"style":{"height":17.6},"width":220.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-14.png","element":"img","alt":" P[Bc] ≤ ǫ.","inline":true,"padRight":true},{"text":"According to Line 7 of the Algorithm ","element":"span"},{"href":"#id-46","text":"4.5, ","element":"a"},{"style":{"height":13.09},"width":130.48,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-15.png","element":"img","alt":" gu,v =","inline":true},{"style":{"height":19.89},"width":715.8,"height":49.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-16.png","element":"img","alt":"1{�rs=1 Lu,au,sLv,av,s ≥ λr} . At s ≤ r","inline":true},{"text":", the same item is recommended to all users: ","element":"span"},{"style":{"height":13.09},"width":318.72,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-17.png","element":"img","alt":" as := au,s = av,s.","inline":true,"padRight":true},{"text":"Hence, ","element":"span"},{"style":{"height":19.06},"width":1050.8,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-18.png","element":"img","alt":" Lu,au,s = ξτU (u),τI(as)zu,as and Lu,au,s = ξτU (v),τI(as)zv,as","inline":true},{"text":". First, we look at the users of the same type:","element":"span"}],[{"style":{"width":"100%"},"width":1876,"height":412,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/47-19.png","element":"img"}],[{"text":"where (a) uses the definition of ","element":"span"},{"style":{"height":13.09},"width":66.92,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-0.png","element":"img","alt":" gu,v","inline":true,"padRight":true},{"text":"according to the Line 7 of Algorithm ","element":"span"},{"href":"#id-45","text":"2. ","element":"a"},{"text":"(b) uses the noise model ","element":"span"},{"href":"#id-48","text":"(11)","element":"a"},{"text":". If ","element":"span"},{"style":{"height":20.05},"width":874.12,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-1.png","element":"img","alt":" τU(u) = τU(v) then ξτU(u),τI(as) = ξτU(v),τI(as)","inline":true,"padRight":true},{"text":"which gives (c). The variables ","element":"span"},{"style":{"height":13.09},"width":61.44,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-2.png","element":"img","alt":" zu,i","inline":true,"padRight":true},{"text":"are independent of other variables in the model which implies (d). For users ","element":"span"},{"style":{"height":16.8},"width":104.04,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-3.png","element":"img","alt":" u ̸= v","inline":true,"padRight":true},{"text":"and any item ","element":"span"},{"text":"i","element":"span"},{"text":", ","element":"span"},{"style":{"height":19.82},"width":415.4,"height":49.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-4.png","element":"img","alt":"E[zu,izv,i] = (1 − 2γ)2","inline":true},{"text":". Lemma ","element":"span"},{"href":"#id-37","text":"A.1 ","element":"a"},{"text":"and the choice of ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-5.png","element":"img","alt":" λ","inline":true,"padRight":true},{"text":"in Algorithm ","element":"span"},{"href":"#id-45","text":"2 ","element":"a"},{"text":"gives (e). The choice of ","element":"span"},{"text":"r ","element":"span"},{"text":"gives the result (f).","element":"span"}],[{"text":"Now, we look at the pair of users ","element":"span"},{"style":{"height":22.43},"width":1178.8,"height":56.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-6.png","element":"img","alt":" u and v such that τU(u) ̸= τU(v). Define Lu,v =��{j : ξτU(u),j ̸=","inline":true},{"style":{"height":22.24},"width":166.2,"height":55.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-7.png","element":"img","alt":"ξτU(v),j}��","inline":true,"padRight":true},{"text":"to be the set of item types on which user types ","element":"span"},{"style":{"height":17.6},"width":290.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-8.png","element":"img","alt":" τU(u) and τU(v","inline":true},{"text":") disagree.","element":"span"}],[{"style":{"width":"89%"},"width":1673,"height":484,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-9.png","element":"img"}],[{"text":"(a) uses the definition of ","element":"span"},{"style":{"height":13.09},"width":66.92,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-10.png","element":"img","alt":" gu,v","inline":true,"padRight":true},{"text":"in Line 7 of Algorithm ","element":"span"},{"href":"#id-45","text":"2. ","element":"a"},{"text":"Total probability lemma gives (b). Conditional on ","element":"span"},{"style":{"height":17.6},"width":249.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-11.png","element":"img","alt":" τU(u) ̸= τU(v","inline":true},{"text":"), the variables ","element":"span"},{"style":{"height":19.06},"width":355.32,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-12.png","element":"img","alt":" ξτU (u),j and ξτU (v),j","inline":true,"padRight":true},{"text":"are independently uniformly distributed. Using the definition on ","element":"span"},{"style":{"height":17.49},"width":76.52,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-13.png","element":"img","alt":" Lu,v","inline":true},{"text":", this implies that ","element":"span"},{"style":{"height":18.29},"width":102.72,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-14.png","element":"img","alt":" |Lu,v|","inline":true,"padRight":true},{"text":"is the sum of ","element":"span"},{"style":{"height":12.21},"width":40.04,"height":30.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-15.png","element":"img","alt":" qI","inline":true,"padRight":true},{"text":"i.i.d. uniform Bernouli random variables. Hence, Chernoff bound in Lemma ","element":"span"},{"href":"#id-37","text":"A.1 ","element":"a"},{"text":"gives the bound on the first term in (c). To bound the second term, note that the items ","element":"span"},{"style":{"height":10.69},"width":39.04,"height":26.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-16.png","element":"img","alt":" as","inline":true,"padRight":true},{"text":"are chosen independently of feedback for ","element":"span"},{"style":{"height":15.2},"width":270.24,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-17.png","element":"img","alt":" s ≤ r. Hence,","inline":true,"padRight":true},{"text":"conditional on ","element":"span"},{"style":{"height":18.29},"width":248.08,"height":45.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-18.png","element":"img","alt":" |Lu,v| ≥ 5qI/","inline":true},{"text":"12, the variables ","element":"span"},{"style":{"height":19.06},"width":508.24,"height":47.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-19.png","element":"img","alt":" ξτU(u),τI(as)ξτU (v),τI(as) = −","inline":true},{"text":"1 with probability at least 5","element":"span"},{"text":"/","element":"span"},{"text":"12. The variables ","element":"span"},{"style":{"height":17.49},"width":258.8,"height":43.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-20.png","element":"img","alt":" zu,as and zv,as","inline":true,"padRight":true},{"text":"are independent of other parameters of the model and algorithm and ","element":"span"},{"style":{"height":13.09},"width":263.92,"height":32.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-21.png","element":"img","alt":" zu,aszv,as = −","inline":true},{"text":"1 with probability 2","element":"span"},{"style":{"height":20.05},"width":1138.56,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-22.png","element":"img","alt":"γ(1 − γ). Hence, ξτU(u),τI(as)ξτU(v),τI(as)zu,aszv,as = −1 with","inline":true,"padRight":true},{"text":"probability at least [5 + 4","element":"span"},{"style":{"height":17.6},"width":189.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-23.png","element":"img","alt":"γ(1 − γ)]/","inline":true},{"text":"12. Using Chernoff bound in Lemma ","element":"span"},{"href":"#id-37","text":"A.1 ","element":"a"},{"text":"gives the second term in (c). The assumption ","element":"span"},{"style":{"height":19.54},"width":344.4,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1711.02198/images/48-24.png","element":"img","alt":" qI > 144 log(N 2/ǫ","inline":true},{"text":") and the choice of ","element":"span"},{"text":"r ","element":"span"},{"text":"gives the result.","element":"span"}]]},{"heading":"References","paragraphs":[[{"id":"id-0","text":"[1] S. Aditya, O. Dabeer, and B. K. Dey, “A channel coding perspective of collaborative filtering,” ","element":"span"},{"text":"IEEE Transactions on Information Theory","element":"span"},{"text":", vol. 57, no. 4, pp. 2327–2341, 2011.","element":"span"}],[{"id":"id-1","text":"[2] G. Biau, B. Cadre, and L. Rouviere, “Statistical analysis of k-nearest neighbor collaborative ","element":"span"},{"text":"recommendation,” ","element":"span"},{"text":"The Annals of Statistics","element":"span"},{"text":", vol. 38, no. 3, pp. 1568–1592, 2010.","element":"span"}],[{"id":"id-2","text":"[3] E. J. Cand`es and B. Recht, “Exact matrix completion via convex optimization,” ","element":"span"},{"text":"Foundations of Computational Mathematics","element":"span"},{"text":", vol. 9, no. 6, p. 717, 2009.","element":"span"}],[{"text":"[4] P. Jain, P. Netrapalli, and S. Sanghavi, “Low-rank matrix completion using alternating minimization,” in ","element":"span"},{"text":"Proceedings of the forty-fifth annual ACM Symposium on Theory of Computing","element":"span"},{"text":". ACM, 2013, pp. 665–674.","element":"span"}],[{"id":"id-3","text":"[5] R. H. Keshavan, A. Montanari, and S. Oh, “Matrix completion from a few entries,” ","element":"span"},{"text":"IEEE Transactions on Information Theory","element":"span"},{"text":", vol. 56, no. 6, pp. 2980–2998, 2010.","element":"span"}],[{"text":"[6] S. Negahban and M. J. Wainwright, “Restricted strong convexity and weighted matrix completion: Optimal bounds with noise,” ","element":"span"},{"text":"Journal of Machine Learning Research","element":"span"},{"text":", vol. 13, no. May, pp. 1665–1697, 2012.","element":"span"}],[{"id":"id-4","text":"[7] A. Rohde and A. B. Tsybakov, “Estimation of high-dimensional low-rank matrices,” ","element":"span"},{"text":"The Annals of Statistics","element":"span"},{"text":", vol. 39, no. 2, pp. 887–930, 2011.","element":"span"}],[{"text":"[8] N. Srebro, N. Alon, and T. S. Jaakkola, “Generalization error bounds for collaborative prediction with low-rank matrices,” in ","element":"span"},{"text":"Advances In Neural Information Processing Systems","element":"span"},{"text":", 2005, pp. 1321–1328.","element":"span"}],[{"id":"id-5","text":"[9] S. Bubeck and N. Cesa-Bianchi, “Regret analysis of stochastic and nonstochastic multi-armed ","element":"span"},{"text":"bandit problems,” ","element":"span"},{"text":"Foundations and Trends in Machine Learning","element":"span"},{"text":", vol. 5, no. 1, pp. 1–122, 2012.","element":"span"}],[{"id":"id-6","text":"[10] T. L. Lai and H. Robbins, “Asymptotically efficient adaptive allocation rules,” ","element":"span"},{"text":"Advances in applied mathematics","element":"span"},{"text":", vol. 6, no. 1, pp. 4–22, 1985.","element":"span"}],[{"id":"id-7","text":"[11] D. Russo and B. Van Roy, “Learning to optimize via information-directed sampling,” in ","element":"span"},{"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", 2014, pp. 1583–1591.","element":"span"}],[{"id":"id-8","text":"[12] G. Bresler, G. H. Chen, and D. Shah, “A latent source model for online collaborative filtering,” ","element":"span"},{"text":"in ","element":"span"},{"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", 2014, pp. 3347–3355.","element":"span"}],[{"text":"[13] G. Bresler, D. Shah, and L. F. Voloch, “Collaborative filtering with low regret,” in ","element":"span"},{"text":"SIGMETRICS Performance Evaluation Review","element":"span"},{"text":", vol. 44, no. 1. ","element":"span"},{"text":"New York, NY, USA: ACM, Jun. 2016, pp. 207–220. [Online]. Available: ","element":"span"},{"href":"http://doi.acm.org/10.1145/2964791.2901469","text":"http://doi.acm.org/10.1145/2964791.2901469","element":"a"}],[{"text":"[14] B.-H. Shen, S. Ji, and J. Ye, “Mining discrete patterns via binary matrix factorization,” in ","element":"span"},{"text":"Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining","element":"span"},{"text":". ","element":"span"},{"text":"ACM, 2009, pp. 757–766.","element":"span"}],[{"text":"[15] J. Xu, R. Wu, K. Zhu, B. Hajek, R. Srikant, and L. Ying, “Jointly clustering rows and ","element":"span"},{"text":"columns of binary matrices: Algorithms and trade-offs,” in ","element":"span"},{"text":"ACM SIGMETRICS Performance Evaluation Review","element":"span"},{"text":", vol. 42, no. 1. ","element":"span"},{"text":"ACM, 2014, pp. 29–41.","element":"span"}],[{"id":"id-9","text":"[16] A. Bellogin and J. Parapar, “Using graph partitioning techniques for neighbour selection in ","element":"span"},{"text":"user-based collaborative filtering,” in ","element":"span"},{"text":"Proceedings of the sixth ACM Conference on Recommender Systems","element":"span"},{"text":". ","element":"span"},{"text":"ACM, 2012, pp. 213–216.","element":"span"}],[{"id":"id-10","text":"[17] A. S. Das, M. Datar, A. Garg, and S. Rajaram, “Google news personalization: scalable online ","element":"span"},{"text":"collaborative filtering,” in ","element":"span"},{"text":"Proceedings of the 16th international conference on World Wide Web","element":"span"},{"text":". ","element":"span"},{"text":"ACM, 2007, pp. 271–280.","element":"span"}],[{"id":"id-11","text":"[18] G. Linden, B. Smith, and J. York, “Amazon.com recommendations: Item-to-item collaborative ","element":"span"},{"text":"filtering,” ","element":"span"},{"text":"IEEE Internet computing","element":"span"},{"text":", vol. 7, no. 1, pp. 76–80, 2003.","element":"span"}],[{"text":"[19] B. Sarwar, G. Karypis, J. Konstan, and J. Riedl, “Item-based collaborative filtering recommendation algorithms,” in ","element":"span"},{"text":"Proceedings of the 10th international conference on World Wide Web","element":"span"},{"text":". ","element":"span"},{"text":"ACM, 2001, pp. 285–295.","element":"span"}],[{"id":"id-13","text":"[20] O. Dabeer, “Adaptive collaborating filtering: The low noise regime,” in ","element":"span"},{"text":"International Symposium on Information Theory Proceedings (ISIT)","element":"span"},{"text":". ","element":"span"},{"text":"IEEE, 2013, pp. 1197–1201.","element":"span"}],[{"id":"id-14","text":"[21] I. ","element":"span"},{"text":"Kerenidis ","element":"span"},{"text":"and ","element":"span"},{"text":"A. ","element":"span"},{"text":"Prakash, ","element":"span"},{"text":"“Quantum ","element":"span"},{"text":"recommendation ","element":"span"},{"text":"systems,” ","element":"span"},{"text":"arXiv ","element":"span"},{"text":"preprint arXiv:1603.08675","element":"span"},{"text":", 2016.","element":"span"}],[{"id":"id-15","text":"[22] K. Barman and O. Dabeer, “Analysis of a collaborative filter based on popularity amongst ","element":"span"},{"text":"neighbors,” ","element":"span"},{"text":"IEEE Transactions on Information Theory","element":"span"},{"text":", vol. 58, no. 12, pp. 7110–7134, 2012.","element":"span"}],[{"text":"[23] D. Song, C. E. Lee, Y. Li, and D. Shah, “Blind regression: Nonparametric regression for latent variable models via collaborative filtering,” in ","element":"span"},{"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", 2016, pp. 2155–2163.","element":"span"}],[{"id":"id-16","text":"[24] J. Wang, A. P. De Vries, and M. J. Reinders, “Unifying user-based and item-based collaborative ","element":"span"},{"text":"filtering approaches by similarity fusion,” in ","element":"span"},{"text":"Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval","element":"span"},{"text":". ","element":"span"},{"text":"ACM, 2006, pp. 501–508.","element":"span"}],[{"id":"id-17","text":"[25] B.-H. Kim, A. Yedla, and H. D. Pfister, “Imp: A message-passing algorithm for matrix com- ","element":"span"},{"text":"pletion,” in ","element":"span"},{"text":"Turbo Codes and Iterative Information Processing (ISTC), 2010 6th International Symposium on","element":"span"},{"text":". ","element":"span"},{"text":"IEEE, 2010, pp. 462–466.","element":"span"}],[{"id":"id-18","text":"[26] C. Borgs, J. Chayes, C. E. Lee, and D. Shah, “Thy friend is my friend: Iterative collaborative ","element":"span"},{"text":"filtering for sparse matrix estimation,” in ","element":"span"},{"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", 2017, pp. 4715–4726.","element":"span"}],[{"id":"id-20","text":"[27] N. Cesa-Bianchi and G. Lugosi, ","element":"span"},{"text":"Prediction, learning, and games","element":"span"},{"text":". Cambridge University Press, 2006.","element":"span"}],[{"text":"[28] F. Chung and L. Lu, “Concentration inequalities and martingale inequalities: a survey,” ","element":"span"},{"text":"Internet Mathematics","element":"span"},{"text":", vol. 3, no. 1, pp. 79–127, 2006.","element":"span"}],[{"text":"[29] C. McDiarmid, “Concentration,” in ","element":"span"},{"text":"Probabilistic methods for algorithmic discrete mathematics","element":"span"},{"text":". Springer, 1998, pp. 195–248.","element":"span"}]]}],"_version":"3.3.4"},"paperNode":"$28:props:children:props:children:0:props:product"}]]