35:[["$","audio",null,{"id":"tts"}],["$","$L3a",null,{"paperID":"2002.00495","publisher":"arxiv","paperJSON":{"title":"Active Learning for Identification of Linear Dynamical Systems","paperID":"2002.00495","avgLineHeight":13.55,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"We propose an algorithm to actively estimate the parameters of a linear dynamical system. Given complete control over the system’s input, our algorithm adaptively chooses the inputs to accelerate estimation. We show a finite time bound quantifying the estimation rate our algorithm attains and prove matching upper and lower bounds which guarantee its asymptotic optimality, up to constants. In addition, we show that this optimal rate is unattainable when using Gaussian noise to excite the system, even with optimally tuned covariance, and analyze several examples where our algorithm provably improves over rates obtained by playing noise. Our analysis critically relies on a novel result quantifying the error in estimating the parameters of a dynamical system when arbitrary periodic inputs are being played. We conclude with numerical examples that illustrate the effectiveness of our algorithm in practice.","element":"span"}],[{"style":{"width":"90%"},"width":1564,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-0.png","element":"img"}]]},{"heading":"1. Introduction","paragraphs":[[{"text":"System identification is a fundamental problem in control theory, reinforcement learning, econometrics, and time-series modeling. Given observations of the input-output behavior of a dynamical system, system identification seeks to estimate the parameters of the system. When the governing dynamics cannot be derived from first principles, this is an important tool for modeling the behavior of a system, allowing for downstream analysis and engineering. In this work we focus on the simplest possible dynamical system model—discrete-time, linear dynamical systems. Several recent works ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") have shown sharp rates for estimating the parameters of such systems in the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"passive ","element":"span"},{"text":"case—where the system is driven by random noise. Here we seek to understand ","element":"span"},{"style":{"fontStyle":"italic"},"text":"active ","element":"span"},{"text":"system identification—given complete control over the inputs, how can we best excite the system to accelerate estimation? Dating back to the 1970s, significant attention has been given to the problem of how to best excite systems for estimation ","element":"span"},{"href":"#id-2","referenceIndex":30,"text":"Mehra ","element":"a"},{"text":"(","element":"span"},{"href":"#id-2","referenceIndex":30,"text":"1976","element":"a"},{"text":"); ","element":"span"},{"href":"#id-3","referenceIndex":15,"text":"Goodwin and Payne ","element":"a"},{"text":"(","element":"span"},{"href":"#id-3","referenceIndex":15,"text":"1977","element":"a"},{"text":"); ","element":"span"},{"href":"#id-4","referenceIndex":4,"text":"Bombois et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-4","referenceIndex":4,"text":"2011","element":"a"},{"text":") yet these works typically lack theoretical guarantees. To the best of our knowledge, we present the first provably correct method for active system identification. We show finite time and asymptotic sample complexity guarantees and characterize settings in which active input design yields performance improvements.","element":"span"}],[{"text":"Formally, we consider linear dynamical systems (LDS) of the form:","element":"span"}],[{"id":"id-101","style":{"width":"63%"},"width":1094,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-1.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.75},"width":209.82,"height":44.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-2.png","element":"img","alt":" A∗ ∈ Rd×d ","inline":true,"padRight":true},{"text":"is unknown, ","element":"span"},{"style":{"height":19.13},"width":347.64,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-3.png","element":"img","alt":" B∗ ∈ Rd×p, and ηt","inline":true,"padRight":true},{"text":"is unobserved process noise. We choose the input ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-4.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":"sequentially, observe the state ","element":"span"},{"style":{"height":10.62},"width":36.94,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-5.png","element":"img","alt":" xt","inline":true},{"text":", and wish to estimate ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-6.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"from this data. For simplicity","element":"span"}],[{"style":{"width":"30%"},"width":523,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/0-7.png","element":"img"}],[{"text":"and ease of exposition, we assume ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-0.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"is known, though all our results can be extended to the case where ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-1.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"is unknown. From an engineering perspective, assuming ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-2.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"is known is a reasonable assumption as one may have knowledge of ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-3.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"from the design of the system actuation. Throughout, we assume that ","element":"span"},{"style":{"height":17.6},"width":422.63,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-4.png","element":"img","alt":" ρ(A∗) < 1 where ρ(A∗)","inline":true,"padRight":true},{"text":"is the spectral radius of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-5.png","element":"img","alt":" A∗","inline":true},{"text":". We are interested in estimating ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-6.png","element":"img","alt":"A∗","inline":true,"padRight":true},{"text":"in the spectral norm, in the case where our input is constrained to have bounded energy, that is: ","element":"span"},{"style":{"height":32.4},"width":422.4,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-7.png","element":"img","alt":"E�1T�Tt=1 u⊤t ut�≤ γ2 ","inline":true,"padRight":true},{"text":"for some constant ","element":"span"},{"style":{"height":18.33},"width":54.95,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-8.png","element":"img","alt":" γ2.","inline":true}],[{"text":"As we will show, the fundamental quantity that determines the sample complexity of estimation is the minimum eigenvalue of the covariates: ","element":"span"},{"style":{"height":31.6},"width":346.52,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-9.png","element":"img","alt":" λmin��Tt=1 xtx⊤t�","inline":true},{"text":". Optimally exciting the system is then equivalent to maximizing this quantity subject to the input power constraints. This quantity, however, depends on ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-10.png","element":"img","alt":" A∗","inline":true},{"text":", the parameter we wish to estimate, so cannot be optimized in practice.","element":"span"}],[{"text":"Our main contribution is an algorithm which balances this tradeoff—progressively updating the inputs as the estimates of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-11.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"improve—and finite time bounds quantifying the estimation rate it achieves, as well as the number of samples necessary to guarantee the optimally exciting inputs are being played. In addition, we present a lower bound and asymptotic upper bound guaranteeing the asymptotic optimality of our algorithm. We show that playing Gaussian noise, even with an optimally tuned covariance, is insufficient to achieve this optimal rate. Our algorithm can be seen as an instance of adaptive E-optimal design ","element":"span"},{"href":"#id-5","referenceIndex":32,"text":"Pronzato and P´azman ","element":"a"},{"text":"(","element":"span"},{"href":"#id-5","referenceIndex":32,"text":"2013","element":"a"},{"text":").","element":"span"}],[{"text":"An important piece in our analysis is a new finite-time bound on the estimation error ","element":"span"},{"style":{"height":17.6},"width":119.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-12.png","element":"img","alt":" ∥A∗ −","inline":true},{"style":{"height":21.21},"width":71.54,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/1-13.png","element":"img","alt":"ˆA∥2","inline":true,"padRight":true},{"text":"that holds when arbitrary periodic inputs are being played. Previous works ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"Dean et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"2018","element":"a"},{"text":") only consider inputs that are Gaussian or state feedback. These works emphasize obtaining bounds that scale properly with the spectral radius of the system. Following this, we develop bounds that avoid a poor scaling with the spectral radius. To the best of our knowledge, this is a novel result and may be of independent interest.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.1. Related Works","element":"span"}],[{"text":"A significant body of work exists on how to optimally excite dynamical systems for identification ","element":"span"},{"href":"#id-2","referenceIndex":30,"text":"Mehra ","element":"a"},{"text":"(","element":"span"},{"href":"#id-2","referenceIndex":30,"text":"1976","element":"a"},{"text":"); ","element":"span"},{"href":"#id-3","referenceIndex":15,"text":"Goodwin and Payne ","element":"a"},{"text":"(","element":"span"},{"href":"#id-3","referenceIndex":15,"text":"1977","element":"a"},{"text":"); ","element":"span"},{"href":"#id-7","referenceIndex":22,"text":"Jansson and Hjalmarsson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-7","referenceIndex":22,"text":"2005","element":"a"},{"text":"); ","element":"span"},{"href":"#id-8","referenceIndex":13,"text":"Gevers et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-8","referenceIndex":13,"text":"2009","element":"a"},{"text":"); ","element":"span"},{"href":"#id-9","referenceIndex":27,"text":"Manchester ","element":"a"},{"text":"(","element":"span"},{"href":"#id-9","referenceIndex":27,"text":"2010","element":"a"},{"text":"); ","element":"span"},{"href":"#id-10","referenceIndex":16,"text":"H¨agg et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-10","referenceIndex":16,"text":"2013","element":"a"},{"text":"). An excellent survey of classical results can be found in ","element":"span"},{"href":"#id-11","referenceIndex":29,"text":"Mehra ","element":"a"},{"text":"(","element":"span"},{"href":"#id-11","referenceIndex":29,"text":"1974","element":"a"},{"text":") and a more recent survey in ","element":"span"},{"href":"#id-4","referenceIndex":4,"text":"Bombois et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-4","referenceIndex":4,"text":"2011","element":"a"},{"text":"). Broadly speaking, earlier works tended to focus on designing inputs so as to be optimal with respect to traditional experimental design objectives. More recent works ","element":"span"},{"href":"#id-12","referenceIndex":20,"text":"Hjalmarsson et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-12","referenceIndex":20,"text":"1996","element":"a"},{"text":"); ","element":"span"},{"href":"#id-13","referenceIndex":19,"text":"Hildebrand and Gevers ","element":"a"},{"text":"(","element":"span"},{"href":"#id-13","referenceIndex":19,"text":"2002","element":"a"},{"text":"); ","element":"span"},{"href":"#id-14","referenceIndex":24,"text":"Katselis et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-14","referenceIndex":24,"text":"2012","element":"a"},{"text":") have focused on designing inputs to meet certain task-specific objectives—for instance, identifying a system for the purpose of control.","element":"span"}],[{"text":"A primary difficulty in designing inputs for identification is that the design criteria, often some function of the Fisher Information Matrix, depend on the unknown parameters of the system. Several different approaches have been proposed to overcome this challenge. One line of work ","element":"span"},{"href":"#id-15","referenceIndex":33,"text":"Rojas ","element":"a"},{"href":"#id-15","referenceIndex":33,"text":"et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-15","referenceIndex":33,"text":"2007","element":"a"},{"text":", ","element":"span"},{"href":"#id-16","referenceIndex":34,"text":"2011","element":"a"},{"text":"); ","element":"span"},{"href":"#id-17","referenceIndex":25,"text":"Larsson et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-17","referenceIndex":25,"text":"2012","element":"a"},{"text":"); ","element":"span"},{"href":"#id-10","referenceIndex":16,"text":"H¨agg et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-10","referenceIndex":16,"text":"2013","element":"a"},{"text":") performs robust experimental design and optimizes a minimax objective. More comparable to our approach are works which perform adaptive experimental design ","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"Lindqvist and Hjalmarsson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"2001","element":"a"},{"text":"); ","element":"span"},{"href":"#id-19","referenceIndex":10,"text":"Gerencs´er and Hjalmarsson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-19","referenceIndex":10,"text":"2005","element":"a"},{"text":"); ","element":"span"},{"href":"#id-20","referenceIndex":3,"text":"Barenthin et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-20","referenceIndex":3,"text":"2005","element":"a"},{"text":"); ","element":"span"},{"href":"#id-21","referenceIndex":11,"text":"Gerencs´er et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-21","referenceIndex":11,"text":"2007","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":12,"text":"2009","element":"a"},{"text":")—alternating between estimating the unknown parameters and designing inputs based on the current estimates.","element":"span"}],[{"text":"Existing works in active system identification lack sound theoretical guarantees and too often specialize results to single-input single-output systems. While several results guarantee asymptotic consistency ","element":"span"},{"href":"#id-21","referenceIndex":11,"text":"Gerencs´er et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-21","referenceIndex":11,"text":"2007","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":12,"text":"2009","element":"a"},{"text":"), most proposed approaches are heuristic and are validated only through examples. To our knowledge, no finite-time performance bounds exist. In addition, many works seek to optimize quantities that only describe the asymptotic behavior of the system— for instance minimizing the asymptotic variance—and it is unclear and unjustified if these are the correct quantities to optimize for over a finite time interval. Finally, existing works do not give precise, explicit algorithms.","element":"span"}],[{"text":"Recently, considerable interest has been shown in the machine learning community towards obtaining finite-time performance guarantees for system identification and control problems. The latter category has primarily centered around developing finite time regret bounds for the LQR problem with unknown dynamics ","element":"span"},{"href":"#id-23","referenceIndex":1,"text":"Abbasi-Yadkori and Szepesv´ari ","element":"a"},{"text":"(","element":"span"},{"href":"#id-23","referenceIndex":1,"text":"2011","element":"a"},{"text":"); ","element":"span"},{"href":"#id-24","referenceIndex":6,"text":"Dean et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-24","referenceIndex":6,"text":"2017","element":"a"},{"text":", ","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-25","referenceIndex":28,"text":"Mania et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-25","referenceIndex":28,"text":"2019","element":"a"},{"text":"); ","element":"span"},{"href":"#id-26","referenceIndex":8,"text":"Dean et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-26","referenceIndex":8,"text":"2019","element":"a"},{"text":"); ","element":"span"},{"href":"#id-27","referenceIndex":5,"text":"Cohen et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-27","referenceIndex":5,"text":"2019","element":"a"},{"text":"). Recent results in system identification have focused on obtaining finite time high probability bounds on the estimation error of the system’s parameters when observing the evolution over time ","element":"span"},{"href":"#id-28","referenceIndex":40,"text":"Tu et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-28","referenceIndex":40,"text":"2017","element":"a"},{"text":"); ","element":"span"},{"href":"#id-29","referenceIndex":9,"text":"Faradonbeh et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-29","referenceIndex":9,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-30","referenceIndex":18,"text":"Hazan ","element":"a"},{"href":"#id-30","referenceIndex":18,"text":"et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-30","referenceIndex":18,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-31","referenceIndex":17,"text":"Hardt et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-31","referenceIndex":17,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-32","referenceIndex":31,"text":"Oymak and ","element":"a"},{"href":"#id-32","referenceIndex":31,"text":"Ozay ","element":"a"},{"text":"(","element":"span"},{"href":"#id-32","referenceIndex":31,"text":"2019","element":"a"},{"text":"); ","element":"span"},{"href":"#id-33","referenceIndex":38,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-33","referenceIndex":38,"text":"2019","element":"a"},{"text":"); ","element":"span"},{"href":"#id-34","referenceIndex":36,"text":"Sarkar et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-34","referenceIndex":36,"text":"2019","element":"a"},{"text":"); ","element":"span"},{"href":"#id-35","referenceIndex":39,"text":"Tsiamis and Pappas ","element":"a"},{"text":"(","element":"span"},{"href":"#id-35","referenceIndex":39,"text":"2019","element":"a"},{"text":"). Existing results rely on excitation from random noise to guarantee learning and do not consider the problem of learning with arbitrary sequences of inputs or optimally choosing inputs for excitation.","element":"span"}],[{"text":"In the context of the existing literature, this work can be seen as the first rigorous treatment of active system identification and the first work to provide finite-time performance guarantees for the problem—bridging the gap between classical approaches and modern machine learning techniques. Indeed, our algorithm is similar to the adaptive input design approach in ","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"Lindqvist and Hjalmarsson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"2001","element":"a"},{"text":"); our work can be seen as making their algorithm more precise and providing finite-time performance and asymptotic optimality guarantees. Our analysis framework is general enough it could be extended to different experimental design criteria proposed in the existing literature.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.2. Notation","element":"span"}],[{"text":"We will let ","element":"span"},{"style":{"height":17.6},"width":89.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-0.png","element":"img","alt":" ρ(A)","inline":true,"padRight":true},{"text":"denote the spectral radius of ","element":"span"},{"style":{"height":17.6},"width":157.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-1.png","element":"img","alt":" A. ∥ · ∥2","inline":true,"padRight":true},{"text":"denotes the spectral norm of a matrix. ","element":"span"},{"style":{"height":20.41},"width":101.14,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-2.png","element":"img","alt":"˜O( · )","inline":true,"padRight":true},{"text":"hides log factors. We assume throughout that ","element":"span"},{"style":{"height":19.13},"width":279.94,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-3.png","element":"img","alt":" ηt ∼ N(0, σ2I)","inline":true,"padRight":true},{"text":"though all results can be extended to more general noise distributions. Let:","element":"span"}],[{"style":{"width":"62%"},"width":1085,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-4.png","element":"img"}],[{"text":"and ","element":"span"},{"style":{"height":21.03},"width":642.05,"height":52.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-5.png","element":"img","alt":" Γt := Γt(A∗), ΓB∗t := ΓB∗t (A∗). Γt","inline":true,"padRight":true},{"text":"is the expected value of ","element":"span"},{"style":{"height":21.03},"width":562.92,"height":52.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-6.png","element":"img","alt":" xtx⊤t when ut = 0, ∀t, and ΓB∗t","inline":true,"padRight":true},{"text":"is the expected value of ","element":"span"},{"style":{"height":18.4},"width":685.16,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-7.png","element":"img","alt":" xtx⊤t when ut ∼ N(0, I), ηt = 0, ∀t","inline":true},{"text":". In the case when the input is a ","element":"span"},{"text":"deterministic, periodic signal of period ","element":"span"},{"style":{"height":22.86},"width":448.17,"height":57.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-8.png","element":"img","alt":" k and 1k�kt=1 u⊤t ut = γ2","inline":true},{"text":", then setting ","element":"span"},{"style":{"height":15.6},"width":116.29,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-9.png","element":"img","alt":" ηt = 0","inline":true,"padRight":true},{"text":"and applying ","element":"span"},{"text":"this input on the system with parameters ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"B ","element":"span"},{"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", we denote the steady state covariates as:","element":"span"}],[{"style":{"width":"97%"},"width":1687,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-10.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":14.84},"width":45.8,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-11.png","element":"img","alt":" Uℓ","inline":true,"padRight":true},{"text":"denotes the Discrete Fourier Transform of ","element":"span"},{"style":{"height":19.81},"width":314.29,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-12.png","element":"img","alt":" {ut}kt=1. Here (a)","inline":true,"padRight":true},{"text":"holds by Parseval’s Theorem. ","element":"span"},{"text":"Let ","element":"span"},{"style":{"height":20.12},"width":403.14,"height":50.31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-13.png","element":"img","alt":" Γuk := Γuk(A∗, B∗). ¯ΓT","inline":true,"padRight":true},{"text":"will denote an upper bound on the covariates: ","element":"span"},{"style":{"height":22},"width":431.2,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-14.png","element":"img","alt":" �Tt=1 xtx⊤t ⪯ T ¯ΓT . We","inline":true,"padRight":true},{"text":"will specify its precise form as needed. To aid in analyzing the transient behavior of a system, let:","element":"span"}],[{"style":{"width":"51%"},"width":888,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-15.png","element":"img"}],[{"style":{"height":17.6},"width":93.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-16.png","element":"img","alt":"β(A)","inline":true,"padRight":true},{"text":"is then the smallest value such that ","element":"span"},{"style":{"height":19.53},"width":979.54,"height":48.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/2-17.png","element":"img","alt":" ∥Ak∥2 ≤ β(A)(1/2 + ρ(A)/2)k for all k ≥ 0, and is","inline":true,"padRight":true},{"text":"always finite. We give a more thorough discussion of this parameter in Appendix ","element":"span"},{"text":"A","element":"span"},{"text":". To determine the optimal inputs, we will solve the following optimization problem. As we make clear in Section ","element":"span"},{"href":"#id-36","text":"2.1","element":"a"},{"text":", the fundamental quantity that controls the sample complexity of estimation is the minimum eigenvalue of the covariates, the quantity ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"maximizes:","element":"span"}],[{"style":{"width":"97%"},"width":1682,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-0.png","element":"img"}],[{"text":"Here ","element":"span"},{"style":{"height":17.6},"width":141.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-1.png","element":"img","alt":" I ⊆ [k]","inline":true,"padRight":true},{"text":"is the set of frequencies we are optimizing over, ","element":"span"},{"style":{"height":14.81},"width":31,"height":37.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-2.png","element":"img","alt":"¯T","inline":true,"padRight":true},{"text":"is the time horizon we will play the inputs for, ","element":"span"},{"style":{"height":14.84},"width":45.79,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-3.png","element":"img","alt":" Uℓ","inline":true,"padRight":true},{"text":"is the DFT of ","element":"span"},{"style":{"height":21.4},"width":324.04,"height":53.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-4.png","element":"img","alt":" u1, ..., uk, and ¯Uγ2","inline":true,"padRight":true},{"text":"is the set of mean-zero signals of length ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"with average power bounded by ","element":"span"},{"style":{"height":18.33},"width":42.02,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-5.png","element":"img","alt":" γ2","inline":true},{"text":". The constraint that the signal be mean zero is for technical reasons and does not affect the results. We let ","element":"span"},{"style":{"height":18.58},"width":61.64,"height":46.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-6.png","element":"img","alt":" Uγ2","inline":true,"padRight":true},{"text":"denote the same set without the constraint that the signal be mean 0. In some cases we will overload notation, letting ","element":"span"},{"style":{"height":19.13},"width":672.74,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-7.png","element":"img","alt":" OptInputk(A, B∗, γ2, I, M) denote","inline":true},{"style":{"height":19.81},"width":637.47,"height":49.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-8.png","element":"img","alt":"OptInputk(A, B∗, γ2, I, {xt}Tt=1)","inline":true,"padRight":true},{"text":"but with the ","element":"span"},{"style":{"height":22},"width":190.94,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-9.png","element":"img","alt":" �Tt=1 xtx⊤t ","inline":true,"padRight":true},{"text":"term in the optimization replaced by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":". In addition, we will sometimes use ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"to refer to the maximum value of the optimization, and sometimes to refer to the inputs attaining that maximum—it will be clear from context which we are referring to.","element":"span"}]]},{"heading":"2. Main Results","paragraphs":[[{"text":"Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"proceeds in epochs, successively improving its input design as its estimate of ","element":"span"},{"style":{"height":15.42},"width":125.32,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-10.png","element":"img","alt":" A∗ im-","inline":true,"padRight":true},{"text":"proves. At each epoch, the input computed in the previous epoch is played (line ","element":"span"},{"text":"11","element":"span"},{"text":"), and ","element":"span"},{"style":{"height":15.42},"width":139.98,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-11.png","element":"img","alt":" A∗ esti-","inline":true,"padRight":true},{"text":"mated from the data collected (line ","element":"span"},{"text":"12","element":"span"},{"text":"). Using this estimate, a set of inputs are designed to excite the estimated system (line ","element":"span"},{"text":"15","element":"span"},{"text":"), and these inputs are played on the real system in the subsequent epoch, yielding a new estimate of ","element":"span"},{"style":{"height":15.42},"width":49.72,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-12.png","element":"img","alt":" A∗","inline":true},{"text":". This procedure continues with exponentially growing epoch length.","element":"span"}],[{"id":"id-37","style":{"width":"102%"},"width":1769,"height":1068,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/3-13.png","element":"img"}],[{"text":"UpdateInputs ","element":"span"},{"text":"pseudocode (full definition in Appendix ","element":"span"},{"text":"A","element":"span"},{"text":")","element":"span"}],[{"text":"1: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"function ","element":"span"},{"style":{"height":19.81},"width":768.92,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-0.png","element":"img","alt":" UP D A T EIN P U T S(A,B,{xt}Tt=1,γ2,k,ϵ,FT)","inline":true,"padRight":true},{"text":"2: ","element":"span"},{"text":"Check if ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-1.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"small enough to plan with all frequencies, if so set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I ","element":"span"},{"text":"= [","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"] ","element":"span"},{"text":"3: ","element":"span"},{"text":"Otherwise set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I ","element":"span"},{"text":"to include frequencies we can guarantee will sufficiently excite the system ","element":"span"},{"text":"4: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"if ","element":"span"},{"style":{"height":24.81},"width":1039.9,"height":62.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-2.png","element":"img","alt":" FT == True: return OptInputk(A, B, γ22 , I, {xt}Tt=1)","inline":true}],[{"style":{"width":"100%"},"width":1728,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-3.png","element":"img"}],[{"text":"The ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"flag in ","element":"span"},{"text":"UpdateInputs ","element":"span"},{"text":"controls how the inputs are designed. With ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"text":"True ","element":"span"},{"text":"(the finite time case), the algorithm does not take into account the expected future contribution due to noise when designing the inputs. Results for this case are outlined in Section ","element":"span"},{"href":"#id-38","text":"2.3","element":"a"},{"text":". With ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"text":"False ","element":"span"},{"text":"(the asymptotic case), the algorithm does take into account the estimated future contribution due to noise when designing the inputs. Results for this case are outlined in Section ","element":"span"},{"href":"#id-36","text":"2.1","element":"a"},{"text":".","element":"span"}],[{"id":"id-36","style":{"fontWeight":"bold"},"text":"2.1. Asymptotic Optimality of Algorithm ","element":"span"},{"href":"#id-37","style":{"fontWeight":"bold"},"text":"1","element":"a"}],[{"text":"We show that our algorithm is asymptotically optimal—up to constants, no algorithm can estimate ","element":"span"},{"style":{"height":15.42},"width":49.72,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-4.png","element":"img","alt":"A∗","inline":true,"padRight":true},{"text":"more quickly as ","element":"span"},{"style":{"height":12.8},"width":111.24,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-5.png","element":"img","alt":" δ → 0","inline":true},{"text":". We first present a lower bound for estimating linear dynamical systems actively. We call an algorithm ","element":"span"},{"style":{"height":17.6},"width":92.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-6.png","element":"img","alt":" (ϵ, δ)","inline":true},{"text":"-locally-stable in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"text":"if there exists a finite time ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-7.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"such that for all ","element":"span"},{"style":{"height":21.21},"width":1258.8,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-8.png","element":"img","alt":" t ≥ τ and all A′ ∈ B(A, 3ϵ): PA′(∥ ˆAt − A′∥2 ≤ ϵ) ≥ 1 − δ. Here PA′","inline":true,"padRight":true},{"text":"is the measure induced when the true matrix is ","element":"span"},{"style":{"height":21.21},"width":1042.29,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-9.png","element":"img","alt":" A′, B(A, 3ϵ) := {A′ ∈ Rd×d : ∥A − A′∥2 ≤ 3ϵ}, and ˆAt","inline":true,"padRight":true},{"text":"is the estimate obtained by the algorithm after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"observations. The sample complexity ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-10.png","element":"img","alt":" τϵδ","inline":true,"padRight":true},{"text":"is the infimum of all times ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-11.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"satisfying the above definition. This condition was introduced in ","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"Jedra and Proutiere ","element":"a"},{"text":"(","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"2019","element":"a"},{"text":") and allows us to avoid trivial algorithms that simply return ","element":"span"},{"style":{"height":19.43},"width":154.86,"height":48.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-12.png","element":"img","alt":"ˆAt = A∗","inline":true,"padRight":true},{"text":"for all time. Also define:","element":"span"}],[{"style":{"width":"65%"},"width":1131,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-13.png","element":"img"}],[{"text":"Note that, by Lemma ","element":"span"},{"href":"#id-40","text":"H.2 ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-41","text":"H.3","element":"a"},{"text":", this limit exists and is equal to the limit obtained by replacing ","element":"span"},{"style":{"height":14.73},"width":33.82,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-14.png","element":"img","alt":" 2i ","inline":true,"padRight":true},{"text":"with any other sequence ","element":"span"},{"style":{"height":14.62},"width":346.95,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-15.png","element":"img","alt":" ni → ∞ as i → ∞.","inline":true}],[{"id":"id-42","style":{"fontWeight":"bold"},"text":"Theorem 2.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume there exists finite ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"such that the input ","element":"span"},{"style":{"height":22.86},"width":618.22,"height":57.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-16.png","element":"img","alt":" ut satisfies 1k�kt=1 u⊤s+tus+t ≤ γ2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for any ","element":"span"},{"style":{"height":17.6},"width":377.02,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-17.png","element":"img","alt":" s ≥ 0. Then for (ϵ, δ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"small enough, any ","element":"span"},{"style":{"height":17.6},"width":92.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-18.png","element":"img","alt":" (ϵ, δ)","inline":true},{"style":{"fontStyle":"italic"},"text":"-locally-stable in ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-19.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"algorithm will have:","element":"span"}],[{"style":{"width":"50%"},"width":867,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-20.png","element":"img"}],[{"id":"id-43","style":{"fontWeight":"bold"},"text":"Theorem 2.2 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume we are running Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"False","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then for any ","element":"span"},{"style":{"height":17.6},"width":218.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-21.png","element":"img","alt":" δ, ϵ ∈ (0, 1),","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"there exists a deterministic ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-22.png","element":"img","alt":" τϵδ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that, for any ","element":"span"},{"style":{"height":15.24},"width":318.42,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-23.png","element":"img","alt":" T ≥ τϵδ where T","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is at an epoch boundary, we have: ","element":"span"},{"style":{"height":32.4},"width":429.51,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-24.png","element":"img","alt":" P�∥ ˆA − A∗∥2 > ϵ�≤ δ","inline":true},{"style":{"fontStyle":"italic"},"text":", and, for small enough ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-25.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and some universal constant ","element":"span"},{"style":{"fontStyle":"italic"},"text":"C","element":"span"},{"style":{"fontStyle":"italic"},"text":":","element":"span"}],[{"style":{"width":"52%"},"width":902,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/4-26.png","element":"img"}],[{"text":"The proof of Theorem ","element":"span"},{"href":"#id-42","text":"2.1 ","element":"a"},{"text":"is given in Section ","element":"span"},{"text":"G ","element":"span"},{"text":"and the proof of Theorem ","element":"span"},{"href":"#id-43","text":"2.2 ","element":"a"},{"text":"is given in Section ","element":"span"},{"href":"#id-44","text":"B.3","element":"a"},{"text":". It follows that up to constant factors, Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"is asymptotically optimal. The fundamental value present in both the upper and lower bound controlling the sample complexity of estimation is ","element":"span"},{"style":{"height":19.13},"width":390.8,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-0.png","element":"img","alt":" λmin(σ2Γ∞ + γ2Γu∞)","inline":true},{"text":", the minimum eigenvalue of the expected covariates when the input ","element":"span"},{"style":{"fontStyle":"italic"},"text":"u ","element":"span"},{"text":"is ","element":"span"},{"text":"being played. Optimally exciting the system for identification is then equivalent to choosing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"u ","element":"span"},{"text":"so as to maximize ","element":"span"},{"style":{"height":19.13},"width":397.83,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-1.png","element":"img","alt":" λmin(σ2Γ∞ + γ2Γu∞).","inline":true}],[{"id":"id-51","style":{"fontWeight":"bold"},"text":"2.2. Suboptimality of Colored Noise","element":"span"}],[{"text":"While Theorem ","element":"span"},{"href":"#id-42","text":"2.1 ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-43","text":"2.2 ","element":"a"},{"text":"together show that the optimal performance can be attained in the limit by periodic inputs, it may seem reasonable that one could attain a similar rate by playing the optimal noise—setting ","element":"span"},{"style":{"height":18.4},"width":265.24,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-2.png","element":"img","alt":" ut ∼ N(0, Σ∗)","inline":true,"padRight":true},{"text":"for the optimal choice of ","element":"span"},{"style":{"height":12.33},"width":48.52,"height":30.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-3.png","element":"img","alt":" Σ∗ ","inline":true,"padRight":true},{"text":"that satisfies the expected power constraint. We show this is false. Consider the following example. Let ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-4.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"be PSD with eigenvalues ","element":"span"},{"style":{"height":17.6},"width":462.35,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-5.png","element":"img","alt":"λ = [λ1, . . . , λd], B∗ = I","inline":true},{"text":", and assume that ","element":"span"},{"style":{"height":18.33},"width":161.4,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-6.png","element":"img","alt":" γ2 ≫ σ2","inline":true},{"text":". We show in the proof of Corollary ","element":"span"},{"href":"#id-45","text":"3.1 ","element":"a"},{"text":"that ","element":"span"},{"style":{"height":24.16},"width":934.61,"height":60.39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-7.png","element":"img","alt":"maxu∈Uγ2 λmin(σ2Γ∞+γ2Γu∞) = Θ�γ2/∥1 − λ∥22�","inline":true},{"text":". In contrast, when playing ","element":"span"},{"style":{"height":18.4},"width":321.97,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-8.png","element":"img","alt":" ut ∼ N(0, Σ∗), as","inline":true,"padRight":true},{"text":"we show in Appendix ","element":"span"},{"text":"I","element":"span"},{"text":", we will have that ","element":"span"},{"style":{"height":19.89},"width":983.01,"height":49.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-9.png","element":"img","alt":" λmin(σ2Γ∞ + �∞s=0 AsΣ∗(As)⊤) = Θ(γ2/∥1 − λ∥1).","inline":true,"padRight":true},{"text":"Note here that ","element":"span"},{"style":{"height":19.9},"width":615.49,"height":49.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-10.png","element":"img","alt":" λmin(σ2Γ∞ + �∞s=0 AsΣ∗(As)⊤)","inline":true,"padRight":true},{"text":"upper bounds the minimum eigenvalue of the ","element":"span"},{"text":"expected covariates when ","element":"span"},{"style":{"height":18.4},"width":265.25,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-11.png","element":"img","alt":" ut ∼ N(0, Σ∗)","inline":true},{"text":". Depending on the values of ","element":"span"},{"style":{"height":12.8},"width":26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-12.png","element":"img","alt":" λ","inline":true},{"text":", there is clear gap between these quantities. For example, if ","element":"span"},{"style":{"height":17.6},"width":546.05,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-13.png","element":"img","alt":" λi = 1 − 1/d for i = 1, ..., d","inline":true},{"text":", the upper bound on the sample complexity of our algorithm is ","element":"span"},{"style":{"height":19.13},"width":298.75,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-14.png","element":"img","alt":" Θ(σ2ϵ−2/(dγ2))","inline":true,"padRight":true},{"text":"while the lower bound on the sample complexity when playing optimal noise is ","element":"span"},{"style":{"height":19.13},"width":518.04,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-15.png","element":"img","alt":" Θ(σ2ϵ−2/γ2), a gap of Θ(d)","inline":true},{"text":". Note that existing works on system identification ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") only apply to the case when the input is zero-mean noise and are thus insufficient to guarantee optimal rates.","element":"span"}],[{"id":"id-38","style":{"fontWeight":"bold"},"text":"2.3. Finite Time Performance of Algorithm ","element":"span"},{"href":"#id-37","style":{"fontWeight":"bold"},"text":"1","element":"a"}],[{"text":"We next present our main result quantifying the finite time performance of Algorithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":". Throughout, we let ","element":"span"},{"style":{"height":24},"width":254.74,"height":60.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-16.png","element":"img","alt":" T = �ij=0 Tj","inline":true},{"text":", the total time elapsed after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"epochs, and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"text":") ","element":"span"},{"text":"the value of ","element":"span"},{"style":{"height":15.02},"width":174.59,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-17.png","element":"img","alt":" ki after T","inline":true,"padRight":true},{"text":"steps. If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"is at an epoch boundary, ","element":"span"},{"style":{"height":20.33},"width":884.31,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-18.png","element":"img","alt":" k(T) = k02log(2T/T0+1)/ log 3−1 ≈ O((T/T0)0.63).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Theorem 2.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Informal","element":"span"},{"style":{"fontStyle":"italic"},"text":") Assume that ","element":"span"},{"style":{"height":14.62},"width":42.5,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-19.png","element":"img","alt":" T0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is chosen sufficiently large relative to ","element":"span"},{"style":{"height":16},"width":277.05,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-20.png","element":"img","alt":" k0. Then for T","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"large enough, with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"True","element":"span"},{"style":{"fontStyle":"italic"},"text":", Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"will achieve the following rate:","element":"span"}],[{"id":"id-46","style":{"width":"94%"},"width":1633,"height":240,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-21.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and will produce inputs satisfying ","element":"span"},{"style":{"height":32.4},"width":620.11,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-22.png","element":"img","alt":" E�1/T �Tt=1 u⊤t ut�≤ γ2. Here C","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a universal constant, ","element":"span"},{"style":{"height":12.73},"width":83.66,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-23.png","element":"img","alt":" u∗ is","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the solution to ","element":"span"},{"style":{"height":21.56},"width":1460.13,"height":53.91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-24.png","element":"img","alt":" OptInputk(T)(A∗, B∗, γ2, k(T), 0), and ¯ΓT = I · O(β(A∗)2γ2T/(1 − ρ(A∗))2).","inline":true}],[{"text":"Note that our finite time rate critically depends on the minimum eigenvalue of the expected covariates. At a high level, Theorem ","element":"span"},{"href":"#id-46","text":"2.3 ","element":"a"},{"text":"provides a finite sample bound on the error in the estimates produced by Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"and states that once ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"is large enough, despite lacking knowledge of the true system parameters, Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"will play inputs that maximize ","element":"span"},{"style":{"height":20.05},"width":210.7,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-25.png","element":"img","alt":" λmin(γ2Γuk)","inline":true},{"text":". As was shown ","element":"span"},{"text":"in Section ","element":"span"},{"href":"#id-36","text":"2.1","element":"a"},{"text":", the fundamental quantity that controls the estimation rate is ","element":"span"},{"style":{"height":19.13},"width":390.06,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-26.png","element":"img","alt":" λmin(σ2Γ∞ + γ2Γu∞)","inline":true,"padRight":true},{"text":"which, in finite time, can be thought of as ","element":"span"},{"style":{"height":20.05},"width":701.82,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-27.png","element":"img","alt":" λmin(σ2Γk + γ2Γuk). When γ2 ≫ σ2","inline":true},{"text":", maximizing ","element":"span"},{"style":{"height":20.05},"width":210.69,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-28.png","element":"img","alt":"λmin(γ2Γuk)","inline":true,"padRight":true},{"text":"is essentially equivalent to maximizing ","element":"span"},{"href":"#id-46","style":{"height":20.05},"width":614.42,"height":50.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/5-29.png","element":"img","alt":" λmin(σ2Γk + γ2Γuk). Theorem 2.3","inline":true,"padRight":true},{"text":"then guar- ","element":"span"},{"text":"antees in this case that Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"plays the inputs that best excite the system for estimation.","element":"span"}],[{"text":"The proof of this theorem is sketched in Section ","element":"span"},{"text":"4 ","element":"span"},{"text":"and formally proved in Section ","element":"span"},{"href":"#id-47","text":"B.1","element":"a"},{"text":". A full version of this result is presented as Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"in Appendix ","element":"span"},{"text":"B","element":"span"},{"text":", where we quantify formally how large ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"must be for the rate given in Theorem ","element":"span"},{"href":"#id-46","text":"2.3 ","element":"a"},{"text":"to apply. Corollary ","element":"span"},{"href":"#id-45","text":"3.1 ","element":"a"},{"text":"works this out explicitly in a simplified setting. Intuitively, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"must be large enough for the transient effects of the last input to have dissipated, and for ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-0.png","element":"img","alt":" ϵi−1","inline":true,"padRight":true},{"text":"to be small enough to guarantee we are playing inputs that achieve nearly optimal performance. The former quantity scales as ","element":"span"},{"style":{"height":20.41},"width":338.09,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-1.png","element":"img","alt":"˜O (1/(1 − ρ(A∗)))","inline":true},{"text":". The latter depends on the system parameters in a complicated fashion. In the case where ","element":"span"},{"style":{"height":15.42},"width":49.72,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-2.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"is diagonalizable with largest and smallest magnitude eigenvalues ","element":"span"},{"style":{"height":15.24},"width":181.07,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-3.png","element":"img","alt":" λ1 and λd","inline":true},{"text":", respectively, and ","element":"span"},{"style":{"height":14.62},"width":50.09,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-4.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"allows for sufficient excitation of all modes, then when ","element":"span"},{"style":{"height":24.72},"width":274.72,"height":61.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-5.png","element":"img","alt":"11−|λ1| ≫ 11−|λd|","inline":true},{"text":", it will behave like ","element":"span"},{"style":{"height":21.87},"width":509.83,"height":54.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-6.png","element":"img","alt":" ˜O�(1 − |λd|)4/(1 − |λ1|)4�.","inline":true,"padRight":true},{"text":"If ","element":"span"},{"style":{"height":17.6},"width":195.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-7.png","element":"img","alt":" |λ1| ≈ |λd|","inline":true,"padRight":true},{"text":"it will behave like ","element":"span"},{"style":{"height":20.41},"width":321.18,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-8.png","element":"img","alt":"˜O(1/(1 − |λ1|)2).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Remark 2.4 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"If ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-9.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is also unknown, it is still possible to run a procedure similar to Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1","element":"a"},{"style":{"fontStyle":"italic"},"text":", choosing the inputs to improve estimation of both ","element":"span"},{"style":{"height":15.42},"width":195.29,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-10.png","element":"img","alt":" A∗ and B∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"simultaneously. In this case, we minimize the same least squares objective but now over both ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"B","element":"span"},{"style":{"fontStyle":"italic"},"text":". Theorem ","element":"span"},{"href":"#id-49","style":{"fontStyle":"italic"},"text":"2.6 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"can be modified to bound the error","element":"span"},{"style":{"height":24.11},"width":442.95,"height":60.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-11.png","element":"img","alt":"��A∗ B∗�−� ˆA ˆB��2","inline":true},{"style":{"fontStyle":"italic"},"text":", but the error scales instead with:","element":"span"}],[{"style":{"width":"28%"},"width":490,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"In this setting, the optimal design is one that maximizes this minimum eigenvalue. To obtain a result similar to Theorem ","element":"span"},{"href":"#id-46","style":{"fontStyle":"italic"},"text":"2.3","element":"a"},{"style":{"fontStyle":"italic"},"text":", a version of Theorem ","element":"span"},{"href":"#id-50","style":{"fontStyle":"italic"},"text":"4.1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is needed to quantify how suboptimal our choice of input may be given only estimates of ","element":"span"},{"style":{"height":15.42},"width":187.58,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-13.png","element":"img","alt":" A∗ and B∗","inline":true},{"style":{"fontStyle":"italic"},"text":". A fairly straightforward extension of the argument used to obtain Theorem ","element":"span"},{"href":"#id-50","style":{"fontStyle":"italic"},"text":"4.1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"can be used to argue such a bound, allowing a version of Theorem ","element":"span"},{"href":"#id-46","style":{"fontStyle":"italic"},"text":"2.3 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"to be proved.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark 2.5 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The update of ","element":"span"},{"style":{"height":10.22},"width":29.71,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-14.png","element":"img","alt":" ϵi","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"in Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"requires knowledge of the true system parameters to compute ","element":"span"},{"style":{"height":23.99},"width":219.85,"height":59.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-15.png","element":"img","alt":"¯ΓT , Γki, ΓB∗ki ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":". In practice, bootstrapped estimates of these quantities could be used. Fur- ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ther, these terms only appear logarithmically and will not be the dominant terms in the expression. Experimentally, we found that greedily designing our inputs with respect to ","element":"span"},{"style":{"height":19.43},"width":44.73,"height":48.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-16.png","element":"img","alt":"ˆAi","inline":true},{"style":{"fontStyle":"italic"},"text":", equivalent to solving ","element":"span"},{"style":{"height":21.49},"width":945.96,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-17.png","element":"img","alt":" UpdateInputs( ˆAi, B∗, {xt}Tt=1, γ2, 2k(T), 0, FT)","inline":true},{"style":{"fontStyle":"italic"},"text":", yielded better performance and did not ","element":"span"},{"style":{"fontStyle":"italic"},"text":"require any estimate of ","element":"span"},{"style":{"height":10.22},"width":42.24,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-18.png","element":"img","alt":" ϵi.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"2.4. Estimating Dynamical Systems With Periodic Inputs","element":"span"}],[{"text":"As was shown in Section ","element":"span"},{"href":"#id-51","text":"2.2","element":"a"},{"text":", exciting a system with random noise is insufficient to obtain optimal estimation rates. Relying on carefully designed periodic inputs, Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"is able to attain this optimal rate. Showing this critically requires bounding the estimation error when arbitrary periodic inputs are being played. The following result quantifies this and can be thought of as a novel extension of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") to non-noise inputs. This result may be of independent interest and is proved in Section ","element":"span"},{"text":"E","element":"span"},{"text":".","element":"span"}],[{"id":"id-49","style":{"fontWeight":"bold"},"text":"Theorem 2.6 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that we start from initial state ","element":"span"},{"style":{"height":10.62},"width":41.94,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-19.png","element":"img","alt":" x0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and play input ","element":"span"},{"style":{"height":16.72},"width":452.64,"height":41.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-20.png","element":"img","alt":" ut = ˜ut + ηut where ˜ut is","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"deterministic with period ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and average power ","element":"span"},{"style":{"height":19.13},"width":892.18,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-21.png","element":"img","alt":" γ2 ≥ 0, and ηut ∼ N(0, σ2uI) with σ2u ≥ 0. Let Tss","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be some value satisfying ","element":"span"},{"style":{"height":20.41},"width":447.82,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-22.png","element":"img","alt":" Tss = ˜O(1/(1 − ρ(A∗)))","inline":true},{"style":{"fontStyle":"italic"},"text":". Then as long as:","element":"span"}],[{"style":{"width":"94%"},"width":1626,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/6-23.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"we have:","element":"span"}],[{"style":{"width":"90%"},"width":1565,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":31.6},"width":1191.95,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-1.png","element":"img","alt":"¯ΓT = 4�1T�Tt=0 x˜ut x˜ut⊤ + Tr(σ2ΓT + σ2uΓB∗T )(1 + log 2δ)I�, c, C","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are universal constants,","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-2.png","element":"img","alt":" x˜ut ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the (deterministic) response of the system to ","element":"span"},{"style":{"height":14.22},"width":147.59,"height":35.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-3.png","element":"img","alt":" ut = ˜ut.","inline":true}],[{"text":"Note, critically, the ","element":"span"},{"style":{"height":17.32},"width":47.27,"height":43.29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-4.png","element":"img","alt":" Γuk ","inline":true,"padRight":true},{"text":"term in the denominator. This term quantifies how the estimation error ","element":"span"},{"text":"scales in terms of the interaction between the input and the system.","element":"span"}]]},{"heading":"3. Interpreting the Results","paragraphs":[[{"text":"We next present several corollaries to Theorem ","element":"span"},{"href":"#id-46","text":"2.3","element":"a"},{"text":". Let ","element":"span"},{"style":{"height":15.42},"width":246.72,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-5.png","element":"img","alt":" A∗ = V ΛV ⊤ ","inline":true,"padRight":true},{"text":"for orthogonal ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V ","element":"span"},{"text":", real, diagonal ","element":"span"},{"style":{"height":15.6},"width":398.5,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-6.png","element":"img","alt":" Λ ⪰ 0, and B∗ = I","inline":true},{"text":". Denote the eigenvalues of ","element":"span"},{"style":{"height":15.64},"width":631.05,"height":39.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-7.png","element":"img","alt":" A∗ as λ1 ≥ λ2 ≥ . . . ≥ λd and","inline":true},{"style":{"height":17.6},"width":339.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-8.png","element":"img","alt":"λ = [λ1, λ2, ..., λd]","inline":true},{"text":". To aid in interpretability, assume that ","element":"span"},{"style":{"height":23.54},"width":350.82,"height":58.86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-9.png","element":"img","alt":"11−λ1 ≫ 11−λ2 , 11−λd ","inline":true,"padRight":true},{"text":"is small enough to ","element":"span"},{"text":"be thought of as a small constant factor, ","element":"span"},{"style":{"height":21.29},"width":424.52,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-10.png","element":"img","alt":" γ2 > σ2, and, log 1δ > 1","inline":true},{"text":". We then have the following.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Corollary 3.1 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Symmetric ","element":"span"},{"style":{"height":16.4},"width":224.92,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-11.png","element":"img","alt":" A∗) Let Tinit","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be some value satisfying:","element":"span"}],[{"style":{"width":"67%"},"width":1164,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"then after ","element":"span"},{"style":{"height":14.62},"width":170.89,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-13.png","element":"img","alt":" T ≥ Tinit","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"steps, running Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"True ","element":"span"},{"style":{"fontStyle":"italic"},"text":"will produce an estimate satisfying, with high probability:","element":"span"}],[{"id":"id-45","style":{"width":"46%"},"width":803,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-14.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"while instead playing ","element":"span"},{"style":{"height":24.81},"width":282.9,"height":62.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-15.png","element":"img","alt":" ut ∼ N(0, γ2d I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all time, our estimate will satisfy, with high probability:","element":"span"}],[{"style":{"width":"38%"},"width":659,"height":138,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-16.png","element":"img"}],[{"text":"In the high SNR regime of ","element":"span"},{"style":{"height":18.33},"width":191.28,"height":45.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-17.png","element":"img","alt":" γ2 ≫ dσ2","inline":true},{"text":", the leading constant for the rate attained by Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"behaves as ","element":"span"},{"style":{"height":27.02},"width":135.01,"height":67.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-18.png","element":"img","alt":"σ∥1−λ∥2γ","inline":true,"padRight":true},{"text":"compared to a leading constant of ","element":"span"},{"style":{"height":27.37},"width":66.73,"height":68.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-19.png","element":"img","alt":"σ√dγ","inline":true,"padRight":true},{"text":"when playing ","element":"span"},{"style":{"height":24.81},"width":409.46,"height":62.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-20.png","element":"img","alt":" ut ∼ N(0, γ2d I). Note","inline":true,"padRight":true},{"text":"that in both cases the expected average power is ","element":"span"},{"style":{"height":18.33},"width":54.94,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-21.png","element":"img","alt":" γ2.","inline":true}],[{"text":"Now let ","element":"span"},{"style":{"height":15.42},"width":190.81,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-22.png","element":"img","alt":" A∗ and B∗","inline":true,"padRight":true},{"text":"be block diagonal matrices where ","element":"span"},{"style":{"height":20.15},"width":693.72,"height":50.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-23.png","element":"img","alt":" Aj ∈ Rdj×dj and Bj ∈ Rdj×pj denote","inline":true,"padRight":true},{"text":"their ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":"th blocks. Assume that it is known that ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-24.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"has this structure. For simplicity, assume ","element":"span"},{"style":{"height":18.33},"width":155.32,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-25.png","element":"img","alt":" γ2 ≫ σ2","inline":true,"padRight":true},{"text":"so that ","element":"span"},{"style":{"height":31.6},"width":1125.45,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-26.png","element":"img","alt":" λmin�σ2Γjk + γ2Γu∗,jk �≈ λmin�γ2Γu∗,jk �. Here Γjk and Γu∗,jk","inline":true,"padRight":true},{"text":"denote the expected noise and input covariates of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":"th subsystem.","element":"span"}],[{"id":"id-52","style":{"fontWeight":"bold"},"text":"Corollary 3.2 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Block Diagonal ","element":"span"},{"style":{"height":16.4},"width":175.81,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-27.png","element":"img","alt":" A∗) For T","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"large enough, a version of Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"slightly modified to account for the block structure, will have, with high probability, when ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"True","element":"span"},{"style":{"fontStyle":"italic"},"text":":","element":"span"}],[{"style":{"width":"49%"},"width":857,"height":166,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/7-28.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"In contrast, simply playing ","element":"span"},{"style":{"height":27.21},"width":282.91,"height":68.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-0.png","element":"img","alt":" ut ∼ N(0, γ2p I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"will, with high probability, achieve the following rate:","element":"span"}],[{"style":{"width":"53%"},"width":929,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-1.png","element":"img"}],[{"text":"Intuitively, the rate obtained by Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"scales as the average error in estimating each block, while the rate obtained by playing ","element":"span"},{"style":{"height":27.21},"width":282.9,"height":68.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-2.png","element":"img","alt":" ut ∼ N(0, γ2p I)","inline":true,"padRight":true},{"text":"scales as the error of the worst case block. Note ","element":"span"},{"text":"that while in both Corollary ","element":"span"},{"href":"#id-45","text":"3.1 ","element":"a"},{"text":"and Corollary ","element":"span"},{"href":"#id-52","text":"3.2 ","element":"a"},{"text":"we are comparing upper bounds, the leading constants in these bounds are identical to those obtained in the asymptotic lower bound, Theorem ","element":"span"},{"href":"#id-42","text":"2.1","element":"a"},{"text":", and are thus unimprovable—the improvement in upper bounds we see in performing active estimation compared to playing noise are matched by the lower bound. Both corollaries are proved in Section ","element":"span"},{"text":"C","element":"span"},{"text":".","element":"span"}],[{"text":"It is difficult to work out analytically what the performance will be when ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-3.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"is a Jordan block. However, at an intuitive level, our algorithm should yield a large improvement over isotropic noise as the proper excitation of a Jordan block focuses nearly all the energy on the last coordinate in the block. This conjecture is supported by our experiments in Section ","element":"span"},{"text":"5","element":"span"},{"text":".","element":"span"}]]},{"heading":"4. Proof Sketch of Theorem 2.3","paragraphs":[[{"text":"To prove Theorem ","element":"span"},{"href":"#id-46","text":"2.3","element":"a"},{"text":", our primary upper bound on the error in the estimates of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-4.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"produced by Algorithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":", we first bound the error in the estimate of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-5.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"obtained at the ","element":"span"},{"style":{"height":17.6},"width":128.67,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-6.png","element":"img","alt":" (i − 1)","inline":true},{"text":"th epoch, then bound the suboptimality of the inputs computed from this estimate, and finally bound the estimation error at the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch in terms of these inputs.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Controlling the estimation error ","element":"span"},{"style":{"height":21.21},"width":700.54,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-7.png","element":"img","alt":" ∥ ˆAi−1 − A∗∥2 at the (i − 1)th epoch.","inline":true,"padRight":true},{"text":"We rely on excitation due to noise to guarantee learning and bound ","element":"span"},{"style":{"height":21.21},"width":254.33,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-8.png","element":"img","alt":" ∥ ˆAi−1 − A∗∥2","inline":true},{"text":". This proof is similar to those given in ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") and is outlined in the appendix.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Bounding the suboptimality of the inputs. ","element":"span"},{"text":"Given the estimate ","element":"span"},{"style":{"height":19.43},"width":87.61,"height":48.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-9.png","element":"img","alt":"ˆAi−1","inline":true,"padRight":true},{"text":"and past data ","element":"span"},{"style":{"height":21.76},"width":253.9,"height":54.39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-10.png","element":"img","alt":" {xt}T−Tit=1 , and","inline":true,"padRight":true},{"text":"letting ","element":"span"},{"style":{"height":15.02},"width":36.98,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-11.png","element":"img","alt":" ˆui","inline":true,"padRight":true},{"text":"denote the optimal inputs on the estimated system and ","element":"span"},{"style":{"height":17.22},"width":41.98,"height":43.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-12.png","element":"img","alt":" u∗i ","inline":true,"padRight":true},{"text":"the optimal inputs on the true ","element":"span"},{"text":"system, we wish to bound:","element":"span"}],[{"id":"id-50","style":{"width":"78%"},"width":1352,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-13.png","element":"img"}],[{"text":"in terms of ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-14.png","element":"img","alt":" ϵi−1","inline":true},{"text":", as this will quantify how suboptimal our input’s response on the true system is. Theorem ","element":"span"},{"href":"#id-50","text":"4.1 ","element":"a"},{"text":"provides such a bound in terms of ","element":"span"},{"style":{"height":10.22},"width":85.52,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-15.png","element":"img","alt":" ϵi−1.","inline":true}],[{"style":{"width":"98%"},"width":1699,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":17.6},"width":371.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-17.png","element":"img","alt":" L(A∗, B∗, U, ϵ, I, w)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a measure of the smoothness of ","element":"span"},{"style":{"height":19.26},"width":55.91,"height":48.15,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-18.png","element":"img","alt":" Γuki ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"with respect to ","element":"span"},{"style":{"height":15.42},"width":62.66,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-19.png","element":"img","alt":" A∗.","inline":true}],[{"text":"The full version of Theorem ","element":"span"},{"href":"#id-50","text":"4.1 ","element":"a"},{"text":"is stated and proved in Appendix ","element":"span"},{"text":"F","element":"span"},{"text":". At a high level, the proof follows by upper bounding (","element":"span"},{"href":"#id-50","text":"2","element":"a"},{"text":") in terms of the difference between ","element":"span"},{"style":{"height":24.07},"width":552.7,"height":60.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-20.png","element":"img","alt":" Γuki and ˆΓuki := Γuki( ˆAi−1, B∗).","inline":true,"padRight":true},{"text":"This difference can be quantified in terms of the sensitivity of ","element":"span"},{"style":{"height":19.26},"width":55.91,"height":48.15,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-21.png","element":"img","alt":" Γuki ","inline":true,"padRight":true},{"text":"to changes in ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/8-22.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"and, critically, ","element":"span"},{"text":"does not require bounding the difference between ","element":"span"},{"style":{"height":17.29},"width":174.74,"height":43.22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-0.png","element":"img","alt":" ˆui and u∗i ","inline":true,"padRight":true},{"text":". The primary challenge in proving ","element":"span"},{"text":"Theorem ","element":"span"},{"href":"#id-50","text":"4.1 ","element":"a"},{"text":"is in avoiding standard matrix perturbation bounds of the form:","element":"span"}],[{"id":"id-53","style":{"width":"86%"},"width":1495,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-1.png","element":"img"}],[{"text":"Depending on the structure of ","element":"span"},{"style":{"height":20.06},"width":133.22,"height":50.14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-2.png","element":"img","alt":" A∗, Γuki ","inline":true,"padRight":true},{"text":"could be very ill-conditioned and (","element":"span"},{"href":"#id-53","text":"3","element":"a"},{"text":") could be very loose. ","element":"span"},{"text":"We instead show that it is sufficient to bound:","element":"span"}],[{"id":"id-54","style":{"width":"97%"},"width":1677,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-3.png","element":"img"}],[{"text":"for a set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"guaranteed to include the eigenvectors corresponding to the minimum eigenvalues of ","element":"span"},{"style":{"height":26.62},"width":776.67,"height":66.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-4.png","element":"img","alt":"�T−Tit=1 xtx⊤t + Γˆuiki and �T−Tit=1 xtx⊤t + Γu∗iki ","inline":true,"padRight":true},{"text":". Applying (","element":"span"},{"href":"#id-54","text":"4","element":"a"},{"text":") instead of (","element":"span"},{"href":"#id-53","text":"3","element":"a"},{"text":") with this ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"can save a ","element":"span"},{"text":"factor of as much as ","element":"span"},{"style":{"height":17.6},"width":260.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-5.png","element":"img","alt":" 1/(1 − ρ(A∗))","inline":true,"padRight":true},{"text":"in the final perturbation bound.","element":"span"}],[{"text":"Given this perturbation bound, we can quantify how suboptimal the inputs computed by solving ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"on our estimated system are. As we make precise in Appendix ","element":"span"},{"text":"F","element":"span"},{"text":", the suboptimality depends on the frequencies our input signal contains. ","element":"span"},{"text":"UpdateInputs ","element":"span"},{"text":"carefully takes this into account, only playing inputs for which it can guarantee the system will be sufficiently excited. Ultimately, we are interested in exciting the system optimally, which requires that we have learned the system well enough to guarantee the performance at every frequency. We quantify this in Lemma ","element":"span"},{"href":"#id-55","text":"D.1 ","element":"a"},{"text":"and show that for sufficiently large ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"text":", we will be playing inputs that attain the optimal response. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Controlling the estimation error ","element":"span"},{"style":{"height":21.21},"width":211.26,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-6.png","element":"img","alt":" ∥ ˆAi − A∗∥2","inline":true,"padRight":true},{"style":{"fontWeight":"bold"},"text":"in terms of the inputs. ","element":"span"},{"text":"The final piece in the proof involves showing that, for the inputs being played, ","element":"span"},{"style":{"height":15.02},"width":36.98,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-7.png","element":"img","alt":" ˆui","inline":true},{"text":", the estimation error will scale in accordance with how these inputs excite the true system. We can decompose the error in our estimate of ","element":"span"},{"style":{"height":15.42},"width":110.8,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-8.png","element":"img","alt":" A∗ as:","inline":true}],[{"style":{"width":"70%"},"width":1222,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-9.png","element":"img"}],[{"style":{"height":22},"width":564.04,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-10.png","element":"img","alt":"∥(�Tt=1xtx⊤t )−1/2�Tt=1xtη⊤t ∥2 ","inline":true,"padRight":true},{"text":"scales like ","element":"span"},{"style":{"height":20.8},"width":466.45,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-11.png","element":"img","alt":" O(�d + log 1/δ + log T)","inline":true,"padRight":true},{"text":"and can be handled using a self-normalized bound ","element":"span"},{"href":"#id-56","referenceIndex":2,"text":"Abbasi-Yadkori et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-56","referenceIndex":2,"text":"2011","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":"). The primary difficulty is obtaining a lower bound on ","element":"span"},{"style":{"height":22},"width":313.88,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-12.png","element":"img","alt":" λmin(�Tt=1xtx⊤t )","inline":true,"padRight":true},{"text":"in terms of the inputs being played. We ","element":"span"},{"text":"in fact want to show something even stronger, that ","element":"span"},{"style":{"height":24.42},"width":551.16,"height":61.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-13.png","element":"img","alt":"�Tt=T−Ti xtx⊤t ⪰ c(T −Ti)Γˆuiki","inline":true},{"text":", as this allows us ","element":"span"},{"text":"to quantify precisely how an input affects the covariates, and how we can adjust the input to increase ","element":"span"},{"style":{"height":22},"width":313.88,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-14.png","element":"img","alt":"λmin(�Tt=1xtx⊤t )","inline":true},{"text":". The following proposition is the key piece in proving such a lower bound. ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proposition 4.2 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Informal) ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider ","element":"span"},{"style":{"height":17.75},"width":355.85,"height":44.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-15.png","element":"img","alt":" w ∈ Sd−1 and let ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be a deterministic signal with period ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"style":{"fontStyle":"italic"},"text":". Assuming that ","element":"span"},{"style":{"height":14.62},"width":57.15,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-16.png","element":"img","alt":" Tss","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is large enough that the transient effects of the input have dissipated, we have:","element":"span"}],[{"id":"id-57","style":{"width":"83%"},"width":1441,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-17.png","element":"img"}],[{"text":"The proof of this proposition is given in Section ","element":"span"},{"text":"E","element":"span"},{"text":". The main technical challenge comes in handling the interactions between the inputs and the noise. To avoid directly bounding these cross terms, we prove that the covariates over one period of the input are, with constant probability, lower bounded by the covariates obtained if running the system with no process noise. After enough periods, we show that with high probability the bound (","element":"span"},{"href":"#id-57","text":"5","element":"a"},{"text":") holds. Given this pointwise lower bound, we can apply a similar argument to that in ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") to show the estimation error bound given in Theorem ","element":"span"},{"href":"#id-49","text":"2.6","element":"a"},{"text":".","element":"span"}],[{"text":"To complete the proof of Theorem ","element":"span"},{"href":"#id-46","text":"2.3","element":"a"},{"text":", we effectively apply Theorem ","element":"span"},{"href":"#id-49","text":"2.6 ","element":"a"},{"text":"to bound the estimation error in the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch in terms of ","element":"span"},{"style":{"height":24.34},"width":57.88,"height":60.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-18.png","element":"img","alt":" Γˆuiki","inline":true},{"text":", and using the fact that ","element":"span"},{"style":{"height":15.02},"width":36.98,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/9-19.png","element":"img","alt":" ˆui","inline":true,"padRight":true},{"text":"excites the system nearly optimally, ","element":"span"},{"text":"conclude that we attain the optimal estimation rate.","element":"span"}],[{"id":"id-58","style":{"width":"98%"},"width":1702,"height":625,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-0.png","element":"img"}],[{"text":"Figure 1: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-1.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"diagonalizable by unitary matrix, ","element":"figcaption","subtype":"caption"},{"style":{"height":16},"width":295.97,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-2.png","element":"img","alt":"d = 6, p = 4, B∗","inline":true,"padRight":true},{"text":"randomly generated","element":"figcaption","subtype":"caption"}],[{"text":"Figure 2: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":186.58,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-3.png","element":"img","alt":" A∗ and B∗","inline":true,"padRight":true},{"text":"randomly generated, ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"d ","element":"figcaption","subtype":"caption"},{"text":"= 5","element":"figcaption","subtype":"caption"},{"text":", ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"p ","element":"figcaption","subtype":"caption"},{"text":"= 3","element":"figcaption","subtype":"caption"}],[{"id":"id-59","style":{"width":"98%"},"width":1702,"height":626,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-4.png","element":"img"}],[{"text":"Figure 3: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-5.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"Jordan block with ","element":"figcaption","subtype":"caption"},{"style":{"height":17.6},"width":500.65,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-6.png","element":"img","alt":"d = 4, ρ(A∗) = 0.9, B∗ = I","inline":true}],[{"text":"Figure 4: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-7.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"Jordan block with ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"d ","element":"figcaption","subtype":"caption"},{"text":"= 4","element":"figcaption","subtype":"caption"},{"text":", ","element":"figcaption","subtype":"caption"},{"style":{"height":19.13},"width":586.61,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-8.png","element":"img","alt":"ρ(A∗) = 0.9, B∗ = I, varying σ2u","inline":true}]]},{"heading":"5. Experimental Results","paragraphs":[[{"text":"We next validate our algorithm on several examples. Additional trials are included in Section ","element":"span"},{"text":"J","element":"span"},{"text":". We compare Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"against three baselines: playing ","element":"span"},{"style":{"height":19.13},"width":774.05,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-9.png","element":"img","alt":" ut ∼ N(0, γ2I/p), playing ut ∼ N(0, Σ∗),","inline":true,"padRight":true},{"text":"and playing the oracle set of inputs as computed by solving ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"on the true system parameters. ","element":"span"},{"style":{"height":12.33},"width":48.51,"height":30.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-10.png","element":"img","alt":" Σ∗ ","inline":true,"padRight":true},{"text":"is the covariance yielding the optimal noise excitation and can be computed via an SDP. We do not compare against existing works in active system identification as these works typically either require knowledge of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-11.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"to implement, and so are not directly comparable, or propose approaches similar enough to ours (","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"Lindqvist and Hjalmarsson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-18","referenceIndex":26,"text":"2001","element":"a"},{"text":")) a comparison is not relevant.","element":"span"}],[{"text":"We set ","element":"span"},{"style":{"height":15.6},"width":373.86,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-12.png","element":"img","alt":" T0 = 100, k0 = 20","inline":true},{"text":". Rather than running the ","element":"span"},{"text":"UpdateInputs ","element":"span"},{"text":"function as stated, we plan greedily with respect to ","element":"span"},{"style":{"height":16.81},"width":33.51,"height":42.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-13.png","element":"img","alt":"ˆA","inline":true},{"text":"—we do not restrict the set of allowable frequencies and set ","element":"span"},{"style":{"height":22.29},"width":885.64,"height":55.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/10-14.png","element":"img","alt":"U ← OptInputki( ˆAi, B∗, γ2/2, [ki+1], {xt}Tt=1)","inline":true},{"text":". In every experiment we solve ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"from ","element":"span"},{"text":"a single random initialization and do not restart multiple times to obtain a globally optimal solution. We plot the error ","element":"span"},{"style":{"height":21.21},"width":200.56,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-0.png","element":"img","alt":" ∥ ˆA − A∗∥2","inline":true,"padRight":true},{"text":"against the iteration number. The solid lines show the averages over 50 trials (100 for Figure ","element":"span"},{"href":"#id-58","text":"2","element":"a"},{"text":") and the shaded regions indicate the 10% and 90% percentiles.","element":"span"}],[{"text":"Figures ","element":"span"},{"href":"#id-58","text":"1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-58","text":"2 ","element":"a"},{"text":"illustrate the effectiveness of our approach as compared to exciting the system with noise—Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"dramatically outperforms noise-based approaches and performs nearly as well as the optimal. Figure ","element":"span"},{"href":"#id-59","text":"3 ","element":"a"},{"text":"investigates the performance of our algorithm when ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-1.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"is unknown. Here we simultaneously solve for ","element":"span"},{"style":{"height":15.42},"width":197.78,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-2.png","element":"img","alt":" A∗ and B∗","inline":true,"padRight":true},{"text":"and use our estimate of ","element":"span"},{"style":{"height":14.62},"width":50.1,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-3.png","element":"img","alt":" B∗","inline":true,"padRight":true},{"text":"when optimizing our inputs. As can be seen, this barely affects the algorithm’s performance.","element":"span"}],[{"text":"At each epoch, Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"devotes some amount of input energy to playing random noise. Let ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-4.png","element":"img","alt":"σ2u ","inline":true,"padRight":true},{"text":"denote the variance of this noise. By default in Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"we set ","element":"span"},{"href":"#id-59","style":{"height":27.21},"width":321.24,"height":68.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-5.png","element":"img","alt":" σ2u = γ22p. Figure 4","inline":true,"padRight":true},{"text":"illustrates ","element":"span"},{"text":"the performance of Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"when ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-6.png","element":"img","alt":" σ2u ","inline":true,"padRight":true},{"text":"is varied. For a given ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/11-7.png","element":"img","alt":" σ2u","inline":true},{"text":", all additional energy is devoted ","element":"span"},{"text":"to the sinusoidal component of the input. As this plot illustrates, noise is not needed in practice to effectively learn and, when all energy is devoted to the sinusoidal inputs, the performance of Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"almost immediately matches that of the optimal.","element":"span"}]]},{"heading":"6. Discussion","paragraphs":[[{"text":"In this work we have presented an algorithm for active identification of linear dynamical systems. We show that our algorithm achieves optimal asymptotic rates and present finite time performance bounds quantifying how the interactions between the input and the system affect the estimation. This work opens up several possible directions for future work.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"is nonconvex so a globally optimal solution cannot be efficiently found. In practice, an alternating minimization approach can be used to compute a local optimum. While solving ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"may be difficult, as our bounds show, the quantity being optimized is intrinsic to the problem. Developing algorithms to efficiently solve ","element":"span"},{"text":"OptInput ","element":"span"},{"text":"is an interesting future direction.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"Recent works in system identification ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"); ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") have emphasized obtaining bounds that do not scale with the mixing time of the system. Our error bounds do not scale with this quantity yet they require the transient effects of the inputs to have decayed. This condition seems necessary to cleanly quantify the performance and design inputs, yet may be possible to remove with a careful analysis of the transient behavior.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"This work only considers exciting the system with sinusoidal inputs. While we show this is sufficient to achieve optimal rates, one could also imagine choosing inputs that were a function of the current state. ","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"Dean et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"2018","element":"a"},{"text":") provides rates when a linear state feedback controller is used, but does not discuss how the choice of feedback could improve estimation. It is unclear a priori how effective it could be. At minimum, a carefully designed state feedback controller could be used to mitigate transient effects. We leave this direction for future work.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"text":"A recent work ","element":"span"},{"href":"#id-60","referenceIndex":14,"text":"Gonzlez and Rojas ","element":"a"},{"text":"(","element":"span"},{"href":"#id-60","referenceIndex":14,"text":"2019","element":"a"},{"text":") develops finite time bounds for estimating SISO AR(","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":") systems with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n > ","element":"span"},{"text":"1","element":"span"},{"text":". Extending this to MIMO AR(","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":") systems and allowing for active input design is an open problem and exciting future direction.","element":"span"}]]},{"heading":"Acknowledgements","paragraphs":[[{"text":"The authors would like to thank Yue Sun and Max Simchowitz for helpful comments. The work of AW was supported by an NSF GFRP Fellowship DGE-1762114. The work of KJ was supported in part by grant NSF RI 1907907.","element":"span"}]]},{"heading":"References","paragraphs":[[{"id":"id-23","text":"Yasin Abbasi-Yadkori and Csaba Szepesv´ari. ","element":"span"},{"text":"Regret bounds for the adaptive control of linear quadratic systems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proceedings of the 24th Annual Conference on Learning Theory","element":"span"},{"text":", pages 1–26, 2011.","element":"span"}],[{"id":"id-56","text":"Yasin Abbasi-Yadkori, David Pal, and Csaba Szepesvari. Improved algorithms for linear stochastic ","element":"span"},{"text":"bandits. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", 2011.","element":"span"}],[{"id":"id-20","text":"M¨arta Barenthin, Henrik Jansson, and H˚akan Hjalmarsson. Applications of mixed h2 and hinfin; ","element":"span"},{"text":"input design in identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IFAC Proceedings Volumes","element":"span"},{"text":", 38(1):458–463, 2005.","element":"span"}],[{"id":"id-4","text":"Xavier Bombois, Michel Gevers, Roland Hildebrand, and Gabriel Solari. Optimal experiment de- ","element":"span"},{"text":"sign for open and closed-loop system identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Communications in Information and Systems","element":"span"},{"text":", 11(3):197–224, 2011.","element":"span"}],[{"id":"id-27","text":"Alon Cohen, Tomer Koren, and Yishay Mansour. Learning linear-quadratic regulators efficiently ","element":"span"},{"text":"with only","element":"span"},{"style":{"height":19.58},"width":190.94,"height":48.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/12-0.png","element":"img","alt":"√T regret.","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1902.06223","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-24","text":"Sarah Dean, Horia Mania, Nikolai Matni, Benjamin Recht, and Stephen Tu. On the sample com- ","element":"span"},{"text":"plexity of the linear quadratic regulator. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1710.01688","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-6","text":"Sarah Dean, Horia Mania, Nikolai Matni, Benjamin Recht, and Stephen Tu. Regret bounds for ","element":"span"},{"text":"robust adaptive control of the linear quadratic regulator. ","element":"span"},{"text":"In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", pages 4188–4197, 2018.","element":"span"}],[{"id":"id-26","text":"Sarah Dean, Stephen Tu, Nikolai Matni, and Benjamin Recht. Safely learning to control the con- ","element":"span"},{"text":"strained linear quadratic regulator. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"2019 American Control Conference (ACC)","element":"span"},{"text":", pages 5582– 5588. IEEE, 2019.","element":"span"}],[{"id":"id-29","text":"Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, and George Michailidis. Finite time identi- ","element":"span"},{"text":"fication in unstable linear systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Automatica","element":"span"},{"text":", 96:342–353, 2018.","element":"span"}],[{"id":"id-19","text":"L´aszl´o Gerencs´er and H˚akan Hjalmarsson. Adaptive input design in system identification. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proceedings of the 44th IEEE Conference on Decision and Control","element":"span"},{"text":", pages 4988–4993. IEEE, 2005.","element":"span"}],[{"id":"id-21","text":"L´aszl´o Gerencs´er, Jonas M˚artensson, and H˚akan Hjalmarsson. Adaptive input design for arx sys- ","element":"span"},{"text":"tems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"2007 European Control Conference (ECC)","element":"span"},{"text":", pages 5707–5714. IEEE, 2007.","element":"span"}],[{"id":"id-22","text":"L´aszl´o Gerencs´er, H˚akan Hjalmarsson, and Jonas M˚artensson. Identification of arx systems with ","element":"span"},{"text":"non-stationary inputsasymptotic analysis with application to adaptive input design. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Automatica","element":"span"},{"text":", 45(3):623–633, 2009.","element":"span"}],[{"id":"id-8","text":"Michel Gevers, Alexandre S Bazanella, Xavier Bombois, and Ljubisa Miskovic. ","element":"span"},{"text":"Identification and the information matrix: how to get just sufficiently rich? ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 54(ARTICLE):2828–2840, 2009.","element":"span"}],[{"id":"id-60","text":"Rodrigo Gonzlez and Cristian Rojas. A finite-sample deviation bound for stable autoregressive ","element":"span"},{"text":"processes. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1912.08103","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-3","text":"Graham Clifford Goodwin and Robert L Payne. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Dynamic system identification: experiment design and data analysis","element":"span"},{"text":". Academic press, 1977.","element":"span"}],[{"id":"id-10","text":"Per H¨agg, Christian A Larsson, and H˚akan Hjalmarsson. Robust and adaptive excitation signal ","element":"span"},{"text":"generation for input and output constrained systems. ","element":"span"},{"text":"In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"2013 European Control Conference (ECC)","element":"span"},{"text":", pages 1416–1421. IEEE, 2013.","element":"span"}],[{"id":"id-31","text":"Moritz Hardt, Tengyu Ma, and Benjamin Recht. Gradient descent learns linear dynamical systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The Journal of Machine Learning Research","element":"span"},{"text":", 19(1):1025–1068, 2018.","element":"span"}],[{"id":"id-30","text":"Elad Hazan, Holden Lee, Karan Singh, Cyril Zhang, and Yi Zhang. Spectral filtering for general ","element":"span"},{"text":"linear dynamical systems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Advances in Neural Information Processing Systems","element":"span"},{"text":", pages 4634– 4643, 2018.","element":"span"}],[{"id":"id-13","text":"Roland Hildebrand and Michel Gevers. Identification for control: optimal input design with respect ","element":"span"},{"text":"to a worst-case ","element":"span"},{"style":{"height":8},"width":24,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/13-0.png","element":"img","alt":" ν","inline":true},{"text":"-gap cost function. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"SIAM Journal on Control and optimization","element":"span"},{"text":", 41(5):1586– 1608, 2002.","element":"span"}],[{"id":"id-12","text":"H˚akan Hjalmarsson, Michel Gevers, and Franky De Bruyne. ","element":"span"},{"text":"For model-based control design, closed-loop identification gives better performance. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Automatica","element":"span"},{"text":", 32(12):1659–1673, 1996.","element":"span"}],[{"id":"id-118","text":"Roger A Horn and Charles R Johnson. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Matrix analysis","element":"span"},{"text":". Cambridge university press, 2012.","element":"span"}],[{"id":"id-7","text":"Henrik Jansson and H˚akan Hjalmarsson. Input design via lmis admitting frequency-wise model ","element":"span"},{"text":"specifications in confidence regions. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE transactions on Automatic Control","element":"span"},{"text":", 50(10):1534– 1549, 2005.","element":"span"}],[{"id":"id-39","text":"Yassir Jedra and Alexandre Proutiere. Sample complexity lower bounds for linear system identifi- ","element":"span"},{"text":"cation. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1903.10343","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-14","text":"Dimitrios Katselis, Cristian R Rojas, H˚akan Hjalmarsson, and Mats Bengtsson. ","element":"span"},{"text":"Applicationoriented finite sample experiment design: A semidefinite relaxation approach. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IFAC Proceedings Volumes","element":"span"},{"text":", 45(16):1635–1640, 2012.","element":"span"}],[{"id":"id-17","text":"Christian Larsson, Egon Geerardyn, and Johan Schoukens. Robust input design for resonant systems ","element":"span"},{"text":"under limited a priori information. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IFAC Proceedings Volumes","element":"span"},{"text":", 45(16):1611–1616, 2012.","element":"span"}],[{"id":"id-18","text":"Kristian Lindqvist and H˚akan Hjalmarsson. Identification for control: Adaptive input design using ","element":"span"},{"text":"convex optimization. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No. 01CH37228)","element":"span"},{"text":", volume 5, pages 4326–4331. IEEE, 2001.","element":"span"}],[{"id":"id-9","text":"Ian R Manchester. Input design for system identification via convex relaxation. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"49th IEEE Conference on Decision and Control (CDC)","element":"span"},{"text":", pages 2041–2046. IEEE, 2010.","element":"span"}],[{"id":"id-25","text":"Horia Mania, Stephen Tu, and Benjamin Recht. Certainty equivalent control of lqr is efficient. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1902.07826","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-11","text":"Raman Mehra. Optimal input signals for parameter estimation in dynamic systems–survey and new ","element":"span"},{"text":"results. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 19(6):753–768, 1974.","element":"span"}],[{"id":"id-2","text":"Raman K Mehra. Synthesis of optimal inputs for multiinput-multioutput (mimo) systems with pro- ","element":"span"},{"text":"cess noise part i: Frequenc y-domain synthesis part ii: Time-domain synthesis. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mathematics in Science and Engineering","element":"span"},{"text":", volume 126, pages 211–249. Elsevier, 1976.","element":"span"}],[{"id":"id-32","text":"Samet Oymak and Necmiye Ozay. Non-asymptotic identification of lti systems from a single tra- ","element":"span"},{"text":"jectory. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"2019 American Control Conference (ACC)","element":"span"},{"text":", pages 5655–5661. IEEE, 2019.","element":"span"}],[{"id":"id-5","text":"Luc Pronzato and Andrej P´azman. Design of experiments in nonlinear models. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Lecture notes in statistics","element":"span"},{"text":", 212, 2013.","element":"span"}],[{"id":"id-15","text":"Cristian R Rojas, James S Welsh, Graham C Goodwin, and Arie Feuer. Robust optimal experiment ","element":"span"},{"text":"design for system identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Automatica","element":"span"},{"text":", 43(6):993–1008, 2007.","element":"span"}],[{"id":"id-16","text":"Cristian R Rojas, Juan-Carlos Aguero, James S Welsh, Graham C Goodwin, and Arie Feuer. Ro- ","element":"span"},{"text":"bustness in experiment design. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 57(4):860–874, 2011.","element":"span"}],[{"id":"id-1","text":"Tuhin Sarkar and Alexander Rakhlin. How fast can linear dynamical systems be learned? ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1812.01251","element":"span"},{"text":", 2018.","element":"span"}],[{"id":"id-34","text":"Tuhin Sarkar, Alexander Rakhlin, and Munther A Dahleh. Finite-time system identification for ","element":"span"},{"text":"partially observed lti systems of unknown order. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1902.01848","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-0","text":"Max Simchowitz, Horia Mania, Stephen Tu, Michael I Jordan, and Benjamin Recht. ","element":"span"},{"text":"Learning without mixing: Towards a sharp analysis of linear system identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1802.08334","element":"span"},{"text":", 2018.","element":"span"}],[{"id":"id-33","text":"Max Simchowitz, Ross Boczar, and Benjamin Recht. Learning linear dynamical systems with semi- ","element":"span"},{"text":"parametric least squares. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1902.00768","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-35","text":"Anastasios Tsiamis and George J Pappas. Finite sample analysis of stochastic system identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1903.09122","element":"span"},{"text":", 2019.","element":"span"}],[{"id":"id-28","text":"Stephen Tu, Ross Boczar, Andrew Packard, and Benjamin Recht. Non-asymptotic analysis of robust ","element":"span"},{"text":"control from coarse-grained identification. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1707.04791","element":"span"},{"text":", 2017.","element":"span"}]]},{"heading":"Appendix A. Notation","paragraphs":[[{"style":{"width":"102%"},"width":1772,"height":2032,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/16-0.png","element":"img"}],[{"id":"id-62","style":{"width":"103%"},"width":1782,"height":2346,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/17-0.png","element":"img"}],[{"text":"Full Definition of ","element":"span"},{"text":"UpdateInputs","element":"span"}],[{"style":{"width":"97%"},"width":1686,"height":740,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-0.png","element":"img"}],[{"text":"10: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"// Otherwise, set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I ","element":"span"},{"style":{"fontStyle":"italic"},"text":"to include frequencies we can plan effectively with","element":"span"}],[{"style":{"width":"35%"},"width":608,"height":85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-1.png","element":"img"}],[{"text":"13: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"// Check if we can plan optimally with frequency ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-2.png","element":"img","alt":" ℓ","inline":true}],[{"style":{"width":"99%"},"width":1716,"height":281,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-3.png","element":"img"}],[{"text":"17: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"end for","element":"span"}],[{"text":"18: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"end if","element":"span"}],[{"text":"19: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"// Update inputs","element":"span"}],[{"style":{"width":"82%"},"width":1427,"height":208,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-4.png","element":"img"}],[{"text":"24: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"end if","element":"span"}],[{"text":"25: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"return ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U","element":"span"}],[{"text":"26: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"end function","element":"span"}],[{"text":"Several comments on notation are in order. First, note that ","element":"span"},{"style":{"height":17.6},"width":135.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-5.png","element":"img","alt":" β(A, ρ)","inline":true,"padRight":true},{"text":"is the smallest value such that ","element":"span"},{"style":{"height":19.53},"width":764.08,"height":48.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-6.png","element":"img","alt":"∥Ak∥2 ≤ β(A, r)ρk for all k ≥ 0. β(A, ρ)","inline":true,"padRight":true},{"text":"is finite as long as ","element":"span"},{"style":{"height":17.6},"width":176,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-7.png","element":"img","alt":" ρ > ρ(A)","inline":true},{"text":". More generally, we can upper bound ","element":"span"},{"href":"#id-28","referenceIndex":40,"style":{"height":17.6},"width":475.27,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-8.png","element":"img","alt":" β(A, ρ) as Tu et al. (2017):","inline":true}],[{"style":{"width":"63%"},"width":1089,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-9.png","element":"img"}],[{"text":"As ","element":"span"},{"style":{"fontStyle":"italic"},"text":"r ","element":"span"},{"text":"is increased, ","element":"span"},{"style":{"height":17.6},"width":135.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-10.png","element":"img","alt":" β(A, ρ)","inline":true,"padRight":true},{"text":"will decrease, but the decay rate will be slower. Note that if we set","element":"span"}],[{"style":{"width":"89%"},"width":1550,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/18-11.png","element":"img"}],[{"text":"so the cumulative behavior of the transient, which corresponds to ","element":"span"},{"style":{"height":24.09},"width":60.29,"height":60.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-0.png","element":"img","alt":"11−ρ","inline":true},{"text":", and the upper bound on ","element":"span"},{"style":{"height":17.6},"width":135.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-1.png","element":"img","alt":"β(A, ρ)","inline":true},{"text":", will each be within a factor of 2 of their optimal possible values. Throughout the appendix, we will upper bound ","element":"span"},{"style":{"height":17.6},"width":374.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-2.png","element":"img","alt":" ∥Aℓ∥2 ≤ β(A)¯ρ(A)ℓ","inline":true},{"text":". In nearly all cases, however, the expressions obtained that contain ","element":"span"},{"style":{"height":17.6},"width":89.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-3.png","element":"img","alt":" ¯ρ(A)","inline":true,"padRight":true},{"text":"can be replaced with a ","element":"span"},{"style":{"height":17.6},"width":89.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-4.png","element":"img","alt":" ρ(A)","inline":true,"padRight":true},{"text":"by adding a factor of 2.","element":"span"}],[{"text":"To simplify notation throughout the proofs, we will let ","element":"span"},{"style":{"height":22.04},"width":684.2,"height":55.1,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-5.png","element":"img","alt":" Γηk = σ2Γk + σ2uΓB∗k and ˜Γuk = γ2Γuk.","inline":true,"padRight":true},{"text":"Throughout the appendix, we will let ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-6.png","element":"img","alt":" σ2u ","inline":true,"padRight":true},{"text":"refer to the variance of the exploration noise, which is set ","element":"span"},{"text":"by default to ","element":"span"},{"style":{"height":19.13},"width":154.48,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-7.png","element":"img","alt":" γ2/(2p).","inline":true}]]},{"heading":"Appendix B. Algorithm 1 Performance Results","paragraphs":[[{"id":"id-48","text":"We first present the full version of Theorem ","element":"span"},{"href":"#id-46","text":"2.3","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Theorem B.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Full version of Theorem ","element":"span"},{"href":"#id-46","style":{"fontStyle":"italic","fontWeight":"bold"},"text":"2.3","element":"a"},{"style":{"fontStyle":"italic"},"text":") Assume that ","element":"span"},{"style":{"height":29.18},"width":386.53,"height":72.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-8.png","element":"img","alt":" γ2 ≥ (1−ρ(A∗))22β(A∗)2 , and:","inline":true}],[{"style":{"width":"89%"},"width":1553,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-9.png","element":"img"}],[{"id":"id-61","style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":135.71,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-10.png","element":"img","alt":"0 ≥ ck0","inline":true}],[{"style":{"width":"88%"},"width":1530,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-11.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Then for any:","element":"span"}],[{"style":{"width":"88%"},"width":1532,"height":341,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"True ","element":"span"},{"style":{"fontStyle":"italic"},"text":"will achieve the following rate:","element":"span"}],[{"style":{"width":"94%"},"width":1633,"height":240,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-13.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and will produce inputs satisfying ","element":"span"},{"style":{"height":32.4},"width":766.07,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-14.png","element":"img","alt":" E�1/T �Tt=1 u⊤t ut�≤ γ2. Here c1, c2, C","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are universal constants, ","element":"span"},{"style":{"height":12.73},"width":41.98,"height":31.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-15.png","element":"img","alt":" u∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":29.18},"width":1210.6,"height":72.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-16.png","element":"img","alt":" OptInputk(T)(A∗, B∗, γ2, k(T), 0), and ¯ΓT = 16 β(A∗)2γ2(1−ρ(A∗))2 (1 +","inline":true}],[{"style":{"width":"47%"},"width":819,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-17.png","element":"img"}],[{"text":"Several additional remarks are in order.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark B.2 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For Theorem ","element":"span"},{"href":"#id-48","style":{"fontStyle":"italic"},"text":"B.1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"to hold, ","element":"span"},{"style":{"height":15.02},"width":173.64,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-18.png","element":"img","alt":" T0 and k0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"must be set to satisfy (","element":"span"},{"href":"#id-61","style":{"fontStyle":"italic"},"text":"6","element":"a"},{"style":{"fontStyle":"italic"},"text":"). This condition is necessary to guarantee that the burn-in time required by Theorem ","element":"span"},{"href":"#id-49","style":{"fontStyle":"italic"},"text":"2.6 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is met at each epoch. Satisfying this condition requires knowledge of the unknown system so, in practice, we cannot guarantee that it will be met for some ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-19.png","element":"img","alt":" T0, k0","inline":true},{"style":{"fontStyle":"italic"},"text":". However, since Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"increases ","element":"span"},{"style":{"height":14.62},"width":37.5,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-20.png","element":"img","alt":" Ti","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"faster than ","element":"span"},{"style":{"height":15.02},"width":34.72,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-21.png","element":"img","alt":" ki","inline":true},{"style":{"fontStyle":"italic"},"text":", regardless of how ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/19-22.png","element":"img","alt":" T0, k0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are set, it will eventually satisfy the burn-in condition of Theorem ","element":"span"},{"href":"#id-49","style":{"fontStyle":"italic"},"text":"2.6","element":"a"},{"style":{"fontStyle":"italic"},"text":", and so the conclusion of Theorem ","element":"span"},{"href":"#id-48","style":{"fontStyle":"italic"},"text":"B.1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"will eventually hold.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark B.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Every line in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"UpdateInputs","element":"span"},{"style":{"fontStyle":"italic"},"text":", with the exceptions of solving ","element":"span"},{"style":{"fontStyle":"italic"},"text":"OptInput","element":"span"},{"style":{"fontStyle":"italic"},"text":", is at worst a convex program and can be solved efficiently. Computing ","element":"span"},{"href":"#id-62","style":{"height":19.81},"width":431.98,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-0.png","element":"img","alt":" M(A, {xt}Tt=1) in line 3","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"involves a lin- ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ear search over ","element":"span"},{"style":{"height":17.6},"width":123.67,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-1.png","element":"img","alt":" ℓ ∈ [k]","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and the computation of a minimum eigenvalue for each ","element":"span"},{"style":{"height":19.81},"width":324.62,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-2.png","element":"img","alt":" ℓ. M(A, {xt}Tt=1)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"will be an ellipsoid. Line ","element":"span"},{"href":"#id-62","style":{"fontStyle":"italic"},"text":"7 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"and line ","element":"span"},{"href":"#id-62","style":{"fontStyle":"italic"},"text":"12 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"also involve iterating over all ","element":"span"},{"style":{"height":17.6},"width":123.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-3.png","element":"img","alt":" ℓ ∈ [k]","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and for each ","element":"span"},{"style":{"height":14.8},"width":129.14,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-4.png","element":"img","alt":" ℓ, max-","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"imizing a quadratic over an ellipsoid. Since the maximization of a quadratic over an ellipsoid can be solved via a single SVD, this step can be efficiently completed. While ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is growing exponentially with the epoch, we only call ","element":"span"},{"style":{"fontStyle":"italic"},"text":"UpdateInputs ","element":"span"},{"style":{"fontStyle":"italic"},"text":"once per epoch. Since the epoch length is also increasing exponentially, the number of epochs is only logarithmic in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"fontStyle":"italic"},"text":". Thus, the total number of flops is only linear in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"fontStyle":"italic"},"text":". In practice, one should simply stop increasing k when a sufficiently fine discretization of the space is reached to obtain close to optimal performance. Experimentally, we found this worked quite well.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark B.4 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The only constraint we place on the inputs is that their average power is bounded by some value. This constraint allows for signals with large amplitudes, a situation which is often highly undesirable in practice. To avoid this possibility, further constraints could be added ","element":"span"},{"style":{"fontStyle":"italic"},"text":"OptInput ","element":"span"},{"style":{"fontStyle":"italic"},"text":"to guarantee that the input computed has bounded amplitude as well as power. Unfortunately, amplitude constraints are non-trivial to enforce when optimizing in the frequency domain. Further, adding this constraint would cause us to lose the guarantee of global optimality of inputs. In practice, we have observed that the optimal inputs typically do not exhibit large spikes are other such undesirable behavior.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark B.5 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The restriction that ","element":"span"},{"style":{"height":17.6},"width":196.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-5.png","element":"img","alt":" ρ(A∗) < 1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is necessary to guarantee that the system will reach steady-state when a new input is played. As such, all our finite time results fundamentally depend on this assumption. A first step towards relaxing it would be proving a version of Proposition ","element":"span"},{"href":"#id-63","style":{"fontStyle":"italic"},"text":"E.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"that does not require the system has reached steady state. We leave this for future work.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"We also note that, in some sense, the interesting regime for active system identification is when ","element":"span"},{"style":{"height":17.6},"width":195.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-6.png","element":"img","alt":"ρ(A∗) < 1","inline":true},{"style":{"fontStyle":"italic"},"text":". As was shown in ","element":"span"},{"href":"#id-1","referenceIndex":35,"style":{"fontStyle":"italic"},"text":"Sarkar and Rakhlin ","element":"a"},{"style":{"fontStyle":"italic"},"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"style":{"fontStyle":"italic"},"text":"2018","element":"a"},{"style":{"fontStyle":"italic"},"text":"), when all modes in ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-7.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are unstable, the system can be estimated at an exponential rate. Thus, in this case, active identification is likely unnecessary. A more interesting regime may be when some eigenvalues of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-8.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"have magnitude greater than 1, and some have magnitude less than 1. In this case active identification could be used to excite the modes corresponding to the smaller eigenvalues. We leave this direction for future work.","element":"span"}],[{"text":"We next present our master theorem quantifying the performance of Algorithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":". Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"operates in three regimes. In the first regime, when ","element":"span"},{"style":{"height":14.62},"width":37.5,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-9.png","element":"img","alt":" Ti","inline":true,"padRight":true},{"text":"is not large enough for the system to reach steady state, we are only able to guarantee learning due to the contribution of the noise. In the second regime, ","element":"span"},{"style":{"height":14.62},"width":37.5,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-10.png","element":"img","alt":" Ti","inline":true,"padRight":true},{"text":"is large enough for the system to reach steady state but ","element":"span"},{"style":{"height":10.22},"width":29.71,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-11.png","element":"img","alt":" ϵi","inline":true,"padRight":true},{"text":"is not small enough for all frequencies to be playable. Finally, in the third regime, ","element":"span"},{"style":{"height":14.62},"width":37.5,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-12.png","element":"img","alt":" Ti","inline":true,"padRight":true},{"text":"is large enough to reach steady state and all frequencies are playable, allowing us to attain the optimal performance. All three regimes are quantified in Theorem ","element":"span"},{"href":"#id-64","text":"B.6","element":"a"},{"text":".","element":"span"}],[{"id":"id-64","style":{"fontWeight":"bold"},"text":"Theorem B.6 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":29.18},"width":386.53,"height":72.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-13.png","element":"img","alt":" γ2 ≥ (1−ρ(A∗))22β(A∗)2 , and:","inline":true}],[{"style":{"width":"90%"},"width":1556,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-14.png","element":"img"}],[{"id":"id-78","style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":135.71,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-15.png","element":"img","alt":"0 ≥ ck0","inline":true}],[{"style":{"width":"88%"},"width":1532,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/20-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"height":31.6},"width":1235.1,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-0.png","element":"img","alt":"¯ΓT = 16 β(A∗)2γ2(1−ρ(A∗))2 (1 + T)I + 4�tr�σ2ΓT + γ2p ΓB∗T � �1 + log 2δ�I�","inline":true},{"style":{"fontStyle":"italic"},"text":". Then Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"FT ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"True ","element":"span"},{"style":{"fontStyle":"italic"},"text":"will have:","element":"span"}],[{"id":"id-72","style":{"width":"94%"},"width":1632,"height":1477,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-1.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"In all cases, the inputs produced will satisfy:","element":"span"}],[{"style":{"width":"22%"},"width":397,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-2.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Here ","element":"span"},{"style":{"height":15.6},"width":256.72,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-3.png","element":"img","alt":" C1, C2, C3, C4","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are universal constants.","element":"span"}],[{"id":"id-47","style":{"fontWeight":"bold"},"text":"B.1. Proof of Theorem ","element":"span"},{"href":"#id-46","style":{"fontWeight":"bold"},"text":"2.3 ","element":"a"},{"style":{"fontWeight":"bold"},"text":"and Theorem ","element":"span"},{"href":"#id-48","style":{"fontWeight":"bold"},"text":"B.1","element":"a"}],[{"text":"The proof of Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"follows an event-based analysis. We define several events, show that they all hold with high probability, and that together they imply the rate given in Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"holds. We outline the steps at a high level here.","element":"span"}],[{"text":"We first must show that the estimate attained at the ","element":"span"},{"style":{"height":12.4},"width":97.25,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-4.png","element":"img","alt":" i − 1","inline":true},{"text":"th epoch is sufficiently accurate to guarantee that we are playing inputs that achieve a response close to optimal. Defining the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-5.png","element":"img","alt":" E7","inline":true,"padRight":true},{"text":"to be the event that ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/21-6.png","element":"img","alt":" ϵi−1","inline":true,"padRight":true},{"text":"is this small, Theorem ","element":"span"},{"href":"#id-64","text":"B.6 ","element":"a"},{"text":"shows that this holds with high probability. To show that the value of ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-0.png","element":"img","alt":" ϵi−1","inline":true,"padRight":true},{"text":"is sufficiently small to guarantee that our inputs are nearly optimal, we must show that ","element":"span"},{"style":{"height":13.9},"width":139.98,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-1.png","element":"img","alt":" ϵS ≥ ¯ϵS","inline":true},{"text":". This requires controlling the covariates in a specific direction, ","element":"span"},{"style":{"height":15.02},"width":206.74,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-2.png","element":"img","alt":" wmin which","inline":true,"padRight":true},{"text":"we define below. Event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-3.png","element":"img","alt":" E6","inline":true,"padRight":true},{"text":"is the event on which this is controlled and Lemma ","element":"span"},{"href":"#id-65","text":"E.7 ","element":"a"},{"text":"shows that it holds with high probability. On this event, Theorem ","element":"span"},{"href":"#id-66","text":"F.1 ","element":"a"},{"text":"and Lemmas ","element":"span"},{"href":"#id-67","text":"D.2 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-55","text":"D.1 ","element":"a"},{"text":"guarantee that, given ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-4.png","element":"img","alt":" ϵi−1","inline":true,"padRight":true},{"text":"this small, we will have that our inputs achieve a nearly optimal response.","element":"span"}],[{"text":"The remaining events are needed to guarantee our estimation rate at epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"holds. Event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-5.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"guarantees an upper bound on the covariates. ","element":"span"},{"style":{"height":15.02},"width":171.98,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-6.png","element":"img","alt":" E2 and E3","inline":true,"padRight":true},{"text":"are both lower bounds on the covariates. ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-7.png","element":"img","alt":"E2","inline":true,"padRight":true},{"text":"lower bounds the covariates from all epochs prior to epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"in terms of the noise and ","element":"span"},{"style":{"height":15.02},"width":152.79,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-8.png","element":"img","alt":" E3 lower","inline":true,"padRight":true},{"text":"bounds the covariates from the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epochs in terms of the input. The former is necessary for more technical reasons while the latter allows us to lower bound the covariates in terms of the inputs, which ultimately yields the rate that depends on the input response. ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-9.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"is shown to hold with high probability by Lemma ","element":"span"},{"href":"#id-65","text":"E.7 ","element":"a"},{"text":"and, conditioned on the covariates being upper bounded Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"shows that ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-10.png","element":"img","alt":" E2","inline":true,"padRight":true},{"text":"holds with high probability.","element":"span"}],[{"text":"A slightly more subtle issue arises in showing that ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-11.png","element":"img","alt":" E3","inline":true,"padRight":true},{"text":"holds with high probability. For ","element":"span"},{"style":{"height":15.02},"width":90.71,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-12.png","element":"img","alt":" E3 to","inline":true,"padRight":true},{"text":"hold, we must have that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"is large enough to guarantee that the system has reached steady state in the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch. Guaranteeing the steady state condition is reached requires the initial state at the start of the epoch, ","element":"span"},{"style":{"height":12.65},"width":106.43,"height":31.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-13.png","element":"img","alt":" xT−Ti","inline":true},{"text":", to be bounded. Given that such a bound holds, we can guarantee, in terms of this bound, that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"will be sufficiently large for the system to reach steady state. Event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-14.png","element":"img","alt":" E5","inline":true,"padRight":true},{"text":"gives this upper bound on ","element":"span"},{"href":"#id-69","style":{"height":17.05},"width":411.91,"height":42.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-15.png","element":"img","alt":" xT−Ti and Lemma D.7","inline":true,"padRight":true},{"text":"shows that it holds with high probability. Given this and the burn-in condition required by Theorem ","element":"span"},{"href":"#id-48","text":"B.1","element":"a"},{"text":", it follows that the system will have reached steady state at epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":". This, combined with ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-16.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"holding, allows us to apply Corollary ","element":"span"},{"href":"#id-70","text":"E.5 ","element":"a"},{"text":"to show that ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-17.png","element":"img","alt":" E3","inline":true,"padRight":true},{"text":"holds with high probability.","element":"span"}],[{"text":"Event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-18.png","element":"img","alt":" E4","inline":true,"padRight":true},{"text":"next shows that the self-normalized term in the error is bounded. On the event that the covariates are upper and lower bounded as in ","element":"span"},{"style":{"height":15.6},"width":228.07,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-19.png","element":"img","alt":" E1, E2, E3, E4","inline":true,"padRight":true},{"text":"holds with high probability by Lemma ","element":"span"},{"href":"#id-71","text":"E.6","element":"a"},{"text":".","element":"span"}],[{"text":"Finally, we show that if all of these events hold simultaneously, the “good” event, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":", which guarantees that the rate in Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"holds, is always true. Since all of these events hold with high probability, it then follows that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"text":"holds with high probability.","element":"span"}],[{"style":{"width":"91%"},"width":1577,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-20.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":12.73},"width":41.98,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-21.png","element":"img","alt":" ¯u∗ ","inline":true,"padRight":true},{"text":"be the solution to ","element":"span"},{"style":{"height":20.22},"width":558.97,"height":50.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-22.png","element":"img","alt":" OptInputki(A∗, B∗, γ2, ki, 0)","inline":true,"padRight":true},{"text":"and define the following events:","element":"span"}],[{"style":{"width":"76%"},"width":1325,"height":669,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/22-23.png","element":"img"}],[{"style":{"width":"96%"},"width":1661,"height":538,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-0.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":18.33},"width":383.75,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-1.png","element":"img","alt":" A∗ = PJP −1 and pi","inline":true,"padRight":true},{"text":"denote the columns of ","element":"span"},{"style":{"height":14.62},"width":251.4,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-2.png","element":"img","alt":" P. Here wmin","inline":true,"padRight":true},{"text":"is any unit norm vector such that ","element":"span"},{"style":{"height":17.41},"width":372.67,"height":43.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-3.png","element":"img","alt":"w⊤minpi = 0 for all pi","inline":true,"padRight":true},{"text":"that do not correspond to the minimum eigenvalue of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-4.png","element":"img","alt":" A∗","inline":true},{"text":". We wish to bound ","element":"span"},{"style":{"height":17.6},"width":102.29,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-5.png","element":"img","alt":"P[Ac]","inline":true},{"text":". The following set of inequalities obviously holds:","element":"span"}],[{"style":{"width":"98%"},"width":1696,"height":640,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-6.png","element":"img"}],[{"text":"By part ","element":"span"},{"href":"#id-72","text":"(a) ","element":"a"},{"text":"of Theorem ","element":"span"},{"href":"#id-64","text":"B.6 ","element":"a"},{"text":"it follows that if:","element":"span"}],[{"style":{"width":"89%"},"width":1549,"height":458,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-7.png","element":"img"}],[{"text":"which allows us to deterministically upper bound:","element":"span"}],[{"style":{"width":"101%"},"width":1749,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-8.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-65","text":"E.7 ","element":"a"},{"text":"we then have that ","element":"span"},{"href":"#id-65","style":{"height":17.88},"width":665.54,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-9.png","element":"img","alt":" P[Ec1] ≤ δ. By Lemma E.7, P[Ec6] ≤ δ","inline":true},{"text":". Note that on the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/23-10.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"by Lemmas ","element":"span"},{"href":"#id-73","text":"D.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-74","text":"D.4 ","element":"a"},{"text":"the burn-in time required by Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"will be met at the end of epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"assuming that ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-0.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are chosen to satisfy (","element":"span"},{"href":"#id-61","text":"6","element":"a"},{"text":"). Since ","element":"span"},{"style":{"height":10.62},"width":79.86,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-1.png","element":"img","alt":" ui−1","inline":true,"padRight":true},{"text":"is random we cannot apply Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"to bound this directly, however:","element":"span"}],[{"style":{"width":"104%"},"width":1802,"height":649,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-2.png","element":"img"}],[{"text":"have that ","element":"span"},{"style":{"height":17.88},"width":433.14,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-3.png","element":"img","alt":" P[E1 ∩ E2 ∩ E5 ∩ Ec3] ≤ δ","inline":true,"padRight":true},{"text":"so long as the steady state condition required by Proposition ","element":"span"},{"href":"#id-63","text":"E.2 ","element":"a"},{"text":"is met for every ","element":"span"},{"style":{"height":24.42},"width":859.4,"height":61.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-4.png","element":"img","alt":" w ∈ Sd−1 where cTiw⊤˜Γuikiw ≥ �T−Tit=1 (w⊤xt)2","inline":true},{"text":". That is, we need:","element":"span"}],[{"style":{"width":"73%"},"width":1274,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-5.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w ","element":"span"},{"text":"meeting this condition. On the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-6.png","element":"img","alt":" E5","inline":true},{"text":", by Corollary ","element":"span"},{"href":"#id-75","text":"D.8","element":"a"},{"text":", this burn in time will be reached as long as:","element":"span"}],[{"style":{"width":"63%"},"width":1090,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-7.png","element":"img"}],[{"text":"Note that on the event ","element":"span"},{"style":{"height":15.02},"width":284.52,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-8.png","element":"img","alt":" E2 and since Tss","inline":true,"padRight":true},{"text":"increases as its first argument decreases, we have:","element":"span"}],[{"style":{"width":"69%"},"width":1198,"height":532,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-9.png","element":"img"}],[{"text":"By Lemmas ","element":"span"},{"href":"#id-73","text":"D.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-74","text":"D.4 ","element":"a"},{"text":"and on the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-10.png","element":"img","alt":" E1","inline":true},{"text":", assuming that ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-11.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are chosen to satisfy (","element":"span"},{"href":"#id-61","text":"6","element":"a"},{"text":"), we will have that:","element":"span"}],[{"style":{"width":"45%"},"width":792,"height":92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-12.png","element":"img"}],[{"text":"so if ","element":"span"},{"style":{"height":31.6},"width":617.04,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-13.png","element":"img","alt":" Ti ≥ 3Tss�c5λmin(Γηki), ki�, then:","inline":true}],[{"style":{"width":"74%"},"width":1286,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/24-14.png","element":"img"}],[{"text":"Noting that on ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-0.png","element":"img","alt":" E1","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":22},"width":371.92,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-1.png","element":"img","alt":"�T−Tit=1 xtx⊤t ⪯ T ¯ΓT ","inline":true,"padRight":true},{"text":", we see then that the burn-in time required ","element":"span"},{"text":"by Corollary ","element":"span"},{"href":"#id-70","text":"E.5 ","element":"a"},{"text":"will be met if ","element":"span"},{"style":{"height":31.6},"width":516.75,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-2.png","element":"img","alt":" Ti ≥ 3Tss�c5λmin(Γηki), ki�","inline":true},{"text":". Repeating the same calculation we","element":"span"}],[{"text":"used to bound ","element":"span"},{"href":"#id-70","style":{"height":17.88},"width":186.58,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-3.png","element":"img","alt":" P[E1 ∩ Ec2]","inline":true,"padRight":true},{"text":"to handle the fact that ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-4.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"is random, we conclude, by Corollary ","element":"span"},{"href":"#id-70","text":"E.5","element":"a"},{"text":", that","element":"span"}],[{"style":{"height":17.88},"width":456.06,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-5.png","element":"img","alt":"P[E1 ∩ E2 ∩ E5 ∩ Ec3] ≤ δ.","inline":true}],[{"style":{"width":"96%"},"width":1661,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-6.png","element":"img"}],[{"text":"as:","element":"span"}],[{"style":{"width":"55%"},"width":954,"height":202,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-7.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":15.02},"width":582.7,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-8.png","element":"img","alt":" E1 ∩ E2 ∩ E3 ∩ E4 ∩ E5 ∩ E6 ∩ E7","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"75%"},"width":1313,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-9.png","element":"img"}],[{"text":"Furthermore, on this event, we will have that ","element":"span"},{"style":{"height":19.13},"width":488.33,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-10.png","element":"img","alt":" ϵi−1 ≤ ¯ϵS(A∗, B∗, γ2, T, δ)","inline":true,"padRight":true},{"text":"and all the conditions of Lemma ","element":"span"},{"href":"#id-67","text":"D.2 ","element":"a"},{"text":"will be met so ","element":"span"},{"style":{"height":19.81},"width":929.37,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-11.png","element":"img","alt":" ϵS(A∗, B∗, γ2, ki, {xt}Tt=1, δ) ≥ ¯ϵS(A∗, B∗, γ2, T, δ)","inline":true},{"text":". This implies that ","element":"span"},{"style":{"height":21.76},"width":721.86,"height":54.39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-12.png","element":"img","alt":"ϵi−1 ≤ ϵS(A∗, B∗, γ2, ki−1, {xt}T−Tit=1 , δ)","inline":true,"padRight":true},{"text":"so by Lemma ","element":"span"},{"href":"#id-55","text":"D.1","element":"a"},{"text":", ","element":"span"},{"style":{"height":17.6},"width":241.87,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-13.png","element":"img","alt":" Ii = [ki] and:","inline":true}],[{"style":{"height":53.5},"width":614.88,"height":133.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-14.png","element":"img","alt":"��λmin�Tik2iHki(A∗, B∗, U∗, [ki]) +","inline":true}],[{"id":"id-76","style":{"width":"98%"},"width":1709,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-15.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-16.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"text":"is the solution to ","element":"span"},{"style":{"height":21.6},"width":864.13,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-17.png","element":"img","alt":" OptInputki(A∗, B∗, γ2/2, [ki], {xt}Tt=1) and ˆU","inline":true,"padRight":true},{"text":"the solution to ","element":"span"},{"style":{"height":21.6},"width":783.21,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-18.png","element":"img","alt":"OptInputki( ˆAi−1, B∗, γ2/2, [ki], {xt}Tt=1)","inline":true},{"text":". Furthermore, on this event we will have that:","element":"span"}],[{"style":{"width":"55%"},"width":954,"height":557,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-19.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.6},"width":362.42,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-20.png","element":"img","alt":" (a) holds on E3, (b)","inline":true,"padRight":true},{"text":"holds given (","element":"span"},{"href":"#id-76","text":"9","element":"a"},{"text":"), ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"holds since the inputs ","element":"span"},{"style":{"height":17.22},"width":41.98,"height":43.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-21.png","element":"img","alt":" u∗i ","inline":true,"padRight":true},{"text":"maximize the quantity","element":"span"}],[{"style":{"height":31.6},"width":26,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-22.png","element":"img","alt":"�","inline":true},{"text":"under the power constraint, and ","element":"span"},{"style":{"height":21.29},"width":603.45,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-23.png","element":"img","alt":" (d) holds on E2. Ti = 23T + 13T0","inline":true,"padRight":true},{"text":"which implies that both ","element":"span"},{"style":{"height":15.02},"width":204.23,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-24.png","element":"img","alt":" Ti and Ti−1","inline":true,"padRight":true},{"text":"are greater than ","element":"span"},{"style":{"height":21.29},"width":114.98,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-25.png","element":"img","alt":"15T so:","inline":true}],[{"style":{"width":"53%"},"width":929,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/25-26.png","element":"img"}],[{"text":"By the error decomposition above, on the event ","element":"span"},{"style":{"height":15.02},"width":548.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-0.png","element":"img","alt":" E1 ∩ E2 ∩ E3 ∩ E4 ∩ E5 ∩ E6 ∩ E7","inline":true,"padRight":true},{"text":"it then follows that:","element":"span"}],[{"style":{"width":"83%"},"width":1437,"height":561,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-1.png","element":"img"}],[{"text":"and ","element":"span"},{"style":{"height":31.6},"width":606.13,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-2.png","element":"img","alt":" Ti ≥ 3Tss�c5λmin(Γηki), ki�then:","inline":true}],[{"style":{"width":"82%"},"width":1417,"height":190,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-3.png","element":"img"}],[{"text":"To eliminate dependance on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":", note that ","element":"span"},{"style":{"height":24},"width":633.04,"height":60.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-4.png","element":"img","alt":" T = �ij=0 3jT0 = T02 (3i+1 − 1)","inline":true,"padRight":true},{"text":"which implies that ","element":"span"},{"style":{"height":21.29},"width":1391.59,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-5.png","element":"img","alt":"i = log(2T/T0 + 1)/ log 3 − 1, and that Ti = 23T + 13T0 and Ti−1 = 29T + 19T0","inline":true},{"text":". We then have that","element":"span"}],[{"style":{"width":"100%"},"width":1728,"height":635,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-6.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"B.2. Proof of Theorem ","element":"span"},{"href":"#id-64","style":{"fontWeight":"bold"},"text":"B.6","element":"a"}],[{"text":"Throughout we will let ","element":"span"},{"style":{"height":24},"width":241.39,"height":60,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/26-7.png","element":"img","alt":" T = �ij=0 Ti","inline":true},{"text":", the total time that has elapsed after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"epochs. ","element":"span"},{"text":"We first note that the bound on expected power of the inputs follows directly from Lemma ","element":"span"},{"href":"#id-77","text":"D.6","element":"a"}],[{"text":"and by the power constraint imposed in ","element":"span"},{"text":"OptInput","element":"span"},{"text":".","element":"span"}],[{"style":{"width":"45%"},"width":778,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-0.png","element":"img"}],[{"text":"Let:","element":"span"}],[{"style":{"width":"70%"},"width":1210,"height":152,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-1.png","element":"img"}],[{"text":"be the event that our desired error bound holds, and define the following events:","element":"span"}],[{"style":{"width":"89%"},"width":1546,"height":463,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-2.png","element":"img"}],[{"text":"We wish to bound ","element":"span"},{"style":{"height":17.6},"width":102.29,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-3.png","element":"img","alt":" P[Ac]","inline":true},{"text":". The following inequalities obviously hold:","element":"span"}],[{"style":{"width":"71%"},"width":1244,"height":177,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-4.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-65","text":"E.7","element":"a"},{"text":", and since, following the proof of Lemma ","element":"span"},{"href":"#id-74","text":"D.4","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"97%"},"width":1684,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-5.png","element":"img"}],[{"text":"we have that ","element":"span"},{"style":{"height":17.88},"width":192.26,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-6.png","element":"img","alt":" P[Ec1] ≤ δ","inline":true},{"text":". Note that on the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-7.png","element":"img","alt":" E1","inline":true},{"text":", by Lemmas ","element":"span"},{"href":"#id-73","text":"D.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-74","text":"D.4 ","element":"a"},{"text":"the burn-in time ","element":"span"},{"text":"required by Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"will be met at the end of epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"assuming that ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-8.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are chosen to satisfy","element":"span"}],[{"style":{"width":"77%"},"width":1340,"height":393,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-9.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":15.02},"width":220.92,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-10.png","element":"img","alt":" E1 ∩ E2 ∩ E3","inline":true},{"text":", we have that:","element":"span"}],[{"style":{"width":"38%"},"width":658,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-11.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"69%"},"width":1194,"height":94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/27-12.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"59%"},"width":1025,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-0.png","element":"img"}],[{"text":"so ","element":"span"},{"style":{"height":17.6},"width":453.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-1.png","element":"img","alt":" P[Ac ∩ E1 ∩ E2 ∩ E3] = 0","inline":true},{"text":". Combining everything, it follows that:","element":"span"}],[{"style":{"width":"87%"},"width":1514,"height":247,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-2.png","element":"img"}],[{"id":"id-82","text":"The proof of this mirrors closely the proof above but now with inputs included. Let:","element":"span"}],[{"style":{"width":"76%"},"width":1324,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-3.png","element":"img"}],[{"text":"be the event that our desired error bound holds, and define the following events:","element":"span"}],[{"style":{"width":"96%"},"width":1661,"height":769,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-4.png","element":"img"}],[{"text":"We wish to bound ","element":"span"},{"style":{"height":17.6},"width":102.3,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-5.png","element":"img","alt":" P[Ac]","inline":true},{"text":". The following set of inequalities hold:","element":"span"}],[{"style":{"width":"85%"},"width":1482,"height":376,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-6.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-69","text":"D.7 ","element":"a"},{"text":"we have that ","element":"span"},{"href":"#id-65","style":{"height":17.88},"width":463.79,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-7.png","element":"img","alt":" P[Ec5] ≤ δ. By Lemma E.7","inline":true,"padRight":true},{"text":"and since:","element":"span"}],[{"style":{"width":"97%"},"width":1684,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/28-8.png","element":"img"}],[{"text":"we have that ","element":"span"},{"style":{"height":17.88},"width":172.69,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-0.png","element":"img","alt":" P[Ec1] ≤ δ","inline":true},{"text":". Note that on the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-1.png","element":"img","alt":" E1","inline":true},{"text":", by Lemmas ","element":"span"},{"href":"#id-73","text":"D.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-74","text":"D.4 ","element":"a"},{"text":"the burn-in time required ","element":"span"},{"text":"by Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"will be met at the end of epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"assuming that ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-2.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are chosen to satisfy (","element":"span"},{"href":"#id-78","text":"8","element":"a"},{"text":"). Since ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-3.png","element":"img","alt":"ui","inline":true,"padRight":true},{"text":"is random we cannot apply Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"to bound this directly, however:","element":"span"}],[{"style":{"width":"80%"},"width":1390,"height":349,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-4.png","element":"img"}],[{"text":"where the last inequality follows by applying Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"since ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-5.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"is deterministic on ","element":"span"},{"style":{"height":17.05},"width":193.41,"height":42.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-6.png","element":"img","alt":" FT−Ti and","inline":true,"padRight":true},{"text":"noting that ","element":"span"},{"style":{"height":21.29},"width":290.53,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-7.png","element":"img","alt":" Ti = 23T + 13T0.","inline":true}],[{"text":"A similar argument can be applied to bound ","element":"span"},{"style":{"height":17.88},"width":287.17,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-8.png","element":"img","alt":" P[E1 ∩ E5 ∩ Ec3]","inline":true},{"text":". Note that on the event ","element":"span"},{"style":{"height":16.4},"width":112.62,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-9.png","element":"img","alt":" E1, by","inline":true,"padRight":true},{"text":"Lemmas ","element":"span"},{"href":"#id-73","text":"D.5 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-74","text":"D.4 ","element":"a"},{"text":"and assuming that ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-10.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are chosen to satisfy (","element":"span"},{"href":"#id-78","text":"8","element":"a"},{"text":"), we will have that:","element":"span"}],[{"style":{"width":"45%"},"width":792,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-11.png","element":"img"}],[{"text":"so if ","element":"span"},{"style":{"height":31.6},"width":624.64,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-12.png","element":"img","alt":" Ti ≥ 3Tss�110λmin(˜Γuiki), ki�, then:","inline":true}],[{"style":{"width":"76%"},"width":1316,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-13.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-14.png","element":"img","alt":" E5","inline":true},{"text":", by Corollary ","element":"span"},{"href":"#id-75","text":"D.8","element":"a"},{"text":", ","element":"span"},{"style":{"height":31.6},"width":396.6,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-15.png","element":"img","alt":" Tss�110λmin(˜Γuiki), ki�","inline":true},{"text":"will then be sufficiently large for the","element":"span"}],[{"text":"system to reach steady state so t","element":"span"},{"href":"#id-75","text":"he bu","element":"a"},{"text":"rn-in time required by Lemma ","element":"span"},{"href":"#id-79","text":"E.4 ","element":"a"},{"text":"will be met. Then repeating","element":"span"}],[{"style":{"width":"99%"},"width":1725,"height":394,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-16.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":15.02},"width":401.81,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-17.png","element":"img","alt":" E1 ∩ E2 ∩ E3 ∩ E4 ∩ E5","inline":true},{"text":", we have that:","element":"span"}],[{"style":{"width":"44%"},"width":772,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-18.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"75%"},"width":1309,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-19.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"65%"},"width":1139,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/29-20.png","element":"img"}],[{"text":"so ","element":"span"},{"style":{"height":17.6},"width":634.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-0.png","element":"img","alt":" P[Ac ∩ E1 ∩ E2 ∩ E3 ∩ E4 ∩ E5] = 0","inline":true},{"text":". Combining everything, it follows that:","element":"span"}],[{"style":{"width":"90%"},"width":1571,"height":254,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-1.png","element":"img"}],[{"text":"To complete the result, we must show that the inputs ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-2.png","element":"img","alt":" ui","inline":true},{"text":", which are computed based on our estimate of the system ","element":"span"},{"style":{"height":19.43},"width":87.6,"height":48.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-3.png","element":"img","alt":"ˆAi−1","inline":true},{"text":", are close to the optimal inputs computed on the true system, for a specific set of frequencies ","element":"span"},{"style":{"height":15.02},"width":193.72,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-4.png","element":"img","alt":" Ii. That is:","inline":true}],[{"style":{"width":"96%"},"width":1661,"height":287,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-5.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-6.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"text":"is the solution to ","element":"span"},{"style":{"height":21.81},"width":868.56,"height":54.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-7.png","element":"img","alt":" OptInputki(A∗, B∗, γ2/2, Ii, {xt}T−Tit=1 ) and ˆU","inline":true,"padRight":true},{"text":"the solution to","element":"span"}],[{"style":{"width":"100%"},"width":1741,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-8.png","element":"img"}],[{"text":"then:","element":"span"}],[{"style":{"width":"77%"},"width":1346,"height":225,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-9.png","element":"img"}],[{"text":"this then implies that ","element":"span"},{"style":{"height":25.93},"width":593.84,"height":64.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-10.png","element":"img","alt":" ϵi−1 ≤ (3∥(ej 2πℓki I − A∗)−1∥2)−1","inline":true,"padRight":true},{"text":"so, again by Lemma ","element":"span"},{"href":"#id-80","text":"F.9","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"48%"},"width":844,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-11.png","element":"img"}],[{"text":"Assuming this condition is satisfied for a particular ","element":"span"},{"style":{"height":15.2},"width":127.14,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-12.png","element":"img","alt":" ℓ, then:","inline":true}],[{"style":{"width":"96%"},"width":1671,"height":306,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-13.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":31.6},"width":1205.62,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-14.png","element":"img","alt":" OptInputki(A∗, B∗, γ2/2, Ii, {xt}T−Tit=1 ) ≥ λmin��T−Tit=1 xtx⊤t�","inline":true},{"text":". So linking these together, if ","element":"span"},{"style":{"height":15.2},"width":217.76,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-15.png","element":"img","alt":" ℓ ∈ Ii, then:","inline":true}],[{"style":{"width":"83%"},"width":1441,"height":249,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/30-16.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":25.94},"width":830.28,"height":64.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-0.png","element":"img","alt":" ϵi−1 ≤ (3∥(ej 2πℓki I − A∗)−1∥2)−1 for all ℓ ∈ Ii","inline":true},{"text":", we can invoke Lemma ","element":"span"},{"href":"#id-81","text":"F.4 ","element":"a"},{"text":"to get that:","element":"span"}],[{"style":{"width":"96%"},"width":1670,"height":939,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-1.png","element":"img"}],[{"text":"which is the desired conclusion.","element":"span"}],[{"style":{"width":"44%"},"width":776,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-2.png","element":"img"}],[{"text":"Let:","element":"span"}],[{"style":{"width":"92%"},"width":1600,"height":500,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-3.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":18.33},"width":383.75,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-4.png","element":"img","alt":" A∗ = PJP −1 and pi","inline":true,"padRight":true},{"text":"denote the columns of ","element":"span"},{"style":{"height":14.62},"width":251.4,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-5.png","element":"img","alt":" P. Here wmin","inline":true,"padRight":true},{"text":"is any unit norm vector such that ","element":"span"},{"style":{"height":17.41},"width":372.3,"height":43.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-6.png","element":"img","alt":"w⊤minpi = 0 for all pi","inline":true,"padRight":true},{"text":"that do not correspond to the minimum eigenvalue of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-7.png","element":"img","alt":" A∗","inline":true},{"text":". We can follow the ","element":"span"},{"text":"proof outlined in Section ","element":"span"},{"href":"#id-82","text":"B.2.2 ","element":"a"},{"text":"up to the final step, adding in the events ","element":"span"},{"style":{"height":15.6},"width":115.31,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-8.png","element":"img","alt":" E6, E7:","inline":true}],[{"style":{"width":"84%"},"width":1458,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-9.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-65","text":"E.7","element":"a"},{"text":", ","element":"span"},{"href":"#id-72","style":{"height":17.88},"width":428.16,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-10.png","element":"img","alt":" P[Ec7] ≤ δ. By part (a)","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":17.6},"width":292.14,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-11.png","element":"img","alt":" P[E6] ≥ 1 − 3δ","inline":true},{"text":". We would like to ","element":"span"},{"text":"guarantee that ","element":"span"},{"style":{"height":19.13},"width":583.76,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-12.png","element":"img","alt":" ϵi−1 ≤ ¯ϵS(A∗, B∗, γ2, T − Ti, δ)","inline":true},{"text":". On the event ","element":"span"},{"style":{"height":15.02},"width":40.03,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-13.png","element":"img","alt":" E6","inline":true},{"text":", a sufficient condition to achieve this is:","element":"span"}],[{"style":{"width":"57%"},"width":988,"height":172,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/31-14.png","element":"img"}],[{"text":"On the event ","element":"span"},{"href":"#id-67","style":{"height":16.4},"width":674.88,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-0.png","element":"img","alt":" E1∩E2∩E3∩E5∩E6∩E7, by Lemma D.2","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":21.76},"width":599.58,"height":54.39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-1.png","element":"img","alt":" ϵS(A, B, γ2, ki−1, {xt}T−Tit=1 , δ) ≥","inline":true},{"style":{"height":19.13},"width":446.23,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-2.png","element":"img","alt":"¯ϵS(A∗, B∗, γ2, T − Ti, δ)","inline":true},{"text":", so by Lemma ","element":"span"},{"href":"#id-55","text":"D.1","element":"a"},{"text":", then we will have that ","element":"span"},{"style":{"height":17.6},"width":318.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-3.png","element":"img","alt":" Ii = [ki] and that:","inline":true}],[{"style":{"width":"98%"},"width":1707,"height":286,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-5.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"text":"is the solution to ","element":"span"},{"style":{"height":21.81},"width":891.76,"height":54.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-6.png","element":"img","alt":" OptInputki(A∗, B∗, γ2/2, [ki], {xt}T−Tit=1 ) and ˆU","inline":true,"padRight":true},{"text":"the solution to","element":"span"}],[{"style":{"height":21.81},"width":810.84,"height":54.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-7.png","element":"img","alt":"OptInputki( ˆAi−1, B∗, γ2/2, [ki], {xt}T−Tit=1 )","inline":true},{"text":". So it follows that on the event ","element":"span"},{"style":{"height":15.02},"width":355.05,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-8.png","element":"img","alt":" E1 ∩ E2 ∩ E3 ∩ E5 ∩","inline":true},{"style":{"height":15.6},"width":191.06,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-9.png","element":"img","alt":"E6 ∩ E7, A","inline":true,"padRight":true},{"text":"will also hold, so ","element":"span"},{"style":{"height":17.6},"width":729.18,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-10.png","element":"img","alt":" P[Ac ∩ E1 ∩ E2 ∩ E3 ∩ E5 ∩ E6 ∩ E7] = 0","inline":true},{"text":". We can then apply part ","element":"span"},{"href":"#id-72","text":"(a) ","element":"a"},{"text":"to get that, so long as ","element":"span"},{"style":{"height":14.62},"width":80.38,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-11.png","element":"img","alt":" Ti−1","inline":true,"padRight":true},{"text":"meets the condition above and ","element":"span"},{"style":{"height":31.6},"width":532.3,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-12.png","element":"img","alt":" Ti ≥ 3Tss�110λmin(˜Γu∗iki ), ki�:","inline":true}],[{"style":{"width":"82%"},"width":1421,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-13.png","element":"img"}],[{"id":"id-44","style":{"fontWeight":"bold"},"text":"B.3. Proof of Theorem ","element":"span"},{"href":"#id-43","style":{"fontWeight":"bold"},"text":"2.2","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Throughout we will let ","element":"span"},{"style":{"height":24},"width":241.39,"height":60.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-14.png","element":"img","alt":" T = �ij=0 Ti","inline":true},{"text":", the total time that has elapsed after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"epochs. ","element":"span"},{"text":"By Lemma ","element":"span"},{"href":"#id-41","text":"H.3","element":"a"},{"text":", we know that:","element":"span"}],[{"style":{"width":"34%"},"width":592,"height":70,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-15.png","element":"img"}],[{"text":"exists and is finite, where here ","element":"span"},{"style":{"height":12.73},"width":41.98,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-16.png","element":"img","alt":" u∗ ","inline":true,"padRight":true},{"text":"is the set of inputs in ","element":"span"},{"style":{"height":18.58},"width":61.64,"height":46.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-17.png","element":"img","alt":" Uγ2","inline":true,"padRight":true},{"text":"that maximizes ","element":"span"},{"style":{"height":24.21},"width":412.2,"height":60.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-18.png","element":"img","alt":" λmin(σ2Γk02i + ˜Γuk02i).","inline":true,"padRight":true},{"text":"It follows then that there exists ","element":"span"},{"style":{"height":14.62},"width":32.03,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-19.png","element":"img","alt":" i0","inline":true,"padRight":true},{"text":"such that, for all ","element":"span"},{"style":{"height":14.62},"width":105.25,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-20.png","element":"img","alt":" i ≥ i0","inline":true},{"text":", we will have:","element":"span"}],[{"style":{"width":"37%"},"width":641,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-21.png","element":"img"}],[{"text":"By Corollary ","element":"span"},{"href":"#id-83","text":"F.3 ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-84","text":"F.6","element":"a"},{"text":", for small enough ","element":"span"},{"style":{"height":15.02},"width":237.6,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-22.png","element":"img","alt":" ϵ and some i1","inline":true},{"text":", we will have that: ","element":"span"},{"style":{"height":35.38},"width":1013.44,"height":88.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-23.png","element":"img","alt":"��λmin(σ2Γk02i + ˜Γu∗k02i) − λmin(σ2Γk02i + ˜Γˆuk02i)�� ≤ 14c∗","inline":true}],[{"style":{"width":"104%"},"width":1799,"height":670,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/32-24.png","element":"img"}],[{"text":"which implies:","element":"span"}],[{"id":"id-85","style":{"width":"65%"},"width":1125,"height":84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-0.png","element":"img"}],[{"text":"Modifying the burn-in time of Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"to:","element":"span"}],[{"style":{"width":"85%"},"width":1473,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-1.png","element":"img"}],[{"text":"Assuming this burn in time is met and:","element":"span"}],[{"style":{"width":"89%"},"width":1543,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-2.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":125.31,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-3.png","element":"img","alt":"i ≥ cki","inline":true}],[{"text":"then by Theorem ","element":"span"},{"href":"#id-48","text":"B.1","element":"a"},{"text":", we will have that:","element":"span"}],[{"style":{"width":"24%"},"width":429,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-4.png","element":"img"}],[{"text":"so long as (where here we use the fact that ","element":"span"},{"style":{"height":17.6},"width":210.54,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-5.png","element":"img","alt":" ki = k(T)):","inline":true}],[{"style":{"width":"63%"},"width":1091,"height":211,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-6.png","element":"img"}],[{"text":"or equivalently:","element":"span"}],[{"id":"id-86","style":{"width":"81%"},"width":1401,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-7.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":12.73},"width":41.98,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-8.png","element":"img","alt":" u∗ ","inline":true,"padRight":true},{"text":"is defined as above and here we use (","element":"span"},{"href":"#id-85","text":"10","element":"a"},{"text":"). Note that by modifying the burn-in time of Theorem ","element":"span"},{"href":"#id-48","text":"B.1","element":"a"},{"text":", replacing ","element":"span"},{"style":{"height":19.13},"width":491.1,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-9.png","element":"img","alt":" ¯ϵ(A∗, B∗, γ2, T, δ) with ϵ∞","inline":true},{"text":", by the definition of ","element":"span"},{"style":{"height":10.22},"width":51.71,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-10.png","element":"img","alt":" ϵ∞","inline":true},{"text":", we will have the inputs being played are optimal with the flag ","element":"span"},{"style":{"height":19.13},"width":917.55,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-11.png","element":"img","alt":" FT = False, since ϵ∞ ≤ ¯ϵ(A∗, B∗, γ2, T, δ). As","inline":true,"padRight":true},{"text":"noted above, ","element":"span"},{"style":{"height":31.6},"width":393.61,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-12.png","element":"img","alt":" λmin�Γηk(T) + ˜Γu∗k(T)�","inline":true},{"text":"is upper bounded by a constant independent of ","element":"span"},{"style":{"height":15.2},"width":262.19,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-13.png","element":"img","alt":" T and δ. Thus,","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":12.8},"width":111.27,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-14.png","element":"img","alt":" δ → 0","inline":true},{"text":", the condition (","element":"span"},{"href":"#id-86","text":"11","element":"a"},{"text":") will force ","element":"span"},{"style":{"height":12.4},"width":143.8,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-15.png","element":"img","alt":" T → ∞","inline":true},{"text":". This implies that for small enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-16.png","element":"img","alt":" δ","inline":true},{"text":", we will have ","element":"span"},{"style":{"height":20.33},"width":373.9,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-17.png","element":"img","alt":"k(T) ≥ k02max{i0,i1}","inline":true},{"text":". In this case, then, we will have:","element":"span"}],[{"style":{"width":"94%"},"width":1628,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-18.png","element":"img"}],[{"text":"Defining ","element":"span"},{"style":{"height":13.24},"width":48.79,"height":33.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-19.png","element":"img","alt":" ¯τϵδ","inline":true,"padRight":true},{"text":"to be a solution to:","element":"span"}],[{"style":{"width":"41%"},"width":710,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/33-20.png","element":"img"}],[{"text":"for small enough ","element":"span"},{"style":{"height":15.6},"width":57.14,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-0.png","element":"img","alt":" ϵ, δ","inline":true},{"text":", it then follows by Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"that for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"at an epoch boundary, so long as ","element":"span"},{"style":{"height":14.84},"width":138.53,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-1.png","element":"img","alt":" T ≥ ¯τϵδ","inline":true,"padRight":true},{"text":"and the burn-in condition is met, we will have that:","element":"span"}],[{"style":{"width":"24%"},"width":429,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-2.png","element":"img"}],[{"text":"The above definition of ","element":"span"},{"style":{"height":13.24},"width":48.79,"height":33.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-3.png","element":"img","alt":" ¯τϵδ","inline":true,"padRight":true},{"text":"implies that necessarily ","element":"span"},{"style":{"height":28.5},"width":498.82,"height":71.26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-4.png","element":"img","alt":" ¯τϵδ ≥C′σ2 log 1δϵ2c∗ so as δ → 0","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":13.25},"width":162.83,"height":33.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-5.png","element":"img","alt":"¯τϵδ → ∞","inline":true},{"text":". By definition:","element":"span"}],[{"style":{"width":"63%"},"width":1089,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-6.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"93%"},"width":1614,"height":255,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-7.png","element":"img"}],[{"text":"where the inequality will hold for small enough ","element":"span"},{"style":{"height":15.24},"width":525.24,"height":38.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-8.png","element":"img","alt":" δ. Since ¯τϵδ → ∞ as δ → 0","inline":true},{"text":", it follows that for small enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-9.png","element":"img","alt":" δ","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"49%"},"width":851,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-10.png","element":"img"}],[{"text":"Thus, for small enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-11.png","element":"img","alt":" δ","inline":true},{"text":", we will have that:","element":"span"}],[{"id":"id-87","style":{"width":"98%"},"width":1702,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-12.png","element":"img"}],[{"text":"So if:","element":"span"}],[{"style":{"width":"55%"},"width":962,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-13.png","element":"img"}],[{"text":"we will have that for any ","element":"span"},{"style":{"height":14.84},"width":138.53,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-14.png","element":"img","alt":" T ≥ ¯τϵδ","inline":true},{"text":", so long as the burn-in condition is met:","element":"span"}],[{"style":{"width":"24%"},"width":429,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-15.png","element":"img"}],[{"text":"We can set:","element":"span"}],[{"style":{"width":"56%"},"width":975,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-16.png","element":"img"}],[{"text":"and then:","element":"span"}],[{"style":{"width":"18%"},"width":316,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-17.png","element":"img"}],[{"text":"It remains to show that the modified burn-in time required by Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"is met as ","element":"span"},{"style":{"height":12.8},"width":224.91,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-18.png","element":"img","alt":" δ → 0. That","inline":true,"padRight":true},{"text":"is, we need to ensure that as ","element":"span"},{"style":{"height":12.8},"width":122.74,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-19.png","element":"img","alt":" δ → 0:","inline":true}],[{"style":{"width":"86%"},"width":1495,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/34-20.png","element":"img"}],[{"text":"where here we have replaced ","element":"span"},{"style":{"height":16.4},"width":154,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-0.png","element":"img","alt":" Ti by τϵδ","inline":true,"padRight":true},{"text":"by noting that ","element":"span"},{"style":{"height":19.28},"width":250.03,"height":48.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-1.png","element":"img","alt":" Ti ≥ τϵδ2 if τϵδ","inline":true,"padRight":true},{"text":"is at an epoch boundary, since ","element":"span"},{"style":{"height":21.29},"width":278.78,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-2.png","element":"img","alt":"Ti = 23T + 13T0","inline":true},{"text":". By what we have shown and by definition of ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-3.png","element":"img","alt":" τϵδ","inline":true},{"text":", so long as ","element":"span"},{"style":{"height":12.22},"width":128.44,"height":30.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-4.png","element":"img","alt":" ϵ < ϵ∞","inline":true,"padRight":true},{"text":"and for small ","element":"span"},{"text":"enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-5.png","element":"img","alt":" δ","inline":true},{"text":", we automatically have that:","element":"span"}],[{"style":{"width":"76%"},"width":1325,"height":331,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-6.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":21.67},"width":257.57,"height":54.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-7.png","element":"img","alt":" λmin(Γηki) > 0","inline":true},{"text":", and that the dependance in ","element":"span"},{"style":{"height":14.62},"width":57.15,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-8.png","element":"img","alt":" Tss","inline":true,"padRight":true},{"text":"is logarithmic in ","element":"span"},{"style":{"height":10.44},"width":48.78,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-9.png","element":"img","alt":" τϵδ","inline":true},{"text":", and scales as ","element":"span"},{"style":{"height":21.29},"width":164.78,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-10.png","element":"img","alt":" log log 1δ.","inline":true,"padRight":true},{"text":"Thus, using the same argument as what we used above in (","element":"span"},{"href":"#id-87","text":"12","element":"a"},{"text":"), since ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-11.png","element":"img","alt":" τϵδ","inline":true,"padRight":true},{"text":"increases as ","element":"span"},{"style":{"height":21.29},"width":139.9,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-12.png","element":"img","alt":" log 1δ, a","inline":true,"padRight":true},{"text":"term linear in ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-13.png","element":"img","alt":" τϵδ","inline":true,"padRight":true},{"text":"will eventually exceed a term logarithmic in ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-14.png","element":"img","alt":" τϵδ","inline":true,"padRight":true},{"text":"for small enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-15.png","element":"img","alt":" δ","inline":true},{"text":", so we will eventually have that the burn-in condition is met. Finally, we see that the condition:","element":"span"}],[{"style":{"width":"89%"},"width":1543,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":125.31,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-17.png","element":"img","alt":"i ≥ cki","inline":true}],[{"text":"will be met eventually regardless of how ","element":"span"},{"style":{"height":15.6},"width":103.54,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-18.png","element":"img","alt":" k0, T0","inline":true,"padRight":true},{"text":"are set since, as noted, ","element":"span"},{"style":{"height":16.4},"width":507.78,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-19.png","element":"img","alt":" τϵδ → ∞ as δ → 0, implying","inline":true,"padRight":true},{"text":"that the number of epochs will go to infinity as ","element":"span"},{"style":{"height":15.02},"width":282.56,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-20.png","element":"img","alt":" δ → 0. Since Ti","inline":true,"padRight":true},{"text":"increases faster than ","element":"span"},{"style":{"height":15.02},"width":34.72,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-21.png","element":"img","alt":" ki","inline":true},{"text":", eventually the left hand side of the above inequality will be greater than the right hand side.","element":"span"}]]},{"heading":"Appendix C. Special Cases of Theorem B.1","paragraphs":[[{"id":"id-88","style":{"fontWeight":"bold"},"text":"Corollary C.1 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Full version of Corollary ","element":"span"},{"href":"#id-45","style":{"fontStyle":"italic","fontWeight":"bold"},"text":"3.1","element":"a"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume the assumptions outlined in Section ","element":"span"},{"style":{"fontStyle":"italic"},"text":"3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"for the case where ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-22.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is diagonalizable by a unitary matrix are met. Then after:","element":"span"}],[{"style":{"width":"89%"},"width":1554,"height":347,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-23.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"steps, Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"will attain the following rate:","element":"span"}],[{"style":{"width":"98%"},"width":1702,"height":210,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-24.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"while simply playing ","element":"span"},{"style":{"height":24.81},"width":282.9,"height":62.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-25.png","element":"img","alt":" ut ∼ N(0, γ2d I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all time will yield the following rate:","element":"span"}],[{"style":{"width":"88%"},"width":1527,"height":211,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/35-26.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"C.1. Proof of Corollary ","element":"span"},{"href":"#id-45","style":{"fontWeight":"bold"},"text":"3.1 ","element":"a"},{"style":{"fontWeight":"bold"},"text":"and Corollary ","element":"span"},{"href":"#id-88","style":{"fontWeight":"bold"},"text":"C.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"The above rate can be attained by the input:","element":"span"}],[{"style":{"width":"70%"},"width":1211,"height":442,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/36-0.png","element":"img"}],[{"text":"To see this, note that with this input we will have that:","element":"span"}],[{"style":{"width":"85%"},"width":1474,"height":440,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/36-1.png","element":"img"}],[{"text":"Note that:","element":"span"}],[{"style":{"width":"70%"},"width":1223,"height":315,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/36-2.png","element":"img"}],[{"text":"where the last equality will hold as long as:","element":"span"}],[{"style":{"width":"41%"},"width":719,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/36-3.png","element":"img"}],[{"text":"Assume that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"satisfies this, then:","element":"span"}],[{"style":{"width":"78%"},"width":1351,"height":438,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/36-4.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"78%"},"width":1348,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-0.png","element":"img"}],[{"text":"Thus, we will have that ","element":"span"},{"style":{"height":31.6},"width":451.95,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-1.png","element":"img","alt":" λmin�˜Γuk�= O� γ2∥1−λ∥22","inline":true}],[{"text":"Since we have constructed a feasible input and Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"constructs the optimal input on the true system (assuming ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"is large enough), it follows that Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"will perform at least this well.","element":"span"}],[{"style":{"width":"98%"},"width":1704,"height":753,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-2.png","element":"img"}],[{"text":"It remains then to quantify how large ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"must be to achieve this rate. From Theorem ","element":"span"},{"href":"#id-48","text":"B.1","element":"a"},{"text":", we know that we must have:","element":"span"}],[{"style":{"width":"91%"},"width":1576,"height":204,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-3.png","element":"img"}],[{"text":"and from above we need ","element":"span"},{"style":{"height":31.6},"width":447.07,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-4.png","element":"img","alt":" k = O�maxi=1,...,d i1−λi","inline":true}],[{"text":"lets us lower bound ","element":"span"},{"style":{"height":14.8},"width":171.59,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-5.png","element":"img","alt":" k as k ≥","inline":true}],[{"text":"sufficiently large.","element":"span"}],[{"style":{"width":"102%"},"width":1766,"height":590,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/37-6.png","element":"img"}],[{"style":{"width":"80%"},"width":1392,"height":602,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-0.png","element":"img"}],[{"text":"where the first inequality holds since ","element":"span"},{"style":{"height":24.72},"width":599.01,"height":61.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-1.png","element":"img","alt":" log 1¯ρ(A∗) ≈ 1 − ¯ρ(A∗) for ¯ρ(A∗)","inline":true,"padRight":true},{"text":"close to 1 and the second ","element":"span"},{"text":"holds by our lower bound on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":".","element":"span"}],[{"text":"To bound ","element":"span"},{"style":{"height":19.13},"width":351.44,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-2.png","element":"img","alt":" ¯ϵS(A∗, B∗, γ2, T, δ)","inline":true},{"text":", we must first bound ","element":"span"},{"style":{"height":19.21},"width":374.92,"height":48.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-3.png","element":"img","alt":"¯Mk(A∗, B∗, δ, γ2/2)","inline":true},{"text":". We see in our case that:","element":"span"}],[{"style":{"width":"75%"},"width":1303,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-4.png","element":"img"}],[{"style":{"height":20.01},"width":421.01,"height":50.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-5.png","element":"img","alt":"¯Mk(A∗, B∗, δ, γ2/2) ⊆","inline":true}],[{"style":{"width":"75%"},"width":1303,"height":23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-6.png","element":"img"}],[{"text":"Note that this implies that, for any ","element":"span"},{"style":{"height":19.21},"width":865.83,"height":48.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-7.png","element":"img","alt":" u ∈ ¯Mk(A∗, B∗, δ, γ2/2), denoting wi = [V ⊤u]i","inline":true},{"text":", we will have:","element":"span"}],[{"style":{"width":"85%"},"width":1475,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-8.png","element":"img"}],[{"text":"Then we will have that:","element":"span"}],[{"style":{"width":"101%"},"width":1752,"height":237,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-9.png","element":"img"}],[{"text":"Based on our choice of inputs:","element":"span"}],[{"style":{"width":"73%"},"width":1263,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-10.png","element":"img"}],[{"text":"So combining these, we can lower bound ","element":"span"},{"style":{"height":19.13},"width":410.67,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-11.png","element":"img","alt":" ¯ϵS(A∗, B∗, γ2, T, δ) as:","inline":true}],[{"style":{"width":"45%"},"width":784,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-12.png","element":"img"}],[{"text":"We can then write the burn in time from Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"as:","element":"span"}],[{"style":{"width":"89%"},"width":1554,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/38-13.png","element":"img"}],[{"style":{"width":"61%"},"width":1055,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-0.png","element":"img"}],[{"text":"The rate in the case where we simply play ","element":"span"},{"style":{"height":24.81},"width":287.34,"height":62.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-1.png","element":"img","alt":" ut ∼ N(0, γ2d I)","inline":true,"padRight":true},{"text":"for all time follows from Theorem","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"C.2. Proof of Corollary ","element":"span"},{"href":"#id-52","style":{"fontWeight":"bold"},"text":"3.2","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Since ","element":"span"},{"style":{"height":21.83},"width":965.45,"height":54.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-2.png","element":"img","alt":" ∥A∗ − ˆA∥2 = maxj=1,...,m ∥Aj − ˆAj∥2 (assuming ˆA","inline":true,"padRight":true},{"text":"has the same block diagonal structure), to minimize the error in the estimate we want to minimize the maximum error in the estimate of each subsystem. By Theorem ","element":"span"},{"href":"#id-48","text":"B.1","element":"a"},{"text":", once the burn-in time is reached, the estimation error for each subsystem will behave as:","element":"span"}],[{"style":{"width":"86%"},"width":1501,"height":241,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-3.png","element":"img"}],[{"text":"where we let ","element":"span"},{"style":{"height":14.73},"width":42.27,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-4.png","element":"img","alt":" Γj ","inline":true,"padRight":true},{"text":"denote the covariates for the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":"th subsystem. For simplicity assume that:","element":"span"}],[{"style":{"width":"51%"},"width":887,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-5.png","element":"img"}],[{"text":"where here we let ","element":"span"},{"style":{"height":22.22},"width":82.08,"height":55.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-6.png","element":"img","alt":" λ∗,jmin ","inline":true,"padRight":true},{"text":"denote the optimal response of the system to inputs with power 1, and ","element":"span"},{"style":{"height":22.02},"width":42.02,"height":55.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-7.png","element":"img","alt":" γ2j","inline":true,"padRight":true},{"text":"the true amount of power inputed to the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":"th block. Ignoring log factors, the optimal thing to do is to then set:","element":"span"}],[{"id":"id-89","style":{"width":"59%"},"width":1024,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-8.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":17.6},"width":179.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-9.png","element":"img","alt":" ℓ, j ∈ [m]","inline":true},{"text":", as this will make the estimation error equal for each subsystem, minimizing the overall error. Meeting this constraint and the power constraint, the following condition will then be met for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":":","element":"span"}],[{"style":{"width":"101%"},"width":1749,"height":494,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-10.png","element":"img"}],[{"text":"Thus, with high probability, we will have that:","element":"span"}],[{"style":{"width":"24%"},"width":429,"height":192,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/39-11.png","element":"img"}],[{"text":"In contrast, if we simply input random noise into the system—that is, set ","element":"span"},{"style":{"height":27.21},"width":447.38,"height":68.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-0.png","element":"img","alt":" ut ∼ N(0, γ2p I)—then in","inline":true,"padRight":true},{"text":"the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th block we will achieve the rate:","element":"span"}],[{"style":{"width":"92%"},"width":1592,"height":241,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-1.png","element":"img"}],[{"text":"so, with high probability, noting that by construction ","element":"span"},{"style":{"height":14.8},"width":130.45,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-2.png","element":"img","alt":" p ≥ m:","inline":true}],[{"style":{"width":"40%"},"width":697,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-3.png","element":"img"}],[{"text":"To achieve the adaptive rate, Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"can be run separately for each subsystem. After the optimal solution for each subsystem is found, the power ","element":"span"},{"style":{"height":22.02},"width":42.02,"height":55.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-4.png","element":"img","alt":" γ2j ","inline":true,"padRight":true},{"text":"input to each subsystem can then be ","element":"span"},{"text":"adjusted so that the empirical version of (","element":"span"},{"href":"#id-89","text":"14","element":"a"},{"text":") is satisfied. Once the burn-in time from Theorem ","element":"span"},{"href":"#id-48","text":"B.1 ","element":"a"},{"text":"is met for each subsystem, our estimates of ","element":"span"},{"style":{"height":22.22},"width":82.07,"height":55.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-5.png","element":"img","alt":" λ∗,jmin ","inline":true,"padRight":true},{"text":"will be sufficiently accurate to guarantee that (","element":"span"},{"href":"#id-89","text":"14","element":"a"},{"text":") ","element":"span"},{"text":"will be met on the true system, and we will then achieve the optimal adaptive rate.","element":"span"}]]},{"heading":"Appendix D. Algorithm 1 Performance Lemmas","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"D.1. Quantifying When ","element":"span"},{"style":{"height":10.22},"width":72.59,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-6.png","element":"img","alt":" ϵi−1","inline":true,"padRight":true},{"style":{"fontWeight":"bold"},"text":"Small Enough for ","element":"span"},{"style":{"height":17.22},"width":138.67,"height":43.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-7.png","element":"img","alt":" ui ≈ u∗i","inline":true}],[{"id":"id-55","style":{"fontWeight":"bold"},"text":"Lemma D.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"If:","element":"span"}],[{"style":{"width":"110%"},"width":1918,"height":731,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-8.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-9.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":23.2},"width":953.92,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-10.png","element":"img","alt":" OptInputki+1(A∗, B∗, γ2/2, [ki+1], {xt}Tt=1) and ˆU","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the solution to ","element":"span"},{"style":{"height":23.2},"width":831.16,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-11.png","element":"img","alt":"OptInputki+1( ˆAi, B∗, γ2/2, [ki+1], {xt}Tt=1).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"By Lemma ","element":"span"},{"href":"#id-80","text":"F.9","element":"a"},{"text":", if ","element":"span"},{"style":{"height":21.21},"width":617.97,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-12.png","element":"img","alt":" ϵi ≤ (4∥(ejθI − ˆAi)−1∥2)−1, then:","inline":true}],[{"style":{"width":"41%"},"width":724,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/40-13.png","element":"img"}],[{"text":"this then implies that ","element":"span"},{"style":{"height":19.53},"width":512.49,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-0.png","element":"img","alt":" ϵi ≤ (3∥(ejθI − A∗)−1∥2)−1 ","inline":true,"padRight":true},{"text":"so, again by Lemma ","element":"span"},{"href":"#id-80","text":"F.9","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"41%"},"width":724,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-1.png","element":"img"}],[{"text":"Thus, if ","element":"span"},{"style":{"height":21.21},"width":507.08,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-2.png","element":"img","alt":" ϵi ≤ (4∥(ejθI − ˆAi)−1∥2)−1","inline":true},{"text":", we can upper bound:","element":"span"}],[{"style":{"width":"86%"},"width":1498,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"∥","element":"span"},{"style":{"height":17.6},"width":97.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-4.png","element":"img","alt":"w⊤(e","inline":true}],[{"id":"id-91","style":{"width":"83%"},"width":1450,"height":186,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-5.png","element":"img"}],[{"text":"Applying Lemma ","element":"span"},{"href":"#id-80","text":"F.9 ","element":"a"},{"text":"again, a sufficient condition for ","element":"span"},{"style":{"height":21.21},"width":813.5,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-6.png","element":"img","alt":" ϵi ≤ (4∥(ejθI− ˆAi)−1∥2)−1 is ϵi ≤ (5∥(ejθI−","inline":true},{"style":{"height":19.13},"width":227.89,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-7.png","element":"img","alt":"A∗)−1∥2)−1.","inline":true}],[{"style":{"width":"96%"},"width":1663,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-8.png","element":"img"}],[{"text":"of Theorem ","element":"span"},{"href":"#id-66","text":"F.1","element":"a"},{"text":", it follows that:","element":"span"}],[{"style":{"width":"103%"},"width":1782,"height":629,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-9.png","element":"img"}],[{"text":"where the inequality ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"follows from Lemma ","element":"span"},{"href":"#id-81","text":"F.4 ","element":"a"},{"text":"(with a slight readjustment of constants) and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"follows from Lemma ","element":"span"},{"href":"#id-90","text":"D.3","element":"a"},{"text":". Thus, if we can guarantee that:","element":"span"}],[{"id":"id-92","style":{"width":"90%"},"width":1561,"height":280,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-10.png","element":"img"}],[{"text":"then it will follow that:","element":"span"}],[{"style":{"width":"99%"},"width":1720,"height":60,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-11.png","element":"img"}],[{"text":"Assume ","element":"span"},{"style":{"height":10.22},"width":29.71,"height":25.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-12.png","element":"img","alt":" ϵi","inline":true,"padRight":true},{"text":"is small enough to satisfy this. Then, with (","element":"span"},{"href":"#id-91","text":"15","element":"a"},{"text":"), it follows that if:","element":"span"}],[{"style":{"width":"84%"},"width":1454,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/41-13.png","element":"img"}],[{"style":{"width":"51%"},"width":896,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-0.png","element":"img"}],[{"text":"then:","element":"span"}],[{"style":{"width":"79%"},"width":1368,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-1.png","element":"img"}],[{"text":"so ","element":"span"},{"style":{"height":16.22},"width":167.15,"height":40.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-2.png","element":"img","alt":" ℓ ∈ Ii+1","inline":true},{"text":". Note that this condition will also imply that (","element":"span"},{"href":"#id-92","text":"16","element":"a"},{"text":") holds. Combining all of this, it follows that if:","element":"span"}],[{"style":{"width":"107%"},"width":1849,"height":318,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-3.png","element":"img"}],[{"text":"then ","element":"span"},{"style":{"height":17.6},"width":242.4,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-4.png","element":"img","alt":" Ii+1 = [ki+1]","inline":true},{"text":". Finally, we see that the perturbation bound holds by applying Theorem ","element":"span"},{"href":"#id-66","text":"F.1 ","element":"a"},{"text":"and our condition on ","element":"span"},{"style":{"height":14.8},"width":154.72,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-5.png","element":"img","alt":" ϵi, since:","inline":true}],[{"id":"id-67","style":{"width":"100%"},"width":1728,"height":1313,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/42-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":8.4},"width":48.42,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-0.png","element":"img","alt":" w′ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"to be specified, we will have:","element":"span"}],[{"style":{"width":"55%"},"width":966,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-1.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where:","element":"span"}],[{"style":{"height":20.01},"width":351.44,"height":50.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-2.png","element":"img","alt":"¯ϵS(A∗, B∗, γ2, T, δ)","inline":true}],[{"style":{"width":"103%"},"width":1780,"height":326,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-3.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"From the definition of ","element":"span"},{"text":"OptInput","element":"span"},{"text":", it is clear that:","element":"span"}],[{"style":{"width":"94%"},"width":1627,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-4.png","element":"img"}],[{"text":"on the event ","element":"span"},{"style":{"height":24.42},"width":396.06,"height":61.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-5.png","element":"img","alt":"�Tt=1 xtx⊤t ⪰ c2TΓηki","inline":true},{"text":". Further, conditioned on all three events assumed to hold, by ","element":"span"},{"text":"Lemma ","element":"span"},{"href":"#id-93","text":"F.5","element":"a"},{"text":", we have that:","element":"span"}],[{"style":{"width":"44%"},"width":769,"height":57,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-6.png","element":"img"}],[{"text":"Finally, recall that ","element":"span"},{"style":{"height":17.6},"width":184.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-7.png","element":"img","alt":" ki = k(T)","inline":true},{"text":". Combining all of this we have:","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"ϵ","element":"span"},{"style":{"height":20.41},"width":543.82,"height":51.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-8.png","element":"img","alt":"S(A∗, B∗, γ2, ki+1, {xt}Tt=1, δ)","inline":true}],[{"style":{"width":"110%"},"width":1907,"height":750,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-9.png","element":"img"}],[{"id":"id-90","style":{"fontWeight":"bold"},"text":"Lemma D.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"When calling ","element":"span"},{"text":"UpdateInputs","element":"span"},{"style":{"fontStyle":"italic"},"text":", we will always have that:","element":"span"}],[{"style":{"width":"42%"},"width":729,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/43-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Recall that:","element":"span"}],[{"style":{"width":"99%"},"width":1727,"height":302,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-0.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"111%"},"width":1928,"height":254,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-1.png","element":"img"}],[{"text":"for any ","element":"span"},{"style":{"height":17.6},"width":119.73,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-2.png","element":"img","alt":" ℓ ∈ [k]","inline":true,"padRight":true},{"text":"satisfying ","element":"span"},{"href":"#id-80","style":{"height":34.85},"width":812.47,"height":87.13,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-3.png","element":"img","alt":" ϵ ≤�4∥(ej 2πℓk I − ˆA)−1∥2�−1, by Lemma F.9","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"45%"},"width":786,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-4.png","element":"img"}],[{"text":"Since ","element":"span"},{"text":"UpdateInputs ","element":"span"},{"text":"only includes frequencies ","element":"span"},{"style":{"height":34.85},"width":828.41,"height":87.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-5.png","element":"img","alt":" ℓ in I if ϵ ≤�4∥(ej 2πℓk I − ˆA)−1∥2�−1, it fol-","inline":true,"padRight":true},{"text":"lows that:","element":"span"}],[{"style":{"width":"95%"},"width":1656,"height":150,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-6.png","element":"img"}],[{"text":"from which it follows that ","element":"span"},{"style":{"height":21.49},"width":744.15,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-7.png","element":"img","alt":" M(A∗, ˆA, {xt}Tt=1, I) ⊆ M( ˆA, {xt}Tt=1).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"D.2. Meeting the Burn-In Time of Theorem ","element":"span"},{"href":"#id-94","style":{"fontWeight":"bold"},"text":"E.1","element":"a"}],[{"id":"id-74","style":{"width":"104%"},"width":1813,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"We have that ","element":"span"},{"style":{"height":31.6},"width":1357.92,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-9.png","element":"img","alt":"¯ΓT = 4�˜ΓuT,0 + Tr(ΓηT )(1 + log 2δ)I�where ˜ΓuT,0 = 1T�Tt=1 xut xut⊤. Note","inline":true,"padRight":true},{"text":"that:","element":"span"}],[{"style":{"width":"82%"},"width":1433,"height":261,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-10.png","element":"img"}],[{"text":"which implies that:","element":"span"}],[{"style":{"width":"25%"},"width":447,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/44-11.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"68%"},"width":1186,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-0.png","element":"img"}],[{"text":"We also have that:","element":"span"}],[{"style":{"width":"69%"},"width":1206,"height":715,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-1.png","element":"img"}],[{"text":"This gives that:","element":"span"}],[{"style":{"width":"78%"},"width":1357,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-2.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"111%"},"width":1932,"height":454,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-3.png","element":"img"}],[{"id":"id-73","style":{"fontWeight":"bold"},"text":"Lemma D.5 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":29.18},"width":541.72,"height":72.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-4.png","element":"img","alt":" T ≥ 16, γ2 ≥ (1−¯ρ(A∗))22β(A∗)2 , and:","inline":true}],[{"style":{"width":"89%"},"width":1543,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-5.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":125.31,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-6.png","element":"img","alt":"i ≥ cki","inline":true}],[{"style":{"fontStyle":"italic"},"text":"then:","element":"span"}],[{"style":{"width":"95%"},"width":1658,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-7.png","element":"img"}],[{"style":{"height":15.02},"width":194.44,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/45-8.png","element":"img","alt":"3Ti ≥ c2ki","inline":true}],[{"style":{"width":"91%"},"width":1576,"height":463,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-0.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"101%"},"width":1758,"height":150,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-1.png","element":"img"}],[{"style":{"height":15.02},"width":120.27,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-2.png","element":"img","alt":"≤ c2ki","inline":true}],[{"style":{"width":"96%"},"width":1676,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-3.png","element":"img"}],[{"text":"where the second to last inequality follows assuming that ","element":"span"},{"style":{"height":29.18},"width":513.46,"height":72.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-4.png","element":"img","alt":" T ≥ 16 and γ2 ≥ (1−¯ρ(A∗))22β(A∗)2 .","inline":true}],[{"text":"A direct corollary of Lemma ","element":"span"},{"text":"D.5 ","element":"span"},{"text":"and Lemma ","element":"span"},{"text":"D.4 ","element":"span"},{"text":"is that, assuming ","element":"span"},{"href":"#id-73","style":{"height":49.05},"width":1723.28,"height":122.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-5.png","element":"img","alt":" T0 ≥ 16 and γ2 ≥(1−¯ρ(A∗))22β(A∗)2","inline":true,"padRight":true},{"text":", then, as long as:","element":"span"}],[{"style":{"width":"90%"},"width":1556,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":15.02},"width":135.71,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-7.png","element":"img","alt":"0 ≥ ck0","inline":true}],[{"text":"the ","element":"span"},{"style":{"height":15.02},"width":158.57,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-8.png","element":"img","alt":" ki and Ti","inline":true,"padRight":true},{"text":"used by Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"will satisfy:","element":"span"}],[{"style":{"width":"41%"},"width":723,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-9.png","element":"img"}],[{"text":"for any ","element":"span"},{"style":{"height":14.8},"width":272.68,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-10.png","element":"img","alt":" Γ ⪰ 0 and all i.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"D.3. Additional Lemmas","element":"span"}],[{"id":"id-77","style":{"fontWeight":"bold"},"text":"Lemma D.6 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any ","element":"span"},{"style":{"height":17.6},"width":566.19,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-11.png","element":"img","alt":" i and any t ∈ [T − Ti, Ti − ki]","inline":true},{"style":{"fontStyle":"italic"},"text":", the inputs generated by Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"will satisfy:","element":"span"}],[{"style":{"width":"23%"},"width":411,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Denote ","element":"span"},{"style":{"height":16.72},"width":390.19,"height":41.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-13.png","element":"img","alt":" ut = ˜ut +ηut where ˜ut ","inline":true,"padRight":true},{"text":"is the solution to ","element":"span"},{"style":{"height":19.81},"width":770.37,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-14.png","element":"img","alt":" OptInputk(A∗, B∗, γ2 −pσ2u, I, {xt}Tt=1)","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":19.13},"width":289.58,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-15.png","element":"img","alt":" ηut ∼ N(0, σ2uI)","inline":true},{"text":". Assume that ","element":"span"},{"style":{"height":19.05},"width":252.65,"height":47.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-16.png","element":"img","alt":" σ2u ̸= 0. Then:","inline":true}],[{"style":{"width":"61%"},"width":1064,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/46-17.png","element":"img"}],[{"style":{"width":"37%"},"width":645,"height":351,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"follows since ","element":"span"},{"style":{"height":16.71},"width":180.96,"height":41.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-1.png","element":"img","alt":" ˜ut and ηut ","inline":true,"padRight":true},{"text":"are independent, ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"follows by our choice of ","element":"span"},{"style":{"height":19.05},"width":218.27,"height":47.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-2.png","element":"img","alt":" σ2u in Algo-","inline":true,"padRight":true},{"text":"rithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":". ","element":"span"},{"text":"The final equality follows since, by construction, the inputs that are the solution to ","element":"span"},{"style":{"height":19.81},"width":778.22,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-3.png","element":"img","alt":"OptInputk(A∗, B∗, γ2 − pσ2u, I, {xt}Tt=1)","inline":true,"padRight":true},{"text":"will satisfy:","element":"span"}],[{"style":{"width":"24%"},"width":424,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-4.png","element":"img"}],[{"text":"for any ","element":"span"},{"style":{"height":14},"width":106.76,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-5.png","element":"img","alt":" t ≥ 0.","inline":true}],[{"id":"id-69","style":{"fontWeight":"bold"},"text":"Lemma D.7 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"After ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"style":{"fontStyle":"italic"},"text":"epochs of running Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1","element":"a"},{"style":{"fontStyle":"italic"},"text":", we will have, with probability ","element":"span"},{"style":{"height":12.8},"width":111.19,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-6.png","element":"img","alt":" 1 − δ:","inline":true}],[{"style":{"width":"73%"},"width":1264,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-7.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Let ","element":"span"},{"style":{"height":18.72},"width":601.62,"height":46.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-8.png","element":"img","alt":" xt = xut + xη,pt + xη,ut where xut ","inline":true,"padRight":true},{"text":"is the response of the system due to the sinusoidal ","element":"span"},{"text":"component of the input, ","element":"span"},{"style":{"height":18.72},"width":70.49,"height":46.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-9.png","element":"img","alt":" xη,pt","inline":true,"padRight":true},{"text":"is the response due to the process noise, and ","element":"span"},{"style":{"height":18.72},"width":72.49,"height":46.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-10.png","element":"img","alt":" xη,ut","inline":true,"padRight":true},{"text":"is the response due to the input noise. Note that this decomposition holds by linearity. Given this, we have ","element":"span"},{"style":{"height":17.6},"width":153,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-11.png","element":"img","alt":" ∥xt∥2 ≤","inline":true}],[{"style":{"width":"34%"},"width":604,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"∥","element":"span"},{"style":{"height":17.6},"width":133.42,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-13.png","element":"img","alt":"xut ∥2 ≤","inline":true}],[{"style":{"width":"93%"},"width":1617,"height":334,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-14.png","element":"img"}],[{"text":"By construction, we will have that ","element":"span"},{"style":{"height":26.38},"width":473.87,"height":65.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-15.png","element":"img","alt":"�ki(ℓ+1)−1s=kiℓ ∥us∥22 ≤ kiγ2","inline":true,"padRight":true},{"text":"so long as ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-16.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"is large enough that ","element":"span"},{"style":{"height":15.02},"width":54.24,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-17.png","element":"img","alt":"kiℓ","inline":true,"padRight":true},{"text":"is in epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":". However, since ","element":"span"},{"style":{"height":15.02},"width":34.72,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-18.png","element":"img","alt":" ki","inline":true,"padRight":true},{"text":"is doubled at each epoch, this sum will contain an integer multiple of the period of the input regardless what the value of ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-19.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"is, so we see that this inequality will hold for all values of ","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-20.png","element":"img","alt":" ℓ","inline":true},{"text":". This implies that for all ","element":"span"},{"style":{"height":17.77},"width":776.15,"height":44.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-21.png","element":"img","alt":" ℓ (since ∥x∥1 ≤ √n∥x∥2 for any x ∈ Rn),","inline":true}],[{"style":{"width":"96%"},"width":1670,"height":253,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/47-22.png","element":"img"}],[{"style":{"width":"76%"},"width":1320,"height":672,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-0.png","element":"img"}],[{"text":"where the last inequality holds since if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"is divisible by ","element":"span"},{"style":{"height":31.37},"width":785.05,"height":78.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-1.png","element":"img","alt":" ki, ki⌈t/ki⌉−t+1 = 1 so ¯ρ(A∗)ki¯ρ(A∗)ki⌈t/ki⌉−t+1 ≤","inline":true,"padRight":true},{"text":"1","element":"span"},{"text":", and if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"is not divisible by ","element":"span"},{"style":{"height":17.6},"width":1003.25,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-2.png","element":"img","alt":" ki, ki⌈t/ki⌉ − t + 1 < ki(t/ki + 1) − t + 1 = ki + 1","inline":true},{"text":", and since ","element":"span"},{"style":{"height":17.6},"width":293.27,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-3.png","element":"img","alt":"ki⌈t/ki⌉ − t + 1","inline":true,"padRight":true},{"text":"is an integer, it follows that ","element":"span"},{"style":{"height":31.37},"width":366.53,"height":78.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-4.png","element":"img","alt":"¯ρ(A∗)ki¯ρ(A∗)ki⌈t/ki⌉−t+1 ≤ 1.","inline":true}],[{"text":"By definition:","element":"span"}],[{"style":{"width":"82%"},"width":1417,"height":410,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-5.png","element":"img"}],[{"text":"where:","element":"span"}],[{"style":{"width":"50%"},"width":866,"height":225,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-6.png","element":"img"}],[{"text":"Noting that ","element":"span"},{"style":{"height":20.41},"width":429.22,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-7.png","element":"img","alt":" E˜η⊤ ˜A⊤ ˜A˜η = σ2Tr(Γt)","inline":true},{"text":", we can then apply the Hanson-Wright inequality to get:","element":"span"}],[{"style":{"width":"99%"},"width":1725,"height":668,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/48-8.png","element":"img"}],[{"style":{"width":"28%"},"width":487,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-0.png","element":"img"}],[{"text":"where the inequality holds since ","element":"span"},{"style":{"height":21.19},"width":1133.8,"height":52.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-1.png","element":"img","alt":" ∥ ˜A⊤ ˜A∥2F ≤ ∥ ˜A⊤ ˜A∥2Tr( ˜A⊤ ˜A) = ∥ ˜A⊤ ˜A∥2Tr(Γt). Denoting","inline":true},{"style":{"height":16.41},"width":267.38,"height":41.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-2.png","element":"img","alt":"˜A⊤ ˜A = UΛU ⊤","inline":true},{"text":", we see this is true since:","element":"span"}],[{"style":{"width":"93%"},"width":1617,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-3.png","element":"img"}],[{"text":"A similar calculation reveals that with probability at least ","element":"span"},{"style":{"height":17.6},"width":151.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-4.png","element":"img","alt":" 1 − δ/2:","inline":true}],[{"style":{"width":"69%"},"width":1202,"height":183,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-5.png","element":"img"}],[{"id":"id-75","style":{"fontWeight":"bold"},"text":"Corollary D.8 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"After ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"style":{"fontStyle":"italic"},"text":"epochs of running Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1","element":"a"},{"style":{"fontStyle":"italic"},"text":", on the event that:","element":"span"}],[{"style":{"width":"71%"},"width":1242,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"we will have:","element":"span"}],[{"style":{"width":"91%"},"width":1574,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-7.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":19.67},"width":406.53,"height":49.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-8.png","element":"img","alt":"ss(ζ, ki+1, x ¯Ti) ≤ max","inline":true}],[{"style":{"width":"91%"},"width":1574,"height":477,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is the amount of time elapsed after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"style":{"fontStyle":"italic"},"text":"epochs.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"From Lemma ","element":"span"},{"href":"#id-95","text":"E.10","element":"a"},{"text":", we have:","element":"span"}],[{"style":{"width":"92%"},"width":1598,"height":462,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/49-10.png","element":"img"}],[{"style":{"width":"63%"},"width":1093,"height":164,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":10.7},"width":48.94,"height":26.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-1.png","element":"img","alt":" xT","inline":true,"padRight":true},{"text":"is the state at the start of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"+ 1","element":"span"},{"text":"th epoch, and ","element":"span"},{"style":{"height":21.89},"width":120.54,"height":54.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-2.png","element":"img","alt":" xss,i+10","inline":true,"padRight":true},{"text":"is the initial state of the steady state response of the system to the inputs played at the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"+ 1","element":"span"},{"text":"th epoch. From Lemma ","element":"span"},{"href":"#id-69","text":"D.7","element":"a"},{"text":", since the noise term will be 0, we can deterministically upper bound:","element":"span"}],[{"style":{"width":"34%"},"width":597,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-3.png","element":"img"}],[{"text":"and also:","element":"span"}],[{"style":{"width":"77%"},"width":1332,"height":281,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-4.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"87%"},"width":1508,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-5.png","element":"img"}],[{"text":"it follows then that:","element":"span"}],[{"style":{"width":"91%"},"width":1574,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":19.67},"width":406.53,"height":49.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-7.png","element":"img","alt":"ss(ζ, ki+1, x ¯Ti) ≤ max","inline":true}],[{"style":{"width":"91%"},"width":1574,"height":484,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-8.png","element":"img"}],[{"text":"Finally, we must upper bound ","element":"span"},{"style":{"height":25.19},"width":269.78,"height":62.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-9.png","element":"img","alt":" ki+1w⊤˜Γui+1ki+1w","inline":true},{"text":". Upper bounding this over all ","element":"span"},{"style":{"height":15.93},"width":176.22,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-10.png","element":"img","alt":" w ∈ Sd−1","inline":true,"padRight":true},{"text":"is equivalent ","element":"span"},{"text":"to bounding:","element":"span"}],[{"style":{"width":"106%"},"width":1832,"height":500,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/50-11.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Lemma D.9 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"After ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"style":{"fontStyle":"italic"},"text":"epochs, we will have that:","element":"span"}],[{"style":{"width":"16%"},"width":283,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-0.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"After the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch, we will have that:","element":"span"}],[{"style":{"width":"32%"},"width":562,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-1.png","element":"img"}],[{"text":"Solving this for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"gives:","element":"span"}],[{"style":{"width":"22%"},"width":384,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-2.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"58%"},"width":1010,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-3.png","element":"img"}],[{"text":"Noting that ","element":"span"},{"style":{"height":17.6},"width":335.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-4.png","element":"img","alt":" log 2/ log 3 ≈ 0.63","inline":true},{"text":", we can lower bound this as:","element":"span"}],[{"style":{"width":"67%"},"width":1173,"height":173,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-5.png","element":"img"}]]},{"heading":"Appendix E. Estimation of Linear Dynamical Systems with Periodic Inputs","paragraphs":[[{"id":"id-94","style":{"fontWeight":"bold"},"text":"Theorem E.1 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Full version of Theorem ","element":"span"},{"href":"#id-49","style":{"fontStyle":"italic","fontWeight":"bold"},"text":"2.6","element":"a"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that we start from some initial state ","element":"span"},{"style":{"height":15.02},"width":123.4,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-6.png","element":"img","alt":" x0 and","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"we are playing some input ","element":"span"},{"style":{"height":16.71},"width":374.36,"height":41.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-7.png","element":"img","alt":" ut = ˜ut+ηut where ˜ut ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is deterministic with period ","element":"span"},{"style":{"height":19.13},"width":406.76,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-8.png","element":"img","alt":" k and ηut ∼ N(0, σ2uI).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Then as long as:","element":"span"}],[{"id":"id-98","style":{"width":"71%"},"width":1228,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"we will have that:","element":"span"}],[{"id":"id-97","style":{"width":"89%"},"width":1553,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-10.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and if:","element":"span"}],[{"style":{"width":"88%"},"width":1537,"height":23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-11.png","element":"img"}],[{"id":"id-96","style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"height":14.62},"width":125.03,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-12.png","element":"img","alt":" ≥ 2Tss","inline":true}],[{"text":"(19) ","element":"span"},{"style":{"fontStyle":"italic"},"text":"then:","element":"span"}],[{"id":"id-99","style":{"width":"96%"},"width":1667,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-13.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":31.6},"width":963.09,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/51-14.png","element":"img","alt":"¯ΓT = 4�˜ΓuT,0 + Tr(ΓηT )(1 + log 2δ)I�and c, c′, C, C′ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are universal constants.","element":"span"}],[{"text":"Note that ","element":"span"},{"href":"#id-96","style":{"height":31.6},"width":548.56,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-0.png","element":"img","alt":" Tss�110λmin(˜Γuk), k, x0�in (19","inline":true},{"text":") can be replaced with ","element":"span"},{"style":{"height":20.8},"width":546.07,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-1.png","element":"img","alt":" Tss�c′′λmin(Γηk), k, x0�, which","inline":true}],[{"text":"may be helpful if our system is not controllable, in which case it’s possible ","element":"span"},{"style":{"height":21.33},"width":352.95,"height":53.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-2.png","element":"img","alt":" λmin(˜Γuk) = 0. An","inline":true,"padRight":true},{"text":"example of this argument can be found in the proof of Theorem ","element":"span"},{"href":"#id-46","text":"2.3","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"E.1. Proof of Theorem ","element":"span"},{"href":"#id-49","style":{"fontWeight":"bold"},"text":"2.6 ","element":"a"},{"style":{"fontWeight":"bold"},"text":"and Theorem ","element":"span"},{"href":"#id-94","style":{"fontWeight":"bold"},"text":"E.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Define the following events:","element":"span"}],[{"style":{"width":"88%"},"width":1534,"height":641,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-3.png","element":"img"}],[{"text":"(","element":"span"},{"href":"#id-97","text":"18","element":"a"},{"text":") follows directly from bounding ","element":"span"},{"style":{"height":17.88},"width":104.56,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-4.png","element":"img","alt":" P[Ac1]","inline":true},{"text":". The following clearly holds:","element":"span"}],[{"style":{"width":"72%"},"width":1249,"height":178,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-5.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-65","text":"E.7","element":"a"},{"text":", we will have that ","element":"span"},{"href":"#id-98","style":{"height":17.88},"width":297.4,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-6.png","element":"img","alt":" P[Ec1] ≤ δ. If (17","inline":true},{"text":") holds the burn in time required by Lemma ","element":"span"},{"href":"#id-68","text":"E.3 ","element":"a"},{"text":"will be met, so by Lemma ","element":"span"},{"href":"#id-68","text":"E.3","element":"a"},{"text":", ","element":"span"},{"style":{"height":17.88},"width":264.19,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-7.png","element":"img","alt":" P[Ec2 ∩ E1] ≤ δ","inline":true},{"text":". Similarly, by Lemma ","element":"span"},{"href":"#id-71","text":"E.6","element":"a"},{"text":", ","element":"span"},{"style":{"height":17.88},"width":426.2,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-8.png","element":"img","alt":" P[Ec3 ∩ E1 ∩ E2] ≤ δ. To","inline":true,"padRight":true},{"text":"bound ","element":"span"},{"style":{"height":17.88},"width":374.88,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-9.png","element":"img","alt":" P[Ac1 ∩ E1 ∩ E2 ∩ E3]","inline":true},{"text":", note that we can decompose the error of the least squares estimate as:","element":"span"}],[{"style":{"width":"54%"},"width":940,"height":201,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-10.png","element":"img"}],[{"text":"On the event ","element":"span"},{"style":{"height":15.02},"width":220.92,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-11.png","element":"img","alt":" E1 ∩ E2 ∩ E3","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"68%"},"width":1186,"height":281,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-12.png","element":"img"}],[{"text":"Combining these it follows that on this event:","element":"span"}],[{"style":{"width":"71%"},"width":1230,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/52-13.png","element":"img"}],[{"text":"so ","element":"span"},{"style":{"height":17.88},"width":456.24,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-0.png","element":"img","alt":" P[Ac1 ∩ E1 ∩ E2 ∩ E3] = 0","inline":true},{"text":". It follows then that ","element":"span"},{"style":{"height":17.88},"width":204.68,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-1.png","element":"img","alt":" P[Ac1] ≤ 3δ","inline":true,"padRight":true},{"text":"which proves (","element":"span"},{"href":"#id-97","text":"18","element":"a"},{"text":"). ","element":"span"},{"text":"To show (","element":"span"},{"href":"#id-99","text":"20","element":"a"},{"text":"), define the following events:","element":"span"}],[{"style":{"width":"98%"},"width":1695,"height":775,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-2.png","element":"img"}],[{"text":"As before, we have that ","element":"span"},{"style":{"height":17.88},"width":185.28,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-3.png","element":"img","alt":" P[Ec1] ≤ δ","inline":true,"padRight":true},{"text":"and, assuming (","element":"span"},{"href":"#id-96","text":"19","element":"a"},{"text":") holds, ","element":"span"},{"href":"#id-96","style":{"height":17.88},"width":418.18,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-4.png","element":"img","alt":" P[E1 ∩ Ec2] ≤ δ. If (19","inline":true},{"text":") holds, by ","element":"span"},{"text":"Corollary ","element":"span"},{"href":"#id-100","text":"E.11 ","element":"a"},{"text":"the burn in condition required by Lemma ","element":"span"},{"href":"#id-79","text":"E.4 ","element":"a"},{"text":"will be met so we will also have that ","element":"span"},{"href":"#id-71","style":{"height":17.88},"width":574.96,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-5.png","element":"img","alt":"P[E1 ∩ Ec4] ≤ δ. By Lemma E.6","inline":true,"padRight":true},{"text":"and the error decomposition of ","element":"span"},{"style":{"height":21.21},"width":201.37,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-6.png","element":"img","alt":" ∥ ˆA − A∗∥2","inline":true,"padRight":true},{"text":"used above, we have ","element":"span"},{"text":"that ","element":"span"},{"style":{"height":17.88},"width":1435.08,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-7.png","element":"img","alt":" P[E1 ∩ E2 ∩ E4 ∩ Ec5] ≤ δ and P[Ac2 ∩ E1 ∩ E2 ∩ E4 ∩ E5] = 0. Thus, P[Ac2] ≤ 4δ","inline":true,"padRight":true},{"text":"from which ","element":"span"},{"text":"(","element":"span"},{"href":"#id-99","text":"20","element":"a"},{"text":") follows directly.","element":"span"}],[{"style":{"width":"1%"},"width":30,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"E.2. Lower Bounds on Covariates and Self-Normalized Bounds","element":"span"}],[{"text":"The following proposition is crucial to proving a high probability bound on the error in the presence of non-random inputs.","element":"span"}],[{"id":"id-63","style":{"fontWeight":"bold"},"text":"Proposition E.2 ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"(Full version of Proposition ","element":"span"},{"href":"#id-57","style":{"fontStyle":"italic","fontWeight":"bold"},"text":"4.2","element":"a"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider any ","element":"span"},{"style":{"height":17.75},"width":526.07,"height":44.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-9.png","element":"img","alt":" w ∈ Sd−1 and let xt evolve","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"according to the dynamical system (","element":"span"},{"href":"#id-101","style":{"fontStyle":"italic"},"text":"1","element":"a"},{"style":{"fontStyle":"italic"},"text":"). Let ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-10.png","element":"img","alt":" ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be a deterministic periodic signal and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be an integer multiple of its period. Let ","element":"span"},{"style":{"height":18.72},"width":85.6,"height":46.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-11.png","element":"img","alt":" xu,sst","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the steady state response of the system to this input and let ","element":"span"},{"style":{"height":22},"width":407.17,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-12.png","element":"img","alt":"α := �k−1t=0 (w⊤xu,sst )2","inline":true},{"style":{"fontStyle":"italic"},"text":". Assume that ","element":"span"},{"style":{"height":14.62},"width":57.16,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-13.png","element":"img","alt":" Tss","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is chosen large enough so that, for any ","element":"span"},{"style":{"height":14.4},"width":126.56,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-14.png","element":"img","alt":" T ≥ 0:","inline":true}],[{"id":"id-103","style":{"width":"73%"},"width":1269,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-15.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where:","element":"span"}],[{"style":{"width":"20%"},"width":348,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Then we will have that:","element":"span"}],[{"id":"id-104","style":{"width":"78%"},"width":1360,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/53-17.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"We first note that, since our system is linear, the output of the system due to the input, ","element":"span"},{"style":{"height":16.25},"width":57.54,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-0.png","element":"img","alt":" xut ,","inline":true,"padRight":true},{"text":"will contain only the frequencies present in the input, ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-1.png","element":"img","alt":" ut","inline":true},{"text":", with possibly some phase shift. Thus, the period of the periodic part of our output will be identical to that of the input once the system is in steady state.","element":"span"}],[{"text":"Let:","element":"span"}],[{"style":{"width":"92%"},"width":1600,"height":344,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-2.png","element":"img"}],[{"text":"for some ","element":"span"},{"style":{"height":10.62},"width":35.88,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-3.png","element":"img","alt":" c1","inline":true,"padRight":true},{"text":"to be specified, where ","element":"span"},{"text":"I ","element":"span"},{"text":"is the indicator function. Then ","element":"span"},{"style":{"height":31.6},"width":638.94,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-4.png","element":"img","alt":"�ki=1 z2jk+i ≥�c1�ki=1 µ2jk+i�Bj.","inline":true,"padRight":true},{"text":"Let ","element":"span"},{"style":{"height":17.6},"width":323.21,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-5.png","element":"img","alt":" S = ⌊T/k⌋ and c2","inline":true,"padRight":true},{"text":"be some constant to be specified. Then:","element":"span"}],[{"id":"id-102","style":{"width":"98%"},"width":1703,"height":331,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-6.png","element":"img"}],[{"text":"where the last inequality is simply Chernoff’s bound. To compute the expectation, we will use the tower property. To do so, it will be convenient to first calculate the conditional expectation of ","element":"span"},{"style":{"height":17.02},"width":61.63,"height":42.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-7.png","element":"img","alt":" Bj.","inline":true,"padRight":true},{"text":"Letting ","element":"span"},{"style":{"height":17.42},"width":46.36,"height":43.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-8.png","element":"img","alt":" Fj","inline":true,"padRight":true},{"text":"denote the ","element":"span"},{"style":{"height":8},"width":25,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-9.png","element":"img","alt":" σ","inline":true},{"text":"-field generated by ","element":"span"},{"style":{"height":13.24},"width":246.98,"height":33.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-10.png","element":"img","alt":" η0, ..., ηTss+jk","inline":true},{"text":", we have that:","element":"span"}],[{"style":{"width":"112%"},"width":1947,"height":461,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-11.png","element":"img"}],[{"text":"where the last equality follows since:","element":"span"}],[{"style":{"width":"49%"},"width":849,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-12.png","element":"img"}],[{"text":"Note that, conditioned on the ","element":"span"},{"style":{"height":22.77},"width":598.79,"height":56.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-13.png","element":"img","alt":" Fj, w⊤Ai∗xηTss+jk and w⊤¯xuTss+jk ","inline":true,"padRight":true},{"text":"are deterministic. Further, since ","element":"span"},{"style":{"height":12},"width":33.67,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-14.png","element":"img","alt":" ηt","inline":true,"padRight":true},{"text":"is mean 0, ","element":"span"},{"style":{"height":21.69},"width":483.89,"height":54.22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-15.png","element":"img","alt":" w⊤ �i−1s=0 Ai−s−1∗ ηTss+jk+i","inline":true,"padRight":true},{"text":"will simply be a linear combination of mean 0 Gaussians and so will itself be a mean 0 Gaussian. This implies that ","element":"span"},{"style":{"height":21.69},"width":750.96,"height":54.22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/54-16.png","element":"img","alt":" P[w⊤ �i−1s=0 Ai−s−1∗ ηTss+jk+i ≥ 0] = 1/2.","inline":true}],[{"text":"Since we have constructed ","element":"span"},{"style":{"height":12},"width":38.29,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-0.png","element":"img","alt":" µt","inline":true,"padRight":true},{"text":"in such a way as to be mean zero over a block of length ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":", for any fixed ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":":","element":"span"}],[{"style":{"width":"82%"},"width":1419,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-1.png","element":"img"}],[{"text":"In particular then:","element":"span"}],[{"style":{"width":"82%"},"width":1417,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-2.png","element":"img"}],[{"text":"which implies:","element":"span"}],[{"style":{"width":"102%"},"width":1766,"height":1060,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-3.png","element":"img"}],[{"text":"where the last inequality follows by a reverse Markov inequality which states that, for any random variable ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Z ","element":"span"},{"text":"supported in ","element":"span"},{"text":"[0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1] ","element":"span"},{"text":"almost surely and with ","element":"span"},{"style":{"height":17.6},"width":783.7,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-4.png","element":"img","alt":" E[Z] ≥ p ∈ (0, 1), for all t ∈ [0, p], P[Z ≥","inline":true},{"style":{"height":21.91},"width":146.24,"height":54.77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-5.png","element":"img","alt":"t] ≥ p−t1−t ","inline":true,"padRight":true},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"). Noting that, since the noise is 0 mean Gaussian, we have:","element":"span"}],[{"style":{"width":"108%"},"width":1870,"height":434,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/55-6.png","element":"img"}],[{"style":{"width":"71%"},"width":1244,"height":281,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-0.png","element":"img"}],[{"text":"From this ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"follows by simple manipulations. Since we can choose ","element":"span"},{"style":{"height":10.62},"width":35.88,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-1.png","element":"img","alt":" c1","inline":true,"padRight":true},{"text":"as we wish, we set it equal to ","element":"span"},{"style":{"height":17.6},"width":161.63,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-2.png","element":"img","alt":" c1 = 1/4","inline":true,"padRight":true},{"text":"and conclude that:","element":"span"}],[{"style":{"height":35.78},"width":249.94,"height":89.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-3.png","element":"img","alt":"E[Bj|Fj] ≥ 13","inline":true,"padRight":true},{"text":"Returning to (","element":"span"},{"href":"#id-102","text":"23","element":"a"},{"text":"), we can now use this result to bound the expectation. Note that:","element":"span"}],[{"style":{"width":"99%"},"width":1718,"height":332,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-4.png","element":"img"}],[{"text":"Then by what we just proved and applying Hoeffding’s Lemma, since ","element":"span"},{"style":{"height":15.2},"width":281.92,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-5.png","element":"img","alt":" λ < 0, we have:","inline":true}],[{"style":{"width":"113%"},"width":1953,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-6.png","element":"img"}],[{"text":"Repeating this procedure condition on each ","element":"span"},{"style":{"height":16.4},"width":193.82,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-7.png","element":"img","alt":" Fi, we get:","inline":true}],[{"style":{"width":"103%"},"width":1788,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-8.png","element":"img"}],[{"text":"and so:","element":"span"}],[{"style":{"width":"112%"},"width":1951,"height":733,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/56-9.png","element":"img"}],[{"text":"where the final inequality follows from choosing the optimal ","element":"span"},{"style":{"height":13.2},"width":109.3,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-0.png","element":"img","alt":" λ < 0","inline":true,"padRight":true},{"text":"(and assuming ","element":"span"},{"style":{"height":10.62},"width":35.88,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-1.png","element":"img","alt":" c2","inline":true,"padRight":true},{"text":"chosen such that ","element":"span"},{"style":{"height":17.6},"width":169.01,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-2.png","element":"img","alt":" c1/3 − c2","inline":true,"padRight":true},{"text":"is positive) and the final equality uses ","element":"span"},{"style":{"height":19.41},"width":395.82,"height":48.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-3.png","element":"img","alt":" C = 2(c1/3 − c2)2/c21","inline":true},{"text":". By our assumption on ","element":"span"},{"text":"the power (","element":"span"},{"href":"#id-103","text":"21","element":"a"},{"text":"), we will have that ","element":"span"},{"style":{"height":24.56},"width":916.19,"height":61.39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-4.png","element":"img","alt":"�ki=1 µ2jk+i = α + αj for some |αj| ≤ α/10. Thus:","inline":true}],[{"style":{"width":"85%"},"width":1478,"height":824,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-5.png","element":"img"}],[{"text":"It remains then to write this in form of (","element":"span"},{"href":"#id-104","text":"22","element":"a"},{"text":"). Plugging in our definitions of ","element":"span"},{"style":{"height":16.4},"width":159.92,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-6.png","element":"img","alt":" µt and zt","inline":true},{"text":", we have that the above is equivalent to:","element":"span"}],[{"style":{"width":"89%"},"width":1544,"height":463,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-7.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"holds by our assumption on the power (","element":"span"},{"href":"#id-103","text":"21","element":"a"},{"text":") and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"follows by Parseval’s Theorem. Choosing ","element":"span"},{"style":{"height":10.62},"width":35.88,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-8.png","element":"img","alt":" c2","inline":true,"padRight":true},{"text":"to balance the constants, we get that:","element":"span"}],[{"style":{"width":"57%"},"width":989,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-9.png","element":"img"}],[{"text":"which completes the proof.","element":"span"}],[{"id":"id-68","style":{"fontWeight":"bold"},"text":"Lemma E.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that our system is driven by some input ","element":"span"},{"style":{"height":16.72},"width":382.08,"height":41.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-10.png","element":"img","alt":" ut = ˜ut+ηut where ˜ut ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is deterministic ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":19.13},"width":289.58,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-11.png","element":"img","alt":" ηut ∼ N(0, σ2uI)","inline":true},{"style":{"fontStyle":"italic"},"text":". Then on the event that:","element":"span"}],[{"style":{"width":"17%"},"width":302,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/57-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":17.51},"width":51.27,"height":43.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-0.png","element":"img","alt":"¯ΓT","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":", choosing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"so that:","element":"span"}],[{"id":"id-105","style":{"width":"80%"},"width":1392,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-1.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"we will have with probability less than ","element":"span"},{"style":{"height":12.8},"width":36.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-2.png","element":"img","alt":" δ:","inline":true}],[{"style":{"width":"23%"},"width":414,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-3.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Take some ","element":"span"},{"style":{"height":15.2},"width":209.42,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-4.png","element":"img","alt":" s ≥ 0, then:","inline":true}],[{"style":{"width":"70%"},"width":1223,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-5.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.45},"width":78.95,"height":43.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-6.png","element":"img","alt":" xus+t ","inline":true,"padRight":true},{"text":"is the state obtained by driving the system with the input in the absence of noise, ","element":"span"},{"text":"which is deterministic conditioned on ","element":"span"},{"style":{"height":15.02},"width":47.36,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-7.png","element":"img","alt":" Fs","inline":true},{"text":". Given this, we have that ","element":"span"},{"style":{"height":11.82},"width":78.95,"height":29.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-8.png","element":"img","alt":" xs+t","inline":true,"padRight":true},{"text":"satisfies the ","element":"span"},{"style":{"height":19.13},"width":220.53,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-9.png","element":"img","alt":" (2k, σ2Γk +","inline":true},{"style":{"height":22.04},"width":242.29,"height":55.1,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-10.png","element":"img","alt":"σ2uΓB∗k , 3/20)","inline":true},{"text":"-BMSB condition, as defined in ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"). The proof of this closely ","element":"span"},{"text":"mirrors the proof of Proposition 3.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"). The primary difference is that the mean of ","element":"span"},{"style":{"height":17.6},"width":201.41,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-11.png","element":"img","alt":" w⊤xs+t|Fs","inline":true,"padRight":true},{"text":"differs from that of the signal considered in ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), but this does not affect the argument and, as such, we omit it here. We can then apply Proposition 2.5 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") to get that:","element":"span"}],[{"style":{"width":"52%"},"width":910,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-12.png","element":"img"}],[{"text":"where here ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"20","element":"span"},{"text":". Following the proof of Theorem 2.4 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"be a 1/4-net in the norm ","element":"span"},{"style":{"height":20.8},"width":730.78,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-13.png","element":"img","alt":" T ¯ΓT of�w : k⌊T/k⌋p2w⊤Γηkw/8 = 1�","inline":true},{"text":". By Lemma 4.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), ","element":"span"},{"style":{"height":22.97},"width":710.39,"height":57.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-14.png","element":"img","alt":" |T | ≤ 2d log(10/p) + log det(¯ΓT Γηk−1)","inline":true},{"text":". Then by Lemma 4.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") we have:","element":"span"}],[{"style":{"width":"72%"},"width":1259,"height":627,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-15.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.6},"width":397.88,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-16.png","element":"img","alt":" (a) holds if T ≥ 4k","inline":true},{"text":", which is true by (","element":"span"},{"href":"#id-105","text":"25","element":"a"},{"text":"), and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"holds by (","element":"span"},{"href":"#id-105","text":"25","element":"a"},{"text":"). ","element":"span"},{"text":"Lower bounding","element":"span"}],[{"id":"id-79","style":{"width":"99%"},"width":1724,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/58-17.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Lemma E.4 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-0.png","element":"img","alt":" ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be a deterministic input with period ","element":"span"},{"style":{"height":15.02},"width":215.67,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-1.png","element":"img","alt":" k and let Tss","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the time such that condition (","element":"span"},{"href":"#id-103","style":{"fontStyle":"italic"},"text":"21","element":"a"},{"style":{"fontStyle":"italic"},"text":") in Proposition ","element":"span"},{"href":"#id-63","style":{"fontStyle":"italic"},"text":"E.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is met for all ","element":"span"},{"style":{"height":15.93},"width":176.22,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-2.png","element":"img","alt":" w ∈ Sd−1","inline":true},{"style":{"fontStyle":"italic"},"text":". On the event that:","element":"span"}],[{"style":{"width":"17%"},"width":302,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":17.51},"width":51.27,"height":43.78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-4.png","element":"img","alt":"¯ΓT","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":", then as long as:","element":"span"}],[{"id":"id-106","style":{"width":"99%"},"width":1726,"height":312,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"The proof of this follows ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") closely but replacing Proposition 2.5 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") with our Proposition ","element":"span"},{"href":"#id-63","text":"E.2","element":"a"},{"text":". By Proposition ","element":"span"},{"href":"#id-63","text":"E.2 ","element":"a"},{"text":"we will have that:","element":"span"}],[{"style":{"width":"101%"},"width":1753,"height":207,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-6.png","element":"img"}],[{"text":"Following the proof of Theorem 2.4 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"be a 1/4-net in the norm ","element":"span"},{"style":{"height":32},"width":746.12,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-7.png","element":"img","alt":"T ¯ΓT of�w : 2k⌊T/k⌋w⊤˜Γukw/81 = 1�","inline":true},{"text":". By Lemma D.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), we have","element":"span"}],[{"text":"that ","element":"span"},{"style":{"height":21.33},"width":732.9,"height":53.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-8.png","element":"img","alt":" |T | ≤ 2d log(45/2)+log det(¯ΓT (˜Γuk)−1)","inline":true},{"text":". Then by Lemma 4.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") we ","element":"span"},{"text":"have:","element":"span"}],[{"style":{"width":"81%"},"width":1413,"height":782,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-9.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"holds so long as ","element":"span"},{"style":{"height":15.02},"width":255.44,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-10.png","element":"img","alt":" T ≥ Tss + 4k","inline":true},{"text":", which will be true by (","element":"span"},{"href":"#id-106","text":"26","element":"a"},{"text":"), and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"holds by (","element":"span"},{"href":"#id-106","text":"26","element":"a"},{"text":"). The following holds by ","element":"span"},{"href":"#id-106","style":{"height":16.4},"width":471.87,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-11.png","element":"img","alt":" T ≥ Tss + 4k and by (26):","inline":true}],[{"style":{"width":"45%"},"width":778,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/59-12.png","element":"img"}],[{"text":"which completes the result.","element":"span"}],[{"id":"id-70","style":{"fontWeight":"bold"},"text":"Corollary E.5 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let:","element":"span"}],[{"style":{"width":"51%"},"width":890,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":14.62},"width":291.72,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-1.png","element":"img","alt":" M ⪰ 0. Let ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be a deterministic input with period ","element":"span"},{"style":{"height":15.02},"width":237.29,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-2.png","element":"img","alt":" k and let Tss","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the time such that condition (","element":"span"},{"href":"#id-103","style":{"fontStyle":"italic"},"text":"21","element":"a"},{"style":{"fontStyle":"italic"},"text":") in Proposition ","element":"span"},{"href":"#id-63","style":{"fontStyle":"italic"},"text":"E.2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is met for all ","element":"span"},{"style":{"height":12.8},"width":131.78,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-3.png","element":"img","alt":" w ∈ W","inline":true},{"style":{"fontStyle":"italic"},"text":". On the event that:","element":"span"}],[{"style":{"width":"17%"},"width":302,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":17.52},"width":51.27,"height":43.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-5.png","element":"img","alt":"¯ΓT","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":", then as long as:","element":"span"}],[{"id":"id-108","style":{"width":"87%"},"width":1516,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"with probability less than ","element":"span"},{"style":{"height":12.8},"width":36.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-7.png","element":"img","alt":" δ:","inline":true}],[{"style":{"width":"35%"},"width":606,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"The proof of this result is very similar to that of Lemma ","element":"span"},{"href":"#id-79","text":"E.4","element":"a"},{"text":". For any ","element":"span"},{"style":{"height":15.93},"width":301.99,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-9.png","element":"img","alt":" w ∈ Sd−1 ∩ Wc:","inline":true}],[{"id":"id-107","style":{"width":"83%"},"width":1446,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-10.png","element":"img"}],[{"text":"For any ","element":"span"},{"style":{"height":12.8},"width":131.77,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-11.png","element":"img","alt":" w ∈ W","inline":true},{"text":", by Proposition ","element":"span"},{"href":"#id-63","text":"E.2","element":"a"},{"text":", given the definition of ","element":"span"},{"style":{"height":14.62},"width":57.16,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-12.png","element":"img","alt":" Tss","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"101%"},"width":1753,"height":207,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-13.png","element":"img"}],[{"text":"Following the proof of Theorem 2.4 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"be a 1/4-net in the norm ","element":"span"},{"style":{"height":32},"width":828.47,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-14.png","element":"img","alt":"T ¯ΓT +M of�w : 2k⌊T/k⌋w⊤˜Γukw/81 = 1�","inline":true},{"text":". By Lemma D.1 of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":"), ","element":"span"},{"style":{"height":17.6},"width":105.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-15.png","element":"img","alt":" |T | ≤","inline":true}],[{"style":{"height":22.03},"width":795.14,"height":55.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-16.png","element":"img","alt":"2d log(45/2) + log det((¯ΓT + 1T M)(˜Γuk)−1)","inline":true},{"text":". Then by Lemma 4.1of ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"Simchowitz et al. ","element":"a"},{"text":"(","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"2018","element":"a"},{"text":") we ","element":"span"},{"text":"have:","element":"span"}],[{"style":{"width":"84%"},"width":1466,"height":284,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/60-17.png","element":"img"}],[{"style":{"width":"96%"},"width":1673,"height":593,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"holds by (","element":"span"},{"href":"#id-107","text":"28","element":"a"},{"text":") and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"b","element":"span"},{"text":") ","element":"span"},{"text":"and the final inequalities hold so long as ","element":"span"},{"href":"#id-108","style":{"height":15.6},"width":419.87,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-1.png","element":"img","alt":" T ≥ Tss + 4k and (27)","inline":true,"padRight":true},{"text":"holds, since in that case we will have that ","element":"span"},{"style":{"height":17.6},"width":839.99,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-2.png","element":"img","alt":" k⌊(T − Tss)/k⌋/81 ≥ (T − Tss)/54 ≥ T/108.","inline":true}],[{"id":"id-71","style":{"fontWeight":"bold"},"text":"Lemma E.6 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":10.62},"width":36.94,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-3.png","element":"img","alt":" xt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is generated from some input ","element":"span"},{"style":{"height":16.72},"width":666.17,"height":41.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-4.png","element":"img","alt":" ut = ˜ut + ηut where ˜ut is Ft−1 mea-","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"surable and ","element":"span"},{"style":{"height":19.13},"width":295.92,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-5.png","element":"img","alt":" ηut ∼ N(0, σ2uI)","inline":true},{"style":{"fontStyle":"italic"},"text":". On the event that ","element":"span"},{"style":{"height":22},"width":437.58,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-6.png","element":"img","alt":" V+ ⪰ �Tt=1 xtx⊤t ⪰ V−","inline":true},{"style":{"fontStyle":"italic"},"text":", we will have that, with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"probability less than ","element":"span"},{"style":{"height":12.8},"width":36.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-7.png","element":"img","alt":" δ:","inline":true}],[{"style":{"width":"85%"},"width":1479,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Note that Proposition 8.2 of ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":") applies even when ","element":"span"},{"style":{"height":10.62},"width":36.94,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-9.png","element":"img","alt":" xt","inline":true,"padRight":true},{"text":"is driven by an input ","element":"span"},{"style":{"height":14.22},"width":36.98,"height":35.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-10.png","element":"img","alt":" ˜ut","inline":true,"padRight":true},{"text":"which is changing over time, since we choose ","element":"span"},{"style":{"height":15.02},"width":233.27,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-11.png","element":"img","alt":" ˜ut to be Ft−1","inline":true,"padRight":true},{"text":"measurable, so ","element":"span"},{"style":{"height":15.02},"width":252.69,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-12.png","element":"img","alt":" xt is still Ft−1","inline":true,"padRight":true},{"text":"measurable. Therefore, for any deterministic ","element":"span"},{"style":{"height":12.8},"width":127.15,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-13.png","element":"img","alt":" V ≻ 0:","inline":true}],[{"style":{"width":"101%"},"width":1760,"height":800,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-14.png","element":"img"}],[{"text":"with probability less than ","element":"span"},{"style":{"height":12.8},"width":32.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/61-15.png","element":"img","alt":" δ.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"E.3. Upper Bounds on Covariates","element":"span"}],[{"id":"id-65","style":{"fontWeight":"bold"},"text":"Lemma E.7 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume ","element":"span"},{"style":{"height":16.25},"width":233.02,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-0.png","element":"img","alt":" ut = ˜ut + ηut ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for some deterministic ","element":"span"},{"style":{"height":19.13},"width":350.59,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-1.png","element":"img","alt":" ˜ut, ηut ∼ N(0, σ2uI)","inline":true},{"style":{"fontStyle":"italic"},"text":", and for any initial ","element":"span"},{"style":{"fontStyle":"italic"},"text":"state, then with probability at least ","element":"span"},{"style":{"height":12.8},"width":111.2,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-2.png","element":"img","alt":" 1 − δ:","inline":true}],[{"style":{"width":"63%"},"width":1105,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w","element":"span"},{"style":{"fontStyle":"italic"},"text":", with probability at least ","element":"span"},{"style":{"height":12.8},"width":111.2,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-4.png","element":"img","alt":" 1 − δ:","inline":true}],[{"style":{"width":"68%"},"width":1192,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"We note that:","element":"span"}],[{"style":{"width":"55%"},"width":952,"height":282,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-6.png","element":"img"}],[{"text":"Where here we let ","element":"span"},{"style":{"height":19.05},"width":44.94,"height":47.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-7.png","element":"img","alt":" x˜ut ","inline":true,"padRight":true},{"text":"denote the response of the system to the deterministic part of the input and ","element":"span"},{"style":{"height":22.02},"width":61.08,"height":55.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-8.png","element":"img","alt":"xηut","inline":true,"padRight":true},{"text":"the response due to the random part of the input. The term ","element":"span"},{"style":{"height":22},"width":219.93,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-9.png","element":"img","alt":"�Tt=1 x˜ut x˜ut⊤","inline":true,"padRight":true},{"text":"is then deterministic. Following Proposition 8.4 of ","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"Sarkar and Rakhlin ","element":"a"},{"text":"(","element":"span"},{"href":"#id-1","referenceIndex":35,"text":"2018","element":"a"},{"text":"), we can bound the second and third terms each with probability ","element":"span"},{"style":{"height":17.6},"width":199.1,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-10.png","element":"img","alt":" 1 − δ/2 as:","inline":true}],[{"style":{"width":"82%"},"width":1433,"height":321,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-11.png","element":"img"}],[{"text":"Combining these bounds gives the result. For the second inequality, following the same argument as in the proof of Proposition 8.4 of","element":"span"}],[{"style":{"width":"70%"},"width":1225,"height":219,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-12.png","element":"img"}],[{"text":"combining this with the above gives the result.","element":"span"}],[{"id":"id-122","style":{"fontWeight":"bold"},"text":"Lemma E.8 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that the input ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-13.png","element":"img","alt":" ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies, for some ","element":"span"},{"style":{"height":16},"width":300.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-14.png","element":"img","alt":" k and any s ≥ 0:","inline":true}],[{"style":{"width":"21%"},"width":369,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/62-15.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"then:","element":"span"}],[{"style":{"width":"112%"},"width":1944,"height":1475,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"uses Lemma ","element":"span"},{"href":"#id-109","text":"E.12","element":"a"},{"text":". Since ","element":"span"},{"style":{"height":17.77},"width":586,"height":44.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-1.png","element":"img","alt":" ∥x∥1 ≤ √n∥x∥2 for any x ∈ Rn","inline":true},{"text":", we will have, by Parseval’s Theorem and our assumption on ","element":"span"},{"style":{"height":11.02},"width":51.2,"height":27.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-2.png","element":"img","alt":" ut:","inline":true}],[{"style":{"width":"87%"},"width":1513,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-3.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"96%"},"width":1668,"height":260,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-4.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"56%"},"width":970,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/63-5.png","element":"img"}],[{"style":{"width":"87%"},"width":1512,"height":486,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-0.png","element":"img"}],[{"id":"id-115","style":{"fontWeight":"bold"},"text":"Lemma E.9 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that we are running Algorithm ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"1 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"and that we started from initial condition ","element":"span"},{"style":{"height":17.35},"width":516.11,"height":43.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-1.png","element":"img","alt":"x0 = 0. Let A∗ = PJP −1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the Jordan decomposition of ","element":"span"},{"style":{"height":15.42},"width":49.72,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-2.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and consider some ","element":"span"},{"style":{"height":15.93},"width":192.91,"height":39.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-3.png","element":"img","alt":" w ∈ Sd−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that ","element":"span"},{"style":{"height":19.95},"width":381.78,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-4.png","element":"img","alt":" ∥w⊤Pn(j):n(j)∥2 = 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"except for ","element":"span"},{"style":{"height":17.6},"width":502.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-5.png","element":"img","alt":" j = ℓ. Here n(j) and n(j)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the start and stop indices of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"style":{"fontStyle":"italic"},"text":"th Jordan block (so in particular, if ","element":"span"},{"style":{"height":17.42},"width":183.71,"height":43.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-6.png","element":"img","alt":" Jj is the j","inline":true},{"style":{"fontStyle":"italic"},"text":"th Jordan block, we have that ","element":"span"},{"style":{"height":17.02},"width":92.68,"height":42.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-7.png","element":"img","alt":" Jj =","inline":true},{"style":{"height":19.95},"width":330.43,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-8.png","element":"img","alt":"[J]n(j):n(j),n(j):n(j)","inline":true},{"style":{"fontStyle":"italic"},"text":"). Assume that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is chosen to be within epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then, after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"fontStyle":"italic"},"text":"steps:","element":"span"}],[{"style":{"width":"96%"},"width":1676,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-9.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Adopting the notation used in Algorithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":", let ","element":"span"},{"style":{"height":14.62},"width":37.5,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-10.png","element":"img","alt":" Ti","inline":true,"padRight":true},{"text":"denote the length of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch. Denote ","element":"span"},{"style":{"height":24},"width":251.85,"height":60.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-11.png","element":"img","alt":"¯Ti = �t−1j=0 Tj","inline":true,"padRight":true},{"text":"be the start time of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th epoch.","element":"span"}],[{"text":"Following the analysis used in Section ","element":"span"},{"href":"#id-110","text":"E.4","element":"a"},{"text":", we can break up the response into its steady state and transient components and write:","element":"span"}],[{"style":{"width":"31%"},"width":545,"height":59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-12.png","element":"img"}],[{"text":"for ","element":"span"},{"style":{"height":19.44},"width":565.69,"height":48.59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-13.png","element":"img","alt":" t ∈ [ ¯Ti + 1, ¯Ti + Ti], where xssit","inline":true,"padRight":true},{"text":"denotes the steady state response of the system at time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"to the inputs used at epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":". We then have:","element":"span"}],[{"style":{"width":"78%"},"width":1356,"height":438,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-14.png","element":"img"}],[{"text":"Note that:","element":"span"}],[{"style":{"width":"55%"},"width":959,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-15.png","element":"img"}],[{"text":"where, relying on the periodicity of ","element":"span"},{"style":{"height":17.9},"width":435.74,"height":44.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-16.png","element":"img","alt":" us, we let us = us%ki+ki","inline":true,"padRight":true},{"text":"for negative ","element":"span"},{"style":{"fontStyle":"italic"},"text":"s","element":"span"},{"text":". So:","element":"span"}],[{"style":{"width":"74%"},"width":1291,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/64-17.png","element":"img"}],[{"style":{"width":"62%"},"width":1074,"height":877,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/65-0.png","element":"img"}],[{"text":"Repeating this calculation:","element":"span"}],[{"style":{"width":"98%"},"width":1698,"height":1477,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/65-1.png","element":"img"}],[{"text":"where the last inequality follows since ","element":"span"},{"style":{"height":17.42},"width":201.84,"height":43.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-0.png","element":"img","alt":" kj = 2kj−1","inline":true},{"text":". Therefore:","element":"span"}],[{"style":{"width":"69%"},"width":1205,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-1.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"69%"},"width":1199,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-2.png","element":"img"}],[{"text":"Finally, by Parseval’s Theorem:","element":"span"}],[{"style":{"width":"57%"},"width":998,"height":232,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-3.png","element":"img"}],[{"text":"Combining this, we have:","element":"span"}],[{"style":{"width":"104%"},"width":1810,"height":622,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-4.png","element":"img"}],[{"id":"id-110","style":{"fontWeight":"bold"},"text":"E.4. Transients","element":"span"}],[{"text":"Consider the response of a system to a deterministic, periodic, zero-mean input ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-5.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":"starting from some initial state ","element":"span"},{"style":{"height":16.61},"width":220.06,"height":41.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-6.png","element":"img","alt":" xu0 at t = 0","inline":true,"padRight":true},{"text":"(here the mean is taken over a full period). We can break up the ","element":"span"},{"text":"response into the steady state response, ","element":"span"},{"style":{"height":16.25},"width":56.59,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-7.png","element":"img","alt":" xsst ","inline":true,"padRight":true},{"text":", and the transient response, ","element":"span"},{"style":{"height":18.65},"width":407.77,"height":46.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-8.png","element":"img","alt":" xtrt : xut = xsst + xtrt .","inline":true,"padRight":true},{"text":"Precisely, ","element":"span"},{"style":{"height":16.25},"width":56.6,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-9.png","element":"img","alt":" xsst ","inline":true,"padRight":true},{"text":"is the response of the system if the input ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-10.png","element":"img","alt":" ut ","inline":true,"padRight":true},{"text":"has been on for all time in the past and, ","element":"span"},{"text":"to attain the desired response, we can set:","element":"span"}],[{"style":{"width":"60%"},"width":1037,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/66-11.png","element":"img"}],[{"text":"With these definitions, we will have:","element":"span"}],[{"style":{"width":"52%"},"width":913,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-0.png","element":"img"}],[{"text":"Assume that ","element":"span"},{"style":{"height":17.6},"width":188.34,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-1.png","element":"img","alt":" ρ(A∗) < 1","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":17.6},"width":461.55,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-2.png","element":"img","alt":" limt→∞ ∥xut − xsst ∥2 = 0.","inline":true,"padRight":true},{"text":"Take ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"to be an integer multiple of the period of the input and note that, by linearity, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"will also be an integer multiple of the period of ","element":"span"},{"style":{"height":16.25},"width":69.26,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-3.png","element":"img","alt":" xsst .","inline":true}],[{"id":"id-95","style":{"fontWeight":"bold"},"text":"Lemma E.10 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Using the definitions above, let:","element":"span"}],[{"style":{"width":"84%"},"width":1456,"height":292,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Then if ","element":"span"},{"style":{"height":17.88},"width":325.44,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-5.png","element":"img","alt":" T ′ ≥ Tss(ζ, k, xu0)","inline":true},{"style":{"fontStyle":"italic"},"text":", we will have that:","element":"span"}],[{"style":{"width":"111%"},"width":1933,"height":1319,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"≤ ∥","element":"span"},{"style":{"height":20.01},"width":196.96,"height":50.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-7.png","element":"img","alt":"xu0 − xss0 ∥22","inline":true}],[{"style":{"width":"49%"},"width":861,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/67-8.png","element":"img"}],[{"style":{"width":"103%"},"width":1792,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-0.png","element":"img"}],[{"style":{"height":54.7},"width":1336.44,"height":136.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-1.png","element":"img","alt":"≤ ∥xu0 − xss0 ∥22β(A∗)2¯ρ(A∗)2T ′1 − ¯ρ(A∗)2 +2∥xu0 − xss0 ∥2β(A∗)�kw⊤˜Γukw¯ρ(A∗)T ′","inline":true}],[{"style":{"width":"12%"},"width":213,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-2.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"holds by our assumption on ","element":"span"},{"style":{"height":12.4},"width":53.74,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-3.png","element":"img","alt":" T ′.","inline":true}],[{"id":"id-100","style":{"fontWeight":"bold"},"text":"Corollary E.11 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under the same assumptions as Lemma ","element":"span"},{"href":"#id-95","style":{"fontStyle":"italic"},"text":"E.10","element":"a"},{"style":{"fontStyle":"italic"},"text":", we will have that:","element":"span"}],[{"style":{"width":"46%"},"width":809,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where:","element":"span"}],[{"style":{"width":"99%"},"width":1714,"height":557,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-5.png","element":"img"}],[{"text":"Since, by assumption ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-6.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":"is zero-mean, it follows that ","element":"span"},{"style":{"height":16.25},"width":56.59,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-7.png","element":"img","alt":" xsst","inline":true,"padRight":true},{"text":"is zero-mean. Thus, the only non-zero mean component of ","element":"span"},{"style":{"height":16.25},"width":44.94,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-8.png","element":"img","alt":" xut ","inline":true,"padRight":true},{"text":"is that due to the transient so:","element":"span"}],[{"style":{"width":"36%"},"width":634,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-9.png","element":"img"}],[{"text":"from which it follows that ","element":"span"},{"style":{"height":19.01},"width":461.4,"height":47.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-10.png","element":"img","alt":" w⊤At∗(xu0 − xss0 ) − w⊤¯xu","inline":true,"padRight":true},{"text":"is a zero-mean signal. Denoting ","element":"span"},{"style":{"height":19.53},"width":223.28,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-11.png","element":"img","alt":" Xtr(ejθ) the","inline":true,"padRight":true},{"text":"DFT of ","element":"span"},{"style":{"height":18.65},"width":540.77,"height":46.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-12.png","element":"img","alt":" xtrt over t = T ′, ..., T ′ + k − 1","inline":true},{"text":", by Parseval’s Theorem, we will have that:","element":"span"}],[{"style":{"width":"76%"},"width":1316,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-13.png","element":"img"}],[{"text":"where, crucially, since ","element":"span"},{"style":{"height":19.01},"width":429.04,"height":47.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-14.png","element":"img","alt":" w⊤At∗(xu0−xss0 )−w⊤¯xu","inline":true,"padRight":true},{"text":"is zero-mean, we only sum over frequencies starting ","element":"span"},{"text":"at ","element":"span"},{"style":{"height":21.29},"width":121.6,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-15.png","element":"img","alt":" θ = 2πk ","inline":true,"padRight":true},{"text":"(that is, we do not sum over the DC component). Thus:","element":"span"}],[{"style":{"width":"76%"},"width":1316,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/68-16.png","element":"img"}],[{"style":{"width":"36%"},"width":635,"height":289,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-0.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"98%"},"width":1696,"height":395,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-1.png","element":"img"}],[{"text":"where the last inequality follows since we have assumed Lemma ","element":"span"},{"href":"#id-95","text":"E.10 ","element":"a"},{"text":"holds.","element":"span"}],[{"id":"id-109","style":{"fontWeight":"bold"},"text":"Lemma E.12 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that the input ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-2.png","element":"img","alt":" ut","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies, for some ","element":"span"},{"style":{"height":16},"width":300.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-3.png","element":"img","alt":" k and any s ≥ 0:","inline":true}],[{"style":{"width":"21%"},"width":369,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"then:","element":"span"}],[{"style":{"width":"58%"},"width":1013,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-5.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":19.53},"width":128.14,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-6.png","element":"img","alt":" X(ejθ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denotes the response of the noiseless system running for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"style":{"fontStyle":"italic"},"text":"steps when the input ","element":"span"},{"style":{"height":19.53},"width":123.11,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-7.png","element":"img","alt":"U(ejθ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is applied.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Note that:","element":"span"}],[{"style":{"width":"84%"},"width":1457,"height":285,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-8.png","element":"img"}],[{"text":"and:","element":"span"}],[{"style":{"width":"78%"},"width":1351,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-9.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"113%"},"width":1962,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/69-10.png","element":"img"}],[{"style":{"width":"58%"},"width":1018,"height":1035,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-0.png","element":"img"}],[{"text":"where the last inequality follows from the proof of Lemma ","element":"span"},{"href":"#id-69","text":"D.7","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"1%"},"width":30,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-1.png","element":"img"}]]},{"heading":"Appendix F. Optimal Design Perturbation Bounds","paragraphs":[[{"text":"Throughout this section we assume we are running Algorithm ","element":"span"},{"href":"#id-37","text":"1 ","element":"a"},{"text":"and that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"is the elapsed time after ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"epochs. We will let ","element":"span"},{"style":{"height":15.02},"width":116.99,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-2.png","element":"img","alt":" k = ki","inline":true,"padRight":true},{"text":"to simplify expressions. We will also often simplify notation by writing ","element":"span"},{"style":{"height":24.91},"width":531.8,"height":62.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-3.png","element":"img","alt":"θi := 2πik and Ui := U(ej 2πik ).","inline":true}],[{"text":"Let:","element":"span"}],[{"style":{"width":"101%"},"width":1756,"height":479,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":18.33},"width":42.02,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-5.png","element":"img","alt":" γ2 ","inline":true,"padRight":true},{"text":"is simply some value constraining the power of our input signal and ","element":"span"},{"style":{"height":20.33},"width":339.06,"height":50.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-6.png","element":"img","alt":" U(ej2πℓ/k) denotes","inline":true,"padRight":true},{"text":"the DFT of ","element":"span"},{"style":{"height":11.2},"width":162.04,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-7.png","element":"img","alt":" u1, ..., uk","inline":true},{"text":", the time domain signal. Note that the normalization ","element":"span"},{"style":{"height":23.32},"width":411.85,"height":58.3,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-8.png","element":"img","alt":"2T+T0k2 of �Tt=1 xtx⊤t is","inline":true,"padRight":true},{"text":"due to the fact that, by Parseval’s Theorem:","element":"span"}],[{"style":{"width":"50%"},"width":873,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/70-9.png","element":"img"}],[{"text":"assuming that ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-0.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":"has period ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"and that we are in steady state. Further, by the update rule of Algorithm ","element":"span"},{"href":"#id-37","text":"1","element":"a"},{"text":", ","element":"span"},{"style":{"height":22.46},"width":1162.9,"height":56.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-1.png","element":"img","alt":" T = �i−1ℓ=0 3iT0 = 12(3i − 1)T0 = 12Ti − 12T0 so Ti = 2T + T0","inline":true},{"text":", which is the expected ","element":"span"},{"text":"amount of time we will play these inputs for.","element":"span"}],[{"text":"It is worth noting that the constraint ","element":"span"},{"style":{"height":22},"width":688.07,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-2.png","element":"img","alt":"�kℓ=1 U(ej2πℓ/k)HU(ej2πℓ/k) ≤ k2γ2","inline":true,"padRight":true},{"text":"is equivalent, by ","element":"span"},{"text":"Parseval’s Theorem, to the constraint:","element":"span"}],[{"style":{"width":"17%"},"width":299,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-3.png","element":"img"}],[{"text":"We will denote the optimal set of inputs on the true system as ","element":"span"},{"style":{"height":12.73},"width":41.98,"height":31.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-4.png","element":"img","alt":" u∗ ","inline":true,"padRight":true},{"text":"and the optimal set of inputs on the estimated system as ","element":"span"},{"style":{"height":15.6},"width":202.7,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-5.png","element":"img","alt":" ˆu (that is, ˆu","inline":true,"padRight":true},{"text":"is the solution to ","element":"span"},{"style":{"height":21.49},"width":662.96,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-6.png","element":"img","alt":" OptInputk( ˆA, B∗, γ2, I, {xt}Tt=1)).","inline":true,"padRight":true},{"text":"Our main perturbation result is as follows.","element":"span"}],[{"id":"id-66","style":{"fontWeight":"bold"},"text":"Theorem F.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Full version of Theorem ","element":"span"},{"href":"#id-50","style":{"fontStyle":"italic","fontWeight":"bold"},"text":"4.1","element":"a"},{"style":{"fontStyle":"italic"},"text":") Assuming that ","element":"span"},{"style":{"height":21.21},"width":292.66,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-7.png","element":"img","alt":" ∥A∗ − ˆA∥2 ≤ ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":", then we will have that:","element":"span"}],[{"style":{"width":"113%"},"width":1967,"height":297,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-8.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":19.81},"width":138.36,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-9.png","element":"img","alt":" {xt}Tt=1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is generated from a system with parameter ","element":"span"},{"style":{"height":15.42},"width":122.93,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-10.png","element":"img","alt":" A∗, U ∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":19.81},"width":668.36,"height":49.53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-11.png","element":"img","alt":" OptInputk(A∗, B∗, γ2, I, {xt}Tt=1),","inline":true},{"style":{"height":17.21},"width":34,"height":43.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-12.png","element":"img","alt":"ˆU","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":21.49},"width":739.71,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-13.png","element":"img","alt":" OptInputk( ˆA, B∗, γ2, I, {xt}Tt=1), and:","inline":true}],[{"style":{"width":"112%"},"width":1937,"height":282,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-14.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"style":{"height":17.6},"width":341.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-15.png","element":"img","alt":"(A∗, B∗, U, ϵ, I, w)","inline":true}],[{"style":{"width":"116%"},"width":2012,"height":653,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/71-16.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"between the largest and smallest eigenvalues of ","element":"span"},{"style":{"height":21.49},"width":469.83,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-0.png","element":"img","alt":" A∗, M(A∗, ˆA, {xt}Tt=1, I)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"will not include vectors ","element":"span"},{"style":{"fontStyle":"italic"},"text":"corresponding to the subspace spanned by the eigenvectors corresponding to the largest eigenvalues, as these will be sufficiently excited by noise to make ","element":"span"},{"style":{"height":22},"width":254.43,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-1.png","element":"img","alt":"�Tt=1(w⊤xt)2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"large. In that case one can ","element":"span"},{"style":{"fontStyle":"italic"},"text":"show that for all ","element":"span"},{"style":{"height":23.8},"width":1393.81,"height":59.5,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-2.png","element":"img","alt":" w ∈ M(A∗, ˆA, {xt}Tt=1, I), ∥(ejθiI − A∗)−Hw∥2 = O(∥(ejθiI − A∗)−1∥1/22 ).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"F.1. Proof of Theorem ","element":"span"},{"href":"#id-50","style":{"fontWeight":"bold"},"text":"4.1 ","element":"a"},{"style":{"fontWeight":"bold"},"text":"and Theorem ","element":"span"},{"href":"#id-66","style":{"fontWeight":"bold"},"text":"F.1","element":"a"}],[{"style":{"width":"113%"},"width":1961,"height":385,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-3.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16.47},"width":257.54,"height":41.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-4.png","element":"img","alt":" wA∗,U∗, wA∗, ˆU","inline":true,"padRight":true},{"text":"are the eigenvectors corresponding to the minimum eigenvalues of the matrices ","element":"span"},{"style":{"height":22},"width":1287.48,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-5.png","element":"img","alt":" ξHk(A∗, B∗, U∗, I) + �Tt=1 xtx⊤t and ξHk(A∗, B∗, ˆU, I) + �Tt=1 xtx⊤t ","inline":true,"padRight":true},{"text":", respectively. We ","element":"span"},{"text":"wish to show that:","element":"span"}],[{"style":{"height":53.5},"width":611.38,"height":133.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-6.png","element":"img","alt":"��wA∗,U∗⊤�ξHk(A∗, B∗, U∗, I) +","inline":true}],[{"style":{"width":"113%"},"width":1953,"height":213,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-7.png","element":"img"}],[{"text":"Denote ","element":"span"},{"style":{"height":16.47},"width":92.02,"height":41.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-8.png","element":"img","alt":" w ˆA, ˆU ","inline":true,"padRight":true},{"text":"the solution of the above minimization. Denote also ","element":"span"},{"style":{"height":16.47},"width":107.33,"height":41.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-9.png","element":"img","alt":" w ˆA,U∗","inline":true,"padRight":true},{"text":"the eigenvector corre- ","element":"span"},{"text":"sponding to the minimum eigenvalue of ","element":"span"},{"style":{"height":22},"width":574.04,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-10.png","element":"img","alt":" ξHk( ˆA, B∗, U∗, I) + �Tt=1 xtx⊤t ","inline":true,"padRight":true},{"text":". Then if for all ","element":"span"},{"style":{"height":18.58},"width":165.12,"height":46.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-11.png","element":"img","alt":" U ∈ Uγ2:","inline":true}],[{"id":"id-111","style":{"width":"110%"},"width":1909,"height":127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-12.png","element":"img"}],[{"text":"(29) and:","element":"span"}],[{"id":"id-112","style":{"width":"110%"},"width":1909,"height":127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-13.png","element":"img"}],[{"text":"(30) the above will follow. To see this, assume that:","element":"span"}],[{"style":{"width":"109%"},"width":1890,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-14.png","element":"img"}],[{"text":"then:","element":"span"}],[{"style":{"width":"53%"},"width":916,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/72-15.png","element":"img"}],[{"style":{"width":"57%"},"width":1001,"height":591,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"follows by optimality of ","element":"span"},{"style":{"height":17.6},"width":130.59,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-1.png","element":"img","alt":" U ∗, (b)","inline":true,"padRight":true},{"text":"follows by our assumption (","element":"span"},{"href":"#id-111","text":"29","element":"a"},{"text":"), and ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"follows since ","element":"span"},{"style":{"height":16.47},"width":92.02,"height":41.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-2.png","element":"img","alt":"w ˆA, ˆU ","inline":true,"padRight":true},{"text":"corresponds to the minimum eigenvalue of ","element":"span"},{"style":{"height":22},"width":557.22,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-3.png","element":"img","alt":" ξHk( ˆA, B∗, ˆU, I) + �Tt=1 xtx⊤t ","inline":true,"padRight":true},{"text":". This is clearly a ","element":"span"},{"text":"contradiction, which implies that:","element":"span"}],[{"style":{"width":"109%"},"width":1890,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-4.png","element":"img"}],[{"text":"We can repeat this argument identically in the opposite direction:","element":"span"}],[{"id":"id-113","style":{"width":"112%"},"width":1951,"height":981,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-5.png","element":"img"}],[{"text":"(31) We now return to bounding the difference assuming (","element":"span"},{"href":"#id-111","text":"29","element":"a"},{"text":") and (","element":"span"},{"href":"#id-112","text":"30","element":"a"},{"text":") hold:","element":"span"}],[{"style":{"width":"112%"},"width":1948,"height":286,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/73-6.png","element":"img"}],[{"text":"First, assume that:","element":"span"}],[{"style":{"width":"114%"},"width":1972,"height":1052,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/74-0.png","element":"img"}],[{"text":"where the final inequality follows by (","element":"span"},{"href":"#id-111","text":"29","element":"a"},{"text":") and (","element":"span"},{"href":"#id-113","text":"31","element":"a"},{"text":"). Assume instead that:","element":"span"}],[{"style":{"width":"113%"},"width":1955,"height":726,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/74-1.png","element":"img"}],[{"style":{"height":14.8},"width":120.77,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/74-2.png","element":"img","alt":"≤ δ′ +","inline":true}],[{"style":{"width":"103%"},"width":1790,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/74-3.png","element":"img"}],[{"text":"where the final equality follows by (","element":"span"},{"href":"#id-113","text":"31","element":"a"},{"text":"). If we assume that:","element":"span"}],[{"style":{"width":"102%"},"width":1774,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/74-4.png","element":"img"}],[{"style":{"width":"108%"},"width":1883,"height":358,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-0.png","element":"img"}],[{"style":{"height":14.8},"width":81.89,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-1.png","element":"img","alt":"≤ δ′","inline":true}],[{"style":{"width":"110%"},"width":1902,"height":367,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-2.png","element":"img"}],[{"style":{"height":14.8},"width":81.89,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-3.png","element":"img","alt":"≤ δ′","inline":true}],[{"text":"where the first inequality holds since ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-4.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"text":"are the optimal inputs and the final equality follows by (","element":"span"},{"href":"#id-113","text":"31","element":"a"},{"text":"). Combining these, we conclude that:","element":"span"}],[{"style":{"width":"114%"},"width":1973,"height":637,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-5.png","element":"img"}],[{"text":"We want to guarantee that such a condition holds for ","element":"span"},{"style":{"height":20.87},"width":315.92,"height":52.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-6.png","element":"img","alt":" w ˆA,U∗ and wA∗, ˆU","inline":true},{"text":". In practice we cannot ","element":"span"},{"text":"determine what these are exactly since this requires knowledge of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-7.png","element":"img","alt":" A∗","inline":true},{"text":". Thus, instead, we will find a set ","element":"span"},{"style":{"height":21.49},"width":396.23,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-8.png","element":"img","alt":" M(A∗, ˆA, {xt}Tt=1, I)","inline":true,"padRight":true},{"text":"which is guaranteed to contain them. Setting:","element":"span"}],[{"style":{"width":"95%"},"width":1645,"height":254,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-9.png","element":"img"}],[{"text":"this will be satisfied. To see why, note that","element":"span"}],[{"style":{"width":"94%"},"width":1639,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/75-10.png","element":"img"}],[{"text":"upper bounds","element":"span"}],[{"style":{"width":"40%"},"width":696,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-0.png","element":"img"}],[{"text":"and","element":"span"}],[{"style":{"width":"39%"},"width":676,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-1.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":18.58},"width":253.77,"height":46.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-2.png","element":"img","alt":" U ∈ Uγ2, so if","inline":true}],[{"style":{"width":"106%"},"width":1833,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-3.png","element":"img"}],[{"text":"then ","element":"span"},{"style":{"fontStyle":"italic"},"text":"w ","element":"span"},{"text":"cannot possibly correspond to the minimum eigenvalue of either","element":"span"}],[{"style":{"width":"31%"},"width":542,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-4.png","element":"img"}],[{"text":"or","element":"span"}],[{"style":{"width":"96%"},"width":1660,"height":599,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"F.2. Perturbation Lemmas","element":"span"}],[{"id":"id-83","style":{"fontWeight":"bold"},"text":"Corollary F.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assuming that ","element":"span"},{"style":{"height":21.21},"width":258.19,"height":53.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-6.png","element":"img","alt":" ∥A∗− ˆA∥2 ≤ ϵ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and that the largest Jordan block of ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-7.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"has dimension ","element":"span"},{"style":{"fontStyle":"italic"},"text":"q","element":"span"},{"style":{"fontStyle":"italic"},"text":", we will have that, for small enough ","element":"span"},{"style":{"height":8.4},"width":32.71,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-8.png","element":"img","alt":" ϵ:","inline":true}],[{"style":{"width":"92%"},"width":1606,"height":238,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where here ","element":"span"},{"style":{"height":12.73},"width":51.55,"height":31.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-10.png","element":"img","alt":" U ∗ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":22.67},"width":697.76,"height":56.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-11.png","element":"img","alt":" OptInputk�A∗, B∗, γ2, I, M�and ˆU","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the solution to ","element":"span"},{"style":{"height":31.6},"width":580.68,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/76-12.png","element":"img","alt":"OptInputk�ˆA, B∗, γ2, I, ˆM�.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"The proof of this result follows identically the proof of Theorem ","element":"span"},{"href":"#id-66","text":"F.1 ","element":"a"},{"text":"except now instead of showing:","element":"span"}],[{"style":{"width":"108%"},"width":1878,"height":891,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-0.png","element":"img"}],[{"text":"Given this, the rest of the proof of Theorem ","element":"span"},{"href":"#id-66","text":"F.1 ","element":"a"},{"text":"follows identically now.","element":"span"}],[{"text":"It is not clear in general how large ","element":"span"},{"style":{"height":17.6},"width":371.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-1.png","element":"img","alt":" L(A∗, B∗, U, ϵ, I, w)","inline":true,"padRight":true},{"text":"is and how it scales with ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-2.png","element":"img","alt":" ϵ","inline":true},{"text":". The following lemma provides an interpretable upper bound on ","element":"span"},{"style":{"height":17.6},"width":505.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-3.png","element":"img","alt":" L(A∗, B∗, U, ϵ, I, w) when ϵ","inline":true,"padRight":true},{"text":"is small enough.","element":"span"}],[{"id":"id-81","style":{"fontWeight":"bold"},"text":"Lemma F.4 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"style":{"fontStyle":"italic"},"text":"has period ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then as long as:","element":"span"}],[{"style":{"width":"33%"},"width":584,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-4.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a > ","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", then:","element":"span"}],[{"style":{"width":"114%"},"width":1978,"height":389,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof","element":"span"}],[{"style":{"width":"116%"},"width":2010,"height":307,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/77-6.png","element":"img"}],[{"style":{"width":"117%"},"width":2039,"height":400,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/78-0.png","element":"img"}],[{"text":"where the final inequality holds since, by Parseval’s Theorem:","element":"span"}],[{"style":{"width":"71%"},"width":1238,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/78-1.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-114","text":"F.8 ","element":"a"},{"text":"and our condition on ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/78-2.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"we have that:","element":"span"}],[{"style":{"width":"66%"},"width":1153,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/78-3.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"111%"},"width":1935,"height":1211,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/78-4.png","element":"img"}],[{"style":{"width":"100%"},"width":1728,"height":1064,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-0.png","element":"img"}],[{"text":"To get deterministic bounds on the algorithm performance, it is helpful to deterministically upper bound ","element":"span"},{"style":{"height":21.49},"width":349.85,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-1.png","element":"img","alt":" M(A∗, ˆA, {xt}Tt=1)","inline":true},{"text":". The following lemma provides such a bound.","element":"span"}],[{"id":"id-93","style":{"fontWeight":"bold"},"text":"Lemma F.5 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":17.35},"width":256.39,"height":43.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-2.png","element":"img","alt":" A∗ = PJP −1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the Jordan decomposition of ","element":"span"},{"style":{"height":15.64},"width":172.26,"height":39.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-3.png","element":"img","alt":" A∗, let Jℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"denote the ","element":"span"},{"style":{"height":12.8},"width":52.31,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-4.png","element":"img","alt":" ℓth","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Jordan block, and assume ","element":"span"},{"style":{"height":15.42},"width":154.09,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-5.png","element":"img","alt":" A∗ has r","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Jordan blocks. On the event that:","element":"span"}],[{"style":{"width":"18%"},"width":314,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-6.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and:","element":"span"}],[{"style":{"width":"78%"},"width":1354,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-7.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":8.4},"width":48.42,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-8.png","element":"img","alt":" w′ ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"to be specified, and if:","element":"span"}],[{"style":{"width":"33%"},"width":582,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"then:","element":"span"}],[{"style":{"width":"45%"},"width":781,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-10.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and:","element":"span"}],[{"style":{"width":"38%"},"width":663,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/79-11.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where:","element":"span"}],[{"style":{"width":"113%"},"width":1967,"height":231,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"and here ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is the frequency discretization at the epoch with end-time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"By definition:","element":"span"}],[{"style":{"width":"95%"},"width":1647,"height":606,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-1.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-114","text":"F.8 ","element":"a"},{"text":"and our condition on ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-2.png","element":"img","alt":" ϵ","inline":true},{"text":", we have that:","element":"span"}],[{"style":{"width":"95%"},"width":1643,"height":518,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-3.png","element":"img"}],[{"text":"By assumption:","element":"span"}],[{"style":{"width":"78%"},"width":1354,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-4.png","element":"img"}],[{"text":"Lemma ","element":"span"},{"href":"#id-115","text":"E.9 ","element":"a"},{"text":"implies that, assuming we choose ","element":"span"},{"style":{"height":33.02},"width":922.01,"height":82.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-5.png","element":"img","alt":" w′ such that��w′⊤Pn(j):n(j)��2 = 0 for j ̸= ℓ and that","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"chosen such that it is within epoch ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":":","element":"span"}],[{"style":{"width":"98%"},"width":1698,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/80-6.png","element":"img"}],[{"text":"Following the computation from Lemma ","element":"span"},{"href":"#id-116","text":"F.11 ","element":"a"},{"text":"and noting that:","element":"span"}],[{"style":{"width":"57%"},"width":994,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-0.png","element":"img"}],[{"text":"and that the inverse of a block diagonal matrix is equal to the matrix formed from each of the blocks inverted individually, we then have that:","element":"span"}],[{"style":{"width":"73%"},"width":1267,"height":85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-1.png","element":"img"}],[{"text":"In addition, Lemma ","element":"span"},{"href":"#id-116","text":"F.11","element":"a"},{"text":", gives that:","element":"span"}],[{"style":{"width":"110%"},"width":1907,"height":1565,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-2.png","element":"img"}],[{"text":"Assume that ","element":"span"},{"style":{"height":17.6},"width":865.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-3.png","element":"img","alt":" β(Jk)¯ρ(Jk) ≤ β(Ji)¯ρ(Ji) for all i ̸= k, and let w′ ","inline":true,"padRight":true},{"text":"be some vector such that","element":"span"},{"style":{"height":33.02},"width":341.75,"height":82.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-4.png","element":"img","alt":"��w′⊤Pn(i):n(i)��2 =","inline":true},{"style":{"height":16.8},"width":190.73,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-5.png","element":"img","alt":"0 for i ̸= k","inline":true},{"text":". Note that ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"a","element":"span"},{"text":") ","element":"span"},{"text":"will also upper bound:","element":"span"}],[{"style":{"width":"69%"},"width":1201,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/81-6.png","element":"img"}],[{"style":{"width":"78%"},"width":1360,"height":285,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-0.png","element":"img"}],[{"text":"and the result follows.","element":"span"}],[{"text":"Finally, for Theorem ","element":"span"},{"href":"#id-43","text":"2.2","element":"a"},{"text":", it is necessary to quantify how close ","element":"span"},{"style":{"height":21.21},"width":509.23,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-1.png","element":"img","alt":" Γt(A∗) is to Γt( ˆA). This is","inline":true,"padRight":true},{"text":"quantified below.","element":"span"}],[{"id":"id-84","style":{"fontWeight":"bold"},"text":"Lemma F.6 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"q ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be the dimension of the largest Jordan block of ","element":"span"},{"style":{"height":21.21},"width":561.72,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-2.png","element":"img","alt":" A∗. Then if ∥ ˆA − A∗∥2 ≤ ϵ, for","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"small enough ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-3.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":", where at least ","element":"span"},{"style":{"height":20.8},"width":616.37,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-4.png","element":"img","alt":" ρ(A∗) + q�2κ(A∗)ϵ < 1, we have:","inline":true}],[{"style":{"width":"96%"},"width":1675,"height":238,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"We first compute the directional derivate of ","element":"span"},{"style":{"height":21.6},"width":267.52,"height":54.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-6.png","element":"img","alt":"�t−1s=0 As∗(As∗)⊤","inline":true,"padRight":true},{"text":"with respect to ","element":"span"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-7.png","element":"img","alt":" A∗ ","inline":true,"padRight":true},{"text":"in direction","element":"span"}],[{"style":{"width":"108%"},"width":1880,"height":1379,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/82-8.png","element":"img"}],[{"style":{"width":"37%"},"width":643,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-0.png","element":"img"}],[{"text":"We can upper bound ","element":"span"},{"style":{"height":17.6},"width":164.09,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-1.png","element":"img","alt":" β(A′) as:","inline":true}],[{"style":{"width":"36%"},"width":626,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-2.png","element":"img"}],[{"text":"Writing ","element":"span"},{"href":"#id-114","style":{"height":27.41},"width":1592.12,"height":68.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-3.png","element":"img","alt":" A′ = A∗+δ∆ for δ ∈ [0, ϵ] and ∥∆∥2 = 1, by Lemma F.8, if ϵ ≤ 1maxθ∈[0,2π] 2∥(ejθI−A∗)−1∥2 :","inline":true}],[{"style":{"width":"98%"},"width":1703,"height":376,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-4.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-117","text":"F.10 ","element":"a"},{"text":"we will have that:","element":"span"}],[{"style":{"width":"44%"},"width":765,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-5.png","element":"img"}],[{"text":"Combining these we have that:","element":"span"}],[{"style":{"width":"98%"},"width":1698,"height":301,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-6.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"F.3. Additional Lemmas","element":"span"}],[{"style":{"width":"85%"},"width":1474,"height":167,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-7.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where:","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"L","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"A, B, U, ϵ, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I","element":"span"},{"style":{"fontStyle":"italic"},"text":", w","element":"span"},{"text":")","element":"span"}],[{"style":{"width":"108%"},"width":1883,"height":157,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"To bound","element":"span"},{"style":{"height":32.55},"width":816.36,"height":81.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-9.png","element":"img","alt":"��w⊤Hk(A, B, U, I)w − w⊤Hk( ˆA, B, U, I)w��","inline":true},{"text":", we calculate the directional derivative of ","element":"span"},{"style":{"height":17.6},"width":365.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-10.png","element":"img","alt":" w⊤Hk(A, B, U, I)w","inline":true,"padRight":true},{"text":"with respect to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"text":"and use this to bound the Lipschitz constant of the function ","element":"span"},{"style":{"height":17.6},"width":365.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-11.png","element":"img","alt":" w⊤Hk(A, B, U, I)w","inline":true},{"text":". The directional derivative is given by:","element":"span"}],[{"style":{"width":"88%"},"width":1523,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/83-12.png","element":"img"}],[{"text":"Lemma ","element":"span"},{"href":"#id-114","text":"F.8 ","element":"a"},{"text":"gives that, for small enough ","element":"span"},{"style":{"height":12.8},"width":33.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-0.png","element":"img","alt":" δ:","inline":true}],[{"style":{"width":"61%"},"width":1061,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-1.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"108%"},"width":1880,"height":505,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-2.png","element":"img"}],[{"text":"and thus:","element":"span"}],[{"style":{"width":"116%"},"width":2019,"height":1097,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-3.png","element":"img"}],[{"id":"id-114","style":{"fontWeight":"bold"},"text":"Lemma F.8 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For ","element":"span"},{"style":{"height":25.43},"width":335.35,"height":63.57,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-4.png","element":"img","alt":" δ < 1∥(ejθI−A)−1∥2 :","inline":true}],[{"style":{"width":"61%"},"width":1061,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/84-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"To see that this is true, we can simply multiply the right hand side above by ","element":"span"},{"style":{"height":19.53},"width":293.24,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-0.png","element":"img","alt":" (ejθI −A−δ∆)","inline":true,"padRight":true},{"text":"and observe that the result is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I","element":"span"},{"text":". We wish to show that:","element":"span"}],[{"style":{"width":"109%"},"width":1893,"height":864,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-1.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":25.43},"width":313.96,"height":63.57,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-2.png","element":"img","alt":" δ < 1∥(ejθI−A)−1∥2 ","inline":true,"padRight":true},{"text":", we can make ","element":"span"},{"style":{"height":20.74},"width":423.98,"height":51.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-3.png","element":"img","alt":" δn+1∥(ejθI −A)−1∥n+12","inline":true,"padRight":true},{"text":"arbitrarily small by making ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"text":"large. Thus, for any ","element":"span"},{"style":{"height":12.4},"width":97.89,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-4.png","element":"img","alt":" ϵ > 0","inline":true},{"text":", we can find an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N ","element":"span"},{"text":"such that for all ","element":"span"},{"style":{"height":14.4},"width":136.19,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-5.png","element":"img","alt":" n ≥ N:","inline":true}],[{"style":{"width":"72%"},"width":1254,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-6.png","element":"img"}],[{"text":"This implies that:","element":"span"}],[{"style":{"width":"84%"},"width":1463,"height":432,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-7.png","element":"img"}],[{"id":"id-80","style":{"fontWeight":"bold"},"text":"Lemma F.9 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"If:","element":"span"}],[{"style":{"width":"76%"},"width":1317,"height":390,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/85-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Denote ","element":"span"},{"href":"#id-114","style":{"height":26.97},"width":1308.19,"height":67.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-0.png","element":"img","alt":"ˆA = A + δ∆ for some ∥∆∥2 = 1 and δ ≤ 1a∥(ejθI−A)−1∥2 . By Lemma F.8:","inline":true}],[{"style":{"width":"40%"},"width":694,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-1.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"∥","element":"span"},{"style":{"height":20.41},"width":420.69,"height":51.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-2.png","element":"img","alt":"(ejθI−A − δ∆)−1∥2 =","inline":true}],[{"style":{"width":"93%"},"width":1621,"height":313,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-3.png","element":"img"}],[{"text":"For the second inequality, we can simply multiply the first term in the expression by ","element":"span"},{"style":{"height":12.8},"width":261.69,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-4.png","element":"img","alt":" w⊤ and we see","inline":true,"padRight":true},{"text":"that the result holds.","element":"span"}],[{"id":"id-117","style":{"fontWeight":"bold"},"text":"Lemma F.10 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":21.21},"width":265.5,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-5.png","element":"img","alt":" ∥A − ˆA∥2 ≤ ϵ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for some small enough ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-6.png","element":"img","alt":" ϵ","inline":true},{"style":{"fontStyle":"italic"},"text":". Denote by ","element":"span"},{"style":{"height":21.21},"width":89.26,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-7.png","element":"img","alt":" ρ( ˆA)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the spectral radius of ","element":"span"},{"style":{"height":17.21},"width":374.44,"height":43.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-8.png","element":"img","alt":"ˆA. Let A = PJP −1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the Jordan decomposition of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is diagonal, we will have that:","element":"span"}],[{"style":{"width":"23%"},"width":398,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":19.14},"width":504.5,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-10.png","element":"img","alt":" κ(A) = ∥P∥2∥P −1∥2. If J","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is not diagonal then, letting ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"style":{"fontStyle":"italic"},"text":"be the dimension of its largest Jordan block:","element":"span"}],[{"style":{"width":"27%"},"width":470,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-11.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Let ","element":"span"},{"style":{"height":15.14},"width":245.44,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-12.png","element":"img","alt":" A = PJP −1 ","inline":true,"padRight":true},{"text":"be the Jordan decomposition of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":". Assume that ","element":"span"},{"style":{"height":18.41},"width":376.96,"height":46.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-13.png","element":"img","alt":"ˆA = A + δ∆ where","inline":true},{"style":{"height":17.6},"width":543.66,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-14.png","element":"img","alt":"δ ∈ [0, ϵ] and ∥∆∥2 = 1. Let µ","inline":true,"padRight":true},{"text":"be the eigenvalue of ","element":"span"},{"style":{"height":16.81},"width":33.52,"height":42.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-15.png","element":"img","alt":"ˆA","inline":true,"padRight":true},{"text":"with largest magnitude and assume that ","element":"span"},{"style":{"height":16},"width":66.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-16.png","element":"img","alt":" µ is","inline":true,"padRight":true},{"text":"not an eigenvalue of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"text":"(otherwise we are trivially done). Since ","element":"span"},{"style":{"height":12},"width":26,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-17.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"is an eigenvalue of ","element":"span"},{"style":{"height":16.81},"width":33.52,"height":42.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-18.png","element":"img","alt":"ˆA","inline":true},{"text":", following a standard proof of the Bauer-Fike Theorem we have:","element":"span"}],[{"style":{"width":"65%"},"width":1131,"height":192,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-19.png","element":"img"}],[{"text":"Since by assumption ","element":"span"},{"style":{"height":12},"width":26,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-20.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"is not an eigenvalue of ","element":"span"},{"style":{"height":17.6},"width":375.57,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-21.png","element":"img","alt":" A, det(J − µI) ̸= 0","inline":true},{"text":", which implies that ","element":"span"},{"style":{"height":12.4},"width":152.93,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-22.png","element":"img","alt":" −1 is an","inline":true,"padRight":true},{"text":"eigenvalue of ","element":"span"},{"style":{"height":19.13},"width":380.6,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-23.png","element":"img","alt":" δ(J − µI)−1P −1∆P","inline":true},{"text":". Since the spectral norm upper bounds all eigenvalues:","element":"span"}],[{"style":{"width":"35%"},"width":606,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-24.png","element":"img"}],[{"text":"so:","element":"span"}],[{"id":"id-119","style":{"width":"62%"},"width":1084,"height":85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-25.png","element":"img"}],[{"text":"If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J ","element":"span"},{"text":"is diagonal, then ","element":"span"},{"style":{"height":24.72},"width":1338.17,"height":61.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-26.png","element":"img","alt":" ∥(J − µI)−1∥2 = 1mini |λi(A)−µ|. Denoting i∗ = arg mini |λi(A) − µ|, we","inline":true,"padRight":true},{"text":"then have:","element":"span"}],[{"style":{"width":"71%"},"width":1244,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/86-27.png","element":"img"}],[{"text":"where the implication follows by the reverse triangle inequality. If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J ","element":"span"},{"text":"is not diagonal, then ","element":"span"},{"style":{"height":16},"width":132.16,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-0.png","element":"img","alt":" J − µI","inline":true,"padRight":true},{"text":"will be a Jordan form with eigenvalues ","element":"span"},{"style":{"height":16.4},"width":120.44,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-1.png","element":"img","alt":" λi − µ","inline":true},{"text":". In particular then we have:","element":"span"}],[{"style":{"width":"105%"},"width":1818,"height":94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-2.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"J","element":"span"},{"style":{"height":16},"width":128.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-3.png","element":"img","alt":"−µI =","inline":true}],[{"style":{"width":"105%"},"width":1818,"height":78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":18.63},"width":167.88,"height":46.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-5.png","element":"img","alt":"˜Ji is the i","inline":true},{"text":"th Jordan block of ","element":"span"},{"style":{"height":17.6},"width":227.64,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-6.png","element":"img","alt":" J − µI, n(i)","inline":true,"padRight":true},{"text":"is the dimension of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"th Jordan block, and:","element":"span"}],[{"style":{"width":"36%"},"width":623,"height":289,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-7.png","element":"img"}],[{"text":"Since the inverse of a block diagonal matrix is simply formed by inverting each block, we can calculate ","element":"span"},{"style":{"height":19.13},"width":210.86,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-8.png","element":"img","alt":" (J − µI)−1 ","inline":true,"padRight":true},{"text":"by calculating the inverse of each block ","element":"span"},{"style":{"height":19.95},"width":385.76,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-9.png","element":"img","alt":" (λi − µ)In(i) + Dn(i)","inline":true,"padRight":true},{"text":"individually. Note that each block is invertible since we have assumed that ","element":"span"},{"style":{"height":12},"width":26,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-10.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"is not an eigenvalue of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":". By Taylor expanding, and the fact that ","element":"span"},{"style":{"height":18.75},"width":94.38,"height":46.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-11.png","element":"img","alt":" Dn(i)","inline":true,"padRight":true},{"text":"is nilpotent, we have:","element":"span"}],[{"style":{"width":"49%"},"width":856,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-12.png","element":"img"}],[{"text":"so:","element":"span"}],[{"style":{"width":"89%"},"width":1548,"height":294,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-13.png","element":"img"}],[{"text":"Since eigenvalues are continuous functions of the entries of a matrix ","element":"span"},{"href":"#id-118","referenceIndex":21,"text":"Horn and Johnson ","element":"a"},{"text":"(","element":"span"},{"href":"#id-118","referenceIndex":21,"text":"2012","element":"a"},{"text":"), for small enough ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-14.png","element":"img","alt":" δ","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":17.6},"width":457.21,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-15.png","element":"img","alt":" |µ − λi| ≤ 1/2 for some i","inline":true},{"text":". If this holds then:","element":"span"}],[{"style":{"width":"78%"},"width":1355,"height":566,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/87-16.png","element":"img"}],[{"text":"Then:","element":"span"}],[{"style":{"width":"49%"},"width":862,"height":599,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-0.png","element":"img"}],[{"text":"Combining this with (","element":"span"},{"href":"#id-119","text":"32","element":"a"},{"text":") and denoting ","element":"span"},{"style":{"height":16.33},"width":90.82,"height":40.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-1.png","element":"img","alt":" i∗, j∗ ","inline":true,"padRight":true},{"text":"the indices at which the above maximum is achieved, we get that:","element":"span"}],[{"style":{"width":"79%"},"width":1381,"height":291,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-2.png","element":"img"}],[{"id":"id-116","style":{"fontWeight":"bold"},"text":"Lemma F.11 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Let ","element":"span"},{"style":{"height":15.14},"width":239.85,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-3.png","element":"img","alt":" A = PJP −1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be the Jordan decomposition of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"style":{"fontStyle":"italic"},"text":". Assume that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontStyle":"italic"},"text":"has ","element":"span"},{"style":{"fontStyle":"italic"},"text":"r ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Jordan blocks and denote by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"the start and stop indices of the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":"th Jordan block (so in particular, if ","element":"span"},{"style":{"height":15.02},"width":174.81,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-4.png","element":"img","alt":" Ji is the i","inline":true},{"style":{"fontStyle":"italic"},"text":"th Jordan block, we have that ","element":"span"},{"style":{"height":19.95},"width":978.71,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-5.png","element":"img","alt":" Ji = [J]n(i):n(i),n(i):n(i)). Let Pi:j to denote [pi, ..., pj],","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the matrix with columns equal to the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":"th to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"style":{"fontStyle":"italic"},"text":"th columns of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"P","element":"span"},{"style":{"fontStyle":"italic"},"text":". Then:","element":"span"}],[{"style":{"width":"53%"},"width":929,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-6.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"We have:","element":"span"}],[{"style":{"width":"89%"},"width":1554,"height":179,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-7.png","element":"img"}],[{"text":"Since, for nonnegative ","element":"span"},{"style":{"height":19.9},"width":443.05,"height":49.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-8.png","element":"img","alt":" a, b,√a + b ≤ √a +√b","inline":true,"padRight":true},{"text":"(by virtue of the fact that ","element":"span"},{"style":{"height":20.08},"width":423.12,"height":50.19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-9.png","element":"img","alt":" a + b ≤ (√a +√b)2 =","inline":true},{"style":{"height":19.9},"width":285.07,"height":49.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-10.png","element":"img","alt":"a + b + 2√a√b","inline":true},{"text":"), it then follows that:","element":"span"}],[{"style":{"width":"0%"},"width":13,"height":2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-11.png","element":"img"}],[{"style":{"height":32.7},"width":156.68,"height":81.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-12.png","element":"img","alt":"��[w⊤p1","inline":true},{"style":{"fontStyle":"italic"},"text":", . . . , w","element":"span"},{"style":{"height":19.95},"width":427.21,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-13.png","element":"img","alt":"⊤pn(1)]Jℓ1, . . . , [w⊤pn(r)","inline":true},{"style":{"fontStyle":"italic"},"text":", . . . , w","element":"span"},{"style":{"height":33.02},"width":281.65,"height":82.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-14.png","element":"img","alt":"⊤pn(r)]Jℓr��2 ≤","inline":true}],[{"style":{"width":"25%"},"width":440,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/88-15.png","element":"img"}],[{"style":{"width":"36%"},"width":630,"height":349,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-0.png","element":"img"}]]},{"heading":"Appendix G. Lower Bound","paragraphs":[[{"text":"We base our analysis off the lower bound presented in ","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"Jedra and Proutiere ","element":"a"},{"text":"(","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"2019","element":"a"},{"text":"). A slight modifi-cation of their analysis to our situation yields the following result.","element":"span"}],[{"id":"id-120","style":{"fontWeight":"bold"},"text":"Theorem G.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any matrix ","element":"span"},{"style":{"height":17.6},"width":535.91,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-1.png","element":"img","alt":" A∗, for all ϵ > 0, δ ∈ (0, 1)","inline":true},{"style":{"fontStyle":"italic"},"text":", the sample complexity ","element":"span"},{"style":{"height":15.6},"width":182.72,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-2.png","element":"img","alt":" τϵδ of any","inline":true},{"style":{"height":17.6},"width":92.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-3.png","element":"img","alt":"(ϵ, δ)","inline":true},{"style":{"fontStyle":"italic"},"text":"-locally-stable algorithm in ","element":"span"},{"style":{"height":16.8},"width":215.77,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-4.png","element":"img","alt":" A∗ satisfies:","inline":true}],[{"style":{"width":"39%"},"width":688,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"The proof of this result is essentially identical to the proof of Theorem 1 in ","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"Jedra and ","element":"a"},{"href":"#id-39","referenceIndex":23,"text":"Proutiere ","element":"a"},{"text":"(","element":"span"},{"href":"#id-39","referenceIndex":23,"text":"2019","element":"a"},{"text":") and we omit it here.","element":"span"}],[{"text":"Denoting ","element":"span"},{"style":{"height":16.25},"width":44.94,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-6.png","element":"img","alt":" xut ","inline":true,"padRight":true},{"text":"the response of the system due to the input and ","element":"span"},{"style":{"height":18.72},"width":41.94,"height":46.79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-7.png","element":"img","alt":" xηt ","inline":true,"padRight":true},{"text":"the response due to the noise, ","element":"span"},{"text":"we can write:","element":"span"}],[{"style":{"width":"85%"},"width":1472,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-8.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"63%"},"width":1090,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-9.png","element":"img"}],[{"text":"so, Theorem ","element":"span"},{"href":"#id-120","text":"G.1 ","element":"a"},{"text":"gives that:","element":"span"}],[{"id":"id-121","style":{"width":"98%"},"width":1698,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"G.1. Proof of Theorem ","element":"span"},{"href":"#id-42","style":{"fontWeight":"bold"},"text":"2.1","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Since (","element":"span"},{"href":"#id-121","text":"33","element":"a"},{"text":") holds for all input sequences ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-11.png","element":"img","alt":" ut","inline":true},{"text":", and since we wish to minimize the lower bound, we will have in particular:","element":"span"}],[{"style":{"width":"60%"},"width":1046,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/89-12.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":16.25},"width":44.94,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-0.png","element":"img","alt":" xut ","inline":true,"padRight":true},{"text":"is deterministic conditioned on ","element":"span"},{"style":{"height":10.62},"width":36.98,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-1.png","element":"img","alt":" ut","inline":true},{"text":", maximizing ","element":"span"},{"style":{"height":20.8},"width":727.82,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-2.png","element":"img","alt":" λmin�E��τϵδt=1 xut xut⊤�+ �τϵδt=1 σ2Γt�is","inline":true,"padRight":true},{"text":"equivalent to maximizing ","element":"span"},{"style":{"height":20.8},"width":614.54,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-3.png","element":"img","alt":" λmin��τϵδt=1 xut xut⊤ + �τϵδt=1 σ2Γt�","inline":true},{"text":". For any input ","element":"span"},{"style":{"fontStyle":"italic"},"text":"u ","element":"span"},{"text":"satisfying the power constraint given in the statement of Theorem ","element":"span"},{"href":"#id-42","text":"2.1","element":"a"},{"text":", by Lemma ","element":"span"},{"href":"#id-122","text":"E.8","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"93%"},"width":1616,"height":429,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-4.png","element":"img"}],[{"text":"Note that the term ","element":"span"},{"style":{"height":23.54},"width":1016.3,"height":58.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-5.png","element":"img","alt":"1τϵδ�τϵδt=1 G(ejθt)U(ejθt)U(ejθt)HG(ejθt)H + �τϵδt=1 σ2Γt","inline":true,"padRight":true},{"text":"is scaling as ","element":"span"},{"style":{"height":14.84},"width":151.4,"height":37.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-6.png","element":"img","alt":" τϵδ since","inline":true},{"style":{"height":14.62},"width":121.68,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-7.png","element":"img","alt":"Γt ⪰ I","inline":true},{"text":". Thus, for large enough ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-8.png","element":"img","alt":" τϵδ","inline":true},{"text":", since the left hand side is only scaling as ","element":"span"},{"style":{"height":17.6},"width":99.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-9.png","element":"img","alt":"√τϵδ:","inline":true}],[{"style":{"width":"83%"},"width":1446,"height":255,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-10.png","element":"img"}],[{"text":"so, for large enough ","element":"span"},{"style":{"height":11.24},"width":62.94,"height":28.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-11.png","element":"img","alt":" τϵδ:","inline":true}],[{"style":{"width":"94%"},"width":1634,"height":429,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-12.png","element":"img"}],[{"text":"For small enough ","element":"span"},{"style":{"height":10.44},"width":88.32,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-13.png","element":"img","alt":" ϵ, τϵδ","inline":true,"padRight":true},{"text":"will be sufficiently large for this to hold. We have then that:","element":"span"}],[{"style":{"width":"71%"},"width":1231,"height":280,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-14.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-40","text":"H.2","element":"a"},{"text":", we know that:","element":"span"}],[{"style":{"width":"59%"},"width":1031,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-15.png","element":"img"}],[{"text":"exists and, further, that:","element":"span"}],[{"style":{"width":"103%"},"width":1786,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/90-16.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":10.44},"width":48.79,"height":26.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-0.png","element":"img","alt":" τϵδ","inline":true},{"text":". Thus, for small enough ","element":"span"},{"style":{"height":8},"width":18,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-1.png","element":"img","alt":" ϵ","inline":true},{"text":", we will have that:","element":"span"}],[{"style":{"width":"75%"},"width":1309,"height":207,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-2.png","element":"img"}]]},{"heading":"Appendix H. Additional Lemmas","paragraphs":[[{"id":"id-123","style":{"fontWeight":"bold"},"text":"Lemma H.1 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume that ","element":"span"},{"style":{"height":17.6},"width":169.41,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-3.png","element":"img","alt":" ρ(A) < 1","inline":true},{"style":{"fontStyle":"italic"},"text":". Then for any ","element":"span"},{"style":{"height":15.6},"width":96.29,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-4.png","element":"img","alt":" θ1, θ2","inline":true},{"style":{"fontStyle":"italic"},"text":", we will have that:","element":"span"}],[{"style":{"width":"77%"},"width":1342,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-5.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"so it follows that ","element":"span"},{"style":{"height":19.53},"width":240.54,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-6.png","element":"img","alt":" (ejθI − A)−1 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is Lipschitz continuous in ","element":"span"},{"style":{"height":12.8},"width":32.7,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-7.png","element":"img","alt":" θ.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Noting that, since we assume ","element":"span"},{"style":{"height":17.6},"width":169.41,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-8.png","element":"img","alt":" ρ(A) < 1","inline":true},{"text":", using the identity that ","element":"span"},{"style":{"height":19.13},"width":501.28,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-9.png","element":"img","alt":" (I + A)−1 = I − A + A2 −","inline":true}],[{"style":{"width":"76%"},"width":1322,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-10.png","element":"img"}],[{"text":"Thus:","element":"span"}],[{"style":{"width":"47%"},"width":813,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-11.png","element":"img"}],[{"text":"For any matrix ","element":"span"},{"style":{"height":17.6},"width":466.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-12.png","element":"img","alt":" A with ρ(A) < 1 we have:","inline":true}],[{"text":"(","element":"span"},{"style":{"height":20.01},"width":1375.55,"height":50.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-13.png","element":"img","alt":"I + 2A + 3A2 + 4A3 + ...)(I − A)2 = (I + A + A2 + A3 + ...)(I − A) = I","inline":true,"padRight":true},{"text":"=","element":"span"},{"style":{"height":20.01},"width":863.81,"height":50.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-14.png","element":"img","alt":"⇒ (I + 2A + 3A2 + 4A3 + ...)−1 = (I − A)−2","inline":true}],[{"text":"which implies:","element":"span"}],[{"style":{"width":"85%"},"width":1477,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-15.png","element":"img"}],[{"text":"So the Lipschitz constant of ","element":"span"},{"style":{"height":19.53},"width":240.54,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-16.png","element":"img","alt":" (ejθI − A)−1 ","inline":true,"padRight":true},{"text":"is bounded by:","element":"span"}],[{"style":{"width":"63%"},"width":1104,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-17.png","element":"img"}],[{"text":"from which the result follows directly.","element":"span"}],[{"id":"id-40","style":{"fontWeight":"bold"},"text":"Lemma H.2 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any sequences of integers ","element":"span"},{"style":{"height":15.6},"width":935.7,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-18.png","element":"img","alt":" ni, mi such that limi→∞ ni = limi→∞ mi = ∞, we","inline":true}],[{"style":{"width":"67%"},"width":1166,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-19.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"assuming the limit of each exists. Further, for any finite ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"style":{"fontStyle":"italic"},"text":", we will have:","element":"span"}],[{"style":{"width":"29%"},"width":516,"height":70,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/91-20.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof ","element":"span"},{"text":"Assume the opposite, that there exists some sequence of integers ","element":"span"},{"style":{"height":11.2},"width":109.43,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-0.png","element":"img","alt":" ni, mi","inline":true,"padRight":true},{"text":"satisfying the above condition such that ","element":"span"},{"style":{"height":24.27},"width":754.42,"height":60.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-1.png","element":"img","alt":" limi→∞ λmin(˜Γu∗ni ) > limj→∞ λmin(˜Γu∗mj)","inline":true},{"text":". By the definition of a limit, this ","element":"span"},{"text":"implies that there exists some finite ","element":"span"},{"style":{"height":14.62},"width":32.04,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-2.png","element":"img","alt":" i0","inline":true,"padRight":true},{"text":"such that for any ","element":"span"},{"style":{"height":14.62},"width":119.82,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-3.png","element":"img","alt":" i ≥ i0","inline":true},{"text":", we will have that ","element":"span"},{"style":{"height":22.27},"width":237.4,"height":55.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-4.png","element":"img","alt":" λmin(˜Γu∗ni ) >","inline":true},{"style":{"height":24.27},"width":660.91,"height":60.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-5.png","element":"img","alt":"λmin(˜Γu∗mj) for all j. For any ℓ ∈ [ni0]","inline":true},{"text":", note that we can make:","element":"span"}],[{"style":{"width":"12%"},"width":218,"height":107,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-6.png","element":"img"}],[{"text":"arbitrarily small for large enough ","element":"span"},{"style":{"height":17.02},"width":313.67,"height":42.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-7.png","element":"img","alt":" j (since mj → ∞","inline":true,"padRight":true},{"text":"and by proper choice of ","element":"span"},{"style":{"height":17.6},"width":72.62,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-8.png","element":"img","alt":" ℓ(j)","inline":true},{"text":"). By Lemma ","element":"span"},{"href":"#id-123","text":"H.1","element":"a"},{"text":", this implies that we can make:","element":"span"}],[{"style":{"width":"41%"},"width":712,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-9.png","element":"img"}],[{"text":"arbitrarily small. Thus, for large enough ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":", we can simply set the inputs at positions ","element":"span"},{"style":{"height":28.16},"width":223.3,"height":70.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-10.png","element":"img","alt":"ℓ(j)mj identical","inline":true}],[{"text":"to those at positions ","element":"span"},{"style":{"height":26.84},"width":634.27,"height":67.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-11.png","element":"img","alt":"ℓni0 for each ℓ, and make λmin(˜Γu∗mj)","inline":true,"padRight":true},{"text":"arbitrarily close to ","element":"span"},{"style":{"height":25.14},"width":196.94,"height":62.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-12.png","element":"img","alt":" λmin(˜Γu∗ni0)","inline":true,"padRight":true},{"text":"while still ","element":"span"},{"text":"meeting the feasibility constraint on the input. This contradicts the fact that ","element":"span"},{"style":{"height":22.27},"width":382.1,"height":55.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-13.png","element":"img","alt":" limi→∞ λmin(˜Γu∗ni ) >","inline":true},{"style":{"height":24.27},"width":345.86,"height":60.67,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-14.png","element":"img","alt":"limj→∞ λmin(˜Γu∗mj)","inline":true},{"text":", which implies that ","element":"span"},{"style":{"height":24.27},"width":748.15,"height":60.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-15.png","element":"img","alt":" limi→∞ λmin(˜Γu∗ni ) = limj→∞ λmin(˜Γu∗mj).","inline":true}],[{"style":{"width":"103%"},"width":1797,"height":327,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-16.png","element":"img"}],[{"id":"id-41","style":{"fontWeight":"bold"},"text":"Lemma H.3 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"For any integer ","element":"span"},{"style":{"height":15.02},"width":39.72,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-17.png","element":"img","alt":" k0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and finite input power budget ","element":"span"},{"style":{"height":18.34},"width":55.94,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-18.png","element":"img","alt":" γ2,","inline":true}],[{"style":{"width":"34%"},"width":592,"height":87,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-19.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"exists and is finite.","element":"span"}],[{"style":{"width":"107%"},"width":1860,"height":459,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/92-20.png","element":"img"}],[{"text":"exists and is finite.","element":"span"}]]},{"heading":"Appendix I. Suboptimality of Colored Noise","paragraphs":[[{"text":"First, note that satisfying the power constraint in this setting is equivalent to ","element":"span"},{"style":{"height":19.13},"width":368.12,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-0.png","element":"img","alt":" Tr(Σ) ≤ γ2. Under","inline":true,"padRight":true},{"text":"this constraint, the optimal noise covariance can be obtained by solving:","element":"span"}],[{"style":{"width":"56%"},"width":974,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-1.png","element":"img"}],[{"text":"In our setting, with ","element":"span"},{"style":{"height":18.33},"width":155.32,"height":45.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-2.png","element":"img","alt":" γ2 ≫ σ2","inline":true},{"text":", solving this is approximately equivalent to solving:","element":"span"}],[{"style":{"width":"25%"},"width":440,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-3.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16.41},"width":382.08,"height":41.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-4.png","element":"img","alt":"˜Σ = V ⊤ΣV . Let ˜Σ∗ ","inline":true,"padRight":true},{"text":"be the optimal diagonal solution, and note that, in this case, we will have:","element":"span"}],[{"style":{"width":"29%"},"width":514,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-5.png","element":"img"}],[{"text":"To see this, note that for any diagonal ","element":"span"},{"style":{"height":20.9},"width":395.81,"height":52.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-6.png","element":"img","alt":"˜Σ with ith element γ2i :","inline":true}],[{"style":{"width":"30%"},"width":534,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-7.png","element":"img"}],[{"text":"The optimal solution will clearly be the solution that balances the energy in every diagonal element, that is:","element":"span"}],[{"style":{"width":"28%"},"width":500,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-8.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":17.6},"width":155.06,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-9.png","element":"img","alt":" i, j ∈ [d]","inline":true},{"text":", so combining this constraint with the trace constraint yields:","element":"span"}],[{"style":{"width":"65%"},"width":1138,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-10.png","element":"img"}],[{"text":"and thus the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":"th diagonal element will be:","element":"span"}],[{"style":{"width":"12%"},"width":216,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-11.png","element":"img"}],[{"text":"Consider now some other matrix ","element":"span"},{"style":{"height":12.8},"width":37,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-12.png","element":"img","alt":" ∆","inline":true,"padRight":true},{"text":"that is not necessarily diagonal. Note then that:","element":"span"}],[{"style":{"width":"67%"},"width":1159,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/93-13.png","element":"img"}],[{"style":{"width":"40%"},"width":697,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-0.png","element":"img"}],[{"text":"For ","element":"span"},{"style":{"height":17.61},"width":141.82,"height":44.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-1.png","element":"img","alt":"˜Σ∗ + ∆","inline":true,"padRight":true},{"text":"to be in the constraint set, we must have that ","element":"span"},{"style":{"height":20.41},"width":711.72,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-2.png","element":"img","alt":" Tr(˜Σ∗ + ∆) = γ2 + Tr(∆) ≤ γ2 =⇒","inline":true},{"style":{"height":17.6},"width":202.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-3.png","element":"img","alt":"Tr(∆) ≤ 0","inline":true},{"text":". To have that:","element":"span"}],[{"style":{"width":"52%"},"width":910,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-4.png","element":"img"}],[{"text":"we must have that ","element":"span"},{"style":{"height":22},"width":234.04,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-5.png","element":"img","alt":"�kt=0 Λt∆Λt","inline":true,"padRight":true},{"text":"is positive definite. However, this is not possible since the diagonal ","element":"span"},{"text":"elements of ","element":"span"},{"style":{"height":22},"width":234.04,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-6.png","element":"img","alt":"�kt=0 Λt∆Λt","inline":true,"padRight":true},{"text":"are the sum of non-negative scalings of the diagonal elements of ","element":"span"},{"style":{"height":15.6},"width":47.36,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-7.png","element":"img","alt":" ∆,","inline":true,"padRight":true},{"text":"and since ","element":"span"},{"style":{"height":12.8},"width":37,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-8.png","element":"img","alt":" ∆","inline":true,"padRight":true},{"text":"must have at least one non-positive element on the diagonal to meet the constraint ","element":"span"},{"style":{"height":17.6},"width":205.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-9.png","element":"img","alt":"Tr(∆) ≤ 0","inline":true},{"text":", it follows that ","element":"span"},{"style":{"height":22},"width":234.04,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-10.png","element":"img","alt":"�kt=0 Λt∆Λt","inline":true,"padRight":true},{"text":"has at least one non-positive diagonal element. Since the ","element":"span"},{"text":"diagonal elements of every positive definite matrix are positive, ","element":"span"},{"style":{"height":22},"width":234.03,"height":55.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-11.png","element":"img","alt":"�kt=0 Λt∆Λt","inline":true,"padRight":true},{"text":"cannot be positive ","element":"span"},{"text":"definite, so we cannot increase the value of ","element":"span"},{"style":{"height":31.6},"width":517.21,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-12.png","element":"img","alt":" λmin��kt=0 Λt(˜Σ∗ + ∆)Λt�","inline":true},{"text":". By convexity of the constraint set, it follows that the directional derivative in the direction of any other point in our constraint set is negative. Since this is a concave function, it follows that ","element":"span"},{"style":{"height":16.01},"width":48.52,"height":40.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-13.png","element":"img","alt":"˜Σ∗ ","inline":true,"padRight":true},{"text":"is optimal.","element":"span"}],[{"style":{"width":"96%"},"width":1659,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-14.png","element":"img"}],[{"text":"sufficiently large, we have that:","element":"span"}],[{"style":{"width":"31%"},"width":552,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/94-15.png","element":"img"}]]},{"heading":"Appendix J. Additional Experimental Results","paragraphs":[[{"id":"id-124","style":{"width":"98%"},"width":1703,"height":712,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-0.png","element":"img"}],[{"text":"Figure 5: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":49.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-1.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"Jordan block with ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"d ","element":"figcaption","subtype":"caption"},{"text":"= 4","element":"figcaption","subtype":"caption"},{"text":", ","element":"figcaption","subtype":"caption"},{"style":{"height":17.6},"width":294.01,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-2.png","element":"img","alt":"ρ(A∗) = 0.9, B∗","inline":true,"padRight":true},{"text":"randomly generated with specified value of ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"p","element":"figcaption","subtype":"caption"}],[{"text":"Figure 6: ","element":"figcaption","subtype":"caption"},{"style":{"height":15.42},"width":49.72,"height":38.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-3.png","element":"img","alt":" A∗","inline":true,"padRight":true},{"text":"diagonalizable by a unitary matrix and has given spectral radius, ","element":"figcaption","subtype":"caption"},{"style":{"height":16},"width":236.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-4.png","element":"img","alt":" p = 4 and B∗","inline":true,"padRight":true},{"text":"randomly generated. Dotted lines illustrate the performance of ","element":"figcaption","subtype":"caption"},{"style":{"height":19.13},"width":325.12,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-5.png","element":"img","alt":" ut ∼ N(0, γ2I/p)","inline":true,"padRight":true},{"text":"for each value of ","element":"figcaption","subtype":"caption"},{"style":{"height":12},"width":23,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-6.png","element":"img","alt":" ρ","inline":true}],[{"text":"Figure ","element":"span"},{"href":"#id-124","text":"5 ","element":"a"},{"text":"illustrates how the shape of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"B ","element":"span"},{"text":"can influence the effectiveness of active system identifica-tion. With ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p ","element":"span"},{"text":"= 1","element":"span"},{"text":", it is not possible to control the direction of the input, which can greatly reduce the effectiveness of input design. Interestingly, for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p > ","element":"span"},{"text":"1","element":"span"},{"text":", the performance is roughly the same— increasing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p ","element":"span"},{"text":"beyond 2 does not provide a large gain in the effectiveness of input design.","element":"span"}],[{"text":"Figure ","element":"span"},{"href":"#id-124","text":"6 ","element":"a"},{"text":"plots how the estimation rate depends on the spectral radius. Here the performance of our algorithm is plotted as the solid line and the performance of of isotropic noise as the dotted line. As our theory predicts, systems with a larger spectral radius are easier to estimate. Further, as Corollary ","element":"span"},{"href":"#id-45","text":"3.1 ","element":"a"},{"text":"states, the gap between our algorithm and isotropic noise increases as ","element":"span"},{"style":{"height":12},"width":23,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-7.png","element":"img","alt":" ρ","inline":true,"padRight":true},{"text":"increases—for ","element":"span"},{"style":{"height":15.6},"width":145.68,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-8.png","element":"img","alt":"ρ = 0.2","inline":true,"padRight":true},{"text":"there is almost no gain in designing inputs actively but as ","element":"span"},{"style":{"height":12},"width":23,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.00495/images/95-9.png","element":"img","alt":" ρ","inline":true,"padRight":true},{"text":"increases the gains of active ","element":"span"},{"text":"input design also increase.","element":"span"}]]}],"_version":"3.3.2"},"paperNode":"$28:props:children:props:children:0:props:product"}]]