35:[["$","audio",null,{"id":"tts"}],["$","$L3a",null,{"paperID":"1912.11213","publisher":"arxiv","paperJSON":{"title":"Optimal short-term memory before the edge of chaos in driven random recurrent networks","paperID":"1912.11213","avgLineHeight":11.52,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"The ability of discrete-time nonlinear recurrent neural networks to store time-varying small input signals is investigated by mean-field theory. The combination of a small input strength and mean-field assumptions makes it possible to derive an approximate expression for the conditional probability density of the state of a neuron given a past input signal. From this conditional probability density, we can analytically calculate short-term memory measures, such as memory capacity, mutual information, and Fisher information, and determine the relationships among these measures, which have not been clarified to date to the best of our knowledge. We show that the network contribution of these short-term memory measures peaks before the edge of chaos, where the dynamics of input-driven networks is stable but corresponding systems without input signals are unstable.","element":"span"}]]},{"heading":"I. INTRODUCTION","paragraphs":[[{"text":"Natural and artificial high-dimensional nonlinear dynamical systems can be used as resources for real-time computing. By nonlinearly mapping time-varying input signals into a high-dimensional space, the signals can be learned in a supervised manner if the dynamical systems have enough ability to store the signals in their present state and separate different signals ","element":"span"},{"href":"#id-0","referenceIndex":1,"text":"[1, ","element":"a"},{"href":"#id-1","referenceIndex":2,"text":"2]","element":"a"},{"text":". A high computational performance can be achieved by tuning only the weights of linear connections to the output layer while keeping the parameters of the dynamical systems fixed ","element":"span"},{"href":"#id-2","referenceIndex":3,"text":"[3–","element":"a"},{"href":"#id-3","referenceIndex":6,"text":"6]","element":"a"},{"text":". Such dynamical systems called reservoirs can be artificial recurrent neural networks (RNNs) or physical systems, such as optical media ","element":"span"},{"href":"#id-4","referenceIndex":7,"text":"[7, ","element":"a"},{"href":"#id-5","referenceIndex":8,"text":"8]","element":"a"},{"text":", nanoscale magnetization dynamics ","element":"span"},{"href":"#id-6","referenceIndex":9,"text":"[9, ","element":"a"},{"href":"#id-7","referenceIndex":10,"text":"10]","element":"a"},{"text":", soft materials ","element":"span"},{"href":"#id-8","referenceIndex":11,"text":"[11]","element":"a"},{"text":", and quantum systems ","element":"span"},{"href":"#id-9","referenceIndex":12,"text":"[12]","element":"a"},{"text":".","element":"span"}],[{"text":"As mentioned above, a requirement for real-time computing is the ability to memorize past input signals. Such short-term memory of dynamical systems has been studied extensively by assessing a quantity called memory capacity ","element":"span"},{"href":"#id-10","referenceIndex":13,"text":"[13, ","element":"a"},{"href":"#id-11","referenceIndex":14,"text":"14]","element":"a"},{"text":". For input-driven RNNs, it has been suggested that the part of memory capacity representing indirect memory through network takes a maximum value near the edge of chaos, namely, near the critical boundary between the stable and unstable dynamical regimes ","element":"span"},{"href":"#id-12","referenceIndex":15,"text":"[15, ","element":"a"},{"href":"#id-13","referenceIndex":16,"text":"16]","element":"a"},{"text":". Near criticality, different inputs are expected to lead to different states while suppressing the influence of the initial conditions. Hence, it seems reasonable for a dynamical system to be near the critical point for optimal memory capacity. However, it has also been pointed out that the dependence on network parameters is not straightforward based on a systematic numerical simulation ","element":"span"},{"href":"#id-14","referenceIndex":17,"text":"[17]","element":"a"},{"text":".","element":"span"}],[{"text":"For linear RNNs, detailed analytic studies of memory capacity can be performed for both discrete-time ","element":"span"},{"href":"#id-15","referenceIndex":18,"text":"[18, ","element":"a"},{"href":"#id-16","referenceIndex":19,"text":"19] ","element":"a"},{"text":"and continuous-time systems ","element":"span"},{"href":"#id-17","referenceIndex":20,"text":"[20]","element":"a"},{"text":". ","element":"span"},{"text":"The ability to predict future inputs, which is complementary to memory capacity, has also been studied in linear systems with correlated input signals ","element":"span"},{"href":"#id-18","referenceIndex":21,"text":"[21]","element":"a"},{"text":". However, the memory capacity of nonlinear RNNs is difficult to study by analytical methods ","element":"span"},{"href":"#id-19","referenceIndex":22,"text":"[22]","element":"a"},{"text":". Recently, Schuecker et al. ","element":"span"},{"href":"#id-20","referenceIndex":23,"text":"[23] ","element":"a"},{"text":"successfully derived an analytical expression for memory capacity for continuous-time nonlinear RNNs ","element":"span"},{"href":"#id-21","referenceIndex":24,"text":"[24] ","element":"a"},{"text":"in which each neuron is driven by independent input signals following a white-noise Gaussian process. Toyoizumi and Abbott ","element":"span"},{"href":"#id-22","referenceIndex":25,"text":"[25] ","element":"a"},{"text":"analytically calculated the signal-to-noise ratio, which is equivalent to the inverse of memory capacity at the limit of zero input strength, for discrete-time nonlinear RNNs driven by a common time-varying input signal.","element":"span"}],[{"text":"In this paper, we analytically investigate the memory capacity of discrete-time nonlinear RNNs called echo state networks (ESNs) ","element":"span"},{"href":"#id-0","referenceIndex":1,"text":"[1] ","element":"a"},{"text":"by a mean-field theory when the strength of input signals is small but non-zero. The main idea of our approach is that the conditional probability density of the present state of a neuron given a past input signal can be approximately calculated from a functional derivative with respect to past input signals under the assumption of a small input strength. Once we obtain this conditional probability density, it is straightforward to derive the memory capacity and other alternative memory measures, such as mutual information and Fisher information ","element":"span"},{"href":"#id-19","referenceIndex":22,"text":"[22]","element":"a"},{"text":". We show that all three measures of short-term memory through network behave similarly and take a maximum value before the edge of chaos, where the dynamics is stable in the presence of input signals but unstable in the absence of input signals. We also discuss the breakdown of the mean-field theory for calculating memory measures in the ordered regime and show that the linear approximation provides good predictions.","element":"span"}]]},{"heading":"II. RESULTS","paragraphs":[[{"text":"A. ","element":"span"},{"text":"Echo State Networks","element":"span"}],[{"text":"We consider ESNs consisting of ","element":"span"},{"text":"N ","element":"span"},{"text":"artificial neurons. The state of neuron ","element":"span"},{"text":"i ","element":"span"},{"text":"at discrete time step ","element":"span"},{"text":"t ","element":"span"},{"text":"is denoted ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-0.png","element":"img","alt":"xi(t","inline":true},{"text":"). We assume that the time evolution of state ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-1.png","element":"img","alt":" xi(t","inline":true},{"text":") is governed by","element":"span"}],[{"style":{"width":"67%"},"width":660,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-2.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"f ","element":"span"},{"text":"is an activation function. ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-3.png","element":"img","alt":" ai(t","inline":true},{"text":") is the activation potential of neuron ","element":"span"},{"text":"i ","element":"span"},{"text":"at time step ","element":"span"},{"text":"t ","element":"span"},{"text":"given by","element":"span"}],[{"style":{"width":"74%"},"width":735,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-4.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"s","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":") is a time-dependent input signal, ","element":"span"},{"style":{"height":11.5},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-5.png","element":"img","alt":" wij","inline":true,"padRight":true},{"text":"is a time-independent weight of the connection from neuron ","element":"span"},{"text":"j ","element":"span"},{"text":"to neuron ","element":"span"},{"text":"i","element":"span"},{"text":", and ","element":"span"},{"style":{"height":9.1},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-6.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"is a time-independent weight representing the strength of the coupling from the input signal to neuron ","element":"span"},{"text":"i","element":"span"},{"text":". We use the matrix and vector notations ","element":"span"},{"style":{"height":21.14},"width":752.6,"height":52.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-7.png","element":"img","alt":" W := (wij)1≤i,j≤N, u := (u1, u2, . . . , uN)T","inline":true},{"text":", and ","element":"span"},{"style":{"height":17.36},"width":566.36,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-8.png","element":"img","alt":"x(t) := (x1(t), x2(t), . . . , xN(t))T","inline":true},{"text":".","element":"span"}],[{"text":"In the following analytical calculations and numerical simulations, the activation function is assumed to be a sigmoid function satisfying lim","element":"span"},{"style":{"height":16},"width":295,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-9.png","element":"img","alt":"a→±∞ f(a) = ±","inline":true},{"text":"1, ","element":"span"},{"style":{"height":16},"width":231.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-10.png","element":"img","alt":"f(−a) = −a","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":189.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-11.png","element":"img","alt":" f ′(0) = 1.","inline":true,"padRight":true},{"text":"In particular, we adopt","element":"span"}],[{"style":{"width":"99%"},"width":974,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-12.png","element":"img"}],[{"text":"are chosen independently at random from an identical Gaussian distribution with mean zero and variance ","element":"span"},{"style":{"height":17.36},"width":92.16,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-13.png","element":"img","alt":" g2/N","inline":true},{"text":", where ","element":"span"},{"style":{"height":16.56},"width":83.8,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-14.png","element":"img","alt":" g2 >","inline":true,"padRight":true},{"text":"0 is a control parameter. For simplicity, ","element":"span"},{"style":{"height":9.1},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-15.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"are assumed to be independent variables taking ","element":"span"},{"style":{"height":10.8},"width":31,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-16.png","element":"img","alt":" ±","inline":true},{"text":"1 with probability ","element":"span"},{"style":{"height":19.31},"width":16,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-17.png","element":"img","alt":"12","inline":true},{"text":". Since our primary concern is the memory ","element":"span"},{"text":"capacity of ESNs, we consider an independent and identically distributed Gaussian input signal ","element":"span"},{"text":"s","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":") with mean zero and variance ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-18.png","element":"img","alt":" s2","inline":true},{"text":". All numerical results in this paper were obtained in the following way unless otherwise stated. ","element":"span"},{"text":"We simulated ESNs with ","element":"span"},{"text":"N ","element":"span"},{"text":"= 1000 artificial neurons over 40 trials. For a single trial, each quantity (for example, stationary variance of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-19.png","element":"img","alt":" xi(t","inline":true},{"text":")) was calculated from its values over 10","element":"span"},{"style":{"height":8},"width":16,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-20.png","element":"img","alt":"5","inline":true,"padRight":true},{"text":"time steps after discarding the initial 10","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-21.png","element":"img","alt":"4","inline":true,"padRight":true},{"text":"time steps. Then, averages were obtained over all artificial neurons and all trials. All infinite sums appearing in the following sections were evaluated by truncation at the 500-th term.","element":"span"}],[{"text":"The mean-field theory of ESNs ","element":"span"},{"href":"#id-22","referenceIndex":25,"text":"[25–","element":"a"},{"href":"#id-23","referenceIndex":28,"text":"28] ","element":"a"},{"text":"makes it possible to calculate the stationary variance of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-22.png","element":"img","alt":" xi(t","inline":true},{"text":") and the largest Lyapunov exponent in the limit ","element":"span"},{"style":{"height":11.2},"width":143.2,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-23.png","element":"img","alt":" N → ∞","inline":true},{"text":". It assumes that ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-24.png","element":"img","alt":" xi(t","inline":true},{"text":") are independent and identically distributed random variables. ","element":"span"},{"text":"They are also assumed to be independent of ","element":"span"},{"style":{"height":11.5},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-25.png","element":"img","alt":" wij","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.1},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-26.png","element":"img","alt":" ui","inline":true},{"text":". This assumption can be","element":"span"}],[{"style":{"width":"92%"},"width":902,"height":607,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-27.png","element":"img"}],[{"text":"FIG. 1. ","element":"figcaption","subtype":"caption"},{"id":"id-27","text":"Numerical results (marks) and mean-field predic- ","element":"figcaption","subtype":"caption"},{"text":"tions (solid lines) for the stationary variance of ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":61,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-28.png","element":"img","alt":" xi(t","inline":true},{"text":") are shown as functions of ","element":"figcaption","subtype":"caption"},{"style":{"height":16.14},"width":484.72,"height":40.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-29.png","element":"img","alt":" g2 for s2 = 0.01 (red), s2 = 0.","inline":true},{"text":"02 (green), and ","element":"figcaption","subtype":"caption"},{"style":{"height":12.54},"width":111.28,"height":31.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-30.png","element":"img","alt":"s2 = 0.","inline":true},{"text":"04 (blue). Vertical broken lines indicate the mean-field predictions of the critical value of ","element":"figcaption","subtype":"caption"},{"style":{"height":15.15},"width":34.2,"height":37.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-31.png","element":"img","alt":" g2 ","inline":true,"padRight":true},{"text":"obtained from Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-24","text":"(4) ","element":"a","subtype":"caption"},{"text":"for ","element":"figcaption","subtype":"caption"},{"style":{"height":15.95},"width":979.44,"height":39.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-32.png","element":"img","alt":"s2 = 0 (g2 = 1, black), s2 = 0.01 (g2 ≈ 1.39, red), s2 = 0.02","inline":true,"padRight":true},{"text":"(","element":"figcaption","subtype":"caption"},{"style":{"height":15.35},"width":112.72,"height":38.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-33.png","element":"img","alt":"g2 ≈ 1.","inline":true},{"text":"50, green), and ","element":"figcaption","subtype":"caption"},{"style":{"height":16.15},"width":439.6,"height":40.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-34.png","element":"img","alt":" s2 = 0.04 (g2 ≈ 1.64, blue).","inline":true}],[{"text":"justified in the limit ","element":"span"},{"style":{"height":11.2},"width":138.4,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-35.png","element":"img","alt":" N → ∞","inline":true,"padRight":true},{"text":"when there is no input signal. Since ","element":"span"},{"text":"f ","element":"span"},{"text":"is odd, we can self-consistently assume that the mean of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-36.png","element":"img","alt":" xi(t","inline":true},{"text":") is equal to zero. Let ","element":"span"},{"style":{"height":17.36},"width":278.08,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-37.png","element":"img","alt":" σ2(t) = ⟨xi(t)2⟩","inline":true,"padRight":true},{"text":"be the variance of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-38.png","element":"img","alt":" xi(t","inline":true},{"text":"), where ","element":"span"},{"style":{"height":16},"width":84.63,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-39.png","element":"img","alt":" ⟨· · · ⟩","inline":true,"padRight":true},{"text":"indicates the average over trials with the same ","element":"span"},{"style":{"height":11.5},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-40.png","element":"img","alt":" wij","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.1},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-41.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"but possibly different realizations of the input signal ","element":"span"},{"text":"s","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":") and initial conditions. By the central limit theorem, ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-42.png","element":"img","alt":" ai(t","inline":true},{"text":") follows a Gaussian distribution with mean zero and variance ","element":"span"},{"style":{"height":17.36},"width":201.76,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-43.png","element":"img","alt":"g2σ2(t)+s2","inline":true,"padRight":true},{"text":"+","element":"span"},{"style":{"height":19.58},"width":126.8,"height":48.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-44.png","element":"img","alt":"O(N − 12","inline":true,"padRight":true},{"text":"), where we use ","element":"span"},{"style":{"height":17.58},"width":324.79,"height":43.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-45.png","element":"img","alt":" u2i = 1. Neglecting","inline":true,"padRight":true},{"text":"the ","element":"span"},{"style":{"height":19.78},"width":127.28,"height":49.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-46.png","element":"img","alt":" O(N − 12","inline":true,"padRight":true},{"text":") term, the variance of ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-47.png","element":"img","alt":" ai(t","inline":true},{"text":") does not depend on specific realizations of ","element":"span"},{"style":{"height":11.51},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-48.png","element":"img","alt":" wij","inline":true,"padRight":true},{"href":"#id-22","referenceIndex":25,"text":"[25]","element":"a"},{"text":". In the following, we omit quantities that approach 0 as ","element":"span"},{"style":{"height":11.2},"width":145.12,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-49.png","element":"img","alt":" N → ∞","inline":true,"padRight":true},{"text":"unless otherwise stated. Consequently, ","element":"span"},{"style":{"height":17.36},"width":71.6,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-50.png","element":"img","alt":" σ2(t","inline":true},{"text":") follows the following recurrence equation ","element":"span"},{"href":"#id-25","referenceIndex":27,"text":"[27]","element":"a"},{"text":":","element":"span"}],[{"id":"id-26","style":{"width":"88%"},"width":864,"height":229,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-51.png","element":"img"}],[{"text":"where Σ","element":"span"},{"style":{"height":17.36},"width":251.6,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-52.png","element":"img","alt":"2(t) = g2σ2(t","inline":true},{"text":") + ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-53.png","element":"img","alt":" s2","inline":true},{"text":". ","element":"span"},{"text":"By numerically solving Eq. ","element":"span"},{"href":"#id-26","text":"(3) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":17.36},"width":403.36,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-54.png","element":"img","alt":" σ2(t + 1) = σ2(t) = σ2","inline":true},{"text":", we can obtain the mean-field prediction of the stationary variance of ","element":"span"},{"style":{"height":16},"width":65.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-55.png","element":"img","alt":" xi(t","inline":true},{"text":"). We write Σ","element":"span"},{"style":{"height":16.56},"width":162.4,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-56.png","element":"img","alt":"2 = g2σ2","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-57.png","element":"img","alt":" s2","inline":true},{"text":". These values of ","element":"span"},{"style":{"height":13.36},"width":40,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-58.png","element":"img","alt":" σ2","inline":true,"padRight":true},{"text":"and Σ","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-59.png","element":"img","alt":"2","inline":true,"padRight":true},{"text":"have been used for plotting theoretical results later in the paper.","element":"span"}],[{"text":"The largest Lyapunov exponent derived from the mean-field theory is ","element":"span"},{"href":"#id-22","referenceIndex":25,"text":"[25, ","element":"a"},{"href":"#id-25","referenceIndex":27,"text":"27, ","element":"a"},{"href":"#id-23","referenceIndex":28,"text":"28]","element":"a"}],[{"id":"id-24","style":{"width":"87%"},"width":857,"height":263,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/1-60.png","element":"img"}],[{"text":"Here, ","element":"span"},{"style":{"height":11.6},"width":75.64,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-0.png","element":"img","alt":" λ >","inline":true,"padRight":true},{"text":"0 indicates that a small perturbation to a state of the system leads to exponential growth, while ","element":"span"},{"style":{"height":11.6},"width":67.48,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-1.png","element":"img","alt":"λ <","inline":true,"padRight":true},{"text":"0 implies that the perturbation eventually becomes undetectable. The dynamics is called chaotic or unstable in the former case and called ordered or stable in the latter. ","element":"span"},{"text":"When an input signal is absent, the boundary between chaos and stability ","element":"span"},{"style":{"height":16.56},"width":493.88,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-2.png","element":"img","alt":" λ = 0 corresponds to g2 = 1.","inline":true,"padRight":true},{"text":"The presence of input signals shifts the boundary towards the chaotic side ","element":"span"},{"href":"#id-25","referenceIndex":27,"text":"[27, ","element":"a"},{"href":"#id-23","referenceIndex":28,"text":"28]","element":"a"},{"text":".","element":"span"}],[{"text":"In Fig. ","element":"span"},{"href":"#id-27","text":"1, ","element":"a"},{"text":"we confirm that the mean-field prediction of the stationary variance of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-3.png","element":"img","alt":" xi(t","inline":true},{"text":") agrees well with the result obtained by numerical simulations. ","element":"span"},{"text":"We note that the difference is negligible, even for the input-driven regime ","element":"span"},{"style":{"height":16.56},"width":92.32,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-4.png","element":"img","alt":"g2 ≪","inline":true,"padRight":true},{"text":"1 where the mean-field assumption is expected to be violated. This will be explained when we discuss the breakdown of the mean-field theory in the calculation of memory capacity.","element":"span"}],[{"text":"B. ","element":"span"},{"text":"Memory Capacity","element":"span"}],[{"text":"The memory capacity of an ESN is defined as the quality of the optimal linear estimator of the past input ","element":"span"},{"style":{"height":16},"width":118.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-5.png","element":"img","alt":"s(t − n","inline":true},{"text":") using the present state of neurons ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-6.png","element":"img","alt":" xi(t","inline":true},{"text":"). Following previous work ","element":"span"},{"href":"#id-20","referenceIndex":23,"text":"[23, ","element":"a"},{"href":"#id-22","referenceIndex":25,"text":"25]","element":"a"},{"text":", we assume a sparse readout, namely, there are ","element":"span"},{"text":"K ","element":"span"},{"text":"= ","element":"span"},{"text":"O","element":"span"},{"text":"(1) readout neurons 1 ","element":"span"},{"style":{"height":13.2},"width":147.84,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-7.png","element":"img","alt":" ≤ i ≤ K","inline":true,"padRight":true},{"text":"and consider linear readout ˆ","element":"span"},{"style":{"height":20.46},"width":326,"height":51.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-8.png","element":"img","alt":"s(t) = �Ki=1 vixi(t","inline":true},{"text":"). Given ","element":"span"},{"text":"time-delay ","element":"span"},{"text":"n ","element":"span"},{"text":"(","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . .","element":"span"},{"text":"), the weights ","element":"span"},{"style":{"height":9.1},"width":30.2,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-9.png","element":"img","alt":" vi","inline":true,"padRight":true},{"text":"are determined by minimizing the mean squared error between ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-10.png","element":"img","alt":"s(t − n","inline":true},{"text":") and ˆ","element":"span"},{"text":"s","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":") over a sufficiently long time period ","element":"span"},{"text":"T ","element":"span"},{"text":". ","element":"span"},{"text":"The optimal mean squared error as a function of time-delay ","element":"span"},{"text":"n ","element":"span"},{"text":"is called the memory function and is given by ","element":"span"},{"href":"#id-10","referenceIndex":13,"text":"[13, ","element":"a"},{"href":"#id-11","referenceIndex":14,"text":"14]","element":"a"}],[{"style":{"width":"73%"},"width":717,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-11.png","element":"img"}],[{"id":"id-28","style":{"height":13.1},"width":102.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-12.png","element":"img","alt":"Mn =","inline":true}],[{"style":{"width":"19%"},"width":194,"height":15,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-13.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.51},"width":44.68,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-14.png","element":"img","alt":" dij","inline":true,"padRight":true},{"text":"is the (","element":"span"},{"text":"i, j","element":"span"},{"text":")-th element of the matrix ","element":"span"},{"style":{"height":13.36},"width":72.16,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-15.png","element":"img","alt":" C−1","inline":true},{"text":", which is the inverse of the matrix ","element":"span"},{"style":{"height":19.77},"width":472,"height":49.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-16.png","element":"img","alt":" C = (⟨xi(t)xj(t)⟩T )1≤i,j≤K","inline":true},{"text":", and ","element":"span"},{"style":{"height":16},"width":107,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-17.png","element":"img","alt":" ⟨· · · ⟩T","inline":true,"padRight":true},{"text":"indicates the time average over the period of length ","element":"span"},{"text":"T ","element":"span"},{"text":". The memory capacity is defined as the sum of","element":"span"}],[{"style":{"width":"100%"},"width":981,"height":689,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-18.png","element":"img"}],[{"style":{"height":13.1},"width":58.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-19.png","element":"img","alt":"Mn","inline":true,"padRight":true},{"text":"over all time-delays ","element":"span"},{"text":"n","element":"span"},{"text":":","element":"span"}],[{"style":{"width":"61%"},"width":603,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-20.png","element":"img"}],[{"text":"It is known that ","element":"span"},{"style":{"height":13.2},"width":132,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-21.png","element":"img","alt":" M ≤ K","inline":true,"padRight":true},{"text":"holds ","element":"span"},{"href":"#id-10","referenceIndex":13,"text":"[13, ","element":"a"},{"href":"#id-11","referenceIndex":14,"text":"14]","element":"a"},{"text":".","element":"span"}],[{"text":"To compute ","element":"span"},{"style":{"height":13.1},"width":58.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-22.png","element":"img","alt":" Mn","inline":true,"padRight":true},{"text":"by the mean-field theory, we replace the time average ","element":"span"},{"style":{"height":16},"width":107,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-23.png","element":"img","alt":" ⟨· · · ⟩T","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":11.2},"width":133.6,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-24.png","element":"img","alt":" T → ∞","inline":true,"padRight":true},{"text":"by the average over trials ","element":"span"},{"style":{"height":16},"width":84.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-25.png","element":"img","alt":" ⟨· · · ⟩","inline":true,"padRight":true},{"text":"in stationary states. ","element":"span"},{"text":"Since ","element":"span"},{"style":{"height":16.7},"width":197.44,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-26.png","element":"img","alt":" ⟨xi(t)xj(t)⟩","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":15.2},"width":94.28,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-27.png","element":"img","alt":"i ̸= j","inline":true,"padRight":true},{"text":"vanishes as ","element":"span"},{"style":{"height":11.2},"width":148.96,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-28.png","element":"img","alt":" N → ∞","inline":true},{"text":", the contribution of the off-diagonal terms of ","element":"span"},{"style":{"height":13.36},"width":72.16,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-29.png","element":"img","alt":" C−1","inline":true,"padRight":true},{"text":"to Eq. ","element":"span"},{"href":"#id-28","text":"(5) ","element":"a"},{"text":"can be neglected for ","element":"span"},{"style":{"height":11.2},"width":149.92,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-30.png","element":"img","alt":"N → ∞","inline":true,"padRight":true},{"text":"by the sparse readout assumption ","element":"span"},{"text":"K ","element":"span"},{"text":"= ","element":"span"},{"text":"O","element":"span"},{"text":"(1). Thus, in the mean-field calculation, Eq. ","element":"span"},{"href":"#id-28","text":"(5) ","element":"a"},{"text":"is just ","element":"span"},{"text":"K ","element":"span"},{"text":"times ","element":"span"},{"style":{"height":13.1},"width":58.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-31.png","element":"img","alt":" Mn","inline":true,"padRight":true},{"text":"for ","element":"span"},{"text":"K ","element":"span"},{"text":"= 1. Since","element":"span"}],[{"id":"id-38","style":{"width":"69%"},"width":680,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-32.png","element":"img"}],[{"text":"when ","element":"span"},{"text":"K ","element":"span"},{"text":"= 1, the task to obtain ","element":"span"},{"text":"M ","element":"span"},{"text":"reduces to calculating ","element":"span"},{"style":{"height":16},"width":257.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-33.png","element":"img","alt":"⟨xi(t)s(t − n)⟩","inline":true,"padRight":true},{"text":"for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . .","element":"span"},{"text":". ","element":"span"},{"text":"In the following, we perform the calculation by assuming that the strength of the input signal is small, namely, ","element":"span"},{"style":{"height":14.56},"width":87.52,"height":36.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-34.png","element":"img","alt":" s2 ≪","inline":true,"padRight":true},{"text":"1.","element":"span"}],[{"text":"The main idea to calculate ","element":"span"},{"style":{"height":16},"width":248.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-35.png","element":"img","alt":" ⟨xi(t)s(t − n)⟩","inline":true,"padRight":true},{"text":"is that conditioning of ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-36.png","element":"img","alt":" ai(t","inline":true},{"text":") on ","element":"span"},{"style":{"height":16},"width":128.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-37.png","element":"img","alt":" s(t − n","inline":true},{"text":") (","element":"span"},{"text":"n ","element":"span"},{"text":"= 0","element":"span"},{"text":", ","element":"span"},{"text":"1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . .","element":"span"},{"text":") can be regarded as a small perturbation to dependence of ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-38.png","element":"img","alt":" ai(t","inline":true},{"text":") on ","element":"span"},{"style":{"height":16},"width":127.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-39.png","element":"img","alt":" s(t − n","inline":true},{"text":"), when ","element":"span"},{"style":{"height":14.56},"width":95.68,"height":36.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-40.png","element":"img","alt":" s2 ≪","inline":true,"padRight":true},{"text":"1. ","element":"span"},{"text":"We assume that ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-41.png","element":"img","alt":" xi(t","inline":true},{"text":") are independent and identically distributed and are also independent of ","element":"span"},{"style":{"height":11.5},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-42.png","element":"img","alt":" wij","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.1},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-43.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"even after conditioning. By the central limit theorem, ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-44.png","element":"img","alt":" ai(t","inline":true},{"text":") conditioned on ","element":"span"},{"style":{"height":16},"width":214.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-45.png","element":"img","alt":" s(t − n) = c","inline":true,"padRight":true},{"text":"follows a Gaussian distribution in the limit of large ","element":"span"},{"text":"N","element":"span"},{"text":". Hence, it is sufficient to calculate its mean ","element":"span"},{"style":{"height":11.5},"width":88.28,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-46.png","element":"img","alt":" µn,c,i","inline":true,"padRight":true},{"text":"and variance Σ","element":"span"},{"style":{"height":19.98},"width":40.28,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-47.png","element":"img","alt":"2n,i","inline":true,"padRight":true},{"text":"to determine the conditional probability ","element":"span"},{"text":"density ","element":"span"},{"style":{"height":17.68},"width":266.16,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-48.png","element":"img","alt":" Pai(t)|s(t−n)(a|c","inline":true},{"text":").","element":"span"}],[{"text":"First, we calculate the mean of ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-49.png","element":"img","alt":" ai(t","inline":true},{"text":") given ","element":"span"},{"style":{"height":16},"width":182.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-50.png","element":"img","alt":" s(t − n) =","inline":true,"padRight":true},{"text":"c","element":"span"},{"text":". ","element":"span"},{"text":"We regard ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-51.png","element":"img","alt":" ai(t","inline":true},{"text":") as a functional of stochastic variables ","element":"span"},{"style":{"height":16},"width":177.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-52.png","element":"img","alt":" s(t), s(t −","inline":true,"padRight":true},{"text":"1)","element":"span"},{"style":{"height":16},"width":399.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-53.png","element":"img","alt":", . . . , s(t − n), x(t − n","inline":true},{"text":"). ","element":"span"},{"text":"We write ","element":"span"},{"style":{"height":16},"width":455.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-54.png","element":"img","alt":"ai(t) = Fi [s0:n(t), x(t − n","inline":true},{"text":")], where ","element":"span"},{"style":{"height":16},"width":356.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-55.png","element":"img","alt":" s0:n(t) = (s(t), s(t −","inline":true,"padRight":true},{"text":"1)","element":"span"},{"style":{"height":16},"width":209.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-56.png","element":"img","alt":", . . . , s(t − n","inline":true},{"text":")). We consider a norm defined by the average over trials ","element":"span"},{"style":{"height":19.2},"width":391.83,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-57.png","element":"img","alt":" ⟨· · · ⟩ (∥X∥ :=�⟨X2⟩","inline":true,"padRight":true},{"text":"for a stochastic variable ","element":"span"},{"text":"X","element":"span"},{"text":"). Conditioning of ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-58.png","element":"img","alt":" ai(t","inline":true},{"text":") on ","element":"span"},{"style":{"height":16},"width":206.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-59.png","element":"img","alt":" s(t − n) = c","inline":true,"padRight":true},{"text":"corresponds to replacing argument ","element":"span"},{"style":{"height":16},"width":119.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-60.png","element":"img","alt":" s(t − n","inline":true},{"text":") with the constant stochastic variable ","element":"span"},{"text":"c","element":"span"},{"text":". If ","element":"span"},{"style":{"height":16.56},"width":140.32,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-61.png","element":"img","alt":" c2, s2 ≪","inline":true,"padRight":true},{"text":"1, then ","element":"span"},{"style":{"height":17.36},"width":285.4,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-62.png","element":"img","alt":" ∥c−s(t−n)∥2 =","inline":true},{"style":{"height":17.55},"width":431.2,"height":43.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-63.png","element":"img","alt":"⟨c−s(t−n)⟩2 = c2+s2 ≪","inline":true,"padRight":true},{"text":"1. Thus, ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-64.png","element":"img","alt":" ai(t","inline":true},{"text":") given ","element":"span"},{"style":{"height":16},"width":192.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-65.png","element":"img","alt":" s(t−n) = c","inline":true,"padRight":true},{"text":"can be approximated by the following:","element":"span"}],[{"style":{"height":16},"width":166.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-66.png","element":"img","alt":"Fi [s0:n−1","inline":true},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":")","element":"span"},{"text":", c, ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-67.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"n","element":"span"},{"text":")] ","element":"span"},{"style":{"height":16},"width":167.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-68.png","element":"img","alt":" ≃ Fi [s0:n","inline":true},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":")","element":"span"},{"text":", ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-69.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"n","element":"span"},{"text":")] + ","element":"span"},{"text":"δ","element":"span"},{"style":{"height":16},"width":125.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-70.png","element":"img","alt":"Fi [s0:n","inline":true},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":")","element":"span"},{"text":", ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-71.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"n","element":"span"},{"text":")]","element":"span"},{"text":"δs","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-72.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"n","element":"span"},{"text":") ","element":"span"},{"text":"(","element":"span"},{"text":"c ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-73.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"s","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-74.png","element":"img","alt":" −","inline":true,"padRight":true},{"text":"n","element":"span"},{"text":"))","element":"span"}],[{"id":"id-30","style":{"width":"65%"},"width":1325,"height":177,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-75.png","element":"img"}],[{"text":"By taking the average over trials, we have","element":"span"}],[{"style":{"width":"94%"},"width":922,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/2-76.png","element":"img"}],[{"style":{"width":"88%"},"width":1808,"height":635,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-0.png","element":"img"}],[{"text":"FIG. 2. ","element":"figcaption","subtype":"caption"},{"id":"id-34","text":"Conditional probability density of the activation potential given a past input. Theoretical predictions and numerical ","element":"figcaption","subtype":"caption"},{"text":"results are compared for (a) the mean of ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":382,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-1.png","element":"img","alt":" a1(t) given s(t − n) = 0.","inline":true},{"text":"1 and (b) the conditional probability density ","element":"figcaption","subtype":"caption"},{"style":{"height":15.51},"width":271.12,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-2.png","element":"img","alt":" Pa1(t)|s(t−9)(a|c).","inline":true,"padRight":true},{"text":"We set ","element":"figcaption","subtype":"caption"},{"style":{"height":12.35},"width":111.28,"height":30.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-3.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01 and use single specific realizations of ","element":"figcaption","subtype":"caption"},{"text":"W ","element":"figcaption","subtype":"caption"},{"text":"and ","element":"figcaption","subtype":"caption"},{"text":"u","element":"figcaption","subtype":"caption"},{"text":".","element":"figcaption","subtype":"caption"}],[{"text":"where","element":"span"}],[{"id":"id-35","style":{"width":"66%"},"width":649,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-4.png","element":"img"}],[{"text":"Since ","element":"span"},{"text":"f ","element":"span"},{"text":"is an odd function, we have (Appendix ","element":"span"},{"href":"#id-29","text":"B)","element":"a"}],[{"id":"id-31","style":{"width":"71%"},"width":703,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-5.png","element":"img"}],[{"text":"From Eqs. ","element":"span"},{"href":"#id-30","text":"(10) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-31","text":"(12)","element":"a"},{"text":", the mean-field theory predicts","element":"span"}],[{"id":"id-36","style":{"width":"64%"},"width":633,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-6.png","element":"img"}],[{"text":"when ","element":"span"},{"style":{"height":16.56},"width":140.32,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-7.png","element":"img","alt":" c2, s2 ≪","inline":true,"padRight":true},{"text":"1.","element":"span"}],[{"text":"Second, the variance of ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-8.png","element":"img","alt":" ai(t","inline":true},{"text":") given ","element":"span"},{"style":{"height":16},"width":226.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-9.png","element":"img","alt":" s(t − n) = c","inline":true,"padRight":true},{"text":"can be obtained as follows. The variance of ","element":"span"},{"style":{"height":16},"width":63.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-10.png","element":"img","alt":" ai(t","inline":true},{"text":") can be expressed as","element":"span"}],[{"style":{"width":"93%"},"width":919,"height":166,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-11.png","element":"img"}],[{"text":"Thus, we have","element":"span"}],[{"id":"id-33","style":{"width":"70%"},"width":688,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-12.png","element":"img"}],[{"text":"Note that the population variance of (","element":"span"},{"style":{"height":20.46},"width":112.48,"height":51.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-13.png","element":"img","alt":"V nu)2i","inline":true,"padRight":true},{"text":"takes a ","element":"span"},{"text":"nonzero finite value even in the limit of large ","element":"span"},{"text":"N","element":"span"},{"text":", as we will see in the linear case (Appendix ","element":"span"},{"href":"#id-32","text":"E) ","element":"a"},{"text":"when we discuss the breakdown of the mean-field theory in the ordered regime. This implies that the value of Σ","element":"span"},{"style":{"height":19.79},"width":40.28,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-14.png","element":"img","alt":"2n,i","inline":true,"padRight":true},{"text":"depends on ","element":"span"},{"text":"i ","element":"span"},{"text":"or, equivalently, realizations of ","element":"span"},{"text":"W ","element":"span"},{"text":"and ","element":"span"},{"text":"u","element":"span"},{"text":", even after discarding the ","element":"span"},{"style":{"height":19.78},"width":127.28,"height":49.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-15.png","element":"img","alt":" O(N − 12","inline":true,"padRight":true},{"text":") term. Another related remark is that Eq. ","element":"span"},{"href":"#id-33","text":"(15) ","element":"a"},{"text":"holds only in the limit of small ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-16.png","element":"img","alt":" s2","inline":true},{"text":". Otherwise, the right-hand side may become negative even in the limit of large ","element":"span"},{"text":"N","element":"span"},{"text":", since (","element":"span"},{"style":{"height":16.8},"width":107.48,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-17.png","element":"img","alt":"V nu)i","inline":true,"padRight":true},{"text":"follows a Gaussian distribution with a variance of ","element":"span"},{"text":"O","element":"span"},{"text":"(1) and thus can take an arbitrarily large value.","element":"span"}],[{"text":"In Fig. ","element":"span"},{"href":"#id-34","text":"2, ","element":"a"},{"text":"the mean of ","element":"span"},{"style":{"height":16},"width":68.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-18.png","element":"img","alt":" a1(t","inline":true},{"text":") given ","element":"span"},{"style":{"height":16},"width":229.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-19.png","element":"img","alt":" s(t − n) = 0.","inline":true},{"text":"1 and the conditional probability density ","element":"span"},{"style":{"height":17.68},"width":265.2,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-20.png","element":"img","alt":" Pa1(t)|s(t−9)(a|c","inline":true},{"text":") are shown for single specific realizations of ","element":"span"},{"text":"W ","element":"span"},{"text":"and ","element":"span"},{"text":"u","element":"span"},{"text":". Here, we set ","element":"span"},{"style":{"height":13.36},"width":137.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-21.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01. ","element":"span"},{"text":"The numerical results are obtained by first generating a single orbit of length 10","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-22.png","element":"img","alt":"6","inline":true,"padRight":true},{"text":"time steps after discarding the initial 10","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-23.png","element":"img","alt":"4","inline":true,"padRight":true},{"text":"time steps and then sampling the value of ","element":"span"},{"style":{"height":16},"width":68.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-24.png","element":"img","alt":" a1(t","inline":true},{"text":") with 0","element":"span"},{"style":{"height":16},"width":331.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-25.png","element":"img","alt":".1 ≤ s(t − n) < 0.","inline":true},{"text":"11 for each ","element":"span"},{"text":"n","element":"span"},{"text":". The theoretical values for ","element":"span"},{"style":{"height":11.5},"width":120.16,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-26.png","element":"img","alt":" µn,0.1,1","inline":true,"padRight":true},{"text":"and Σ","element":"span"},{"style":{"height":19.79},"width":45.28,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-27.png","element":"img","alt":"2n,1","inline":true,"padRight":true},{"text":"are ","element":"span"},{"text":"calculated from Eqs. ","element":"span"},{"href":"#id-35","text":"(11)","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","text":"(13)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-33","text":"(15)","element":"a"},{"text":", where ","element":"span"},{"text":"W ","element":"span"},{"text":"and ","element":"span"},{"text":"u ","element":"span"},{"text":"are the same as those used in the numerical simulation. We can see that the numerical results and the theoretical predictions agree well.","element":"span"}],[{"text":"Using Eqs. ","element":"span"},{"href":"#id-36","text":"(13) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-33","text":"(15)","element":"a"},{"text":", we obtain (Appendix ","element":"span"},{"href":"#id-37","text":"C)","element":"a"}],[{"id":"id-39","style":{"width":"76%"},"width":748,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-28.png","element":"img"}],[{"text":"for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . . ","element":"span"},{"text":". From Eqs. ","element":"span"},{"href":"#id-38","text":"(7) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-39","text":"(16)","element":"a"},{"text":", we have","element":"span"}],[{"style":{"width":"67%"},"width":662,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-29.png","element":"img"}],[{"text":"for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . . ","element":"span"},{"text":". The population average of ","element":"span"},{"style":{"height":13.11},"width":58.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-30.png","element":"img","alt":" Mn","inline":true},{"text":", which is equivalent to the average over realizations of ","element":"span"},{"text":"W ","element":"span"},{"text":"and ","element":"span"},{"text":"u","element":"span"},{"text":", is (Appendix ","element":"span"},{"href":"#id-40","text":"D)","element":"a"}],[{"id":"id-52","style":{"width":"78%"},"width":768,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-31.png","element":"img"}],[{"text":"where","element":"span"}],[{"id":"id-41","style":{"width":"61%"},"width":605,"height":101,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-32.png","element":"img"}],[{"text":"The population average of ","element":"span"},{"text":"M ","element":"span"},{"text":"is","element":"span"}],[{"style":{"width":"80%"},"width":793,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/3-33.png","element":"img"}],[{"style":{"width":"87%"},"width":861,"height":578,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-0.png","element":"img"}],[{"text":"FIG. 3. ","element":"figcaption","subtype":"caption"},{"id":"id-42","text":"Memory capacity and network memory capacity of ESNs. Theoretical predictions and numerical results are compared ","element":"figcaption","subtype":"caption"},{"text":"for the population averages of (a) memory function ","element":"figcaption","subtype":"caption"},{"style":{"height":11.54},"width":53.52,"height":28.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-1.png","element":"img","alt":" Mn","inline":true},{"text":", (b) memory capacity ","element":"figcaption","subtype":"caption"},{"text":"M","element":"figcaption","subtype":"caption"},{"text":", and (c) network memory capacity ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":154.16,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-2.png","element":"img","alt":" Mnet. (d)","inline":true,"padRight":true},{"text":"Mean-field lines for the effective measurement of the nonlinear response ","element":"figcaption","subtype":"caption"},{"text":"r ","element":"figcaption","subtype":"caption"},{"text":"(Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-41","text":"(19)","element":"a","subtype":"caption"},{"text":"). In (a), we set ","element":"figcaption","subtype":"caption"},{"style":{"height":12.54},"width":117.04,"height":31.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-3.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01. The vertical broken lines indicate the transition point between the ordered and chaotic regimes for the value of ","element":"figcaption","subtype":"caption"},{"style":{"height":12.35},"width":32.28,"height":30.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-4.png","element":"img","alt":" s2 ","inline":true,"padRight":true},{"text":"with the same color.","element":"figcaption","subtype":"caption"}],[{"style":{"width":"88%"},"width":1807,"height":641,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-5.png","element":"img"}],[{"text":"FIG. 4. ","element":"figcaption","subtype":"caption"},{"id":"id-43","text":"Linear approximation of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.54},"width":50.52,"height":28.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-6.png","element":"img","alt":" M1","inline":true},{"text":". (a) Numerical values of ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":120.96,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-7.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] for different system sizes ","element":"figcaption","subtype":"caption"},{"text":"N","element":"figcaption","subtype":"caption"},{"text":". (b) Comparison among numerical results, linear approximation, and mean-field theory for ","element":"figcaption","subtype":"caption"},{"style":{"height":16.15},"width":365.68,"height":40.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-8.png","element":"img","alt":" E [M1]. We set s2 = 0.","inline":true},{"text":"01 in both panels.","element":"figcaption","subtype":"caption"}],[{"text":"M ","element":"span"},{"text":"can be decomposed into two parts ","element":"span"},{"href":"#id-14","referenceIndex":17,"text":"[17, ","element":"a"},{"href":"#id-20","referenceIndex":23,"text":"23]","element":"a"},{"text":": the direct memory ","element":"span"},{"style":{"height":13.1},"width":54.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-9.png","element":"img","alt":" M1","inline":true,"padRight":true},{"text":"and the indirect memory through network ","element":"span"},{"style":{"height":13.1},"width":286.72,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-10.png","element":"img","alt":"Mnet := M −M1","inline":true},{"text":". We call the latter the ","element":"span"},{"text":"network memory","element":"span"}],[{"text":"capacity","element":"span"},{"text":". The population average of the latter is","element":"span"}],[{"style":{"width":"79%"},"width":784,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-11.png","element":"img"}],[{"text":"Figure ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"(a) shows the population average of ","element":"span"},{"style":{"height":13.11},"width":58.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/4-12.png","element":"img","alt":" Mn","inline":true,"padRight":true},{"text":"for","element":"span"}],[{"text":"different values of ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-0.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":13.36},"width":136.76,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-1.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01. The population averages of ","element":"span"},{"text":"M ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":13.11},"width":83.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-2.png","element":"img","alt":" Mnet","inline":true,"padRight":true},{"text":"are shown in Fig. ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"(b) and (c), respectively. ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-3.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] peaks in the range 1 ","element":"span"},{"style":{"height":17.2},"width":185.44,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-4.png","element":"img","alt":" < g2 < g2∗","inline":true},{"text":", ","element":"span"},{"text":"where ","element":"span"},{"style":{"height":17.39},"width":36.64,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-5.png","element":"img","alt":" g2∗","inline":true,"padRight":true},{"text":"is the value of ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-6.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"such that ","element":"span"},{"style":{"height":11.2},"width":313.12,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-7.png","element":"img","alt":" λ = 0. The exact","inline":true,"padRight":true},{"text":"location of the maximum point depends on the value of ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-8.png","element":"img","alt":"s2","inline":true,"padRight":true},{"text":"and shifts to a larger value as ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-9.png","element":"img","alt":" s2","inline":true,"padRight":true},{"text":"increases. In the mean-field theory, ","element":"span"},{"style":{"height":16},"width":130.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-10.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] is given as the product between ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"M","element":"span"},{"text":"] and ","element":"span"},{"text":"r","element":"span"},{"text":". Hence, its qualitative behavior can be understood from those of ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"M","element":"span"},{"text":"] and ","element":"span"},{"text":"r ","element":"span"},{"text":"(Fig. ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"(b) and (d), respectively). Since ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"M","element":"span"},{"text":"] is a measure of the linear short-term memory, it is expected to decrease as the nonlinearity of the system increases. On the other hand, ","element":"span"},{"text":"r ","element":"span"},{"text":"can be interpreted as an effective measure of the nonlinear response of the system, which reaches saturation for sufficiently large ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-11.png","element":"img","alt":" g2","inline":true},{"text":", since the activation function ","element":"span"},{"text":"f ","element":"span"},{"text":"is a sigmoid function. Indeed, ","element":"span"},{"style":{"height":19.5},"width":110.2,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-12.png","element":"img","alt":" r → 2π","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":16.56},"width":145.6,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-13.png","element":"img","alt":" g2 → ∞","inline":true},{"text":", since ","element":"span"},{"style":{"height":13.95},"width":99.52,"height":34.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-14.png","element":"img","alt":"σ2 →","inline":true,"padRight":true},{"text":"1 as ","element":"span"},{"style":{"height":16.75},"width":152.8,"height":41.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-15.png","element":"img","alt":" g2 → ∞","inline":true,"padRight":true},{"text":"in Eq. ","element":"span"},{"href":"#id-41","text":"(19) ","element":"a"},{"text":"(However, this cannot be seen from Fig. ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"(d) because the range of ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-16.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"shown is restricted upto 2).","element":"span"}],[{"text":"The mean-field predictions and the numerical results agree well over the whole range of ","element":"span"},{"style":{"height":16.75},"width":36.64,"height":41.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-17.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"for ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"M","element":"span"},{"text":"]. However, there is a clear discrepancy for ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-18.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] in the ordered regime (Fig. ","element":"span"},{"href":"#id-42","text":"3(","element":"a"},{"text":"c)). This is due to the breakdown of the mean-field theory. ","element":"span"},{"text":"That is, the assumption that ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-19.png","element":"img","alt":" xi(t","inline":true},{"text":") are independent and identically distributed and are also independent of ","element":"span"},{"style":{"height":11.51},"width":52.36,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-20.png","element":"img","alt":" wij","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.11},"width":34.04,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-21.png","element":"img","alt":" ui","inline":true,"padRight":true},{"text":"is violated when the ESN dynamics is driven by input signals. Indeed, in a certain range of ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-22.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"in the ordered regime (0","element":"span"},{"style":{"height":16.56},"width":242.36,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-23.png","element":"img","alt":".2 < g2 < 0.","inline":true},{"text":"7 in Fig. ","element":"span"},{"href":"#id-43","text":"4 ","element":"a"},{"text":"(a) where ","element":"span"},{"style":{"height":13.36},"width":137.24,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-24.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01), the numerically obtained values of ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-25.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] do not approach the mean-field value as the system size ","element":"span"},{"text":"N ","element":"span"},{"text":"increases. To understand the quantitative influence of the violation of the mean-field assumption on ","element":"span"},{"style":{"height":16},"width":130.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-26.png","element":"img","alt":" E [Mnet","inline":true},{"text":"], we consider the regime ","element":"span"},{"style":{"height":16.56},"width":95.68,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-27.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1, where the activation function ","element":"span"},{"text":"f ","element":"span"},{"text":"can be approximated by the identity function ","element":"span"},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"x","element":"span"},{"text":") = ","element":"span"},{"text":"x","element":"span"},{"text":". When ","element":"span"},{"style":{"height":16.56},"width":94.24,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-28.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1, we can approximately calculate ","element":"span"},{"style":{"height":16},"width":106.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-29.png","element":"img","alt":" E [Mn","inline":true},{"text":"] without the mean-field theory. Since both the mean-field theory and the linear approximation lead to ","element":"span"},{"style":{"height":17.55},"width":346.72,"height":43.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-30.png","element":"img","alt":" E [M] = 1 for g2 ≪","inline":true,"padRight":true},{"text":"1, the differ-ence in ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-31.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] is reduced to that in ","element":"span"},{"style":{"height":16},"width":102.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-32.png","element":"img","alt":" E [M1","inline":true},{"text":"]. The linear approximation predicts (Appendix ","element":"span"},{"href":"#id-32","text":"E)","element":"a"}],[{"id":"id-73","style":{"width":"93%"},"width":913,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-33.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.58},"width":40,"height":43.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-34.png","element":"img","alt":" σ2i","inline":true,"padRight":true},{"text":"is the stationary variance of ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-35.png","element":"img","alt":" xi(t","inline":true},{"text":"). Note that ","element":"span"},{"text":"we have ","element":"span"},{"style":{"height":24.51},"width":258.8,"height":61.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-36.png","element":"img","alt":" E�σ2i�= s21−g2","inline":true,"padRight":true},{"text":"in both the mean-field theory ","element":"span"},{"text":"and the linear approximation. Indeed, in the mean-field theory, Eq. ","element":"span"},{"href":"#id-26","text":"(3) ","element":"a"},{"text":"reduces to ","element":"span"},{"style":{"height":16.56},"width":298.72,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-37.png","element":"img","alt":" σ2 = Σ2 = g2σ2","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-38.png","element":"img","alt":" s2","inline":true,"padRight":true},{"text":"when ","element":"span"},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"x","element":"span"},{"text":") = ","element":"span"},{"text":"x","element":"span"},{"text":". ","element":"span"},{"text":"The equation for the linear approximation is derived in Appendix ","element":"span"},{"href":"#id-32","text":"E ","element":"a"},{"text":"(Eq. ","element":"span"},{"href":"#id-44","text":"(E3)","element":"a"},{"text":"). However, the former predicts ","element":"span"},{"style":{"height":28.8},"width":93.68,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-39.png","element":"img","alt":" E�1σ2i","inline":true}],[{"style":{"height":16},"width":102.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-40.png","element":"img","alt":"E [M1","inline":true},{"text":"] for ","element":"span"},{"style":{"height":16.56},"width":88.96,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-41.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1, the mean-field theory fails to capture the variance of ","element":"span"},{"style":{"height":17.39},"width":40,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-42.png","element":"img","alt":" σ2i","inline":true,"padRight":true},{"text":".","element":"span"}],[{"text":"We compare the values of ","element":"span"},{"style":{"height":16},"width":102.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-43.png","element":"img","alt":" E [M1","inline":true},{"text":"] obtained from the numerical simulation, the linear approximation, and the mean-field theory in Fig. ","element":"span"},{"href":"#id-43","text":"4 ","element":"a"},{"text":"(b). Although the mean-field line does not fit the numerical result, the linear approximation can explain it well.","element":"span"}],[{"text":"C. ","element":"span"},{"text":"Mutual Information and Fisher Memory","element":"span"}],[{"text":"Once we obtain the conditional probability density ","element":"span"},{"style":{"height":17.68},"width":265.68,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-44.png","element":"img","alt":"Pai(t)|s(t−n)(a|c","inline":true},{"text":") for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 0","element":"span"},{"text":", ","element":"span"},{"text":"1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . .","element":"span"},{"text":", we can immediately calculate the mutual information between ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-45.png","element":"img","alt":" xi(t","inline":true,"padRight":true},{"text":"+ 1) and ","element":"span"},{"style":{"height":16},"width":120.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-46.png","element":"img","alt":"s(t − n","inline":true},{"text":") as","element":"span"}],[{"id":"id-45","style":{"height":16},"width":101.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-47.png","element":"img","alt":"I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"style":{"height":16},"width":522.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-48.png","element":"img","alt":" s(t − n)) = I(ai(t); s(t − n)) ≃","inline":true,"padRight":true},{"text":"1","element":"span"},{"text":"2 ","element":"span"},{"text":"log Σ","element":"span"},{"style":{"height":41.87},"width":69.08,"height":104.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-49.png","element":"img","alt":"2Σ2n,i","inline":true}],[{"style":{"width":"6%"},"width":64,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-50.png","element":"img"}],[{"text":"when the mean-field assumption is valid. We would like to take the population average of Eq. ","element":"span"},{"href":"#id-45","text":"(23)","element":"a"},{"text":". Recall that we assumed ","element":"span"},{"style":{"height":14.56},"width":88.48,"height":36.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-51.png","element":"img","alt":" s2 ≪","inline":true,"padRight":true},{"text":"1. Let us suppose Σ","element":"span"},{"style":{"height":13.36},"width":102.48,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-52.png","element":"img","alt":"2 = O","inline":true},{"text":"(1) in the limit ","element":"span"},{"style":{"height":13.76},"width":93.28,"height":34.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-53.png","element":"img","alt":"s2 →","inline":true,"padRight":true},{"text":"0. In particular, this holds when ","element":"span"},{"style":{"height":16.56},"width":85.72,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-54.png","element":"img","alt":" g2 >","inline":true,"padRight":true},{"text":"1. Then, we can approximate the population average of ","element":"span"},{"style":{"height":16},"width":101.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-55.png","element":"img","alt":" I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"style":{"height":16},"width":120.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-56.png","element":"img","alt":" s(t − n","inline":true},{"text":")) as","element":"span"}],[{"id":"id-50","style":{"width":"82%"},"width":805,"height":166,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-57.png","element":"img"}],[{"text":"Let us consider the summation of Eq. ","element":"span"},{"href":"#id-45","text":"(23) ","element":"a"},{"text":"over ","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . . ","element":"span"},{"text":"defined by","element":"span"}],[{"id":"id-48","style":{"width":"76%"},"width":754,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-58.png","element":"img"}],[{"text":"where the subscript ot indicates the mutual information between the future state and a one-time past input. Note that the ","element":"span"},{"text":"n ","element":"span"},{"text":"= 0 term is not included in the summation. Thus, ","element":"span"},{"style":{"height":13.1},"width":45.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-59.png","element":"img","alt":" Iot","inline":true,"padRight":true},{"text":"is a measure of network short-term memory analogous to ","element":"span"},{"style":{"height":13.1},"width":83.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-60.png","element":"img","alt":" Mnet","inline":true,"padRight":true},{"text":"based on the mutual information. The population average ","element":"span"},{"style":{"height":16},"width":92.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-61.png","element":"img","alt":" E [Iot","inline":true},{"text":"] calculated based on the mean-field theory is shown in Fig. ","element":"span"},{"href":"#id-46","text":"5(","element":"a"},{"text":"a) and is compared with the numerical results. ","element":"span"},{"text":"Since the direct numerical estimate of the mutual information between ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-62.png","element":"img","alt":" xi(t","inline":true,"padRight":true},{"text":"+ 1) and ","element":"span"},{"style":{"height":16},"width":117.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-63.png","element":"img","alt":"s(t − n","inline":true},{"text":") for all ","element":"span"},{"text":"n ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . . , ","element":"span"},{"text":"500 is computationally hard, we estimated the mutual information from the correlation coefficient between ","element":"span"},{"style":{"height":16},"width":63.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-64.png","element":"img","alt":" ai(t","inline":true},{"text":") and ","element":"span"},{"style":{"height":16},"width":118.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-65.png","element":"img","alt":" s(t − n","inline":true},{"text":") assuming that ","element":"span"},{"style":{"height":17.68},"width":265.68,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-66.png","element":"img","alt":"Pai(t)|s(t−n)(a|c","inline":true},{"text":") is Gaussian, which is valid both in the linear regime and the mean-field regime. As in the case of ","element":"span"},{"style":{"height":16},"width":266.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-67.png","element":"img","alt":" E [Mnet], E [Iot","inline":true},{"text":"] also takes a maximum value in the range 1 ","element":"span"},{"style":{"height":17.39},"width":170.08,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-68.png","element":"img","alt":" < g2 < g2∗","inline":true},{"text":".","element":"span"}],[{"text":"As we have seen in the calculation of memory capacity, the mean-field theory is not applicable to the linear regime ","element":"span"},{"style":{"height":16.56},"width":97.6,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-69.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1. ","element":"span"},{"text":"Indeed, although the discrepancy between the numerical results and the mean-field predictions appears to be small on the scale of Fig. ","element":"span"},{"href":"#id-46","text":"5(","element":"a"},{"text":"a), the calculation of ","element":"span"},{"style":{"height":13.1},"width":45.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-70.png","element":"img","alt":" Iot","inline":true,"padRight":true},{"text":"based on the linear approximation (Appendix ","element":"span"},{"href":"#id-32","text":"E) ","element":"a"},{"text":"provides much better fits to the numerical results than the mean-field theory for ","element":"span"},{"style":{"height":16.56},"width":90.88,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-71.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1, as shown in Fig. ","element":"span"},{"href":"#id-46","text":"5(","element":"a"},{"text":"b).","element":"span"}],[{"text":"Another familiar information-theoretic memory measure is the Fisher information ","element":"span"},{"href":"#id-19","referenceIndex":22,"text":"[22, ","element":"a"},{"href":"#id-47","referenceIndex":29,"text":"29]","element":"a"},{"text":". Here, we regard the past input ","element":"span"},{"style":{"height":16},"width":122.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-72.png","element":"img","alt":" s(t − n","inline":true},{"text":") as a parameter and consider the Fisher information with respect to the conditional probability density ","element":"span"},{"style":{"height":17.68},"width":308.4,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/5-73.png","element":"img","alt":" Pxi(t+1)|s(t−n)(x|c","inline":true},{"text":"), namely, information","element":"span"}],[{"style":{"width":"88%"},"width":1807,"height":641,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-0.png","element":"img"}],[{"text":"FIG. 5. ","element":"figcaption","subtype":"caption"},{"id":"id-46","text":"Network memory measure based on the mutual information between the future state and a one-time past input. ","element":"figcaption","subtype":"caption"},{"text":"(a) The theoretical predictions and the numerical results are compared for the population averages of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.54},"width":42.72,"height":28.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-1.png","element":"img","alt":" Iot","inline":true,"padRight":true},{"text":"(Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-48","text":"(25)","element":"a","subtype":"caption"},{"text":"). ","element":"figcaption","subtype":"caption"},{"text":"(b) Comparison among numerical results, linear approximation, and mean-field theory for ","element":"figcaption","subtype":"caption"},{"style":{"height":16.15},"width":352.72,"height":40.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-2.png","element":"img","alt":" E [Iot] with s2 = 0.01.","inline":true}],[{"style":{"width":"92%"},"width":903,"height":605,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-3.png","element":"img"}],[{"text":"FIG. 6. ","element":"figcaption","subtype":"caption"},{"text":"Population average of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.54},"width":46.56,"height":28.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-4.png","element":"img","alt":" Jot","inline":true,"padRight":true},{"text":"(Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-49","text":"(28)","element":"a","subtype":"caption"},{"text":") calculated based on the mean-field theory under the same assumption as for Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-50","text":"(24)","element":"a","subtype":"caption"},{"text":". To adjust the scale of the vertical axis, ","element":"figcaption","subtype":"caption"},{"style":{"height":11.54},"width":85.56,"height":28.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-5.png","element":"img","alt":" Jot is","inline":true,"padRight":true},{"text":"multiplied by ","element":"figcaption","subtype":"caption"},{"style":{"height":12.55},"width":44.08,"height":31.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-6.png","element":"img","alt":" s2.","inline":true}],[{"text":"about ","element":"span"},{"style":{"height":16},"width":123.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-7.png","element":"img","alt":" s(t − n","inline":true},{"text":") contained in ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-8.png","element":"img","alt":" xi(t","inline":true,"padRight":true},{"text":"+ 1). Since the activation function ","element":"span"},{"text":"f ","element":"span"},{"text":"is invertible and the Fisher information is invariant under an invertible transformation of stochastic variables, we can use ","element":"span"},{"style":{"height":17.68},"width":265.68,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-9.png","element":"img","alt":" Pai(t)|s(t−n)(a|c","inline":true},{"text":") to calculate the Fisher information. When ","element":"span"},{"style":{"height":14.75},"width":87.52,"height":36.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-10.png","element":"img","alt":" s2 ≪","inline":true,"padRight":true},{"text":"1 and the mean-field assumption is valid, the Fisher information for ","element":"span"},{"style":{"height":16},"width":129.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-11.png","element":"img","alt":" s(t − n","inline":true},{"text":") contained in ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-12.png","element":"img","alt":" xi(t","inline":true,"padRight":true},{"text":"+ 1) is calculated as","element":"span"}],[{"id":"id-51","style":{"width":"98%"},"width":962,"height":321,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-13.png","element":"img"}],[{"text":"The population average of Eq. ","element":"span"},{"href":"#id-51","text":"(26) ","element":"a"},{"text":"can be approximately obtained under the same assumption as for Eq. ","element":"span"},{"href":"#id-50","text":"(24) ","element":"a"},{"text":"as","element":"span"}],[{"text":"follows:","element":"span"}],[{"id":"id-53","style":{"width":"82%"},"width":813,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-14.png","element":"img"}],[{"text":"We define the network Fisher memory with respect to a one-time past input by","element":"span"}],[{"id":"id-49","style":{"width":"62%"},"width":617,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-15.png","element":"img"}],[{"text":"As in the case of ","element":"span"},{"style":{"height":13.1},"width":45.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-16.png","element":"img","alt":" Iot","inline":true},{"text":", we exclude the ","element":"span"},{"text":"n ","element":"span"},{"text":"= 0 term representing direct memory from the sum in the right-hand side of Eq. ","element":"span"},{"href":"#id-49","text":"(28)","element":"a"},{"text":". The population average ","element":"span"},{"style":{"height":16},"width":97.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-17.png","element":"img","alt":" E [Jot","inline":true},{"text":"] calculated based on the mean-field theory under the same assumption as for Eq. ","element":"span"},{"href":"#id-50","text":"(24) ","element":"a"},{"text":"is shown in Fig. ","element":"span"},{"text":"6. ","element":"span"},{"style":{"height":16},"width":97.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-18.png","element":"img","alt":"E [Jot","inline":true},{"text":"] behaves qualitatively similarly to ","element":"span"},{"style":{"height":16},"width":131.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-19.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] and ","element":"span"},{"style":{"height":16},"width":92.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-20.png","element":"img","alt":"E [Iot","inline":true},{"text":"] at least in the range where the mean-field theory is valid. Note that there is a close relationship between the mean-field predictions of memory function, mutual information, and Fisher information through Eqs. ","element":"span"},{"href":"#id-52","text":"(18)","element":"a"},{"text":", ","element":"span"},{"href":"#id-50","text":"(24) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-53","text":"(27)","element":"a"},{"text":": ","element":"span"},{"style":{"height":16},"width":149.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-21.png","element":"img","alt":"E [I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"style":{"height":16},"width":222.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-22.png","element":"img","alt":" s(t − n))] =","inline":true}],[{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-23.png","element":"img","alt":"2","inline":true,"padRight":true},{"text":"log","element":"span"},{"style":{"height":28.8},"width":480.16,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-24.png","element":"img","alt":"�1 −�1 − s2Σ2�E [Mn]�= 12","inline":true,"padRight":true},{"text":"log","element":"span"},{"style":{"height":19.2},"width":18,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-25.png","element":"img","alt":"�","inline":true},{"text":"1 + ","element":"span"},{"style":{"height":19.2},"width":200.88,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-26.png","element":"img","alt":" E [J(n)] s2�","inline":true},{"text":".","element":"span"}],[{"text":"The derivation of the conditional probability density ","element":"span"},{"style":{"height":17.68},"width":265.68,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-27.png","element":"img","alt":"Pai(t)|s(t−n)(a|c","inline":true},{"text":") under the mean-field assumption can be extended in a straightforward manner to the conditioning on a set of past inputs at multiple time steps. In particular, the conditional probability density ","element":"span"},{"style":{"height":17.68},"width":267.24,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-28.png","element":"img","alt":" Pai(t)|s1:n(t)(a|c","inline":true},{"text":") can be approximated as a Gaussian distribution with mean ","element":"span"},{"style":{"height":11.51},"width":115.16,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-29.png","element":"img","alt":" µ1:n,c,i","inline":true,"padRight":true},{"text":"and variance Σ","element":"span"},{"style":{"height":19.99},"width":65.24,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-30.png","element":"img","alt":"21:n,i","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":16},"width":276.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-31.png","element":"img","alt":" s1:n(t) = (s(t −","inline":true}],[{"text":"1)","element":"span"},{"text":", s","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"text":"− ","element":"span"},{"text":"2)","element":"span"},{"text":", . . . , s","element":"span"},{"text":"(","element":"span"},{"text":"t ","element":"span"},{"text":"− ","element":"span"},{"text":"n","element":"span"},{"text":")), ","element":"span"},{"text":"c ","element":"span"},{"text":"= (","element":"span"},{"text":"c","element":"span"},{"text":"1","element":"span"},{"text":", c","element":"span"},{"text":"2","element":"span"},{"text":", . . . , c","element":"span"},{"text":"n","element":"span"},{"text":"),","element":"span"}],[{"style":{"width":"70%"},"width":691,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-32.png","element":"img"}],[{"text":"and","element":"span"}],[{"style":{"width":"74%"},"width":733,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/6-33.png","element":"img"}],[{"style":{"width":"89%"},"width":1812,"height":639,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-0.png","element":"img"}],[{"text":"FIG. 7. ","element":"figcaption","subtype":"caption"},{"id":"id-57","text":"Population averages of (a) ","element":"figcaption","subtype":"caption"},{"text":"I ","element":"figcaption","subtype":"caption"},{"text":"(Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-54","text":"(32)","element":"a","subtype":"caption"},{"text":") and (b) ","element":"figcaption","subtype":"caption"},{"style":{"height":12.35},"width":75.48,"height":30.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-1.png","element":"img","alt":" J ·s2 ","inline":true,"padRight":true},{"text":"(Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-55","text":"(35) ","element":"a","subtype":"caption"},{"text":"multiplied by ","element":"figcaption","subtype":"caption"},{"style":{"height":12.35},"width":32.28,"height":30.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-2.png","element":"img","alt":" s2","inline":true},{"text":") calculated from the mean-field theory under the same assumption as for Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-50","text":"(24) ","element":"a","subtype":"caption"},{"text":"are shown.","element":"figcaption","subtype":"caption"}],[{"style":{"width":"92%"},"width":906,"height":607,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-3.png","element":"img"}],[{"text":"FIG. 8. ","element":"figcaption","subtype":"caption"},{"id":"id-58","text":"The mean-field values of ","element":"figcaption","subtype":"caption"},{"style":{"height":17.42},"width":396.56,"height":43.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-4.png","element":"img","alt":" g2 (g2χ, g2ι , g2φ, and g2ω)","inline":true,"padRight":true},{"text":"that attain the maxima of the five network memory measures (","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":650.88,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-5.png","element":"img","alt":"E [Mnet], E [Iot], E [Jot], and E [I] (E [J","inline":true},{"text":"]), respectively) are shown as functions of ","element":"figcaption","subtype":"caption"},{"style":{"height":12.55},"width":32.28,"height":31.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-6.png","element":"img","alt":" s2","inline":true},{"text":". Note that ","element":"figcaption","subtype":"caption"},{"text":"E ","element":"figcaption","subtype":"caption"},{"text":"[","element":"figcaption","subtype":"caption"},{"text":"I","element":"figcaption","subtype":"caption"},{"text":"] and ","element":"figcaption","subtype":"caption"},{"text":"E ","element":"figcaption","subtype":"caption"},{"text":"[","element":"figcaption","subtype":"caption"},{"text":"J","element":"figcaption","subtype":"caption"},{"text":"] peak at the same value of ","element":"figcaption","subtype":"caption"},{"style":{"height":15.34},"width":34.2,"height":38.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-7.png","element":"img","alt":" g2","inline":true},{"text":". The critical line ","element":"figcaption","subtype":"caption"},{"style":{"height":15.34},"width":118.68,"height":38.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-8.png","element":"img","alt":" g2 = g2∗ ","inline":true,"padRight":true},{"text":"is also shown.","element":"figcaption","subtype":"caption"}],[{"text":"An alternative network memory measure to ","element":"span"},{"style":{"height":13.1},"width":45.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-9.png","element":"img","alt":" Iot","inline":true,"padRight":true},{"text":"is the limit ","element":"span"},{"style":{"height":8.8},"width":125.92,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-10.png","element":"img","alt":" n → ∞","inline":true,"padRight":true},{"text":"of the mutual information between ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-11.png","element":"img","alt":" xi(t","inline":true},{"text":"+1) and ","element":"span"},{"style":{"height":16},"width":97.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-12.png","element":"img","alt":" s1:n(t","inline":true},{"text":"). When the mean-field assumption is valid, it is given by","element":"span"}],[{"style":{"width":"88%"},"width":871,"height":178,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-13.png","element":"img"}],[{"text":"where Σ","element":"span"},{"style":{"height":23.63},"width":519.04,"height":59.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-14.png","element":"img","alt":"21:∞,i = Σ2 − �∞k=1�V ku�2i s2","inline":true},{"text":". Under the same ","element":"span"},{"text":"assumption as for Eq. ","element":"span"},{"href":"#id-50","text":"(24)","element":"a"},{"text":", its population average is approximately given by","element":"span"}],[{"id":"id-54","style":{"width":"75%"},"width":745,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-15.png","element":"img"}],[{"text":"Similarly, we can consider an alternative network memory measure based on the Fisher information matrix with","element":"span"}],[{"style":{"width":"92%"},"width":905,"height":614,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-16.png","element":"img"}],[{"text":"FIG. 9. ","element":"figcaption","subtype":"caption"},{"id":"id-59","style":{"height":14.4},"width":491.56,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-17.png","element":"img","alt":" I = limn→∞ I(xi(t + 1); s1:n(t","inline":true},{"text":")) can be represented as the difference between ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":545.96,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-18.png","element":"img","alt":" I1 = I(xi(t + 1); x(t)) and I2 =","inline":true,"padRight":true},{"text":"lim","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":439.72,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-19.png","element":"img","alt":"n→∞ I(xi(t + 1); x(t)|s1:n(t","inline":true},{"text":")) (Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-56","text":"(36)","element":"a","subtype":"caption"},{"text":"). The population-averaged values of these quantities calculated from the mean-field theory under the same assumption as for Eq. ","element":"figcaption","subtype":"caption"},{"href":"#id-50","text":"(24) ","element":"a","subtype":"caption"},{"text":"are shown for ","element":"figcaption","subtype":"caption"},{"style":{"height":12.35},"width":157.84,"height":30.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-20.png","element":"img","alt":" s2 = 0.01.","inline":true}],[{"text":"respect to ","element":"span"},{"style":{"height":17.68},"width":266.76,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-21.png","element":"img","alt":" Pai(t)|s1:n(t)(a|c","inline":true},{"text":"). ","element":"span"},{"text":"When the mean-field assumption is valid, the (","element":"span"},{"text":"k, l","element":"span"},{"text":")-th element of the Fisher information matrix is given by","element":"span"}],[{"style":{"width":"89%"},"width":875,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-22.png","element":"img"}],[{"text":"The Fisher memory with respect to the whole past input history is defined by","element":"span"}],[{"style":{"width":"64%"},"width":632,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-23.png","element":"img"}],[{"text":"and its approximate population average is found to be","element":"span"}],[{"id":"id-55","style":{"width":"70%"},"width":694,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-24.png","element":"img"}],[{"text":"under the same assumption as for Eq. ","element":"span"},{"href":"#id-50","text":"(24)","element":"a"},{"text":". Note that ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"I","element":"span"},{"text":"] and ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"J","element":"span"},{"text":"] are related by ","element":"span"},{"style":{"height":19.5},"width":153.28,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-25.png","element":"img","alt":" E [I] ≃ 12","inline":true,"padRight":true},{"text":"log","element":"span"},{"style":{"height":19.2},"width":18,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-26.png","element":"img","alt":"�","inline":true},{"text":"1 + ","element":"span"},{"style":{"height":19.2},"width":146.16,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/7-27.png","element":"img","alt":" E [J] s2�","inline":true},{"text":". Figure ","element":"span"},{"href":"#id-57","text":"7 ","element":"a"},{"text":"shows Eqs. ","element":"span"},{"href":"#id-54","text":"(32) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-55","text":"(35) ","element":"a"},{"text":"for ","element":"span"},{"style":{"height":16.56},"width":210.2,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-0.png","element":"img","alt":" s2 = 0.01, 0.","inline":true},{"text":"02, and 0","element":"span"},{"text":".","element":"span"},{"text":"04. Both ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"I","element":"span"},{"text":"] and ","element":"span"},{"text":"E ","element":"span"},{"text":"[","element":"span"},{"text":"J","element":"span"},{"text":"] take maximum values at points close to those for ","element":"span"},{"style":{"height":16},"width":261.12,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-1.png","element":"img","alt":" E [Mnet], E [Iot","inline":true},{"text":"], and ","element":"span"},{"style":{"height":16},"width":97.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-2.png","element":"img","alt":" E [Jot","inline":true},{"text":"] as long as the mean-field theory is valid. Figure ","element":"span"},{"href":"#id-58","text":"8 ","element":"a"},{"text":"summarizes the maximum points of these network memory measures in the range 0 ","element":"span"},{"style":{"height":15.76},"width":182.84,"height":39.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-3.png","element":"img","alt":" < s2 ≤ 0.","inline":true},{"text":"04 obtained from the mean-field theory.","element":"span"}],[{"text":"Finally, we remark on the behavior of ","element":"span"},{"text":"I","element":"span"},{"text":". Because ","element":"span"},{"style":{"height":16},"width":65.83,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-4.png","element":"img","alt":" xi(t","inline":true},{"text":"+ 1) and ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":") are conditionally independent given ","element":"span"},{"style":{"height":16},"width":97.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-5.png","element":"img","alt":" s1:n(t","inline":true},{"text":"), we obtain","element":"span"}],[{"id":"id-56","style":{"width":"97%"},"width":952,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-6.png","element":"img"}],[{"text":"The first term on the right-hand side of Eq. ","element":"span"},{"href":"#id-56","text":"(36) ","element":"a"},{"text":"becomes ","element":"span"},{"text":"I ","element":"span"},{"text":"by taking the limit ","element":"span"},{"style":{"height":8.8},"width":135.04,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-7.png","element":"img","alt":" n → ∞","inline":true},{"text":". The second term ","element":"span"},{"style":{"height":16},"width":101.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-8.png","element":"img","alt":"I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"style":{"height":16},"width":179.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-9.png","element":"img","alt":" x(t)|s1:n(t","inline":true},{"text":")) will be negligible when the system is driven by input signals (","element":"span"},{"style":{"height":16.56},"width":94.72,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-10.png","element":"img","alt":"g2 ≪","inline":true,"padRight":true},{"text":"1). On the other hand, the chaotic dynamics dominates as ","element":"span"},{"style":{"height":16.56},"width":150.4,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-11.png","element":"img","alt":" g2 → ∞","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":101.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-12.png","element":"img","alt":"I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"style":{"height":16},"width":180.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-13.png","element":"img","alt":" x(t)|s1:n(t","inline":true},{"text":")) will approach ","element":"span"},{"style":{"height":16},"width":101.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-14.png","element":"img","alt":" I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":")). Thus, as ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-15.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"varies from 0 to ","element":"span"},{"style":{"height":14},"width":85.28,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-16.png","element":"img","alt":" ∞, I","inline":true,"padRight":true},{"text":"will increase together with ","element":"span"},{"style":{"height":16},"width":101.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-17.png","element":"img","alt":" I(xi(t","inline":true,"padRight":true},{"text":"+ 1); ","element":"span"},{"text":"x","element":"span"},{"text":"(","element":"span"},{"text":"t","element":"span"},{"text":")) in the ordered regime but will decrease in the sufficiently chaotic regime. Hence, ","element":"span"},{"text":"I ","element":"span"},{"text":"is expected to take a maximum value between the two extremes. In Fig. ","element":"span"},{"href":"#id-59","text":"9, ","element":"a"},{"text":"the population-averaged values of the three terms in Eq. ","element":"span"},{"href":"#id-56","text":"(36) ","element":"a"},{"text":"in the limit ","element":"span"},{"style":{"height":8.8},"width":136.96,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-18.png","element":"img","alt":" n → ∞","inline":true,"padRight":true},{"text":"calculated from the mean-field theory under the same assumption as for Eq. ","element":"span"},{"href":"#id-50","text":"(24) ","element":"a"},{"text":"are shown, where ","element":"span"},{"style":{"height":16},"width":101.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-19.png","element":"img","alt":" I(xi(t","inline":true},{"text":"+1); ","element":"span"},{"style":{"height":19.31},"width":161.44,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-20.png","element":"img","alt":" x(t)) = 12","inline":true,"padRight":true},{"text":"log ","element":"span"},{"style":{"height":22.11},"width":36.56,"height":55.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-21.png","element":"img","alt":" Σ2s2","inline":true,"padRight":true},{"text":"due to the mean-field assumption.","element":"span"}]]},{"heading":"III. DISCUSSION","paragraphs":[[{"text":"The three network memory measures studied in this paper take maximum values in the ordered regime for ESNs with small input signals. ","element":"span"},{"text":"The value of ","element":"span"},{"style":{"height":16.56},"width":36.64,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-22.png","element":"img","alt":" g2","inline":true,"padRight":true},{"text":"that attains the maximum is always greater than 1, which is the boundary between the ordered and chaotic regimes in the corresponding autonomous system. ","element":"span"},{"text":"However, it is far from the critical ","element":"span"},{"style":{"height":17.39},"width":36.64,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-23.png","element":"img","alt":" g2∗","inline":true,"padRight":true},{"text":"(Fig. ","element":"span"},{"href":"#id-58","text":"8)","element":"a"},{"text":". In previous work, it ","element":"span"},{"text":"was argued that the maximal Fisher information can be used to detect the edge of chaos ","element":"span"},{"href":"#id-60","referenceIndex":30,"text":"[30, ","element":"a"},{"href":"#id-61","referenceIndex":31,"text":"31]","element":"a"},{"text":". ","element":"span"},{"text":"Our results suggest that such an approach is not necessarily effective for driven dynamical systems.","element":"span"}],[{"text":"In the context of physical reservoir computing ","element":"span"},{"href":"#id-4","referenceIndex":7,"text":"[7–","element":"a"},{"href":"#id-9","referenceIndex":12,"text":"12]","element":"a"},{"text":", it is generally difficult to tune the parameters of a given physical system for optimal computational performance. An alternative method is to choose an optimal input strength. Although the analysis presented in this paper assumes that ","element":"span"},{"style":{"height":13.36},"width":34.72,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-24.png","element":"img","alt":" s2","inline":true,"padRight":true},{"text":"is small, our results theoretically suggest that tuning the input strength is meaningful. For example, the value of ","element":"span"},{"style":{"height":16},"width":130.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-25.png","element":"img","alt":" E [Mnet","inline":true},{"text":"] for ","element":"span"},{"style":{"height":13.36},"width":131.48,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-26.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"02 is greater than those for ","element":"span"},{"style":{"height":13.55},"width":133.88,"height":33.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-27.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"01 and ","element":"span"},{"style":{"height":13.55},"width":133.88,"height":33.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-28.png","element":"img","alt":" s2 = 0.","inline":true},{"text":"04 around ","element":"span"},{"style":{"height":16.75},"width":135.32,"height":41.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-29.png","element":"img","alt":" g2 ≈ 1.","inline":true},{"text":"3 in Fig. ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"(c).","element":"span"}],[{"text":"Toyoizumi and Abbott ","element":"span"},{"href":"#id-22","referenceIndex":25,"text":"[25] ","element":"a"},{"text":"analytically showed that the signal-to-noise ratio of ESNs decreases rapidly on the left side of the criticality ","element":"span"},{"style":{"height":16.75},"width":510.76,"height":41.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-30.png","element":"img","alt":" g2 = 1 when inputs are ab-","inline":true,"padRight":true},{"text":"sent, but decreases much slowly on the right side. They suggested that high computational performance can be achieved without fine tuning in the latter. Our results confirm this expectation because all of the short-term memory measures in the presence of inputs peak near ","element":"span"},{"style":{"height":17.2},"width":560.32,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-31.png","element":"img","alt":"g2 = 1 in the region 1 < g2 < g2∗","inline":true},{"text":".","element":"span"}],[{"text":"In general, dynamical regimes of autonomous systems beyond stable fixed points are candidates for computational resources. For example, RNNs with sinusoidal activation functions achieve a high computational performance in the non-chaotic window regions after transition to chaos occurs in their autonomous dynamics ","element":"span"},{"href":"#id-62","referenceIndex":32,"text":"[32]","element":"a"},{"text":". An online supervised learning algorithm for RNNs proposed by Sussillo and Abbott ","element":"span"},{"href":"#id-63","referenceIndex":33,"text":"[33] ","element":"a"},{"text":"exhibits its best performance when their autonomous dynamics is adjusted to the chaotic region not far from the critical point where chaotic dynamics can be suppressed by input signals. Schuecker et al. ","element":"span"},{"href":"#id-20","referenceIndex":23,"text":"[23] ","element":"a"},{"text":"showed that the network memory capacity for continuous-time nonlinear RNNs peaks in the ordered regime with ","element":"span"},{"style":{"height":16.56},"width":85.72,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-32.png","element":"img","alt":" g2 >","inline":true,"padRight":true},{"text":"1, which is consistent to our result. They argued that the dynamic suppression of chaos (DSC), which results from the fact that the onset of local instability precedes that of asymptotic instability, contributes to optimal information processing. However, DSC cannot occur in discrete-time ESNs where the two onsets coincide. In ESNs, the shift of the critical ","element":"span"},{"style":{"height":17.39},"width":36.64,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/8-33.png","element":"img","alt":" g2∗","inline":true,"padRight":true},{"text":"to- ","element":"span"},{"text":"ward the chaotic regime induced by input signals is solely due to a mechanism called the static suppression of chaos (SSC), which increases the frequency with which an orbit visits the contracting region of the phase space. Unlike SSC, DSC is conjectured to occur based on fast switching among different unstable directions caused by input signals ","element":"span"},{"href":"#id-20","referenceIndex":23,"text":"[23]","element":"a"},{"text":". However, ESNs with leaky neurons ","element":"span"},{"href":"#id-64","referenceIndex":34,"text":"[34] ","element":"a"},{"text":"are expected to exhibit DSC ","element":"span"},{"href":"#id-20","referenceIndex":23,"text":"[23]","element":"a"},{"text":". Future analyses of network memory measures for leaky ESNs based on the presented theory could deepen the understanding of the relationship between DSC and the information processing ability of dynamical systems.","element":"span"}],[{"text":"It has been suggested that there exists a trade-off between nonlinearity of dynamical systems and their memory capacity ","element":"span"},{"href":"#id-11","referenceIndex":14,"text":"[14]","element":"a"},{"text":". Inubushi and Yoshimura ","element":"span"},{"href":"#id-65","referenceIndex":35,"text":"[35] ","element":"a"},{"text":"theoretically investigated the trade-off in terms of how nonlinearity degrades small initial differences of input signals. The mean-field theory presented in this paper makes it possible to study the trade-off when input strength is small by directly calculating the nonlinear memory capacity proposed by Damble et al. ","element":"span"},{"href":"#id-11","referenceIndex":14,"text":"[14]","element":"a"},{"text":". Performing the detailed calculation is also left as future work.","element":"span"}],[{"id":"id-66","style":{"width":"95%"},"width":937,"height":468,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-0.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":16.7},"width":159.72,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-1.png","element":"img","alt":" ajk(t − k","inline":true},{"text":") (","element":"span"},{"text":"k ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", . . ., n","element":"span"},{"text":") are independent and ","element":"span"},{"style":{"height":18.26},"width":361.6,"height":45.64,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-2.png","element":"img","alt":"ajk(t − k) ∼ N(0, Σ2","inline":true},{"text":") by the mean-field assumption, we have","element":"span"}],[{"id":"id-67","style":{"width":"100%"},"width":981,"height":1984,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-3.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":20.35},"width":241.52,"height":50.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-4.png","element":"img","alt":" f ′(a) = e− π4 a2","inline":true,"padRight":true},{"text":"is used for ","element":"span"},{"style":{"height":21.73},"width":266.28,"height":54.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-5.png","element":"img","alt":" f(a) = erf(√π2 a","inline":true},{"text":"). Equa- ","element":"span"},{"text":"tion ","element":"span"},{"href":"#id-30","text":"(10) ","element":"a"},{"text":"follows from Eqs. ","element":"span"},{"href":"#id-35","text":"(11)","element":"a"},{"text":", ","element":"span"},{"href":"#id-66","text":"(A1) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-67","text":"(A2)","element":"a"},{"text":".","element":"span"}],[{"id":"id-29","style":{"width":"99%"},"width":980,"height":696,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-6.png","element":"img"}],[{"text":"We set","element":"span"}],[{"style":{"width":"68%"},"width":672,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-7.png","element":"img"}],[{"text":"and","element":"span"}],[{"text":"g","element":"span"},{"style":{"height":7.6},"width":61.64,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-8.png","element":"img","alt":"n−k","inline":true},{"text":"(","element":"span"},{"text":"y, z","element":"span"},{"style":{"height":7.6},"width":61.64,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-9.png","element":"img","alt":"n−k","inline":true},{"text":", . . . , z","element":"span"},{"style":{"height":22.4},"width":170.8,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-10.png","element":"img","alt":"n) = �","inline":true},{"style":{"height":10.53},"width":102.32,"height":26.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-11.png","element":"img","alt":"jn−k+1","inline":true,"padRight":true},{"text":"w","element":"span"},{"style":{"height":10.53},"width":172.4,"height":26.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-12.png","element":"img","alt":"jn−kjn−k+1","inline":true},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"g","element":"span"},{"style":{"height":9.2},"width":102.4,"height":23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-13.png","element":"img","alt":"n−k+1","inline":true},{"text":"(","element":"span"},{"text":"y, z","element":"span"},{"style":{"height":9.2},"width":102.4,"height":23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-14.png","element":"img","alt":"n−k+1","inline":true},{"text":", . . . , z","element":"span"},{"style":{"height":4.8},"width":20,"height":12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-15.png","element":"img","alt":"n","inline":true},{"text":")) + ","element":"span"},{"text":"u","element":"span"},{"style":{"height":11.5},"width":151.88,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-16.png","element":"img","alt":"jn−kzn−k","inline":true,"padRight":true},{"text":"(B4)","element":"span"}],[{"text":"for ","element":"span"},{"style":{"height":14},"width":247.48,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-17.png","element":"img","alt":" k = 1, . . . , n −","inline":true,"padRight":true},{"text":"1. Let us introduce","element":"span"}],[{"style":{"width":"95%"},"width":935,"height":171,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-18.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":28.18},"width":325.48,"height":70.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-19.png","element":"img","alt":" Dy = dy√2πg2σ2 e−","inline":true}],[{"text":"Eq. ","element":"span"},{"href":"#id-29","text":"(B2) ","element":"a"},{"text":"can be written as","element":"span"}],[{"style":{"width":"86%"},"width":850,"height":231,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-20.png","element":"img"}],[{"style":{"width":"100%"},"width":981,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-21.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":23.01},"width":280.36,"height":57.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-22.png","element":"img","alt":" Dzk = dzk√2πs2 e−","inline":true}],[{"text":"odd function with respect to (","element":"span"},{"style":{"height":10},"width":163.52,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-23.png","element":"img","alt":"z1, . . . , zn","inline":true},{"text":"), namely,","element":"span"}],[{"id":"id-69","style":{"width":"80%"},"width":785,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-24.png","element":"img"}],[{"text":"holds. ","element":"span"},{"text":"This yields the desired result. ","element":"span"},{"text":"First, note that ","element":"span"},{"style":{"height":16},"width":347.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-25.png","element":"img","alt":"gn−k(y, zn−k, . . . , zn","inline":true},{"text":") ","element":"span"},{"text":"is ","element":"span"},{"text":"odd ","element":"span"},{"text":"with ","element":"span"},{"text":"respect ","element":"span"},{"text":"to (","element":"span"},{"style":{"height":10},"width":248.96,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-26.png","element":"img","alt":"y, zn−k, . . . , zn","inline":true},{"text":") for ","element":"span"},{"style":{"height":14},"width":282.52,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-27.png","element":"img","alt":" k = 0, 1, . . . , n −","inline":true,"padRight":true},{"text":"1. Namely, we have","element":"span"}],[{"id":"id-68","style":{"width":"96%"},"width":945,"height":84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-28.png","element":"img"}],[{"text":"Indeed, Eq. ","element":"span"},{"href":"#id-68","text":"(B8) ","element":"a"},{"text":"can be proved by mathematical induction. ","element":"span"},{"text":"First, ","element":"span"},{"style":{"height":16.7},"width":731.36,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-29.png","element":"img","alt":" gn(−y, −zn) = −y − ujnzn = −gn(y, zn","inline":true},{"text":") for ","element":"span"},{"text":"k ","element":"span"},{"text":"= 0. ","element":"span"},{"text":"Assume that ","element":"span"},{"style":{"height":16},"width":509.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-30.png","element":"img","alt":" gn−k(−y, −zn−k, . . . , −zn) =","inline":true},{"style":{"height":16},"width":378.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-31.png","element":"img","alt":"−gn−k(y, zn−k, . . . , zn","inline":true},{"text":") for ","element":"span"},{"style":{"height":14},"width":287.8,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/9-32.png","element":"img","alt":" k = 0, 1, . . ., n −","inline":true,"padRight":true},{"text":"2. Then, we obtain","element":"span"}],[{"text":"g","element":"span"},{"style":{"height":17.68},"width":391.2,"height":44.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-0.png","element":"img","alt":"n−(k+1)(−y, −zn−(k+1)","inline":true},{"text":", . . . , ","element":"span"},{"style":{"height":22.4},"width":203.92,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-1.png","element":"img","alt":" −zn) =�","inline":true},{"style":{"height":9.6},"width":67.32,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-2.png","element":"img","alt":"jn−k","inline":true,"padRight":true},{"text":"w","element":"span"},{"style":{"height":11.31},"width":193.08,"height":28.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-3.png","element":"img","alt":"jn−(k+1)jn−k","inline":true},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"g","element":"span"},{"style":{"height":16},"width":260.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-4.png","element":"img","alt":"n−k(−y, −zn−k","inline":true},{"text":", . . . , ","element":"span"},{"style":{"height":18.41},"width":446.4,"height":46.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-5.png","element":"img","alt":" −zn)) − ujn−(k+1)zn−(k+1)","inline":true}],[{"style":{"height":22.4},"width":106,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-6.png","element":"img","alt":"=�","inline":true},{"style":{"height":9.6},"width":67.32,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-7.png","element":"img","alt":"jn−k","inline":true,"padRight":true},{"text":"w","element":"span"},{"style":{"height":11.31},"width":193.08,"height":28.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-8.png","element":"img","alt":"jn−(k+1)jn−k","inline":true},{"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"height":10},"width":112.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-9.png","element":"img","alt":"−gn−k","inline":true},{"text":"(","element":"span"},{"text":"y, z","element":"span"},{"style":{"height":7.6},"width":61.64,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-10.png","element":"img","alt":"n−k","inline":true},{"text":", . . . , z","element":"span"},{"style":{"height":18.42},"width":396.96,"height":46.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-11.png","element":"img","alt":"n)) − ujn−(k+1)zn−(k+1)","inline":true}],[{"style":{"height":22.4},"width":143.44,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-12.png","element":"img","alt":"= −�","inline":true},{"style":{"height":9.6},"width":67.32,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-13.png","element":"img","alt":"jn−k","inline":true,"padRight":true},{"text":"w","element":"span"},{"style":{"height":11.31},"width":193.08,"height":28.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-14.png","element":"img","alt":"jn−(k+1)jn−k","inline":true},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"g","element":"span"},{"style":{"height":7.6},"width":61.64,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-15.png","element":"img","alt":"n−k","inline":true},{"text":"(","element":"span"},{"text":"y, z","element":"span"},{"style":{"height":7.6},"width":61.64,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-16.png","element":"img","alt":"n−k","inline":true},{"text":", . . . , z","element":"span"},{"style":{"height":18.41},"width":396.96,"height":46.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-17.png","element":"img","alt":"n)) − ujn−(k+1)zn−(k+1)","inline":true}],[{"style":{"height":12.48},"width":218.88,"height":31.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-18.png","element":"img","alt":"= −gn−(k+1)","inline":true},{"text":"(","element":"span"},{"text":"y, z","element":"span"},{"style":{"height":11.2},"width":126.72,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-19.png","element":"img","alt":"n−(k+1)","inline":true},{"text":", . . . , z","element":"span"},{"style":{"height":4.8},"width":20,"height":12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-20.png","element":"img","alt":"n","inline":true},{"text":")","element":"span"},{"text":", ","element":"span"},{"text":"(B9)","element":"span"}],[{"style":{"width":"100%"},"width":981,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-21.png","element":"img"}],[{"text":"where we applied the induction hypothesis for the third equality and we used the fact that ","element":"span"},{"text":"f ","element":"span"},{"text":"is an odd function for the fourth equality. Now, Eq. ","element":"span"},{"href":"#id-69","text":"(B7) ","element":"a"},{"text":"is obtained by","element":"span"}],[{"style":{"width":"89%"},"width":878,"height":637,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-22.png","element":"img"}],[{"text":"where we used Eq. ","element":"span"},{"href":"#id-68","text":"(B8) ","element":"a"},{"text":"for the third equality and the fact that ","element":"span"},{"style":{"height":14},"width":38,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-23.png","element":"img","alt":" f ′","inline":true,"padRight":true},{"text":"is an even function for the fourth equality.","element":"span"}],[{"id":"id-37","style":{"width":"66%"},"width":656,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-24.png","element":"img"}],[{"text":"We have","element":"span"}],[{"id":"id-70","style":{"width":"93%"},"width":912,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-25.png","element":"img"}],[{"text":"=","element":"span"}],[{"style":{"width":"26%"},"width":256,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-26.png","element":"img"}],[{"text":"By the change of variable ","element":"span"},{"style":{"height":16.7},"width":119.48,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-27.png","element":"img","alt":" y = aΣ","inline":true},{"text":", the integral on the ","element":"span"},{"text":"right-hand side of Eq. ","element":"span"},{"href":"#id-70","text":"(C1) ","element":"a"},{"text":"can be calculated as","element":"span"}],[{"id":"id-71","style":{"width":"96%"},"width":946,"height":456,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-28.png","element":"img"}],[{"style":{"width":"100%"},"width":981,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-29.png","element":"img"}],[{"text":"From Eqs. ","element":"span"},{"href":"#id-70","text":"(C1) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-71","text":"(C2)","element":"a"},{"text":", we obtain Eq. ","element":"span"},{"href":"#id-39","text":"(16)","element":"a"},{"text":".","element":"span"}],[{"id":"id-40","style":{"width":"67%"},"width":658,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-30.png","element":"img"}],[{"text":"Since (","element":"span"},{"style":{"height":29.54},"width":503.2,"height":73.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-31.png","element":"img","alt":"V nu)2i =� 11+ π2 Σ2�n(W nu)2i","inline":true,"padRight":true},{"text":", it is sufficient to ","element":"span"},{"text":"show ","element":"span"},{"style":{"height":28.8},"width":325.28,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-32.png","element":"img","alt":" E�(W nu)2i�= g2n","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"text":"O","element":"span"},{"text":"( ","element":"span"},{"style":{"height":19.31},"width":27,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-33.png","element":"img","alt":"1N","inline":true,"padRight":true},{"text":") for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 0","element":"span"},{"text":", ","element":"span"},{"text":"1","element":"span"},{"text":", . . . ","element":"span"},{"text":".","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":23.47},"width":73.44,"height":58.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-34.png","element":"img","alt":" w(n)ij","inline":true,"padRight":true},{"text":"be the (","element":"span"},{"text":"i, j","element":"span"},{"text":")-th element of ","element":"span"},{"style":{"height":10.8},"width":63.2,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-35.png","element":"img","alt":" W n","inline":true},{"text":". We have","element":"span"}],[{"style":{"width":"85%"},"width":839,"height":232,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-36.png","element":"img"}],[{"text":"where we used ","element":"span"},{"style":{"height":16.7},"width":251.72,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-37.png","element":"img","alt":" E [ujuk] = δjk","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16.3},"width":49.16,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-38.png","element":"img","alt":" δjk","inline":true,"padRight":true},{"text":"is the Kronecker delta. The population average of","element":"span"},{"style":{"height":32.34},"width":139.84,"height":80.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-39.png","element":"img","alt":"�w(n)ij �2","inline":true},{"text":"is given by","element":"span"}],[{"style":{"width":"78%"},"width":770,"height":138,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-40.png","element":"img"}],[{"text":"because ","element":"span"},{"style":{"height":23.07},"width":242.96,"height":57.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-41.png","element":"img","alt":" wij ∼ N(0, g2N","inline":true,"padRight":true},{"text":") are independent. Thus, we ob- ","element":"span"},{"text":"tain","element":"span"}],[{"id":"id-32","style":{"width":"83%"},"width":816,"height":223,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-42.png","element":"img"}],[{"text":"When ","element":"span"},{"style":{"height":16.56},"width":91.84,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-43.png","element":"img","alt":" g2 ≪","inline":true,"padRight":true},{"text":"1, we can approximate ","element":"span"},{"text":"f","element":"span"},{"text":"(","element":"span"},{"text":"a","element":"span"},{"text":") = ","element":"span"},{"text":"a ","element":"span"},{"text":"and obtain","element":"span"}],[{"style":{"width":"75%"},"width":737,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-44.png","element":"img"}],[{"text":"Thus,","element":"span"}],[{"style":{"width":"80%"},"width":786,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-45.png","element":"img"}],[{"text":"holds. ","element":"span"},{"text":"Ignoring the ","element":"span"},{"text":"O","element":"span"},{"text":"( ","element":"span"},{"style":{"height":19.5},"width":27,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-46.png","element":"img","alt":"1N","inline":true,"padRight":true},{"text":") terms, the mean and vari- ","element":"span"},{"text":"ance of (","element":"span"},{"style":{"height":20.46},"width":123.52,"height":51.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-47.png","element":"img","alt":"W nu)2i","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":12.8},"width":72.76,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-48.png","element":"img","alt":" n ≥","inline":true,"padRight":true},{"text":"1 are given by ","element":"span"},{"style":{"height":16.56},"width":56.48,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-49.png","element":"img","alt":" g2n","inline":true,"padRight":true},{"text":"and 2","element":"span"},{"style":{"height":16.56},"width":56.48,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/10-50.png","element":"img","alt":"g4n","inline":true},{"text":",","element":"span"}],[{"text":"respectively, because ","element":"span"},{"style":{"height":11.51},"width":59.72,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-0.png","element":"img","alt":" wjk","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":11.51},"width":36.04,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-1.png","element":"img","alt":" uj","inline":true,"padRight":true},{"text":"are independent and ","element":"span"},{"style":{"height":23.07},"width":240.56,"height":57.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-2.png","element":"img","alt":"wjk ∼ N(0, g2N","inline":true,"padRight":true},{"text":") and ","element":"span"},{"style":{"height":15.5},"width":123.64,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-3.png","element":"img","alt":" uj = ±","inline":true},{"text":"1. The population average of ","element":"span"},{"style":{"height":17.58},"width":40,"height":43.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-4.png","element":"img","alt":"σ2i","inline":true,"padRight":true},{"text":"is given by","element":"span"}],[{"id":"id-44","style":{"width":"75%"},"width":742,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-5.png","element":"img"}],[{"text":"Its variance is","element":"span"}],[{"style":{"width":"74%"},"width":726,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-6.png","element":"img"}],[{"text":"Var","element":"span"},{"style":{"height":19.66},"width":163.36,"height":49.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-7.png","element":"img","alt":"�σ2i�= s4","inline":true}],[{"id":"id-72","style":{"width":"100%"},"width":981,"height":416,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-8.png","element":"img"}],[{"style":{"height":28.8},"width":535.44,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-9.png","element":"img","alt":"E�(W mu)2i�E�(W nu)2i�+ O","inline":true},{"text":"( ","element":"span"},{"style":{"height":19.31},"width":27,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-10.png","element":"img","alt":"1N","inline":true,"padRight":true},{"text":") for ","element":"span"},{"style":{"height":15.2},"width":139.2,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-11.png","element":"img","alt":" m ̸= n","inline":true},{"text":". ","element":"span"},{"text":"From Eqs. ","element":"span"},{"href":"#id-44","text":"(E3) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-72","text":"(E4)","element":"a"},{"text":", we obtain","element":"span"}],[{"id":"id-74","style":{"width":"81%"},"width":800,"height":234,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-12.png","element":"img"}],[{"text":"Equation ","element":"span"},{"href":"#id-73","text":"(22) ","element":"a"},{"text":"follows from Eq. ","element":"span"},{"href":"#id-74","text":"(E5)","element":"a"},{"text":".","element":"span"}],[{"text":"Similarly, we can compute the mutual information between ","element":"span"},{"style":{"height":16},"width":65.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-13.png","element":"img","alt":" xi(t","inline":true,"padRight":true},{"text":"+ 1) and ","element":"span"},{"style":{"height":16},"width":133.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-14.png","element":"img","alt":" s(t − n","inline":true},{"text":") in the linear regime ","element":"span"},{"style":{"height":16.56},"width":105.76,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-15.png","element":"img","alt":"g2 ≪","inline":true,"padRight":true},{"text":"1 as follows. ","element":"span"},{"text":"Let ","element":"span"},{"style":{"height":18.06},"width":243.52,"height":45.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-16.png","element":"img","alt":" X := �∞m=0","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":20.46},"width":132.16,"height":51.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-17.png","element":"img","alt":"W mu)2i","inline":true,"padRight":true},{"text":"and","element":"span"}],[{"style":{"width":"97%"},"width":959,"height":1010,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-18.png","element":"img"}],[{"style":{"width":"99%"},"width":976,"height":202,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-19.png","element":"img"}],[{"text":"Since ","element":"span"},{"text":"E ","element":"span"},{"text":"[log ","element":"span"},{"text":"X","element":"span"},{"text":"] can be approximated as","element":"span"}],[{"style":{"width":"78%"},"width":773,"height":101,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-20.png","element":"img"}],[{"text":"and ","element":"span"},{"text":"a ","element":"span"},{"text":"similar ","element":"span"},{"text":"approximation can ","element":"span"},{"text":"be ","element":"span"},{"text":"obtained ","element":"span"},{"text":"for ","element":"span"},{"text":"E ","element":"span"},{"text":"[log ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-21.png","element":"img","alt":" Xn","inline":true},{"text":"], we obtain","element":"span"}],[{"id":"id-75","style":{"width":"99%"},"width":979,"height":276,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-22.png","element":"img"}],[{"style":{"height":11.27},"width":71.12,"height":28.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-23.png","element":"img","alt":"1−g4","inline":true,"padRight":true},{"text":"and Var [","element":"span"},{"style":{"height":25.47},"width":473.28,"height":63.68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-24.png","element":"img","alt":"Xn] = 2g41−g4 − 2g4n. E [Iot","inline":true},{"text":"] can be com- ","element":"span"},{"text":"puted by summing Eq. ","element":"span"},{"href":"#id-75","text":"(E8) ","element":"a"},{"text":"over ","element":"span"},{"style":{"height":12.8},"width":66.04,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-25.png","element":"img","alt":" n ≥","inline":true,"padRight":true},{"text":"1.","element":"span"}]]},{"heading":"ACKNOWLEDGMENTS","paragraphs":[[{"text":"The authors are grateful to the anonymous reviewers for their comments and suggestions that improved the manuscript. ","element":"span"},{"text":"TH was supported by JSPS KAKENHI Grant Number JP18K03423. KN was supported by JSPS KAKENHI Grant Number JP18H05472 and by MEXT Quantum Leap Flagship Program (MEXT QLEAP) Grant Number JPMXS0118067394. This work is partially based on results obtained from a project commissioned by the New Energy and Industrial Technology Development Organization (NEDO).","element":"span"}],[{"id":"id-0","text":"[1] H. Jaeger, “The “echo state” approach to analysing ","element":"span"},{"text":"and training recurrent neural networks,” (2001), GMDReport 148, GMD-German National Research Institute for Computer Science.","element":"span"}],[{"id":"id-1","text":"[2] W. Maass, T. Natschl¨ager, ","element":"span"},{"text":"and H. Markram, Neural Comput. ","element":"span"},{"text":"14","element":"span"},{"text":", 2531 (2002).","element":"span"}],[{"id":"id-2","text":"[3] H. Jaeger and H. Haas, Science ","element":"span"},{"text":"304","element":"span"},{"text":", 78 (2004).","element":"span"}],[{"text":"[4] D. Verstraeten, ","element":"span"},{"text":"M. Schrauwen, ","element":"span"},{"text":"B. D’Haene, ","element":"span"},{"text":"and D. Stroobandt, Neural Netw. ","element":"span"},{"text":"20","element":"span"},{"text":", 391 (2007).","element":"span"}],[{"text":"[5] M. Lukoˇseviˇcius and H. Jaeger, Comput. Sci. Rev. ","element":"span"},{"text":"3","element":"span"},{"text":", 127 (2009).","element":"span"}],[{"id":"id-3","text":"[6] J. Pathak, B. Hunt, M. Girvan, Z. Lu, and E. Ott, Phys. ","element":"span"},{"text":"Rev. Lett. ","element":"span"},{"text":"120","element":"span"},{"text":", 024102 (2018).","element":"span"}],[{"id":"id-4","text":"[7] L. Larger, M. C. Soriano, D. Brunner, L. Appeltant, J. M. ","element":"span"},{"text":"Gutierrez, L. Pesquera, C. R. Mirasso, ","element":"span"},{"text":"and I. Fischer, Optics Express ","element":"span"},{"text":"20","element":"span"},{"text":", 3241 (2012).","element":"span"}],[{"id":"id-5","text":"[8] L. Larger, A. Bayl´on-Fuentes, R. Martinenghi, V. S. ","element":"span"},{"text":"Udaltsov, Y. K. Chembo, ","element":"span"},{"text":"and M. Jacquot, Phys. Rev. X ","element":"span"},{"text":"7","element":"span"},{"text":", 011015 (2017).","element":"span"}],[{"id":"id-6","text":"[9] J. Torrejon, ","element":"span"},{"text":"M. Riou, ","element":"span"},{"text":"F. A. Araujo, ","element":"span"},{"text":"S. Tsunegi, G. ","element":"span"},{"text":"Khalsa, ","element":"span"},{"text":"D. ","element":"span"},{"text":"Querlioz, ","element":"span"},{"text":"P. ","element":"span"},{"text":"Bortolotti, ","element":"span"},{"text":"V. ","element":"span"},{"text":"Cros, K. Yakushiji, A. Fukushima, H. Kubota, S. Yuasa, M. D.","element":"span"}],[{"style":{"width":"100%"},"width":981,"height":1666,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.11213/images/11-26.png","element":"img"}],[{"text":"Stiles, and J. Grollier, Nature ","element":"span"},{"text":"547","element":"span"},{"text":", 428 (2017).","element":"span"}],[{"id":"id-7","text":"[10] S. Tsunegi, ","element":"span"},{"text":"T. Taniguchi, ","element":"span"},{"text":"K. Nakajima, ","element":"span"},{"text":"S. Miwa, K. Yakushiji, A. Fukushima, S. Yuasa, and H. Kubota, Appl. Phys. Lett. ","element":"span"},{"text":"114","element":"span"},{"text":", 164101 (2019).","element":"span"}],[{"id":"id-8","text":"[11] K. Nakajima, H. Hauser, T. Li, ","element":"span"},{"text":"and R. Pfeifer, Soft Robotics ","element":"span"},{"text":"5","element":"span"},{"text":", 339 (2018).","element":"span"}],[{"id":"id-9","text":"[12] K. Fujii and K. Nakajima, Phys. Rev. Appl. ","element":"span"},{"text":"8","element":"span"},{"text":", 024030 (2017).","element":"span"}],[{"id":"id-10","text":"[13] H. Jaeger, “Short term memory in echo state networks,” ","element":"span"},{"text":"(2002), GMD-Report 152, GMD-German National Research Institute for Computer Science.","element":"span"}],[{"id":"id-11","text":"[14] J. Dambre, D. Verstraeten, B. Schrauwen, and S. Mas- ","element":"span"},{"text":"sar, Sci. Rep. ","element":"span"},{"text":"2","element":"span"},{"text":", 514 (2012).","element":"span"}],[{"id":"id-12","text":"[15] N. Bertschinger and T. Natschl¨ager, Neural Comput. ","element":"span"},{"text":"16","element":"span"},{"text":", 1413 (2004).","element":"span"}],[{"id":"id-13","text":"[16] J. Boedecker, O. Obst, J. T. Lizier, N. M. Mayer, and ","element":"span"},{"text":"M. Asada, Theory Biosci. ","element":"span"},{"text":"131","element":"span"},{"text":", 205 (2012).","element":"span"}],[{"id":"id-14","text":"[17] I. Farkaˇs, R. Bos´ak, and P. Gerge","element":"span"},{"text":"ˇ","element":"span"},{"text":"l, Neural Netw. ","element":"span"},{"text":"83","element":"span"},{"text":", 109 (2016).","element":"span"}],[{"id":"id-15","text":"[18] O. L. White, D. D. Lee, and H. Sompolinsky, Phys. Rev. ","element":"span"},{"text":"Lett. ","element":"span"},{"text":"92","element":"span"},{"text":", 148102 (2004).","element":"span"}],[{"id":"id-16","text":"[19] A. Rodan and P. Tino, IEEE Trans Neural Netw. ","element":"span"},{"text":"22","element":"span"},{"text":", 131","element":"span"}],[{"text":"(2011).","element":"span"}],[{"id":"id-17","text":"[20] M. Hermans and B. Schrauwen, Neural Netw. ","element":"span"},{"text":"23","element":"span"},{"text":", 341 (2010).","element":"span"}],[{"id":"id-18","text":"[21] S. Marzen, Phys. Rev. E ","element":"span"},{"text":"96","element":"span"},{"text":", 032308 (2017).","element":"span"}],[{"id":"id-19","text":"[22] S. Ganguli, D. Huh, ","element":"span"},{"text":"and H. Sompolinsky, Proc. Natl. Acad. Sci. USA ","element":"span"},{"text":"105","element":"span"},{"text":", 18970 (2008).","element":"span"}],[{"id":"id-20","text":"[23] J. Schuecker, S. Goedeke, and M. Helias, Phys. Rev. X ","element":"span"},{"text":"8","element":"span"},{"text":", 041029 (2018).","element":"span"}],[{"id":"id-21","text":"[24] H. Sompolinsky, A. Crisanti, and H. J. Sommers, Phys. ","element":"span"},{"text":"Rev. Lett. ","element":"span"},{"text":"61","element":"span"},{"text":", 259 (1988).","element":"span"}],[{"id":"id-22","text":"[25] T. Toyoizumi and L. F. Abbott, Phys. Rev. E ","element":"span"},{"text":"84","element":"span"},{"text":", 051908 (2011).","element":"span"}],[{"text":"[26] B. Cessac and M. Samuelides, Eur. Phys. J. Spec. Top. ","element":"span"},{"text":"142","element":"span"},{"text":", 7 (2007).","element":"span"}],[{"id":"id-25","text":"[27] M. Massar and S. Massar, Phys. Rev. E ","element":"span"},{"text":"87","element":"span"},{"text":", 042809 (2013).","element":"span"}],[{"id":"id-23","text":"[28] L. Molgedey, J. Schuchhardt, and H. G. Schuster, Phys. ","element":"span"},{"text":"Rev. Lett. ","element":"span"},{"text":"69","element":"span"},{"text":", 3717 (1992).","element":"span"}],[{"id":"id-47","text":"[29] T. M. Cover and J. A. Thomas, ","element":"span"},{"text":"Elements of Information Theory, 2nd ed. ","element":"span"},{"text":"(John Wiley & Sons, Hoboken, NJ, 2006).","element":"span"}],[{"id":"id-60","text":"[30] M. Prokopenko, J. T. Lizier, O. Obst, and X. R. Wang, ","element":"span"},{"text":"Phys. Rev. E ","element":"span"},{"text":"84","element":"span"},{"text":", 041116 (2011).","element":"span"}],[{"id":"id-61","text":"[31] L. Livi, F. M. Bianchi, and C. Alippi, IEEE Trans. Neu- ","element":"span"},{"text":"ral Netw. Learn. Syst. ","element":"span"},{"text":"29","element":"span"},{"text":", 706 (2017).","element":"span"}],[{"id":"id-62","text":"[32] B. A. Marquez, L. Larger, M. Jacquot, Y. K. Chembo, ","element":"span"},{"text":"and D. Brunner, Sci. Rep. ","element":"span"},{"text":"8","element":"span"},{"text":", 3319 (2018).","element":"span"}],[{"id":"id-63","text":"[33] D. Sussillo and L. F. Abbott, Neuron ","element":"span"},{"text":"63","element":"span"},{"text":", 544 (2009).","element":"span"}],[{"id":"id-64","text":"[34] H. Jaeger, M. Lukoˇseviˇcius, D. Popovici, and U. Siewert, ","element":"span"},{"text":"Neural Netw. ","element":"span"},{"text":"20","element":"span"},{"text":", 335 (2007).","element":"span"}],[{"id":"id-65","text":"[35] M. Inubushi and K. Yoshimura, Sci. Rep. ","element":"span"},{"text":"7","element":"span"},{"text":", 10199 (2017).","element":"span"}]]}],"_version":"3.3.4"},"paperNode":"$28:props:children:props:children:0:props:product"}]]