36:[["$","audio",null,{"id":"tts"}],["$","$L3b",null,{"paperID":"1912.12728","publisher":"arxiv","paperJSON":{"title":"Discovery of Dynamics Using Linear Multistep Methods","paperID":"1912.12728","avgLineHeight":13.55,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"text":"Linear multistep methods (LMMs) are popular time discretization techniques for the numerical solution of differential equations. ","element":"span"},{"text":"Traditionally they are applied to solve for the state given the dynamics (the forward problem), but here we consider their application for learning the dynamics given the state (the inverse problem). This repurposing of LMMs is largely motivated by growing interest in data-driven modeling of dynamics, but the behavior and analysis of LMMs for discovery turn out to be significantly different from the well-known, existing theory for the forward problem. Assuming a highly idealized setting of being given the exact state with a zero residual of the discrete dynamics, we establish for the first time a rigorous framework based on refined notions of consistency and stability to yield convergence using LMMs for discovery. When applying these concepts to three popular ","element":"span"},{"style":{"height":10},"width":68.49,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/0-0.png","element":"img","alt":" M−","inline":true},{"text":"step LMMs, the Adams-Bashforth, Adams-Moulton, and Backwards Differentiation Formula schemes, the new theory suggests that Adams-Bashforth for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"ranging from 1 and 6, Adams-Moulton for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 0 and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 1, and Backwards Differentiation Formula for all positive ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"are convergent, and, otherwise, the methods are not convergent in general. In addition, we provide numerical experiments to both motivate and substantiate our theoretical analysis.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Key words. ","element":"span"},{"text":"discovery of dynamics, data-driven modeling, linear multistep methods, stability and convergence, root condition, learning dynamics, artificial intelligence","element":"span"}],[{"style":{"width":"59%"},"width":1052,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/0-1.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"1. Introduction. ","element":"span"},{"text":"In this work, we focus on developing a new numerical analysis framework for the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"discovery ","element":"span"},{"text":"of dynamical systems with given states, where finitely many discrete measurements are used to approximately recover the unknown dynamical system – a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"data-driven ","element":"span"},{"text":"discovery of dynamics [","element":"span"},{"href":"#id-0","referenceIndex":5,"text":"5","element":"a"},{"text":", ","element":"span"},{"href":"#id-1","referenceIndex":44,"text":"44","element":"a"},{"text":"]. ","element":"span"},{"text":"Data-driven discovery of dynamical systems is experiencing a renaissance as costs of sensors, data storage, and computational resources has decreased [","element":"span"},{"href":"#id-2","referenceIndex":42,"text":"42","element":"a"},{"text":"]. ","element":"span"},{"text":"Meanwhile, advancements in the fields of machine learning and data science [","element":"span"},{"href":"#id-3","referenceIndex":17,"text":"17","element":"a"},{"text":", ","element":"span"},{"href":"#id-4","referenceIndex":22,"text":"22","element":"a"},{"text":", ","element":"span"},{"href":"#id-5","referenceIndex":27,"text":"27","element":"a"},{"text":", ","element":"span"},{"href":"#id-6","referenceIndex":28,"text":"28","element":"a"},{"text":", ","element":"span"},{"href":"#id-7","referenceIndex":45,"text":"45","element":"a"},{"text":"] have brought in renewed vigor and enabled expansive view to this field. ","element":"span"},{"text":"At the same time, the growth of data-driven discovery of dynamical systems has also led to a new solution method and model reduction approach to study multiscale and high dimensional complex problems. For more discussions, we refer to works such as [","element":"span"},{"href":"#id-8","referenceIndex":3,"text":"3","element":"a"},{"text":", ","element":"span"},{"href":"#id-9","referenceIndex":6,"text":"6","element":"a"},{"text":", ","element":"span"},{"href":"#id-10","referenceIndex":18,"text":"18","element":"a"},{"text":", ","element":"span"},{"href":"#id-11","referenceIndex":19,"text":"19","element":"a"},{"text":", ","element":"span"},{"href":"#id-12","referenceIndex":23,"text":"23","element":"a"},{"text":", ","element":"span"},{"href":"#id-13","referenceIndex":25,"text":"25","element":"a"},{"text":", ","element":"span"},{"href":"#id-14","referenceIndex":26,"text":"26","element":"a"},{"text":", ","element":"span"},{"href":"#id-15","referenceIndex":29,"text":"29","element":"a"},{"text":", ","element":"span"},{"href":"#id-16","referenceIndex":30,"text":"30","element":"a"},{"text":", ","element":"span"},{"href":"#id-17","referenceIndex":31,"text":"31","element":"a"},{"text":", ","element":"span"},{"href":"#id-18","referenceIndex":35,"text":"35","element":"a"},{"text":", ","element":"span"},{"href":"#id-19","referenceIndex":36,"text":"36","element":"a"},{"text":", ","element":"span"},{"href":"#id-20","referenceIndex":37,"text":"37","element":"a"},{"text":", ","element":"span"},{"href":"#id-21","referenceIndex":38,"text":"38","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-23","referenceIndex":40,"text":"40","element":"a"},{"text":", ","element":"span"},{"href":"#id-24","referenceIndex":41,"text":"41","element":"a"},{"text":", ","element":"span"},{"href":"#id-25","referenceIndex":43,"text":"43","element":"a"},{"text":", ","element":"span"},{"href":"#id-26","referenceIndex":48,"text":"48","element":"a"},{"text":", ","element":"span"},{"href":"#id-27","referenceIndex":50,"text":"50","element":"a"},{"text":", ","element":"span"},{"href":"#id-28","referenceIndex":51,"text":"51","element":"a"},{"text":", ","element":"span"},{"href":"#id-29","referenceIndex":53,"text":"53","element":"a"},{"text":", ","element":"span"},{"href":"#id-30","referenceIndex":54,"text":"54","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":", ","element":"span"},{"href":"#id-32","referenceIndex":56,"text":"56","element":"a"},{"text":"].","element":"span"}],[{"style":{"width":"95%"},"width":1695,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/0-2.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"methods. ","element":"span"},{"text":"In this work, we consider using linear multistep methods (LMMs) to discover unspecified dynamics given the state at equidistant time steps and contribute to the fundamental theory of using LMMs for data-driven discovery. ","element":"span"},{"text":"Historically, LMMs have been developed as popular schemes for numerically integrating known dynamic systems [","element":"span"},{"href":"#id-33","referenceIndex":16,"text":"16","element":"a"},{"text":"], with well-established mathematical theory in the last century [","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":", ","element":"span"},{"href":"#id-35","referenceIndex":12,"text":"12","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-37","referenceIndex":21,"text":"21","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":"]. Recent works","element":"span"}]]},{"heading":"2 KELLER AND DU.","paragraphs":[[{"text":"combine the classical numerical technique of linear multistep methods with neural networks ","element":"span"},{"id":"id-40","text":"for dynamics discovery [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-27","referenceIndex":50,"text":"50","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"].","element":"span"}],[{"id":"id-41","style":{"width":"88%"},"width":1563,"height":1320,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/1-0.png","element":"img"}],[{"text":"Figure 1: Absolute ","element":"figcaption","subtype":"caption"},{"style":{"height":15.02},"width":35.18,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/1-1.png","element":"img","alt":" ℓ2","inline":true},{"text":"-errors for the first coordinate of the 2D Damped Cubic System (","element":"figcaption","subtype":"caption"},{"href":"#id-39","text":"6.1","element":"a","subtype":"caption"},{"text":") on ","element":"figcaption","subtype":"caption"},{"style":{"height":17.6},"width":133.34,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/1-2.png","element":"img","alt":" t ∈ [0,","inline":true,"padRight":true},{"text":"5] with varying time mesh size ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"h ","element":"figcaption","subtype":"caption"},{"text":"= 0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"01","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":", ","element":"figcaption","subtype":"caption"},{"text":"0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"02","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":", ","element":"figcaption","subtype":"caption"},{"text":"0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"03, using a single hidden layer neural network with tanh activation function, as used in [","element":"figcaption","subtype":"caption"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a","subtype":"caption"},{"text":"], after a fixed number of training iterations for each ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"M","element":"figcaption","subtype":"caption"},{"text":".","element":"figcaption","subtype":"caption"}],[{"text":"Coined “LMNet,” LMMs are combined with neural networks for discovery of dynamics in [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-27","referenceIndex":50,"text":"50","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"]. Figure ","element":"span"},{"href":"#id-40","text":"1 ","element":"a"},{"text":"shows the absolute errors associated with learning ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"for a nonlinearlydamped, 2D cubic oscillator (","element":"span"},{"href":"#id-39","text":"6.1","element":"a"},{"text":") using neural networks with three representative schemes of LMMs – Adams-Moulton (AM), Adams-Bashforth (AB), and Backwards Differentiation Formula (BDF). These results are generated using the code repository built for [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":"]; reported are the errors of the dynamics rather than the integrated dynamics, which are shown in [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":"]. For solving differential equations with smooth solutions, increasing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"corresponds to higher accuracy if the scheme is also stable. The AM scheme is an example of such a method; hence, the perplexing behavior in the errors of AM as observed in [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"] (see Tables 1 and 2 of [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":"]","element":"span"}]]},{"heading":"DISCOVERY OF DYNAMICS 3","paragraphs":[[{"text":"and Table 1 of [","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"]). As ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"increases and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"decreases, the errors do not decrease. Further, as we expand the width, thereby increasing the expressibility of the network, the scheme still does not exhibit stable behavior. On the other hand, as shown in Figure ","element":"span"},{"href":"#id-40","text":"1","element":"a"},{"text":", the AB and BDF methods with a fixed network size of 256 show a trend of convergence as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"and the mesh size ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"decrease, while the AM methods show erratic behavior for the same width, persistent even with more expressibility of the network by widening the hidden layer (Figure ","element":"span"},{"href":"#id-41","text":"1b","element":"a"},{"text":"). Since AM is a stable method as a time integrator, these findings warrant further investigation. Indeed, it has also been observed by others that increased resolution does not necessarily imply better neural network representation and prediction without a mathematically sound formulation of the learning problem [","element":"span"},{"href":"#id-8","referenceIndex":3,"text":"3","element":"a"},{"text":"]. While there are many contributing factors such as the neural network structure and size as well as the training process, it is the goal of this paper to investigate these findings and provide a theoretical explanation of the phenomena.","element":"span"}],[{"text":"$3c","element":"span"},{"href":"#id-40","text":"1 ","element":"a"},{"text":"and lays a rigorous foundation for further elucidating the effect of neural networks on dynamics discovery via LMMs through follow-up studies. Therefore, this helps the scientific community broadly in our goal of making machine learning more transparent, explainable, stable and trustworthy.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.2. Summary of Results. ","element":"span"},{"text":"We present a framework in Section ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"consisting of nuanced notions of consistency and stability to handle unique challenges presented by using LMMs for discovery. These concepts are then combined to prove convergence. A set of algebraic criteria is developed to check for the consistency and stability, and thus convergence, of LMMs for dynamics discovery. With this foundation, in Theorems ","element":"span"},{"href":"#id-43","text":"4.1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-44","text":"4.2 ","element":"a"},{"text":", we outline consistency and stability properties of the Adams-Bashforth, Adams-Moulton, and Backwards Differentiation Formula schemes, and consequentially, Corollary ","element":"span"},{"href":"#id-45","text":"4.3","element":"a"},{"text":", their convergence guarantees.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"1.3. Outline. ","element":"span"},{"text":"This paper is organized as follows. In Section ","element":"span"},{"href":"#id-46","text":"2 ","element":"a"},{"text":"we briefly review LMMs and their theory for solving ordinary differential equations, including the standard notions in numerical analysis of truncation error, consistency, stability, and convergence, along with","element":"span"}]]},{"heading":"4 KELLER AND DU.","paragraphs":[[{"text":"an algebraic root condition for stability. ","element":"span"},{"text":"In Section ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"we frame the problem of discovery using LMMs and develop nuanced versions of consistency and stability for discovery. ","element":"span"},{"text":"In particular, in Section ","element":"span"},{"href":"#id-47","text":"3.3","element":"a"},{"text":", we discuss how truncation error for discovery is inherited from the forward problem and introduce a stronger notion of consistency; in Section ","element":"span"},{"href":"#id-48","text":"3.4 ","element":"a"},{"text":"we refine the traditional definition of stability and the algebraic root condition, and we show equivalent theorems connecting the root conditions and the refined notions of stability. In Section ","element":"span"},{"href":"#id-49","text":"4","element":"a"},{"text":", the discovery framework of Section ","element":"span"},{"href":"#id-42","text":"3 ","element":"a"},{"text":"is applied to characterize convergence properties of the Adams-Bashforth, Adams-Moulton, and Backwards Differentiation Formula schemes. Some discussions on the long time dynamics discovery are made in Section ","element":"span"},{"href":"#id-50","text":"5","element":"a"},{"text":". Then, in Section ","element":"span"},{"href":"#id-51","text":"6","element":"a"},{"text":", we show results of numerical experiments. Finally, in Section ","element":"span"},{"href":"#id-52","text":"7","element":"a"},{"text":", we summarize the results and discuss future directions.","element":"span"}],[{"id":"id-46","style":{"fontWeight":"bold"},"text":"2. LMMs: Quick Review. ","element":"span"},{"text":"In this section, we introduce notation used throughout this work and briefly review the theory of LMMs as time integrators. ","element":"span"},{"text":"While LMMs are welldocumented in standard textbooks for solving ordinary differential equations (see [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":", ","element":"span"},{"href":"#id-37","referenceIndex":21,"text":"21","element":"a"},{"text":"]), we include the salient points to facilitate direct comparison with the new theory for the discovery of unknown dynamics developed in the next section.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.1. LMMs: Notation and Concepts. ","element":"span"},{"text":"Consider the ordinary differential equation (ODE)","element":"span"}],[{"id":"id-53","style":{"width":"71%"},"width":1265,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":19.53},"width":418.13,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-1.png","element":"img","alt":" x ∈ C∞(0, ∞)d and f","inline":true,"padRight":true},{"text":"is assumed to be a Lipschitz continuous, smooth, and bounded function. Discretizing the model problem (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":"), we assume a grid on the interval [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] defined to be a set of points: ","element":"span"},{"style":{"height":15.1},"width":551.02,"height":37.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-2.png","element":"img","alt":" a = t0 < t1 < · · · < tN = b","inline":true,"padRight":true},{"text":"with equidistant mesh ","element":"span"},{"style":{"height":16.22},"width":321.56,"height":40.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-3.png","element":"img","alt":" tn+1 − tn = h =","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":17.6},"width":868.98,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-4.png","element":"img","alt":"b − a)/(N + 1), n ∈ {0, 1, . . . , N}. Let [ a, b ]h","inline":true,"padRight":true},{"text":"denote this ordered set. We denote the set of grid functions Γ","element":"span"},{"href":"#id-36","referenceIndex":15,"style":{"height":37.61},"width":1428.33,"height":94.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-5.png","element":"img","alt":"h[ a, b ] =�z | z ∈ R(N+1)×d, zn = z(tn) ∈ Rd, tn ∈ [ a, b ]h�[15].An M","inline":true},{"text":"-step LMM approximates the ","element":"span"},{"style":{"height":19.53},"width":401.32,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-6.png","element":"img","alt":" nth value xn = x(tn","inline":true},{"text":") in terms of the previous ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"(","element":"span"},{"style":{"height":14.4},"width":93.21,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-7.png","element":"img","alt":"M ≥","inline":true,"padRight":true},{"text":"1) time steps ","element":"span"},{"href":"#id-36","referenceIndex":15,"style":{"height":17.6},"width":852.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-8.png","element":"img","alt":" xn−1, xn−2, . . . , xn−M [15, 32, 2, 21]. An M−","inline":true},{"text":"step linear multistep method is given by","element":"span"}],[{"style":{"width":"80%"},"width":1429,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-9.png","element":"img"}],[{"id":"id-54","text":"where ","element":"span"},{"style":{"height":17.6},"width":231.73,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-10.png","element":"img","alt":" x ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"], the coefficients ","element":"span"},{"style":{"height":16.8},"width":1044.09,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-11.png","element":"img","alt":" αm, βm ∈ R for m = 0, 1, . . . , M, and α0 ̸= 0. The","inline":true,"padRight":true},{"text":"function ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"is assumed to be given and Lipschitz, and the LMM scheme (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") defines an iterative procedure stepping forward in the independent variable ","element":"span"},{"style":{"height":17.6},"width":158.17,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-12.png","element":"img","alt":" t ∈ [ a, b","inline":true,"padRight":true},{"text":"] to solve for ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") at the gridpoints. Associated with an ","element":"span"},{"style":{"height":12},"width":81.09,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-13.png","element":"img","alt":" M−","inline":true},{"text":"step LMM are its first and second characteristic polynomials, given, respectively, by","element":"span"}],[{"id":"id-60","style":{"width":"77%"},"width":1370,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-14.png","element":"img"}],[{"text":"where it is assumed that ","element":"span"},{"href":"#id-38","referenceIndex":32,"style":{"height":17.6},"width":221.29,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/3-15.png","element":"img","alt":" α0 ̸= 0 [32].","inline":true}]]},{"heading":"DISCOVERY OF DYNAMICS 5","paragraphs":[[{"text":"For the numerical integration of differential equations, the method (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") is called explicit if ","element":"span"},{"style":{"height":16.4},"width":41.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-0.png","element":"img","alt":" β0","inline":true,"padRight":true},{"text":"= 0 and implicit otherwise [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"]. Implicit methods require a nonlinear solver to the generated system of equations, whereas explicit methods do not. Existence and uniqueness of solutions in the case of implicit schemes is shown in [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-37","referenceIndex":21,"text":"21","element":"a"},{"text":"]. For both implicit and explicit methods, a kickstarting method for initial ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"values must be chosen, and as such a critical component of analyzing any multistep method scheme is to understand how much errors in initial values pollute the subsequent calculations [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]. This aspect of numerical methods is called numerical stability [","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"].","element":"span"}],[{"text":"Finally, for any index set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"S ","element":"span"},{"text":"with cardinality ","element":"span"},{"text":"¯","element":"span"},{"style":{"height":19.16},"width":801.23,"height":47.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-1.png","element":"img","alt":"S, we let ∥z∥1 = �i∈S |zi| and ∥z∥∞ =","inline":true,"padRight":true},{"text":"max","element":"span"},{"style":{"height":17.6},"width":126.18,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-2.png","element":"img","alt":"i∈S |zi|","inline":true,"padRight":true},{"text":"denote the standard discrete norms for any vector ","element":"span"},{"style":{"fontWeight":"bold"},"text":"z ","element":"span"},{"text":"naturally embedded in ","element":"span"},{"style":{"height":17.16},"width":98.24,"height":42.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-3.png","element":"img","alt":" R ¯S×d","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":17.6},"width":59.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-4.png","element":"img","alt":" |zi|","inline":true,"padRight":true},{"text":"can be any vector norm of ","element":"span"},{"style":{"height":17.75},"width":141.66,"height":44.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-5.png","element":"img","alt":" zi ∈ Rd","inline":true},{"text":". The same notations are used also for discrete grid functions given either in Γ","element":"span"},{"style":{"height":17.6},"width":95.13,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-6.png","element":"img","alt":"h[a, b","inline":true},{"text":"] or its subsets.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Remark ","element":"span"},{"text":"2.1. ","element":"span"},{"text":"To fix ideas, we use the hat notation ˆ to mark grid functions generated by the discretization (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":"). In the forward problem, the state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") is iteratively produced by LMMs, and hence we study ˆ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":", whereas for dynamics discovery, we study ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f","element":"span"},{"text":", see Section ","element":"span"},{"href":"#id-42","text":"3","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.2. The Adams Family and BDF. ","element":"span"},{"text":"Adams-Bashforth (AB), Adams-Moulton (AM), and the Backwards Differentiation Formula (BDF) are three popular multistep method schemes that arise from a Lagrange interpolating polynomial of the state or dynamics at time ","element":"span"},{"style":{"height":15.6},"width":151.17,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-7.png","element":"img","alt":" tn using","inline":true,"padRight":true},{"text":"data from previous time steps. Without loss of generality, we consider the scalar model problem in this section; for higher dimensions, the theory need only be applied in each dimension. Let Λ","element":"span"},{"style":{"height":17.6},"width":1241.18,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-8.png","element":"img","alt":"0 = {−M, −M + 1, . . . , −1, 0} and Λ1 = {−M, −M + 1, . . . , −1}","inline":true},{"text":". The Lagrange interpolating polynomial of a function ","element":"span"},{"style":{"height":12.4},"width":205.08,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-9.png","element":"img","alt":" u : R → R","inline":true,"padRight":true},{"text":"over the set ","element":"span"},{"style":{"height":20.41},"width":244.23,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-10.png","element":"img","alt":" {tn+i, i ∈ ˜Λ}","inline":true,"padRight":true},{"text":"is the polynomial of degree ","element":"span"},{"style":{"height":18.63},"width":268.02,"height":46.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-11.png","element":"img","alt":" M for ˜Λ = Λ0","inline":true,"padRight":true},{"text":"and degree ","element":"span"},{"style":{"height":18.63},"width":344.12,"height":46.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-12.png","element":"img","alt":" M − 1 for ˜Λ = Λ1","inline":true,"padRight":true},{"text":"obtained from the linear combination of basis functions","element":"span"}],[{"style":{"width":"70%"},"width":1250,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-13.png","element":"img"}],[{"text":"with ","element":"span"},{"style":{"height":20.41},"width":425.47,"height":51.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-14.png","element":"img","alt":" u(tn+k) for each k ∈ ˜","inline":true},{"text":"Λ as the coefficient of the linear combination. The M-step AdamsMoulton (or AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":") and Adams-Bashforth (or AB-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":") are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":"-step LMMs that arise from interpolating the dynamics ","element":"span"},{"style":{"height":17.6},"width":169.19,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-15.png","element":"img","alt":" f(x(tn)),","inline":true,"padRight":true},{"text":"with Lagrange interpolating polynomials corresponding to ","element":"span"},{"text":"˜","element":"span"},{"text":"Λ = Λ","element":"span"},{"style":{"height":18.63},"width":257.03,"height":46.58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-16.png","element":"img","alt":"0 and ˜Λ = Λ1","inline":true},{"text":", respectively, and then applying the fundamental theorem of calculus on the model problem (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":"). Letting ","element":"span"},{"style":{"height":17.6},"width":375.15,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-17.png","element":"img","alt":" f(tn) denote f(x(tn","inline":true},{"text":")) for brevity, we have","element":"span"}],[{"id":"id-56","style":{"width":"74%"},"width":1328,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-18.png","element":"img"}],[{"text":"BDF-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":", on the other hand, is an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":"-step LMM for ","element":"span"},{"style":{"height":14.4},"width":98.32,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-19.png","element":"img","alt":" M ≥","inline":true,"padRight":true},{"text":"1 derived from interpolating the state ","element":"span"},{"href":"#id-53","style":{"height":17.6},"width":352.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-20.png","element":"img","alt":" x ∈ Γh[a, b] in (2.1","inline":true},{"text":") directly on the lattice Λ","element":"span"},{"style":{"height":15.6},"width":179.24,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-21.png","element":"img","alt":"0, so that","inline":true}],[{"style":{"width":"47%"},"width":839,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/4-22.png","element":"img"}]]},{"heading":"6 KELLER AND DU.","paragraphs":[[{"text":"By the change of variables ","element":"span"},{"style":{"height":17.6},"width":320,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-0.png","element":"img","alt":" u = (t − tn−1)/h","inline":true},{"text":", we have a scaled Lagrange interpolating polynomial, denoted ","element":"span"},{"style":{"height":20.45},"width":226.38,"height":51.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-1.png","element":"img","alt":" ℓhk, given by","inline":true}],[{"id":"id-55","style":{"width":"68%"},"width":1218,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-2.png","element":"img"}],[{"text":"With (","element":"span"},{"href":"#id-55","text":"2.6","element":"a"},{"text":"), the integrand of (","element":"span"},{"href":"#id-56","text":"2.5","element":"a"},{"text":") may be written independent of the time step, so that","element":"span"}],[{"style":{"width":"73%"},"width":1303,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-3.png","element":"img"}],[{"text":"The simplified coefficients for the BDF method with uniform mesh can be obtained similarly.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.3. Truncation Error and Consistency. ","element":"span"},{"text":"In this section, we introduce the residual and notions related to analytical error for LMMs. The residual operator is given by [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]:","element":"span"}],[{"id":"id-74","style":{"width":"87%"},"width":1546,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-4.png","element":"img"}],[{"text":"defined for ˆ","element":"span"},{"style":{"height":17.6},"width":209.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-5.png","element":"img","alt":"x ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"]. How accurately the discretization (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") approximates the solution of (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") is measured by the truncation error, defined below.","element":"span"}],[{"id":"id-106","href":"#id-38","referenceIndex":32,"style":{"height":17.6},"width":1420.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-6.png","element":"img","alt":"Definition 2.2 (Local Truncation Error [32, 2, 21, 15]). Let x ∈ Γh[a, b","inline":true},{"text":"] be the exact solution of the dynamic system (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") defined at the grid coordinates. The local truncation error ","element":"span"},{"style":{"height":20.33},"width":953.54,"height":50.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-7.png","element":"img","alt":" τ h = ((τ h)M, (τ h)M+1, . . . , (τ h)N) ∈ R(N−M+1)×d ","inline":true,"padRight":true},{"text":"is given by","element":"span"}],[{"id":"id-72","style":{"width":"74%"},"width":1317,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-8.png","element":"img"}],[{"text":"For smooth functions ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":", we have","element":"span"}],[{"style":{"width":"64%"},"width":1143,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-9.png","element":"img"}],[{"text":"where","element":"span"}],[{"id":"id-59","style":{"width":"96%"},"width":1713,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-10.png","element":"img"}],[{"text":"Now, we proceed to define order of error and the notion of consistency.","element":"span"}],[{"id":"id-65","style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"2.3 ","element":"span"},{"text":"(Order of Error [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]). ","element":"span"},{"text":"A linear multistep method has error order of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p ","element":"span"},{"text":"if ","element":"span"},{"style":{"height":18.36},"width":451.26,"height":45.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-11.png","element":"img","alt":"∥τ h∥∞ = O(hp) as h →","inline":true,"padRight":true},{"text":"0 and admits a ","element":"span"},{"style":{"height":17.6},"width":707.42,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-12.png","element":"img","alt":" principal error function e(t) ∈ C[ a, b","inline":true,"padRight":true},{"text":"] provided","element":"span"}],[{"style":{"width":"54%"},"width":962,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-13.png","element":"img"}],[{"id":"id-57","text":"or simply, ","element":"span"},{"style":{"height":19.89},"width":477.68,"height":49.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/5-14.png","element":"img","alt":" ∥τ h − hpe∥∞ = O(hp+1).","inline":true}]]},{"heading":"DISCOVERY OF DYNAMICS 7","paragraphs":[[{"style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"2.4 ","element":"span"},{"text":"(Consistency [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]). ","element":"span"},{"text":"A linear multistep method is consistent with the differential equation provided ","element":"span"},{"style":{"height":18.36},"width":414.28,"height":45.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-0.png","element":"img","alt":" ∥τ h∥∞ → 0 as h → 0.","inline":true}],[{"text":"The Adams family and BDF are consistent in the sense of Definition ","element":"span"},{"href":"#id-57","text":"2.4","element":"a"},{"text":". Moreover, the local truncation error associated with the ","element":"span"},{"style":{"height":12},"width":81.1,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-1.png","element":"img","alt":" M−","inline":true},{"text":"step AB and BDF schemes are ","element":"span"},{"style":{"height":19.53},"width":307.93,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-2.png","element":"img","alt":" O(hM), whereas","inline":true,"padRight":true},{"text":"for the ","element":"span"},{"style":{"height":12},"width":81.09,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-3.png","element":"img","alt":" M−","inline":true},{"text":"step AM, the local truncation error is ","element":"span"},{"href":"#id-38","referenceIndex":32,"style":{"height":19.53},"width":319.19,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-4.png","element":"img","alt":" O(hM+1) [32, 2].","inline":true}],[{"text":"It is well-known that consistency can be formulated algebraically in terms of the characteristic polynomials [","element":"span"},{"href":"#id-58","referenceIndex":11,"text":"11","element":"a"},{"text":"]. In particular, the consistency condition, i.e., ","element":"span"},{"href":"#id-59","style":{"height":17.6},"width":426.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-5.png","element":"img","alt":" C0 = C1 = 0 in (2.10),","inline":true,"padRight":true},{"text":"is equivalent to ","element":"span"},{"style":{"height":12},"width":23,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-6.png","element":"img","alt":" ρ","inline":true},{"text":"(1) = 0 and ","element":"span"},{"style":{"height":17.6},"width":172.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-7.png","element":"img","alt":" ρ′(1) = σ","inline":true},{"text":"(1). Moreover, the truncation error is order ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"if","element":"span"}],[{"id":"id-62","style":{"width":"71%"},"width":1273,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-8.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"2.4. Stability and the Root Condition. ","element":"span"},{"text":"In this section, we review definitions of stability and the root condition for LMMs. Stability is defined as follows.","element":"span"}],[{"href":"#id-36","referenceIndex":15,"style":{"height":17.6},"width":894.43,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-9.png","element":"img","alt":"Definition 2.5 (Stability [15]). A linear M−","inline":true},{"text":"step method for the ordinary differential equation ˙","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t, ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":")) is called stable on [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] provided there exists a constant ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K ","element":"span"},{"text":"not depending on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"such that, for any two grid functions ","element":"span"},{"style":{"height":17.6},"width":279.23,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-10.png","element":"img","alt":" u, v ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"], we have for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"sufficiently small","element":"span"}],[{"style":{"width":"60%"},"width":1067,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-11.png","element":"img"}],[{"text":"The characteristic polynomials defined in (","element":"span"},{"href":"#id-60","text":"2.3","element":"a"},{"text":") may be used to determine the stability of a linear multistep method via the root condition.","element":"span"}],[{"id":"id-61","style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"2.6 ","element":"span"},{"text":"(Algebraic Root Condition [","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"] ). ","element":"span"},{"text":"A polynomial satisfies the root condition provided the roots of the polynomial do not exceed magnitude 1, and those of magnitude 1 are simple.","element":"span"}],[{"text":"The following theorem states the equivalence between the stability and the root condition.","element":"span"}],[{"id":"id-81","text":"Theorem 2.7 (Stability and the Root Condition [","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A linear multistep method is stable if and only if its first characteristic polynomial ","element":"span"},{"style":{"height":17.6},"width":78.74,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-12.png","element":"img","alt":" ρ(z)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies the algebraic root condition given by Definition ","element":"span"},{"href":"#id-61","style":{"fontStyle":"italic"},"text":"2.6","element":"a"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"text":"Note that all AB and AM schemes satisfy the root condition and are stable by Definition ","element":"span"},{"href":"#id-62","text":"2.5","element":"a"},{"text":", whereas BDF-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"satisfies the root condition and is stable only for 1 ","element":"span"},{"href":"#id-37","referenceIndex":21,"style":{"height":17.6},"width":267.58,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-13.png","element":"img","alt":" ≤ M ≤ 6 [21].","inline":true}],[{"style":{"fontWeight":"bold"},"text":"2.5. Convergence. ","element":"span"},{"text":"Finally, we introduce the definition of convergence for LMMs and the celebrated equivalence theorem for determining it.","element":"span"}],[{"id":"id-63","style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"2.8 ","element":"span"},{"text":"(Convergence [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]). ","element":"span"},{"text":"Consider the initial value problem (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") and a fixed linear multistep method defined by (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":"). Let ˆ","element":"span"},{"style":{"height":17.6},"width":213.8,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-14.png","element":"img","alt":"x ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"] be the grid function obtained by applying (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") on a uniform, real-valued grid of [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] with mesh size ","element":"span"},{"style":{"height":17.6},"width":478.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-15.png","element":"img","alt":" h, and let x ∈ Γh[ a, b ] be","inline":true,"padRight":true},{"text":"the exact solution of (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") at the grid points. The linear multistep method is said to converge on [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] if","element":"span"}],[{"style":{"width":"71%"},"width":1265,"height":69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/6-16.png","element":"img"}]]},{"heading":"8 KELLER AND DU.","paragraphs":[[{"id":"id-64","text":"With Definition ","element":"span"},{"href":"#id-63","text":"2.8","element":"a"},{"text":", one can obtain the Dahlquist Equivalence Theorem, Theorem ","element":"span"},{"href":"#id-64","text":"2.9 ","element":"a"},{"text":"[","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":"].","element":"span"}],[{"text":"Theorem 2.9 (Equivalence Theorem [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"]). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The multistep method ","element":"span"},{"text":"(","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"converges in the sense of Definition ","element":"span"},{"href":"#id-63","style":{"fontStyle":"italic"},"text":"2.8 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"for all Lipschitz ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"style":{"fontStyle":"italic"},"text":"if and only if it is consistent and stable.","element":"span"}],[{"text":"From the Equivalence Theorem, it can be shown that the order of the error ","element":"span"},{"style":{"height":18.36},"width":230.25,"height":45.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-0.png","element":"img","alt":" ∥x − ˆx∥∞ is","inline":true,"padRight":true},{"text":"the same order as the truncation error (Definition ","element":"span"},{"href":"#id-65","text":"2.3","element":"a"},{"text":") and thus the order of approximation, provided the initial error max","element":"span"},{"style":{"height":18.44},"width":397.67,"height":46.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-1.png","element":"img","alt":"0≤k≤M−1 |ˆxk − x(tk)|","inline":true,"padRight":true},{"text":"is also of the same order.","element":"span"}],[{"text":"In this work, we develop an analogous theory for multistep methods modifying these theorems to deal with the discovery of dynamics rather than solving the differential equation. In particular, we show how the second characteristic polynomial is determinant of stability for discovery and whether the Adams family and BDF are stable or not.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3. Discovery of Dynamics. ","element":"span"},{"text":"In this study, we consider a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"data-driven ","element":"span"},{"text":"technique to solve ","element":"span"},{"id":"id-42","text":"for the dynamics ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"given information on the state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"at equidistant time steps [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":"]. First, we introduce the problem and then discuss notions of consistency, stability, and convergence. We now proceed to define the problem of LMMs for discovery.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.1. Problem Definition. ","element":"span"},{"text":"Following earlier discussions, we are concerned with the initial value problem (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":"). In this section and the next, multivariate functions representing the continuum models are denoted by scalar notations, i.e., ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":") and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":"), so that boldface symbols can be reserved for vectors corresponding to discrete forms of dynamics, which should be clear in context without ambiguity. ","element":"span"},{"text":"The task of learning is to produce a function to approximately represent the dynamics, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":"), based on a set of observed states, that conforms with the discrete dynamics described by a linear multistep method. In practice, one often encounters situations with only partial (incomplete) data or data containing observation errors and uncertainties; these complications are typical for inverse problems. When combined with deep networks, the approximation is produced by a network in a learned parametrized form, which introduces further approximations as well as implicit regularizations.","element":"span"}],[{"text":"As the first step to develop a rigorous numerical analysis framework, we consider a very idealized setting in this work by assuming that (A1) a complete set of exact values of the state, ","element":"span"},{"style":{"height":19.81},"width":317.9,"height":49.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-2.png","element":"img","alt":"{xn = x(tn)}Nn=0","inline":true},{"text":", given at equally distributed, ordered grid points ","element":"span"},{"style":{"height":19.81},"width":145.82,"height":49.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-3.png","element":"img","alt":" {tn}Nn=0","inline":true},{"text":"; (A2) the neural ","element":"span"},{"text":"networks (or the underlying function classes used to represent the dynamics) have sufficient approximation capability to produce zero residual for the discrete dynamical system; (A3) approximated values of the exact dynamics for some observed initial states are available.","element":"span"}],[{"text":"Although the assumptions make the situation very idealized, the study is a very constructive step towards the understanding of the mathematical and computational issues related to the data-driven modeling using neural networks and discretized forms of the unknown dynamics, which are the focuses of our ongoing work. The findings made here shed light on future studies of similar issues under more realistic conditions, as discussed in Section ","element":"span"},{"href":"#id-66","text":"3.2 ","element":"a"},{"text":"and further in Section ","element":"span"},{"href":"#id-52","text":"7","element":"a"},{"text":". ","element":"span"},{"text":"Under the assumptions (A1), (A2), and (A3) stated above, the procedure of learning dynamics can be described as follows. Given ","element":"span"},{"style":{"height":17.6},"width":488.42,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-4.png","element":"img","alt":" xn = x(tn) for 0 ≤ n ≤ N","inline":true,"padRight":true},{"text":"and ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.19},"width":41.6,"height":40.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-5.png","element":"img","alt":"f i","inline":true,"padRight":true},{"text":"as suitable approximations of ","element":"span"},{"style":{"height":17.6},"width":197.63,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-6.png","element":"img","alt":" f(xi) for i","inline":true,"padRight":true},{"text":"in a suitable subset of ","element":"span"},{"style":{"height":17.6},"width":402.01,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-7.png","element":"img","alt":" {0 ≤ i ≤ M − 1}, we","inline":true,"padRight":true},{"text":"have zero residuals for the discrete dynamics based on the LMM discretization for ","element":"span"},{"style":{"height":15.02},"width":141.06,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/7-8.png","element":"img","alt":" tn with","inline":true}]]},{"heading":"DISCOVERY OF DYNAMICS 9","paragraphs":[[{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M, . . . , N","element":"span"},{"text":", i.e,","element":"span"}],[{"style":{"width":"60%"},"width":1068,"height":129,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-0.png","element":"img"}],[{"text":"Indeed, the above system for ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"is simply (","element":"span"},{"href":"#id-54","text":"2.2","element":"a"},{"text":") rewritten for learning the dynamics rather than the state. To help with later discussions, we let ","element":"span"},{"style":{"height":14.7},"width":287.98,"height":36.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-1.png","element":"img","alt":" NM = N − M","inline":true,"padRight":true},{"text":"+ 1 denote the number of linear equations in the system. Given that the values of ","element":"span"},{"style":{"height":19.81},"width":173.57,"height":49.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-2.png","element":"img","alt":" {βm}Mm=0 ","inline":true,"padRight":true},{"text":"affect the structure of the ","element":"span"},{"text":"resulting system, we let ","element":"span"},{"style":{"height":15.02},"width":211.76,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-3.png","element":"img","alt":" m0 and M0","inline":true,"padRight":true},{"text":"be the smallest and the largest index, respectively, among those ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"’s satisfying ","element":"span"},{"style":{"height":16.8},"width":218.95,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-4.png","element":"img","alt":" βm ̸= 0, i.e.","inline":true}],[{"style":{"width":"86%"},"width":1539,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-5.png","element":"img"}],[{"text":"We collect the ordered coefficients of the LMM scheme in the vectors ","element":"span"},{"style":{"height":17.6},"width":408.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-6.png","element":"img","alt":" α = (α0, α1, . . . , αM)","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":17.67},"width":516.33,"height":44.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-7.png","element":"img","alt":" β = (βm0, βm0+1, . . . , βM0).","inline":true,"padRight":true},{"text":"The system for ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"in this reduced notation is then","element":"span"}],[{"id":"id-67","style":{"width":"80%"},"width":1429,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-8.png","element":"img"}],[{"text":"For brevity, we introduce the index sets ","element":"span"},{"style":{"height":17.6},"width":908.24,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-9.png","element":"img","alt":" I = {n ∈ N | M − m0 ≤ n ≤ N − m0} for the","inline":true,"padRight":true},{"text":"set of indices of the grid associated with the values of unknown dynamics and ","element":"span"},{"style":{"height":17.6},"width":233.77,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-10.png","element":"img","alt":" IM := {n ∈","inline":true},{"style":{"height":17.6},"width":558.96,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-11.png","element":"img","alt":"N | M − M0 ≤ n < M − m0}","inline":true,"padRight":true},{"text":"for the set of indices for supplied initial dynamics. The linear system (","element":"span"},{"href":"#id-67","text":"3.1","element":"a"},{"text":") may be written in compact matrix-vector form:","element":"span"}],[{"id":"id-68","style":{"width":"59%"},"width":1053,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-12.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.6},"width":342.61,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-13.png","element":"img","alt":" A is the NM × (N","inline":true,"padRight":true},{"text":"+ 1) matrix of coefficients for ","element":"span"},{"style":{"height":8},"width":33,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-14.png","element":"img","alt":" α","inline":true,"padRight":true},{"text":"corresponding to ","element":"span"},{"href":"#id-67","style":{"height":17.6},"width":346.17,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-15.png","element":"img","alt":" xn−m in (3.1); the","inline":true,"padRight":true},{"text":"matrix ","element":"span"},{"style":{"height":14.7},"width":355,"height":36.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-16.png","element":"img","alt":" B is an NM × NM","inline":true,"padRight":true},{"text":"banded lower-triangular matrix with its diagonal entries given by ","element":"span"},{"style":{"height":17.19},"width":269.23,"height":42.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-17.png","element":"img","alt":"βm0 and the k","inline":true},{"text":"-th subdiagonal entries given by ","element":"span"},{"style":{"height":21.62},"width":877.5,"height":54.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-18.png","element":"img","alt":" βm0+k for k = 1, 2, ..., M0 − m0; ˆf ∈ RNM×d is","inline":true,"padRight":true},{"text":"the ordered vector of unknowns ","element":"span"},{"style":{"height":22.4},"width":918.97,"height":55.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-19.png","element":"img","alt":" {ˆf n}n∈I; and ˆg = (ˆgM, ˆgM+1, . . . , ˆgN) ∈ RNM×d ","inline":true,"padRight":true},{"text":"is defined as","element":"span"}],[{"style":{"width":"56%"},"width":997,"height":211,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-20.png","element":"img"}],[{"text":"which can be generated from the assumed, suitably approximated starting values ","element":"span"},{"style":{"height":21.85},"width":203.93,"height":54.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-21.png","element":"img","alt":" {ˆf n}n∈IM .","inline":true,"padRight":true},{"text":"We note that since ","element":"span"},{"style":{"height":19.12},"width":269.6,"height":47.81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-22.png","element":"img","alt":" βm0 ̸= 0, B−1 ","inline":true,"padRight":true},{"text":"always exists so that (","element":"span"},{"href":"#id-68","text":"3.2","element":"a"},{"text":") is solvable whenever the right hand terms are prescribed.","element":"span"}],[{"style":{"width":"95%"},"width":1697,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-23.png","element":"img"}],[{"id":"id-66","text":"see how the theory developed in this work is connected to the increasingly popular machine ","element":"span"},{"text":"learning based data-driven discovery of dynamics, we briefly recall the relevant learning problems here. For more extensive works on machine learning, we refer to [","element":"span"},{"href":"#id-69","referenceIndex":4,"text":"4","element":"a"},{"text":", ","element":"span"},{"href":"#id-3","referenceIndex":17,"text":"17","element":"a"},{"text":", ","element":"span"},{"href":"#id-70","referenceIndex":33,"text":"33","element":"a"},{"text":", ","element":"span"},{"href":"#id-71","referenceIndex":34,"text":"34","element":"a"},{"text":", ","element":"span"},{"href":"#id-7","referenceIndex":45,"text":"45","element":"a"},{"text":"].","element":"span"}],[{"text":"In a generic supervised machine learning setting of learning an unknown function ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f","element":"span"},{"text":", one often assumes knowledge of ","element":"span"},{"text":"˜","element":"span"},{"style":{"fontStyle":"italic"},"text":"N ","element":"span"},{"text":"samples of input-output data, ","element":"span"},{"style":{"height":22.64},"width":558.52,"height":56.59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/8-24.png","element":"img","alt":" D = {(xn, f(xn))} ˜Nn=1. This","inline":true}]]},{"heading":"10 KELLER AND DU.","paragraphs":[[{"text":"sample dataset is often divided into sets of training and test sets, and one attempts to find a neural network (NN) representation of ","element":"span"},{"style":{"height":16.19},"width":226.02,"height":40.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-0.png","element":"img","alt":" f, say f NN","inline":true},{"text":", through an empirical loss minimization over the training set. We let ˜","element":"span"},{"style":{"height":19.81},"width":163.11,"height":49.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-1.png","element":"img","alt":"x and ˜f","inline":true,"padRight":true},{"text":"denote an ordered subset of ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":18.41},"width":152.75,"height":46.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-2.png","element":"img","alt":"K ≤ ˜N","inline":true,"padRight":true},{"text":"data, so that (˜","element":"span"},{"style":{"height":22.07},"width":550.34,"height":55.17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-3.png","element":"img","alt":"xk, ˜f k) =�xnk, f(xnk)�∈ D.","inline":true,"padRight":true},{"text":"The loss is a suitably-defined function ","element":"span"},{"style":{"height":20.61},"width":219.92,"height":51.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-4.png","element":"img","alt":" ℓ(˜x, ˜f, f NN","inline":true},{"text":") measuring a distance between ","element":"span"},{"style":{"height":20.88},"width":877.72,"height":52.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-5.png","element":"img","alt":" f(xnk) and f NN(xnk) for each k = 1, 2, . . . , ˜K","inline":true},{"text":". When evaluated over only training data, this loss leads to the training error. The desired goal is to learn ","element":"span"},{"style":{"height":16.59},"width":264.27,"height":41.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-6.png","element":"img","alt":" f NN that not","inline":true,"padRight":true},{"text":"only minimizes the loss in the training set (i.e., the training error), but also achieves a small loss in the remaining test samples (i.e., the generalization error).","element":"span"}],[{"text":"In the setting of dynamics discovery, it is important to note that the dynamics, or output, data is not given directly. Instead, only the state, or the input, is provided, and information on the true dynamics ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f ","element":"span"},{"text":"is inferred by constraining the data to conform with some dynamical system. For the LMM discretization of the dynamics given by (","element":"span"},{"href":"#id-67","text":"3.1","element":"a"},{"text":"), conformity is achieved by minimizing the error associated with the LMM system, which we call the LMM residual. A total loss function of the optimization problem may be effectively viewed as","element":"span"}],[{"style":{"width":"41%"},"width":729,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-7.png","element":"img"}],[{"text":"where the loss ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-8.png","element":"img","alt":"ℓ","inline":true,"padRight":true},{"text":"is an increasing function of the LMM residual and vanishes at the origin, e.g., ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":20.88},"width":568.8,"height":52.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-9.png","element":"img","alt":"ℓ(˜x, ˜f) = ∥B˜f − h−1A˜x + ˆg∥22","inline":true},{"text":". A network approximation with sufficient accuracy would ","element":"span"},{"text":"attempt to conform with the discretized LMM dynamics by minimizing the LMM residual to find the unknown data ","element":"span"},{"text":"˜","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"f","element":"span"},{"text":". Alternatively, as done in LMNet, the neural network approximation may be supplied to the LMM residual ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":12.8},"width":18,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-10.png","element":"img","alt":"ℓ","inline":true},{"text":", where the initial dynamics in ˆ","element":"span"},{"style":{"fontWeight":"bold"},"text":"g ","element":"span"},{"text":"are also learned. If the approximation can be as accurate as desired, we would be led to the idealized setting that as the network is trained more, given sufficient width, the neural network would converge to ˆ","element":"span"},{"style":{"height":20.61},"width":210.46,"height":51.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-11.png","element":"img","alt":"f, where ˆf","inline":true,"padRight":true},{"text":"satisfies (","element":"span"},{"href":"#id-68","text":"3.2","element":"a"},{"text":").","element":"span"}],[{"text":"Naturally, due to other practical considerations as well as the finite approximation power of the neural networks, more general loss functions, regularization techniques, and network architectures may also be taken into account, see Section ","element":"span"},{"href":"#id-52","text":"7 ","element":"a"},{"text":"for further discussions. ","element":"span"},{"text":"Our main focus here is to illustrate the impact of using different LMM on the learning process by developing a rigorous mathematical theory of consistency, stability and convergence for the dynamics discovery, beginning with the highly idealized setting of exact state data.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.3. Truncation Error and Consistency. ","element":"span"},{"text":"LMMs for discovery inherit the truncation error ","element":"span"},{"id":"id-47","text":"of solving ordinary differential equations with LMMs. Indeed, truncation error is specific to ","element":"span"},{"text":"the discretization of the continuous problem; therefore, the truncation error ","element":"span"},{"style":{"height":10.84},"width":48.6,"height":27.11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-12.png","element":"img","alt":" τ h","inline":true,"padRight":true},{"text":"of a scheme for dynamics discovery remains the same as that for solving an ordinary differential equation for the state defined by (","element":"span"},{"href":"#id-72","text":"2.9","element":"a"},{"text":"). However, in addition to inheriting the same concept of consistency from Section ","element":"span"},{"href":"#id-46","text":"2","element":"a"},{"text":", Definition ","element":"span"},{"href":"#id-57","text":"2.4","element":"a"},{"text":", we also introduce some strengthened notions of consistency for dynamics discovery. We complement these concepts later on with refined notions of stability for a more nuanced discussion of convergence for discovery using LMMs. Consistency and its strengthened forms are defined below.","element":"span"}],[{"id":"id-73","style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"3.1 ","element":"span"},{"text":"(Consistency for Dynamics Discovery). ","element":"span"},{"text":"An LMM is consistent with the differential equation for dynamics discovery provided ","element":"span"},{"style":{"height":18.36},"width":367.12,"height":45.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-13.png","element":"img","alt":" ∥τ h∥∞ → 0 as h →","inline":true,"padRight":true},{"text":"0, and it is strongly consistent if ","element":"span"},{"style":{"height":18.36},"width":349.48,"height":45.9,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-14.png","element":"img","alt":" ∥τ h∥1 → 0 as h →","inline":true,"padRight":true},{"text":"0. Furthermore, a method is consistent of degree ","element":"span"},{"style":{"height":15.6},"width":231.58,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-15.png","element":"img","alt":" k, for k ≥ 1,","inline":true,"padRight":true},{"id":"id-75","text":"provided ","element":"span"},{"style":{"height":20.29},"width":525.13,"height":50.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/9-16.png","element":"img","alt":" Nk−1 ∥τ h∥∞ → 0 as h → 0.","inline":true}]]},{"heading":"DISCOVERY OF DYNAMICS 11","paragraphs":[[{"style":{"fontStyle":"italic"},"text":"Remark ","element":"span"},{"text":"3.2. ","element":"span"},{"text":"With the Definition ","element":"span"},{"href":"#id-73","text":"3.1","element":"a"},{"text":", all LMMs having at least ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":"-th order truncation error are consistent of degree at least ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":". Moreover, since","element":"span"}],[{"style":{"width":"36%"},"width":638,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-0.png","element":"img"}],[{"text":"LMMs having at least second-order truncation are automatically consistent of degree 2 and thus strongly consistent.","element":"span"}],[{"text":"Following from the classical truncation error analysis for LMMs, we have the algebraic criteria for the consistency.","element":"span"}],[{"id":"id-89","text":"Lemma 3.3 (Consistency). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A linear multistep method scheme for dynamics discovery is consistent provided that ","element":"span"},{"style":{"height":17.6},"width":497.66,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-1.png","element":"img","alt":" ρ(1) = 0 and ρ′(1) = σ(1)","inline":true},{"style":{"fontStyle":"italic"},"text":". Furthermore, it is consistent of degree ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"if it is order ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"style":{"fontStyle":"italic"},"text":"in the sense of Definition ","element":"span"},{"href":"#id-65","style":{"fontStyle":"italic"},"text":"2.3","element":"a"},{"style":{"fontStyle":"italic"},"text":", that is, if ","element":"span"},{"style":{"height":19.53},"width":680.18,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-2.png","element":"img","alt":" ρ(ez) − zσ(ez) = O(zk+1) as z → 0.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"3.4. Stability and the Root Condition for Discovery. ","element":"span"},{"text":"In this section we develop stability ","element":"span"},{"id":"id-48","text":"in a similar spirit as in Section ","element":"span"},{"href":"#id-46","text":"2 ","element":"a"},{"text":"but also introduce more refined notions of stability for convergence analysis. For discovery, the main distinction from theory for solving the forward problem is that now we consider perturbations to the recovered dynamics as opposed to the integrated states for the numerical solution of the differential equation. To begin we introduce a linear operator given by","element":"span"}],[{"id":"id-84","style":{"width":"76%"},"width":1356,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-3.png","element":"img"}],[{"text":"Notice ( ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":21.41},"width":122.22,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-4.png","element":"img","alt":"Rhˆf)n","inline":true,"padRight":true},{"text":"arises from its forward counterpart (","element":"span"},{"href":"#id-74","text":"2.8","element":"a"},{"text":") with the reduced ","element":"span"},{"style":{"height":15.6},"width":216.87,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-5.png","element":"img","alt":" β notation.","inline":true}],[{"style":{"height":17.6},"width":1233.72,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-6.png","element":"img","alt":"Definition 3.4 (Stability for Dynamics Discovery). A linear M−","inline":true},{"text":"step method for the dynamics discovery is called stable on [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] provided there exists a constant ","element":"span"},{"style":{"height":15.2},"width":262.61,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-7.png","element":"img","alt":" K < ∞, not","inline":true,"padRight":true},{"text":"depending on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N","element":"span"},{"text":", such that, for any two grid functions ","element":"span"},{"style":{"height":17.6},"width":256.77,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-8.png","element":"img","alt":" u, v ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"], we have","element":"span"}],[{"style":{"width":"95%"},"width":1696,"height":189,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-9.png","element":"img"}],[{"id":"id-76","text":"for the dynamics discovery is called marginally stable on [ ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a, b ","element":"span"},{"text":"] provided that there exists a constant ","element":"span"},{"style":{"height":12.8},"width":142.36,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-10.png","element":"img","alt":" K < ∞","inline":true},{"text":", not depending on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N","element":"span"},{"text":", such that, for any two grid functions ","element":"span"},{"style":{"height":17.6},"width":287.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-11.png","element":"img","alt":" u, v ∈ Γh[ a, b ],","inline":true,"padRight":true},{"text":"we have","element":"span"}],[{"style":{"width":"95%"},"width":1696,"height":161,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-12.png","element":"img"}],[{"id":"id-85","text":"method for the dynamics discovery is called weakly stable of degree ","element":"span"},{"style":{"height":17.6},"width":430.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-13.png","element":"img","alt":" −k for k ≥ 2 on [ a, b ]","inline":true,"padRight":true},{"text":"provided that there exists a constant ","element":"span"},{"style":{"height":12.8},"width":150.7,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-14.png","element":"img","alt":" K < ∞","inline":true},{"text":", not depending on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N","element":"span"},{"text":", such that, for any two grid functions ","element":"span"},{"style":{"height":17.6},"width":256.77,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-15.png","element":"img","alt":" u, v ∈ Γh[ a, b","inline":true,"padRight":true},{"text":"], we have","element":"span"}],[{"style":{"width":"66%"},"width":1178,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/10-16.png","element":"img"}]]},{"heading":"12 KELLER AND DU.","paragraphs":[[{"text":"In all cases, the norm on the left-hand-side is taken over the learned components ","element":"span"},{"style":{"height":17.6},"width":160.18,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-0.png","element":"img","alt":" {un}n∈I","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":17.6},"width":158.79,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-1.png","element":"img","alt":" {vn}n∈I","inline":true},{"text":". This convention is used in the rest of the paper. Note that we choose to use negative degree values so that more negative degrees correspond to weaker stability. Similar to the observation given in Remark ","element":"span"},{"href":"#id-75","text":"3.2","element":"a"},{"text":", we see that weak stability of degree ","element":"span"},{"style":{"height":4.8},"width":34,"height":12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-2.png","element":"img","alt":" −","inline":true},{"text":"2 follows from the marginal stability in Definition ","element":"span"},{"href":"#id-76","text":"3.5","element":"a"},{"text":".","element":"span"}],[{"text":"We would like to turn the property of stability into an algebraic condition as for the case of numerical solution to ODEs. For the forward problem, the algebraic root condition (Definition ","element":"span"},{"href":"#id-61","text":"2.6","element":"a"},{"text":") serves this purpose; however, for the inverse problem, we require a more subtle treatment of the root condition to capture the nuances in stability for dynamics discovery.","element":"span"}],[{"id":"id-83","style":{"fontStyle":"italic"},"text":"Definition ","element":"span"},{"text":"3.7 ","element":"span"},{"text":"(Strong Root Condition [","element":"span"},{"href":"#id-77","referenceIndex":1,"text":"1","element":"a"},{"text":", ","element":"span"},{"href":"#id-78","referenceIndex":46,"text":"46","element":"a"},{"text":", ","element":"span"},{"href":"#id-79","referenceIndex":13,"text":"13","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"] ). ","element":"span"},{"text":"A polynomial satisfies the strong root condition provided the roots of the polynomial have magnitude less than 1.","element":"span"}],[{"id":"id-80","text":"Likewise, we also generalize the above root conditions.","element":"span"}],[{"style":{"height":19.53},"width":897.07,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-3.png","element":"img","alt":"Definition 3.8 (kth-multiplicity Root Condition).","inline":true,"padRight":true},{"text":"A polynomial satisfies the root condition of degree ","element":"span"},{"style":{"height":13.2},"width":113.1,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-4.png","element":"img","alt":" k ∈ N","inline":true,"padRight":true},{"text":"provided the roots of the polynomial do not exceed magnitude 1, and those of magnitude 1 have multiplicity no larger than ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Remark ","element":"span"},{"text":"3.9. ","element":"span"},{"text":"One may view the conventional (algebraic) root condition (Definition ","element":"span"},{"href":"#id-61","text":"2.6","element":"a"},{"text":") and the strong root condition (Definition ","element":"span"},{"href":"#id-80","text":"3.8","element":"a"},{"text":") as special cases of the ","element":"span"},{"style":{"height":15.53},"width":56.33,"height":38.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-5.png","element":"img","alt":" kth","inline":true},{"text":"-multiplicity root condition of Definition ","element":"span"},{"href":"#id-80","text":"3.8 ","element":"a"},{"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"= 1 and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"= 0 respectively. The strong root condition has been used in the numerical analysis, control theory, and linear recurrence relation literature for study of relative stability for LMM as time integrators and asymptotic properties associated with the linear recurrence relations [","element":"span"},{"href":"#id-77","referenceIndex":1,"text":"1","element":"a"},{"text":", ","element":"span"},{"href":"#id-78","referenceIndex":46,"text":"46","element":"a"},{"text":", ","element":"span"},{"href":"#id-79","referenceIndex":13,"text":"13","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"].","element":"span"}],[{"text":"Naturally, we can see that the notions of stability for discovery for LMMs are tied to the bounds on the solutions to the linear recurrence equations determined by the coefficients ","element":"span"},{"style":{"height":15.6},"width":29,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-6.png","element":"img","alt":"β","inline":true},{"text":". We now relate them to the root conditions. Notice that while the stability in Theorem ","element":"span"},{"href":"#id-81","text":"2.7 ","element":"a"},{"text":"for numerical integration of the given dynamics is concerned with the first characteristic polynomial ","element":"span"},{"style":{"height":17.6},"width":59.53,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-7.png","element":"img","alt":" ρ(r","inline":true},{"text":"), the stability in Theorem ","element":"span"},{"href":"#id-82","text":"3.10 ","element":"a"},{"text":"for the discovery of dynamics is concerned with the second characteristic polynomial ","element":"span"},{"style":{"height":17.6},"width":63.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-8.png","element":"img","alt":" σ(r","inline":true},{"text":") defined by (","element":"span"},{"href":"#id-60","text":"2.3","element":"a"},{"text":"). More precisely, the root condition can be stated for a reduced second characteristic polynomial","element":"span"}],[{"style":{"width":"62%"},"width":1104,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-9.png","element":"img"}],[{"text":"Hence, we see a fundamental difference in the two stability notions. ","element":"span"},{"text":"The dependence of stability on ","element":"span"},{"style":{"height":17.6},"width":242.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-10.png","element":"img","alt":" σ(r) (or ˆσ(r","inline":true},{"text":")) might be unexpected as it has not appeared in the numerical differential equation literature. However, it is also not surprising given the inverse problem nature of using LMMs for dynamics discovery.","element":"span"}],[{"id":"id-82","text":"Theorem 3.10 (Stability for Discovery). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A linear multistep method for discovery of dynamics is stable provided that the second characteristic polynomial ","element":"span"},{"style":{"height":17.6},"width":81.37,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-11.png","element":"img","alt":" σ(r)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"or the reduced ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":17.6},"width":81.37,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-12.png","element":"img","alt":"σ(r)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies the strong root condition in Definition ","element":"span"},{"href":"#id-83","style":{"fontStyle":"italic"},"text":"3.7","element":"a"},{"style":{"fontStyle":"italic"},"text":". Likewise, an LMM for discovery of dynamics is marginally stable provided that ","element":"span"},{"style":{"height":17.6},"width":233.39,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/11-13.png","element":"img","alt":" σ(r) or ˆσ(r)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies the algebraic root condition in Definition ","element":"span"},{"href":"#id-61","style":{"fontStyle":"italic"},"text":"2.6","element":"a"},{"style":{"fontStyle":"italic"},"text":". Furthermore, an LMM for discovery of dynamics is weakly stable of degree","element":"span"}]]},{"heading":"DISCOVERY OF DYNAMICS 13","paragraphs":[[{"style":{"height":17.6},"width":264.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-0.png","element":"img","alt":"−k (for k ≥ 2","inline":true},{"style":{"fontStyle":"italic"},"text":") provided that ","element":"span"},{"style":{"height":17.6},"width":234.07,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-1.png","element":"img","alt":" σ(r) or ˆσ(r)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies the ","element":"span"},{"text":"(","element":"span"},{"style":{"height":19.53},"width":147.23,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-2.png","element":"img","alt":"k − 1)th","inline":true},{"style":{"fontStyle":"italic"},"text":"-multiplicity root condition in Definition ","element":"span"},{"href":"#id-80","style":{"fontStyle":"italic"},"text":"3.8","element":"a"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"style":{"height":17.6},"width":805.07,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-3.png","element":"img","alt":"Proof. Let ˆe = u−v, where u, v ∈ Γh[0, T","inline":true},{"text":"] are both generated by solving the LMM (","element":"span"},{"href":"#id-68","text":"3.2","element":"a"},{"text":"). By setting ","element":"span"},{"style":{"height":21.21},"width":173.46,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-4.png","element":"img","alt":" r = ˆRh(e","inline":true},{"text":") with the operator ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":14.84},"width":53.13,"height":37.1,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-5.png","element":"img","alt":"Rh","inline":true,"padRight":true},{"text":"defined in (","element":"span"},{"href":"#id-84","text":"3.3","element":"a"},{"text":"), we have","element":"span"}],[{"style":{"width":"47%"},"width":847,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-6.png","element":"img"}],[{"text":"By standard recurrence and linear algebra theory [","element":"span"},{"href":"#id-77","referenceIndex":1,"text":"1","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":"], the difference ˆ","element":"span"},{"style":{"fontWeight":"bold"},"text":"e ","element":"span"},{"text":"can be determined by the companion matrix of the above recurrence relation, denoted by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Z","element":"span"},{"text":". This matrix is an (","element":"span"},{"style":{"height":17.6},"width":409.3,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-7.png","element":"img","alt":"M0 −m0)×(M0 −m0","inline":true},{"text":") matrix with its first row given by ","element":"span"},{"style":{"height":31.6},"width":675.06,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-8.png","element":"img","alt":" −� βm0+1βm0 , βm0+2βm0 , . . . ,βM0βm0�, and the","inline":true,"padRight":true},{"text":"rest of the rows are of the form (","element":"span"},{"style":{"fontWeight":"bold"},"text":"I","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"style":{"fontWeight":"bold"},"text":"0","element":"span"},{"text":") where ","element":"span"},{"style":{"fontWeight":"bold"},"text":"I ","element":"span"},{"text":"is the identity matrix of size (","element":"span"},{"style":{"height":17.6},"width":309.09,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-9.png","element":"img","alt":"M0 − m0 − 1) ×","inline":true,"padRight":true},{"text":"(","element":"span"},{"style":{"height":17.6},"width":415.1,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-10.png","element":"img","alt":"M0 − m0 − 1) and 0","inline":true,"padRight":true},{"text":"is the zero column vector in ","element":"span"},{"style":{"height":15.13},"width":196.91,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-11.png","element":"img","alt":" RM0−m0−1","inline":true},{"text":". The matrix ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Z ","element":"span"},{"text":"is associated with a characteristic polynomial given by ˆ","element":"span"},{"style":{"height":17.6},"width":63.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-12.png","element":"img","alt":"σ(r","inline":true},{"text":") that shares the same set of roots as that of ","element":"span"},{"style":{"height":17.6},"width":63.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-13.png","element":"img","alt":"σ(r","inline":true},{"text":"), except a possible root at 0.","element":"span"}],[{"text":"To consider the propagation of the difference ˆ","element":"span"},{"style":{"fontWeight":"bold"},"text":"e","element":"span"},{"text":", we form the matrix ","element":"span"},{"style":{"height":14.62},"width":366.84,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-14.png","element":"img","alt":" En = ZEn−1 + Rn","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":18.55},"width":334.55,"height":46.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-15.png","element":"img","alt":" En ∈ R(M0−m0)×d ","inline":true,"padRight":true},{"text":"has its rows given by ","element":"span"},{"style":{"height":21.18},"width":877.37,"height":52.94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/12-16.png","element":"img","alt":" {ˆen−k}0≤k 0. Notice","inline":true}],[{"id":"id-98","style":{"width":"81%"},"width":1447,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-1.png","element":"img"}],[{"text":"We prove (","element":"span"},{"href":"#id-98","text":"4.4","element":"a"},{"text":") by induction. ","element":"span"},{"text":"As the base case, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 2. For ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 2, we have ","element":"span"},{"style":{"height":17.6},"width":244.23,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-2.png","element":"img","alt":" β1 = 8/12 >","inline":true},{"style":{"height":17.6},"width":145.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-3.png","element":"img","alt":"β0 = 5/","inline":true},{"text":"12. Now assume (","element":"span"},{"href":"#id-98","text":"4.4","element":"a"},{"text":") holds up to some arbitrary ","element":"span"},{"style":{"height":15.6},"width":351.96,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-4.png","element":"img","alt":" M ∈ N, with M >","inline":true,"padRight":true},{"text":"2. We will show the result for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"+ 1","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"id":"id-99","style":{"width":"95%"},"width":1684,"height":591,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-5.png","element":"img"}],[{"text":"as desired. Note we used the inductive hypothesis on the second term in (","element":"span"},{"href":"#id-99","text":"4.6","element":"a"},{"text":"). The proof by induction showing for ","element":"span"},{"style":{"height":16.4},"width":312.3,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-6.png","element":"img","alt":" M ≥ 2, β1 > β0","inline":true,"padRight":true},{"text":"is complete. To prove Part ","element":"span"},{"href":"#id-100","text":"2","element":"a"},{"text":", note that the relation of signs between coefficients follows from the sign of the Lagrange basis polynomials in the integrand of the coefficients. For ","element":"span"},{"style":{"height":17.6},"width":362.73,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-7.png","element":"img","alt":" m ∈ {2, 3, . . . , M},","inline":true,"padRight":true},{"text":"the integrand of (","element":"span"},{"href":"#id-101","text":"4.3","element":"a"},{"text":") are of the same sign, and therefore the sign of ","element":"span"},{"style":{"height":16.4},"width":54.68,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-8.png","element":"img","alt":" βm","inline":true,"padRight":true},{"text":"depends only on the multiplier (","element":"span"},{"style":{"height":17.6},"width":116.68,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-9.png","element":"img","alt":"−1)m.","inline":true,"padRight":true},{"text":"Hence Part ","element":"span"},{"href":"#id-100","text":"2 ","element":"a"},{"text":"of Lemma ","element":"span"},{"href":"#id-100","text":"4.6 ","element":"a"},{"text":"follows.","element":"span"}],[{"text":"Finally, for Part ","element":"span"},{"href":"#id-102","text":"3","element":"a"},{"text":", we note that","element":"span"}],[{"style":{"width":"65%"},"width":1161,"height":284,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-10.png","element":"img"}],[{"text":"This completes the proof.","element":"span"}],[{"style":{"height":17.6},"width":865.19,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-11.png","element":"img","alt":"Lemma 4.7 (General Instability of AM M ≥ 2).","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"The linear multistep method formed by the ","element":"span"},{"id":"id-96","style":{"fontStyle":"italic"},"text":"Adams-Moulton scheme for ","element":"span"},{"style":{"height":14.4},"width":127.27,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-12.png","element":"img","alt":" M ≥ 2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"does not satisfy the root condition.","element":"span"}],[{"style":{"width":"99%"},"width":1770,"height":374,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/18-13.png","element":"img"}]]},{"heading":"20 KELLER AND DU.","paragraphs":[[{"text":"Taking the limit as ","element":"span"},{"style":{"height":12},"width":166.72,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-0.png","element":"img","alt":" r → +∞","inline":true},{"text":", we see that (","element":"span"},{"style":{"height":19.53},"width":607.54,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-1.png","element":"img","alt":"−1)Mσ(−∞) = ∞ since β0 > 0.","inline":true,"padRight":true},{"text":"Meanwhile,","element":"span"}],[{"style":{"width":"55%"},"width":990,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-2.png","element":"img"}],[{"text":"Hence, it follows from the Intermediate Value Theorem that there is at least one root of ","element":"span"},{"style":{"height":17.6},"width":63.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-3.png","element":"img","alt":"σ(r","inline":true},{"text":") that is real in (","element":"span"},{"style":{"height":17.6},"width":464.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-4.png","element":"img","alt":"−∞, −β1/β0) ⊂ (−∞, −","inline":true},{"text":"1), violating the root condition. The result thus follows.","element":"span"}],[{"id":"id-104","text":"Theorem 4.8 (Root Condition of AB, AM, BDF). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The strong root condition for discovery is satisfied by BDF-","element":"span"},{"style":{"height":16.8},"width":478.97,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-5.png","element":"img","alt":"M for all M ∈ N, AB-M","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"scheme for ","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":16.8},"width":645,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-6.png","element":"img","alt":" ≤ M ≤ 6, and AM-M for M = 0.","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"The algebraic root condition, or the ","element":"span"},{"style":{"height":15.53},"width":56.33,"height":38.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-7.png","element":"img","alt":" kth ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"root condition with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k ","element":"span"},{"text":"= 1","element":"span"},{"style":{"fontStyle":"italic"},"text":", is satisfied for AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"style":{"fontStyle":"italic"},"text":"with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 1","element":"span"},{"style":{"fontStyle":"italic"},"text":". On the other hand, the root condition is not satisfied for the AB-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"style":{"fontStyle":"italic"},"text":"scheme with ","element":"span"},{"text":"7 ","element":"span"},{"style":{"height":14.4},"width":195.15,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-8.png","element":"img","alt":" ≤ M ≤ 10","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"or the AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"style":{"fontStyle":"italic"},"text":"scheme with ","element":"span"},{"style":{"height":14.4},"width":140.09,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-9.png","element":"img","alt":" M ≥ 2.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"The case of AM-0 is trivial. Lemma ","element":"span"},{"href":"#id-103","text":"4.5 ","element":"a"},{"text":"implies the results of Theorem ","element":"span"},{"href":"#id-104","text":"4.8 ","element":"a"},{"text":"for AB-","element":"span"},{"style":{"height":14.8},"width":335.21,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-10.png","element":"img","alt":"M with 1 ≤ M ≤","inline":true,"padRight":true},{"text":"10 and for AM-","element":"span"},{"style":{"height":14.8},"width":335.2,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-11.png","element":"img","alt":"M with 1 ≤ M ≤","inline":true,"padRight":true},{"text":"10. Furthermore, by Lemma ","element":"span"},{"href":"#id-96","text":"4.7","element":"a"},{"text":", the AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"scheme for ","element":"span"},{"style":{"height":14.4},"width":93.21,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-12.png","element":"img","alt":" M ≥","inline":true,"padRight":true},{"text":"2 violates the root condition and hence is unstable. Finally, BDF-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"has ","element":"span"},{"style":{"height":19.53},"width":412.84,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-13.png","element":"img","alt":" σ(r) = rM−1 and ˆσ(r","inline":true},{"text":") = 1, for all ","element":"span"},{"style":{"height":14.4},"width":95.83,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-14.png","element":"img","alt":" M ≥","inline":true,"padRight":true},{"text":"1. Hence, the root condition is always satisfied for the BDF scheme for arbitrary ","element":"span"},{"style":{"height":14.4},"width":93.21,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-15.png","element":"img","alt":" M ≥","inline":true,"padRight":true},{"text":"1. As a result, AM-0, identical to BDF-1, satisfies the root condition as well.","element":"span"}],[{"text":"Finally, Theorem ","element":"span"},{"href":"#id-44","text":"4.2 ","element":"a"},{"text":"follows directly from Theorems ","element":"span"},{"href":"#id-104","text":"4.8 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-82","text":"3.10","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"4.4. Discussions on the Effect of Initial Conditions. ","element":"span"},{"text":"The theory developed so far is ","element":"span"},{"id":"id-111","text":"under the assumption that some initial data of the dynamics are provided, which leads to ","element":"span"},{"text":"learning the approximated dynamics at later times. One may consider a situation where the some terminal data are given instead. In such cases, the approximate dynamics would be solved backward in time, yielding a modified system of equations. It is not hard to check that the stability would become dependent on a modified second characteristic polynomial whose roots are the reciprocals of those of ˆ","element":"span"},{"style":{"height":8},"width":25,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-16.png","element":"img","alt":"σ","inline":true},{"text":". Naturally, it is of interest to check root conditions for the three classes of LMMs as well. For BDF, we clearly see the strong root condition holds as ˆ","element":"span"},{"style":{"height":17.6},"width":63.46,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-17.png","element":"img","alt":"σ(r","inline":true},{"text":") = 1. For AM-0 and AB-1, the same also hold. Likewise for AM-1, the root condition but not the strong root condition remains true. For AM-","element":"span"},{"href":"#id-102","style":{"height":15.6},"width":490.34,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-18.png","element":"img","alt":"M with M ≥ 2, Part 3 of","inline":true,"padRight":true},{"text":"Lemma ","element":"span"},{"href":"#id-100","text":"4.6 ","element":"a"},{"text":"implies the product of the roots of ˆ","element":"span"},{"style":{"height":17.6},"width":204.22,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-19.png","element":"img","alt":"σ(r) = σ(r","inline":true},{"text":") is less than one. Therefore, there might be at least one root of the modified second characteristic polynomial outside the unit disc, and hence instability for these methods is again expected. Interestingly, unlike in the case with initial data where there is not yet rigorous theory but only computational results for the AB methods, one can prove rigorously in the next lemma a result of instability for the backwards-in-time AB-","element":"span"},{"style":{"height":15.2},"width":212.85,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-20.png","element":"img","alt":"M, M ≥ 2,","inline":true,"padRight":true},{"text":"via a similar argument as Part ","element":"span"},{"href":"#id-102","text":"3 ","element":"a"},{"text":"of Lemma ","element":"span"},{"href":"#id-100","text":"4.6","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"100%"},"width":1773,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-21.png","element":"img"}],[{"style":{"height":16.8},"width":178.41,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-22.png","element":"img","alt":"Proof. β0","inline":true,"padRight":true},{"text":"= 0 is true by construction. For ","element":"span"},{"style":{"height":14.4},"width":93.21,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-23.png","element":"img","alt":" M ≥","inline":true,"padRight":true},{"text":"2, we have","element":"span"}],[{"style":{"width":"74%"},"width":1327,"height":163,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/19-24.png","element":"img"}]]},{"heading":"DISCOVERY OF DYNAMICS 21","paragraphs":[[{"text":"for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"= 1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"2","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , M ","element":"span"},{"text":"+ 1","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"The coefficients ","element":"span"},{"style":{"height":16.8},"width":341.81,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-0.png","element":"img","alt":" β1 and βM satisfy","inline":true}],[{"style":{"width":"81%"},"width":1442,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-1.png","element":"img"}],[{"text":"which completes the proof of the lemma.","element":"span"}],[{"text":"From the above, we see that root conditions do not hold for the modified second characteristic polynomial associated with AB-","element":"span"},{"style":{"height":14.8},"width":252.17,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-2.png","element":"img","alt":"M with M ≥","inline":true,"padRight":true},{"text":"2, so that instability would occur when terminal data are supplied. In practice, it is often the case that such initial dynamics are represented by neural networks as part of the unknown as well. Thus, the stability in such cases is worthy of further investigation, particularly in conjunction with the approximation properties of the neural networks to be employed. Clearly, the successful runs using neural networks in Figure ","element":"span"},{"href":"#id-40","text":"1 ","element":"a"},{"text":"have good correspondence with those schemes enjoying some stability properties in one or both types of initial/terminal data .","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"5. Long Time Dynamics Discovery. ","element":"span"},{"text":"In this section, we consider the problem of discov- ","element":"span"},{"id":"id-50","text":"ering dynamics of (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") over a variable interval (0","element":"span"},{"style":{"fontStyle":"italic"},"text":", T","element":"span"},{"text":")","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"with terminal time 1 ","element":"span"},{"style":{"height":15.6},"width":330.79,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-3.png","element":"img","alt":" ≪ T → ∞, and a","inline":true,"padRight":true},{"text":"fixed mesh ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h","element":"span"},{"text":". Notice by increasing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"we increase the number of grid points ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T/h","element":"span"},{"text":"; hence we hope to relate our previous studies with variable mesh and fixed domain to this setting. For the numerical analysis of time integration, this study is reminiscent to that of asymptotic stability, which is often treated via the study of linear dynamics [","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"].","element":"span"}],[{"text":"By rescaling time, ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":19.22},"width":482.34,"height":48.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-4.png","element":"img","alt":"t = t/T, where 0 ≤ ˜t ≤ 1,","inline":true,"padRight":true},{"text":"and defining ˜","element":"span"},{"style":{"height":19.22},"width":190.72,"height":48.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-5.png","element":"img","alt":"x(˜t) = x(t","inline":true},{"text":"), we have via change of variables that the scaled dynamics ","element":"span"},{"text":"˜","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"may be related to that of the original variables by","element":"span"}],[{"style":{"width":"44%"},"width":788,"height":94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-6.png","element":"img"}],[{"text":"Then, if we define ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":19.22},"width":549.7,"height":48.05,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-7.png","element":"img","alt":"f(˜x(˜t)) = Tf(˜x(˜t)) = Tf(x(t","inline":true},{"text":")), the rescaled differential equation becomes","element":"span"}],[{"id":"id-105","style":{"width":"75%"},"width":1339,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-8.png","element":"img"}],[{"text":"Now, consider applying the LMM scheme to ˜","element":"span"},{"style":{"fontStyle":"italic"},"text":"x ","element":"span"},{"text":"using the transformed model problem (","element":"span"},{"href":"#id-105","text":"5.1","element":"a"},{"text":") with a step size ","element":"span"},{"text":"˜","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"= 1","element":"span"},{"style":{"fontStyle":"italic"},"text":"/N","element":"span"},{"text":". Under this rescaling of time, one can check directly the leading truncation error term of an LMM of order ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p ","element":"span"},{"text":"in the sense of Definitions ","element":"span"},{"href":"#id-106","text":"2.2 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-65","text":"2.3 ","element":"a"},{"text":"is","element":"span"}],[{"id":"id-107","style":{"width":"84%"},"width":1492,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-9.png","element":"img"}],[{"text":"In light of (","element":"span"},{"href":"#id-107","text":"5.2","element":"a"},{"text":"), we can see that the truncation error of the discovered dynamics of (","element":"span"},{"href":"#id-53","text":"2.1","element":"a"},{"text":") in the original time scale is a multiple of the truncation error of the rescaled model (","element":"span"},{"href":"#id-105","text":"5.1","element":"a"},{"text":") by the factor ","element":"span"},{"style":{"height":14.73},"width":74.9,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-10.png","element":"img","alt":" T −1","inline":true},{"text":". Meanwhile, from the analysis of Section ","element":"span"},{"href":"#id-48","text":"3.4","element":"a"},{"text":", the error from stability is only directly dependent on ","element":"span"},{"style":{"height":17.6},"width":219.73,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-11.png","element":"img","alt":" σ(r) and N","inline":true},{"text":", not the specific time domain.","element":"span"}],[{"text":"Using these observations of the effects on consistency and stability, we can deduce the behavior of an LMM in the long-time regime. For a strongly stable ","element":"span"},{"style":{"height":18.73},"width":89.7,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-12.png","element":"img","alt":" pth−","inline":true},{"text":"order LMM, the global error behaves like ","element":"span"},{"style":{"height":20.8},"width":400.9,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-13.png","element":"img","alt":" O�T −1Thp�= O(hp","inline":true},{"text":") provided that max","element":"span"},{"style":{"height":22.69},"width":475.46,"height":56.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/20-14.png","element":"img","alt":"t∈(0,T) |x(p+1)(t)| remains","inline":true}]]},{"heading":"22 KELLER AND DU.","paragraphs":[[{"text":"uniformly bounded as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"increases. Hence, we may view strongly stable LMMs as A-stable, in the case of dynamics discovery, for fixed ","element":"span"},{"style":{"height":12.8},"width":236.05,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-0.png","element":"img","alt":" h as T → ∞","inline":true},{"text":". This can be seen as another difference with the case of the forward problem of time integration, where the order of A-stable LMMs is known to be limited by 2 due to the celebrated Dahlquist barrier theorems [","element":"span"},{"href":"#id-35","referenceIndex":12,"text":"12","element":"a"},{"text":", ","element":"span"},{"href":"#id-36","referenceIndex":15,"text":"15","element":"a"},{"text":", ","element":"span"},{"href":"#id-38","referenceIndex":32,"text":"32","element":"a"},{"text":", ","element":"span"},{"href":"#id-34","referenceIndex":2,"text":"2","element":"a"},{"text":"]. On the other hand, for unstable methods, the exponential growth in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N ","element":"span"},{"text":"of the inverse matrix ","element":"span"},{"style":{"height":14.73},"width":78.64,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-1.png","element":"img","alt":"B−1 ","inline":true,"padRight":true},{"text":"dominates over any gain in accuracy from consistency. Thus, lack of stability leads to an exponentially increasing error as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"grows linearly.","element":"span"}],[{"text":"As a special example, the marginally stable AM-1 is not stable for dynamics discovery, but as stated in the Corollary ","element":"span"},{"href":"#id-45","text":"4.3 ","element":"a"},{"text":"and the derivation in its proof, we can use the rescaling to get the global error in the form ","element":"span"},{"style":{"height":19.13},"width":372.69,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-2.png","element":"img","alt":" O(T −1Th2) = O(h2","inline":true},{"text":"), Thus, we expect AM-1, for a fixed ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h","element":"span"},{"text":", to have a constant error as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"increases, which is supported by numerical experiments presented in the next section.","element":"span"}],[{"text":"To recap, from the analysis in this section, for dynamics discovery, BDFs enjoy asymptotic stability for a fixed time step size ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"increases. Same holds for AB-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":", at least for a small value of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"that enjoys the stability as ","element":"span"},{"style":{"height":12.8},"width":81.26,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-3.png","element":"img","alt":" h →","inline":true,"padRight":true},{"text":"0 for a given terminal time. While this also holds for AM-1, it does not hold for AM-","element":"span"},{"style":{"height":14.8},"width":251.71,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-4.png","element":"img","alt":"M with M ≥","inline":true,"padRight":true},{"text":"2. As shown in Figure ","element":"span"},{"href":"#id-108","text":"3","element":"a"},{"text":", the errors from AB and BDF remain fixed across various values of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"text":", while the AM methods yield exponential growth of error in ","element":"span"},{"style":{"height":14.8},"width":251.98,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-5.png","element":"img","alt":" T for M ≥ 2.","inline":true}],[{"style":{"width":"95%"},"width":1697,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-6.png","element":"img"}],[{"id":"id-51","text":"systems associated with each of the studied multistep methods and show numerical evidence ","element":"span"},{"text":"consistent with the theoretical findings. We limit ourselves to the idealized setting of numerically exact states considered for the theoretical analysis and to low dimensional dynamic systems for the sake of illustration and benchmarking. In addition, we also take the initial data for the dynamics to be exact. For a model problem, we consider the 2D Cubic System, a nonlinearly damped oscillator, specified as in [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-9","referenceIndex":6,"text":"6","element":"a"},{"text":"].","element":"span"}],[{"id":"id-39","style":{"width":"63%"},"width":1130,"height":184,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-7.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"6.1. Fixed Time Domain. ","element":"span"},{"text":"First we study the methods on a fixed time domain, ","element":"span"},{"style":{"height":17.6},"width":202.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-8.png","element":"img","alt":" t ∈ [0, 0.2],","inline":true,"padRight":true},{"text":"with varying time step. We show in Figure ","element":"span"},{"href":"#id-109","text":"2 ","element":"a"},{"text":"the results from the Adams family and BDF methods. The exact dynamics are computed by numerically integrating (","element":"span"},{"href":"#id-39","text":"6.1","element":"a"},{"text":") on a very refined mesh. ","element":"span"},{"text":"The errors of the discovered dynamics in the ","element":"span"},{"style":{"height":12.8},"width":88.05,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-9.png","element":"img","alt":" ℓ∞−","inline":true},{"text":"norm are shown in Figure ","element":"span"},{"href":"#id-109","text":"2 ","element":"a"},{"text":"for different ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"against different number of grid points. In addition, Figure ","element":"span"},{"href":"#id-109","text":"2d ","element":"a"},{"text":"shows a segment of the approximated dynamics captured over the interval versus the true dynamics using a stable and convergent method (AB-3) when ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"= 0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"01. In this figure, the dotted and dashed lines represent the true dynamics in the first and second coordinates, i.e. ","element":"span"},{"style":{"height":16.59},"width":195.36,"height":41.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-10.png","element":"img","alt":" f 1 and f 2","inline":true},{"text":", respectively. The crosses and asterisks denote the learned dynamics in the first and second coordinates, i.e. ˆ","element":"span"},{"style":{"height":21.2},"width":195.24,"height":52.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-11.png","element":"img","alt":"f 1 and ˆf 2","inline":true},{"text":", respectively. The method is able to capture the twist and intersection of the two coordinates. Clearly, the numerical results support the theoretical findings of this paper.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"6.2. Long Time Behavior. ","element":"span"},{"text":"Here, we consider the problem of discovering dynamics over a changing domain [0","element":"span"},{"style":{"height":17.6},"width":215.73,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/21-12.png","element":"img","alt":", T], T ≫ 1.","inline":true,"padRight":true},{"text":"with fixed mesh size ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h","element":"span"},{"text":". In Figure ","element":"span"},{"href":"#id-108","text":"3","element":"a"},{"text":", we discover the dynamics","element":"span"}]]},{"heading":"DISCOVERY OF DYNAMICS 23","paragraphs":[[{"id":"id-109","style":{"width":"100%"},"width":1772,"height":1528,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/22-0.png","element":"img"}],[{"text":"Figure 2: Numerical results of the three types of schemes on the 2D cubic system (","element":"figcaption","subtype":"caption"},{"href":"#id-39","text":"6.1","element":"a","subtype":"caption"},{"text":") on the unit time interval for different choices of ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"M ","element":"figcaption","subtype":"caption"},{"text":"and ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"N","element":"figcaption","subtype":"caption"},{"text":".","element":"figcaption","subtype":"caption"}],[{"text":"of the 2D Cubic System over specified ranges of T (","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"= 12","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"5","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"25","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"37","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"5)","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"For AM (Figure ","element":"span"},{"href":"#id-108","text":"3a","element":"a"},{"text":", ","element":"span"},{"href":"#id-108","text":"3b","element":"a"},{"text":", and ","element":"span"},{"href":"#id-108","text":"3c","element":"a"},{"text":") we use ","element":"span"},{"style":{"fontStyle":"italic"},"text":"h ","element":"span"},{"text":"= 0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"01 to first generate data over [0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"50] and then select the slice of data matching the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"text":"s. AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"clearly suffers from the exponential error growth when ","element":"span"},{"style":{"height":15.6},"width":251.73,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/22-1.png","element":"img","alt":" M ≥ 2, while","inline":true,"padRight":true},{"text":"it has a constant error when ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 1, as predicted in Section ","element":"span"},{"href":"#id-50","text":"5","element":"a"},{"text":". Meanwhile, also consistent with the analysis of Secton ","element":"span"},{"href":"#id-50","text":"5","element":"a"},{"text":", AB and BDF are robust for the long-time dynamics discovery – yielding a constant error for fixed mesh as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"increases and a decreasing error for larger ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"7. Conclusions and Future Steps. ","element":"span"},{"text":"In this paper, we extend the foundational work of ","element":"span"},{"id":"id-52","text":"solving ordinary differential equations using LMMs to the problem of dynamics discovery. We ","element":"span"},{"text":"introduce refined notions of consistency, and stability, and convergence for discovery based on classical definitions, and we showed how three prominent schemes – Adams-Bashforth, Adams-Moulton, and Backwards Differentiation Formula – may or may not be convergent","element":"span"}]]},{"heading":"24 KELLER AND DU.","paragraphs":[[{"id":"id-108","style":{"width":"100%"},"width":1772,"height":561,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/23-0.png","element":"img"}],[{"text":"Figure 3: Long Time Errors for Discovery of 2D Cubic System (","element":"figcaption","subtype":"caption"},{"href":"#id-39","text":"6.1","element":"a","subtype":"caption"},{"text":")","element":"figcaption","subtype":"caption"}],[{"text":"numerical methods for dynamics discovery in general. ","element":"span"},{"text":"To do so, we first derive algebraic criteria to determine the consistency and stability of the LMM, in a spirit similar to the counterpart for the classical theory. The key difference lies in the characteristic polynomial of attention; instead of the root condition for the first characteristic polynomial, as classically attributed to LMMs as time integrators, stability for discovery of dynamics is attributed to root conditions on the second characteristic polynomial. ","element":"span"},{"text":"While the conditions are trivial for the BDF class, their validity in the case of AM schemes requires the study of some new properties of the Lagrange interpolants. The case of AB, at the present, has to be investigated computationally. ","element":"span"},{"text":"Numerical results are presented to show agreement with the theoretical findings. In conclusion, we find theoretically and numerically that the systems for BDF-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"for all ","element":"span"},{"style":{"height":16},"width":499.03,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/23-1.png","element":"img","alt":" M ∈ N, AB for 1 ≤ M ≤","inline":true,"padRight":true},{"text":"6, and AM-","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"M ","element":"span"},{"text":"= 0 and 1 are and convergent, while AB-","element":"span"},{"style":{"height":14.8},"width":324.12,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/23-2.png","element":"img","alt":"M for 7 ≤ M ≤","inline":true,"padRight":true},{"text":"10 and AM-","element":"span"},{"style":{"height":14.8},"width":233.42,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/23-3.png","element":"img","alt":"M for M ≥","inline":true,"padRight":true},{"text":"2 are not, as summarized in Table ","element":"span"},{"href":"#id-110","text":"2","element":"a"},{"text":". These conclusions are drawn provided some initial data on the dynamics are supplied. Modifications need to be made, as discussed in Section ","element":"span"},{"href":"#id-111","text":"4.4","element":"a"},{"text":", if other types of additional data on the dynamics are provided. ","element":"span"},{"text":"LMM schemes are well-studied for the forward problem in numerical analysis. As such tools, they can be useful to the subject of machine learning. For example, they can be applied to the design and training of neural networks that are seen as discrete forms of","element":"span"}],[{"id":"id-110","style":{"width":"93%"},"width":1662,"height":467,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/23-4.png","element":"img"}],[{"text":"Table 2: LMMs: similarities and differences for integrating and learning dynamics.","element":"figcaption","subtype":"caption"}]]},{"heading":"DISCOVERY OF DYNAMICS 25","paragraphs":[[{"text":"dynamic systems [","element":"span"},{"href":"#id-112","referenceIndex":52,"text":"52","element":"a"},{"text":", ","element":"span"},{"href":"#id-113","referenceIndex":7,"text":"7","element":"a"},{"text":", ","element":"span"},{"href":"#id-114","referenceIndex":47,"text":"47","element":"a"},{"text":"]. Different from such applications, the new study given here is motived by recent interest in using machine learning [","element":"span"},{"href":"#id-69","referenceIndex":4,"text":"4","element":"a"},{"text":", ","element":"span"},{"href":"#id-3","referenceIndex":17,"text":"17","element":"a"},{"text":", ","element":"span"},{"href":"#id-70","referenceIndex":33,"text":"33","element":"a"},{"text":", ","element":"span"},{"href":"#id-71","referenceIndex":34,"text":"34","element":"a"},{"text":", ","element":"span"},{"href":"#id-7","referenceIndex":45,"text":"45","element":"a"},{"text":"] to formalize a variety of inverse problems such as learning dynamics using classical discretization techniques like LMMs. The change of the problem type from forward integration to inverse learning leads to different mathematical theory as illustrated in the Table ","element":"span"},{"href":"#id-110","text":"2","element":"a"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-0.png","element":"img","alt":"1","inline":true},{"text":". Note that in particular, BDF provides a class of methods convergent for integrating and learning dynamics, while not all AB and AM methods can share the same conclusion. Our framework can be applied to check on other LMMs besides these examples. Furthermore, it will be interesting to explore if there are systematic ways to generate broader classes of LMMs good for both tasks of model-based time integration and data-driven learning.","element":"span"}],[{"text":"As discussed in Section ","element":"span"},{"href":"#id-66","text":"3.2","element":"a"},{"text":", our current study assumes the best possible case that the exact states along with suitable approximations to the initial dynamics are all given, together with the assumption that the neural network representation can produce zero residual for the LMM dynamics. While this setting is highly idealized, based on the conclusions drawn, we can speculate about the impact on the properties of stability and convergence caused by different choices of time discretization schemes for a more informed attempt at discovery of unknown dynamics in more practical settings. The latter leads to many interesting issues to be considered in the future. For instance, instead of assuming only data on the state with a loss function ","element":"span"},{"style":{"height":20.61},"width":236.6,"height":51.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-1.png","element":"img","alt":" T (˜x, ˜f, f NN","inline":true},{"text":"), we may consider a more general loss function with data on the state and dynamics, i.e. ","element":"span"},{"style":{"height":21.41},"width":334.26,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-2.png","element":"img","alt":" T (˜x, ˜f, f NN, ˆf, ˆx","inline":true},{"text":"), given by","element":"span"}],[{"style":{"width":"81%"},"width":1437,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-3.png","element":"img"}],[{"text":"For LMMs with grid functions, the loss ","element":"span"},{"style":{"height":15.02},"width":35.18,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-4.png","element":"img","alt":" ℓ1","inline":true,"padRight":true},{"text":"associated with dynamics conformity comes from the discretization (","element":"span"},{"href":"#id-67","text":"3.1","element":"a"},{"text":"), and ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":17.6},"width":205.37,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-5.png","element":"img","alt":"f ∈ Γh[a, b","inline":true},{"text":"], the space of grid functions. The total loss can be taken as an expectation over training samplesand minimized to obtain some optimal representation of the state or dynamics. LMNet is an example where the conformity term is minimized over parameterized neural networks of various types, so that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.19},"width":177.68,"height":40.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/24-6.png","element":"img","alt":"f ≡ f NN","inline":true},{"text":", as studied in [","element":"span"},{"href":"#id-20","referenceIndex":37,"text":"37","element":"a"},{"text":", ","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":", ","element":"span"},{"href":"#id-27","referenceIndex":50,"text":"50","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"]. Whenever the term involving the LMM residual is accounted for, the framework developed in this paper would be relevant. ","element":"span"},{"text":"For stable LMMs considered here, one may expect that it may be possible to extend the convergence results for exact and complete data if the set of neural networks can satisfy some universal approximation properties. ","element":"span"},{"text":"The convergence would be expected to be in the sense of function approximations which would imply good generalization error, at least among suitable classes of smooth dynamic systems. For systems displaying chaotic behavior and sharp transitions, new ideas are likely needed in order to assure accurate discovery of the underlying complex dynamics.","element":"span"}],[{"text":"In this more general setting, neural network representations may also provide implicit regularization of the learned dynamics so that unstable LMMs could potentially be stabilized. However, regularization likely produces additional consistency error so the convergence has to be more carefully examined. Moreover, we may consider compressed representation and treat incomplete data by promoting sparsity or exploring the use of partial physics as regularization","element":"span"}]]},{"heading":"26 KELLER AND DU.","paragraphs":[[{"text":"to achieve physics-informed and data-driven discovery of the dynamics. Finally, there are many avenues of exploration to extend the results reported here. Some interesting topics for future studies include","element":"span"}],[{"text":"1. the effects of regularization by specifying various forms of the regularization terms ","element":"span"},{"style":{"height":15.02},"width":217.25,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/25-0.png","element":"img","alt":"R1 and R2","inline":true},{"text":", such as those promoting smoothness, sparsity, low dimensionality, and extending the above tasks for study of the dynamics discovery problem with incomplete and uncertain data.","element":"span"}],[{"text":"2. different reduced-order models via choice of constrained representations on the dynamics or the state variables or both [","element":"span"},{"href":"#id-8","referenceIndex":3,"text":"3","element":"a"},{"text":", ","element":"span"},{"href":"#id-31","referenceIndex":55,"text":"55","element":"a"},{"text":"];","element":"span"}],[{"text":"3. extension of the stability framework to incorporate other multistep and multistage schemes such as predictor-corrector, Milne and Runge-Kutta [","element":"span"},{"href":"#id-25","referenceIndex":43,"text":"43","element":"a"},{"text":"];","element":"span"}],[{"text":"4. derivation of a general class of LMMs that are convergent for both the forward problem of time integration and the backward problem of dynamics discovery.","element":"span"}],[{"text":"5. the errors in numerically ","element":"span"},{"style":{"fontStyle":"italic"},"text":"integrated ","element":"span"},{"text":"states based on learned dynamics [","element":"span"},{"href":"#id-22","referenceIndex":39,"text":"39","element":"a"},{"text":"]; 6. distributed dynamic systems such as time-dependent PDEs and examine the additional effect due to spatial discretization;","element":"span"}],[{"text":"7. generalizing to the study of dynamics for a suitable set of initial conditions.","element":"span"}],[{"text":"Naturally, learning dynamics has strong connections to the subject of time-series prediction using deep learning [","element":"span"},{"href":"#id-115","referenceIndex":8,"text":"8","element":"a"},{"text":", ","element":"span"},{"href":"#id-116","referenceIndex":20,"text":"20","element":"a"},{"text":", ","element":"span"},{"href":"#id-117","referenceIndex":24,"text":"24","element":"a"},{"text":", ","element":"span"},{"href":"#id-118","referenceIndex":49,"text":"49","element":"a"},{"text":"]. Our current work here may motivate further rigorous numerical analysis studies in such a direction as well. To conclude, we see from this study that there are many new challenges in physics-based and data-driven modeling and simulations warranting further numerical analysis research.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"8. Acknowledgments. ","element":"span"},{"text":"The authors would like to thank the CM3 group at Columbia University for invigorating discussions, Wen Ding for his stimulating suggestions, and the referees and Associate Editor of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"SIAM Journal of Numerical Analysis ","element":"span"},{"text":"for their valuable comments.","element":"span"}]]},{"heading":"REFERENCES","paragraphs":[[{"id":"id-77","text":"[1] ","element":"span"},{"text":"R. P. Agarwal","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Difference equations and inequalities: theory, methods, and applications","element":"span"},{"text":", CRC Press, 2000.","element":"span"}],[{"id":"id-34","text":"[2] ","element":"span"},{"text":"K. Atkinson, W. Han, and D. E. Stewart","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Numerical solution of ordinary differential equations","element":"span"},{"text":", vol. 108, John Wiley & Sons, 2011.","element":"span"}],[{"id":"id-8","text":"[3] ","element":"span"},{"text":"K. Bhattacharya, B. Hosseini, N. B. Kovachki, and A. M. Stuart","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Model reduction and neural networks for parametric pdes","element":"span"},{"text":", arXiv preprint arXiv:2005.03180, (2020).","element":"span"}],[{"id":"id-69","text":"[4] ","element":"span"},{"text":"C. M. Bishop","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Pattern recognition and machine learning","element":"span"},{"text":", springer, 2006.","element":"span"}],[{"id":"id-0","text":"[5] ","element":"span"},{"text":"S. L. Brunton and J. N. Kutz","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Data-driven science and engineering: Machine learning, dynamical systems, and control","element":"span"},{"text":", Cambridge University Press, 2019.","element":"span"}],[{"id":"id-9","text":"[6] ","element":"span"},{"text":"S. L. Brunton, J. L. Proctor, and J. N. Kutz","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Discovering governing equations from data by sparse identification of nonlinear dynamical systems","element":"span"},{"text":", Proceedings of the National Academy of Sciences, 113 (2016), pp. 3932–3937.","element":"span"}],[{"id":"id-113","text":"[7] ","element":"span"},{"text":"R. T. Chen, Y. Rubanova, J. Bettencourt, and D. K. Duvenaud","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Neural ordinary differential equations","element":"span"},{"text":", in Advances in neural information processing systems, 2018, pp. 6571–6583.","element":"span"}],[{"id":"id-115","text":"[8] ","element":"span"},{"text":"J. T. Connor, R. D. Martin, and L. E. Atlas","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Recurrent neural networks and robust time series prediction","element":"span"},{"text":", IEEE transactions on neural networks, 5 (1994), pp. 240–254.","element":"span"}],[{"id":"id-95","text":"[9] ","element":"span"},{"text":"D. M. Creedon and J. J. Miller","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The stability properties ofq-step backward difference schemes","element":"span"},{"text":", BIT Numerical Mathematics, 15 (1975), pp. 244–249.","element":"span"}],[{"id":"id-93","text":"[10] ","element":"span"},{"text":"C. W. Cryer","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"On the instability of high order backward-difference multistep methods","element":"span"},{"text":", BIT Numerical","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"DISCOVERY OF DYNAMICS ","element":"span"},{"style":{"fontWeight":"bold"},"text":"27","element":"span"}],[{"text":"Mathematics, 12 (1972), pp. 17–25.","element":"span"}],[{"id":"id-58","text":"[11] ","element":"span"},{"text":"G. Dahlquist","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Convergence and stability in the numerical integration of ordinary differential equations","element":"span"},{"text":", Mathematica Scandinavica, (1956), pp. 33–53.","element":"span"}],[{"id":"id-35","text":"[12] ","element":"span"},{"text":"G. G. Dahlquist","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A special stability problem for linear multistep methods","element":"span"},{"text":", BIT Numerical Mathematics, 3 (1963), pp. 27–43.","element":"span"}],[{"id":"id-79","text":"[13] ","element":"span"},{"text":"R. C. Dorf and R. H. Bishop","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Modern control systems","element":"span"},{"text":", Pearson, 2011.","element":"span"}],[{"id":"id-94","text":"[14] ","element":"span"},{"text":"C. Fredebeul","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A-BDF: a generalization of the backward differentiation formulae","element":"span"},{"text":", SIAM journal on numerical analysis, 35 (1998), pp. 1917–1938.","element":"span"}],[{"id":"id-36","text":"[15] ","element":"span"},{"text":"W. Gautschi","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Numerical analysis","element":"span"},{"text":", Springer Science & Business Media, 1997.","element":"span"}],[{"id":"id-33","text":"[16] ","element":"span"},{"text":"H. H. Goldstine","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A History of Numerical Analysis from the 16th through the 19th Century","element":"span"},{"text":", vol. 2, Springer Science & Business Media, 2012.","element":"span"}],[{"id":"id-3","text":"[17] ","element":"span"},{"text":"I. Goodfellow, Y. Bengio, and A. Courville","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Deep learning","element":"span"},{"text":", MIT press, 2016.","element":"span"}],[{"id":"id-10","style":{"height":14.8},"width":1204.36,"height":36.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/26-0.png","element":"img","alt":"[18] N. S. Gulgec, Z. Shi, N. Deshmukh, S. Pakzad, and M. Tak´aˇc,","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"FD-Net with auxiliary time steps: Fast prediction of PDEs using Hessian-free trust-region methods","element":"span"},{"text":", arXiv preprint arXiv:1910.12680, (2019).","element":"span"}],[{"id":"id-11","text":"[19] ","element":"span"},{"text":"J. Han, A. Jentzen, and E. Weinan","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Solving high-dimensional partial differential equations using deep learning","element":"span"},{"text":", Proceedings of the National Academy of Sciences, 115 (2018), pp. 8505–8510.","element":"span"}],[{"id":"id-116","text":"[20] ","element":"span"},{"text":"M. Han, J. Xi, S. Xu, and F.-L. Yin","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Prediction of chaotic time series based on the recurrent predictor neural network","element":"span"},{"text":", IEEE transactions on signal processing, 52 (2004), pp. 3409–3416.","element":"span"}],[{"id":"id-37","text":"[21] ","element":"span"},{"text":"P. Henrici","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Discrete variable methods in ordinary differential equations","element":"span"},{"text":", (1962).","element":"span"}],[{"id":"id-4","text":"[22] ","element":"span"},{"text":"M. I. Jordan and T. M. Mitchell","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Machine learning: Trends, perspectives, and prospects","element":"span"},{"text":", Science, 349 (2015), pp. 255–260.","element":"span"}],[{"id":"id-12","text":"[23] ","element":"span"},{"text":"S. H. Kang, W. Liao, and Y. Liu","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Ident: Identifying differential equations with numerical time evolution","element":"span"},{"text":", arXiv preprint arXiv:1904.03538, (2019).","element":"span"}],[{"id":"id-117","text":"[24] ","element":"span"},{"text":"F. Karim, S. Majumdar, H. Darabi, and S. Chen","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"LSTM fully convolutional networks for time series classification","element":"span"},{"text":", IEEE access, 6 (2017), pp. 1662–1669.","element":"span"}],[{"id":"id-13","text":"[25] ","element":"span"},{"text":"I. G. Kevrekidis, C. W. Rowley, and M. O. Williams","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A kernel-based method for data-driven Koopman spectral analysis","element":"span"},{"text":", Journal of Computational Dynamics, 2 (2016), pp. 247–265.","element":"span"}],[{"id":"id-14","text":"[26] ","element":"span"},{"text":"Y. Khoo, J. Lu, and L. Ying","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Solving parametric PDE problems with artificial neural networks","element":"span"},{"text":", arXiv preprint arXiv:1707.03351, (2017).","element":"span"}],[{"id":"id-5","text":"[27] ","element":"span"},{"text":"A. Krizhevsky, I. Sutskever, and G. E. Hinton","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Imagenet classification with deep convolutional neural networks","element":"span"},{"text":", in Advances in neural information processing systems, 2012, pp. 1097–1105.","element":"span"}],[{"id":"id-6","text":"[28] ","element":"span"},{"text":"Y. LeCun, Y. Bengio, and G. Hinton","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Deep learning","element":"span"},{"text":", nature, 521 (2015), p. 436.","element":"span"}],[{"id":"id-15","text":"[29] ","element":"span"},{"text":"Z. Long, Y. Lu, and B. Dong","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"PDE-Net 2.0: Learning PDEs from data with a numeric-symbolic hybrid deep network","element":"span"},{"text":", Journal of Computational Physics, 399 (2019), p. 108925.","element":"span"}],[{"id":"id-16","text":"[30] ","element":"span"},{"text":"F. Lu, M. Zhong, S. Tang, and M. Maggioni","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Nonparametric inference of interaction laws in systems of agents from trajectory data","element":"span"},{"text":", Proceedings of the National Academy of Sciences, 116 (2019), pp. 14424–14433.","element":"span"}],[{"id":"id-17","text":"[31] ","element":"span"},{"text":"C. Ma, J. Wang, and W. E","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Model reduction with memory and the machine learning of dynamical systems","element":"span"},{"text":", arXiv preprint arXiv:1808.04258, (2018).","element":"span"}],[{"id":"id-38","text":"[32] ","element":"span"},{"style":{"height":13.6},"width":1040.48,"height":33.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1912.12728/images/26-1.png","element":"img","alt":" D. Mayers and E. S¨uli, An introduction to numerical analysis","inline":true},{"text":", Cambridge University Press, 2003.","element":"span"}],[{"id":"id-70","text":"[33] ","element":"span"},{"text":"M. Mohri, A. Rostamizadeh, and A. Talwalkar","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Foundations of machine learning","element":"span"},{"text":", MIT press, 2018.","element":"span"}],[{"id":"id-71","text":"[34] ","element":"span"},{"text":"K. P. Murphy","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Machine learning: a probabilistic perspective","element":"span"},{"text":", MIT press, 2012.","element":"span"}],[{"id":"id-18","text":"[35] ","element":"span"},{"text":"S. Pan and K. Duraisamy","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Data-driven discovery of closure models","element":"span"},{"text":", SIAM Journal on Applied Dynamical Systems, 17 (2018), pp. 2381–2413.","element":"span"}],[{"id":"id-19","text":"[36] ","element":"span"},{"text":"E. Qian, B. Kramer, B. Peherstorfer, and K. Willcox","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Lift & learn: Physics-informed machine learning for large-scale nonlinear dynamical systems","element":"span"},{"text":", Physica D: Nonlinear Phenomena, 406 (2020), p. 132401.","element":"span"}],[{"id":"id-20","text":"[37] ","element":"span"},{"text":"T. Qin, K. Wu, and D. Xiu","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Data driven governing equations approximation using deep neural networks","element":"span"},{"text":", Journal of Computational Physics, 395 (2019), pp. 620–635.","element":"span"}],[{"id":"id-21","text":"[38] ","element":"span"},{"text":"M. Raissi","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Deep hidden physics models: Deep learning of nonlinear partial differential equations","element":"span"},{"text":", The Journal of Machine Learning Research, 19 (2018), pp. 932–955.","element":"span"}],[{"id":"id-22","text":"[39] ","element":"span"},{"text":"M. Raissi, P. Perdikaris, and G. E. Karniadakis","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Multistep neural networks for data-driven discovery of nonlinear dynamical systems","element":"span"},{"text":", arXiv preprint arXiv:1801.01236, (2018).","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"28 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"KELLER AND DU.","element":"span"}],[{"id":"id-23","text":"[40] ","element":"span"},{"text":"M. Raissi, P. Perdikaris, and G. E. Karniadakis","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Numerical Gaussian processes for time-dependent and nonlinear partial differential equations","element":"span"},{"text":", SIAM Journal on Scientific Computing, 40 (2018), pp. A172–A198, ","element":"span"},{"href":"https://doi.org/10.1137/17M1120762","text":"https://doi.org/10.1137/17M1120762","element":"a"},{"text":".","element":"span"}],[{"id":"id-24","text":"[41] ","element":"span"},{"text":"M. Raissi, P. Perdikaris, and G. E. Karniadakis","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations","element":"span"},{"text":", Journal of Computational Physics, 378 (2019), pp. 686–707.","element":"span"}],[{"id":"id-2","text":"[42] ","element":"span"},{"text":"S. H. Rudy, S. L. Brunton, J. L. Proctor, and J. N. Kutz","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Data-driven discovery of partial differential equations","element":"span"},{"text":", Science Advances, 3 (2017), p. e1602614.","element":"span"}],[{"id":"id-25","text":"[43] ","element":"span"},{"text":"S. H. Rudy, J. N. Kutz, and S. L. Brunton","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Deep learning of dynamics and signal-noise decomposition with time-stepping constraints","element":"span"},{"text":", Journal of Computational Physics, 396 (2019), pp. 483–506.","element":"span"}],[{"id":"id-1","text":"[44] ","element":"span"},{"text":"M. Schmidt and H. Lipson","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Distilling free-form natural laws from experimental data","element":"span"},{"text":", science, 324 (2009), pp. 81–85.","element":"span"}],[{"id":"id-7","text":"[45] ","element":"span"},{"text":"S. Shalev-Shwartz and S. Ben-David","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Understanding machine learning: From theory to algorithms","element":"span"},{"text":", Cambridge University Press, 2014.","element":"span"}],[{"id":"id-78","text":"[46] ","element":"span"},{"text":"S. Strelitz","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"On the Routh-Hurwitz problem","element":"span"},{"text":", The American Mathematical Monthly, 84 (1977), pp. 542– 544.","element":"span"}],[{"id":"id-114","text":"[47] ","element":"span"},{"text":"Q. Sun, Y. Tao, and Q. Du","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Stochastic training of residual networks: a differential equation viewpoint","element":"span"},{"text":", arXiv preprint arXiv:1812.00174, (2018).","element":"span"}],[{"id":"id-26","text":"[48] ","element":"span"},{"text":"Y. Sun, L. Zhang, and H. Schaeffer","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Neupde: Neural network based ordinary and partial differential equations for modeling time-dependent data","element":"span"},{"text":", arXiv preprint arXiv:1908.03190, (2019).","element":"span"}],[{"id":"id-118","text":"[49] ","element":"span"},{"text":"Y. Tao, L. Ma, W. Zhang, J. Liu, W. Liu, and Q. Du","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Hierarchical attention-based recurrent highway networks for time series prediction","element":"span"},{"text":", arXiv preprint arXiv:1806.00685, (2018).","element":"span"}],[{"id":"id-27","text":"[50] ","element":"span"},{"text":"R. Tipireddy, P. Perdikaris, P. Stinis, and A. Tartakovsky","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A comparative study of physics-informed neural network models for learning unknown dynamics and constitutive relations","element":"span"},{"text":", 2019, ","element":"span"},{"href":"https://arxiv.org/abs/1904.04058","text":"https://arxiv.org/abs/1904.04058","element":"a"},{"text":".","element":"span"}],[{"id":"id-28","text":"[51] ","element":"span"},{"text":"M. Wang, H.-X. Li, X. Chen, and Y. Chen","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Deep learning-based model reduction for distributed parameter systems","element":"span"},{"text":", IEEE Transactions on Systems, Man, and Cybernetics: ","element":"span"},{"text":"Systems, 46 (2016), pp. 1664–1674.","element":"span"}],[{"id":"id-112","text":"[52] ","element":"span"},{"text":"E. Weinan","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A proposal on machine learning via dynamical systems","element":"span"},{"text":", Communications in Mathematics and Statistics, 5 (2017), pp. 1–11.","element":"span"}],[{"id":"id-29","text":"[53] ","element":"span"},{"text":"M. O. Williams, I. G. Kevrekidis, and C. W. Rowley","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A data–driven approximation of the Koopman operator: Extending dynamic mode decomposition","element":"span"},{"text":", Journal of Nonlinear Science, 25 (2015), pp. 1307– 1346.","element":"span"}],[{"id":"id-30","text":"[54] ","element":"span"},{"text":"K. Wu and D. Xiu","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Data-driven deep learning of partial differential equations in modal space","element":"span"},{"text":", arXiv preprint arXiv:1910.06948, (2019).","element":"span"}],[{"id":"id-31","text":"[55] ","element":"span"},{"text":"X. Xie, G. Zhang, and C. G. Webster","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Non-intrusive inference reduced order modeling of fluid dynamics using linear multistep network","element":"span"},{"text":", arXiv preprint arXiv:1809.07820, (2018).","element":"span"}],[{"id":"id-32","text":"[56] ","element":"span"},{"text":"Y. Zhu and N. Zabaras","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Bayesian deep convolutional encoder–decoder networks for surrogate modeling and uncertainty quantification","element":"span"},{"text":", Journal of Computational Physics, 366 (2018), pp. 415–447.","element":"span"}]]}],"_version":"3.3.2"},"paperNode":"$28:props:children:props:children:0:props:product"}]]