36:[["$","audio",null,{"id":"tts"}],["$","$L3b",null,{"paperID":"1808.00924","publisher":"arxiv","paperJSON":{"title":"The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems","paperID":"1808.00924","avgLineHeight":10.91,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"style":{"fontWeight":"bold"},"text":": ","element":"span"},{"text":"$3c","element":"span"}],[{"style":{"width":"70%"},"width":1119,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/0-0.png","element":"img"}]]},{"heading":"1 Introduction","paragraphs":[[{"text":"Safety is among the foremost open problems in robotics and artificial intelligence [","element":"span"},{"href":"#id-0","referenceIndex":1,"text":"1","element":"a"},{"text":"]. Many autonomous systems, such as self-driving cars and robots for palliative care, are safety-critical due to their interaction with human life. At the same time, learning is necessary for these systems to perform well in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a priori ","element":"span"},{"text":"unknown environments. During learning, they must ","element":"span"},{"style":{"fontStyle":"italic"},"text":"safely explore ","element":"span"},{"text":"their environment by avoiding dangerous states from which they cannot recover. For example, consider an autonomous robot in an outdoor environment affected by rough terrain and adverse weather conditions. These factors introduce uncertainty about the relationship between the robot’s speed and maneuverability. While the robot should learn about its capabilities in such conditions, it must not perform a maneuver at a high speed that would cause it to crash. Conversely, traveling at only slow speeds to avoid accidents is not conducive to learning about the extent of the robot’s capabilities.","element":"span"}],[{"text":"To ensure ","element":"span"},{"style":{"fontStyle":"italic"},"text":"safe learning","element":"span"},{"text":", we must verify a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"safety certificate ","element":"span"},{"text":"for a state before it is explored. In control theory, a set of states is safe if system trajectories are bounded within it and asymptotically converge to a fixed point under a fixed control policy. Within such a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"region of attraction (ROA) ","element":"span"},{"text":"[","element":"span"},{"href":"#id-1","referenceIndex":2,"text":"2","element":"a"},{"text":"], the system can collect data during learning and can always recover to a known safe point. In this paper, we leverage Lyapunov stability theory to construct provable, neural network-based safety certificates, and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"adapt ","element":"span"},{"text":"them to the size and shape of the largest ROA of a general nonlinear dynamical system.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Related work ","element":"span"},{"text":"Lyapunov functions are convenient tools for stability (i.e., safety) certification of dynamical systems [","element":"span"},{"href":"#id-1","referenceIndex":2,"text":"2","element":"a"},{"text":"] and for ROA estimation [","element":"span"},{"href":"#id-2","referenceIndex":3,"text":"3","element":"a"},{"text":", ","element":"span"},{"href":"#id-3","referenceIndex":4,"text":"4","element":"a"},{"text":", ","element":"span"},{"href":"#id-4","referenceIndex":5,"text":"5","element":"a"},{"text":"]. These functions encode long-term behaviour of state trajectories in a scalar value [","element":"span"},{"href":"#id-5","referenceIndex":6,"text":"6","element":"a"},{"text":"], such that a ROA can be encoded as a level set of the Lyapunov function. However, Lyapunov functions for general dynamical systems are difficult to find; computational approaches are surveyed in [","element":"span"},{"href":"#id-6","referenceIndex":7,"text":"7","element":"a"},{"text":"]. A Lyapunov function can be identified efficiently via a semi-definite program (SDP, [","element":"span"},{"href":"#id-7","referenceIndex":8,"text":"8","element":"a"},{"text":"]) when the dynamics are polynomial and the Lyapunov function is restricted to be a sum-of-squares (SOS) polynomial [","element":"span"},{"href":"#id-8","referenceIndex":9,"text":"9","element":"a"},{"text":"]. Other methods to compute ROAs include maximization of a measure of ROA volume over system trajectories [","element":"span"},{"href":"#id-9","referenceIndex":10,"text":"10","element":"a"},{"text":"], and sampling-based approaches that generalize information about stability at discrete points to a continuous region [","element":"span"},{"href":"#id-10","referenceIndex":11,"text":"11","element":"a"},{"text":"].","element":"span"}],[{"text":"This paper is particularly concerned with safety certificates for dynamical systems with uncertainties in the form of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"model errors","element":"span"},{"text":". In robust control [","element":"span"},{"href":"#id-11","referenceIndex":12,"text":"12","element":"a"},{"text":"], the formulation of SDPs with SOS Lyapunov functions is used to compute ROA estimates for uncertain linear dynamical systems with the assumption of a worst-case linear perturbation from a known bounded set [","element":"span"},{"href":"#id-12","referenceIndex":13,"text":"13","element":"a"},{"text":", ","element":"span"},{"href":"#id-13","referenceIndex":14,"text":"14","element":"a"},{"text":"]. Learning-based control methods with a Gaussian process (GP, [","element":"span"},{"href":"#id-14","referenceIndex":15,"text":"15","element":"a"},{"text":"]) model of the system instead consider uncertainty in a Bayesian manner, where model errors are reduced in regions where data has been collected. The methods in [","element":"span"},{"href":"#id-15","referenceIndex":16,"text":"16","element":"a"},{"text":", ","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"] estimate a ROA with Lyapunov stability certificates computed on a discretization of the state space, which is used for safe reinforcement learning (RL, [","element":"span"},{"href":"#id-17","referenceIndex":18,"text":"18","element":"a"},{"text":"]). The Lyapunov function is assumed to be given in [","element":"span"},{"href":"#id-15","referenceIndex":16,"text":"16","element":"a"},{"text":"], while [","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"] uses the negative value (i.e., cost) function from RL with a quadratic reward. Ultimately, this approach is limited by a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"shape mismatch ","element":"span"},{"text":"between level sets of the Lyapunov function and the true largest ROA. For example, a quadratic Lyapunov function has ellipsoidal level sets, which cannot characterize a non-ellipsoidal ROA, while the SOS approach is restricted to fixed monomial features. To improve safe exploration for general nonlinear dynamics, we want to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"learn ","element":"span"},{"text":"these features to determine a Lyapunov function with suitably shaped level sets.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Contributions ","element":"span"},{"text":"In this paper, we present a novel method for learning accurate safety certificates for general nonlinear dynamical systems. We construct a neural network Lyapunov candidate and, unlike past work in [","element":"span"},{"href":"#id-18","referenceIndex":19,"text":"19","element":"a"},{"text":", ","element":"span"},{"href":"#id-19","referenceIndex":20,"text":"20","element":"a"},{"text":"], we structure our candidate such that it ","element":"span"},{"style":{"fontStyle":"italic"},"text":"always ","element":"span"},{"text":"inherently yields a provable safety certificate. Then, we specify a training algorithm that adapts the candidate to the shape of the dynamical system’s trajectories via classification of states as safe or unsafe. We do not depend on any specific structure of the dynamics for this. We show how our construction relates to SOS Lyapunov functions, and compare our approach to others on a simulated inverted pendulum benchmark. We also discuss how our method can be used to make safe learning more effective.","element":"span"}]]},{"heading":"2 Problem Statement and Background","paragraphs":[[{"text":"We consider a discrete-time, time-invariant, deterministic dynamical system of the form","element":"span"}],[{"style":{"width":"59%"},"width":936,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":11.6},"width":96.71,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-1.png","element":"img","alt":" t ∈ N","inline":true,"padRight":true},{"text":"is the time step index, and ","element":"span"},{"style":{"height":15.78},"width":229.32,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-2.png","element":"img","alt":" xt ∈ X ⊂ Rd","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":13.19},"width":225.24,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-3.png","element":"img","alt":" ut ∈ U ⊂ Rp","inline":true,"padRight":true},{"text":"are the state and control inputs respectively at time step ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". The system is controlled by a feedback policy ","element":"span"},{"style":{"height":11.2},"width":178.17,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-4.png","element":"img","alt":" π: X → U","inline":true,"padRight":true},{"text":"and the resulting closed-loop dynamical system is given by ","element":"span"},{"style":{"height":16},"width":247.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-5.png","element":"img","alt":" xt+1 = fπ(xt)","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":16},"width":330.94,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-6.png","element":"img","alt":" fπ(x) = f(x, π(x))","inline":true},{"text":". We assume this policy is given, but it can, for example, be computed online with RL or optimal control. This policy ","element":"span"},{"style":{"height":6.8},"width":23,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-7.png","element":"img","alt":" π","inline":true,"padRight":true},{"text":"is safe to use within a subset ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-8.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"of the state space ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". The set ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-9.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is a ROA for ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-10.png","element":"img","alt":" fπ","inline":true},{"text":", i.e., every system trajectory of ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-11.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"that begins at some ","element":"span"},{"style":{"height":13.19},"width":116.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-12.png","element":"img","alt":" x ∈ Sπ","inline":true,"padRight":true},{"text":"also remains in ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-13.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"and asymptotically approaches an ","element":"span"},{"style":{"fontStyle":"italic"},"text":"equilibrium point ","element":"span"},{"style":{"height":13.19},"width":149.11,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-14.png","element":"img","alt":" xO ∈ Sπ","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":16},"width":231.73,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-15.png","element":"img","alt":" fπ(xO) = xO","inline":true,"padRight":true},{"text":"[","element":"span"},{"href":"#id-1","referenceIndex":2,"text":"2","element":"a"},{"text":"]. We assume ","element":"span"},{"style":{"height":12.79},"width":133.41,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-16.png","element":"img","alt":" xO = 0","inline":true,"padRight":true},{"text":"without loss of generality. Hereafter, we use ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-17.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"to denote the true largest ROA in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"under the policy ","element":"span"},{"style":{"height":6.8},"width":23,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-18.png","element":"img","alt":" π","inline":true},{"text":".","element":"span"}],[{"text":"A reliable estimate of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-19.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is critical to online learning systems, since we need to ensure that a policy is safe to use on the real system before it can be deployed. The goal of this paper is to estimate the largest safe set ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-20.png","element":"img","alt":" Sπ","inline":true},{"text":". We must also ensure safety by never overestimating ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-21.png","element":"img","alt":" Sπ","inline":true},{"text":", i.e., we must not identify unsafe states as safe. For this to be feasible, we make a regularity assumption about the closed-loop dynamics; we assume ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-22.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"is Lipschitz continuous on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"with Lipschitz constant ","element":"span"},{"style":{"height":15.59},"width":183.39,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-23.png","element":"img","alt":" Lfπ ∈ R>0","inline":true},{"text":". This is a weak assumption and is even satisfied when a neural network policy is used [","element":"span"},{"href":"#id-20","referenceIndex":21,"text":"21","element":"a"},{"text":"].","element":"span"}],[{"id":"id-25","style":{"fontWeight":"bold"},"text":"2.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Safety Certification with Lyapunov Functions","element":"span"}],[{"text":"One way to estimate the safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-24.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is by using a Lyapunov function. Given a suitable Lyapunov function ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":", a safe region for the closed-loop dynamical system ","element":"span"},{"style":{"height":16},"width":242.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-25.png","element":"img","alt":" xt+1 = fπ(xt)","inline":true,"padRight":true},{"text":"can be determined.","element":"span"}],[{"id":"id-22","style":{"fontWeight":"bold"},"text":"Theorem 1 (Lyapunov’s stability theorem ","element":"span"},{"text":"[","element":"span"},{"href":"#id-5","referenceIndex":6,"text":"6","element":"a"},{"text":"]","element":"span"},{"style":{"fontWeight":"bold"},"text":"): ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-26.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is locally Lipschitz continuous and has an equilibrium point at ","element":"span"},{"style":{"height":12.79},"width":126.78,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-27.png","element":"img","alt":" xO = 0","inline":true},{"style":{"fontStyle":"italic"},"text":". Let ","element":"span"},{"style":{"height":11.2},"width":179.2,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-28.png","element":"img","alt":" v : X → R","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be locally Lipschitz continuous on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"style":{"fontStyle":"italic"},"text":". If there exists a set ","element":"span"},{"style":{"height":13.2},"width":144.78,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-29.png","element":"img","alt":" Dv ⊆ X","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"containing ","element":"span"},{"style":{"fontWeight":"bold"},"text":"0 ","element":"span"},{"style":{"fontStyle":"italic"},"text":"on which ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is positive-definite and ","element":"span"},{"style":{"height":16},"width":538.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-30.png","element":"img","alt":" ∆v(x) := v(fπ(x)) − v(x) < 0","inline":true},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"style":{"height":16},"width":247.91,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-31.png","element":"img","alt":"∀x ∈ Dv \\ {0}","inline":true},{"style":{"fontStyle":"italic"},"text":", then ","element":"span"},{"style":{"height":12.79},"width":128.73,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-32.png","element":"img","alt":" xO = 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is an asymptotically stable equilibrium. In this case, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is known as a Lyapunov function for the closed-loop dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-33.png","element":"img","alt":" fπ","inline":true},{"style":{"fontStyle":"italic"},"text":", and ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/1-34.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the Lyapunov decrease region for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"id":"id-21","style":{"width":"81%"},"width":1295,"height":497,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-0.png","element":"img"}],[{"text":"Figure 1. ","element":"figcaption","subtype":"caption"},{"href":"#id-21","text":"Fig. 1a ","element":"a","subtype":"caption"},{"text":"illustrates a shape mismatch between the largest level set ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"V","element":"figcaption","subtype":"caption"},{"text":"(","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"c","element":"figcaption","subtype":"caption"},{"text":") ","element":"figcaption","subtype":"caption"},{"text":"(blue ellipsoid) of a quadratic Lyapunov function ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"v ","element":"figcaption","subtype":"caption"},{"text":"contained within the decrease region ","element":"figcaption","subtype":"caption"},{"style":{"height":11.59},"width":43.47,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-1.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"text":"(green dashes), and the safe region ","element":"figcaption","subtype":"caption"},{"style":{"height":12.8},"width":163.16,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-2.png","element":"img","alt":" Sπ (black).","inline":true,"padRight":true},{"text":"We cannot certify all of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.6},"width":148.18,"height":28.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-3.png","element":"img","alt":" Sπ with v","inline":true},{"text":", which limits exploration in safe learning. Instead, we train a Lyapunov candidate ","element":"figcaption","subtype":"caption"},{"style":{"height":8.9},"width":33.9,"height":22.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-4.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"with parameters ","element":"figcaption","subtype":"caption"},{"style":{"height":11.99},"width":201.24,"height":29.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-5.png","element":"img","alt":" θ to match Sπ","inline":true,"padRight":true},{"text":"with a level set ","element":"figcaption","subtype":"caption"},{"href":"#id-21","style":{"height":14.4},"width":300.53,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-6.png","element":"img","alt":" Vθ(cS), as in Fig. 1b","inline":true},{"text":", via classification of sampled states as “safe” with ground-truth label ","element":"figcaption","subtype":"caption"},{"style":{"height":12.8},"width":305.13,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-7.png","element":"img","alt":" y = +1 (i.e., x ∈ Sπ","inline":true},{"text":") or “unsafe” with ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":329.3,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-8.png","element":"img","alt":" y = −1 (i.e., x /∈ Sπ).","inline":true}],[{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"states that a Lyapunov function ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"characterizes a “basin” of safe states where trajectories of ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-9.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"“fall” towards the origin ","element":"span"},{"style":{"height":12.79},"width":130.7,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-10.png","element":"img","alt":" xO = 0","inline":true},{"text":". If we can find a positive-definite ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"such that the dynamics always map downwards in the value of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":")","element":"span"},{"text":", then trajectories eventually reach ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") = ","element":"span"},{"style":{"fontWeight":"bold"},"text":"0","element":"span"},{"text":", thus ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontWeight":"bold"},"text":"0","element":"span"},{"text":". To find a ROA, rather than checking if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"decreases along entire trajectories, it is sufficient to verify the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"one-step decrease condition ","element":"span"},{"style":{"height":16},"width":182.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-11.png","element":"img","alt":" ∆v(x) < 0","inline":true,"padRight":true},{"text":"for every state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"in a level set of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Corollary 1 (Safe level sets ","element":"span"},{"text":"[","element":"span"},{"href":"#id-5","referenceIndex":6,"text":"6","element":"a"},{"text":"]","element":"span"},{"style":{"fontWeight":"bold"},"text":"): ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Every level set ","element":"span"},{"style":{"height":19.2},"width":585.07,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-12.png","element":"img","alt":" V(c) := �x | v(x) ≤ c�, c ∈ R>0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"contained within the decrease region ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-13.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is invariant under ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-14.png","element":"img","alt":" fπ","inline":true},{"style":{"fontStyle":"italic"},"text":". That is, ","element":"span"},{"style":{"height":16},"width":422.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-15.png","element":"img","alt":" fπ(x) ∈ V(c), ∀x ∈ V(c)","inline":true},{"style":{"fontStyle":"italic"},"text":". Furthermore, ","element":"span"},{"style":{"height":13.59},"width":253.91,"height":33.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-16.png","element":"img","alt":" limt→∞ xt = 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for every ","element":"span"},{"style":{"height":9.59},"width":36.19,"height":23.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-17.png","element":"img","alt":" xt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"in these level sets, so each one is a ROA for ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-18.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":12.79},"width":126.78,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-19.png","element":"img","alt":" xO = 0","inline":true},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"text":"Intuitively, if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") ","element":"span"},{"text":"decreases everywhere in the level set ","element":"span"},{"style":{"height":16},"width":94.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-20.png","element":"img","alt":" V(c1)","inline":true},{"text":", except at ","element":"span"},{"style":{"height":12.79},"width":131.99,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-21.png","element":"img","alt":" xO = 0","inline":true,"padRight":true},{"text":"where it is zero, then ","element":"span"},{"style":{"height":16},"width":94.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-22.png","element":"img","alt":" V(c1)","inline":true,"padRight":true},{"text":"is invariant, since the image of ","element":"span"},{"style":{"height":16},"width":94.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-23.png","element":"img","alt":" V(c1)","inline":true,"padRight":true},{"text":"under ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-24.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"is the smaller level set ","element":"span"},{"style":{"height":16},"width":94.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-25.png","element":"img","alt":" V(c2)","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":11.19},"width":121.5,"height":27.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-26.png","element":"img","alt":" c2 < c1","inline":true},{"text":". If ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"is also positive-definite, then this ensures trajectories that start in a level set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"contained in the decrease region ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-27.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"text":"remain in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"and converge to ","element":"span"},{"style":{"height":12.79},"width":139.97,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-28.png","element":"img","alt":" xO = 0","inline":true},{"text":". To identify safe level sets, we must check if a given ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Lyapunov candidate ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"satisfies the conditions of ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":". However, the decrease condition ","element":"span"},{"style":{"height":16},"width":195.47,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-29.png","element":"img","alt":" ∆v(x) < 0","inline":true,"padRight":true},{"text":"is difficult to verify throughout a continuous subset ","element":"span"},{"style":{"height":13.2},"width":149,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-30.png","element":"img","alt":" Dv ⊆ X","inline":true},{"text":". It is sufficient to verify the tightened safety certificate ","element":"span"},{"style":{"height":16},"width":293.48,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-31.png","element":"img","alt":" ∆v(x) < −L∆vτ","inline":true,"padRight":true},{"text":"at a finite set of points that cover ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-32.png","element":"img","alt":" Dv","inline":true},{"text":", where ","element":"span"},{"style":{"height":14.39},"width":201.06,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-33.png","element":"img","alt":" L∆v ∈ R>0","inline":true,"padRight":true},{"text":"is the Lipschitz constant of ","element":"span"},{"style":{"height":11.6},"width":52.21,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-34.png","element":"img","alt":" ∆v","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":14.39},"width":150.75,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-35.png","element":"img","alt":" τ ∈ R>0","inline":true,"padRight":true},{"text":"is a measure of how densely the points cover ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-36.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"text":"[","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"]. We can even couple this with bounds on ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-37.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"from a statistical model to certify high-probability safe sets with the certificate ","element":"span"},{"style":{"height":16},"width":286.5,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-38.png","element":"img","alt":" ∆ˆv(x) < −L∆vτ","inline":true},{"text":", where ","element":"span"},{"style":{"height":16},"width":109.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-39.png","element":"img","alt":" ∆ˆv(x)","inline":true,"padRight":true},{"text":"is an upper confidence bound on ","element":"span"},{"style":{"height":16},"width":109.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-40.png","element":"img","alt":" ∆v(x)","inline":true},{"text":". A GP model of ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-41.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"is used for this purpose in [","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"].","element":"span"}],[{"id":"id-26","style":{"fontWeight":"bold"},"text":"2.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Computing SOS Lyapunov Functions","element":"span"}],[{"text":"In general, a suitable Lyapunov candidate ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"is difficult to find. Computational methods often restrict ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"to a particular function class for tractability. The SOS approach restricts ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") ","element":"span"},{"text":"to be polynomial, but is limited to polynomial dynamical systems, i.e., when ","element":"span"},{"style":{"height":16},"width":96.82,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-42.png","element":"img","alt":" fπ(x)","inline":true,"padRight":true},{"text":"is a vector of polynomials in the elements of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"[","element":"span"},{"href":"#id-8","referenceIndex":9,"text":"9","element":"a"},{"text":", ","element":"span"},{"href":"#id-23","referenceIndex":22,"text":"22","element":"a"},{"text":", ","element":"span"},{"href":"#id-24","referenceIndex":23,"text":"23","element":"a"},{"text":"]. In particular, the SOS approach enforces ","element":"span"},{"style":{"height":16},"width":375.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-43.png","element":"img","alt":" v(x) = m(x)⊤Qm(x)","inline":true},{"text":", where ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") ","element":"span"},{"text":"is a vector of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a priori ","element":"span"},{"text":"fixed monomial features in the elements of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":", and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Q ","element":"span"},{"text":"is an unknown positive-semidefinite matrix. This makes ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") ","element":"span"},{"text":"a quadratic function on a monomial ","element":"span"},{"style":{"fontStyle":"italic"},"text":"feature space","element":"span"},{"text":". A SDP can be efficiently solved to yield a ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Q ","element":"span"},{"text":"that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"simultaneously ","element":"span"},{"text":"guarantees that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"satisfies the assumptions of ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"and has the largest possible level set in its decrease region ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-44.png","element":"img","alt":" Dv","inline":true},{"text":". That is, the positive-definiteness of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"and the negative-definiteness of ","element":"span"},{"style":{"height":11.6},"width":52.21,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-45.png","element":"img","alt":" ∆v","inline":true,"padRight":true},{"text":"in ","element":"span"},{"style":{"height":13.19},"width":46.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-46.png","element":"img","alt":" Dv","inline":true,"padRight":true},{"text":"are enforced as constraints in the SDP. This contrasts the more general approach described in ","element":"span"},{"href":"#id-25","text":"Sec. 2.1","element":"a"},{"text":", where a Lyapunov candidate ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"is given and then the assumptions of ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"are verified by checking discrete points.","element":"span"}],[{"text":"With the SOS approach and a suitable choice of ","element":"span"},{"style":{"height":16},"width":151.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-47.png","element":"img","alt":" m(x), Sπ","inline":true,"padRight":true},{"text":"can be estimated well with a level set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":", since the monomial features allow Lyapunov functions with shapes beyond simple ellipsoids to be found. However, the SOS approach requires polynomial dynamics, and the best choice of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":") ","element":"span"},{"text":"can be difficult to determine. Without a suitable Lyapunov function, we face the problem of a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"shape mismatch ","element":"span"},{"text":"between ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":") ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-48.png","element":"img","alt":" Sπ","inline":true},{"text":". This is exemplified in ","element":"span"},{"href":"#id-21","text":"Fig. 1a","element":"a"},{"text":", where level sets of quadratic ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"are ellipsoidal while ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/2-49.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is not, which limits the region of the state space that is certifiable as safe by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v","element":"span"},{"text":".","element":"span"}]]},{"heading":"3 Learning Lyapunov Candidates","paragraphs":[[{"text":"In this section, we establish a more flexible class of parameterized Lyapunov candidates that can satisfy the assumptions on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"in ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"by virtue of their structure and gradient-based parameter training. In particular, we show how a binary classification problem based on whether each state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"lies within the safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-0.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"can be formulated to train the parameterized Lyapunov candidate.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Construction of a Neural Network Lyapunov Function","element":"span"}],[{"text":"We take the SOS approach in ","element":"span"},{"href":"#id-26","text":"Sec. 2.2 ","element":"a"},{"text":"as a starting point to construct a neural network Lyapunov candidate. The SOS Lyapunov candidate ","element":"span"},{"style":{"height":16},"width":371.79,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-1.png","element":"img","alt":" v(x) = m(x)⊤Qm(x)","inline":true,"padRight":true},{"text":"is a Euclidean inner product on the transformed space ","element":"span"},{"style":{"height":19.2},"width":385.73,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-2.png","element":"img","alt":" Y :=�φ(x), ∀x ∈ X�","inline":true},{"text":"with ","element":"span"},{"style":{"height":19.4},"width":327.8,"height":48.49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-3.png","element":"img","alt":" φ(x) := Q1/2m(x)","inline":true},{"text":". The ability of the SOS Lyapunov candidate ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"to certify safe states for ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-4.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"depends on the choice of monomials in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":")","element":"span"},{"text":". We interpret these choices as engineered features that define the expressiveness of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"v ","element":"span"},{"text":"in delineating the decision boundary between safe and unsafe states. Rather than choose such features manually and parameterize ","element":"span"},{"style":{"height":16},"width":79.43,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-5.png","element":"img","alt":" φ(x)","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Q ","element":"span"},{"text":"only, we propose the Lyapunov candidate ","element":"span"},{"style":{"height":16},"width":365.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-6.png","element":"img","alt":" vθ(x) = φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"text":"to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"learn ","element":"span"},{"text":"the requisite features, where ","element":"span"},{"style":{"height":16.59},"width":243.35,"height":41.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-7.png","element":"img","alt":" φθ : Rd → RD","inline":true,"padRight":true},{"text":"is a feed-forward neural network with parameter vector ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-8.png","element":"img","alt":" θ","inline":true},{"text":". Feed-forward neural networks are expressive in that they can approximate any continuous function on compact subsets of ","element":"span"},{"style":{"height":13.38},"width":45.78,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-9.png","element":"img","alt":" Rd","inline":true,"padRight":true},{"text":"with a finite number of parameters [","element":"span"},{"href":"#id-27","referenceIndex":24,"text":"24","element":"a"},{"text":", ","element":"span"},{"href":"#id-28","referenceIndex":25,"text":"25","element":"a"},{"text":"]. In ","element":"span"},{"href":"#id-29","text":"Sec. 3.2","element":"a"},{"text":", we exploit this property together with gradient-based parameter training to closely match the true ROA ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-10.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"with a level set of the candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-11.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"without the need to engineer individual features of ","element":"span"},{"style":{"height":14},"width":24,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-12.png","element":"img","alt":" φ","inline":true},{"text":".","element":"span"}],[{"text":"We cannot use an arbitrary feed-forward neural network ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-13.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"in our Lyapunov candidate, since the conditions of ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"must be satisfied. Otherwise, the resulting candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-14.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"cannot provide any safety information. In general, ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-15.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"is a sequence of function compositions or layers. Each layer has the form ","element":"span"},{"style":{"height":16},"width":422.57,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-16.png","element":"img","alt":" yℓ(x) = ϕℓ(Wℓyℓ−1(x))","inline":true},{"text":", where ","element":"span"},{"style":{"height":16},"width":95.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-17.png","element":"img","alt":" yℓ(x)","inline":true,"padRight":true},{"text":"is the output of layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-18.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"for state ","element":"span"},{"style":{"height":14},"width":169.45,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-19.png","element":"img","alt":" x ∈ X, ϕℓ","inline":true,"padRight":true},{"text":"is a fixed element-wise activation function, and ","element":"span"},{"style":{"height":16},"width":199.98,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-20.png","element":"img","alt":" Wℓyℓ−1(x)","inline":true,"padRight":true},{"text":"is a linear transformation parameterized by ","element":"span"},{"style":{"height":15.77},"width":272.49,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-21.png","element":"img","alt":" Wℓ ∈ Rdℓ×dℓ−1","inline":true},{"text":". To satisfy the assumptions of ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":", ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-22.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"must be Lipschitz continuous on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"and positive-definite on some subset of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"around ","element":"span"},{"style":{"height":12.79},"width":133.44,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-23.png","element":"img","alt":" xO = 0","inline":true},{"text":". To this end, we restrict ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-24.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to be positive-definite and Lipschitz continuous on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"for ","element":"span"},{"style":{"fontStyle":"italic"},"text":"all ","element":"span"},{"text":"values of ","element":"span"},{"style":{"height":16.78},"width":202.4,"height":41.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-25.png","element":"img","alt":" θ := {Wℓ}ℓ","inline":true,"padRight":true},{"text":"with a suitable choice of structure for ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-26.png","element":"img","alt":" φθ","inline":true},{"text":".","element":"span"}],[{"id":"id-30","style":{"fontWeight":"bold"},"text":"Theorem 2 (Lyapunov neural network): ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider ","element":"span"},{"style":{"height":16},"width":370.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-27.png","element":"img","alt":" vθ(x) = φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"as a Lyapunov candidate function, where ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-28.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a feed-forward neural network. Suppose, for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-29.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"in ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-30.png","element":"img","alt":" φθ","inline":true},{"style":{"fontStyle":"italic"},"text":", the activation function ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-31.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and weight matrix ","element":"span"},{"style":{"height":15.78},"width":266.91,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-32.png","element":"img","alt":" Wℓ ∈ Rdℓ×dℓ−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"each have a trivial nullspace. Then ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-33.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"has a trivial nullspace, and ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-34.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is positive-definite with ","element":"span"},{"style":{"height":16},"width":171.06,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-35.png","element":"img","alt":" vθ(0) = 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":16},"width":436.66,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-36.png","element":"img","alt":" vθ(x) > 0, ∀x ∈ X \\ {0}","inline":true},{"style":{"fontStyle":"italic"},"text":". Furthermore, if ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-37.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is Lipschitz continuous for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-38.png","element":"img","alt":" ℓ","inline":true},{"style":{"fontStyle":"italic"},"text":", then ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-39.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is locally Lipschitz continuous.","element":"span"}],[{"text":"We provide a formal proof of ","element":"span"},{"href":"#id-30","text":"Theorem 2 ","element":"a"},{"text":"in ","element":"span"},{"text":"Appendix A ","element":"span"},{"text":"and briefly outline it here. As an inner product, ","element":"span"},{"style":{"height":16},"width":374.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-40.png","element":"img","alt":" vθ(x) = φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"text":"is already positive-definite for any neural network output ","element":"span"},{"style":{"height":16},"width":99.95,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-41.png","element":"img","alt":" φθ(x)","inline":true},{"text":", and thus is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"at least ","element":"span"},{"text":"nonnegative for any state ","element":"span"},{"style":{"height":11.6},"width":123.56,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-42.png","element":"img","alt":" x ∈ X","inline":true},{"text":". The step from nonnegativity to positive-definiteness of ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-43.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"now only depends on how the origin ","element":"span"},{"style":{"height":11.6},"width":110.71,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-44.png","element":"img","alt":" 0 ∈ X","inline":true,"padRight":true},{"text":"is mapped through ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-45.png","element":"img","alt":" φθ","inline":true},{"text":". If ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-46.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"maps ","element":"span"},{"style":{"height":11.6},"width":114.36,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-47.png","element":"img","alt":" 0 ∈ X","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"uniquely ","element":"span"},{"text":"to the zero output ","element":"span"},{"style":{"height":16},"width":184.01,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-48.png","element":"img","alt":" φθ(0) = 0","inline":true},{"text":", i.e., if ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-49.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"has a trivial nullspace, then ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-50.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is positive-definite. For this, it is sufficient that each layer of ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-51.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"has a trivial nullspace, i.e., that each layer “passes along” ","element":"span"},{"style":{"height":11.6},"width":104.64,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-52.png","element":"img","alt":" 0 ∈ X","inline":true,"padRight":true},{"text":"to its zero output ","element":"span"},{"style":{"height":16},"width":170.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-53.png","element":"img","alt":" yℓ(0) = 0","inline":true,"padRight":true},{"text":"until the final output ","element":"span"},{"style":{"height":16},"width":174.31,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-54.png","element":"img","alt":" φθ(0) = 0","inline":true},{"text":".","element":"span"}],[{"text":"In ","element":"span"},{"href":"#id-30","text":"Theorem 2","element":"a"},{"text":", each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-55.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"has a trivial nullspace as long as its weight matrix ","element":"span"},{"style":{"height":13.19},"width":62.01,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-56.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"text":"and activation function ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-57.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"text":"have trivial nullspaces. Consequently, this requires that ","element":"span"},{"style":{"height":13.2},"width":177.84,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-58.png","element":"img","alt":" dℓ ≥ dℓ−1","inline":true,"padRight":true},{"text":"for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-59.png","element":"img","alt":" ℓ","inline":true},{"text":", where ","element":"span"},{"style":{"height":13.19},"width":34.74,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-60.png","element":"img","alt":" dℓ","inline":true,"padRight":true},{"text":"is the output dimension of layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-61.png","element":"img","alt":" ℓ","inline":true},{"text":". That is, ","element":"span"},{"style":{"height":13.19},"width":62.01,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-62.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"text":"must not decrease the dimension of its ","element":"span"},{"id":"id-46","text":"input. To ensure that ","element":"span"},{"style":{"height":13.19},"width":62.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-63.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"text":"has a trivial nullspace, we structure it as","element":"span"}],[{"style":{"width":"64%"},"width":1018,"height":94,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-64.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.78},"width":273.2,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-65.png","element":"img","alt":" Gℓ1 ∈ Rqℓ×dℓ−1","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":18.98},"width":922.91,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-66.png","element":"img","alt":" qℓ ∈ N≥1, Gℓ2 ∈ R(dℓ−dℓ−1)×dℓ−1, Idℓ−1 ∈ Rdℓ−1×dℓ−1","inline":true,"padRight":true},{"text":"is the identity matrix, and ","element":"span"},{"style":{"height":14.39},"width":142.83,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-67.png","element":"img","alt":" ε ∈ R>0","inline":true,"padRight":true},{"text":"is a constant. The top partition ","element":"span"},{"style":{"height":14.88},"width":278.94,"height":37.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-68.png","element":"img","alt":" G⊤ℓ1Gℓ1 + εIdℓ−1","inline":true,"padRight":true},{"text":"is positive-definite ","element":"span"},{"text":"for ","element":"span"},{"style":{"height":11.6},"width":107.97,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-69.png","element":"img","alt":" ε > 0","inline":true},{"text":", thus ","element":"span"},{"style":{"height":13.19},"width":62.01,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-70.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"text":"always has full rank and a trivial nullspace. Otherwise, ","element":"span"},{"style":{"height":13.19},"width":62.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-71.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"text":"would have a non-empty nullspace of dimension ","element":"span"},{"style":{"height":16},"width":685.5,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-72.png","element":"img","alt":" dℓ−1 − min(dℓ, dℓ−1) = dℓ−1 − dℓ > 0","inline":true,"padRight":true},{"text":"by the rank-nullity theorem. With this choice of structure for ","element":"span"},{"style":{"height":13.19},"width":62.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-73.png","element":"img","alt":" Wℓ","inline":true},{"text":", the parameters of the neural network ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-74.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"are given by ","element":"span"},{"style":{"height":16.78},"width":312.52,"height":41.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-75.png","element":"img","alt":" θ := {Gℓ1, Gℓ2}ℓ","inline":true},{"text":". Finally, we choose activation functions that have trivial nullspaces and that are Lipschitz continuous in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":", such as ","element":"span"},{"style":{"height":16},"width":122.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-76.png","element":"img","alt":" tanh(·)","inline":true,"padRight":true},{"text":"and the leaky ReLU. We can then compute a Lipschitz constant for ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/3-77.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"[","element":"span"},{"href":"#id-20","referenceIndex":21,"text":"21","element":"a"},{"text":"].","element":"span"}],[{"id":"id-32","style":{"width":"89%"},"width":1420,"height":562,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-0.png","element":"img"}],[{"text":"Figure 2. ","element":"figcaption","subtype":"caption"},{"text":"Illustration of training the parameterized Lyapunov candidate ","element":"figcaption","subtype":"caption"},{"style":{"height":8.9},"width":33.91,"height":22.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-1.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to expand the safe level set ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":104.32,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-2.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"(blue ellipsoid) towards the true largest ROA ","element":"figcaption","subtype":"caption"},{"style":{"height":11.59},"width":39.32,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-3.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"(black). States in the gap ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":474.32,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-4.png","element":"img","alt":" G between Vθ(ck) and Vθ(αck)","inline":true,"padRight":true},{"text":"(orange ellipsoid) are simulated forward to determine regions (green) towards which we can expand the safe level set. This information is used in ","element":"figcaption","subtype":"caption"},{"href":"#id-31","text":"Algorithm 1 ","element":"a","subtype":"caption"},{"text":"to iteratively adapt safe level sets of ","element":"figcaption","subtype":"caption"},{"style":{"height":8.9},"width":33.9,"height":22.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-5.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to the shape of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.6},"width":51.55,"height":28.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-6.png","element":"img","alt":" Sπ.","inline":true}],[{"id":"id-29","style":{"fontWeight":"bold"},"text":"3.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Learning a Safe Set via Classification","element":"span"}],[{"text":"Previously, we constructed a neural network Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-7.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-30","text":"Theorem 2 ","element":"a"},{"text":"that satisfies the positive-definiteness and Lipschitz continuity requirements in ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":". As a result, we can always use the one-step decrease condition ","element":"span"},{"style":{"height":16},"width":622.22,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-8.png","element":"img","alt":" ∆vθ(x) := vθ(fπ(x)) − vθ(x) < 0","inline":true,"padRight":true},{"text":"as a provable safety certificate to identify safe level sets that are subsets of the largest safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-9.png","element":"img","alt":" Sπ","inline":true},{"text":". Now, we design a training algorithm to adapt the parameters ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-10.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"such that the resulting Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-11.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"satisfies ","element":"span"},{"style":{"height":16},"width":215.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-12.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"throughout as large of a decrease region ","element":"span"},{"style":{"height":14.88},"width":166.62,"height":37.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-13.png","element":"img","alt":" Dvθ ⊆ X","inline":true,"padRight":true},{"text":"as possible. This also makes ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-14.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"a valid Lyapunov function for the closed-loop dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-15.png","element":"img","alt":" fπ","inline":true},{"text":".","element":"span"}],[{"text":"For now, we assume the entire safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-16.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is known. We want to use a level set ","element":"span"},{"style":{"height":16},"width":93.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-17.png","element":"img","alt":" Vθ(c)","inline":true,"padRight":true},{"text":"of ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-18.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to certify the entire set ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-19.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"as safe. According to ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":", this requires the Lyapunov decrease ","element":"span"},{"id":"id-33","text":"condition ","element":"span"},{"style":{"height":16},"width":201.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-20.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"to be satisfied for each state ","element":"span"},{"style":{"height":13.19},"width":116.02,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-21.png","element":"img","alt":" x ∈ Sπ","inline":true},{"text":". We formally state this problem as","element":"span"}],[{"style":{"width":"77%"},"width":1226,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-22.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16},"width":100.13,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-23.png","element":"img","alt":" Vol(·)","inline":true,"padRight":true},{"text":"is some measure of set volume. Thus, we want to find the largest level set of ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-24.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"that is contained in the true largest ROA ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-25.png","element":"img","alt":" Sπ","inline":true},{"text":"; see ","element":"span"},{"href":"#id-32","text":"Fig. 2a","element":"a"},{"text":". We fix ","element":"span"},{"style":{"height":9.19},"width":108.62,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-26.png","element":"img","alt":" c = cS","inline":true,"padRight":true},{"text":"with some ","element":"span"},{"style":{"height":14.39},"width":159.23,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-27.png","element":"img","alt":" cS ∈ R>0","inline":true},{"text":", as it is always possible to rescale ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-28.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"by a constant, and focus on optimizing over ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-29.png","element":"img","alt":" θ","inline":true},{"text":". We can then interpret ","element":"span"},{"href":"#id-33","text":"(3) ","element":"a"},{"text":"as a classification problem. Consider ","element":"span"},{"href":"#id-21","text":"Fig. 1b","element":"a"},{"text":", where we assign the ground-truth label ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"= +1 ","element":"span"},{"text":"whenever a state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"is contained in ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-30.png","element":"img","alt":" Sπ","inline":true},{"text":", and ","element":"span"},{"style":{"height":14},"width":126.43,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-31.png","element":"img","alt":" y = −1","inline":true,"padRight":true},{"text":"otherwise. We use ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-32.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"together with ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"to classify states by their membership in the level set ","element":"span"},{"style":{"height":16},"width":100.03,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-33.png","element":"img","alt":" V(cS)","inline":true},{"text":". This is described by the decision rule","element":"span"}],[{"style":{"width":"64%"},"width":1021,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-34.png","element":"img"}],[{"id":"id-34","text":"That is, each state within the level set ","element":"span"},{"style":{"height":16},"width":100.03,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-35.png","element":"img","alt":" V(cS)","inline":true,"padRight":true},{"text":"obtains the label ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"= +1","element":"span"},{"text":". However, we must also satisfy ","element":"span"},{"id":"id-35","text":"the Lyapunov decrease condition imposed by ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":". This can be written as the constraint","element":"span"}],[{"style":{"width":"63%"},"width":1012,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-36.png","element":"img"}],[{"text":"which means that we can assign the label ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"= +1 ","element":"span"},{"text":"only if the decrease condition is also satisfied. The decision rule ","element":"span"},{"href":"#id-34","text":"(4) ","element":"a"},{"text":"together with the constraint ","element":"span"},{"href":"#id-35","text":"(5) ","element":"a"},{"text":"ensures that the resulting estimated safe set ","element":"span"},{"style":{"height":16},"width":100.03,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-37.png","element":"img","alt":" V(cS)","inline":true,"padRight":true},{"text":"satisfies all of the conditions in ","element":"span"},{"href":"#id-22","text":"Theorem 1","element":"a"},{"text":". We want to select the neural network parameters ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-38.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"so that this rule can perfectly classify ","element":"span"},{"style":{"height":13.19},"width":126.72,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-39.png","element":"img","alt":" x ∈ Sπ","inline":true,"padRight":true},{"text":"as “safe” with ","element":"span"},{"style":{"height":16},"width":210.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-40.png","element":"img","alt":" ˆyθ(x) = +1","inline":true,"padRight":true},{"text":"(i.e., ","element":"span"},{"style":{"height":16},"width":257.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-41.png","element":"img","alt":" cS − vθ(x) > 0","inline":true},{"text":") or ","element":"span"},{"style":{"height":16},"width":122.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-42.png","element":"img","alt":"x /∈ Sπ","inline":true,"padRight":true},{"text":"as “unsafe” with ","element":"span"},{"style":{"height":16},"width":206.22,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-43.png","element":"img","alt":" ˆyθ(x) = −1","inline":true,"padRight":true},{"text":"(i.e., ","element":"span"},{"style":{"height":16},"width":267.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-44.png","element":"img","alt":" cS − vθ(x) ≤ 0","inline":true},{"text":"). To this end, the decision boundary ","element":"span"},{"style":{"height":16},"width":186.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-45.png","element":"img","alt":"vθ(x) = cS","inline":true,"padRight":true},{"text":"must exactly delineate the boundary of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-46.png","element":"img","alt":" Sπ","inline":true},{"text":". Furthermore, the value of ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-47.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"must ensure ","element":"span"},{"href":"#id-35","text":"(5) ","element":"a"},{"text":"holds, such that ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-48.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"satisfies the decrease condition of ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"on ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-49.png","element":"img","alt":" Sπ","inline":true},{"text":".","element":"span"}],[{"text":"Since we have rewritten the optimization problem in ","element":"span"},{"href":"#id-33","text":"(3) ","element":"a"},{"text":"as a classification problem, we can use ideas from the corresponding literature [","element":"span"},{"href":"#id-36","referenceIndex":26,"text":"26","element":"a"},{"text":"]. In particular, we define a loss function ","element":"span"},{"style":{"height":16},"width":152.34,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-50.png","element":"img","alt":" ℓ(y, x; θ)","inline":true,"padRight":true},{"text":"that penalizes misclassification of the true label ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"at a state ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x ","element":"span"},{"text":"under the decision rule ","element":"span"},{"href":"#id-34","text":"(4) ","element":"a"},{"text":"associated with ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/4-51.png","element":"img","alt":" θ","inline":true},{"text":". Many common choices for the loss function are possible; for simplicity, we use the perceptron loss, which penalizes misclassifications more when they occur far from the decision boundary.","element":"span"}],[{"id":"id-31","style":{"width":"100%"},"width":1584,"height":572,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-0.png","element":"img"}],[{"text":"We choose not to use the “maximum margin” objective of the hinge loss, since it may be unsuitable for us to accurately delineate ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-1.png","element":"img","alt":" Sπ","inline":true},{"text":", where states can lie arbitrarily close to the decision boundary in the continuous state space ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". Since we use the level set ","element":"span"},{"style":{"height":16},"width":117.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-2.png","element":"img","alt":" Vθ(cS)","inline":true,"padRight":true},{"text":"in our classification setting, this corresponds to ","element":"span"},{"style":{"height":19.2},"width":667.43,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-3.png","element":"img","alt":" ℓ(y, x; θ) = max�0, −y ·�cS − vθ(x)��","inline":true},{"text":". Here, ","element":"span"},{"style":{"height":16},"width":187.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-4.png","element":"img","alt":" cS − vθ(x)","inline":true,"padRight":true},{"text":"is the signed distance from the decision boundary ","element":"span"},{"style":{"height":16},"width":192.18,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-5.png","element":"img","alt":" vθ(x) = cS","inline":true},{"text":", which separates the safe set ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-6.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"from the rest of the state space ","element":"span"},{"style":{"height":16},"width":110.85,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-7.png","element":"img","alt":" X \\ Sπ","inline":true},{"text":". This ","element":"span"},{"style":{"fontStyle":"italic"},"text":"classifier loss ","element":"span"},{"text":"has a magnitude of","element":"span"},{"style":{"height":19.96},"width":210.84,"height":49.91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-8.png","element":"img","alt":"��cS − vθ(x)��","inline":true,"padRight":true},{"text":"in the case of a misclassification, and zero otherwise. This ensures that decisions far from the decision boundary, such as those near the origin, are considered more important than the more difficult decisions close to the boundary.","element":"span"}],[{"text":"Ideally, we would like to minimize this loss throughout the state space with ","element":"span"},{"style":{"height":18.51},"width":324.68,"height":46.27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-9.png","element":"img","alt":" min�X l(y, x; θ) dx","inline":true,"padRight":true},{"text":"subject to the constraint ","element":"span"},{"href":"#id-35","text":"(5)","element":"a"},{"text":". Since this problem is intractable, we use gradient-based optimization together with mini-batches instead, as is typically done in machine learning. To this end, we sample states ","element":"span"},{"style":{"height":16},"width":199.37,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-10.png","element":"img","alt":" Xb = {xi}i","inline":true,"padRight":true},{"text":"from the state space ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"at random and assign the ground-truth labels ","element":"span"},{"style":{"height":16},"width":83.66,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-11.png","element":"img","alt":" {yi}i","inline":true,"padRight":true},{"text":"to them. Based on this finite set, the optimization objective can be written as","element":"span"}],[{"style":{"width":"76%"},"width":1212,"height":91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-12.png","element":"img"}],[{"id":"id-37","text":"where the batch ","element":"span"},{"style":{"height":13.19},"width":42.42,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-13.png","element":"img","alt":" Xb","inline":true,"padRight":true},{"text":"is re-sampled after every gradient step. We can apply a Lagrangian relaxation","element":"span"}],[{"style":{"width":"76%"},"width":1210,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-14.png","element":"img"}],[{"text":"in order to make the problem tractable. Here, ","element":"span"},{"style":{"height":14.39},"width":155.48,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-15.png","element":"img","alt":" λ ∈ R>0","inline":true,"padRight":true},{"text":"is a Lagrangian multiplier and the term ","element":"span"},{"style":{"height":19.2},"width":509.87,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-16.png","element":"img","alt":"λ((y + 1)/2) max�0, ∆vθ(x)�","inline":true},{"text":"is the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Lyapunov decrease loss","element":"span"},{"text":", which penalizes violations of ","element":"span"},{"href":"#id-35","text":"(5)","element":"a"},{"text":". The decrease condition ","element":"span"},{"style":{"height":16},"width":213.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-17.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"only needs to be enforced within the safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-18.png","element":"img","alt":" Sπ","inline":true},{"text":", so we do not want to incur a loss if it is violated at a state where ","element":"span"},{"style":{"height":14},"width":135.58,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-19.png","element":"img","alt":" y = −1","inline":true},{"text":". Thus, we use the multiplier ","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"+1)","element":"span"},{"style":{"fontStyle":"italic"},"text":"/","element":"span"},{"text":"2 ","element":"span"},{"text":"to map ","element":"span"},{"style":{"height":16},"width":159.48,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-20.png","element":"img","alt":" {+1, −1}","inline":true,"padRight":true},{"text":"to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"0","element":"span"},{"style":{"fontStyle":"italic"},"text":"}","element":"span"},{"text":", such that the Lyapunov decrease loss is zeroed-out if ","element":"span"},{"style":{"height":14},"width":125.1,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-21.png","element":"img","alt":" y = −1","inline":true},{"text":".","element":"span"}],[{"text":"However, there are two issues when this formulation is compared to the exact problem in ","element":"span"},{"href":"#id-33","text":"(3)","element":"a"},{"text":". Firstly, the objective ","element":"span"},{"href":"#id-37","text":"(7) ","element":"a"},{"text":"only penalizes violations of the decrease condition ","element":"span"},{"style":{"height":16},"width":206.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-22.png","element":"img","alt":" ∆vθ(x) < 0","inline":true},{"text":", rather than constraining ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-23.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"to enforce it. Thus, while ","element":"span"},{"style":{"height":16},"width":206.23,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-24.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"always ","element":"span"},{"text":"a provable safety certificate, we must ","element":"span"},{"style":{"fontStyle":"italic"},"text":"verify ","element":"span"},{"text":"that it holds over some level set whenever we update ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-25.png","element":"img","alt":" θ","inline":true},{"text":". Secondly, ground-truth labels of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-26.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"are not known in practice. To address these issues, we can use any method to check Lyapunov safety certificates over continuous state spaces to certify a level set ","element":"span"},{"style":{"height":16},"width":93.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-27.png","element":"img","alt":" Vθ(c)","inline":true,"padRight":true},{"text":"as safe, and then use ","element":"span"},{"style":{"height":16},"width":93.69,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-28.png","element":"img","alt":" Vθ(c)","inline":true,"padRight":true},{"text":"to estimate labels ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"from ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-29.png","element":"img","alt":" Sπ","inline":true},{"text":". For this work, we check the tightened certificate ","element":"span"},{"style":{"height":16.08},"width":322.55,"height":40.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-30.png","element":"img","alt":" ∆vθ(x) < −L∆vθτ","inline":true,"padRight":true},{"text":"on a discretization of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":", as described in ","element":"span"},{"href":"#id-25","text":"Sec. 2.1","element":"a"},{"text":". This method exposes the Lipschitz constant ","element":"span"},{"style":{"height":14.88},"width":84.24,"height":37.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-31.png","element":"img","alt":" L∆vθ","inline":true,"padRight":true},{"text":"of ","element":"span"},{"style":{"height":13.99},"width":70.53,"height":34.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-32.png","element":"img","alt":" ∆vθ","inline":true},{"text":", which can conveniently be used for regularization in practice [","element":"span"},{"href":"#id-20","referenceIndex":21,"text":"21","element":"a"},{"text":"]. Possible alternatives to this safety verification method include the use of an adaptive discretization for better scaling to higher-dimensional state spaces [","element":"span"},{"href":"#id-10","referenceIndex":11,"text":"11","element":"a"},{"text":"], and formal verification methods for neural networks [","element":"span"},{"href":"#id-38","referenceIndex":27,"text":"27","element":"a"},{"text":", ","element":"span"},{"href":"#id-39","referenceIndex":28,"text":"28","element":"a"},{"text":"].","element":"span"}],[{"text":"Since such an estimate of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-33.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is limited by the largest safe level set of ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-34.png","element":"img","alt":" vθ","inline":true},{"text":", we propose ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"to iteratively “grow” an estimate of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-35.png","element":"img","alt":" Sπ","inline":true},{"text":". We initialize ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-36.png","element":"img","alt":" vθ","inline":true},{"text":", then use it to identify the largest safe level set ","element":"span"},{"style":{"height":16},"width":111.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-37.png","element":"img","alt":" Vθ(c0)","inline":true,"padRight":true},{"text":"by verifying the condition ","element":"span"},{"style":{"height":16},"width":216.97,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-38.png","element":"img","alt":" ∆vθ(x) < 0","inline":true},{"text":". At first, we use ","element":"span"},{"style":{"height":16},"width":111.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-39.png","element":"img","alt":" Vθ(c0)","inline":true,"padRight":true},{"text":"to estimate ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-40.png","element":"img","alt":" Sπ","inline":true},{"text":". At iteration ","element":"span"},{"style":{"height":15.59},"width":151.89,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-41.png","element":"img","alt":" k ∈ N≥0","inline":true},{"text":", we consider the safe level set ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-42.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"and the expanded level set ","element":"span"},{"style":{"height":16},"width":138.93,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-43.png","element":"img","alt":" Vθ(αck)","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":14.39},"width":144.03,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-44.png","element":"img","alt":" α ∈ R>1","inline":true},{"text":"; see ","element":"span"},{"href":"#id-32","text":"Fig. 2b","element":"a"},{"text":". Then, states in the “gap” ","element":"span"},{"style":{"height":16},"width":370.94,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/5-45.png","element":"img","alt":" G := Vθ(αck)\\Vθ(ck)","inline":true,"padRight":true},{"text":"are forward-simulated","element":"span"}],[{"id":"id-40","style":{"width":"96%"},"width":1524,"height":692,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-0.png","element":"img"}],[{"text":"Figure 3. ","element":"figcaption","subtype":"caption"},{"text":"Results for training the neural network (NN) Lyapunov candidate ","element":"figcaption","subtype":"caption"},{"style":{"height":8.9},"width":33.9,"height":22.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-1.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"for an inverted pendulum. In ","element":"figcaption","subtype":"caption"},{"href":"#id-40","text":"Fig. 3a","element":"a","subtype":"caption"},{"text":", system trajectories (black) converge to the origin only within the largest safe region ","element":"figcaption","subtype":"caption"},{"style":{"height":11.59},"width":39.32,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-2.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"(green). The NN candidate (orange) characterizes ","element":"figcaption","subtype":"caption"},{"style":{"height":11.6},"width":39.32,"height":28.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-3.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"with a level set better than both the LQR (blue ellipsoid) and SOS (yellow) candidates, as it adapts to the shape of ","element":"figcaption","subtype":"caption"},{"href":"#id-40","style":{"height":13.2},"width":216.35,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-4.png","element":"img","alt":" Sπ. In Fig. 3b","inline":true},{"text":", the safe level ","element":"figcaption","subtype":"caption"},{"style":{"height":11.7},"width":121.24,"height":29.25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-5.png","element":"img","alt":" ck of vθ","inline":true,"padRight":true},{"text":"converges non-monotonically towards the fixed boundary ","element":"figcaption","subtype":"caption"},{"style":{"height":11.59},"width":115.35,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-6.png","element":"img","alt":" cS = 1","inline":true},{"text":", and the safe level set ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":104.32,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-7.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"grows to cover most of ","element":"figcaption","subtype":"caption"},{"style":{"height":11.59},"width":39.32,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-8.png","element":"img","alt":" Sπ","inline":true},{"text":". However, as discussed at the end of ","element":"figcaption","subtype":"caption"},{"text":"Sec. 3","element":"span","subtype":"caption"},{"text":", convergence of ","element":"figcaption","subtype":"caption"},{"style":{"height":14.4},"width":189.81,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-9.png","element":"img","alt":" Vθ(ck) to Sπ","inline":true,"padRight":true},{"text":"is not guaranteed in general by ","element":"figcaption","subtype":"caption"},{"href":"#id-31","text":"Algorithm 1","element":"a","subtype":"caption"},{"text":".","element":"figcaption","subtype":"caption"}],[{"text":"with the dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-10.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":15.59},"width":156.47,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-11.png","element":"img","alt":" T ∈ N≥1","inline":true,"padRight":true},{"text":"time steps. States that fall in ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-12.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"before or after forward-simulation form a new estimate of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-13.png","element":"img","alt":" Sπ","inline":true},{"text":", since trajectories become “trapped” in ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-14.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"and converge to the origin. We use this estimate of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-15.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"to identify labels ","element":"span"},{"style":{"fontStyle":"italic"},"text":"y ","element":"span"},{"text":"for classification, then apply SGD with the objective ","element":"span"},{"href":"#id-37","text":"(7) ","element":"a"},{"text":"to update ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-16.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"and encourage ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-17.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"to grow. Finally, we certify the new largest safe level set ","element":"span"},{"style":{"height":16},"width":153.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-18.png","element":"img","alt":" Vθ(ck+1)","inline":true},{"text":". These steps are repeated until a choice of stopping criterion is satisfied.","element":"span"}],[{"text":"In general, ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"does not guarantee convergence of the safe level set ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-19.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"to ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-20.png","element":"img","alt":" Sπ","inline":true},{"text":", nor that ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-21.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"monotonically grows in volume. Furthermore, it is not guaranteed that the iterated safe level ","element":"span"},{"style":{"height":14.39},"width":173.41,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-22.png","element":"img","alt":" ck ∈ R>0","inline":true,"padRight":true},{"text":"approaches the safe level ","element":"span"},{"style":{"height":9.19},"width":38.25,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-23.png","element":"img","alt":" cS","inline":true,"padRight":true},{"text":"that is prescribed to delineate ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-24.png","element":"img","alt":" Sπ","inline":true},{"text":". This is typical of gradient-based parameter training, since the parameters ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-25.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"can become “stuck” in local optima. However, since the Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-26.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is guaranteed to satisfy the positive-definiteness and Lipschitz continuity conditions of ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"by its construction in ","element":"span"},{"href":"#id-30","text":"Theorem 2","element":"a"},{"text":", ","element":"span"},{"style":{"height":16},"width":217.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-27.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"always ","element":"span"},{"text":"a provable safety certificate for identifying safe level sets. Thus, we can ","element":"span"},{"style":{"fontStyle":"italic"},"text":"always ","element":"span"},{"text":"use ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-28.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to identify at least a subset of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-29.png","element":"img","alt":" Sπ","inline":true},{"text":", without ever identifying unsafe states as safe.","element":"span"}]]},{"heading":"4 Experiments and Discussion","paragraphs":[[{"text":"In the previous section, we developed ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"to train the parameters ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-30.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"of a neural network Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-31.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"constructed according to ","element":"span"},{"href":"#id-30","text":"Theorem 2","element":"a"},{"text":". This construction ensures the positive-definiteness and Lipschitz continuity assumptions on ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-32.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-22","text":"Theorem 1 ","element":"a"},{"text":"are satisfied. ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"encourages ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-33.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to satisfy the decrease condition and match the true largest ROA ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-34.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"for the closed-loop dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-35.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"with a level set ","element":"span"},{"style":{"height":16},"width":117.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-36.png","element":"img","alt":" Vθ(cS)","inline":true},{"text":". In this section, we present details for the implementation of ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"to learn the largest safe region of a simulated inverted pendulum system, and experimental results in a comparison to other methods of computing Lyapunov functions.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Inverted Pendulum Benchmark ","element":"span"},{"text":"The inverted pendulum is governed by the differential equation ","element":"span"},{"style":{"height":18.21},"width":470.58,"height":45.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-37.png","element":"img","alt":"mℓ2¨θ = mgℓ sin θ − β ˙θ + u","inline":true,"padRight":true},{"text":"with state ","element":"span"},{"style":{"height":19.01},"width":179.7,"height":47.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-38.png","element":"img","alt":" x := (θ, ˙θ)","inline":true},{"text":", where ","element":"span"},{"style":{"height":10.8},"width":19,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-39.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"is the angle from the upright equilibrium ","element":"span"},{"style":{"height":12.8},"width":184.8,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-40.png","element":"img","alt":" xO = 0, u","inline":true,"padRight":true},{"text":"is the input torque, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"is the pendulum mass, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"g ","element":"span"},{"text":"is the gravitational acceleration, ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-41.png","element":"img","alt":"ℓ","inline":true,"padRight":true},{"text":"is the pole length, and ","element":"span"},{"style":{"height":14.4},"width":23,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-42.png","element":"img","alt":" β","inline":true,"padRight":true},{"text":"is the friction coefficient. We discretize the dynamics with a time step of ","element":"span"},{"style":{"height":11.6},"width":201.69,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-43.png","element":"img","alt":" ∆t = 0.01 s","inline":true,"padRight":true},{"text":"and enforce a saturation constraint ","element":"span"},{"style":{"height":16},"width":195.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-44.png","element":"img","alt":" u ∈ [−¯u, ¯u]","inline":true},{"text":", such that the pendulum falls over past a certain angle and cannot recover. For a linear policy ","element":"span"},{"style":{"height":16},"width":286.15,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-45.png","element":"img","alt":" u = π(x) = Kx","inline":true},{"text":", this yields the safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-46.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-40","text":"Fig. 3 ","element":"a"},{"text":"around the upright equilibrium for the closed-loop dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-47.png","element":"img","alt":" fπ","inline":true},{"text":". In particular, we fix ","element":"span"},{"style":{"fontWeight":"bold"},"text":"K ","element":"span"},{"text":"to the linear quadratic regulator (LQR) solution for the discretized, linearized, unconstrained form of the dynamics [","element":"span"},{"href":"#id-41","referenceIndex":29,"text":"29","element":"a"},{"text":"]. Outside of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-48.png","element":"img","alt":" Sπ","inline":true},{"text":", the pendulum falls down without the ability to recover and the system trajectories diverge away from ","element":"span"},{"style":{"height":12.79},"width":126.78,"height":31.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/6-49.png","element":"img","alt":" xO = 0","inline":true},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Practical Considerations ","element":"span"},{"text":"To train the parameters of the Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-0.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"to adapt to the shape of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-1.png","element":"img","alt":" Sπ","inline":true},{"text":", we use ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"with SGD. To certify the safety of continuous level sets of ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-2.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"whenever ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-3.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"is updated, we check the stricter decrease condition ","element":"span"},{"style":{"height":16.08},"width":329.81,"height":40.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-4.png","element":"img","alt":" ∆vθ(x) < −L∆vθτ","inline":true,"padRight":true},{"text":"at a discrete set of points that cover ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"in increasing order of the value of ","element":"span"},{"style":{"height":16},"width":95.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-5.png","element":"img","alt":" vθ(x)","inline":true},{"text":", as in [","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"]. ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"does not guarantee that the safe level set estimate ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-6.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"grows monotonically in volume towards ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-7.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"with each iteration ","element":"span"},{"style":{"fontStyle":"italic"},"text":"k","element":"span"},{"text":". In fact, the estimate ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-8.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"may shrink if ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-9.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"initially succeeds and then fails to satisfy the decrease condition ","element":"span"},{"style":{"height":16},"width":217.14,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-10.png","element":"img","alt":" ∆vθ(x) < 0","inline":true,"padRight":true},{"text":"in some regions of the state space. This tends to occur near the origin, where ","element":"span"},{"style":{"height":16},"width":346.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-11.png","element":"img","alt":" vθ(0) = ∆vθ(0) = 0","inline":true,"padRight":true},{"text":"and the “basin of attraction” characterized by ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-12.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"“flattens”. To alleviate this, we use a large Lagrange multiplier ","element":"span"},{"style":{"height":10.8},"width":159.46,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-13.png","element":"img","alt":" λ = 1000","inline":true,"padRight":true},{"text":"in the SGD objective ","element":"span"},{"href":"#id-37","text":"(7) ","element":"a"},{"text":"to strongly “push” ","element":"span"},{"style":{"height":10.8},"width":22,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-14.png","element":"img","alt":" θ","inline":true,"padRight":true},{"text":"towards values that ensure ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-15.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"continues to satisfy the decrease condition. In addition, we normalize the Lyapunov decrease loss ","element":"span"},{"style":{"height":19.2},"width":502.02,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-16.png","element":"img","alt":" λ((y + 1)/2) max�0, ∆vθ(x)�","inline":true},{"text":"in ","element":"span"},{"href":"#id-37","text":"(7) ","element":"a"},{"text":"by ","element":"span"},{"style":{"height":16},"width":95.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-17.png","element":"img","alt":" vθ(x)","inline":true},{"text":". This more heavily weighs sampled states near the origin, i.e., where ","element":"span"},{"style":{"height":16},"width":95.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-18.png","element":"img","alt":" vθ(x)","inline":true,"padRight":true},{"text":"is small.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Results ","element":"span"},{"text":"We implement ","element":"span"},{"href":"#id-31","text":"Algorithm 1 ","element":"a"},{"text":"on the inverted pendulum benchmark with the Python code available at ","element":"span"},{"href":"https://github.com/befelix/safe_learning","style":{"fontFamily":"monospace"},"text":"https://github.com/befelix/safe_learning","element":"a"},{"text":", which is based on TensorFlow [","element":"span"},{"href":"#id-42","referenceIndex":30,"text":"30","element":"a"},{"text":"]. For the neural network Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-19.png","element":"img","alt":" vθ","inline":true},{"text":", we use three layers of 64 ","element":"span"},{"style":{"height":16},"width":122.27,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-20.png","element":"img","alt":" tanh(·)","inline":true,"padRight":true},{"text":"activation units each. We prescribe ","element":"span"},{"style":{"height":16},"width":117.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-21.png","element":"img","alt":" Vθ(cS)","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":13.19},"width":113.98,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-22.png","element":"img","alt":" cS = 1","inline":true,"padRight":true},{"text":"as the level set that delineates the safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-23.png","element":"img","alt":" Sπ","inline":true},{"text":". ","element":"span"},{"href":"#id-40","text":"Fig. 3 ","element":"a"},{"text":"shows the results of training ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-24.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"with ","element":"span"},{"href":"#id-31","text":"Algorithm 1","element":"a"},{"text":", and the largest safe level set ","element":"span"},{"style":{"height":16},"width":127.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-25.png","element":"img","alt":" Vθ(c18)","inline":true,"padRight":true},{"text":"with ","element":"span"},{"text":"10 ","element":"span"},{"text":"SGD iterations per update. ","element":"span"},{"href":"#id-40","text":"Fig. 3a ","element":"a"},{"text":"visualizes how this level set has “moulded” to the shape of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-26.png","element":"img","alt":" Sπ","inline":true},{"text":". ","element":"span"},{"href":"#id-40","text":"Fig. 3b ","element":"a"},{"text":"shows how the safe level ","element":"span"},{"style":{"height":9.19},"width":34.24,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-27.png","element":"img","alt":" ck","inline":true,"padRight":true},{"text":"converges towards the prescribed level ","element":"span"},{"style":{"height":13.19},"width":118.86,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-28.png","element":"img","alt":" cS = 1","inline":true,"padRight":true},{"text":"that delineates ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-29.png","element":"img","alt":" Sπ","inline":true},{"text":", and how the fraction of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-30.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"covered by ","element":"span"},{"style":{"height":16},"width":113.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-31.png","element":"img","alt":" Vθ(ck)","inline":true,"padRight":true},{"text":"approaches ","element":"span"},{"text":"1","element":"span"},{"text":". The true largest ROA ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-32.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"is estimated by forward-simulating all of the states in a state space discretization, and set volume is estimated by counting discrete states. ","element":"span"},{"href":"#id-40","text":"Fig. 3a ","element":"a"},{"text":"also shows the largest safe sets for a LQR Lyapunov candidate and a SOS Lyapunov candidate. The LQR candidate ","element":"span"},{"style":{"height":16.79},"width":299.41,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-33.png","element":"img","alt":" vLQR(x) = x⊤Px","inline":true,"padRight":true},{"text":"is computed in closed-form for the same discretized, linearized, unconstrained form of the dynamics used to determine the LQR policy ","element":"span"},{"style":{"height":16},"width":200.1,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-34.png","element":"img","alt":" π(x) = Kx","inline":true,"padRight":true},{"text":"[","element":"span"},{"href":"#id-41","referenceIndex":29,"text":"29","element":"a"},{"text":"]. The SOS Lyapunov candidate ","element":"span"},{"style":{"height":16},"width":427.1,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-35.png","element":"img","alt":" vSOS(x) = m(x)⊤Qm(x)","inline":true,"padRight":true},{"text":"uses up to third-order monomials in ","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":", thus it is a sixth-order polynomial. It is computed with the toolbox SOSTOOLS [","element":"span"},{"href":"#id-43","referenceIndex":31,"text":"31","element":"a"},{"text":"] and the SDP solver SeDuMi [","element":"span"},{"href":"#id-44","referenceIndex":32,"text":"32","element":"a"},{"text":"] in MATLAB for the unconstrained nonlinear dynamics with a Taylor polynomial expansion of ","element":"span"},{"style":{"height":10.8},"width":74.57,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-36.png","element":"img","alt":" sin θ","inline":true},{"text":". While the SOS approach is a powerful specialized method for polynomial dynamical systems, it cannot account for the non-differentiable nonlinearity introduced by the input saturation, which drastically alters the closed-loop dynamics. As a result, while ","element":"span"},{"style":{"height":9.19},"width":79.38,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-37.png","element":"img","alt":" vSOS","inline":true,"padRight":true},{"text":"is optimized for the system without saturation, it is ill-suited to the true closed-loop dynamics and yields a small safe level set. Overall, our neural network Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-38.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"performs the best at certification of as much of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-39.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"as possible, since it only relies on inputs and outputs of ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-40.png","element":"img","alt":" fπ","inline":true},{"text":", and adapts to the shape of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-41.png","element":"img","alt":" Sπ","inline":true},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Comments on Safe Learning ","element":"span"},{"href":"#id-40","text":"Fig. 3a ","element":"a"},{"text":"demonstrates that a neural network Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-42.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"can certify more of the true largest safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-43.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"than other common Lyapunov candidates. This has important implications for safe exploration during learning for dynamical systems; with more safe states available to visit, an agent can better learn about itself and its environment under a wider range of operating conditions. For example, our method is applicable in the safe reinforcement learning framework of [","element":"span"},{"href":"#id-16","referenceIndex":17,"text":"17","element":"a"},{"text":"]. This past work provides safe exploration guarantees for a GP model of the dynamics ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-44.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"with confidence bounds on the Lyapunov stability certificate, but these guarantees are limited by the choice of Lyapunov function. As our results have shown, certain Lyapunov candidates may poorly characterize the shape of the true largest safe region ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-45.png","element":"img","alt":" Sπ","inline":true},{"text":". Since our neural network Lyapunov candidate can adapt to the shape of ","element":"span"},{"style":{"height":13.19},"width":43.13,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-46.png","element":"img","alt":" Sπ","inline":true,"padRight":true},{"text":"during learning by using, for example, the mean estimate of ","element":"span"},{"style":{"height":14},"width":38.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/7-47.png","element":"img","alt":" fπ","inline":true,"padRight":true},{"text":"from the GP model, we could enlarge the estimated safe region more quickly as data is collected. Our method is also applicable to exploration algorithms within safe motion planning that depends on knowledge of a safe region, such as in [","element":"span"},{"href":"#id-45","referenceIndex":33,"text":"33","element":"a"},{"text":"]. Overall, our method strongly warrants consideration for use in safe learning methods that leverage statistical models of dynamical systems.","element":"span"}]]},{"heading":"5 Conclusion","paragraphs":[[{"text":"We have demonstrated a novel method for learning safety certificates for general nonlinear dynamical systems. Specifically, we developed a flexible class of parameterized Lyapunov candidate functions and a training algorithm to adapt them to the shape of the largest safe region for a closed-loop dynamical system. We believe that our method is appealing due to its applicability to a wide range of dynamical systems in theory and practice. Furthermore, it can play an important role in improving safe exploration during learning for real autonomous systems in uncertain environments.","element":"span"}]]},{"heading":"Acknowledgments","paragraphs":[[{"text":"This research was supported in part by SNSF grant 200020 159557, the Vector Institute, and a fellowship by the Open Philanthropy Project.","element":"span"}]]},{"heading":"References","paragraphs":[[{"id":"id-0","text":"[1] D. Amodei, C. Olah, J. Steinhardt, P. Christiano, J. Schulman, and D. Man´e. Concrete prob- ","element":"span"},{"text":"lems in AI safety. Technical report, 2016. arXiv:1606.06565v2 [cs.AI].","element":"span"}],[{"id":"id-1","text":"[2] H. K. Khalil. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Nonlinear Systems","element":"span"},{"text":". Prentice Hall, Upper Saddle River, NJ, 3 edition, 2002.","element":"span"}],[{"id":"id-2","text":"[3] A. Vannelli and M. Vidyasagar. Maximal Lyapunov functions and domains of attraction for ","element":"span"},{"text":"autonomous nonlinear systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Automatica","element":"span"},{"text":", 21(1):69–80, 1985.","element":"span"}],[{"id":"id-3","text":"[4] D. J. Hill and I. M. Y. Mareels. Stability theory for differential/algebraic systems with ap- ","element":"span"},{"text":"plication to power systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Circuits and Systems","element":"span"},{"text":", 37(11):1416–1423, 1990.","element":"span"}],[{"id":"id-4","text":"[5] J. M. G. da Silva Jr. and S. Tarbouriech. Antiwindup design with guaranteed regions of sta- ","element":"span"},{"text":"bility: An LMI-based approach. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 50(1):106–111, 2005.","element":"span"}],[{"id":"id-5","text":"[6] R. Kalman and J. Bertram. Control system analysis and design via the “second method” of ","element":"span"},{"text":"Lyapunov II: Discrete-time systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Transactions of the American Society of Mechanical Engineers (ASME): Journal of Basic Engineering","element":"span"},{"text":", 82(2):394–400, 1960.","element":"span"}],[{"id":"id-6","text":"[7] P. Giesl and S. Hafstein. Review on computational methods for Lyapunov functions. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Discrete and Continuous Dynamical Systems, Series B","element":"span"},{"text":", 20(8):2291–2331, 2016.","element":"span"}],[{"id":"id-7","text":"[8] S. Boyd and L. Vandenberghe. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Convex Optimization","element":"span"},{"text":". Cambridge University Press, Cambridge, UK, 2009.","element":"span"}],[{"id":"id-8","text":"[9] P. A. Parrilo. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Structured semidefinite programs and semialgebraic geometry methods in robustness and optimization","element":"span"},{"text":". PhD thesis, California Institute of Technology, 2000.","element":"span"}],[{"id":"id-9","text":"[10] D. Henrion and M. Korda. Convex computation of the region of attraction of polynomial ","element":"span"},{"text":"control systems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 59(2):297–312, 2014.","element":"span"}],[{"id":"id-10","text":"[11] R. Bobiti and M. Lazar. A sampling approach to finding Lyapunov functions for nonlinear ","element":"span"},{"text":"discrete-time systems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the European Control Conference (ECC)","element":"span"},{"text":", pages 561–566, 2016.","element":"span"}],[{"id":"id-11","text":"[12] K. Zhou and J. C. Doyle. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Essentials of Robust Control","element":"span"},{"text":". Prentice Hall, Upper Saddle River, NJ, 1998.","element":"span"}],[{"id":"id-12","text":"[13] A. Trofino. Robust stability and domain of attraction of uncertain nonlinear systems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the American Control Conference (ACC)","element":"span"},{"text":", pages 3707–3711, 2000.","element":"span"}],[{"id":"id-13","text":"[14] U. Topcu, A. K. Packard, P. Seiler, and G. J. Balas. Robust region-of-attraction estimation. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 55(1):137–142, 2010.","element":"span"}],[{"id":"id-14","text":"[15] C. E. Rasmussen and C. K. I. Williams. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Gaussian Processes for Machine Learning","element":"span"},{"text":". MIT Press, Cambridge, MA, 2006.","element":"span"}],[{"id":"id-15","text":"[16] F. Berkenkamp, R. Moriconi, A. P. Schoellig, and A. Krause. Safe learning of regions of ","element":"span"},{"text":"attraction for uncertain, nonlinear systems with Gaussian processes. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the IEEE Conference on Decision and Control (CDC)","element":"span"},{"text":", pages 4661–4666, 2016.","element":"span"}],[{"id":"id-16","text":"[17] F. Berkenkamp, M. Turchetta, A. P. Schoellig, and A. Krause. Safe model-based reinforce- ","element":"span"},{"text":"ment learning with stability guarantees. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the Conference on Neural Information Processing Systems (NIPS)","element":"span"},{"text":", pages 908–918, 2017.","element":"span"}],[{"id":"id-17","text":"[18] R. S. Sutton and A. G. Barto. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Reinforcement Learning","element":"span"},{"text":". MIT Press, Cambridge, MA, 2 edition, 2018. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(draft)","element":"span"},{"text":".","element":"span"}],[{"id":"id-18","text":"[19] V. Petridis and S. Petridis. Construction of neural network based Lyapunov functions. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the IEEE International Joint Conference on Neural Network Proceedings","element":"span"},{"text":", pages 5059–5065, 2006.","element":"span"}],[{"id":"id-19","text":"[20] N. Noroozi, P. Karimaghaee, F. Safaei, and H. Javadi. Generation of Lyapunov functions by ","element":"span"},{"text":"neural networks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the World Congress on Engineering (WCE)","element":"span"},{"text":", volume 1, pages 61–65, 2008.","element":"span"}],[{"id":"id-20","text":"[21] C. Szegedy, W. Zaremba, I. Sutskever, J. Bruna, D. Erhan, I. Goodfellow, and R. Fergus. ","element":"span"},{"text":"Intriguing properties of neural networks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the International Conference on Learning Representations (ICLR)","element":"span"},{"text":", 2014.","element":"span"}],[{"id":"id-23","text":"[22] A. Papachristodoulou and S. Prajna. On the construction of Lyapunov functions using the sum ","element":"span"},{"text":"of squares decomposition. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the IEEE Conference on Decision and Control (CDC)","element":"span"},{"text":", pages 3482–3487, 2002.","element":"span"}],[{"id":"id-24","text":"[23] A. Papachristodoulou. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Scalable analysis of nonlinear systems using convex optimization","element":"span"},{"text":". PhD thesis, California Institute of Technology, 2005.","element":"span"}],[{"id":"id-27","text":"[24] G. Cybenko. Approximation by superpositions of a sigmoidal function. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mathematics of Control, Signals, and Systems","element":"span"},{"text":", 2(4):303–314, 1989.","element":"span"}],[{"id":"id-28","text":"[25] K. Hornik. Some new results on neural network approximation. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Neural Networks","element":"span"},{"text":", 6(8):1069– 1072, 2001.","element":"span"}],[{"id":"id-36","text":"[26] C. M. Bishop. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Pattern Recognition and Machine Learning","element":"span"},{"text":". Springer-Verlag, New York, NY, 2006.","element":"span"}],[{"id":"id-38","text":"[27] X. Huang, M. Kwiatkowska, S. Wang, and M. Wu. Safety verification of deep neural networks. ","element":"span"},{"text":"Technical report, 2017. arXiv:1610.06940v3 [cs.AI].","element":"span"}],[{"id":"id-39","text":"[28] G. Katz, C. Barrett, D. Dill, K. Julian, and M. Kochenderfer. Reluplex: An efficient SMT solver ","element":"span"},{"text":"for verifying deep neural networks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the International Conference on Computer Aided Verification (CAV)","element":"span"},{"text":", 2017.","element":"span"}],[{"id":"id-41","text":"[29] F. L. Lewis, D. L. Vrabie, and V. L. Syrmos. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Optimal Control","element":"span"},{"text":". John Wiley & Sons, Inc., Hoboken, NJ, 3 edition, 2012.","element":"span"}],[{"id":"id-42","text":"[30] M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, ","element":"span"},{"text":"M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D. G. Murray, B. Steiner, P. Tucker, V. Vasudevan, P. Warden, M. Wicke, Y. Yu, and X. Zheng. TensorFlow: A system for largescale machine learning. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the USENIX Symposium on Operating Systems Design and Implementation (OSDI)","element":"span"},{"text":", pages 265–283, 2016.","element":"span"}],[{"id":"id-43","text":"[31] S. Prajna, A. Papachristodoulou, and P. A. Parrilo. Introducing SOSTOOLS: A general purpose ","element":"span"},{"text":"sum of squares programming solver. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the IEEE Conference on Decision and Control (CDC)","element":"span"},{"text":", pages 741–746, 2002.","element":"span"}],[{"id":"id-44","text":"[32] J. F. Sturm. Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Optimization Methods and Software","element":"span"},{"text":", 11(1–4):625–653, 1999.","element":"span"}],[{"id":"id-45","text":"[33] T. Koller, F. Berkenkamp, M. Turchetta, and A. Krause. Learning-based model predictive ","element":"span"},{"text":"control for safe exploration. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Proc. of the IEEE Conference on Decision and Control (CDC)","element":"span"},{"text":", 2018. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"(to appear)","element":"span"},{"text":".","element":"span"}]]},{"heading":"A Proofs","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"Theorem 2 (Lyapunov neural network): ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider ","element":"span"},{"style":{"height":16},"width":370.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-0.png","element":"img","alt":" vθ(x) = φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"as a Lyapunov candidate function, where ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-1.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a feed-forward neural network. Suppose, for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-2.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"in ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-3.png","element":"img","alt":" φθ","inline":true},{"style":{"fontStyle":"italic"},"text":", the activation function ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-4.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and weight matrix ","element":"span"},{"style":{"height":15.78},"width":266.91,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-5.png","element":"img","alt":" Wℓ ∈ Rdℓ×dℓ−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"each have a trivial nullspace. Then ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-6.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"has a trivial nullspace, and ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-7.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is positive-definite with ","element":"span"},{"style":{"height":16},"width":171.05,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-8.png","element":"img","alt":" vθ(0) = 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":16},"width":436.66,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-9.png","element":"img","alt":" vθ(x) > 0, ∀x ∈ X \\ {0}","inline":true},{"style":{"fontStyle":"italic"},"text":". Furthermore, if ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-10.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is Lipschitz continuous for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-11.png","element":"img","alt":" ℓ","inline":true},{"style":{"fontStyle":"italic"},"text":", then ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-12.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is locally Lipschitz continuous.","element":"span"}],[{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Proof","element":"span"},{"style":{"fontStyle":"italic"},"text":". ","element":"span"},{"text":"We begin by showing that ","element":"span"},{"style":{"height":14},"width":41.75,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-13.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X ","element":"span"},{"text":"by induction, and then use this to prove that ","element":"span"},{"style":{"height":9.19},"width":37.31,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-14.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is positive-definite on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". Recall that a feed-forward neural network is a successive composition of its layer transformations, such that the output ","element":"span"},{"style":{"height":16},"width":95.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-15.png","element":"img","alt":" yℓ(x)","inline":true,"padRight":true},{"text":"of layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-16.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"for the state ","element":"span"},{"style":{"height":11.6},"width":112.12,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-17.png","element":"img","alt":" x ∈ X","inline":true,"padRight":true},{"text":"is the input to layer ","element":"span"},{"style":{"height":12},"width":89.71,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-18.png","element":"img","alt":" ℓ + 1","inline":true},{"text":". Consider ","element":"span"},{"style":{"height":10.8},"width":100.77,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-19.png","element":"img","alt":" ℓ = 0","inline":true,"padRight":true},{"text":"with the input ","element":"span"},{"style":{"height":16},"width":194.45,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-20.png","element":"img","alt":" y0(x) := x","inline":true},{"text":", and the first layer output ","element":"span"},{"style":{"height":16},"width":390.24,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-21.png","element":"img","alt":"y1(x) = ϕ1(W1y0(x))","inline":true},{"text":". Clearly ","element":"span"},{"style":{"height":11.1},"width":40.82,"height":27.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-22.png","element":"img","alt":" y0","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":", since it is just the identity function. Since ","element":"span"},{"style":{"height":14},"width":127.83,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-23.png","element":"img","alt":" W1, ϕ1","inline":true},{"text":", and ","element":"span"},{"style":{"height":11.1},"width":40.82,"height":27.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-24.png","element":"img","alt":" y0","inline":true,"padRight":true},{"text":"each have a trivial nullspace in their respective input spaces, the sequence of logical statements","element":"span"}],[{"style":{"width":"87%"},"width":1380,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-25.png","element":"img"}],[{"text":"holds. Thus, ","element":"span"},{"style":{"height":16},"width":532.42,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-26.png","element":"img","alt":" x = 0 ⇐⇒ ϕ1(W1y0(x)) = 0","inline":true,"padRight":true},{"text":"holds, and ","element":"span"},{"style":{"height":11.1},"width":40.82,"height":27.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-27.png","element":"img","alt":" y1","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". If we now assume ","element":"span"},{"style":{"height":11.1},"width":38.82,"height":27.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-28.png","element":"img","alt":" yℓ","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":", it is clear that ","element":"span"},{"style":{"height":12.7},"width":78.62,"height":31.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-29.png","element":"img","alt":" yℓ+1","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":", since","element":"span"}],[{"style":{"width":"90%"},"width":1433,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-30.png","element":"img"}],[{"text":"holds in a similar fashion. As a result, ","element":"span"},{"style":{"height":11.1},"width":38.82,"height":27.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-31.png","element":"img","alt":" yℓ","inline":true,"padRight":true},{"text":"has a trivial nullspace for each layer ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-32.png","element":"img","alt":" ℓ","inline":true,"padRight":true},{"text":"by induction. Since ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-33.png","element":"img","alt":"φθ","inline":true,"padRight":true},{"text":"is a composition of a finite number of layers, ","element":"span"},{"style":{"height":14.7},"width":146.57,"height":36.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-34.png","element":"img","alt":" φθ = yL","inline":true,"padRight":true},{"text":"for some ","element":"span"},{"style":{"height":15.59},"width":147.87,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-35.png","element":"img","alt":" L ∈ N≥0","inline":true},{"text":", thus ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-36.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"has a trivial nullspace in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":".","element":"span"}],[{"text":"We now use this property of ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-37.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"to prove that the Lyapunov candidate ","element":"span"},{"style":{"height":16},"width":377.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-38.png","element":"img","alt":" vθ(x) = φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"text":"is positive-definite on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":". As an inner product, ","element":"span"},{"style":{"height":16},"width":214.21,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-39.png","element":"img","alt":" φθ(x)⊤φθ(x)","inline":true,"padRight":true},{"text":"is positive-definite on the transformed space ","element":"span"},{"style":{"height":19.2},"width":404.8,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-40.png","element":"img","alt":" Y :=�φθ(x), ∀x ∈ X�","inline":true},{"text":". Thus, ","element":"span"},{"style":{"height":16},"width":503.73,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-41.png","element":"img","alt":" vθ(x) = 0 ⇐⇒ φθ(x) = 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":178.84,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-42.png","element":"img","alt":" vθ(x) > 0","inline":true,"padRight":true},{"text":"otherwise. Since we have already proven ","element":"span"},{"style":{"height":16},"width":432.59,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-43.png","element":"img","alt":" φθ(x) = 0 ⇐⇒ x = 0","inline":true},{"text":", combining these statements shows that ","element":"span"},{"style":{"height":16},"width":385.74,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-44.png","element":"img","alt":"vθ(x) = 0 ⇐⇒ x = 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16},"width":168.15,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-45.png","element":"img","alt":" vθ(x) > 0","inline":true,"padRight":true},{"text":"otherwise. As a result, ","element":"span"},{"style":{"height":16},"width":95.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-46.png","element":"img","alt":" vθ(x)","inline":true,"padRight":true},{"text":"is positive-definite on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"text":".","element":"span"}],[{"text":"Finally, we need to show that if every activation function ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-47.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"text":"is Lipschitz continuous, then ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-48.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is locally Lipschitz continuous. If the neural network ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-49.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"is Lipschitz continuous, then clearly ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-50.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is locally Lipschitz continuous, since it is quadratic and thus differentiable with respect to ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-51.png","element":"img","alt":" φθ","inline":true},{"text":". To show that ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-52.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"is Lipschitz continuous, it is sufficient to show that each layer is Lipschitz continuous. This is due to the fact that any function composition ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"g","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"x","element":"span"},{"text":")) ","element":"span"},{"text":"is Lipschitz continuous with Lipschitz constant ","element":"span"},{"style":{"height":15.59},"width":90.9,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-53.png","element":"img","alt":"LfLg","inline":true,"padRight":true},{"text":"if ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"has Lipschitz constant ","element":"span"},{"style":{"height":15.59},"width":44.12,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-54.png","element":"img","alt":" Lf","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"g ","element":"span"},{"text":"has Lipschitz constant ","element":"span"},{"style":{"height":15.59},"width":43.12,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-55.png","element":"img","alt":" Lg","inline":true},{"text":". This fact can be seen from ","element":"span"},{"style":{"height":19.96},"width":1044.53,"height":49.91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-56.png","element":"img","alt":"��f(g(x)) − f(g(x′))�� ≤ Lf��g(x) − g(x′)�� ≤ LfLg��x − x′��","inline":true},{"text":", for each pair ","element":"span"},{"style":{"height":14},"width":159,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-57.png","element":"img","alt":" x, x′ ∈ X","inline":true},{"text":". By the Lipschitz continuity of function composition and the linearity of ","element":"span"},{"style":{"height":14.7},"width":142.42,"height":36.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-58.png","element":"img","alt":" Wℓyℓ−1","inline":true},{"text":", each layer transformation ","element":"span"},{"style":{"height":16},"width":321.82,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-59.png","element":"img","alt":"yℓ = ϕℓ(Wℓyℓ−1)","inline":true,"padRight":true},{"text":"is Lipschitz continuous if ","element":"span"},{"style":{"height":10},"width":40.07,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-60.png","element":"img","alt":" ϕℓ","inline":true,"padRight":true},{"text":"is Lipschitz continuous. As a result, the neural network ","element":"span"},{"style":{"height":14},"width":41.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-61.png","element":"img","alt":" φθ","inline":true,"padRight":true},{"text":"is Lipschitz continuous, and the Lyapunov candidate ","element":"span"},{"style":{"height":9.19},"width":37.32,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-62.png","element":"img","alt":" vθ","inline":true,"padRight":true},{"text":"is locally Lipschitz continuous.","element":"span"}],[{"style":{"width":"1%"},"width":27,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-63.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Remark 1: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In ","element":"span"},{"href":"#id-46","text":"(2)","element":"a"},{"style":{"fontStyle":"italic"},"text":", we ensured each weight matrix ","element":"span"},{"style":{"height":13.19},"width":62.01,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-64.png","element":"img","alt":" Wℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"has a trivial nullspace with the structure","element":"span"}],[{"style":{"width":"28%"},"width":454,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-65.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":15.78},"width":272.03,"height":39.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-66.png","element":"img","alt":" Gℓ1 ∈ Rqℓ×dℓ−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for some ","element":"span"},{"style":{"height":18.98},"width":929.6,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-67.png","element":"img","alt":" qℓ ∈ N≥1, Gℓ2 ∈ R(dℓ−dℓ−1)×dℓ−1, Idℓ−1 ∈ Rdℓ−1×dℓ−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the identity matrix, and ","element":"span"},{"style":{"height":14.39},"width":148,"height":35.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-68.png","element":"img","alt":" ε ∈ R>0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a constant. To minimize the number of free parameters required by our neural network Lyapunov candidate, we choose ","element":"span"},{"style":{"height":10},"width":31.79,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-69.png","element":"img","alt":" qℓ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"to be the minimum integer such that each entry in ","element":"span"},{"style":{"height":17.32},"width":366,"height":43.31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-70.png","element":"img","alt":" G⊤ℓ1Gℓ1 ∈ Rdℓ−1×dℓ−1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is independent from the others. Since ","element":"span"},{"style":{"height":14.74},"width":125.98,"height":36.85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-71.png","element":"img","alt":" G⊤ℓ1Gℓ1","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is symmetric, it has ","element":"span"},{"style":{"height":22.95},"width":491.74,"height":57.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-72.png","element":"img","alt":"�dℓ−1j=1 j = dℓ−1(dℓ−1 + 1)/2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"free parameters, thereby requiring ","element":"span"},{"style":{"height":16},"width":466.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-73.png","element":"img","alt":" qℓdℓ−1 ≥ dℓ−1(dℓ−1 + 1)/2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"or ","element":"span"},{"style":{"height":16},"width":302.67,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-74.png","element":"img","alt":"qℓ ≥ (dℓ−1 + 1)/2","inline":true},{"style":{"fontStyle":"italic"},"text":". For this, we choose ","element":"span"},{"style":{"height":19.2},"width":340.42,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/1808.00924/images/10-75.png","element":"img","alt":" qℓ =�(dℓ−1 + 1)/2�","inline":true},{"style":{"fontStyle":"italic"},"text":".","element":"span"}]]}],"_version":"3.3.4"},"paperNode":"$28:props:children:props:children:0:props:product"}]]