1b:["$","$L29",null,{"isWhiteLabelled":false,"children":["$","$Lb",null,{"pt":{"compact":0,"expanded":3},"children":[["$","$L2a",null,{"noStar":true,"publisher":true,"task":true,"params":true,"size":"xl","product":{"id":"eyJwYXBlcklEIjoiMjAwMS4wNDY3OCIsInB1Ymxpc2hlciI6ImFyeGl2In0=","publisher":"arxiv","updated":"2020-01-18T09:09:22.000Z","paperID":"2001.04678","published":"2020-01-14T09:19:39.000Z","authors":"[\"David Balduzzi\",\"Wojciech M Czarnecki\",\"Thomas W Anthony\",\"Ian M Gemp\",\"Edward Hughes\",\"Joel Z Leibo\",\"Georgios Piliouras\",\"Thore Graepel\"]","title":"Smooth markets: A basic mechanism for organizing gradient-based learners","scoreTrending":null,"summary":"With the success of modern machine learning, it is becoming increasingly\nimportant to understand and control how learning algorithms interact.\nUnfortunately, negative results from game theory show there is little hope of\nunderstanding or controlling general n-player games. We therefore introduce\nsmooth markets (SM-games), a class of n-player games with pairwise zero sum\ninteractions. SM-games codify a common design pattern in machine learning that\nincludes (some) GANs, adversarial training, and other recent algorithms. We\nshow that SM-games are amenable to analysis and optimization using first-order\nmethods.","lastCheckedForCode":"2022-09-05T16:33:24.757Z","links":[],"reposConnection":{"edges":[]},"models":[],"tags":[],"summaries":[],"emailsConnection":{"edges":[]},"__typename":"paper","authorArray":["David Balduzzi","Wojciech M Czarnecki","Thomas W Anthony","Ian M Gemp","Edward Hughes","Joel Z Leibo","Georgios Piliouras","Thore Graepel"]}}],["$","$L18",null,{"container":true,"columns":100,"spacing":{"compact":0,"expanded":2,"large":3},"children":[["$","$L18",null,{"size":{"compact":100,"expanded":100,"large":68},"children":[["$","$7",null,{"children":["$","$L2b",null,{"publisher":"arxiv","paperID":"2001.04678","product":{"paper":"$1b:props:children:props:children:0:props:product","models":"$1b:props:children:props:children:0:props:product:models"},"isWhiteLabelled":false}]}],["$","$7",null,{"children":["$","$L2c",null,{"article":"$L2d","model":"$undefined"}]}]]}],["$","$L18",null,{"size":"grow","children":["$","$L2e",null,{}]}]]}],["$","$7",null,{"children":null}],[["$","audio",null,{"id":"tts"}],["$","$L2f",null,{"paperID":"2001.04678","publisher":"arxiv","paperJSON":{"title":"Smooth markets: A basic mechanism for organizing gradient-based learners","paperID":"2001.04678","avgLineHeight":10.91,"imgScale":4,"sections":[{"heading":"ABSTRACT","paragraphs":[[{"text":"With the success of modern machine learning, it is becoming increasingly important to understand and control how learning algorithms interact. Unfortunately, negative results from game theory show there is little hope of understanding or controlling general ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"-player games. We therefore introduce smooth markets (SM-games), a class of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"-player games with pairwise zero sum interactions. SM-games codify a common design pattern in machine learning that includes (some) GANs, adversarial training, and other recent algorithms. We show that SM-games are amenable to analysis and optimization using first-order methods.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"“I began to see ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"legibility ","element":"span"},{"style":{"fontStyle":"italic"},"text":"as a central problem in modern statecraft. The premodern state was, in many respects, partially blind [","element":"span"},{"style":{"fontStyle":"italic"},"text":". . .","element":"span"},{"style":{"fontStyle":"italic"},"text":"] It lacked anything like a detailed ‘map’ of its terrain and its people. It lacked, for the most part, a measure, a metric that would allow it to ‘translate’ what it knew into a common standard necessary for a synoptic view. As a result, its interventions were often crude and self-defeating.”","element":"span"}],[{"style":{"width":"42%"},"width":677,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/0-0.png","element":"img"}]]},{"heading":"1 INTRODUCTION","paragraphs":[[{"text":"As artificial agents proliferate, it is increasingly important to analyze, predict and control their collective behavior ","element":"span"},{"href":"#id-0","referenceIndex":37,"text":"(Parkes and Wellman, ","element":"a"},{"href":"#id-0","referenceIndex":37,"text":"2015; ","element":"a"},{"href":"#id-1","referenceIndex":39,"text":"Rahwan et al., ","element":"a"},{"href":"#id-1","referenceIndex":39,"text":"2019)","element":"a"},{"text":". Unfortunately, despite almost a century of intense research since ","element":"span"},{"href":"#id-2","referenceIndex":50,"text":"von Neumann ","element":"a"},{"href":"#id-2","referenceIndex":50,"text":"(1928)","element":"a"},{"text":", game theory provides little guidance outside a few special cases such as two-player zero-sum, auctions, and potential games ","element":"span"},{"href":"#id-3","referenceIndex":33,"text":"(Monderer and Shapley, ","element":"a"},{"href":"#id-3","referenceIndex":33,"text":"1996; ","element":"a"},{"href":"#id-4","referenceIndex":35,"text":"Nisan et al., ","element":"a"},{"href":"#id-4","referenceIndex":35,"text":"2007; ","element":"a"},{"href":"#id-5","referenceIndex":48,"text":"Vickrey, ","element":"a"},{"href":"#id-5","referenceIndex":48,"text":"1961; ","element":"a"},{"href":"#id-6","referenceIndex":51,"text":"von Neumann and Morgenstern, ","element":"a"},{"href":"#id-6","referenceIndex":51,"text":"1944)","element":"a"},{"text":". Nash equilibria provide a general solution concept, but are intractable in almost all cases for many different reasons ","element":"span"},{"href":"#id-7","referenceIndex":4,"text":"(Babichenko, ","element":"a"},{"href":"#id-7","referenceIndex":4,"text":"2016; ","element":"a"},{"href":"#id-8","referenceIndex":13,"text":"Daskalakis et al., ","element":"a"},{"href":"#id-8","referenceIndex":13,"text":"2009; ","element":"a"},{"href":"#id-9","referenceIndex":19,"text":"Hart and Mas-Colell, ","element":"a"},{"href":"#id-9","referenceIndex":19,"text":"2003)","element":"a"},{"text":". These and other negative results ","element":"span"},{"href":"#id-10","referenceIndex":36,"text":"(Palaiopanos et al., ","element":"a"},{"href":"#id-10","referenceIndex":36,"text":"2017) ","element":"a"},{"text":"suggest that understanding and controlling societies of artificial agents is near hopeless. Nevertheless, human societies – of billions of agents – manage to organize themselves reasonably well and mostly progress with time, suggesting game theory is missing some fundamental organizing principles.","element":"span"}],[{"text":"In this paper, we investigate how markets structure the behavior of agents. Market mechanisms have been studied extensively ","element":"span"},{"href":"#id-4","referenceIndex":35,"text":"(Nisan et al., ","element":"a"},{"href":"#id-4","referenceIndex":35,"text":"2007)","element":"a"},{"text":". However, prior work has restricted to concrete examples, such as auctions and prediction markets, and strong assumptions, such as convexity. Our approach is more abstract and more directly suited to modern machine learning where the building blocks are neural nets. Markets, for us, encompass discriminators and generators trading errors in GANs ","element":"span"},{"href":"#id-11","referenceIndex":18,"text":"(Goodfellow et al., ","element":"a"},{"href":"#id-11","referenceIndex":18,"text":"2014) ","element":"a"},{"text":"and agents trading wins and losses in StarCraft ","element":"span"},{"href":"#id-12","referenceIndex":49,"text":"(Vinyals et al., ","element":"a"},{"href":"#id-12","referenceIndex":49,"text":"2019)","element":"a"},{"text":".","element":"span"}],[{"text":"1.1 ","element":"span"},{"text":"O","element":"span"},{"text":"VERVIEW","element":"span"}],[{"text":"The paper introduces a class of games where ","element":"span"},{"style":{"fontWeight":"bold"},"text":"optimization and aggregation make sense","element":"span"},{"text":". The phrase requires unpacking. “Optimization” means gradient-based methods. Gradient descent (and friends) are the workhorse of modern machine learning. Even when gradients are not available, gradient ","element":"span"},{"style":{"fontStyle":"italic"},"text":"estimates ","element":"span"},{"text":"underpin many reinforcement learning and evolutionary algorithms. “Aggregation” means weighted sums. Sums and averages are the workhorses for analyzing ensembles and populations ","element":"span"},{"text":"across many fields. “Makes sense” means we can draw conclusions about the gradient-based dynamics of the collective by summing over properties of its members.","element":"span"}],[{"text":"As motivation, we present some pathologies that arise in even the simplest smooth games. Examples in section ","element":"span"},{"text":"2 ","element":"span"},{"text":"show that coupling strongly concave profit functions to form a game can lead to uncontrolled behavior, such as spiraling to infinity and excessive sensitivity to learning rates. Hence, one of our goals is to understand how to ‘glue together agents’ such that their collective behavior is predictable.","element":"span"}],[{"text":"Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"introduces a class of games where simultaneous gradient ascent behaves well and is amenable to analysis. In a ","element":"span"},{"style":{"fontWeight":"bold"},"text":"smooth market (SM-game)","element":"span"},{"text":", each player’s profit is composed of a personal objective and pairwise zero-sum interactions with other players. Zero-sum interactions are analogous to monetary exchange (my expenditure is your revenue), double-entry bookkeeping (credits balance debits), and conservation of energy (actions cause equal and opposite reactions). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"SM-games explicitly account for externalities","element":"span"},{"text":". Remarkably, building this simple bookkeeping mechanism into games has strong implications for the dynamics of gradient-based learners. SM-games generalize adversarial games ","element":"span"},{"href":"#id-13","referenceIndex":12,"text":"(Cai et al., ","element":"a"},{"href":"#id-13","referenceIndex":12,"text":"2016) ","element":"a"},{"text":"and codify a common ","element":"span"},{"style":{"fontWeight":"bold"},"text":"design pattern ","element":"span"},{"text":"in machine learning, see section ","element":"span"},{"href":"#id-14","text":"3.1.","element":"a"}],[{"text":"Section ","element":"span"},{"text":"4 ","element":"span"},{"text":"studies SM-games from two points of view. Firstly, from that of a rational, profit-maximizing agent that makes decisions based on first-order profit ","element":"span"},{"style":{"fontWeight":"bold"},"text":"forecasts","element":"span"},{"text":". Secondly, from that of the game as a whole. SM-games are not potential games, so the game does not optimize any single function. A collective of profit-maximizing agents is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"rational because they do not optimize a shared objective ","element":"span"},{"href":"#id-15","referenceIndex":14,"text":"(Drexler, ","element":"a"},{"href":"#id-15","referenceIndex":14,"text":"2019)","element":"a"},{"text":". We therefore introduce the notion of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"legibility","element":"span"},{"text":", which quantifies how the dynamics of the collective relate to that of individual agents.","element":"span"}],[{"text":"Finally, section ","element":"span"},{"text":"5 ","element":"span"},{"text":"applies legibility to prove some basic theorems on the dynamics of SM-games under gradient-ascent. We show that ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(i) ","element":"span"},{"text":"Nash equilibria are stable; ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(ii","element":"span"},{"text":") that if profits are strictly concave then gradient ascent converges to a Nash equilibrium for all learning rates; and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(iii) ","element":"span"},{"text":"the dynamics are bounded under reasonable assumptions.","element":"span"}],[{"text":"The results are important for two reasons. Firstly, we identify a class of games whose dynamics are, at least in some respects, amenable to analysis and control. The kinds of pathologies described in section ","element":"span"},{"text":"2 ","element":"span"},{"text":"cannot arise in SM-games. Secondly, we identify the specific quantities, forecasts, that are useful to track at the level of individual firms and can be meaningfully aggregated to draw conclusions about their global dynamics. It follows that forecasts should be a useful lever for mechanism design.","element":"span"}],[{"text":"1.2 ","element":"span"},{"text":"R","element":"span"},{"text":"ELATED WORK","element":"span"}],[{"text":"A wide variety of machine learning markets and agent-based economies have been proposed and studied: ","element":"span"},{"href":"#id-16","referenceIndex":1,"text":"Abernethy and Frongillo ","element":"a"},{"href":"#id-16","referenceIndex":1,"text":"(2011)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-17","referenceIndex":7,"text":"Balduzzi ","element":"a"},{"href":"#id-17","referenceIndex":7,"text":"(2014)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-18","referenceIndex":9,"text":"Barto et al. ","element":"a"},{"href":"#id-18","referenceIndex":9,"text":"(1983)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-19","referenceIndex":10,"text":"Baum ","element":"a"},{"href":"#id-19","referenceIndex":10,"text":"(1999)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-20","referenceIndex":20,"text":"Hu ","element":"a"},{"href":"#id-20","referenceIndex":20,"text":"and Storkey ","element":"a"},{"href":"#id-20","referenceIndex":20,"text":"(2014)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-21","referenceIndex":21,"text":"Kakade et al. ","element":"a"},{"href":"#id-21","referenceIndex":21,"text":"(2003; ","element":"a"},{"href":"#id-22","referenceIndex":22,"text":"2005)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-23","referenceIndex":23,"text":"Kearns et al. ","element":"a"},{"href":"#id-23","referenceIndex":23,"text":"(2001)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-24","referenceIndex":26,"text":"Kwee et al. ","element":"a"},{"href":"#id-24","referenceIndex":26,"text":"(2001)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-25","referenceIndex":27,"text":"Lay and ","element":"a"},{"href":"#id-25","referenceIndex":27,"text":"Barbu ","element":"a"},{"href":"#id-25","referenceIndex":27,"text":"(2010)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-26","referenceIndex":32,"text":"Minsky ","element":"a"},{"href":"#id-26","referenceIndex":32,"text":"(1986)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-27","referenceIndex":41,"text":"Selfridge ","element":"a"},{"href":"#id-27","referenceIndex":41,"text":"(1958)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-28","referenceIndex":44,"text":"Storkey ","element":"a"},{"href":"#id-28","referenceIndex":44,"text":"(2011)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-29","referenceIndex":45,"text":"Storkey et al. ","element":"a"},{"href":"#id-29","referenceIndex":45,"text":"(2012)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-30","referenceIndex":46,"text":"Sutton et al. ","element":"a"},{"href":"#id-30","referenceIndex":46,"text":"(2011)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-31","referenceIndex":52,"text":"Wellman and Wurman ","element":"a"},{"href":"#id-31","referenceIndex":52,"text":"(1998)","element":"a"},{"text":". The goal of this paper is different. Rather than propose another market mechanism, we abstract an existing design pattern and elucidate some of its consequences for interacting agents.","element":"span"}],[{"text":"Our approach draws on work studying convergence in generative adversarial networks ","element":"span"},{"href":"#id-32","referenceIndex":8,"text":"(Balduzzi ","element":"a"},{"href":"#id-32","referenceIndex":8,"text":"et al., ","element":"a"},{"href":"#id-32","referenceIndex":8,"text":"2018; ","element":"a"},{"href":"#id-33","referenceIndex":16,"text":"Gemp and Mahadevan, ","element":"a"},{"href":"#id-33","referenceIndex":16,"text":"2018; ","element":"a"},{"href":"#id-34","referenceIndex":17,"text":"Gidel et al., ","element":"a"},{"href":"#id-34","referenceIndex":17,"text":"2019; ","element":"a"},{"href":"#id-35","referenceIndex":30,"text":"Mescheder, ","element":"a"},{"href":"#id-35","referenceIndex":30,"text":"2018; ","element":"a"},{"href":"#id-36","referenceIndex":31,"text":"Mescheder et al., ","element":"a"},{"href":"#id-36","referenceIndex":31,"text":"2017)","element":"a"},{"text":", related minimax problems ","element":"span"},{"href":"#id-37","referenceIndex":3,"text":"(Abernethy et al., ","element":"a"},{"href":"#id-37","referenceIndex":3,"text":"2019; ","element":"a"},{"href":"#id-38","referenceIndex":6,"text":"Bailey and Piliouras, ","element":"a"},{"href":"#id-38","referenceIndex":6,"text":"2018)","element":"a"},{"text":", and monotone games ","element":"span"},{"href":"#id-39","referenceIndex":15,"text":"(Gemp and Mahadevan, ","element":"a"},{"href":"#id-39","referenceIndex":15,"text":"2017; ","element":"a"},{"href":"#id-40","referenceIndex":34,"text":"Nemirovski et al., ","element":"a"},{"href":"#id-40","referenceIndex":34,"text":"2010; ","element":"a"},{"href":"#id-41","referenceIndex":47,"text":"Tatarenko and Kamgarpour, ","element":"a"},{"href":"#id-41","referenceIndex":47,"text":"2019)","element":"a"},{"text":".","element":"span"}],[{"text":"1.3 ","element":"span"},{"text":"C","element":"span"},{"text":"AVEAT","element":"span"}],[{"text":"We consider dynamics in continuous time ","element":"span"},{"style":{"height":19.37},"width":190.39,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/1-0.png","element":"img","alt":"dwdt = ξ(w)","inline":true,"padRight":true},{"text":"in this paper. Discrete dynamics, ","element":"span"},{"style":{"height":12.39},"width":139.9,"height":30.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/1-1.png","element":"img","alt":" wt+1 ←","inline":true},{"style":{"height":16},"width":182.51,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/1-2.png","element":"img","alt":"wt + ξ(w)","inline":true,"padRight":true},{"text":"require a more delicate analysis, e.g. ","element":"span"},{"href":"#id-42","referenceIndex":5,"text":"Bailey et al. ","element":"a"},{"href":"#id-42","referenceIndex":5,"text":"(2019)","element":"a"},{"text":". In particular, we do not claim that optimizing GANs and SM-games is easy in discrete time. Rather, our analyis shows that it is relatively easy in continuous time, and therefore possible in discrete time, with some additional effort. The contrast is with smooth games in general, where gradient-based methods have essentially no hope of finding local Nash equilibria even in continuous time.","element":"span"}],[{"style":{"width":"97%"},"width":1551,"height":458,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-0.png","element":"img"}],[{"text":"Figure 1: ","element":"figcaption","subtype":"caption"},{"id":"id-48","style":{"fontWeight":"bold"},"text":"Effect of learning rates in two games. ","element":"figcaption","subtype":"caption"},{"text":"Note: ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"x","element":"figcaption","subtype":"caption"},{"text":"-axis is ","element":"figcaption","subtype":"caption"},{"text":"log","element":"figcaption","subtype":"caption"},{"text":"-scale. ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Left: ","element":"figcaption","subtype":"caption"},{"text":"“half a game”, e.g. ","element":"figcaption","subtype":"caption"},{"href":"#id-43","text":"2. ","element":"a","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Right: ","element":"figcaption","subtype":"caption"},{"text":"minimal SM-game, e.g. ","element":"figcaption","subtype":"caption"},{"href":"#id-44","text":"3. ","element":"a","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Top: ","element":"figcaption","subtype":"caption"},{"text":"Both players have same learning rate. ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Bottom: ","element":"figcaption","subtype":"caption"},{"text":"Second player has ","element":"figcaption","subtype":"caption"},{"style":{"height":19.37},"width":16,"height":48.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-1.png","element":"img","alt":"18 ","inline":true,"padRight":true},{"text":"learning rate of first (which is same as for top). Reducing the learning rate of the second ","element":"figcaption","subtype":"caption"},{"text":"player destabilizes the dynamics in “half a game”, whereas the SM-game is essentially unaffected.","element":"figcaption","subtype":"caption"}],[{"text":"1.4 ","element":"span"},{"text":"N","element":"span"},{"text":"OTATION","element":"span"}],[{"text":"Vectors are column-vectors. The notations ","element":"span"},{"style":{"height":11.6},"width":101.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-2.png","element":"img","alt":" S ≻ 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":11.2},"width":100.96,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-3.png","element":"img","alt":" v ≻ 0","inline":true,"padRight":true},{"text":"refer to a positive-definite matrix and vector with all entries positive respectively. Rather than losses, we work with profits. Proofs are in the appendix. We use economic terminology (firms, profits, forecasts, and sentiment) even though the examples of SM-games, such as GANs and adversarial training, are taken from mainstream machine learning. We hope the economic terminology provides an invigorating change of perspective. The underlying mathematics is no more than first and second-order derivatives.","element":"span"}],[{"style":{"width":"52%"},"width":838,"height":224,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-4.png","element":"img"}]]},{"heading":"2 SMOOTH GAMES","paragraphs":[[{"text":"Smooth games model interacting agents with differentiable objectives. They are the kind of games that are played by neural nets. In practice, the differentiability assumption can be relaxed by replacing gradients with gradient estimates.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"smooth game ","element":"span"},{"href":"#id-45","referenceIndex":28,"style":{"fontStyle":"italic"},"text":"(Letcher et al., ","element":"a"},{"href":"#id-45","referenceIndex":28,"style":{"fontStyle":"italic"},"text":"2019) ","element":"a"},{"style":{"fontStyle":"italic"},"text":"consists in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"style":{"fontStyle":"italic"},"text":"players ","element":"span"},{"text":"[","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"] = ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"text":"1","element":"span"},{"style":{"fontStyle":"italic"},"text":", . . . , n","element":"span"},{"style":{"fontStyle":"italic"},"text":"}","element":"span"},{"style":{"fontStyle":"italic"},"text":", equipped with twice continuously differentiable profit functions ","element":"span"},{"style":{"height":17.53},"width":326.03,"height":43.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-5.png","element":"img","alt":" {πi : Rd → R}ni=1","inline":true},{"style":{"fontStyle":"italic"},"text":". The parameters are ","element":"span"},{"style":{"height":18.17},"width":1130.47,"height":45.42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-6.png","element":"img","alt":"w = (w1, . . . , wn) ∈ Rd with wi ∈ Rdi where �ni=1 di = d. Player i","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"controls the parameters ","element":"span"},{"style":{"height":9.59},"width":56.36,"height":23.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-7.png","element":"img","alt":" wi.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"If players update their actions via ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"simultaneous gradient ascent","element":"span"},{"style":{"fontStyle":"italic"},"text":", then a smooth game yields a ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"dynamical system ","element":"span"},{"style":{"fontStyle":"italic"},"text":"specified by the differential equation ","element":"span"},{"style":{"height":19.37},"width":245.14,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-8.png","element":"img","alt":"dwdt = ξ(w) for","inline":true}],[{"style":{"width":"31%"},"width":492,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":16},"width":446.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-10.png","element":"img","alt":" ξi(w) := ∇wiπi(w) is a di","inline":true},{"style":{"fontStyle":"italic"},"text":"-vector. The ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Jacobian ","element":"span"},{"style":{"fontStyle":"italic"},"text":"of a game is the ","element":"span"},{"style":{"height":16},"width":121.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-11.png","element":"img","alt":" (d × d)","inline":true},{"style":{"fontStyle":"italic"},"text":"-matrix of secondderivatives ","element":"span"},{"style":{"height":28.8},"width":297.17,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-12.png","element":"img","alt":" J(w) :=�∂ξα(w)∂wβ","inline":true}],[{"style":{"fontStyle":"italic"},"text":"substituting ","element":"span"},{"style":{"height":14},"width":303.26,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-13.png","element":"img","alt":" ℓi := −πi for all i.","inline":true}],[{"text":"Smooth games are too general to be tractable since they encompass all dynamical systems.","element":"span"}],[{"id":"id-68","style":{"fontWeight":"bold"},"text":"Lemma 1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Every continuous dynamical system on ","element":"span"},{"style":{"height":13.38},"width":45.78,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/2-14.png","element":"img","alt":" Rd","inline":true},{"style":{"fontStyle":"italic"},"text":", for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d","element":"span"},{"style":{"fontStyle":"italic"},"text":", arises as simultaneous gradient ascent on the profit functions of a smooth game.","element":"span"}],[{"text":"The next two sections illustrate some problems that arise in simple smooth games.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Definition 2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"We recall some solution concepts from dynamical systems and game theory:","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontWeight":"bold"},"text":"stable fixed point","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":16},"width":751.31,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-0.png","element":"img","alt":"w∗ satisfies ξ(w∗) = 0 and v⊺ · J(w∗) · v < 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all vectors ","element":"span"},{"style":{"height":15.2},"width":110.88,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-1.png","element":"img","alt":" v ̸= 0.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontWeight":"bold"},"text":"local Nash equilibrium ","element":"span"},{"style":{"height":10.98},"width":49.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-2.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"has ","element":"span"},{"text":"neighborhoods ","element":"span"},{"style":{"height":13.19},"width":38.21,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-3.png","element":"img","alt":" Ui","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"of ","element":"span"},{"style":{"height":15.13},"width":49.73,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-4.png","element":"img","alt":" w∗i","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":", such that ","element":"span"},{"style":{"height":16.15},"width":800.58,"height":40.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-5.png","element":"img","alt":"πi(wi, w∗−i) < πi(w∗i , w∗−i) all wi ∈ Ui \\ {w∗i }.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"• ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontWeight":"bold"},"text":"classical Nash equilibrium ","element":"span"},{"style":{"height":10.99},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-6.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"satisfies ","element":"span"},{"style":{"height":16.15},"width":463.98,"height":40.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-7.png","element":"img","alt":" πi(wi, w∗−i) ≤ πi(w∗i , w∗−i)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"height":9.59},"width":44.1,"height":23.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-8.png","element":"img","alt":" wi","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"players ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"text":"Example ","element":"span"},{"href":"#id-46","text":"1 ","element":"a"},{"text":"below shows that stable fixed points and local Nash equilibria do not necessarily coincide. The notion of classical Nash equilibrium is ill-suited to nonconcave settings.","element":"span"}],[{"text":"Intuitively, a fixed point is stable if all trajectories sufficiently nearby flow into it. A joint strategy is a local Nash if each player is harmed if it makes a small unilateral deviation. Local Nash differs from the classic definition in two ways. It is weaker, because it only allows ","element":"span"},{"style":{"fontStyle":"italic"},"text":"small ","element":"span"},{"text":"unilateral deviations. This is necessary since players are neural networks and profits are not usually concave. It is also stronger, because unilateral deviations decrease (rather than ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not increase","element":"span"},{"text":") profits.","element":"span"}],[{"text":"2.1 ","element":"span"},{"text":"P","element":"span"},{"text":"ROBLEMS WITH POTENTIAL GAMES","element":"span"}],[{"text":"A game is a potential game if ","element":"span"},{"style":{"height":14},"width":131.8,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-9.png","element":"img","alt":" ξ = ∇φ","inline":true,"padRight":true},{"text":"for some function ","element":"span"},{"style":{"height":14},"width":24,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-10.png","element":"img","alt":" φ","inline":true},{"text":", see ","element":"span"},{"href":"#id-32","referenceIndex":8,"text":"Balduzzi et al. ","element":"a"},{"href":"#id-32","referenceIndex":8,"text":"(2018) ","element":"a"},{"text":"for details.","element":"span"}],[{"id":"id-46","style":{"fontWeight":"bold"},"text":"Example 1 ","element":"span"},{"text":"(potential game)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix a small ","element":"span"},{"style":{"height":11.6},"width":89.31,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-11.png","element":"img","alt":" ϵ > 0","inline":true},{"style":{"fontStyle":"italic"},"text":". Consider the two-player games with profit functions","element":"span"}],[{"style":{"width":"53%"},"width":842,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-12.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"The game has a unique local Nash equilibrium at ","element":"span"},{"style":{"height":16},"width":658.5,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-13.png","element":"img","alt":" w = (0, 0) with π1(0, 0) = 0 = π2(0, 0).","inline":true}],[{"text":"The game is chosen to be as nice as possible: ","element":"span"},{"style":{"height":9.19},"width":38.71,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-14.png","element":"img","alt":" π1","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.19},"width":38.72,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-15.png","element":"img","alt":" π2","inline":true,"padRight":true},{"text":"are strongly concave functions of ","element":"span"},{"style":{"height":9.19},"width":44.53,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-16.png","element":"img","alt":" w1","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":9.19},"width":44.52,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-17.png","element":"img","alt":" w2","inline":true,"padRight":true},{"text":"respectively. The game is a ","element":"span"},{"style":{"fontWeight":"bold"},"text":"potential game ","element":"span"},{"text":"since ","element":"span"},{"style":{"height":16},"width":570.34,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-18.png","element":"img","alt":" ξ = (w2 − ϵw1, w1 − ϵw2) = ∇φ","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":18.88},"width":483.74,"height":47.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-19.png","element":"img","alt":"φ(w) = w1w2 − ϵ2(w21 + w22)","inline":true},{"text":". Nevertheless, the game exhibits three related problems.","element":"span"}],[{"text":"Firstly, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"the Nash equilibrium is unstable. ","element":"span"},{"text":"Players at the Nash equilibrium can increase their profits via the joint update ","element":"span"},{"style":{"height":16},"width":381.65,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-20.png","element":"img","alt":" w ← (0, 0) + η · (1, 1)","inline":true},{"text":", so ","element":"span"},{"style":{"height":17.49},"width":554.48,"height":43.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-21.png","element":"img","alt":" π1(w) = η(1 − ϵ2) = π2(w) > 0","inline":true},{"text":". The existence ","element":"span"},{"text":"of a Nash equilibrium where players can improve their payoffs by ","element":"span"},{"style":{"fontStyle":"italic"},"text":"coordinated action ","element":"span"},{"text":"suggests the incentives are not well-designed.","element":"span"}],[{"text":"Secondly, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"the dynamics can diverge to infinity. ","element":"span"},{"text":"Starting at ","element":"span"},{"style":{"height":18.18},"width":218.72,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-22.png","element":"img","alt":" w(1) = (1, 1)","inline":true,"padRight":true},{"text":"and applying simultaneously gradient ascent causes the norm of vector ","element":"span"},{"style":{"height":18.19},"width":128.52,"height":45.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-23.png","element":"img","alt":" ∥w(t)∥2","inline":true,"padRight":true},{"text":"to increase without limit as ","element":"span"},{"style":{"height":10.4},"width":127.41,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-24.png","element":"img","alt":" t → ∞","inline":true,"padRight":true},{"text":"– and at an accelerating rate – due to a positive feedback loop between the players’ parameters and profits. Finally, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"players impose externalities on each other. ","element":"span"},{"text":"The decisions of the first player affect the profits of the second, and vice versa. Obviously players must interact for a game to be interesting. However, positive feedback loops arise because the interactions are not properly accounted for.","element":"span"}],[{"text":"In short, simultaneous gradient ascent does not converge to the Nash – and can diverge to infinity. It is open to debate whether the fault lies with gradients, the concept of Nash, or the game structure. In this paper, we take gradients and Nash equilibria as given and seek to design better games.","element":"span"}],[{"id":"id-49","text":"2.2 ","element":"span"},{"text":"P","element":"span"},{"text":"ROBLEMS WITH LEARNING RATES","element":"span"}],[{"text":"Gradient-based optimizers rarely follow the actual gradient. For example RMSProp and Adam use adaptive, parameter-dependent learning rates. This is not a problem when optimizing a function. Suppose ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"text":"is optimized with reweighted gradient ","element":"span"},{"style":{"height":16.79},"width":741.63,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-25.png","element":"img","alt":" (∇f)η := (η1∇1f, . . . , ηn∇nf) where η ≻ 0","inline":true,"padRight":true},{"text":"is a vector of learning rates. Even though ","element":"span"},{"style":{"height":16.79},"width":107,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-26.png","element":"img","alt":" (∇f)η","inline":true,"padRight":true},{"text":"is not necessarily the gradient of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"any ","element":"span"},{"text":"function, it behaves ","element":"span"},{"style":{"fontStyle":"italic"},"text":"like ","element":"span"},{"style":{"height":14},"width":56.21,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-27.png","element":"img","alt":" ∇f","inline":true,"padRight":true},{"text":"because they have positive inner product when ","element":"span"},{"style":{"height":15.2},"width":144.06,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-28.png","element":"img","alt":" ∇f ̸= 0:","inline":true}],[{"style":{"width":"58%"},"width":935,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-29.png","element":"img"}],[{"text":"Parameter-dependent learning rates thus behave well in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"potential ","element":"span"},{"text":"games where the dynamics derive from an implicit potential function ","element":"span"},{"style":{"height":16},"width":261.51,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/3-30.png","element":"img","alt":" ξ(w) = ∇φ(w)","inline":true},{"text":". Severe problems can arise in general games.","element":"span"}],[{"id":"id-43","style":{"fontWeight":"bold"},"text":"Example 2 ","element":"span"},{"text":"(“half a game”)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider the following game, where the ","element":"span"},{"style":{"height":9.19},"width":44.52,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-0.png","element":"img","alt":" w2","inline":true},{"style":{"fontStyle":"italic"},"text":"-player is indifferent to ","element":"span"},{"style":{"height":9.59},"width":59.41,"height":23.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-1.png","element":"img","alt":" w1:","inline":true}],[{"style":{"width":"46%"},"width":730,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-2.png","element":"img"}],[{"text":"The dynamics are clear by inspection: the ","element":"span"},{"style":{"height":9.19},"width":44.52,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-3.png","element":"img","alt":" w2","inline":true},{"text":"-player converges to ","element":"span"},{"style":{"height":13.19},"width":120.18,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-4.png","element":"img","alt":" w2 = 0","inline":true},{"text":", and then the ","element":"span"},{"style":{"height":9.19},"width":44.53,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-5.png","element":"img","alt":" w1","inline":true},{"text":"-player does the same. It is hard to imagine that anything could go wrong. In contrast, behavior in the next example should be worse because convergence is slowed down by cycling around the Nash:","element":"span"}],[{"id":"id-44","style":{"fontWeight":"bold"},"text":"Example 3 ","element":"span"},{"text":"(minimal SM-game)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A simple SM-game, see definition ","element":"span"},{"href":"#id-47","style":{"fontStyle":"italic"},"text":"3, ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is","element":"span"}],[{"style":{"width":"54%"},"width":872,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-6.png","element":"img"}],[{"text":"Figure ","element":"span"},{"href":"#id-48","text":"1 ","element":"a"},{"text":"shows the dynamics of the games, in discrete time, with small learning rates and small gradient noise. In the top panel, both players have the same learning rate. Both games converge. Example ","element":"span"},{"href":"#id-43","text":"2 ","element":"a"},{"text":"converges faster – as expected – without cycling around the Nash.","element":"span"}],[{"text":"In the bottom panels, the learning rate of the second player is decreased by a factor of eight. The SM-game’s dynamics do not change significantly. In contrast, the dynamics of example ","element":"span"},{"href":"#id-43","text":"2 ","element":"a"},{"text":"become unstable: although player 1 is attracted to the Nash, it is extremely sensitive to noise and does not stay there for long. One goal of the paper is to explain why SM-games are more robust, in general, to differences in relative learning rates.","element":"span"}],[{"text":"2.3 ","element":"span"},{"text":"S","element":"span"},{"text":"TOP GRADIENT ","element":"span"},{"text":"AND LEARNING RATES","element":"span"}],[{"text":"Tools for automatic differentiation (AD) such as TensorFlow and PyTorch include ","element":"span"},{"text":"stop gradient ","element":"span"},{"text":"operators that stop gradients from being computed. ","element":"span"},{"text":"For example, let ","element":"span"},{"style":{"height":16},"width":277.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-7.png","element":"img","alt":" f(w) = w1 ·","inline":true},{"style":{"height":18.88},"width":645.71,"height":47.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-8.png","element":"img","alt":"stop gradient(w2) − ϵ2(w21 + w22)","inline":true},{"text":". The use of ","element":"span"},{"text":"stop gradient ","element":"span"},{"text":"means ","element":"span"},{"style":{"fontStyle":"italic"},"text":"f ","element":"span"},{"text":"is not strictly ","element":"span"},{"text":"speaking a function and so we use ","element":"span"},{"style":{"height":13.19},"width":73.35,"height":32.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-9.png","element":"img","alt":" ∇AD","inline":true,"padRight":true},{"text":"to refer to its gradient under automatic differentiation. Then","element":"span"}],[{"style":{"width":"32%"},"width":512,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-10.png","element":"img"}],[{"text":"which is the simultaneous gradient from example ","element":"span"},{"href":"#id-43","text":"2. ","element":"a"},{"text":"Any smooth vector field is the gradient of a function augmented with ","element":"span"},{"text":"stop gradient ","element":"span"},{"text":"operators, see appendix ","element":"span"},{"text":"D. ","element":"span"},{"text":"Stop gradient ","element":"span"},{"text":"is often used in complex neural architectures (for example when one neural network is fed into another leading to multiplicative interactions), and is thought to be mostly harmless. Section ","element":"span"},{"href":"#id-49","text":"2.2 ","element":"a"},{"text":"shows that ","element":"span"},{"text":"stop gradient","element":"span"},{"text":"s can interact in unexpected ways with parameter-dependent learning rates.","element":"span"}],[{"text":"2.4 ","element":"span"},{"text":"S","element":"span"},{"text":"UMMARY","element":"span"}],[{"text":"It is natural to expect individually well-behaved agents to also behave well collectively. Unfortunately, this basic requirement fails in even the simplest examples.","element":"span"}],[{"text":"Maximizing a strongly concave function is well-behaved: there is a unique, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"finite ","element":"span"},{"text":"global maximum. However, example ","element":"span"},{"href":"#id-46","text":"1 ","element":"a"},{"text":"shows that coupling concave functions can cause simultaneous gradient ascent to diverge to infinity. The dynamics of the game ","element":"span"},{"style":{"fontWeight":"bold"},"text":"differs in kind ","element":"span"},{"text":"from the dynamics of the players in isolation. Example ","element":"span"},{"href":"#id-43","text":"2 ","element":"a"},{"text":"shows that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"reducing ","element":"span"},{"text":"the learning rate of a well-behaved (strongly concave) player in a simple game destabilizes the dynamics. How collectives behave is sensitive not only to profits, but also to relative learning rates. Off-the-shelf optimizers such as Adam ","element":"span"},{"href":"#id-50","referenceIndex":24,"text":"(Kingma and Ba, ","element":"a"},{"href":"#id-50","referenceIndex":24,"text":"2015) ","element":"a"},{"text":"modify learning rates under the hood, which may destabilize some games.","element":"span"}]]},{"heading":"3 SMOOTH MARKETS (SM-GAMES)","paragraphs":[[{"text":"Let us restrict to more structured games. Take an accountant’s view of the world, where the only thing we track is the flow of money. Interactions are pairwise. Money is neither created nor destroyed, so interactions are zero-sum. If we model the interactions between players by differentiable functions ","element":"span"},{"style":{"height":16.79},"width":192.53,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-11.png","element":"img","alt":"gij(wi, wj)","inline":true,"padRight":true},{"text":"that depend on their respective strategies then we have an SM-game. All interactions are explicitly tracked. There are no externalities off the books. Positive interactions, ","element":"span"},{"style":{"height":15.59},"width":124.06,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/4-12.png","element":"img","alt":" gij > 0","inline":true},{"text":", are revenue, negative are costs, and the difference is profit. The model prescribes that all firms are profit maximizers. More formally:","element":"span"}],[{"id":"id-47","style":{"fontWeight":"bold"},"text":"Definition 3 ","element":"span"},{"text":"(SM-game)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"smooth market ","element":"span"},{"style":{"fontStyle":"italic"},"text":"is a smooth game where interactions between players are pairwise zero-sum. The profits have the form","element":"span"}],[{"style":{"width":"67%"},"width":1074,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":16.79},"width":684.44,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-1.png","element":"img","alt":" gij(wi, wj) + gji(wj, wi) ≡ 0 for all i, j.","inline":true}],[{"text":"The functions ","element":"span"},{"style":{"height":14},"width":30.51,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-2.png","element":"img","alt":" fi","inline":true,"padRight":true},{"text":"can act as regularizers. Alternatively, they can be interpreted as natural resources or dummy players that react too slowly to model as players. Dummy players provide firms with easy (non-adversarial) sources of revenue.","element":"span"}],[{"text":"Humans, unlike firms, are not profit-maximizers; humans typically buy goods because they value them more than the money they spend on them. Appendix ","element":"span"},{"text":"C ","element":"span"},{"text":"briefly discusses extending the model.","element":"span"}],[{"id":"id-14","text":"3.1 ","element":"span"},{"text":"E","element":"span"},{"text":"XAMPLES OF ","element":"span"},{"text":"SM-","element":"span"},{"text":"GAMES","element":"span"}],[{"text":"SM-games codify a common design pattern:","element":"span"}],[{"text":"1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Optimizing a function. ","element":"span"},{"text":"A near-trivial case is where there is a single player with profit ","element":"span"},{"style":{"height":16},"width":270.58,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-3.png","element":"img","alt":"π1(w) = f1(w).","inline":true}],[{"text":"2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Generative adversarial networks ","element":"span"},{"text":"and related architectures like CycleGANs are zero or near zero sum ","element":"span"},{"href":"#id-11","referenceIndex":18,"text":"(Goodfellow et al., ","element":"a"},{"href":"#id-11","referenceIndex":18,"text":"2014; ","element":"a"},{"href":"#id-51","referenceIndex":53,"text":"Wu et al., ","element":"a"},{"href":"#id-51","referenceIndex":53,"text":"2019; ","element":"a"},{"href":"#id-52","referenceIndex":54,"text":"Zhu et al., ","element":"a"},{"href":"#id-52","referenceIndex":54,"text":"2017)","element":"a"},{"text":".","element":"span"}],[{"text":"3. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Zero-sum polymatrix games ","element":"span"},{"text":"are SM-games where ","element":"span"},{"style":{"height":16},"width":217.7,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-4.png","element":"img","alt":" fi(wi) ≡ 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16.79},"width":251.33,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-5.png","element":"img","alt":" gij(wi, wj) =","inline":true},{"style":{"height":17.2},"width":162.58,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-6.png","element":"img","alt":"w⊺i Aijwj","inline":true,"padRight":true},{"text":"for some matrices ","element":"span"},{"style":{"height":15.59},"width":58.92,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-7.png","element":"img","alt":" Aij","inline":true},{"text":". Weights are constrained to probability simplices. The ","element":"span"},{"text":"games have nice properties including: Nash equilibria are computed via a linear program and correlated equilibria marginalize onto Nash equilibria ","element":"span"},{"href":"#id-13","referenceIndex":12,"text":"(Cai et al., ","element":"a"},{"href":"#id-13","referenceIndex":12,"text":"2016)","element":"a"},{"text":".","element":"span"}],[{"text":"4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Intrinsic curiosity modules ","element":"span"},{"text":"use games to drive exploration. One module is rewarded for predicting the environment and an adversary is rewarded for choosing actions whose outcomes are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"predicted by the first module ","element":"span"},{"href":"#id-53","referenceIndex":38,"text":"(Pathak et al., ","element":"a"},{"href":"#id-53","referenceIndex":38,"text":"2017)","element":"a"},{"text":". The modules share some weights, so the setup is nearly, but not exactly, an SM-game.","element":"span"}],[{"text":"5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Adversarial training ","element":"span"},{"text":"is concerned with the minmax problem ","element":"span"},{"href":"#id-54","referenceIndex":25,"text":"(Kurakin et al., ","element":"a"},{"href":"#id-54","referenceIndex":25,"text":"2017; ","element":"a"},{"href":"#id-55","referenceIndex":29,"text":"Madry ","element":"a"},{"href":"#id-55","referenceIndex":29,"text":"et al., ","element":"a"},{"href":"#id-55","referenceIndex":29,"text":"2018)","element":"a"}],[{"style":{"width":"39%"},"width":629,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-8.png","element":"img"}],[{"text":"Setting ","element":"span"},{"style":{"height":19.2},"width":545.26,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-9.png","element":"img","alt":" g0i(w0, δi) = ℓ�fw0(xi + δi), yi�","inline":true},{"text":"obtains a star-shaped SM-game with the neural net (player 0) at the center and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"text":"adversaries – one per datapoint ","element":"span"},{"style":{"height":16},"width":119.46,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-10.png","element":"img","alt":" (xi, yi)","inline":true,"padRight":true},{"text":"– on the arms.","element":"span"}],[{"text":"6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Task-suites ","element":"span"},{"text":"where a population of agents are trained on a population of tasks, form a bipartite graph. If the tasks are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"parametrized ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"adversarially ","element":"span"},{"text":"rewarded based on their difficulty for agents, then the setup is an SM-game.","element":"span"}],[{"text":"7. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Homogeneous games ","element":"span"},{"text":"arise when all the coupling functions are equal up to sign (recall ","element":"span"},{"style":{"height":16.79},"width":216.34,"height":41.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-11.png","element":"img","alt":"gij = −gji)","inline":true},{"text":". An example is population self-play ","element":"span"},{"href":"#id-56","referenceIndex":42,"text":"(Silver et al., ","element":"a"},{"href":"#id-56","referenceIndex":42,"text":"2016; ","element":"a"},{"href":"#id-12","referenceIndex":49,"text":"Vinyals et al., ","element":"a"},{"href":"#id-12","referenceIndex":49,"text":"2019) ","element":"a"},{"text":"which lives on a graph where ","element":"span"},{"style":{"height":19.37},"width":585.94,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-12.png","element":"img","alt":" gij(wi, wj) := P(wi beats wj) − 12","inline":true,"padRight":true},{"text":"comes from the ","element":"span"},{"text":"probability that policy ","element":"span"},{"style":{"height":15.99},"width":208.07,"height":39.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-13.png","element":"img","alt":" wi beats wj.","inline":true}],[{"text":"Monetary exchanges in SM-games are quite general. The error signals traded between generators and discriminators and the wins and losses traded between agents in StarCraft are two very different special cases.","element":"span"}]]},{"heading":"4 FROM MICRO TO MACRO","paragraphs":[[{"text":"How to analyze the behavior of the market as a whole? Adam Smith claimed that profit-maximizing leads firms to promote the interests of society, as if by an invisible hand ","element":"span"},{"href":"#id-57","referenceIndex":43,"text":"(Smith, ","element":"a"},{"href":"#id-57","referenceIndex":43,"text":"1776)","element":"a"},{"text":". More formally, we can ask: Is there a measure that firms collectively increase or decrease? It is easy to see that firms do not collectively maximize aggregate profit (AP) or aggregate revenue (AR):","element":"span"}],[{"style":{"width":"80%"},"width":1277,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/5-14.png","element":"img"}],[{"style":{"width":"75%"},"width":1192,"height":259,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-0.png","element":"img"}],[{"text":"Figure 2: ","element":"figcaption","subtype":"caption"},{"style":{"fontWeight":"bold"},"text":"SM-game graph topologies. ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"A: ","element":"figcaption","subtype":"caption"},{"text":"two-player (e.g. GANs). ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"B: ","element":"figcaption","subtype":"caption"},{"text":"star-shaped (e.g. adversarial training). ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"C: ","element":"figcaption","subtype":"caption"},{"text":"bipartite (e.g. task-suites). ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"D: ","element":"figcaption","subtype":"caption"},{"text":"all-to-all.","element":"figcaption","subtype":"caption"}],[{"text":"Maximizing aggregate profit would require firms to ignore interactions with other firms. Maximizing aggregate revenue would require firms to ignore costs. In short, SM-games are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"potential games; there is no function that they optimize in general. However, it turns out the dynamics of SM-games aggregates the dynamics of individual firms, in a sense made precise in section ","element":"span"},{"href":"#id-58","text":"4.3.","element":"a"}],[{"text":"4.1 ","element":"span"},{"text":"R","element":"span"},{"text":"ATIONALITY","element":"span"},{"text":": S","element":"span"},{"text":"EEING LIKE A ","element":"span"},{"text":"F","element":"span"},{"text":"IRM","element":"span"}],[{"text":"Give an objective function to an agent. The ","element":"span"},{"style":{"fontWeight":"bold"},"text":"agent is rational","element":"span"},{"text":", relative to the objective, if it chooses actions because it forecasts they will lead to better outcomes as measured by the objective. In SM-games, agents are firms, the objective is profit, and forecasts are computed using gradients.","element":"span"}],[{"text":"Firms aim to increase their profit. Applying the first-order Taylor approximation obtains","element":"span"}],[{"style":{"width":"74%"},"width":1188,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-1.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"fontStyle":"italic"},"text":"{","element":"span"},{"style":{"fontStyle":"italic"},"text":"h.o.t.","element":"span"},{"style":{"fontStyle":"italic"},"text":"} ","element":"span"},{"text":"refers to higher-order terms. Firm ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"’s ","element":"span"},{"style":{"fontWeight":"bold"},"text":"forecast ","element":"span"},{"text":"of how profits will change if it modifies production by ","element":"span"},{"style":{"height":9.59},"width":35.19,"height":23.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-2.png","element":"img","alt":" vi","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":16.85},"width":331.16,"height":42.13,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-3.png","element":"img","alt":" fvi(w) := v⊺i ξi(w)","inline":true},{"text":". The Taylor expansion implies that ","element":"span"},{"style":{"height":16},"width":158.83,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-4.png","element":"img","alt":" fvi(w) ≈","inline":true},{"style":{"height":16},"width":336.5,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-5.png","element":"img","alt":"πi(w + vi) − πi(w)","inline":true,"padRight":true},{"text":"for small updates ","element":"span"},{"style":{"height":9.59},"width":35.19,"height":23.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-6.png","element":"img","alt":" vi","inline":true},{"text":". Forecasts encode how individual firms expect profits to change ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ceteris paribus","element":"span"},{"text":"2","element":"span"},{"text":".","element":"span"}],[{"text":"4.2 ","element":"span"},{"text":"P","element":"span"},{"text":"ROFIT CHANGES DO NOT ADD UP","element":"span"}],[{"text":"How does profit maximizing by individual firms look from the point of view of the market as a whole? Summing over all firms obtains","element":"span"}],[{"id":"id-59","style":{"width":"78%"},"width":1238,"height":155,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-7.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16.78},"width":421.19,"height":41.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-8.png","element":"img","alt":" fv(w) = �i fvi(w) is the","inline":true,"padRight":true},{"style":{"fontWeight":"bold"},"text":"aggregate forecast","element":"span"},{"text":". Unfortunately, the left-hand side of Eq. ","element":"span"},{"href":"#id-59","text":"(3) ","element":"a"},{"text":"is ","element":"span"},{"text":"incoherent. It sums the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"changes in profit that would be experienced by firms updating their production in isolation","element":"span"},{"text":". However, firms change their production simultaneously. Updates are ","element":"span"},{"style":{"fontWeight":"bold"},"text":"not ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ceteris paribus ","element":"span"},{"text":"and so profit is not a meaningful macroeconomic concept. The following minimal example illustrates the problem: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Example 4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose ","element":"span"},{"style":{"height":16},"width":264.75,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-9.png","element":"img","alt":" π1(w) = w1w2","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":16},"width":295.75,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-10.png","element":"img","alt":" π2(w) = −w1w2","inline":true},{"style":{"fontStyle":"italic"},"text":". Fix ","element":"span"},{"style":{"height":16},"width":244.25,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-11.png","element":"img","alt":" w = (w1, w2)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and let ","element":"span"},{"style":{"fontWeight":"bold"},"text":"v ","element":"span"},{"text":"= ","element":"span"},{"style":{"height":16},"width":173.02,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-12.png","element":"img","alt":"(w2, −w1)","inline":true},{"style":{"fontStyle":"italic"},"text":". The sum of the changes in profit expected by the firms, reasoning in isolation, is","element":"span"}],[{"style":{"width":"65%"},"width":1046,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-13.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"whereas the actual change in aggregate profit is zero because ","element":"span"},{"style":{"height":16},"width":481.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/6-14.png","element":"img","alt":" π1(x) + π2(x) = 0 for any x.","inline":true}],[{"text":"Tracking aggregate profits is therefore not useful. The next section shows forecasts are better behaved.","element":"span"}],[{"id":"id-58","text":"4.3 ","element":"span"},{"text":"L","element":"span"},{"text":"EGIBILITY","element":"span"},{"text":": S","element":"span"},{"text":"EEING LIKE AN ","element":"span"},{"text":"E","element":"span"},{"text":"CONOMY","element":"span"}],[{"text":"Give a target function to every agent in a collective. The ","element":"span"},{"style":{"fontWeight":"bold"},"text":"collective is legible","element":"span"},{"text":", relative to the targets, if it increases or decreases the aggregate target according to whether its members forecast, on aggregate, they will increase or decrease their targets. We show that SM-games are legible. The targets are profit forecasts (note: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"profits).","element":"span"}],[{"text":"Let us consider how forecasts change. Define the ","element":"span"},{"style":{"fontWeight":"bold"},"text":"sentiment ","element":"span"},{"text":"as the directional derivative of the forecast ","element":"span"},{"style":{"height":16.85},"width":421.2,"height":42.14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-0.png","element":"img","alt":" Dvifvi(w) = v⊺i ∇fvi(w)","inline":true},{"text":". The first-order Taylor expansion of the forecast shows that the ","element":"span"},{"text":"sentiment is a forecast about the profit forecast:","element":"span"}],[{"style":{"width":"74%"},"width":1183,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-1.png","element":"img"}],[{"text":"The perspective of firms can be summarized as:","element":"span"}],[{"style":{"width":"66%"},"width":1060,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-2.png","element":"img"}],[{"text":"a. If sentiment is positive then forecasts increase as the firm modifies its production – forecasts become more optimistic. The firm experiences ","element":"span"},{"style":{"fontWeight":"bold"},"text":"increasing returns-to-scale","element":"span"},{"text":".","element":"span"}],[{"text":"b. If sentiment is negative then forecasts decrease as the firm modifies its production – forecasts become more pessimistic. The firm experiences ","element":"span"},{"style":{"fontWeight":"bold"},"text":"diminishing returns-to-scale","element":"span"},{"text":".","element":"span"}],[{"text":"Our main result is that sentiment is additive, which means that forecasts are legible: ","element":"span"},{"id":"id-61","style":{"fontWeight":"bold"},"text":"Proposition 2 ","element":"span"},{"text":"(forecasts are legible in SM-games)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Sentiment is additive","element":"span"}],[{"style":{"width":"28%"},"width":456,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-3.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Thus, the aggregrate profit forecast ","element":"span"},{"style":{"height":15.2},"width":32,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-4.png","element":"img","alt":" fv","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"increases or decreases according to whether individual forecasts ","element":"span"},{"style":{"height":15.2},"width":43.05,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-5.png","element":"img","alt":" fvi","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"are expected to increase or decrease in aggregate.","element":"span"}],[{"text":"Section ","element":"span"},{"href":"#id-60","text":"5.1 ","element":"a"},{"text":"works through an example that is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"legible.","element":"span"}]]},{"heading":"5 DYNAMICS OF SMOOTH MARKETS","paragraphs":[[{"text":"Finally, we study the dynamics of gradient-based learners in SM-games. Suppose firms use gradient ascent. Firm ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":"’s updates are, infinitesimally, in the direction ","element":"span"},{"style":{"height":19.63},"width":640.58,"height":49.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-6.png","element":"img","alt":" vi = ξi(w) so that dwidt = ξi(w). Since","inline":true,"padRight":true},{"text":"updates are gradients, we can simplify our notation. Define firm ","element":"span"},{"style":{"height":19.37},"width":579.86,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-7.png","element":"img","alt":" i’s forecast as fi(w) := 12ξ⊺i ∇iπi =","inline":true}],[{"style":{"height":18.88},"width":478.96,"height":47.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-8.png","element":"img","alt":"2∥ξi(w)∥22 and its sentiment,","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"ceteris paribus","element":"span"},{"text":", as ","element":"span"},{"style":{"height":20.55},"width":292.35,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-9.png","element":"img","alt":" ξ⊺i ∇ifi = dfidt (w).","inline":true}],[{"text":"We allow firms to choose their learning rates; firms with higher learning rates are more responsive. Define the ","element":"span"},{"style":{"height":10.8},"width":24,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-10.png","element":"img","alt":" η","inline":true},{"text":"-weighted dynamics ","element":"span"},{"style":{"height":18.3},"width":555.33,"height":45.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-11.png","element":"img","alt":" ξη(w) := (η1ξ1, . . . , ηnξn) and η","inline":true},{"text":"-weighted forecast as","element":"span"}],[{"style":{"width":"42%"},"width":680,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-12.png","element":"img"}],[{"text":"In this setting, proposition ","element":"span"},{"href":"#id-61","text":"2 ","element":"a"},{"text":"implies that","element":"span"}],[{"id":"id-67","style":{"fontWeight":"bold"},"text":"Proposition 3 ","element":"span"},{"text":"(legibility under gradient dynamics)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix dynamics ","element":"span"},{"style":{"height":20.17},"width":221.03,"height":50.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-13.png","element":"img","alt":"dwdt := ξη(w)","inline":true},{"style":{"fontStyle":"italic"},"text":". Sentiment decom-","element":"span"}],[{"style":{"width":"59%"},"width":941,"height":127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-14.png","element":"img"}],[{"text":"Thus, we can read off the aggregate dynamics from the dynamics of forecasts of individual firms.","element":"span"}],[{"id":"id-60","text":"5.1 ","element":"span"},{"text":"E","element":"span"},{"text":"XAMPLE OF A FAILURE OF LEGIBILITY","element":"span"}],[{"text":"The pairwise zero-sum structure is crucial to legibility. It is instructive to take a closer look at example ","element":"span"},{"href":"#id-46","text":"1, ","element":"a"},{"text":"where the forecasts are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"not ","element":"span"},{"text":"legible.","element":"span"}],[{"text":"Suppose ","element":"span"},{"style":{"height":18.88},"width":371.01,"height":47.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-15.png","element":"img","alt":" π1(w) = w1w2 − ϵ2w21","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":18.88},"width":371.01,"height":47.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-16.png","element":"img","alt":" π2(w) = w1w2 − ϵ2w22","inline":true},{"text":". Then ","element":"span"},{"style":{"height":16},"width":503.89,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-17.png","element":"img","alt":" ξ(w) = (w2 − ϵw1, w1 − ϵw2)","inline":true,"padRight":true},{"text":"and the firms’ sentiments are ","element":"span"},{"style":{"height":20.55},"width":357.35,"height":51.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-18.png","element":"img","alt":"df1dt = −ϵ(w2 − ϵw1)2","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":20.55},"width":357.35,"height":51.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/7-19.png","element":"img","alt":" df2dt = −ϵ(w1 − ϵw2)2","inline":true,"padRight":true},{"text":"which are always","element":"span"}],[{"style":{"width":"88%"},"width":1398,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-0.png","element":"img"}],[{"text":"which for small ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-1.png","element":"img","alt":" ϵ","inline":true,"padRight":true},{"text":"is dominated by ","element":"span"},{"style":{"height":9.19},"width":90.94,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-2.png","element":"img","alt":" w1w2","inline":true},{"text":", and so can be either positive or negative.","element":"span"}],[{"text":"When ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w ","element":"span"},{"text":"= (1","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1) ","element":"span"},{"text":"we have ","element":"span"},{"style":{"height":20.55},"width":567.43,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-3.png","element":"img","alt":"dfdt = 1 − ϵ(6 − 5ϵ + 2ϵ2) ≈ 1 > 0","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":20.55},"width":484.98,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-4.png","element":"img","alt":" df1dt + df2dt = −2ϵ(1 − ϵ)2 < 0","inline":true},{"text":". ","element":"span"},{"text":"Each firm expects their forecasts to decrease, and yet the opposite happens due to a positive feedback loop that ultimately causes the dynamics to diverge to infinity.","element":"span"}],[{"text":"5.2 ","element":"span"},{"text":"S","element":"span"},{"text":"TABILITY","element":"span"},{"text":", C","element":"span"},{"text":"ONVERGENCE AND ","element":"span"},{"text":"B","element":"span"},{"text":"OUNDEDNESS","element":"span"}],[{"text":"We provide three fundamental results on the dynamics of smooth markets. Firstly, we show that stability, from dynamical systems, and local Nash equilibrium, from game theory, coincide in SM-games:","element":"span"}],[{"id":"id-65","style":{"fontWeight":"bold"},"text":"Theorem 4 ","element":"span"},{"text":"(stability)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A fixed point in an SM-game is a local Nash equilibrium iff it is stable. Thus, every local Nash equilibrium is contained in an open set that forms its basin of attraction.","element":"span"}],[{"text":"Secondly, we consider convergence. Lyapunov functions are tools for studying convergence. Given dynamical system ","element":"span"},{"style":{"height":19.37},"width":187.71,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-5.png","element":"img","alt":"dwdt = ξ(w)","inline":true,"padRight":true},{"text":"with fixed point ","element":"span"},{"style":{"height":10.98},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-6.png","element":"img","alt":" w∗","inline":true},{"text":", recall that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V ","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"text":"is a ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Lyapunov function ","element":"span"},{"text":"if: ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(i) ","element":"span"},{"style":{"height":16},"width":191.02,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-7.png","element":"img","alt":"V (w∗) = 0","inline":true},{"text":"; ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(ii) ","element":"span"},{"style":{"fontStyle":"italic"},"text":"V ","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"> ","element":"span"},{"text":"0 ","element":"span"},{"text":"for all ","element":"span"},{"style":{"height":15.2},"width":139.32,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-8.png","element":"img","alt":" w ̸= w∗","inline":true},{"text":"; and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(iii) ","element":"span"},{"style":{"height":19.37},"width":187.63,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-9.png","element":"img","alt":"dVdt (w) < 0","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"height":15.2},"width":139.32,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-10.png","element":"img","alt":" w ̸= w∗","inline":true},{"text":". If a dynamical ","element":"span"},{"text":"system has a Lyapunov function then the dynamics converge to the fixed point. Aggregate forecasts share properties ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(i) ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"(ii) ","element":"span"},{"text":"with Lyapunov functions.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"(i) Shared global minima: ","element":"span"},{"style":{"height":16.79},"width":684.17,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-11.png","element":"img","alt":" fη(w) = 0 iff fη′(w) = 0 for all η, η′ ≻ 0","inline":true},{"text":", which occurs iff ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w ","element":"span"},{"text":"is a stationary point, ","element":"span"},{"style":{"height":16},"width":315.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-12.png","element":"img","alt":" ξi(w) = 0 for all i.","inline":true}],[{"style":{"width":"79%"},"width":1260,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-13.png","element":"img"}],[{"text":"We can therefore use forecasts to study convergence and divergence ","element":"span"},{"style":{"fontStyle":"italic"},"text":"across all learning rates","element":"span"},{"text":": ","element":"span"},{"id":"id-71","style":{"fontWeight":"bold"},"text":"Theorem 5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In continuous time, for all positive learning rates ","element":"span"},{"style":{"height":14},"width":112.43,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-14.png","element":"img","alt":" η ≻ 0,","inline":true}],[{"style":{"fontStyle":"italic"},"text":"1. ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Convergence: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"If ","element":"span"},{"style":{"height":10.99},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-15.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a stable fixed point (","element":"span"},{"style":{"height":11.6},"width":106.57,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-16.png","element":"img","alt":"S ≺ 0","inline":true},{"style":{"fontStyle":"italic"},"text":"), then there is an open neighborhood ","element":"span"},{"style":{"height":11.78},"width":131.26,"height":29.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-17.png","element":"img","alt":"U ∋ w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"height":21.58},"width":191.33,"height":53.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-18.png","element":"img","alt":"dfηdt (w) < 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"height":16},"width":245.38,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-19.png","element":"img","alt":" w ∈ U \\ {w∗}","inline":true},{"style":{"fontStyle":"italic"},"text":", so the dynamics converge to ","element":"span"},{"style":{"height":10.98},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-20.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"from ","element":"span"},{"style":{"fontStyle":"italic"},"text":"anywhere in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"2. ","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"Divergence: ","element":"span"},{"style":{"fontStyle":"italic"},"text":"If ","element":"span"},{"style":{"height":10.98},"width":49.73,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-21.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is an unstable fixed point (","element":"span"},{"style":{"height":11.6},"width":116.38,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-22.png","element":"img","alt":"S ≻ 0","inline":true},{"style":{"fontStyle":"italic"},"text":"), there is an open neighborhood ","element":"span"},{"style":{"height":11.78},"width":143.03,"height":29.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-23.png","element":"img","alt":"U ∋ w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that ","element":"span"},{"style":{"height":21.58},"width":203.1,"height":53.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-24.png","element":"img","alt":"dfηdt (w) > 0","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"for all ","element":"span"},{"style":{"height":16},"width":261.86,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-25.png","element":"img","alt":" w ∈ U \\ {w∗}","inline":true},{"style":{"fontStyle":"italic"},"text":", so the dynamics within ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"style":{"fontStyle":"italic"},"text":"are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"repelled by ","element":"span"},{"style":{"height":11.38},"width":62.06,"height":28.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-26.png","element":"img","alt":" w∗.","inline":true}],[{"text":"The theorem explains why SM-games are robust to relative differences in learning rates – in contrast to the sensitivity exhibited by the game in example ","element":"span"},{"href":"#id-43","text":"2. ","element":"a"},{"text":"If a fixed point is stable, then for any dynamics","element":"span"}],[{"style":{"width":"99%"},"width":1585,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-27.png","element":"img"}],[{"text":"The aggregate forecasts provide a family of Lyapunov-like functions.","element":"span"}],[{"text":"Finally, we consider the setting where firms experience diminishing returns-to-scale for sufficiently large production vectors. The assumption is realistic for firms in a finite economy since revenues must eventually saturate whilst costs continue to increase with production.","element":"span"}],[{"id":"id-66","style":{"fontWeight":"bold"},"text":"Theorem 6 ","element":"span"},{"text":"(boundedness)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose all firms have negative sentiment for sufficiently large values of ","element":"span"},{"style":{"height":16},"width":86.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-28.png","element":"img","alt":"∥wi∥","inline":true},{"style":{"fontStyle":"italic"},"text":". Then the dynamics are bounded for all ","element":"span"},{"style":{"height":14},"width":111.43,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-29.png","element":"img","alt":" η ≻ 0.","inline":true}],[{"text":"The theorem implies that the kind of positive feedback loops that caused example ","element":"span"},{"href":"#id-46","text":"1 ","element":"a"},{"text":"to diverge to infinity, cannot occur in SM-games.","element":"span"}],[{"text":"5.3 ","element":"span"},{"text":"L","element":"span"},{"text":"EGIBILITY AND THE LANDSCAPE","element":"span"}],[{"text":"One of our themes is that legibility allows to read off the dynamics of games. We make the claim visually explicit in this section. Let us start with a concrete game. ","element":"span"},{"id":"id-63","style":{"fontWeight":"bold"},"text":"Example 5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider the SM-game with profits","element":"span"}],[{"style":{"width":"84%"},"width":1332,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/8-30.png","element":"img"}],[{"style":{"width":"95%"},"width":1520,"height":285,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-0.png","element":"img"}],[{"text":"Figure 3: ","element":"figcaption","subtype":"caption"},{"id":"id-62","style":{"fontWeight":"bold"},"text":"Legibile dynamics. ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Panels AB: ","element":"figcaption","subtype":"caption"},{"text":"Dynamics in an SM-game with both positive and negative sentiment, under different learning rates. ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"Panels CD: ","element":"figcaption","subtype":"caption"},{"text":"Cartoon maps of the dynamics.","element":"figcaption","subtype":"caption"}],[{"text":"Figure ","element":"span"},{"href":"#id-62","text":"3A","element":"a"},{"text":"B plots the dynamics of the SM-game in example ","element":"span"},{"href":"#id-63","text":"5, ","element":"a"},{"text":"under two different learning rates for player 1. There is an unstable fixed point at the origin and an ovoidal cycle. Dynamics converge to the cycle from both inside and outside the ovoid. Changing player 1’s learning rate, panel B, squashes the ovoid. Panels CD provide a cartoon map of the dynamics. There are two regions, the interior and exterior of the ovoid and the boundary formed by the ovoid itself.","element":"span"}],[{"text":"In general, the phase space of any SM-game is carved into regions where sentiment ","element":"span"},{"style":{"height":21.58},"width":117.44,"height":53.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-1.png","element":"img","alt":"dfηdt (w)","inline":true,"padRight":true},{"text":"is ","element":"span"},{"text":"positive and negative, with boundaries where sentiment is zero. The dynamics can be visualized as operating on a landscape where height at each point ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w ","element":"span"},{"text":"corresponds to the value of the aggregate forecast ","element":"span"},{"style":{"height":16.79},"width":100.52,"height":41.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-2.png","element":"img","alt":" fη(w)","inline":true},{"text":". The dynamics does not always ascend or always descend the landscape. Rather, sentiment determines whether the dynamics ascends, descends, or remains on a level-set. Since sentiment is additive, ","element":"span"},{"style":{"height":21.58},"width":397.25,"height":53.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-3.png","element":"img","alt":"dfηdt (w) = �i ηi · dfdt(w)","inline":true},{"text":", the decision to ascend or descend comes down to a ","element":"span"},{"text":"weighted sum of the sentiments of the firms.","element":"span"},{"href":"#id-64","referenceIndex":1,"text":"3 ","element":"a"},{"text":"Changing learning rates changes the emphasis given to different firms’ opinions, and thus changes the shapes of the boundaries between regions in a relatively straightforward manner.","element":"span"}],[{"text":"SM-games can thus express richer dynamics than potential games (cycles will not occur when performing gradient ascent on a fixed objective), which still admit a relatively simple visual description in terms of a landscape and decisions about which direction to go (upwards or downwards). Computing the landscape for general SM-games, as for neural nets, is intractable.","element":"span"}]]},{"heading":"6 DISCUSSION","paragraphs":[[{"text":"Machine learning has got a lot of mileage out of treating differentiable modules like plug-and-play lego blocks. This works when the modules optimize a single loss and the gradients chain together seamlessly. Unfortunately, agents with differing objectives are far from plug-and-play. Interacting agents form games, and games are intractable in general. Worse, positive feedback loops can cause individually well-behaved agents to collectively spiral out of control.","element":"span"}],[{"text":"It is therefore necessary to find organizing principles – constraints – on how agents interact that ensure their collective behavior is amenable to analysis and control. The pairwise zero-sum condition that underpins SM-games is one such organizing principle, which happens to admit an economic interpretation. Our main result is that SM-games are legible: changes in aggregate forecasts are the sum of how individual firms expect their forecasts to change. It follows that we can translate properties of the individual firms into guarantees on collective convergence, stability and boundedness in SM-games, see theorems ","element":"span"},{"href":"#id-65","text":"4-","element":"a"},{"href":"#id-66","text":"6.","element":"a"}],[{"text":"Legibility is a local-to-global principle, whereby we can draw qualitative conclusions about the behavior of collectives based on the nature of their individual members. Identifying and exploiting games that embed local-to-global principles will become increasingly important as artificial agents become more common.","element":"span"}]]},{"heading":"REFERENCES","paragraphs":[[{"id":"id-16","text":"Abernethy, J. and Frongillo, R. (2011). A Collaborative Mechanism for Crowdsourcing Prediction ","element":"span"},{"text":"Problems. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"NeurIPS","element":"span"},{"text":".","element":"span"}],[{"text":"3","element":"span"},{"id":"id-64","text":"Note: the dynamics do not necessarily follow the gradient of ","element":"span"},{"style":{"height":13.6},"width":58.37,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-4.png","element":"img","alt":" ±fη","inline":true},{"text":". Rather, they move in directions with","element":"span"}],[{"text":"positive or negative inner product with ","element":"span"},{"style":{"height":13.6},"width":60.42,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/9-5.png","element":"img","alt":" ∇fη","inline":true,"padRight":true},{"text":"according to sentiment.","element":"span"}],[{"id":"id-37","text":"Abernethy, J., Lai, K. A., and Wibisono, A. (2019). Last-iterate convergence rates for min-max ","element":"span"},{"text":"optimization. In ","element":"span"},{"href":"http://arxiv.org/abs/1906.02027","style":{"fontStyle":"italic"},"text":"arXiv:1906.02027","element":"a"},{"text":".","element":"span"}],[{"id":"id-7","text":"Babichenko, Y. (2016). Query Complexity of Approximate Nash Equilibria. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal ACM","element":"span"},{"text":", 63(4).","element":"span"}],[{"id":"id-42","text":"Bailey, J. P., Gidel, G., and Piliouras, G. (2019). Finite Regret and Cycles with Fixed Step-Size via ","element":"span"},{"text":"Alternating Gradient Descent-Ascent. In ","element":"span"},{"href":"http://arxiv.org/abs/1907.04392","style":{"fontStyle":"italic"},"text":"arXiv:1907.04392","element":"a"},{"text":".","element":"span"}],[{"id":"id-38","text":"Bailey, J. P. and Piliouras, G. (2018). Multiplicative Weights Update in Zero-Sum Games. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ACM EC","element":"span"},{"text":".","element":"span"}],[{"id":"id-17","text":"Balduzzi, D. (2014). Cortical prediction markets. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"AAMAS","element":"span"},{"text":".","element":"span"}],[{"id":"id-32","text":"Balduzzi, D., Racani","element":"span"},{"text":"`ere, S., Martens, J., Foerster, J., Tuyls, K., and Graepel, T. (2018). The mechanics of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n","element":"span"},{"text":"-player differentiable games. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICML","element":"span"},{"text":".","element":"span"}],[{"id":"id-18","text":"Barto, A. G., Sutton, R. S., and Anderson, C. W. (1983). Neuronlike Adapative Elements That Can ","element":"span"},{"text":"Solve Difficult Learning Control Problems. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Trans. Systems, Man, Cyb","element":"span"},{"text":", 13(5):834–846.","element":"span"}],[{"id":"id-19","text":"Baum, E. B. (1999). Toward a Model of Intelligence as an Economy of Agents. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Machine Learning","element":"span"},{"text":", 35(155-185).","element":"span"}],[{"text":"Berard, H., Gidel, G., Almahairi, A., Vincent, P., and Lacoste-Julien, S. (2019). A Closer Look at the ","element":"span"},{"text":"Optimization Landscapes of Generative Adversarial Networks. In ","element":"span"},{"href":"http://arxiv.org/abs/1906.04848","style":{"fontStyle":"italic"},"text":"arXiv:1906.04848","element":"a"},{"text":".","element":"span"}],[{"id":"id-13","text":"Cai, Y., Candogan, O., Daskalakis, C., and Papadimitriou, C. (2016). Zero-sum Polymatrix Games: ","element":"span"},{"text":"A Generalization of Minmax. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mathematics of Operations Research","element":"span"},{"text":", 41(2):648–655.","element":"span"}],[{"id":"id-8","text":"Daskalakis, C., Goldberg, P. W., and Papadimitriou, C. (2009). The Complexity of Computing a ","element":"span"},{"text":"Nash Equilibrium. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"SIAM J. Computing","element":"span"},{"text":", 39(1):195–259.","element":"span"}],[{"id":"id-15","text":"Drexler, K. E. (2019). Reframing Superintelligence: Comprehensive AI Services as General Intelli- ","element":"span"},{"text":"gence. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Future of Humanity Institute, University of Oxford, Technical Report #2019-1","element":"span"},{"text":".","element":"span"}],[{"id":"id-39","text":"Gemp, I. and Mahadevan, S. (2017). Online Monotone Games. In ","element":"span"},{"href":"http://arxiv.org/abs/1710.07328","style":{"fontStyle":"italic"},"text":"arXiv:1710.07328","element":"a"},{"text":".","element":"span"}],[{"id":"id-33","text":"Gemp, I. and Mahadevan, S. (2018). Global Convergence to the Equilibrium of GANs using ","element":"span"},{"text":"Variational Inequalities. In ","element":"span"},{"href":"http://arxiv.org/abs/1808.01531","style":{"fontStyle":"italic"},"text":"arXiv:1808.01531","element":"a"},{"text":".","element":"span"}],[{"id":"id-34","text":"Gidel, G., Hemmat, R. A., Pezeshki, M., Lepriol, R., Huang, G., Lacoste-Julien, S., and Mitliagkas, I. ","element":"span"},{"text":"(2019). Negative Momentum for Improved Game Dynamics. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"AISTATS","element":"span"},{"text":".","element":"span"}],[{"id":"id-11","text":"Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., ","element":"span"},{"text":"and Bengio, Y. (2014). Generative Adversarial Nets. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"NeurIPS","element":"span"},{"text":".","element":"span"}],[{"id":"id-9","text":"Hart, S. and Mas-Colell, A. (2003). Uncoupled Dynamics Do Not Lead to Nash Equilibrium. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"American Economic Review","element":"span"},{"text":", 93(5):1830–1836.","element":"span"}],[{"id":"id-20","text":"Hu, J. and Storkey, A. (2014). Multi-period Trading Prediction Markets with Connections to Machine ","element":"span"},{"text":"Learning. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICML","element":"span"},{"text":".","element":"span"}],[{"id":"id-21","text":"Kakade, S., Kearns, M., and Ortiz, L. (2003). Graphical economics. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"COLT","element":"span"},{"text":".","element":"span"}],[{"id":"id-22","text":"Kakade, S., Kearns, M., Ortiz, L., Pemantle, R., and Suri, S. (2005). Economic properties of social ","element":"span"},{"text":"networks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"NeurIPS","element":"span"},{"text":".","element":"span"}],[{"id":"id-23","text":"Kearns, M., Littman, M., and Singh, S. (2001). Graphical models for game theory. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"UAI","element":"span"},{"text":".","element":"span"}],[{"id":"id-50","text":"Kingma, D. P. and Ba, J. L. (2015). Adam: A method for stochastic optimization. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICLR","element":"span"},{"text":".","element":"span"}],[{"id":"id-54","text":"Kurakin, A., Goodfellow, I., and Bengio, S. (2017). Adversarial Machine Learning at Scale. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICLR","element":"span"},{"text":".","element":"span"}],[{"id":"id-24","text":"Kwee, I., Hutter, M., and Schmidhuber, J. (2001). Market-based reinforcement learning in partially ","element":"span"},{"text":"observable worlds. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICANN","element":"span"},{"text":".","element":"span"}],[{"id":"id-25","text":"Lay, N. and Barbu, A. (2010). Supervised aggregation of classifiers using artificial prediction markets. ","element":"span"},{"text":"In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICML","element":"span"},{"text":".","element":"span"}],[{"id":"id-45","text":"Letcher, A., Balduzzi, D., Racani","element":"span"},{"text":"`ere, S., Martens, J., Foerster, J., Tuyls, K., and Graepel, T. (2019). Differentiable Game Mechanics. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"JMLR","element":"span"},{"text":", 20:1–40.","element":"span"}],[{"id":"id-55","text":"Madry, A., Makelov, A., Schmidt, L., Tsipras, D., and Vladu, A. (2018). Towards Deep Learning ","element":"span"},{"text":"Models Resistant to Adversarial Attacks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICLR","element":"span"},{"text":".","element":"span"}],[{"id":"id-35","text":"Mescheder, L. (2018). On the convergence properties of GAN training. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ArXiv:1801:04406","element":"span"},{"text":".","element":"span"}],[{"id":"id-36","text":"Mescheder, L., Nowozin, S., and Geiger, A. (2017). The Numerics of GANs. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"NeurIPS","element":"span"},{"text":".","element":"span"}],[{"id":"id-26","text":"Minsky, M. (1986). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The society of mind","element":"span"},{"text":". Simon and Schuster, New York NY.","element":"span"}],[{"id":"id-3","text":"Monderer, D. and Shapley, L. S. (1996). Potential Games. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Games and Economic Behavior","element":"span"},{"text":", 14:124– 143.","element":"span"}],[{"id":"id-40","text":"Nemirovski, A., Onn, S., and Rothblum, U. G. (2010). Accuracy certificates for computational ","element":"span"},{"text":"problems with convex structure. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mathematics of Operations Research","element":"span"},{"text":", 35(1).","element":"span"}],[{"id":"id-4","text":"Nisan, N., Roughgarden, T., Tardos, ","element":"span"},{"text":"´","element":"span"},{"text":"E., and Vazirani, V., editors (2007). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Algorithmic Game Theory","element":"span"},{"text":". Cambridge University Press, Cambridge.","element":"span"}],[{"id":"id-10","text":"Palaiopanos, G., Panageas, I., and Piliouras, G. (2017). Multiplicative Weights Update with Constant ","element":"span"},{"text":"Step-Size in Congestion Games: Convergence, Limit Cycles and Chaos. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"NeurIPS","element":"span"},{"text":".","element":"span"}],[{"id":"id-0","text":"Parkes, D. C. and Wellman, M. P. (2015). Economic reasoning and artificial intelligence. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Science","element":"span"},{"text":", 349(6245):267–272.","element":"span"}],[{"id":"id-53","text":"Pathak, D., Agrawal, P., Efros, A. A., and Darrell, T. (2017). Curiosity-driven Exploration by ","element":"span"},{"text":"Self-supervised Prediction. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICML","element":"span"},{"text":".","element":"span"}],[{"id":"id-1","text":"Rahwan, I., Cebrian, M., Obradovich, N., Bongard, J., Bonnefon, J.-F., Breazeal, C., Crandall, J. W., ","element":"span"},{"text":"Christakis, N. A., Couzin, I. D., Jackson, M. O., Jennings, N. R., Kamar, E., Kloumann, I. M., Larochelle, H., Lazer, D., Mcelreath, R., Mislove, A., Parkes, D. C., Pentland, A. S., Roberts, M. E., Shariff, A., Tenenbaum, J. B., and Wellman, M. (2019). Machine behaviour. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Nature","element":"span"},{"text":", 568:477–486.","element":"span"}],[{"text":"Scott, J. (1999). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Seeing Like a State: How Certain Schemes to Improve the Human Condition Have Failed","element":"span"},{"text":". Yale University Press.","element":"span"}],[{"id":"id-27","text":"Selfridge, O. G. (1958). Pandemonium: a paradigm for learning. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mechanisation of Thought Processes: Proc Symposium Held at the National Physics Laboratory","element":"span"},{"text":".","element":"span"}],[{"id":"id-56","text":"Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den Driessche, G., Schrittwieser, J., ","element":"span"},{"text":"Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T. P., Leach, M., Kavukcuoglu, K., Graepel, T., and Hassabis, D. (2016). Mastering the game of Go with deep neural networks and tree search. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Nature","element":"span"},{"text":", 529(7587):484–489.","element":"span"}],[{"id":"id-57","text":"Smith, A. (1776). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The Wealth of Nations","element":"span"},{"text":". W. Strahan and T. Cadell, London.","element":"span"}],[{"id":"id-28","text":"Storkey, A. (2011). Machine Learning Markets. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"AISTATS","element":"span"},{"text":".","element":"span"}],[{"id":"id-29","text":"Storkey, A., Millin, J., and Geras, K. (2012). Isoelastic Agents and Wealth Udates in Machine ","element":"span"},{"text":"Learning Markets. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"ICML","element":"span"},{"text":".","element":"span"}],[{"id":"id-30","text":"Sutton, R., Modayil, J., Delp, M., Degris, T., Pilarski, P. M., White, A., and Precup, D. (2011). Horde: ","element":"span"},{"text":"A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Motor Interaction. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"AAMAS","element":"span"},{"text":".","element":"span"}],[{"id":"id-41","text":"Tatarenko, T. and Kamgarpour, M. (2019). Learning Generalized Nash Equilibria in a Class of ","element":"span"},{"text":"Convex Games. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE Transactions on Automatic Control","element":"span"},{"text":", 64(4):1426–1439.","element":"span"}],[{"id":"id-5","text":"Vickrey, W. (1961). Counterspeculation, Auctions and Competitive Sealed Tenders. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J Finance","element":"span"},{"text":", 16:8–37.","element":"span"}],[{"id":"id-12","text":"Vinyals, O., Babuschkin, I., Chung, J., Mathieu, M., Jaderberg, M., Czarnecki, W. M., Dudzik, ","element":"span"},{"text":"A., Huang, A., Georgiev, P., Powell, R., Ewalds, T., Horgan, D., Kroiss, M., Danihelka, I., Agapiou, J., Oh, J., Dalibard, V., Choi, D., Sifre, L., Sulsky, Y., Vezhnevets, S., Molloy, J., Cai, T., Budden, D., Paine, T., Gulcehre, C., Wang, Z., Pfaff, T., Pohlen, T., Wu, Y., Yogatama, D., Cohen, J., McKinney, K., Smith, O., Schaul, T., Lillicrap, T., Apps, C., Kavukcuoglu, K., Hassabis, D., and Silver, D. (2019). AlphaStar: Mastering the Real-Time Strategy Game StarCraft II. ","element":"span"},{"text":"https://deepmind.com/blog/alphastar-mastering -real-time-strategy-game-starcraft-ii/","element":"span"},{"text":".","element":"span"}],[{"id":"id-2","text":"von Neumann, J. (1928). Zur Theorie der Gesellschaftsspiele. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Mathematische Annalen","element":"span"},{"text":", 100(1):295– 320.","element":"span"}],[{"id":"id-6","text":"von Neumann, J. and Morgenstern, O. (1944). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Theory of Games and Economic Behavior","element":"span"},{"text":". Princeton University Press, Princeton NJ.","element":"span"}],[{"id":"id-31","text":"Wellman, M. P. and Wurman, P. R. (1998). Market-aware agents for a multiagent world. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Robotics and Autonomous Systems","element":"span"},{"text":", 24:115–125.","element":"span"}],[{"id":"id-51","text":"Wu, Y., Donahue, J., Balduzzi, D., Simonyan, K., and Lillicrap, T. (2019). ","element":"span"},{"text":"LOGAN: Latent Optimisation for Generative Adversarial Networks. In ","element":"span"},{"href":"http://arxiv.org/abs/1912.00953","style":{"fontStyle":"italic"},"text":"arXiv:1912.00953","element":"a"},{"text":".","element":"span"}],[{"id":"id-52","text":"Zhu, J.-Y., Park, T., Isola, P., and Efros, A. A. (2017). Unpaired Image-to-Image Translation using ","element":"span"},{"text":"Cycle-Consistent Adversarial Networks. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"CVPR","element":"span"},{"text":".","element":"span"}]]},{"heading":"APPENDIX","paragraphs":[[{"style":{"width":"51%"},"width":812,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-0.png","element":"img"}],[{"text":"This section provides a physics-inspired perspective on smooth markets. Consider a dynamical system with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"n ","element":"span"},{"text":"particles moving according to the differential equations:","element":"span"}],[{"style":{"width":"25%"},"width":406,"height":270,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-1.png","element":"img"}],[{"text":"The kinetic energy of a particle is mass times velocity squared, ","element":"span"},{"style":{"height":13.39},"width":71.74,"height":33.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-2.png","element":"img","alt":" mv2","inline":true},{"text":", or in our case","element":"span"}],[{"style":{"width":"47%"},"width":749,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-3.png","element":"img"}],[{"text":"where we interpret the learning rate squared ","element":"span"},{"style":{"height":17.53},"width":37.22,"height":43.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-4.png","element":"img","alt":" η2i","inline":true,"padRight":true},{"text":"of particle ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"as its mass and ","element":"span"},{"style":{"height":14.7},"width":32.46,"height":36.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-5.png","element":"img","alt":" ξi","inline":true,"padRight":true},{"text":"as its velocity. The ","element":"span"},{"text":"total energy of the system is the sum over the kinetic energies of the particles:","element":"span"}],[{"style":{"width":"39%"},"width":627,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-6.png","element":"img"}],[{"text":"For example, in a Hamiltonian game we have that energy is conserved:","element":"span"}],[{"style":{"width":"50%"},"width":801,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-7.png","element":"img"}],[{"text":"since ","element":"span"},{"style":{"height":17.39},"width":455.38,"height":43.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-8.png","element":"img","alt":" ξ⊺ · ∇∥ξ∥2 = ξ⊺ · A⊺ξ = 0","inline":true},{"text":", see ","element":"span"},{"href":"#id-32","referenceIndex":8,"text":"Balduzzi et al. ","element":"a"},{"href":"#id-32","referenceIndex":8,"text":"(2018)","element":"a"},{"text":"; ","element":"span"},{"href":"#id-45","referenceIndex":28,"text":"Letcher et al. ","element":"a"},{"href":"#id-45","referenceIndex":28,"text":"(2019) ","element":"a"},{"text":"for details.","element":"span"}],[{"text":"Energy is measured in joules (","element":"span"},{"style":{"height":16.58},"width":198.11,"height":41.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-9.png","element":"img","alt":"kg · m · s−2","inline":true},{"text":"). The rate of change of energy with respect to time is ","element":"span"},{"style":{"fontStyle":"italic"},"text":"power","element":"span"},{"text":", measured in joules per second or watts (","element":"span"},{"style":{"height":16.58},"width":194.66,"height":41.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/12-10.png","element":"img","alt":"kg · m · s−3","inline":true},{"text":"). Conservation of energy means that a (closed) Hamiltonian system, in aggregate, generates no power. The existence of an invariant function makes Hamiltonian systems easy to reason about in many ways.","element":"span"}],[{"text":"Smooth markets are more general than Hamiltonian games in that total energy is not necessarily conserved. Nevertheless, they are much more constrained than general dynamical systems. Legibility, proposition ","element":"span"},{"href":"#id-67","text":"3, ","element":"a"},{"text":"says that the total power (total rate of energy generation) in smooth markets is the sum of the power (rate of energy generation) of the individual particles:","element":"span"}],[{"style":{"width":"22%"},"width":352,"height":102,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-0.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Example where legibility fails. ","element":"span"},{"text":"Once again, it is instructive to look at a concrete example where legibility fails. Recall the potential game in example ","element":"span"},{"href":"#id-46","text":"1 ","element":"a"},{"text":"with profits","element":"span"}],[{"style":{"width":"56%"},"width":898,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-1.png","element":"img"}],[{"text":"and sentiments","element":"span"}],[{"style":{"width":"63%"},"width":1007,"height":83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-2.png","element":"img"}],[{"text":"Physically, the negative sentiments ","element":"span"},{"style":{"height":20.55},"width":324.3,"height":51.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-3.png","element":"img","alt":"df1dt < 0 and df2dt < 0","inline":true,"padRight":true},{"text":"mean that that each “particle” in the system, ","element":"span"},{"text":"considered in isolation, is always dissipating energy. Nevertheless as shown in section ","element":"span"},{"href":"#id-60","text":"5.1 ","element":"a"},{"text":"the system as a whole has","element":"span"}],[{"style":{"width":"76%"},"width":1211,"height":84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-4.png","element":"img"}],[{"text":"which is positive for some values of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":". Thus, the system as a whole can generate energy through interaction effects between the (dissipative) particles.","element":"span"}],[{"style":{"width":"15%"},"width":240,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-5.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof of lemma ","element":"span"},{"href":"#id-68","style":{"fontWeight":"bold"},"text":"1.","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Every continuous dynamical system on ","element":"span"},{"style":{"height":13.38},"width":45.78,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-6.png","element":"img","alt":" Rd","inline":true},{"style":{"fontStyle":"italic"},"text":", for any ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d","element":"span"},{"style":{"fontStyle":"italic"},"text":", arises as simultaneous gradient ascent on the profit functions of a smooth game.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Specifically, we mean that every dynamical system of the form ","element":"span"},{"style":{"height":19.37},"width":187.68,"height":48.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-7.png","element":"img","alt":"dwdt = ξ(w)","inline":true,"padRight":true},{"text":"arises as simulta- ","element":"span"},{"text":"neous gradient ascent on the profits of a smooth game.","element":"span"}],[{"text":"Given continuous vector field ","element":"span"},{"style":{"height":14},"width":20,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-8.png","element":"img","alt":" ξ","inline":true,"padRight":true},{"text":"on ","element":"span"},{"style":{"height":13.38},"width":45.78,"height":33.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-9.png","element":"img","alt":" Rd","inline":true},{"text":", we need to construct a smooth game with dynamics given by ","element":"span"},{"style":{"height":14},"width":20,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-10.png","element":"img","alt":" ξ","inline":true},{"text":". To that end, consider a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d","element":"span"},{"text":"-player game where player ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"controls coordinate ","element":"span"},{"style":{"height":9.19},"width":39.52,"height":22.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-11.png","element":"img","alt":" wi","inline":true},{"text":". Set the profit of player ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"to","element":"span"}],[{"style":{"width":"99%"},"width":1581,"height":164,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-12.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof of proposition ","element":"span"},{"href":"#id-61","style":{"fontWeight":"bold"},"text":"2. ","element":"a"},{"text":"Before proving proposition ","element":"span"},{"href":"#id-61","text":"2, ","element":"a"},{"text":"we first prove a lemma.","element":"span"}],[{"id":"id-70","style":{"fontWeight":"bold"},"text":"Lemma 7 ","element":"span"},{"text":"(generalized Helmholtz decomposition)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The Jacobian decomposes into ","element":"span"},{"style":{"fontWeight":"bold"},"text":"J","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") = ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") + ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"where ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"style":{"fontStyle":"italic"},"text":"are symmetric and antisymmetric, respectively, for all ","element":"span"},{"style":{"height":14.18},"width":139.8,"height":35.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-13.png","element":"img","alt":" w ∈ Rd.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Follows immediately. See ","element":"span"},{"href":"#id-45","referenceIndex":28,"text":"Letcher et al. ","element":"a"},{"href":"#id-45","referenceIndex":28,"text":"(2019) ","element":"a"},{"text":"for details and explanation.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Sentiment is additive: ","element":"span"},{"style":{"height":16.78},"width":456.94,"height":41.95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-14.png","element":"img","alt":" Dvfv(w) = �i Dvifvi(w).","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"For any collection of updates ","element":"span"},{"style":{"height":16.15},"width":120.19,"height":40.37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-15.png","element":"img","alt":" (vi)ni=1","inline":true},{"text":", we need to show that","element":"span"}],[{"style":{"width":"34%"},"width":550,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-16.png","element":"img"}],[{"text":"Direct computation obtains ","element":"span"},{"style":{"height":17.19},"width":1160.36,"height":42.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/13-17.png","element":"img","alt":" v⊺∇fv(w) = v⊺Jv = v⊺Sv+v⊺Av = �i v⊺Siiv = �i v⊺i ·∇fvi(w)","inline":true,"padRight":true},{"text":"because ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A ","element":"span"},{"text":"is antisymmetric and ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is block-diagonal.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof of proposition ","element":"span"},{"href":"#id-67","style":{"fontWeight":"bold"},"text":"3. ","element":"a"},{"text":"First we prove a lemma.","element":"span"}],[{"id":"id-69","style":{"fontWeight":"bold"},"text":"Lemma 8. ","element":"span"},{"style":{"height":19.72},"width":669.51,"height":49.31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-0.png","element":"img","alt":" ξ⊺η(w) · ∇fη(w) = �ni=1 η2i · ξ⊺i ∇fi(w).","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Observe by direct computation that","element":"span"}],[{"style":{"width":"58%"},"width":926,"height":267,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-1.png","element":"img"}],[{"text":"It is then easy to see that ","element":"span"},{"style":{"height":18.25},"width":812.92,"height":45.63,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-2.png","element":"img","alt":" ∇fη = �i ηi · ∇fi = �i ηi · J⊺ξi = J⊺ξη. Thus,","inline":true}],[{"style":{"width":"46%"},"width":737,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-3.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":10.8},"width":249.53,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-4.png","element":"img","alt":" S = S⊺ since S","inline":true,"padRight":true},{"text":"is symmetric. By antisymmetry of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A","element":"span"},{"text":", we have that ","element":"span"},{"style":{"height":10.8},"width":417.99,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-5.png","element":"img","alt":" v⊺A⊺v = 0 for all v. The","inline":true,"padRight":true},{"text":"expression thus simplifies to","element":"span"}],[{"style":{"width":"66%"},"width":1061,"height":110,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-6.png","element":"img"}],[{"text":"by the block-diagonal structure of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S","element":"span"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 3 ","element":"span"},{"text":"(legibility under gradient dynamics)","element":"span"},{"style":{"fontWeight":"bold"},"text":". ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Fix dynamics ","element":"span"},{"style":{"height":20.17},"width":221.03,"height":50.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-7.png","element":"img","alt":"dwdt := ξη(w)","inline":true},{"style":{"fontStyle":"italic"},"text":". Sentiment decom-","element":"span"}],[{"style":{"width":"59%"},"width":941,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-8.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Applying the chain rule obtains that","element":"span"}],[{"style":{"width":"43%"},"width":686,"height":96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-9.png","element":"img"}],[{"text":"where the second equality follows by construction of the dynamical system as ","element":"span"},{"href":"#id-69","style":{"height":20.17},"width":377.94,"height":50.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-10.png","element":"img","alt":"dwdt = ξη(w). Lemma 8","inline":true,"padRight":true},{"text":"shows that","element":"span"}],[{"style":{"width":"74%"},"width":1188,"height":676,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/14-11.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"as required.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof of theorem ","element":"span"},{"href":"#id-65","style":{"fontWeight":"bold"},"text":"4.","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A fixed point in an SM-game is a local Nash equilibrium iff it is stable.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Suppose that ","element":"span"},{"style":{"height":10.98},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-0.png","element":"img","alt":" w∗ ","inline":true,"padRight":true},{"text":"is a fixed point of the game, that is suppose ","element":"span"},{"style":{"height":16},"width":190.56,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-1.png","element":"img","alt":" ξ(w∗) = 0.","inline":true}],[{"text":"Recall from lemma ","element":"span"},{"href":"#id-70","text":"7 ","element":"a"},{"text":"that the Jacobian of ","element":"span"},{"style":{"height":14},"width":20,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-2.png","element":"img","alt":" ξ","inline":true,"padRight":true},{"text":"decomposes uniquely into two components ","element":"span"},{"style":{"fontWeight":"bold"},"text":"J","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") = ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") + ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A","element":"span"},{"text":"(","element":"span"},{"style":{"fontWeight":"bold"},"text":"w","element":"span"},{"text":") ","element":"span"},{"text":"where ","element":"span"},{"style":{"height":10.8},"width":132.39,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-3.png","element":"img","alt":" S ≡ S⊺","inline":true,"padRight":true},{"text":"is symmetric and ","element":"span"},{"style":{"height":12},"width":225.64,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-4.png","element":"img","alt":" A + A⊺ ≡ 0","inline":true,"padRight":true},{"text":"is antisymmetric. It follows that ","element":"span"},{"style":{"height":12},"width":560.25,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-5.png","element":"img","alt":"v⊺Jv = v⊺Sv + v⊺Av = v⊺Sv","inline":true,"padRight":true},{"text":"since ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A ","element":"span"},{"text":"is antisymmetric. Thus, ","element":"span"},{"style":{"height":10.99},"width":49.74,"height":27.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-6.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"text":"is a stable fixed point iff ","element":"span"},{"style":{"height":16},"width":184.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-7.png","element":"img","alt":"S(w∗) ≻ 0","inline":true,"padRight":true},{"text":"is negative definite.","element":"span"}],[{"text":"In an SM-game, the antisymmetric component is arbitrary and the symmetric component is block diagonal – where blocks correspond to players’ parameters. That is, ","element":"span"},{"style":{"height":15.59},"width":129.66,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-8.png","element":"img","alt":" Sij = 0","inline":true,"padRight":true},{"text":"for ","element":"span"},{"style":{"height":15.2},"width":83.86,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-9.png","element":"img","alt":" i ̸= j","inline":true,"padRight":true},{"text":"because the interactions between players ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j ","element":"span"},{"text":"are pairwise zero-sum – and are therefore necessarily confined to the antisymmetric component of the Jacobian. Since ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is block-diagonal, it follows that ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is negative definite iff the submatrices ","element":"span"},{"style":{"height":13.19},"width":47.73,"height":32.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-10.png","element":"img","alt":" Sii","inline":true,"padRight":true},{"text":"along the diagonal are negative definite for all players ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":".","element":"span"}],[{"text":"Finally, ","element":"span"},{"style":{"height":17.53},"width":363.58,"height":43.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-11.png","element":"img","alt":" Sii(w∗) = ∇2iiπi(w∗)","inline":true,"padRight":true},{"text":"is negative definite iff profit ","element":"span"},{"style":{"height":16},"width":119.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-12.png","element":"img","alt":" πi(w∗)","inline":true,"padRight":true},{"text":"is strictly concave in the parame- ","element":"span"},{"text":"ters controlled by player ","element":"span"},{"style":{"height":11.38},"width":112.16,"height":28.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-13.png","element":"img","alt":" i at w∗","inline":true},{"text":". The result follows.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof of theorem ","element":"span"},{"href":"#id-71","style":{"fontWeight":"bold"},"text":"5.","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 5. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"In continuous time, for all positive learning rates ","element":"span"},{"style":{"height":14},"width":112.43,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-14.png","element":"img","alt":" η ≻ 0,","inline":true}],[{"style":{"fontStyle":"italic"},"text":"1. If ","element":"span"},{"style":{"height":10.98},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-15.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is a stable fixed point (","element":"span"},{"style":{"height":11.6},"width":103.81,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-16.png","element":"img","alt":"S ≺ 0","inline":true},{"style":{"fontStyle":"italic"},"text":"), then there is an open neighborhood ","element":"span"},{"style":{"height":11.78},"width":132.21,"height":29.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-17.png","element":"img","alt":" U ∋ w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"where","element":"span"},{"style":{"height":21.58},"width":552.08,"height":53.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-18.png","element":"img","alt":"dfηdt (w) < 0 for all w ∈ U \\ {w∗}","inline":true},{"style":{"fontStyle":"italic"},"text":", so the dynamics converge to ","element":"span"},{"style":{"height":10.99},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-19.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"from anywhere in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"2. If ","element":"span"},{"style":{"height":10.98},"width":49.74,"height":27.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-20.png","element":"img","alt":" w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is an unstable fixed point (","element":"span"},{"style":{"height":11.6},"width":101.65,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-21.png","element":"img","alt":"S ≻ 0","inline":true},{"style":{"fontStyle":"italic"},"text":"), there is an open neighborhood ","element":"span"},{"style":{"height":11.78},"width":130.05,"height":29.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-22.png","element":"img","alt":" U ∋ w∗","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"such that","element":"span"},{"style":{"height":21.58},"width":552.08,"height":53.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-23.png","element":"img","alt":"dfηdt (w) > 0 for all w ∈ U \\ {w∗}","inline":true},{"style":{"fontStyle":"italic"},"text":", so the dynamics within ","element":"span"},{"style":{"fontStyle":"italic"},"text":"U ","element":"span"},{"style":{"fontStyle":"italic"},"text":"are repelled by ","element":"span"},{"style":{"height":11.38},"width":62.06,"height":28.46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-24.png","element":"img","alt":" w∗.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"We prove the first part. The second follows by a symmetric argument. First, strict concavity implies ","element":"span"},{"style":{"height":17.53},"width":194.62,"height":43.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-25.png","element":"img","alt":" Sii = ∇2iiπi","inline":true,"padRight":true},{"text":"is negative definite for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":". Second, since ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is block-diagonal, with zeros in all ","element":"span"},{"text":"blocks ","element":"span"},{"style":{"height":15.59},"width":49.73,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-26.png","element":"img","alt":" Sij","inline":true,"padRight":true},{"text":"for pairs of players ","element":"span"},{"style":{"height":15.2},"width":83.86,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-27.png","element":"img","alt":" i ̸= j","inline":true},{"text":", it follows that ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is also negative definite. Observe that","element":"span"}],[{"style":{"width":"61%"},"width":979,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-28.png","element":"img"}],[{"text":"for all ","element":"span"},{"style":{"height":17.9},"width":119.9,"height":44.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-29.png","element":"img","alt":" ξη ̸= 0","inline":true,"padRight":true},{"text":"since ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"is negative definite. Thus, simultaneous gradient ascent on the profits acts to infinitesimally reduce the function ","element":"span"},{"style":{"height":16.79},"width":110.01,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-30.png","element":"img","alt":" fη(w).","inline":true}],[{"text":"Since ","element":"span"},{"style":{"height":17.5},"width":214.22,"height":43.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-31.png","element":"img","alt":" ξη reduces fη","inline":true},{"text":", it will converge to a stationary point satisfying ","element":"span"},{"style":{"height":15.99},"width":144.63,"height":39.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-32.png","element":"img","alt":" ∇fη = 0","inline":true},{"text":". Observe that ","element":"span"},{"style":{"height":15.99},"width":144.62,"height":39.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-33.png","element":"img","alt":" ∇fη = 0","inline":true,"padRight":true},{"text":"iff ","element":"span"},{"style":{"height":17.1},"width":119.88,"height":42.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-34.png","element":"img","alt":" ξη = 0","inline":true,"padRight":true},{"text":"since ","element":"span"},{"style":{"height":17.5},"width":205.81,"height":43.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-35.png","element":"img","alt":" ∇fη = J⊺ξη","inline":true,"padRight":true},{"text":"and the symmetric component ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"of the Jacobian is negative definite. Finally, observe that all stationary points of ","element":"span"},{"style":{"height":15.99},"width":32,"height":39.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-36.png","element":"img","alt":" fη","inline":true},{"text":", and hence ","element":"span"},{"style":{"height":17.1},"width":40.46,"height":42.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-37.png","element":"img","alt":" ξη","inline":true},{"text":", are stable fixed points of ","element":"span"},{"style":{"height":17.1},"width":213.58,"height":42.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-38.png","element":"img","alt":" ξη because S","inline":true,"padRight":true},{"text":"is negative definite, which implies that the fixed point is a Nash equilibrium.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proof of theorem ","element":"span"},{"href":"#id-66","style":{"fontWeight":"bold"},"text":"6.","element":"a"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 6. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose all firms have negative sentiment, ","element":"span"},{"style":{"height":20.55},"width":183.32,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-39.png","element":"img","alt":"dfidt (w) < 0","inline":true},{"style":{"fontStyle":"italic"},"text":", for sufficiently large values of ","element":"span"},{"style":{"height":16},"width":86.29,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-40.png","element":"img","alt":"∥wi∥","inline":true},{"style":{"fontStyle":"italic"},"text":". Then the dynamics are bounded for any learning rates ","element":"span"},{"style":{"height":14},"width":111.44,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-41.png","element":"img","alt":" η ≻ 0.","inline":true}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Fix ","element":"span"},{"style":{"height":14},"width":108.86,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-42.png","element":"img","alt":" η ≻ 0","inline":true,"padRight":true},{"text":"and also fix ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d > ","element":"span"},{"text":"0 ","element":"span"},{"text":"such that ","element":"span"},{"style":{"height":20.55},"width":190.67,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-43.png","element":"img","alt":"dfidt (w) < 0","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w ","element":"span"},{"text":"satisfying ","element":"span"},{"style":{"height":16},"width":185.58,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-44.png","element":"img","alt":" ∥wi∥2 > d","inline":true},{"text":". Let ","element":"span"},{"style":{"height":16},"width":552.87,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-45.png","element":"img","alt":"U(d) = {w : ∥wi∥2 > d for all i}","inline":true,"padRight":true},{"text":"and suppose ","element":"span"},{"style":{"fontStyle":"italic"},"text":"g > ","element":"span"},{"text":"0 ","element":"span"},{"text":"is sufficiently large such that ","element":"span"},{"style":{"height":19.72},"width":236.91,"height":49.31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-46.png","element":"img","alt":" f−1η (g) = {w :","inline":true},{"style":{"height":16.79},"width":330.44,"height":41.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-47.png","element":"img","alt":"fη(w) = g} ⊂ U(d)","inline":true},{"text":". We show that","element":"span"}],[{"style":{"width":"54%"},"width":868,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-48.png","element":"img"}],[{"text":"for the dynamical system defined by ","element":"span"},{"style":{"height":20.17},"width":141.52,"height":50.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-49.png","element":"img","alt":"dwdt = ξη","inline":true},{"text":". Since we are operating in continuous time, all that ","element":"span"},{"text":"is required is to show that ","element":"span"},{"style":{"height":18.98},"width":295.86,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-50.png","element":"img","alt":" fη(w(t)) = g′ < g","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":18.98},"width":264.26,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-51.png","element":"img","alt":" fη(w(t+ϵ)) < g′","inline":true,"padRight":true},{"text":"for all sufficiently small ","element":"span"},{"style":{"height":11.6},"width":99.23,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-52.png","element":"img","alt":"ϵ > 0.","inline":true}],[{"text":"Recall that ","element":"span"},{"style":{"height":20.55},"width":845.9,"height":51.38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-53.png","element":"img","alt":"dfidt (w)(w) := ξ⊺i · ∇2iiπi · ξi = Dξi( 12∥ξi∥22)","inline":true},{"text":". ","element":"span"},{"text":"It follows immediately that ","element":"span"},{"style":{"height":22.01},"width":625.9,"height":55.02,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-54.png","element":"img","alt":"Dξη( 12∥ξη∥22) = �i ηi · dfidt (w) < 0","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"fontWeight":"bold"},"text":"w ","element":"span"},{"text":"in a sufficiently small ball centered at ","element":"span"},{"style":{"height":14.19},"width":70.23,"height":35.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-55.png","element":"img","alt":" w(t)","inline":true},{"text":". In ","element":"span"},{"text":"other words, the dynamics ","element":"span"},{"style":{"height":20.17},"width":302.85,"height":50.43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/15-56.png","element":"img","alt":"dwdt = ξη reduce fη","inline":true,"padRight":true},{"text":"and the result follows.","element":"span"}],[{"text":"C ","element":"span"},{"text":"N","element":"span"},{"text":"EAR ","element":"span"},{"text":"SM-","element":"span"},{"text":"GAMES","element":"span"},{"text":": E","element":"span"},{"text":"XPERIENTIAL ","element":"span"},{"text":"V","element":"span"},{"text":"ALUE AND THE ","element":"span"},{"text":"E","element":"span"},{"text":"XCHANGE OF ","element":"span"},{"text":"G","element":"span"},{"text":"OODS","element":"span"}],[{"text":"Definition ","element":"span"},{"href":"#id-47","text":"3 ","element":"a"},{"text":"proposes a model of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"monetary ","element":"span"},{"text":"exchange in smooth markets. It ignores some major aspects of actual markets. For example, SM-games do not model inventories, investment, borrowing or interest rates. Moreover, in practice money is typically exchanged in return for goods or services – which are ignored by the model.","element":"span"}],[{"text":"In this section, we sketch one way to extend SM-games to model the exchange of both money and goods - although still without accounting for inventories, which would more significantly complicate the model. The proposed extension is extremely simplistic. It is provided to indicate how the model’s expressive power can be increased, and complications that results.","element":"span"}],[{"style":{"width":"78%"},"width":1246,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-0.png","element":"img"}],[{"text":"The functions ","element":"span"},{"style":{"height":11.59},"width":49.07,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-1.png","element":"img","alt":" ωij","inline":true,"padRight":true},{"text":"measure the amount of goods (say, widgets) that are exchanged between firms ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":". We assume that ","element":"span"},{"style":{"height":15.59},"width":235.16,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-2.png","element":"img","alt":" ωij + ωji ≡ 0","inline":true,"padRight":true},{"text":"since widgets are physically passed between the firms and therefore one firms increase must be the others decrease. For two firms to enter into an exchange it must be that they subjectively value the widgets differently, hence we introduce the parameters ","element":"span"},{"style":{"height":11.59},"width":49.76,"height":28.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-3.png","element":"img","alt":" αij","inline":true},{"text":". Note that if ","element":"span"},{"style":{"height":15.99},"width":273.57,"height":39.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-4.png","element":"img","alt":" αij = 1 for all ij","inline":true,"padRight":true},{"text":"then the model is equivalent to an SM-game.","element":"span"}],[{"text":"The transaction between firms ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j ","element":"span"},{"text":"is net beneficial to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"both ","element":"span"},{"text":"firms ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j ","element":"span"},{"text":"if","element":"span"}],[{"style":{"width":"32%"},"width":520,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-5.png","element":"img"}],[{"text":"and, simultaneously","element":"span"}],[{"style":{"width":"33%"},"width":532,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-6.png","element":"img"}],[{"text":"We can interpret the inequalities as follows. First suppose that ","element":"span"},{"style":{"height":11.59},"width":49.08,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-7.png","element":"img","alt":" ωij","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":11.59},"width":43.28,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-8.png","element":"img","alt":" gij","inline":true,"padRight":true},{"text":"always have the same sign. The assumption is reasonable so long as firms do not pay to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"give away ","element":"span"},{"text":"widgets. Further assume without loss of generality that ","element":"span"},{"style":{"height":15.59},"width":171.96,"height":38.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-9.png","element":"img","alt":" ωij and gij","inline":true,"padRight":true},{"text":"are both greater than zero – in other words, firm ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i ","element":"span"},{"text":"is buying widgets from firm ","element":"span"},{"style":{"fontStyle":"italic"},"text":"j","element":"span"},{"text":". The above inequalities can then be rewritten as","element":"span"}],[{"style":{"width":"69%"},"width":1109,"height":326,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Implications for dynamics. ","element":"span"},{"text":"The off-block-diagonal terms of the symmetric and anti-symmetric components of the game Jacobian are","element":"span"}],[{"style":{"width":"67%"},"width":1072,"height":214,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-11.png","element":"img"}],[{"text":"where it is easy to check that ","element":"span"},{"style":{"height":15.99},"width":484.84,"height":39.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-12.png","element":"img","alt":" Sij = Sji and Aij + Aji = 0","inline":true},{"text":". The off-block-diagonal terms of ","element":"span"},{"style":{"fontWeight":"bold"},"text":"S ","element":"span"},{"text":"has consequences for how forecasts behave:","element":"span"}],[{"id":"id-72","style":{"width":"76%"},"width":1215,"height":229,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-13.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"When are near SM-games well-behaved? ","element":"span"},{"text":"If ","element":"span"},{"style":{"height":11.59},"width":164.23,"height":28.98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-14.png","element":"img","alt":" αij = αji","inline":true,"padRight":true},{"text":"for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i, j ","element":"span"},{"text":"then the correction is zero; if ","element":"span"},{"style":{"height":11.59},"width":158,"height":28.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/16-15.png","element":"img","alt":"αij ∼ αji","inline":true,"padRight":true},{"text":"then the corrections due to different valuations of goods will be negligible, and the game should be correspondingly well-behaved.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"What can go wrong? ","element":"span"},{"text":"Eq ","element":"span"},{"href":"#id-72","text":"(5) ","element":"a"},{"text":"implies that the dynamics of near SM-games – specifically whether the dynamics are increasing or decreasing the aggregate forecast – cannot be explained in terms of the sum of sentiments of individual terms. The correction terms involve interactions between dynamics of different firms and the (second-order) quantities of goods exchanged. In principle, these terms could be arbitrarily large positive or negative numbers.","element":"span"}],[{"text":"Concretely, the correction terms involving couplings between dynamics of different firms can lead to positive feedback loops, as in example ","element":"span"},{"href":"#id-46","text":"1, ","element":"a"},{"text":"where the dynamics spiral off to infinity even though both players have strongly concave profit functions.","element":"span"}],[{"style":{"width":"48%"},"width":771,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/17-0.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Lemma 9. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Any smooth vector field can be constructed as the gradient of a function augmented with ","element":"span"},{"style":{"fontStyle":"italic"},"text":"stop gradient ","element":"span"},{"style":{"fontStyle":"italic"},"text":"operators.","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"Suppose ","element":"span"},{"style":{"height":23.32},"width":545.73,"height":58.3,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/17-1.png","element":"img","alt":" ξ = ( ∂f1(w)∂w1 , . . . , ∂fd(w)∂wd ). Define","inline":true}],[{"style":{"width":"71%"},"width":1141,"height":256,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2001.04678/images/17-2.png","element":"img"}],[{"text":"as required.","element":"span"}]]}],"_version":"3.3.2"},"paperNode":"$1b:props:children:props:children:0:props:product"}]]]}]}]