b

DiscoverSearch
About
Policies for constraining the behaviour of coalitions of agents in the context of algebraic information theory
2019·arXiv
Abstract
Abstract

This article takes an oblique sidestep from two previous papers, wherein an approach to reformulation of game theory in terms of information theory, topology, as well as a few other notions was indicated. In this document a description is provided as to how one might determine an approach for an agent to choose a policy concerning which actions to take in a game that constrains behaviour of subsidiary agents. It is then demonstrated how these results in algebraic information theory, together with previous investigations in geometric and topological information theory, can be unified into a single cohesive framework.

1.1 On the centrality of regulation in maintaining economic competitiveness

Consider the real world situation where agents compete and/or cooperate in order to maximise their share of scarce resources. Their actions are constrained by laws, the formulation and implementation of which by regulators is driven by policies.

In particular, consider the role that regulatory agencies fulfil in preventing over-fishing, or in ensuring that financial integrity in maintained in a financial market, thereby preventing anarchy and corruption (and thereby waste of economic value, at the cost of competitiveness of the society in which the regulatory agency is situated). Indeed, effective regulation of play in an economic game goes hand in hand with the ability of a society to compete with other societies effectively, for the means of economic production - be that labour or otherwise.

To drive home this point, let us expand upon the example of the tragedy of the commons. This apocryphal example is of a village with a single green, and various townsfolk with their own separate flocks of sheep. Each townsperson has the ability to allow their sheep to graze a small patch of ground on the village common. Most will not intend to deplete the resource, but eventually they will realise that it will be depleted by others if they don’t deplete it first, so they race to use it all. So the grass is depleted, the resource is gone, and the flocks of sheep have a much poorer diet. This evidently causes all players to lose in an iterated game. It also ’causes the town to lose’ as other towns with policies regarding village greens that are not so lax will attract players from the original town (as said towns will have better land for grazing sheep). Indeed, a regulator (eg the town) can issue a bounded number of permits for use of the land for a particular amount of time, and at a certain level of depletion, which solves this problem.

Some further intuition may be built by considering the role that regulatory entities fulfil in preventing agents from ’gaming’ markets, so that the tragedy of the commons can be avoided. Inefficient markets allow agents to take actions that transfer costs to other players. Regulatory entities provide forcing functions that ideally help players wear the true costs of their activity in the game. Of course, generally speaking, regulators or adjudicators provide some form of influence on the game that acts as a forcing function on payoff functions for particular actions at particular times.

More formally, we would like to study how regulators can act in a way such that their intervention is optimal, i.e. such that it prevents and/or addresses market distortions that would arise as emergent behaviours and/or exploits (violations of the efficient market hypothesis) as a consequence of the basic structure of an economic game. In a way this can be seen as congruent to establishing an ethical framework, as ethics can be seen as concerning itself in terms of how one should regulate behaviour with regards to competition for finite resources in some form of game.

A key goal of this paper will be to construct a framework for the formal consideration of issues such as these.

In particular, within the document, a model will be introduced, within which a system agent regulates the playing of a game, which may or may not be iterated, by coalitions of sub-agents who may compete, coordinate with each other and/or cooperate. The system agent may constrain the actions of the sub-agents using laws in accordance with policies.

Throughout said discussion, it will also be indicated how these ideas can be generalised, abstracted, and extended - within the broader and richer context of a holistic vantage regarding first-order considerations in Algebra, Geometry, and Topology.

1.2 Definitions, agents, constraints

Now, bear in mind the situation that governed said preceding papers on topological information theory: we have a game (i.e., a decision problem with some form of payoff function, that may or may not be iterated, and which generically involves competition, coordination, and/or cooperation between multiple entities over said payoff), and we have agents that might want to determine an optimal strategy for said game and execute on it. (Here we have the understanding that an agent is an entity playing a game that possesses its own inherent agency - hence, ”agent” - and has an objective in the context of the game with respect to some form of internally modelled utility function.)

Certainly, in the real world, agents will not have total information regarding the context of a particular game, and may not be rational. Therefore, it makes sense to find a way to encourage agents in the aggregate to follow a particular strategy. i.e., we would like our agents to be subjected to constraints which may be enforced to a greater or lesser extent.

Definition 1. (Action / Law) A (system) action or law is a construct out of data relating to some form of summary statistics for a game over a population of agents. Such an action can be used as a blunt tool to alter and/or constrain agent behaviour.

So in this sense we think of the system within which the agents are playing, as itself a form of agent - i.e., ”system as agent” - and we are interested in determining two things: one, what policies and/or constraints such a system can employ to regulate behaviour of subsidiary agents within said system; and two, determination of a process to optimise when and how much to utilise such constraints.

Definition 2. (Policy) A policy is an approach that a system agent takes to determine when and what actions to take towards particular subsidiary agents. i.e. when and how to enforce laws. The goal of a policy is to encourage subsidiary agents to behave in ways which are closer to an optimal strategy on the level of a particular set of coalitions of agents, i.e. at the level of concern of the system agent.

In particular, hearkening back to towards the end of [4], one might be interested in looking at expressions such as  g1 ◦ g0, or ◦(g2 ◦ g1 ◦ g0; h) (associated say to composition of information functionals into a more complex overarching information functional), and making these statements about groups or algebras in some logical sort of way, possibly as invariants of same. One might intuit that this should be the logical framework to angle towards in terms of figuring out how to construct policies.

Naturally, a regulator should act in different ways according to different situations. Types of situations arise in a natural emergent and combinatorial fashion according to the subtlety and nature of the game. Interestingly, in the analysis to follow we find that to zeroth order we find three natural types of situations, and to first order we find there to be eleven. Methods for formulation of optimal policy in regards to these different types of situations will be sketched in this paper in terms of solution of appropriate natural PDE arising from criticality of certain information functionals. (n.b. there is a nebulous link here to Type theory in programming, wherein to zeroth order we have String, Boolean and Float types, and to first order we have types associated to programming languages such as J or Haskell where we have function types. (The function types themselves would be ({T|T : {String, Bool, Float} → {String, Bool, Float}}, Dict, Array), makingeleven in total.))

One key goal of this paper will be to first more formally construct a description of a policy manifest for regulators / custodians / arbitrators / adjudicators in terms of an appropriate algebraic framework, and follow this with a description as to what situations are associated to regulation of situations concerning primarily 0-, 1-, and 2-agents (here a 0-agent makes decisions according to one statistical distribution (frequentist decisions), a 1-agent makes decision according to two (bayesian decisions), and a 2-agent makes decisions according to three). The aim will then be to look into description of information theoretic techniques involving construction of information inequalities and setting their first variation to zero (a fairly standard technique) in order to describe how to compute optimal policy regarding intervention in various types of situations by a regulator.

In the above, it was suggested that we should look to algebraic considerations in order to abstract away unnecessary topological structure from [3] and [4], and instead focus on the structural considerations associated to having two different layerings of agents - at player level and at system level. This section will sketch roughly how we can construct such a framework, how this can be used to build policies to zeroth, first, and second order, and also very very briefly touch on more abstract concerns. In particular, the aim of this section is to provide a rough compass for more detailed

argument later within this document.

2.1 Basics

One can think of an optimal policy as a geodesic over the space of types associated to an algebra.

More precisely, we are interested in quantifying a choice of structure over ways to apply a type to a space. Types can be things like  ◦ and ∧with actions  ◦(f, g) := f◦g,and  ∧(f, g) := f g, for instance.

We define a prototypical geometry on the space of types over a space M as follows. Let  σbe a metric over  M, and τbe a metric over the space of metrics on M. Then it makes sense that  τdefines a geometry over the types associated to  σby the following rough intuition:

1) we have multiple types available to us, and the simplest, and therefore most representative of the information of the system to first order (composition, exponentiation, addition and multiplication) form a ’type space’ of dimension 4.

2) we can chain these types together so that we can select an arbitrary point in Z4.

3) we can broadly extend this to non-integers if we broaden our definition of type to include fractional applications of same; in this way we cover a 4-manifold of potential options.

4) this manifold admits a metric,  τ, that is dependent on the metric  σ in theoriginal space that the type has been built on top of. This metric  τ musttherefore be in the space of metrics of metrics over the original space.

This leaves us with this natural expression for a prototypical ’signal function’ for such structures:

image

and, as mentioned above, this defines a geometry over the space of types. Defin-ing the notation −δa (σ(m)) := δ(σ(m) − a), we can rewrite the above as

image

Now, to pare away unnecessary structure, and push our construction more towards that of an algebraic bent: let now m be in some Lie Group M. Then TM is a Lie Algebra, and  τdefines an action from the product of the Lie Algebra with itself to the reals; so a ’metric’ on the Lie Algebra  TmM for each m ∈ M. In thissense we have a basis for discussion of Algebraic Information Theory.

2.2 Classification

This gives us a good starting point, but it is by no means the end of the matter. For it is natural within this form of abstraction to consider the classes of particular structures that can be built and/or extended from this basic structure. Degree one extensions are easy; there are three. In fact, it is a simpler than the above mentions of type theory like geometries, since we are interested in the partition of merely one nested Dirac delta, therefore:

image

where  σ(1) is a function from the space of metrics to metrics, and  φis a function from M back to itself. I claim that these are representative of potential system actions relevant to the consideration of regulating behaviour of 0-agents.

Things get more interesting when we are interested in extensions of degree 2. When we endeavour to classify the resultant topologies generated, with an emphasis to focusing on natural generalisations / extensions (more to come on this later, when we look at Ramsay theory), we determine eleven potential topologies, as demonstrated by 1, bearing in mind that the Fibonacci sequence is a useful first order sequence to use in order to inductively construct structure - therein there being a natural build of 2 extra structures at ground, 1 extra structure at level 1, and 1 extra structure at level 2.

This is not the complete picture actually, for it is natural to consider the set of all partitions of 2: {{2}, {1, 1}}, and thereby consider structures of the following kinds:

1. σ(1)(τ, φ)(m) 5. σ(1)τ(m, n) 9. στ(m, n(1))

2. σ(τ, φ(1))(m) 6. στ(m(2)) 10. στ (1)(m, n)3. σ(τ, φ)(m(1)) 7. στ (1)(m(1)) 11. στ(m, n, p)4. σ(τ, φ)(m, n) 8. σ(1)τ(m(1))

Table 1: Allowed topologies for partition {2} at depth 1

f(m|a, b) := −δa−δb (τ ◦ σ(m))

f(m, n|a, b) := −δa σ(m) −δb τ(n)

where as above there are 11 potential applicable topologies for the first. For the second, there is 1 potential applicable topology. Why one, and not three? Because since we are considering higher abstraction (interaction with the metagame), we need to pare away some complexity. In particular, it makes sense to pare away p(1) = 2 weights for the second class of structures, leaving  p(1 − p(1)) = p(−1) = 0weights to distribute, and therefore a total combinatorial complexity of 1. This makes for a total description at this level of abstraction of size 12.

2.3 Interpretation

In game theory, we will say that a policy or law is a constraint on possible moves in a game, usually with the intent on serving the interests of a group to whom all players belong.

The number of degrees of freedom, or situational topologies, are scenarios or modes of action by players in the game (simplifying from coalitions of same, since we are focusing on the algebra here, rather than CW structures for players).

Construction of optimal policies or a set/code of laws must needs consider each of these topologies individually.

An optimal policy should also take into account externalities or sources of excession, which may perturb the course of the game. To clarify,

Definition 3. (Excession) Excession is the role of agency beyond the scope of a game by unknown players, which can nonetheless exert substantial influence on the way it is played, or even potentially completely change the rules and/or scope of play at certain turning points, after the manner of same in, say, catastrophe theory. In particular, players of a game experience an excession event if/when such an action of external agency becomes more apparent.

Therefore, we have the interpretation that the eleven potential topologies above correspond to a first order classification of ways to restrict modes of action by players in a game. We expect naturally to be able to construct information invariants for same, and thereby to intuit optimal policies or laws that optimally constrain the action of players in terms of serving the interests of the group, while also giving them (the players) incentive to choose to join / remain with the group with said constraints.

The one topology for the partition {1, 1} of 2 corresponds to a classification of ways to respond to excession in a way that optimally guarantees the interests of the group.

On a higher order level, we might of course have law to law interactions. If we are interested in more sophisticated concerns, in order to understand guiding principles for adjudication of a game in the context of some suitably defined metagame, we do need to bear such in mind (we will in fact find that the 11 potential topologies above becomes 42 in said instance).

In particular, we have for particular partitions of 3, being {3}, {2, 1}, {1, 1, 1} that:

For {1, 1, 1} there corresponds structures  f(m, n, p|a, b, c) := −δa σ(m) −δb τ(n) −δcφ(p), with potential topologies of multiplicity 1.

For {2, 1} there corresponds structures  f(m, n|a, b, c) := −δa−δb (τ◦σ(m)) −δc φ(n),with potential topologies analogous to the previous section of multiplicity 11.

For the singleton set {3} there corresponds structures  f(m|a, b, c) := −δa−δb−δc(φ ◦ τ ◦ σ(m)), with potential topologies of multiplicity 42. These are covered in table 2, bearing in mind a Fibonacci progression of 1, 1, 2, 3 at various levels therein.

This makes for overall a description of total multiplicity 42 + 11 + 1, or 54 at depth / subtlety 2.

Here, 42 refers to optimal ways to regulate behaviour between players in said game, if these players are 2-agents. 11 refers to optimal ways for said players to exert custodial responsibilities over 1-agents, bearing in mind principles of excession. 1 refers to the optimal way to approach matters of excession to the domain of operability for 2-agents.

(From our previous discussion, we might ask why we only distribute 2 weights at the second level, and none at the third. This follows because we have a sequence p(2), (p(0), {p(1), p(−1)}), (◦(p, p(1))(1), {p(−1), p(−1), p(−1)}), such that the sum of the components is the same in each part of the sequence. The vague intuition here

image

Table 2: Allowed topologies for partition {3} at depth 2

is that information about the metagame should be coarser for a faithful description, and information about the metametagame should be even vaguer.)

2.4 What lies ahead

image

is the key object of study. This is the foundation of the study of torus categories, the next level of abstraction beyond what is covered in this paper. We are also interested in zero sum Ramsay theory, which concerns a combinatorial approach towards examination of meta-mathematical concepts. In particular, we expect to need to ’fold in’ the idea of meta-mathematics into studies of structures (be they higher categorical / toposes, or otherwise) of higher order subtlety as an element of their mathematics.

In this section we will flesh out the theory of optimal policies for games concerning 0-agents.

3.1 Initial formalism and intuition: A recap and a step forward

Recall that in [3] and [4] we described a game formally as a system that had players who could make potential actions during each game ’turn’. Indeed, within any game, for a given game turn, we should have potential actions or moves available to players. The impact of these actions on the game is described by a decision metric σ : T∆3 × T∆3 → R, where ∆ is a four tuple of CW complexes. The information associated to this decision metric can be thought of as an expression for the payoff g : ∆3 → R, and solving for the first variation of the information to zero (by the Cramer-Rao inequality) allows us to compute optimal strategies as geodesics of this system. So far so good.

We eventually generalised this to a point where we had multiple information functionals in terms of theory spaces etc starting to be chained together - eg  g1 ◦ g0,or  ◦(g2 ◦ g1 ◦ g0; h). These expressions almost look like group multiplication. So this lends us to ask the following question - is there a natural generalisation of this sort of structure to bring in ideas about (algebras of) types, etc? Of course there is, but rather than thinking about metagames, 1-agents and so on from the get go, let us take a few further steps back and try to think about what we are trying to describe.

In particular, our previous concerns assumed that the game had a global set of actions and game rules which were available to players, but within this state of affairs, players (or coalitions of players, described by CW complexes) were free to do anything. Now, let us drop the generalisation that players can be CW complexes, and consider them merely as points. And let us introduce the notion that players can opt to be members of a subsystem that itself has agency to dictate and enforces constraints on the actions of such players. Now, you might ask, why would a player willingly sacrifice agency to join the ranks of an organisation with such membership conditions? In analogy with how cities and societies work, we might say that the reason would be protection and also access to greater resources. And also in analogy with cities and societies, we might call these constraints laws, and their implementation and/or enforcement by said society / organisation a policy. We might also expect optimal levels of enforcement to not to be totally iron-fisted - indeed, it might be mutually advantageous to the society and its members to have some slack in said constraint. nb. also, in the real world, people tend to assume things are enforced, because the societies that couldn’t in the past lost the advantage of having laws and systems, and went into decline and/or were superseded by societies that could. So it is a reasonable assumption to make.

That is the preliminary intuition. Consequent to this, we are interested in formulation of a theory of constraints of play in a game.

So say that we have a set of allowed actions described by  σ : TG × TG → R(assuming for simplicity now that G is a four tuple of Lie Groups  Gi, i = 0, 1, 2, 3).Let  τbe a metric on the space of metrics over G. Suppose that  m = (m1, m2, m3, m4)is a point in G. Then we have the notion of a payoff function given in terms of a natural information functional, constructed from this distribution functional:

image

This payoff function which we will call P takes the form

image

Solving for  δP= 0 we extract a PDE which we can solve to determine the optimal policy for implementation / determination of the optimal amount of ’slack’ concerning a particular topological restraint associated to an understanding notion of law, given by the implicit choice of topology in construction of P.

Indeed, we have different choices of topology in construction of P(f) which manifest in the construction of the statistical distribution itself. To zeroth order the answers are different than to first order, which are given in 1. Indeed, to zeroth order, we have a very straightforward classification:

1.  f(m|a|(0, (0, 1))) = −δa (σ(m, n))

2.  f(m|a|(0, (1, 0))) = −δa (σ(φ))

3.  f(m|a|(1, (0, 0))) = −δa (σ(1)(m))

image

speaking these come from a critical graph, or critical sequence - and the set of these sequences forms a set of ’trees’, a concept which we could coin alternatively as a grove.

3.2 Sequences, subsequences, and critical graphs

Broadly speaking, we are interested in a language that allows us to describe sequences of objects  Siwhich themselves may either be numbers or in fact contain nested sequences  S(i,j), which in turn, contain sequences  S(i,j,k)ad infinitum. And we are interested in critical sequences in some way, shape or form. Let us recast this another way - we can represent a sequence as a graph between n-simplices. Leaving aside the matter of how graph multiplication is supposed to work for the time being, a graph admits a factoring if one can multiply two subgraphs together to get the graph back. The rank of a graph is, vaguely speaking, the degree to which it deviates from

being a chain of 0-simplices. Then there are three unique prime graphs such that their rank is 1. These can

be represented as (0, (0, 1)), (0, (1, 0)), and (1, (0, 0)), and there are 11 prime graphs

of rank 2.

There are 42 prime graphs of rank 3, as evidenced by 2. It is naturally of course interesting to ask how we might be able to represent

such trees in a more natural way in terms of our choice of notation. But there are

actually two choices of tree we can use to build structure: a tree of functions, or a tree of Dirac delta functions. I illustrate this as follows: Consequently, it makes sense to consider notation −δa (T) for Ta tree. More

generally, however, we might be interested in considering −δS (T) (and indeed things

like −δU−δV (T(A, B)), where U, V, A, Bare trees, and T is something slightly more exotic). I will explore matters concerning the latter in a later paper wherein I visit Ramsay theory more carefully. However, for now, it is perhaps instructive to adopt the former notation to our problem of note. In particular, we can rewrite the above in terms of  T ∈ T, the finite set of prime

trees of rank 1, as:

image

or more succinctly as

image

where S is the trivial tree with two nodes.

Let’s try to flesh out this intuition a bit more.

First, we have the idea of prime trees. Loosely speaking, we have a multiplication between trees. A tree is prime if it cannot be factored into smaller non-trivial trees. We also have a way to represent trees of structure for functions, and trees of structure for nested statistical distributions (which in our case are primarily Dirac delta functions). The nodes of a tree correspond to different types of simplices, depending on the ’dimension’ of the node, eg a 0-simplex, 1-simplex etc.

(A tree is prime if it cannot be factored into smaller trees.)

image

Indeed the above is the most natural intuitive way to view of different allowable topologies for our algebraic information theories. But we want to go further.

image

Now  δT(x) := δ(x, T), with Ta tree can therefore be defined in general per the above interpretation.

Similarly,  δa(T) can be defined with T a tree on functions.

Consequently, more generally,  δS(T) is well defined, for S, T trees. Therefore we have a ’tree of trees’, where for each node in the tree of functionals, we have in general a tree of functions for its argument.

This leads us to the following important insight, expressing said ideas per the following notation:

Suppose that  φ ∈ {{κ → κ} → κ}, where κis the set of all trees. So that  φ isa map from the power set of trees - or the set of subtrees of a general tree - to a tree. Then we can represent a treelike structure for Dirac deltas S conjoined with a correspondence to the nodes of such trees with further trees for functions, for each and every subtree of S loosely as:

image

where  args(φ) is loosely the arguments of the aforementioned mapping as resolved at a particular subtree of S. But this is unsatisfying notation. Perhaps better and more suggestive is to overload the idea of a Dirac delta taking only one argument, and allow it to take two.

Then we have vaguely  δ(S, φ). But then the second argument could itself be a statistical distribution, so we could write

image

which is highly suggestive. Indeed, S need not be a tree - it could be something else. A fractional tree? A point in a manifold? A statistical distribution itself?

image

But of course if we are allowing  δto take more than one argument, why not a countably infinite number of arguments? So by dimension criticality assume that we have four arguments for  δthat contain information. And if this is to be a four dimensional space of four-manifolds, (m0, m1, m2, m3) ∈ ¯M, then we have an idea of some sort of matrix of metrics Λ, or a 4-tensor over them.

Then we are interested vaguely in structures of the form

image

3.3 The beginnings of higher category theoretic abstraction

Ultimately, though, we are interested in computability. So let’s unwind things a bit, and recognise in fact that the information density measure

image

where  Dij := δ(Λijk( ¯mk) − aij), has the property that this is a scalar, where Λ is a 3-tensor. So we can insert this neatly into a standard (non-overloaded) Dirac delta to have something computable:

image

where ¯m ∈ ¯M := (M0, M1, M2, M3) a vector of manifolds. (The use of this information density furthermore allows us to have a form of generalised CramerRao inequality holding true for the above statistical distribution - which would not be generically true if we did not use this particular form of nested structure.)

Finally, we might in fact be rather interested in  φ ∈ {g : ¯M → ¯M}, so thatinstead we would be studying

image

But maybe it doesn’t even make sense for  σto be a 2-tensor at all, but rather a 3-tensor, in line with cybernetic considerations. So this is a bit of an open question. Nonetheless, the above can crudely be thought of as a starting point for the study of structures of deeper category theoretic complexity (in a sense, this seems logical, as said structure brings together and unifies depth two considerations with minimal geometry and topology (algebra) with maximal geometric structure, minimal depth and topology (geometry), and maximal topology, minimal geometry and depth (topology)).

Another open question is how to compute these things in a practical manner. It is straightforward to construct algorithms for solution of 2-tensors, but 3-tensors and 4-tensors are wilier and harder to construct algorithms for that are computationally efficient. This is of interest to us, because the above formulation actually is quite central to the consideration of construction of chat bots with adaptive trees.

There may well be ways around this, such as through the use of moving from the idea of normal space to the space of special functions (like Beta functions, gamma functions, etc) and thereby compress the dimensionality of the above constructs into something more computable. For instance, we might move to compress things by dealing with Meijer-G functions to move from 3 tensors to scalars, and from 4 tensors to 2 tensors - in a sense ”lifting” to the space of functions viewed as analogous to surreal numbers with first uncountable cardinality, and identifying same with scalars through such a mapping. Regardless, the detailed study of such structures as these shall be the subject of a later paper the author aspires to write on Ramsay theory, which will attempt to deal with the topic of combinatorial information theory in a way that hopefully would do it greater justice.

For now, though, it is useful to understand roughly where our different topologies are coming from. Later, in yet a different paper that shall concern itself regarding providing a map for exploration further, a sketch will be provided regarding how the ideas just mentioned herein fit in terms of the classification of structures of structure, as well as the classification of structures of structure of structure. It will also be stressed as to the importance of lifting from the notion of primes being the atoms of structure to more abstract notions of irreducibility. Such considerations may prove to be key in making progress with chat bots, as well as a number of other surprisingly practical and applied fields of endeavour.

3.4 Determination of optimal policy

Backtracking somewhat, recall that we are interested in distributions of the form

image

with an associated payoff function which we will call P:

image

We have three different choices of topology in construction of P(f) which manifest in the construction of the statistical distribution itself. To zeroth order the answers are different than to first order, which are given in 1. Indeed, to zeroth order, we have a very straightforward classification:

1.  f(m|a|(0, (0, 1))) = −δa (σ(m, n))

2.  f(m|a|(0, (1, 0))) = −δa (σ(φ))

3.  f(m|a|(1, (0, 0))) = −δa (σ(1)(m))

The sequences above are associated to a discrete variant of a deeper more general underlying subject (the study of combinatorial information theory, or Ramsay theory). In order to determine optimal policy, we can solve each of these in turn and try to infer from a situation as to which is the most important in which circumstance. However, more likely we would be interested in a constrained optimisation problem, where we attempt to optimise each and every one, subject to the others. This brings us full circle back to Ramsay theory again. Our distribution here is

image

where we can think of the inner distribution as being akin to the distributions above via vague intuition. Therefore, in order to determine the best policy to pull available policy levers associated to the relevant topologies, we would perhaps be better served if we were to instead try to optimise the information of this distribution.

Then we find that the geodesics through  { ¯M → ¯M}space (where ¯M := M ×M × M × M) associated to the geometry imposed by a Λ and  σthat solve the PDE induced by  δI(f) = 0 are paths through ”policy lever pull space” that optimise choices (and levels of enforcement of) abstract policy used to constrain behaviour of agents within an abstract society within an abstract game, that optimise the competitiveness of said society.

It seems reasonable to posit that in the real world, societies and/or organisations that tend to do well are the ones that exhibit emergent behaviours, (often formed over significant spans of time) which reflect solutions of this rather abstract problem.

Note finally that we have only dealt with the zeroth order case. The first order case and second order case are discussed briefly in the previous section (and tables provided), but the flow of the argument and the analysis remain roughly the same.

[1] Chris Goddard. Advanced topics in information dynamics. viXra preprint viXra:1002.0033, 2010.

[2] Chris Goddard. Soliton solutions to the dynamics of space filling curves. viXra preprint viXra:1711.0459, 2017.

[3] Christopher Goddard. An information theoretic approach to game theory, ı. viXra preprint viXra:1810.0289, 2018.

[4] Christopher Goddard. An information theoretic approach to game theory, ıı. viXra preprint viXra:1811.0414, 2018.

[5] Christopher John Goddard. A treatise on information geometry. arXiv preprint arXiv:1802.06178, 2018.

Within this paper the choice has largely been to err somewhat towards provision of clear and crisp descriptions of particular mathematical structures for the sake of pedagogical clarity. However, there is perhaps a risk that towards heading for polish in the above, that something might perhaps have been lost in an understanding as to underlying intuition. Therefore, let’s step back to the beginning once more and try to paint a different sort of perspective as to why one might think of what we are doing as ’algebraic information theory’.

A.1 Foundations

In order to conduct our analysis, we are interested in constructing various invariants for algebras and groups, and relating these to information invariants. The prototypical example to consider is to consider maps  f : Sn × Sm → X(generalised simplicial homology), and look at the groups  Cnmconsisting of homolopy equivalence classes of these. There are boundary maps  ∂→ that mapfrom  Cn+1,m → Cn,m → Cn−1,metc, and boundary maps  ∂↓that map from Cn,m+1 → Cn,m → Cn,m−1etc. These form a grid of boundary operators linking groups together. More generally, we can consider skip operators, that map from

Cn+1,m to Cn−1,mdirectly for instance. It is slightly unclear to the author what the

intuition there would be, though.

Regardless, the general idea is that with this construction we can construct an invariant

image

where G is the enveloping algebra consisting of all the homology groups  Gn,mdefined as the quotient groups in such a way that a square is commutative, and gnmαare the generators for  Gn,m, and gnmα relates to the cohomology generators dual to  gnmα.We can think of this roughly as an information, and it satisfies a Cramer-Rao like inequality:

image

Then  δI= 0 allows us to determine an enveloping algebra G such that the information is critical. But what does this have to do with policies?

A.2 Application to game theory

Consider as in [3] a game  g : ∆(0) × ∆(0) × ∆(0) → R. Recall that a point in ∆(0) isa 4-vector (p0, p1, p2, p3), where

 p0is a representation of a coalition

 p1is a representation of a decision for said coalition a wrt a second coalition b

 p2is the decision for coalition a with a third coalition c

 p3is the decision as to how coalition a should account for competition between coalition b and coalition c

and these are all CW complexes. So far, so good. A strategy is a choice of decision metric  σ : T∆(0) × T∆(0) → R.Suppose now that a rational strategy for a is not rational with respect to an

aggregate coalition of which a is part. Or alternatively that our coalition is not rational. Then we need the idea of a policy. Consider maps  f : Sn × Sm → ∆(0). Then as above we can construct groups  Gnmas part of an enveloping algebra G, with accompanying boundary operators. The intuition for f is that we are interested in extracting information regarding cliques amongst our players, and applying some form of forcing to counteract these cliques, which would otherwise make decisions that are suboptimal for the greater group (’society’) of coalitions. Since we are restricting degrees of freedom, instead of optimising a decision metric σ, we are interested in optimising the classes of a decision metric in relation to generators for our groups:

image

A.3 Preliminaries

Indeed, suppose m is in some Lie Group M, with metric  σ. Since M is a LieGroup, TM is a Lie Algebra. Let TM be also itself a Lie Group and action a metric  τ. In this sense we have a basis for discussion of Algebraic Information Theory. Therefore, consider our fundamental structure for consideration of policies to be a group. More than this, we would like it to be a Lie Group, so that group elements vary continuously over some space, ie, the group admits a differentiable structure with an atlas of charts. This is motivated by say tetrated group multiplication under which G should be closed, say  ⋆G(a; b), where ais a group element and b is a real number. We also naturally are interested in a lower order multiplication between elements,  a ⋆G bunder which G should also be closed. That is:

G is closed under the operation of  ⋆G, and

G is closed under iterated operation of  ⋆G via ⋆G(g; r) ∈ G, for all g ∈ G,r ∈ R.

image

eg, if elements in the algebra are generated by  x1, · · · , xk, then

image

A density function over possible P can be defined

image

where  g ∈ G, and P(g) : A × A → Rvaries smoothly with g. We can then define an information for P in terms of our density function f in a standard way:

image

This information will be non-negative. Setting the first variation to zero then gives us an expression for P which we can solve, in order to compute an optimal policy cost function  P0.An optimal policy is then a path in G which follows geodesics of  P0.This, however, is potentially naive. Let us spell out a bit more intuition. Say that  a := xi1xi2 × · · · × xin ∈ Ais a policy. Policies rather than agents are now considered to ’compete’ with one another. So  b := xj1xj2 × · · · × xjm ∈ A,c := xk1xk2 × · · · × xkl ∈ Aare competing policies say. Our task is to determine a way to determine the best strategy for a policy to take with its available ’levers’ in order for it to have the maximal chance of success. Then it seems more reasonable to define a policy cost function as a map

image

So what is a ’lever’ available to a policy? Surely this would be an action that it can take, if we consider a policy (as applied to a power set of CW complexes of agents in the overall pool of players) as an entity in its own right. We wish moreover to ignore the underlying game and focus on the game of policy lever use optimisation. But points in an algebra / in a group are trivial and have essentially no room for choice regarding levers, and are therefore not interesting. Consequently it makes more sense perhaps to consider quotient groups G/g of G (as Lie groups) and their corresponding induced algebras A/{g}. Levers, then, are the available degrees of freedom or subgroups / induced subalgebras within the quotient group viewed as an entity. Then we perhaps want to lift the concept of policy cost function to a map

image

where Pow(A) represents the set of all subalgebras of A including itself. This is the definition that shall be used as the basis for the rest of the paper going forward. So, what are the levers? Orbits of the policy describe the levers/actions available to the policy.

A.4 Iterated policy choices

Suppose we have a situation where a policy wants to decide what levers to pull, subject to the understanding that in future it will have to do the same thing again many times over, and the information regarding what levers it has pulled will propagate into the decision trees of other competing policies. Then the decision problem is different. Solving the ’lever pull problem’ for a policy in this iterated game is different to a ’one-off’ choice. We can follow the pattern of [3] here and observe that possibly 5 is the multiplicity that is important here. However, Pow(A) may not be sufficiently abstract. It is possible indeed that consideration of the set  Pow(1)(A) := {f → g|f, g ∈ Pow(A)}is a useful set to consider, and the expression

image

The first is as above. Consider  Pow(1)(A) := {f|f : Pow(A) → Pow(A)},functions from the power set of A to itself.

The second concerns  Pow(A(1)), where A(1) is the set of maps from A to itself.

The third consists of consideration of  PowPowA, or ◦(Pow; p(1))(A), wherep : N → Nis the prime number sequence.

Intuition for these three types is as follows:

For the first type, this concerns games where the levers available to a policy are subject to the iterate of the game and whatever previous lever pulls were made by constituents.

For the second type, this concerns games where we have certain policies and we wish to make several lever pulls (iterated choice).

For the third type, this concerns iteration of selection of policies, wherein each policy can only make one move, but can seed offspring policies from its choice (parent-child sequence / hereditary selection).

This emergence of three different strands of structure in terms of determination of appropriate strategies for pulling policy levers relates to the classification of situations within which policies are used. A situation is an arena wherein multiple policies are used to make decisions. To clarify this notion:

Definition 4. (Situations, arenas, and modes and resonances). A situation is an arena wherein policies operate. Such arenas have different natural choices of structure that policies can be applied to - and a finite number of said choices. These correspond to modes or resonances of the arena, which act as wrappers around a policy. Said modes (interchangeably, resonances) correspond to a choice of modelling the action of a policy in said situation. Every policy can operate in all natural modes in a situation, therein it is necessary to consider all relevant meta-structures (resonances) that potentially apply as wrappers around sets of such policies. For iterated policy choices for a policy associated to a Lie Algebra A, there are three:  Pow(1)(A), Pow(A(1)), PowPowA.

We can think of situations equivalently as instances of a universal class C which maps from the set of types to a function space. Above, we have three types J that map from a Lie Algebra to a function space. We are interested in characterisation of C in a natural way. Let T be the space of types (in this case discrete). Then we would like to understand maps  C : T × G → {f|f : T × G → G}, on some natural Lie Group G. In particular we would like to compute for instance  C(J, g) ∈ G.Maybe let’s forget Lie Algebras and Lie Groups, and just state things in terms of tangent spaces. Let’s have a function  f : M → M. Now, let’s mutate it with respect to a type J that acts on the tangent space TM. For instance, if we mutate it with respect to a function g in type T corresponding to  Pow(1)(TM), ieg ∈ {g : TM → TM}, we transform f into (in this case)  f g, or ∧(f; g), where ∧ isa representation of this type.

More generally, the mutation of f with respect to an instance g of J is some function J(f; g). Consequently, the types corresponding to mappings for first order:

image

correspond to 2-tuple pre-geometric operators  ∧, ⋆ and ◦respectively, and we have that  C(J) : M → {{M → M} → {M → M}}can be represented as

image

A.5 More on Types

We’ve considered prototypical types (Id, ◦, ⋆,+). But what about more complex types? Certainly we can have types like  C(g) := +(◦(◦(g; ); ); ⋆(g; )), but this might be too complicated. Potentially we would instead like to consider types C(g, h), too, so that we can construct something like ’a metric’ on type space, acting on the space of distributions dual to it. So say we have two general types C, D that are elements of the algebra generated by (Id, ◦, ⋆,+) above. Then we can recover numbers if we compute  C with DT assecond argument. eg if  C(g) := ◦(g; ), D(h) := +(h; ), then DT := +(; h) andCDT(g) := ◦(g; σ) + (σ; g) where σis a type metric. So we move away from types to consideration instead of this nebulous ’type metric’  σacting on two tuples of types, where  σis a function of g, our statistical distribution over metrics in physical space; so that the underlying space for which σoperates is a space of statistical distributions, and our types form a tangent space to this space of distributions. We can thereby think of a statistical distribution for this in turn, too:

image

so that this becomes our way of thinking about geometry for algebra. But we can think of this as a way to develop a handle on representing geometry for topology as well, if we think of types themselves as having geometry, with physical space as tangent, and the space of statistical distributions as the underlying space.

So in this way, we have a { rock, paper, scissors } representation for topology, geometry, and algebra, which is a natural way to think about unification of the ideas present in [5], [1], [2], [3], [4].

A.6 Meta-Policies

Extension of this idea leads one to examine situations for meta-policies (up to 11 structural variants) or meta-meta-policies (up to 31 structural variants).

For classification of situations concerning a meta-policy, we have:

0.  Pow(A(1) ⊗ A(1))

1.  Pow(1)(A ⊗ A(1))

2.  Pow(1)(A(1) ⊗ A)

3. PowPow(A ⊗ A)

4.  Pow(2)(A)

5.  Pow(1)(A(1))

6.  Pow(A(2))

7. PowPow(1)(A)

8. Pow(1)Pow(A)

9. PowPow(A(1))

10. PowPowPow(A)

which makes for 11 situations. An optimal strategy constructed to determine optimal play for a meta-policy within one of these 11 situations is a tenet. Together, these tenets form an ethical framework or code of laws.

Intuition for each strand is as follows:

The 0th situation,  Pow(A(1) ⊗ A(1)): cooperation

image

This consists of subalgebras associated to  A(1) ⊗ A(1). Here we consider in essence  A(1) a superalgebra for policies acting on policies, or the space of meta-policies. Therefore the object of interest to us are pairs of actions as a subgroup of the product superalgebra between two meta-policies.

This then indicates the optimal strategy for a meta-policy to interact with other meta-policies.

One could think of this as the approach for actions available to two-tuples of meta-policies, or interaction with other meta-policies. In terms of more

concrete intuition, this gives rules of engagement for dealing with alternative ethical frameworks, for instance.

The 1st situation,  Pow(1)(A ⊗ A(1)): response to action on meta-policy by a separate policy.

image

The 2nd situation,  Pow(1)(A ⊗ A(1)): approach to action of meta-policy on a separate policy.

image

The 3rd situation,  PowPow(A ⊗ A): approach to review of action of actions of a two-tuple of policies which are answerable to the meta-policy.

The ethics of review of the effect of the cascading effect of action due to actions between two subentities; how and when to legitimise action.

The 4th situation,  Pow(2)(A): approach to approaches to approaches concerning action of a policy answerable to the meta-policy.

The ethics of how to construct an ethics concerning review of the actions of a single subentity.

The 5th situation,  Pow(1)(A(1)): approach to approaches regarding how to act.

The ethics as to how to act as a meta-entity, i.e., as to how the ethics should behave.

The 6th situation,  Pow(A(2)): approach to externality/excession.

The ethics as to how to deal with super-entities which serve as excession relative to oneself.

The 7th situation,  PowPow(1)(A): approach to action regarding approaches concerning the actions of an entity.

The ethics as to how to respond to plans concerning action formulated by an subentity.

The 8th situation,  Pow(1)Pow(A)

The ethics as to how to plan to respond to actions undertaken by an subentity.

The 9th situation,  PowPow(A(1))

image

The 10th situation, PowPowPow(A)

The ethics as to how to respond to the cascading response caused by the cascading events due to the actions of a subentity.

Remark. A way of viewing these 11 tenets is as modes (’policies’) for optimal play for 0-agents, given that same are subject to custodial stewardship by 1-agents, with a supervisory excession represented by 2-agents.

Abstracting again, we have the idea of classes, and metaclasses, or inherited classes. In particular, following the prior analysis, we are led to the idea of a map C that maps from metatypes to maps D that map from types to functions. We are interested in 3-tuple operation at cybernetic level, and possibly 2-tuple operation at normal level.


Designed for Accessibility and to further Open Science