Discovering Symmetry Invariants and Conserved Quantities by Interpreting Siamese Neural Networks

2020·Arxiv

Abstract

Abstract

We introduce interpretable Siamese Neural Networks (SNN) for similarity detection to the field of theoretical physics. More precisely, we apply SNNs to events in special relativity, the transformation of electromagnetic fields, and the motion of particles in a central potential. In these examples, SNNs learn to identify datapoints belonging to the same event, field configuration, or trajectory of motion. We demonstrate that in the process of learning which datapoints belong to the same event or field configuration, these SNNs also learn the relevant symmetry invariants and conserved quantities. Such SNNs are highly interpretable, which enables us to reveal the symmetry invariants and conserved quantities without prior knowledge.

I. INTRODUCTION

Machine learning (ML) algorithms have experienced a surge in the physical sciences. This is based on the introduction of ML methods to fulfill tasks beyond the scope for which they were originally designed. These include finding phase transitions [1–12], simulating quantum systems [13–19] and rediscovering physical concepts [20–27].

Even though ML in theoretical physics is a young discipline, it has so far been successful in reproducing results in many complicated systems in just a few years. This success often comes at the cost of a lack of understanding of what ML algorithms intrinsically learn. Physics, as a scientific discipline, benefits from a “deeper understanding” of the underlying models used for making predictions.

The question of whether ML models can “understand” physics is a deeply philosophical one, and we don’t presume to address it in all its complexity. Assuming that a ML algorithm is successfully trained to predict the outcome of a physical experiment or calculation, it is not always clear whether the algorithm has deduced physical concepts or has merely managed to perform some basic pattern matching. However, if the ML model is “interpretable” in the sense that we can recover a compact and simple mathematical representation of a physical equation by analyzing the said model, then we take the position that such a model has indeed learned to “understand” the underlying physics.

The most successful ML algorithms are artificial neural networks (ANNs), which are famously inscrutable. Having said that, there have been many recent attempts at interpreting the learned features of a fully trained ANN. The simplest way to interpret a neural network is to examine the weights and biases of individual neurons, which can only yield successful results in shallow ANNs. In the field of explainable AI (xAI), there are different methods that determine which features of the given input are responsible for a learned model’s classification [28, 29]. Similarly, in the field of computer vision, there have been many developments to examine the contribution of the pixels in an image to the ANN prediction [30–34]. One of the most popular methods is feature visualization by enhancing learned patterns on input images [35].

In physics, one has a distinct advantage when it comes to interpreting ANNs. In the field of computer vision or natural language processing, it is very hard to come up with mathematical equations uniquely describing the “ground truth”. In contrast, physicists have worked for hundreds of years to formulate their theories and experimental measurements in terms of mathematical equations. This means that if we can recover such an equation by analyzing an ANN, we have an immediate access to its interpretation. This also opens up the possibility to check for consistency and reveal new concepts. Indeed a few recent works have presented successful interpretations of ANNs in physics [22, 36–39].

In this article, we propose a change in traditional Siamese Neural Networks (SNN) architectures that makes them easier to interpret. Specifically, the key feature is a bottleneck layer, where the SNN is forced to compress all available information from previous layers. The output of this bottleneck can be analyzed, for example, by applying known regression methods. A similar approach has been taken in [22]. While there does not exist a related interpretation procedure in computer vision, the idea of interpreting bottleneck layers is also seen in disentangling autoencoders [40].

The ANNs we are considering in this work are a variant of the previously proposed SNNs, a class of ANNs that have been applied to object tracking, face recognition, and image similarity detection [41–44]. An SNN consists of two (identical) ANNs that are applied to a pair of input data points. The two networks share their weights and biases, which are updated simultaneously during training. The goal of the network is to map the input pairs to a set of latent variables that determine the similarity

of the pair.

The general problem an SNN attempts to solve can be stated as follows: given two data points x and y related by an equivalence relation (e.g., the same event in a relativistic setting measured by two observers in different reference frames), is it possible to correctly and automatically classify them as “related”? Further, if x and y are not related, then we require the ANN to classify them as not related.

SNNs can solve an extension of a classification problem with relatively little training data per class. Instead of training a traditional neural network to distinguish between a fixed number of classes, an SNN can probe the similarity of one datapoint with another prototypical datapoint for a certain class. This reformulation bears many advantages. First, the number of classes does not need to be fixed. Further, it is no longer necessary to train on all of the classes. A successfully trained SNN might be able to share its learned representation to distinguish between classes that are not in the training set. These properties become important in the limit of many (possibly infinitely many) classes, or in the case where only a few data points are available in each class.

The contributions we make in this paper are:

1. We introduce the SNN to the field of theoretical physics.

2. We demonstrate its usage in the well known contexts of special relativity, electromagnetism, and the motion of particles in a central potential. In the case of special relativity, these SNNs learn whether or not two different observations of physical phenomena correspond to the same event. In the case of electromagnetism, these SNNs learn whether or not given two field configurations, one can be transformed into the other via a Lorentz transformation. In the case of motion of particles, these SNNs discover whether or not two observations of position and momenta describe the same particle.

3. Further, we successfully interpret the intermediate output representations of the SNN and recover the mathematical formulations of known physical conserved quantities and invariants, e.g., the spacetime interval or the angular momentum.

4. Since the interpretation of the SNN yields signatures of known physical equations, we argue that our SNN has indeed learned to “understand” physical concepts instead of merely performing basic pattern matching.

II. NEURAL NETWORK ARCHITECTURE

Figure 1: Schematic architecture of an interpretable SNN. Our SNN contains a bottleneck of only a single neuron, the output of this layer is called the intermediate output of the network. We observe that this bottleneck encodes quantities which are strongly correlated with invariants like the energy or the spacetime interval.

In this paper, we employ Siamese Neural Networks (SNN) to determine whether or not two samples belong to the same class. In this context, our input data is a pair of samples . In order to formulate a su- pervised learning problem, we associate the label 0 to pairs that correspond to the same class (i.e., and are related via an equivalence relation) and 1 to pairs that belong to different classes (i.e., the input pairs are not related). In this sense, we can reformulate a clas-sification problem with many – possibly infinitely many – classes into a traditional binary classification problem.

For this purpose, we construct our SNN consisting of several building blocks. The first building block is composed of a pair of identical neural networks. This pair of networks is applied simultaneously to each of the samples in a data point pair and . The last layer of the network pair only contains a single neuron, this layer we refer to as the bottleneck. The output of the bottleneck layer is the intermediate output of the SNN. The intermediate output is merged by performing appropriate algebraic operations. Let us denote and the output of each of the neural networks. Then the algebra layer calculates before supplying it to a sigmoid neuron such that the output of the full SNN can be written as

The SNN outputs a probability that signifies whether the two samples belong to the same class or not.

Figure 2: Schematic description of the task solved by the Siamese neural network. Left (a): In the case of special relativity and electromagnetism our network is tasked to learn if two descriptions from different perspectives correspond to the same event or the same electromagnetic field configuration. Right (b): In the context of Newtonian gravity, we train our network to detect if two observations of velocities and positions correspond to the same particle moving in a central potential.

For the purpose of training, we minimize the binary cross-entropy loss function between the SNN and the label on the training set. After training is complete, the generalization performance is measured on the test set. While training the SNN, we enforce weight sharing in the network pair to make sure these networks learn the same function. We note that, in the context of our physical examples, a natural minimization of the binary cross-entropy loss function is achieved if learns a symmetry invariant or a conserved quantity.

After having successfully trained the SNN, our goal is to answer the question of which features this neural network bases its decision. In general, there is no easy answer to this question, since analyzing even small neural networks can be extremely challenging. So far, there does not exist a comprehensive theory of what is learned by artificial neural networks.

One of our crucial insights is that in order to interpret what our SNN learns, we have designed the SNN to include a bottleneck at the output of the first building block before merging (see Fig. 1). We will see later that our SNN learns conserved quantities and invariants at the bottleneck in order to make its decision about whether two samples belong to the same class. Further, by interpreting the network, we can predict conserved quantities and invariants with no additional prior knowledge.

If the number of neurons in the bottleneck layer increases, one can achieve better accuracy at the cost of interpretability. The interpretability can, in principle, be retained if one enforces decorrelated intermediate outputs.

More details about our neural network architecture and the learning procedure can be found in Appendix B.

III. PERFORMING MACHINE LEARNING

A. Spacetime in Special Relativity

The first physical system we consider in this work is the Minkowski spacetime in special relativity. An event is a four vector that combines spatial coordinates and a moment in time. Minkowki space-time is with a scalar product induced by the metric diag,

where we have used . Thereby we define the spacetime interval s by

The Lorentz Group is the set of transformations which preserve the scalar product on Minkowksi spacetime

and thus also preserve the spacetime interval.

In this section, we discuss how to teach the neural network to identify, in special relativity, whether two observations by different observers correspond to the same event. These observers are at the same position but move with a relative velocity in some direction. For this purpose, we prepare positive training data of pairs of obser-

Figure 3: Special Relativity: Correlation between the inter- mediate output of the siamese neural network at the bottleneck layer and the spacetime interval.

vations that correspond to the same event, and negative data where a pair of measurements does not describe the same event.

More specifically, in order to train our neural networks with data we prepare a training dataset consisting of pairs of measurements of the same event in Minkowski spacetime seen from two different observers Λ . Here, Λ is a random Lorentz transformation which is sampled from all possible Lorentz transformations. More details can be found in Appendix A. We sample 50000 spacetime events and Lorentz transformations Λ to create pairs of events that form the positive dataset. We associate with each pair the label y = 0. Further, we create a negative dataset where each pair of spacetime coordinates is not related by a Lorentz transformation. In practice, we implement this by randomly permuting among all second elements of all pairs of spacetime events in the positive dataset. Each pair in the negative dataset is labeled with y = 1. In addition to this training set, we prepare a similar test dataset of 5000 positive pairs and 5000 negative pairs.

The SNN is trained to predict if a pair of observations describe the same event or not. This is done by optimizing the weights of the neural network via backpropagation to minimize the binary cross-entropy loss between network output and true label . After training, the neural network is able to correctly predict if a pair of observations belong to the same event with an accuracy of 94% on the training dataset and 92% on the test dataset. The training and testing accuracies during training can be seen in Fig. 9.

Following the successful training of our SNN, we want to understand what the neural network has learned. This can be achieved by examining the intermediate output of the neural network, which acts as an interpretable bottleneck. We perform a hierarchy of linear regressions with

Table I: Regression scores of the regression on the intermedi- ate output in the case of special relativity, measures normalized distance between regression and data. The score metric is known as the coefficient of determination or score. Best possible score is 1, the score can be negative.

polynomial features (i.e. polynomial regression) on the intermediate output with respect to the input. If we assume that the Taylor expansion of the decision function is sufficiently accurate at the decision boundary, we can hope to get insightful results.

We perform ridge regression with polynomial features of the input on the intermediate output of the SNN. We start with polynomials of degree 1 and increase the degree of the polynomial features until the regression becomes accurate. From Table I one can immediately infer that the optimal degree of the polynomial features is 2.

The result of the regression is in an ordered manner is

We can see that four nontrivial features dominate all others. If we assume that the regression includes small approximation errors, we can infer that the SNN has learned the invariant quantity . This quantity is the spacetime interval, a known invariant of the Lorentz group. In cases where the regression does not yield a clear result, one can cross-check the second-order regression result with higher orders of regression, and observe if the dominant features stay the same. Another option is to do the whole training procedure with a different random seed and see what parts of the results keep the same ratio.

To summarize, as long as the ANN is only able to use a single scalar function to decide if two events are the same, it calculates the spacetime interval. If the space-time interval is the same, the ANN predicts that both coordinates in a pair belong to the same event. While it is often difficult to decide if neural networks learn to “understand” physical concepts to make decisions, here we argue that our SNN does so. To confirm our derivation, we draw a scatter plot for a subset of our data points of the intermediate output versus the spacetime interval in Fig. 3 and observe a nearly perfect non-linear correlation between these two. Note that we have cross-checked the second-order regression result with higher orders of regression, and found that the dominant features stay the same.

Finally, we examine whether the SNN can also learn a different quantity to decide if two observations from different observers belong to the same event. For this purpose, we again prepare a training and a test dataset, as explained above. However, in the preparation of the dataset, we keep the spacetime interval fixed. We attempt to train the SNN to learn to associate corresponding observations. However, the ANN fails to train in this case. After the best training cycle, the ANN can only predict if two observations belong to the same event with an accuracy of 58% on the training set or 57% on the testing set, which is barely better than random. This fact leads to the conclusion that the SNN is unable find another invariant of the Lorentz group besides the space-time interval.

The fact that the SNN fails to distinguish observations in this reduced dataset hints that all observations with the same space-time interval can be transformed into each other by a Lorentz transformation. Further, it indicates that there is no other symmetry invariant. Both of these statements are of course known to be true. However, one needs to be careful since the same conclusions could be drawn if the neural network is not powerful enough to learn an underlying invariant.

B. Motion in a Central Potential

As a second system we consider the motion of a particle in a central potential, such as the movement of a planet in the gravitational potential of the sun. Newtonian gravity can be formulated via the Hamiltonian

Here p is the momentum, is the distance from the potential centre, m is the mass of the planet, M is the mass of the sun, and G is Newton’s constant of gravitation. Given an initial position x and velocity v one can calculate the trajectory of motion by solving Hamilton’s equations

There are conserved quantities in this system, the energy E the components of the angular momentum L and the components of the Laplace-Runge-Lenz vector A. They are related by two equations which effectively reduces the

number of scalar conserved quantities to five.

Figure 4: Particle in central potential: correlation between the intermediate output and (a) the angular momentum or (b) the energy.

When examining the motion of particles in a central potential, the SNN is tasked with determining whether two observations of the same particle correspond to the same particle trajectory.

We simulate particles of fixed mass m moving in a Newtonian static gravitational potential produced by a stationary mass M by solving the Hamiltons equations for a set of random initial positions and velocities. For simplicity we set m = 1 and GmM = 1. We measure the position and the velocity of the particle at two different times to get pairs of inputs . We generate 50000 pairs belonging to the same particle trajectories to form the

Figure 5: Particle in central potential with fixed angular mo- mentum: correlation between the intermediate output and the energy.

Table II: Regression scores of the Regression on the interme- diate output in the case of the motion of a particle in a central potential.

positive training dataset labelled by 0. By permuting the second entry in the pairs, we create a negative dataset labeled with 1. Similarly, a testing set is produced with 5000 positive and 5000 negative examples. The SNN is then trained to correctly predict if a pair of coordinates belong to the same trajectory. After being successfully trained, the network achieves an accuracy of 98% on the training set and 97% on the test set. In order to interpret on what quantity the neural network bases its decision, we again examine the bottleneck at the intermediate output. We again perform a hierarchy of linear regressions with increasing polynomial features on the intermediate output. The optimal degree of the regression is two (see Table II). The result of the regression in an ordered manner is

This quantity is an approximation to the angular momentum . A confirmation of this result is visualized in the very good correlation between the angular momentum and the intermediate output, illustrated in Fig. 4 (a). This means that the SNN learns to distinguish between pairs originating from the same trajectory and different trajectories, by calculating the angular momentum. Another conserved quantity in this system is the energy. In Fig. 4 (b), we see that the intermediate prediction is not correlated with the energy.

We now fix the angular momentum and perform the simulation again to produce 50000 positive and 50000 negative data pairs. We train the SNN again to distinguish if a pair of observations belong to the same trajectory. Even though the neural network cannot use the angular momentum to determine if the pair corresponds to the same trajectory, it still manages to perform well on this task. The SNN achieves an accuracy of 99% on both the training and the test set.

When using linear regression with polynomial features to determine what the SNN has learned in order to make its prediction we fail. On the one hand, there is no clear optimal degree of the polynomial regression. On the other hand, all the regression results do not yield a clear dominant feature. If we compare the intermediate output to the remaining invariants in the system, we find that the intermediate output is strongly correlated with the energy of the system – see Fig. 5. This means that the SNN probes the pair of observations for energy conservation. However, the energy cannot be well approximated by a polynomial function with which we perform our regression.

To circumvent this problem we extend the input features to include the term 1such that the input feature vector reads . From this point we can start the polynomial regression, go on to identify the best polynomial order (see Table III) and formulate the findings in the following equation

We see that the result includes the energy and the angular momentum. The angular momentum evaluates to a constant. Since constants can be absorbed, we conclude that the SNN has learned energy conservation. One might ask whether the SNN is able to find the Laplace-Runge-Lenz vector, which remains open for further investigation.

Table III: Regression scores of the Regression on the inter- mediate output in the case of the motion of a particle in a central potential with fixed angular momentum.

C. Electromagnetism

Finally, we consider electric E and magnetic fields B, and their behaviour under Lorentz transformations. For this purpose we incorporate the fields in the electromagnetic field strength tensor

The Lorentz transformation of the field strength tensor

implies the transformations for the electric and magnetic fields. The known Lorentz invariants of the electromagnetic fields are the determinant of the field strength tensor detF and .

In this section, we study the behaviour of electromagnetic fields under Lorentz transformations with SNNs. For this purpose, we again produce 200000 true pairs of electric and magnetic field configurations which are connected by a Lorentz transformation, and 200000 negative pairs of fields by permuting the positive pairs.

We again train the SNN to predict if the two measurements belong to the same field configuration. After having successfully trained the neural network, we find that the neural network can fulfill the task to the high accuracy of 95% on the training set and 94% on the test set.

In order to determine what the neural network has learned, we perform polynomial regression on the intermediate output of the neural network. The function

Figure 6: Electromagnetism: Correlation between the inter- mediate output and (a) the determinant of the field strength tensor or (a) a specific contraction of two field strength tensors.

Figure 7: Electromagnetism with fixed determinant of the field strength tensor: Correlation between the intermediate output and a specific contraction of two field strength tensors.

Table IV: Regression scores of the Regression on the intermediate output in the case of Electromagnetism.

Table V: Regression scores of the Regression on the inter- mediate output in the case of Electromagnetism with fixed determinant of the field strength tensor.

which approximates the output best is of degree two (see Table IV) and is given by

This function is an approximation to a known invariant, the determinant of the field strength tensor detF. A confirmation of this deduction is the correlation between detF and the intermediate output as depicted in Fig. 6.

Let us perform the same experiment again, however this time we fix the determinant of the field strength tensor when sampling the pairs of electromagnetic field configurations. The neural network still trains successfully and performs well in identifying pairs of data belonging to the same fields, with an accuracy of 93% on the training set an 92% on the test set. Performing the bottleneck regression on the intermediate output of the neural network reveals the remaining invariant to be of degree two (see Table V) and is approximated by

This function is another known invariant of the field strength tensor , confirmed in Fig. 7. To summarize, in the context of electromagnetism, we have revealed the two invariants of the electric and magnetic fields which are preserved under Lorentz transformations.

IV. CONCLUSIONS AND FUTURE DIRECTIONS

We have introduced siamese neural networks (SNNs) to the field of theoretical physics. They are successful in predicting whether two data instances are connected by a deterministic transformation. We examined space-time events and electromagnetic fields which transform under Lorentz transformations, as well as the movement of particles in a central potential. By interpreting our neural network, we found that it learns the underlying symmetry invariants and conserved quantities to perform its prediction. Most interestingly, we were able to interpret our SNNs via the use of polynomial regression. This procedure revealed an excellent approximation of the underlying symmetry invariants and conserved quantities. These invariants range from the spacetime interval over angular momentum conservation to the determinant of the field strength tensor. If the underlying system does not contain human readable invariants, the neural network could act as an approximation to such an invariant.

Future directions of this work include an upgrade of the polynomial regression to symbolic regression [45]. Another exciting direction is to combine interpretable SNNs with semi-automated mathematical reasoning tools, e.g., solvers or theorem provers. The idea is to check the physical law learned by the SNN for consistency against known laws and invariants by leveraging such reasoning tools [46]. It does not take much imagination to envision how this technology can be used in applications such as quantum error correction or in particle tracking at the LHC.

It remains to be seen if SNNs will ever find an invariant or conserved quantity unknown to modern physics. Even if this does not happen, the contribution of this work is the introduction of SNNs as a useful tool in theoretical physics. Furthermore, we challenged the black box nature of artificial neural networks by a very clear interpretation that reveals polynomial quantities without prior knowledge. The interpretation procedure might also be adopted into the field of computer science, where the interpretability of neural networks poses a major problem.

V. ACKNOWLEDGEMENTS

We thank Isaac Tamblyn for helpful discussions. RGM and JS are supported by NSERC. RGM is further supported by the Canada Research Chair program, and the Perimeter Institute for Theoretical Physics. We thank the National Research Council of Canada for their partnership with Perimeter on the PIQuIL. Research at Perimeter Institute is supported in part by the Government of Canada through the Department of Innovation, Science and Economic Development Canada and by the Province of Ontario through the Ministry of Colleges and Universities.

Appendix A: Lorentz Transformation

Let us describe the representation of the Lorentz transformations which are used to generate the data pairs in the special relativity and electromagnetism sections.

An arbitrary Lorentz transformation can be decomposed

Here Λis a Lorentz boost in x direction.

where

c is the speed of light which we conveniently set to c = 1.

The matrices perform the rotation in the three dimensional subspace

Appendix B: Neural Network Details

In this section we explain the details of the training of the SNN on pairs of data with a number of datapoints N between 50000 an 200000. For the sake of understandability we use the same architecture and hyperparameters for all learning tasks. The architecture of the SNN is depicted in Fig. 8.

The training of the neural network is the adjustment of the weights and biases of the neural network to achieve a minimum of the binary cross entropy loss function for all N training datapoints.

where denotes the true label, while is the neural network prediction. Our neural networks are trained using the Adadelta optimizer. We found that starting learning rates of lr = 100 are needed to train the neural network, this learning rate is a lot higher than normally used in traditional classification problems. Each update is performed by calculating the gradient on a batch of size 256. We employ learning rate decay callbacks which reduce the learning rate by a factor of 2 if the training loss has not improved for 50 epochs. We train our networks for 10000 epochs however, we employ an early stopping callback which aborts the training process if the training loss has not improved over 200 epochs. We do not use any kind of regularization in our neural networks. The evolution of the losses and accuracies during training are depicted in Fig. 9

Figure 8: Detailed architecture of an interpretable siamese neural network.

Appendix C: A Note on Interpreting Neural Networks

If a neural network intrinsically learns a physical observable O(x) as a function of the input data x, this observable is often encoded in an elusive manner distributed among many neurons. The bottleneck interpretation forces all information of this observable through a single neuron. In general, this observable is encoded in a deformed manner such that the output of the bottleneck neuron is h(O(x))). If we restrict ourselves to a small output range, the function h can be linearized such that . This form helps us to perform linear regression and and can be identified as adjustments to the weights and bias of the neuron.

Figure 9: Losses and Accuracies

[1] J. Carrasquilla and R. G. Melko, Nature Physics 13, 431 (2017).

[2] E. P. L. van Nieuwenburg, Y.-H. Liu, and S. D. Huber, Nature Physics 13, 435 (2017).

[3] L. Wang, Physical Review B 94 (2016). [4] S. J. Wetzel, Physical Review E 96 (2017). [5] Y. Zhang and E.-A. Kim, Physical Review Letters 118 (2017).

[6] F. Schindler, N. Regnault, and T. Neupert, Physical Review B 95 (2017).

[8] T. Ohtsuki and T. Ohtsuki, Journal of the Physical So- ciety of Japan 86, 044708 (2017).

[9] P. Broecker, J. Carrasquilla, R. G. Melko, and S. Trebst, Scientific Reports 7 (2017).

[10] D.-L. Deng, X. Li, and S. D. Sarma, Physical Review B

[11] K. Ch’ng, J. Carrasquilla, R. G. Melko, and E. Khatami, Physical Review X 7 (2017), 10.1103/physrevx.7.031038.

[12] P. Huembeli, A. Dauphin, and P. Wittek, Physical Re- view B 97 (2018).

[13] G. Torlai and R. G. Melko, Physical Review B 94 (2016). [14] G. Carleo and M. Troyer, Science 355, 602 (2017).

[15] E. M. Inack, G. E. Santoro, L. Dell’Anna, and S. Pilati, Phys. Rev. B 98, 235145 (2018).

[16] M. Hibat-Allah, M. Ganahl, L. E. Hayward, R. G. Melko, and J. Carrasquilla, http://arxiv.org/abs/2002.02973v2 .

[17] J. Carrasquilla, D. Luo, F. P´erez, A. Milsted, B. K. Clark, M. Volkovs, and L. Aolita, http://arxiv.org/abs/1912.11052v1 .

[18] F. Ferrari, F. Becca, and J. Carrasquilla, Physical Review B 100 (2019).

[19] O. Sharir, Y. Levine, N. Wies, G. Carleo, and A. Shashua, Physical Review Letters 124 (2020).

[20] M. Schmidt and H. Lipson, Science 324, 81 (2009). [21] R. Iten, T. Metger, H. Wilming, L. del Rio, and R. Renner, Physical Review Letters 124 (2020).

[22] S. J. Wetzel and M. Scherzer, Physical Review B 96 (2017).

[23] P. Ponte and R. G. Melko, Physical Review B 96 (2017). [24] J. Greitemann, K. Liu, and L. Pollet, Phys. Rev. B 99, 060404 (2019).

[25] Y. ichi Mototake, http://arxiv.org/abs/2001.00111v1 . [26] S.-M. Udrescu and M. Tegmark, http://arxiv.org/abs/1905.11481v1 .

[27] C. Wang, H. Zhai, and Y.-Z. You, Science Bulletin 64, 1228 (2019).

[28] S. M. Lundberg and S.-I. Lee, in Advances in Neural In- formation Processing Systems 30, edited by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Curran Associates, Inc., 2017) pp. 4765–4774.

[29] D. Gunning, Defense Advanced Research Projects Agency (DARPA), nd Web 2 (2017).

[30] G. Montavon, W. Samek, and K.-R. M¨uller, Digital Sig- nal Processing 73, 1 (2018).

[31] M. T. Ribeiro, S. Singh, and C. Guestrin, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '16 (ACM

[32] K. Simonyan, A. Vedaldi, and A. Zisserman, in Work-

[33] S. Bach, A. Binder, G. Montavon, F. Klauschen, K.-R. M¨uller, and W. Samek, PLOS ONE 10, e0130140 (2015).

[34] M. D. Zeiler and R. Fergus, in Computer Vision – ECCV 2014 (Springer International Publishing, 2014) pp. 818– 833.

[35] A. Mordvintsev, “Deepdream - a code example for visu- alizing neural networks,” (2015).

[36] D. Kim and D.-H. Kim, Physical Review E 98 (2018). [37] P. Suchsland and S. Wessel, Phys. Rev. B 97, 174435 (2018).

[38] Y. Zhang, P. Ginsparg, and E.-A. Kim, http://arxiv.org/abs/1912.10057v1 .

[39] S. Bluecher, L. Kades, J. M. Pawlowski, N. Strodthoff, and J. M. Urban, http://arxiv.org/abs/2003.01504v1 .

[40] C. P. Burgess, I. Higgins, A. Pal, L. Matthey, N. Watters, G. Desjardins, and A. Lerchner, http://arxiv.org/abs/1804.03599v1 .

[41] J. Bromley, J. W. Bentz, L. Bottou, I. Guyon, Y. Lecun, C. M. an Eduard S¨ackinger, and R. Shah, International Journal of Pattern Recognition and Artificial Intelligence 07, 669 (1993).

[42] S. Chopra, R. Hadsell, and Y. LeCun, in 2005 IEEE

[43] Y. Taigman, M. Yang, M. Ranzato, and L. Wolf, in

[44] S. Appalaraju and V. Chaoji, http://arxiv.org/abs/1709.08761v2 .

[46] J. Scott, M. Panju, and V. Ganesh, in Procedings of the 2020 Conference of the Association for the Advancement of Artificial Intelligence (AAAI, 2020).