Prediction and optimization of mechanical properties of composites using convolutional neural networks

2019·Arxiv

Abstract

Abstract

In this paper, we develop a convolutional neural network model to predict the mechanical properties of a two-dimensional checkerboard composite quantitatively. The checkerboard composite possesses two phases, one phase is soft and ductile while the other is stiff and brittle. The ground-truth data used in the training process are obtained from finite element analyses under the assumption of plane stress. Monte Carlo simulations and central limit theorem are used to find the size of the dataset needed. Once the training process is completed, the developed model is validated using data unseen during training. The developed neural network model captures the stiffness, strength, and toughness of checkerboard composites with high accuracy. Also, we integrate the developed model with a genetic algorithm optimizer to identify the optimal microstructural designs. The genetic algorithm optimizer adopted here has several operators, selection, crossover, mutation, and elitism. The optimizer converges to con-figurations with highly enhanced properties. For the case of the modulus and starting from randomly-initialized generation, the GA optimizer converges to the global maximum which involves no soft elements. Also, the GA optimizers, when used to maximize strength and toughness, tend towards having soft elements in the region next to the crack tip.

Keywords: Machine learning, Convolutional neural networks, Mechanical

properties, Genetic algorithm, Checkerboard composites

1. Introduction

The pursuit of materials possessing robust properties has been of high sci-entific and industrial interests to meet the requirement of modern engineering applications necessitating advanced composite materials [1, 2, 3]. There are several approaches to manufacture composite materials. For instance, the widely used laminate composites are layers sequentially stacked to achieve the desired properties under predetermined loading conditions. The difficulty in achieving a strong adhesion between base materials has confined the laminate stacking process and limited the expedition to explore new composite materials. Also, composites have been made by mixing in fillers in a molten matrix. Another prominent approach is to rely on recent advances in additive manufacturing that enables the fabrication of complex combinations of distinct materials and tuning their properties in three spatial directions. With this freedom in fabricating composite materials, researchers have more flexibility to design materials with superior properties [4, 5].

Generally, one can develop new materials with desired properties through inspiration from natural and biological systems, optimization algorithms, or by combining both. Human bone is an example of a multifunctional material which achieves excellent mechanical properties due to its several distinct hierarchical levels [6, 7, 8]. Bone consists of a hard shell and soft core, where cortical bone (stiff) embraces trabecular bone (soft). Bioinspiration and optimization provide superior material properties through the proper selection of the constituent materials and their volume fractions and identification of the optimal geometric configurations (spatial distributions of the constituents). One can combine the different approaches to optimize the performance of the materials further. In the past, researchers extensively studied the development of new materials by identifying the proper constituent materials and corresponding volume fraction, and such techniques are almost mature. Developed materials usually have taken the form of composites and cellular materials, and they yield properties not available in their bulk material counterparts [9, 10, 11, 12, 13]. Although the foci of this paper are on the prediction and optimization of composite materials’ properties, many analogies can be drawn to developing cellular materials with enhanced properties. Composite materials with randomly distributed constituents are prevalent and intensively studied by many researchers not only because of their rich physics and complexity compared to ones with deterministic distributions, but also because they are widely spread in nature [14, 15, 16].

Employing brute force (also known as exhaustive search) algorithms to obtain optimized materials is not practical in most of the cases due to the enormous design space. Brute force algorithms are based on attempting all possible designs to identify composite materials with optimal microstructural material distributions. A more efficient approach is to use topology optimization algorithms to optimize mechanical properties. Most topology optimization algorithms available in the literature are gradient-based [17, 18]. Both approaches mentioned above require solving many numerical simulations, and each simulation commonly takes from seconds to hours depending on the complexity of the problem. However, such techniques have an inherent limitation in terms of computational cost due to a large number of design variables and/or difficulties in finding the gradients in the case of gradient-based optimization. Using another platform which is faster than the finite element analysis and other available techniques such boundary element method to predict the mechanical properties of composite materials may revolutionize the field of composite materials’ optimization.

Machine learning has been proven to be a potent tool in various applications [19]. Machine learning is a statistical and predictive tool that helps to better perceive the behavior of a particular set of data and a problem. Recently, machine learning has been intensively used in spam detection, speech and image detection, search engines, and disease and drug discoveries [20, 21]. Applications of machine learning are not limited to the ones above; machine learning has been employed to predict the properties of different structural and material systems and to search for new materials with optimal designs [22, 23, 24, 25].

Deep neural networks (DNNs) are one of the most powerful machine learning techniques. DNNs are inspired by architectures of biological neural networks simulating the way we humans learn from data. DNNs usually have an input layer, hidden layers, and an output layer. Several studies used DNNs to investigate material and structural behavior. Do et al. [26] used deep neural networks to supersede the finite element analysis while solving optimization problems (buckling and free vibration with various volume constraints) of functionally graded plates. In a different work, Nguyen et al. [27] used a deep neural network model for predicting the compressive strength of foamed concrete.

Additionally, Kim et al. [28] argued that deep learning networks could be employed to capture the nonlinear hysteretic systems without compromising accuracy. Although the adopted approach is generic and is suitable for any system with hysteresis, they applied it to the prediction of structural responses under earthquake excitations. Gopalakrishnan et al. [29] utilized deep neural network models trained on a large set of images, and then they transferred their learning capability to pavement crack detection using digitized pavement surface images. Such an approach is known as transfer learning in the field of machine learning. Additionally, Lee et al. [30] used single-layered and multilayered perception networks to study the well-known ten bar truss problem, and they investigated the effect of different hyper-parameters.

Recently, Gu et al. [31] used machine learning to study two mechanical properties (strength and toughness) of two-dimensional (2D) checkerboard composites possessing two constituent materials. The authors argued that machine learning is a very promising tool for investigating the mechanical properties of composite materials, and it can be incorporated in optimization algorithms to find designs with optimal performance. The authors developed two binary classifiers using a single layer convolutional neural network and a linear model to conclude whether a composite material with a specific material distribution yields good or bad mechanical properties. They found that a small proportion of the design space can be utilized to train the machine learning models that are capable of classifying the performance of the two-dimensional (2D) composites with high accuracy.

In this paper, we extend the work of Gu et al. [31]. Their sample space has 35960 composite configurations as they fixed the volume fractions of the phases; 12.5% of the volume fraction is assigned to be a soft material, and the remaining 87.5% is assigned as the stiff material. Also, the model they developed classifies a given composite as good or bad (qualitative prediction) rather than capturing the mechanical properties quantitatively. In this work, we develop a convolutional neural network model (CNN) that is capable of quantitatively predicting the mechanical properties of the composite over the entire volume fraction space. CNNs are a class of DNNs, and they are chosen because they have proven to be very successful in image recognition tasks. In our problem, we represent our checkerboard composites as images exploiting the robustness of CNNs. Accounting for all volume fractions dramatically enlarges the sample space as we discuss in the following sections. Also, from a practical point of view, it is more useful to have a model that predicts the performance of the composites in terms of real values rather than a set of classes obtained from a classification model. Moreover, we integrate an optimization scheme based on genetic algorithm with the developed CNN model to optimize the mechanical properties with respect to the volume fractions of the stiff and soft materials and their spatial distribution in the microstructure.

2. Methodology

A convolutional neural network model is developed to quantitatively predict the stiffness, strength, and toughness of a 2D checkerboard composite system composed of two materials, one is soft and ductile, and the other is stiff and brittle. CNN models usually need a significant amount of data to provide accurate predictions. After training the CNN model, it is tested against new data unseen by the model during the training process. Our CNN model is trained on data generated from a finite element (FE) analysis. Subsection 2.1 talks about the boundary value problem (BVP) and FE analysis while subsection 2.2 discusses the sample space and training and testing datasets. Subsection 2.3 scrutinizes the architecture of the CNN model including the different layers and model parameters whilst subsection 2.4 states the loss function and metrics used to evaluate the performance of the developed CNN model. Subsection 2.5 discusses the genetic algorithm used to find the optimal composite configurations.

2.1. BVP and FE analysis

The composite of interest is a 2D checkerboard system with two materials, soft and stiff. Both materials are assumed to be linear elastic and isotropic.

The equilibrium equation in the absence of inertial and body forces is given by

where are the components of the Cauchy stress tensor while ()is the divergence operator. Under the realm of linear elasticity, the constitutive relationship

is expressed as

where represent the components of an infinitesimal strain tensor, and are the components of the fourth-order elasticity tensor. In case of threedimensional analysis, i, j, k, l = 1, 2, 3. Since small deformation is assumed,

the strain is given by,

where are displacement components. is defined by two constants when

isotropic materials are assumed,

where E is Young’s modulus, is Poisson’s ratio, is the bulk mod- ulus, and is the Kronecker delta. Assuming plane stress condition reduces

the governing equations to,

has a value of E = 1 GPa if the material is stiff and a value of E = 0.1 GPa

if the material is soft. Poisson’s ratio for both stiff and soft materials is set to be = . The failure strain of the stiff and brittle material is 10% while the failure strain of the soft and ductile material is assumed to be 100%. The composite system we are considering has a preexisting edge crack of 25% of the specimen length in the y-direction. Figure 1 depicts the crack, applied displacement boundary conditions, and plane of symmetry. Since symmetry is assumed, half of the problem is needed to be solved. We calculate three effective properties for the cracked composite system: modulus, strength, and toughness. Modulus is defined as the slope of the stress-strain curve of the cracked composite system while the strength is the maximum stress achieved by the system. The toughness is defined as the area under the stress-strain curve of the cracked composite system. In other words, it is defined as the energy needed to initiate the propagation of the crack. Classically these properties are defined in the literature when there is no preexisting crack. These quantities

are defined here differently

Finite element analysis (FEA) is done to evaluate the properties of interest. The FE simulation is stopped when the von Mises strain () at the element at the crack tip reaches the failure strain of the corresponding material. The

von Mises strain is defined as

Four-node elements are used in the FEA where each element has four quadrature points, and each node has two degrees of freedom. Two different sizes of the cracked composite system are adopted: 88 and 1616 system sizes. Figure 2 portrays examples of the randomly generated configurations. The finite element analysis is carried out using an in-house developed MATLAB code operated in the parallel computing toolbox for efficient data generation purposes. The code is validated by comparing the results obtained with those obtained from the open-source code FEAP [32].

Figure 1: Illustration of the boundary value problem. The solid line shows the edge crack, l/L = 0.25. The dashed line represents the plane of symmetry while the black arrows show the applied displacements.

Figure 2: Examples of the randomly generated composite configurations: An example of the a) 8x8 grid and b) 16x16 grid.

2.2. Data description and processing

Finite element analysis is performed to generate the required data for this study. We consider two grid sizes, 8x8 and 16x16 elements. A large number

Table 1: 8x8 grid statistics

of possible composite configurations corresponding to both grid sizes are examined by the FE method, and the resulting properties (modulus, strength, and toughness) of each configuration are stored. Configuration here refers to the microstructure which includes information about the volume fractions and spatial distributions of the phases. The sample space for the 8x8 grid with twomaterials composites has a size of 2composite configurations while the 16x16 grid results in a sample space with a size of 2possible configurations. Due to the vastness of the sample space size, we rely on the central limit theorem for sampling our data. The sampling is performed in a batch-by-batch manner where all batches have the same fixed size where each batch is drawn from the sample space using a uniform distribution. Based on the central limit theorem, as the number of batches becomes larger, the distribution of means evaluated from repeated sampling converges to a normal distribution. The normal distribution is obtained regardless of the original distribution of the data as shown in Figures 3 and 4. We end up with 4.3 million (M) data points for the 8x8 grid and 4.9M for the 16x16 grid. Tables 1 and 2 illustrate some key statistical information regarding the sampled data distributions. A perfect normal distribution has a skew and kurtosis of zero. The skews for the data corresponding to the 8x8 grid are 0.035, 0.112, and 0.085 for the modulus, strength, and toughness, respectively. The kurtoses for the data corresponding to the same grid are -0.041, -0.018, and -0.063 for the modulus, strength, and toughness, respectively. On the other hand, the skews for the data corresponding to the 16x16 grid are 0.031, 0.138, and 0.054 for the modulus, strength, and toughness, respectively. The kurtoses for the data corresponding to the same grid are 0.017, 0.035, and 0.060 for the modulus, strength, and toughness, respectively.

Table 2: 16x16 grid statistics

Figure 3: Histogram of the means obtained by repeated sampling for the 8x8 composite material.

Figure 4: Histogram of the means obtained by repeated sampling for the 16x16 composite material.

Furthermore, Monte Carlo analysis was performed to understand the data better and estimate the means (modulus, strength, and toughness means) of the entire sample spaces (populations) for the 8x8 and 16x16 grids. Figures 5 and 6 below illustrate the convergence of the population means for the 8x8 and 16x16 grids, respectively. It can be observed that after some number of batches the mean of the values stops fluctuating and stabilizes around a specific value, and this indicates that we have a good representation of the sample space.

Figure 5: Monte Carlo analysis of the population means: 8x8 grid.

Figure 6: Monte Carlo analysis of the population means: 16x16 grid.

After acquiring the data from the FEA, some preprocessing steps are required to prepare the data to be fed into the CNN model. A matrix represents the data in its raw form (as acquired from the FEA). The rows of that matrix are the training examples (distinct configurations), and the columns represent the binary distribution of the materials (0 is stiff, and 1 is soft). On the other hand, each training example has a corresponding label vector containing the three material properties: modulus, strength, and toughness.

For our CNN model, the microstructure (two-material distribution) is de-fined as the feature (x) while the properties (modulus, strength, and toughness) of that specific composite are the label (y) to that feature. To be fed into the CNN model, each feature is transformed into an image form representing the spatial distribution of each material (8x8 and 16x16 grids) on which convolutions can then be performed to train the CNN model.

2.3. Convolutional neural network model

Convolutional neural network is a widely-used class of deep neural networks that is remarkably successful in image recognition tasks. It has been widely used and incorporated in a lot of different disciplines. The main idea of a convolutional neural network is the presumption that the input data are images or can be interpreted as images. This helps in reducing the number of parameters significantly and thus results in faster processing.

A typical convolutional neural network usually consists of the following layers: convolutional layers, pooling layers, fully connected layers, and activation functions. In the convolutional layer, a filter is applied to the input image through a convolution operation which preserves the spatial relationships between pixels in the image. On the other hand, a pooling layer applies a combining operation to the input such as a maximum pooling layer which outputs the maximum value element in each window. Fully connected layers and activation functions are similar to those used in simple DNN architectures. Additionally, other layers can be added to the CNN such as dropout layers which are a method of regularization used to reduce model overfitting [33].

We developed two CNN architectures, one for each grid size, where both architectures have six composite layers, a dropout layer, and a fully connected layer. Each composite layer comprises a 2D convolutional layer, a 2D batch normalization, and a rectified linear unit (ReLU) activation function. Kernel sizes of the convolutional layers are different in the two architectures. The same hyper-parameters are used for the two models as shown in Table 3. Additionally, we use mini-batching to help the CNN models escape local minima and increase the convergence rate [34]. Figure 7 illustrates the architecture of the developed convolutional neural network.

Figure 7: Illustration of the CNN model used in the present study.

Table 3: Hyper-parameter values used to design the models.

2.4. CNN model evaluation

We develop and test our models using PyTorch framework [35]. Moreover, we use the Nvidia Pascal Titan XP GPU platform. In the training phase, we use the mean squared error cost function (MSE) to minimize the residual error between the model output and ground-truth data. The final model, which results after the training phase, uses the mean absolute percentage error (MAPE) to test the accuracy of the model when tested against new unseen data. Both the mean squared and mean absolute percentage errors are calculated as a discrepancy

measure between the actual and predicted values using Equations 7 and 8,

where ˆis the prediction obtained from the model, and is the actual (ground-truth) value.

2.5. Optimization using genetic algorithm

Genetic algorithm (GA) is a metaheuristic inspired by Darwin’s theory of evolution, and it belongs to the larger class of evolutionary algorithms. GA optimizers are often utilized to obtain optimal solutions relying on several bioinspired operators, namely selection, crossover, and mutation [36, 37, 38]. At each optimization step, the GA optimizer selects random (using the selection operator) individuals from the current generation to be parents producing children (offspring) for the next generation. The selection operator used here is based on the Roulette wheel mechanism where higher fitness values yield higher selection probabilities. Fitness is the value of the objective function in the optimization problem. The process of producing the offspring is done through the use of the crossover operator where the adopted crossover operator has two crossover points. On the other hand, the mutation operator applies random alterations to individual parents to form the children. The problem discussed here is binary-encoded since the genes have a value of either zero or one. Hence, mutated genes with an original value of zero end up with a value of one, and vice versa.

In this paper, we adopted the elitism operator in addition to the main operators mentioned earlier where the elitism operator ensures that the chromosomes with best fitness values carry over to the next generation regardless of being a parent or child. This operator is very crucial as it guarantees that the solution quality attained by the optimizer is not decreased from one generation to the next [39]. In the context of the present paper, a chromosome refers to a specific microstructure while a gene has the information about the material type at a particular element in the microstructure. The initial generation is selected randomly using a uniform distribution. The fitness evaluation of a microstructure (chromosome) is independent of the rest of the generation chromosomes. Thus, a generation can be evaluated easily in a parallel fashion. The procedure for the genetic algorithm used is portrayed in Figure 8.

Figure 8: Procedure for the genetic algorithm used.

3. Results and discussion

3.1. Linear model

We start by developing a linear model and use it as a baseline; the performance of the linear model is compared with the CNN model. The linear model developed here is used to quantitatively predict the properties of checkerboard composites, unlike the linear model developed by Gu et al. [31]. The linear

model takes the general form shown in Equation 9,

where Y is the material property in question, X is the composite material distribution in vector form, and A and B are the weights. The weights (A and B) are calibrated through the training process to fit each material property as accurately as possible.

Since we have two composite grids (8x8 and 16x16), and we calibrate each model for the three material properties, we end up with 6 linear models. For the 8x8 grid, the 4.3M data points are used. The resulting coefficient of determination () for the fitted line is 0.916. However, the mean absolute percentage errors portray the shortcomings and oversimplification of the problem by the linear model where mean absolute percentage errors exceeded 25% for the modulus and are as high as 40% and 200% for the strength and toughness, respectively. As for the 16x16 grid, the linear model results in a coefficient of determination () of 0.928, but the MAPE values remain too high with 24%, 32%, and 127% for the modulus, strength, and toughness, respectively. The high error values produced by the linear models further justify the choice to utilize convolutional neural networks for our prediction problem.

Furthermore, the linear model is used to qualitatively determine the spatial location of the most critical elements in the composite. The essential elements are defined as the elements that have the highest weights attached to them in the linear model, and thus changes to them have the highest impact on the properties of the resulting composites. Figures 9 and 10 below show the ranking of the weights extracted from the linear model. The weights are ranked in descending order, and the positive weights are assigned positive ranks and vice versa. Positive weights correspond to the soft material while negative weights correspond to the stiff material as shown from the distribution of the weights. This is shown in Figures 9 and 10 where the highest ranked weights are near the crack tip, and the one directly at the crack tip is positive (soft material) and next to it are negative weights (stiff materials), which is in line with the physics of this problem which shows that a soft material placed right at the crack tip followed by stiff materials next to it yields best results in terms of material properties. Although the linear model fails to quantitatively predict the mechanical properties of checkerboard composites with high accuracy, one can still conclude some qualitative aspects as discussed in the present study and

work of Gu et al. [31].

Figure 9: Weights ranking: 8x8 grid.

3.2. Convolutional neural network model

CNNs have been intensively used in image recognition and signal processing, and they can be utilized to extract features from datasets. However, CNN models have to be trained before the inference process. The training process is an optimization problem in which the loss function (MSE) is minimized through the selection of the optimum weights of the CNN model. To start the training process, a dataset with many examples is required, so an optimal mapping

Figure 10: Weights ranking: 16x16 grid.

approximating relationship between output and input data is determined. Here, we discuss the results obtained from the CNN model mentioned in subsection 2.3 and summarized in Figure 7. The CNN model is trained using 200 epochs where an epoch is an iteration in the training process of the CNN model; one epoch is concluded when the CNN model is trained on every training example in the training dataset. The use of 200 epochs means that the CNN model encountered each training example in the training dataset 200 times. Figures 11 and 12 depict the convergence history of the training and testing loss functions for the case of the 8x8 grid and 16x16 grid, respectively. From the displayed results, one can see that the MSE loss functions of all properties for the two grid sizes converge to very low values. Also, the difference between the training and testing losses after the 200 epochs is tiny which in turn indicates that there is no critical overfitting occurring.

The developed CNN model yields outstanding results for the defined problem, and it can predict the three material properties with very high accuracy. After the training process, we evaluate the performance of the CNN model using three parameters, the mean absolute percentage error, maximum error, and percentage of data points with an error greater than 5%. These parameters are calculated using the testing dataset which is not seen by the CNN model throughout the training process of the model. The performance of the CNN model is summarized in Tables 4 and 5. For both grid sizes, the MAPE of the modulus, strength, and toughness are less than 5%. A stricter parameter to evaluate the performance of the CNN model is the maximum error. The maximum errors for the modulus and strength are less than 5% while the maximum error for the toughness is larger than 5%. However, the number of data points that have an error larger than 5% is meager, 1.7% in the case of the 8x8 grid and 0.082% for the 16x16 grid. The results show that the CNN model outperforms the linear model discussed in subsection 3.1.

The excellent agreement between the results from the developed CNN model and ground-truth finite element results shows that machine learning algorithms in general and CNN models, in particular, have a high potential to be used in materials analyses and optimization. A possible extension to the current study is to build CNN models that capture the response of larger mechanical systems (e.g., 3D materials) and/or response of nonlinear materials. This leads us to another promising potentials of CNN models. Such CNN models can be integrated with optimization algorithms to find optimum solutions targeting various engineering applications. More details about optimization using CNN models are discussed in subsection 3.3.

Figure 11: The convergence history of loss functions: 8x8 grid.

Figure 12: The convergence history of loss functions: 16x16 grid.

3.3. Optimization using genetic algorithm

Gradient-based topology optimization comprises several algorithms such as optimality condition [40], sequential linear programming [41], and sequential quadratic programming [42]. Gradient-based optimization algorithms have fast convergence rates when used to search for an optimal solution. However, the sensitivity analyses of the objective functions and constraints are very challenging, especially if nonlinear materials are included [43, 44]. Also, most of the topology optimization problems are nonconvex. Hence, the selection of the

Table 4: Performance of the CNN model: 8x8 grid.

Table 5: Performance of the CNN model: 16x16 grid.

starting point is a critical step in the optimization process which in turn makes it not easy.

To overcome these issues, gradient-free optimization algorithms can be adopted. Generally, gradient-free optimization algorithms require many function evaluations when compared to gradient-based optimization algorithms. However, these function evaluations can be performed using parallel programming because each one is independent of the rest. Function evaluations may take a significant amount of time if the problem we are interested in solving is a nonlinear finite element problem. Therefore, developing CNN models that are capable of predicting the performance of materials and then integrating it with a gradient-free optimizer accelerate the optimization process significantly as the evaluation using a CNN model is way faster than the evaluation obtained from FE analysis [26].

Here, we integrate the CNN model discussed earlier with a GA optimizer to find the optimum composite configurations yielding maximum mechanical properties: modulus, strength, and toughness. Genetic algorithms are a compromise between strong and weak search methods [45]. Strong methods progress with the search in an informed manner by using gradient information while weak methods (e.g., random and exhaustive search) perform the search in an uninformed manner through the extensive sampling of the design space [46]. In contrast, GA operates with (a) a strong progression toward designs with optimal performance and (b) weak operations of probabilistic pairing, crossover, and mutation. Generally, GA progresses toward regions in the design space with

Table 6: Parameters used in the optimization process.

optimal performance without getting stuck in local optima (maxima/minima) in case of multimodal design space, a design space with multiple local optima.

Any successful optimization algorithm should have the right balance between the exploration and exploitation where exploration is related to global search (search throughout the design space for regions with good solutions), and exploitation is related to local search (solution refinement with the avoidance of big jumps) [47]. Going too far with exploitation yields solutions with local optima, not necessarily global ones, while going too far with exploration results in a very slow convergence rate and a tremendous number of function evaluations. In GA, one can balance between exploration and exploitation through the selection of the different parameters such as crossover probability, mutation probability, and elitism ratio.

In this paper, we discuss the optimization problem for the case of the 16x16 grid since the 8x8 grid is relatively simple. Hence, for the 16x16 grid, we have 128 genes (optimization parameters) after applying the symmetry boundary condition. Table 6 summarizes the parameters we adopted in solving the optimization problem. The parameters reported in Table 6 are used after tuning them based on a parametric study investigating the effect of different parameters on the optimal solutions obtained and convergence rate.

Figures 13-15 portray the top five composite configurations obtained from the GA optimizer when we optimize for modulus, strength, and toughness, respectively. The stopping criteria of the optimization problems are: (1) there is no significant change in the fitness function and (2) the maximum number of generations is met. Although the GA optimizer does not guarantee that we obtain the global optimum, it progresses toward the global optimum and approaches it with an appropriate tolerance. For the case of the modulus (see Figure 13) one concludes that the global optimum is met. From physics, the highest modulus is obtained when the composite is entirely composed of the stiff material. The configurations yielding top five modulus values have a volume fraction ranging from 0.00% to 2.34% of the soft material where the volume fraction of 0.00% of the soft material corresponds to the global maximum.

Figure 13: Genetic algorithm results: Modulus optimization. The volume fractions of the soft materials are: a) 0.0%, b) 0.78%, c) 0.78%, d) 1.56%, and e) 2.34%.

On the other hand, it is not straightforward to predict the optimum composite configuration for the case of strength and toughness. However, it is intuitive that we need a soft material in the region near the crack tip, and this is what Figures 14 and 15 are showing. Although the volume fractions of the phases

Figure 14: Genetic algorithm results: Strength optimization. The volume fractions of the soft materials are: a) 26.6%, b) 27.3%, c) 26.6%, d) 27.3%, and e) 25.8%.

are fixed in the work of Gu et al. [31], similar conclusion, the need for soft elements next to the crack tip, is obtained. They did not use GA for the optimization; they introduced an optimization scheme based on the weights of the linear model. We discuss the drawbacks of the linear model in subsection 3.1. The configurations yielding top five strength values have a volume fraction ranging from 25.8% to 27.3% of the soft material. For the case of toughness, the volume fraction of the soft material ranges from 39.8% to 41.4%.

Also, we have considered the case of multi-objective optimization. There are four possible combinations to simultaneously optimize for: a) modulus and strength, b) modulus and toughness, c) strength and toughness, d) modulus, strength, and toughness. The aggregate objective function (AOF), the function combining the different objectives into a scalar function, based on simple weighted sum of the objective functions performs poorly in case of non-convex

Figure 15: Genetic algorithm results: Toughness optimization. The volume fractions of the soft materials are: a) 40.6%, b) 39.8%, c) 41.4%, d) 41.4%, and e) 39.8%.

Pareto frontier [48]. One possible solution is to use compromise programming,

where n is the exponent of the objective functions, and it is selected to be n = 4 in this study. , and are the weights of modulus, strength, and toughness, respectively. In the case of optimizing for two properties, the weights corresponding to the two properties we are optimizing for are assigned a value of 0.5 while the weight of the third property is assigned a value of 0.0. In the case of optimizing for three properties, the three weights have a value of 0.333. Also, the different objectives (modulus, strength, and toughness) have different scales which may cause some biases to objectives possessing higher values. This issue is resolved by normalizing each objective (property) by its maximum value (obtained from single-objective optimization (see Equation 10).

Figure 16 and Table 7 show the results obtained from GA optimizer when multi-objective cases are considered. As concluded from the single-objective optimization, the modulus is maximum when there is no soft material. However, when we optimize for the strength and/or toughness in addition to the modulus, the optimizer tends to balance between the different objectives. Hence, the modulus obtained from the multi-objective optimization is significantly reduced due to the addition of soft materials, and the volume fraction of the soft material ranges from 23.4% to 32.8% depending on the properties we are optimizing. On the other hand, configurations with high values of strength and toughness have better harmony; optimizer tends to possess soft material in the region next to the crack tip. Consequently, less compromising is needed.

Figure 16: Multi-objective optimization using genetic algorithm: The volume fractions of the soft materials are: a) 23.4% (modulus and strength), b) 27.3% (modulus and toughness), c) 28.1% (strength and toughness), and d) 32.8% (modulus, strength, and toughness).

Table 7: Results of the multi-objective optimization using GA.

4. Conclusions

We develop a convolutional neural network model that is capable of quantitatively predicting the mechanical properties (modulus, strength, and toughness) of 2D checkerboard composites. The model is trained using finite element results (ground-truth data), and then it is tested on another dataset which is not seen by the model throughout the training process to ensure the validity of the model. The model shows very promising capabilities; it illustrates the potential of utilizing CNN models in structural and materials analysis. The developed CNN model is integrated with a genetic algorithm optimizer to obtain the composite configurations (material distribution and volume fraction) leading to materials with improved performance. CNN models have the potential of accelerating the current optimization techniques, and it might revolutionize the field of structural and materials design.

Acknowledgment

This research was partially supported by the NSF Center for Novel High Voltage/Temperature Materials and Structures [NSF I/UCRC (IIP-1362146)]. R.A. would like to acknowledge the support of the Beckman Institute at the University of Illinois to carry out this work during his visiting scholar residence. The authors thank Dr. Grace X. Gu for helpful discussions.

References

[1] K. K. Chawla, Composite materials: Science and engineering, Springer Science & Business Media, 2012.

[2] C.-T. Chen, F. J. Martin-Martinez, S. Ling, Z. Qin, M. J. Buehler, Nacre- inspired design of graphene oxide–polydopamine nanocomposites for enhanced mechanical properties and multi-functionalities, Nano Futures 1 (1) (2017) 011003.

[3] A. R. Studart, Biological and bioinspired composites with spatially tunable heterogeneous architectures, Advanced Functional Materials 23 (36) (2013) 4423–4436.

[4] X. Wang, M. Jiang, Z. Zhou, J. Gou, D. Hui, 3d printing of polymer matrix composites: A review and prospective, Composites Part B: Engineering 110 (2017) 442–458.

[5] I. Jasiuk, D. W. Abueidda, C. Kozuch, S. Pang, F. Y. Su, J. McKittrick, An overview on additive manufacturing of polymers, JOM 70 (3) (2018) 275–283.

[6] N. Reznikov, R. Shahar, S. Weiner, Bone hierarchical structure in three dimensions, Acta Biomaterialia 10 (9) (2014) 3815–3826.

[7] D. W. Abueidda, F. A. Sabet, I. M. Jasiuk, Modeling of stiffness and strength of bone at nanoscale, Journal of Biomechanical Engineering 139 (5) (2017) 051006.

[8] H. P. Schwarcz, D. Abueidda, I. Jasiuk, The ultrastructure of bone and its relevance to mechanical properties, Frontiers in Physics 5 (2017) 39.

[9] R. W. Hertzberg, R. P. Vinci, J. L. Hertzberg, Deformation and fracture mechanics of engineering materials, Vol. 89, Wiley New York, 1996.

[10] S. Khaderi, V. Deshpande, N. Fleck, The stiffness and strength of the gyroid lattice, International Journal of Solids and Structures 51 (23-24) (2014) 3866–3877.

[11] K. K. Chawla, Ceramic matrix composites, in: Composite Materials, Springer, 1998, pp. 212–251.

[12] O. Al-Ketan, A. Soliman, A. M. AlQubaisi, R. K. Abu Al-Rub, Nature- inspired lightweight cellular co-continuous composites with architected periodic gyroidal structures, Advanced Engineering Materials 20 (2) (2018) 1700549.

[13] T. Tancogne-Dejean, M. Diamantopoulou, M. B. Gorji, C. Bonatti, D. Mohr, 3d plate-lattices: An emerging class of low-density metamaterial exhibiting optimal isotropic stiffness, Advanced Materials 30 (45) (2018) 1803334.

[14] M. Ostoja-Starzewski, Microstructural randomness and scaling in mechan- ics of materials, Chapman and Hall/CRC, 2007.

[15] Z. R. Khokhar, I. A. Ashcroft, V. V. Silberschmidt, Simulations of delam- ination in cfrp laminates: effect of microstructural randomness, Computational Materials Science 46 (3) (2009) 607–613.

[16] A. S. Dalaq, S. I. Ranganathan, M. Ostoja-Starzewski, Scaling function in conductivity of planar random checkerboards, Computational Materials Science 79 (2013) 252–261.

[17] M. P. Bendsøe, Topology optimization, in: Encyclopedia of Optimization, Springer, 2001, pp. 2636–2638.

[18] O. Sigmund, A 99 line topology optimization code written in matlab, Struc- tural and Multidisciplinary Optimization 21 (2) (2001) 120–127.

[19] R. S. Michalski, J. G. Carbonell, T. M. Mitchell, Machine learning: An artificial intelligence approach, Springer Science & Business Media, 2013.

[20] I. H. Witten, E. Frank, M. A. Hall, C. J. Pal, Data mining: Practical machine learning tools and techniques, Morgan Kaufmann, 2016.

[21] S. Lafon, A. B. Lee, Diffusion maps and coarse-graining: A unified frame- work for dimensionality reduction, graph partitioning, and data set parameterization, IEEE Transactions on Pattern Analysis and Machine Intelligence 28 (9) (2006) 1393–1403.

[22] A. Agrawal, A. Choudhary, Perspective: Materials informatics and big data: Realization of the fourth paradigm of science in materials science, Apl Materials 4 (5) (2016) 053208.

[23] G. B. Goh, N. O. Hodas, A. Vishnu, Deep learning for computational chem- istry, Journal of Computational Chemistry 38 (16) (2017) 1291–1307.

[24] C. Bisagni, L. Lanzi, Post-buckling optimisation of composite stiffened pan- els using neural networks, Composite Structures 58 (2) (2002) 237–247.

[25] G. X. Gu, C.-T. Chen, D. J. Richmond, M. J. Buehler, Bioinspired hi- erarchical composite design using machine learning: simulation, additive manufacturing, and experiment, Materials Horizons 5 (5) (2018) 939–945.

[26] D. T. Do, D. Lee, J. Lee, Material optimization of functionally graded plates using deep neural network and modified symbiotic organisms search for eigenvalue problems, Composites Part B: Engineering 159 (2019) 300– 326.

[27] T. Nguyen, A. Kashani, T. Ngo, S. Bordas, Deep neural network with high- order neuron for the prediction of foamed concrete strength, ComputerAided Civil and Infrastructure Engineering 34 (4) (2019) 316–332.

[28] T. Kim, O.-S. Kwon, J. Song, Response prediction of nonlinear hysteretic systems by deep neural networks, Neural Networks 111 (2019) 1 – 10.

[29] K. Gopalakrishnan, S. K. Khaitan, A. Choudhary, A. Agrawal, Deep con- volutional neural networks with transfer learning for computer vision-based

data-driven pavement distress detection, Construction and Building Materials 157 (2017) 322–330.

[30] S. Lee, J. Ha, M. Zokhirova, H. Moon, J. Lee, Background information of deep learning for structural engineering, Archives of Computational Methods in Engineering 25 (1) (2018) 121–129.

[31] G. X. Gu, C.-T. Chen, M. J. Buehler, De novo composite design based on machine learning algorithm, Extreme Mechanics Letters 18 (2018) 19–28.

[32] R. L. Taylor, Feap-a finite element analysis program (2014).

[33] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting, The Journal of Machine Learning Research 15 (1) (2014) 1929–1958.

[34] G. E. Hinton, A practical guide to training restricted Boltzmann machines (2012) 599–619.

[35] A. Paszke, S. Gross, S. Chintala, G. Chanan, Pytorch (2017).

[36] M. Mitchell, An Introduction to Genetic Algorithms, MIT Press, 1996.

[37] J. H. Holland, et al., Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence, MIT Press, 1992.

[38] J. H. Holland, Adaptation in natural and artificial systems, The University of Michigan Press 1 (1975) 975.

[39] S. Baluja, R. Caruana, Removing the genetics from the standard genetic algorithm, in: Machine Learning Proceedings 1995, Elsevier, 1995, pp. 38– 46.

[40] M. Khan, K. Willmert, W. Thornton, An optimality criterion method for large-scale structures, AIAA Journal 17 (7) (1979) 753–761.

[41] L. Lamberti, C. Pappalettere, Comparison of the numerical efficiency of different sequential linear programming based algorithms for structural optimisation problems, Computers & Structures 76 (6) (2000) 713–728.

[42] R. Sedaghati, Benchmark case studies in structural design optimization us- ing the force method, International Journal of Solids and Structures 42 (21-22) (2005) 5848–5871.

[43] K. A. James, H. Waisman, Failure mitigation in optimal topology design using a coupled nonlinear continuum damage model, Computer Methods in Applied Mechanics and Engineering 268 (2014) 614–631.

[44] K. A. James, H. Waisman, Topology optimization of viscoelastic structures using a time-dependent adjoint method, Computer Methods in Applied Mechanics and Engineering 285 (2015) 166–187.

[45] D. E. Golberg, Genetic algorithms in search, optimization, and machine learning, Addion Wesley 1989 (102) (1989) 36.

[46] C. D. Chapman, K. Saitou, M. J. Jakiela, Genetic algorithms as an ap- proach to configuration and topology design, Journal of Mechanical Design 116 (4) (1994) 1005–1012.

[47] J. Chen, B. Xin, Z. Peng, L. Dou, J. Zhang, Optimal contraction theo- rem for exploration–exploitation tradeoff in search and optimization, IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 39 (3) (2009) 680–691.

[48] A. Messac, Optimization in Practice with MATLAB : For Engineering Students and Professionals, Cambridge University Press, 2015.

designed for accessibility and to further open science