b

DiscoverModelsSearch
About
Deep Learning Detection Networks in MIMO Decode-Forward Relay Channels
2018
·
arXiv
Abstract

In this paper, we consider signal detection algorithms in a multiple-input multiple-output (MIMO) decode-forward (DF) relay channel with one source, one relay, and one destination. The existing suboptimal near maximum likelihood (NML) detector and the NML with two-level pair-wise error probability (NMLw2PEP) detector achieve excellent performance with instantaneous channel state information (CSI) of the source-relay (SR) link and with statistical CSI of the SR link, respectively. However, the NML detectors require an exponentially increasing complexity as the number of transmit antennas increases. Using deep learning algorithms, NML-based detection networks (NMLDNs) are proposed with and without the CSI of the SR link at the destination. The NMLDNs detect signals in changing channels after a single training using a large number of randomly distributed channels. The detection networks require much lower detection complexity than the exhaustive search NML detectors while exhibiting good performance. To evaluate the performance, we introduce semidefinite relaxation detectors with polynomial complexity based on the NML detectors. Additionally, new linear detectors based on the zero gradient of the NML metrics are proposed. Applying various detection algorithms at the relay (DetR) and detection algorithms at the destination (DetD), we present some DetR-DetD methods in MIMO DF relay channels. An appropriate DetR-DetD method can be employed according to the required error probability and detection complexity. The complexity analysis and simulation results validate the arguments of this paper.

Index Terms—Channel state information, machine learning, maximum likelihood, neural network, TensorFlow.

In wireless communications, deep fading often causes a failure in reliable data transmission. Relays help increase the transmission reliability between a source and a destination, and extend the network coverage by providing an additional link. The relay channel model, introduced by van der Meulen [1], is a basic channel model in network communications. This relay channel has been studied extensively in the literature [2]-[10]. Among various relaying operations, amplify-forward (AF) and decode-forward (DF) are the two most common methods [3].

Unlike a multiple-input multiple-output (MIMO) system, a linear relationship does not exist between the input and the output in the DF relay channel due to the hard decision at the relay. As the received signal of the relay is not known at the destination, the maximum likelihood (ML) detection in the DF relay system requires more steps than that in the

X. Jin and H.-N. Kim are with the Department of Electronics Engineering, Pusan National University, Busan 46241, Republic of Korea, (e-mails: jinxl77@gmail.com, hnkim@pusan.ac.kr, Phone number: +82-51-510-2394.)

This research has been supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education under Grant NRF-2017R1D1A1A09000565.

MIMO system [4], [5], [6]. Due to the complexity of the ML detection and the difficulty in analysis, a near-ML (NML) detector was proposed in [6], [7] under instantaneous channel state informations (CSIs) of the source-relay (SR), sourcedestination (SD), and relay-destination (RD) links. However, forwarding the instantaneous CSI of the SR link from the relay to the destination requires additional work and reduces the data rate. With the statistical CSI of the SR link at the destination, an NML with two-level pair-wise error probability (PEP) (NMLw2PEP) detection was proposed in [8] that achieves good performance with relatively low complexity. Without any knowledge of the SR link at the destination, the minimum distance (MD) detection1 ignores detection error at the relay and shows very poor performance [5], [6]. The above mentioned detection algorithms detect signals simultaneously by exhaustively searching all the possible signal sets so that their complexities increase exponentially as the number of transmit antennas increases.

A method to reduce the detection complexity is to separate the signals by a linear operation and detect them individually. The typical linear detectors in MIMO channels are the zero forcing (ZF) and minimum mean square estimation (MMSE) detectors [11]. Referring to the ZF detector, a linear detector of ZF with maximum ratio combining (MRC) (ZFwMRC) was proposed in MIMO relay channels when the relay detects signals correctly [9]. However, this algorithm cannot achieve good performance for the relay with errors similar to the MD detection. A new detection method should be introduced, and a potential solution is to use the powerful tools in machine learning.

A. Machine Learning and Detection

Machine learning is a subset of artificial intelligence that learns to solve a specific problem by themselves [12]. Supervised learning, a basic machine learning algorithm, trains a learning algorithm, g, which is an approximate of a target function f such that x = f(y) using the known training data samples including the observation data y and reference data x. Meanwhile, traditional signal detection obtains an estimation of  ˆxdirectly from the observation y using a mathematical optimization method without reference signals (training data). However, it is not easy to theoretically find a detector with reasonable performance and complexity. Applying machine learning, a learning algorithm, g, that approximates the existing detection algorithm is trained to minimize a loss function l(ˆx, x)that measures the cost of estimating  ˆxwhen the actual answer is x. After training, the observation data y undergoes the final learning algorithm g; and subsequently, the desired data  ˆxis detected in real time. This testing phase is called a detection stage in this paper.

Advances in computer technology and big data processing have significantly reduced the cost and time of training deep learning algorithms. This has significantly improved the development of computer vision [13] and natural language processing [14]. In communication networks, deep learning has begun to receive much attention [15]. To reduce complexity, the detection and channel decoding problems have been investigated using powerful deep learning tools in the channel decoding [16], [17], signal detection in MIMO systems [18], [19], and signal detection in chemical communications [20], [21]. A MIMO deep detection network in the MIMO channel is noteworthy [19]. This detection network applies a deep unfolding approach that transforms a computationally intractable probabilistic model into a deep neural network by unfolding iterative calculations into neural-network layers (NNLs) [22]. Embedding the existing mathematical methods into black-box-like deep neural networks improves the accuracy and reduces complexity. In this paper, we adopt the deep unfolding approach in the detection networks of the MIMO DF relay channel under three scenarios related to the knowledge of the SR channel.

B. Contributions

The primary contributions are summarized as follows:

• With the instantaneous CSI of the SR link, a detection network with SR channel (DNwSRC) is proposed applying deep unfolding approach when the ML detector is applied at the relay. The DNwSRC is trained using large numbers of randomly distributed channels. This detection network detects signals on changing channels and requires much lower complexity compared to the suboptimal NML detector while exhibiting a fine performance.

• Applying an equivalent SR channel with the average error probability at the relay to the DNwSRC, a detection network with relay error probability (DNwREP) is proposed, which only requires the statistical CSI of the SR link.

• Without any knowledge of the SR channel, a suboptimal exhaustive search detection algorithm called NML with relay signal distance (NMLwRSD) is proposed by considering the squared signal distance at the relay (Proposition 3). To the best of our knowledge, the optimal or suboptimal detector in this case has not been proposed in the literature. This algorithm exhibits much better performance compared to the existing MD detector. Moreover, based on the NMLwRSD detector, a detection network with relay signal distance (DNwRSD) and a simplified DNwRSD (sDNwRSD) are proposed. The DNwRSD and sDNwRSD achieve excellent performance without any knowledge of the SR channel.

• To evaluate the performance of the NML-based detection networks (NMLDNs), detection algorithms of semidefi-nite relaxation (SDR) with SR channel (SDRwSRC), SDR

with relay error probability (SDRwREP), and SDR with relay signal distance (SDRwRSD) are introduced as the SDR versions of the NML, NMLw2PEP, and NMLwRSD detectors, respectively. The NML-based SDR (NMLSDR) detectors exhibit relatively good performance with polynomial complexity, and are thus suitable choices for the MIMO DF relay channel without enough training data.

• Additionally, new linear detectors based on the zero gradient (ZG) of the metrics in the NML, NMLw2PEP, and NMLwRSD detectors are proposed (Propositions 1, 2, and 4). The NML-based ZG (NMLZG) detectors achieve much better performance than the existing linear ZFwMRC detector.

• For various detection algorithms at the relay (DetR), we present the corresponding equivalent SR channel matrix such that the above-mentioned detection algorithms at the destination (DetD) can be implemented for any DetR.

• We present and compare some DetR-DetD methods based on the characteristics in error probability and detection complexity in the MIMO relay channel. This provides the directions for designing the system configuration.

The remainder of the paper is organized as follows. In the next section, we formally introduce the MIMO DF relay channel and its equivalent real model. The main part of this paper is presented in Section III. The NML-based detection networks (NMLDNs) such as the DNwSRC, DNwREP, DNwRSD, and sDNwRSD are proposed with various conditions of the knowledge of the SR channel. In Section IV, the training and detection details for the proposed NMLDNs are introduced. For comparison, the SDR detectors based on the NML detection algorithms are proposed in Section V. Various DetR-DetD methods are presented based on the required error probability and complexity after introducing the DetR and their corresponding equivalent SR channels in Section VI. The main results are evaluated using TensorFlow and Matlab, and are detailed in Section VII. Finally, the conclusions are given in Section VIII.

C. Notations

Throughout the paper, we use the following notations. The superscript  (·)Tdenotes the transpose of a matrix;  tr(·)denotes the trace of a matrix;  Re(·)and  Im(·)denote the real and imaginary parts of a complex number, respectively; Indenotes the  n × nidentity matrix (where the subscript n is omitted when it is irrelevant or clear from the context);  Cn×mdenotes a set of  n × mcomplex matrices; for  A ∈ Cn×m, A ∼ CN(0, σ2Inm)denotes that the elements of A are i.i.d. circularly symmetric complex Gaussian random variables with zero mean and variance  σ2, and  B ∼ N(0, σ2Inm)denotes that  B ∈ Rn×mis a real Gaussian random matrix with zero mean and covariance matrix  σ2Inm; diag(·)denotes a block diagonal matrix with the entries on its main diagonal;  [·]i:j,k:lmeans a matrix consisting of the entries from the ith row to the jth row, and from the kth column to the lth column in the original matrix.

A half-duplex DF relay channel with one source, one destination, and one relay is considered. It is assumed that the relay knows the CSI of the SR channel, and the destination knows the CSIs of the SD and RD links. In the first phase, the source with N transmit antennas broadcasts N independently and uniformly distributed complex signals  xC = [xC1 , . . . , xCN]Tto the relay and the destination, where  Re{xCi } ∈ A, Im{xCi } ∈A, i = 1, . . . , N, and  A ∈ {+1, −1}. Subsequently, the received signals at the relay with  NRreceiving antennas can be written as

image

where  HCSR ∈ CNR×Nis the channel coefficient matrix of the SR link and  zCSR ∼ CN(0, σ2INR)is the noise term at the relay. Simultaneously, the destination receives the signal transmitted from the source as

image

where  HCSD ∈ CND×Nis the channel coefficient matrix of the SD link and  zCSD ∼ CN(0, σ2IND)is the noise term at the destination in the first phase. In the second phase, the relay decodes the received signal and forwards the decoded signal  xCR = [xC1R, . . . , xCNR]T , Re{xCiR} ∈ A, Im{xCiR} ∈A, i = 1, . . . , N to the destination. The received signal at the destination with  NDreceiving antennas in the second phase is

image

where  HCRD ∈ CND×Nis the channel coefficient matrix of the RD link and  zCRD ∼ CN(0, σ2IND)is the noise term at the destination in the second phase. To simplify the expressions, we convert the complex system model to a real system model. Let  x =�Re{xC}Im{xC}�, xR =

image

equivalent real system model is written as

image

where  x, xR ∈ A2N, zSR ∼ N(0, 12σ2I2NR), and  zkl ∼N(0, 12σ2I2ND)for kl = {SD, RD}. Further, the signal-to- noise ratios (SNRs) at the relay and the destination are linearly proportional to  ρ = 2Nσ2. The equivalent real system model is depicted in Fig. 1, where “DetR” and “DetD” represent the detection algorithms at the relay and the destination, respectively. Various types of DetD and DetR will be introduced in the following sections.

Consider the MIMO relay channel in Fig. 1. Due to the probabilistic distribution of the noise terms  zSR, zSD, and  zRD,

image

Fig. 1. The equivalent real MIMO DF relay system model.

the optimal detection method is the ML detector that finds a ˆx ∈ A2Nthat maximizes  p(ySD, yRD|x, HSD, HRD, HSR)for the uniformly distributed x.

As detection errors may exist in the relay, the detection method for the DF relay channel is different from that of the MIMO channel. Error probabilities in the relay should be considered when the CSIs for the SR, SD, and RD links are known. This means that both signals possibly transmitted from the source and from the relay must be considered in the DetD. This results in a much higher detection complexity in the MIMO DF relay channel, approximately the square of the computational complexity of the point-to-point MIMO channel detection. The exhaustive search algorithms such as the ML and NML detectors cannot be applied in the MIMO DF relay channel with large numbers of antennas. Moreover, without the CSI of the SR link, there is no existing optimal or suboptimal detection algorithm that we can refer to. To address these problems, this section explores deep learning detection algorithms for three cases of the knowledge of the SR channel in the destination: 1) with the instantaneous CSI of the SR link; 2) with the statistical CSI of the SR link; 3) without the CSI of the SR link.

A. With Instantaneous CSI of SR Link

With the full CSIs of the SR, SD, and RD links, the ML detection that maximizes the probability p(ySD, yRD|x, HSD, HRD, HSR)for the real system model in (4)-(6) can be written as

image

where  PSR(xR|x, HSR)is the probability that the relay detects the received signal to  xRwhen the source transmits x. Since it is highly difficult to derive  PSR(xR|x, HSR)in MIMO systems [23], the pair-wise error probability (PEP) between x and  xR, PSR(x → xR|HSR)is used. Moreover, applying the widelyused max-log approximation  ln �i exp(xi) ≈ maxi xi[24]- [26], the near-ML (NML) detector was proposed in [6] as

image

where  PSR(x → xR|HSR) = 1for  xR = x; otherwise, PSR(x → xR|HSR)is the PEP between x and  xRfor the ML detector at the relay (MLaR) [6] written as

image

To detect the transmitted signal, the NML detector requires |A|4Ntimes of the calculation for the metric in (8). This exhaustive search detection algorithm cannot be used in practice. By unfolding the iterations of the projected gradient descent method to the layered neural networks as the deep MIMO detection in [19], a deep learning algorithm is proposed to approximate the NML detector. This is described in detail in the following steps.

• Projected gradient descent method

The projected gradient descent method is based on the gradient of the metric in the original exhaustive search detection algorithm. However, the metric in (8) itself is unsuitable for the gradient descent method due to the complicated gradient of the function ln Q(x). Instead, we use an approximation given in [6]

image

Then the detection metric in (8) becomes a quadratic function of x:

image

diag�HSD, HRD�, and  HR = 12�HSR −HSR�. An optimization problem is established as

image

Applying the projected gradient descent method to the nonconvex optimization problem in (13), an update in the kth iteration is written as

image

image

Fig. 2. A single NNL in the detection network.

where  φ(·)is a nonlinear projection operator, e.g., φ(x) = sign(x)for  x ∈ A = {+1, −1}, ˆxk−1is the estimate in the  (k − 1)th iteration,  δkis a step size in the kth iteration for  k = 1, . . . , L, ∇m(x) =

image

• Unfolding iterations

The L iterations are unfolded to the L neural-network layers (NNLs). The update in the kth iteration in (14) is reflected in the input of the kth NNL, i.e.,

image

Due to the structure of the NNL in Fig. 2, the repeated part and the constant values in (14) are ignored in (15). Unlike the deep MIMO detection [19], the output of each layer includes an auxiliary vector v and the desired signal x as well as a help signal  xR. In detail, the main parts in the kth NNL include

image

where Relu(x) = max(0, x) and  ψtkis a elementwise soft decision operator. We adopt  ψtk(x) = −1 +Relu(x+tk)

|tk| − Relu(x−tk)|tk|in [19] for  x ∈ {1, −1}. As shown in Fig. 3, the entire detection network includes L NNLs, where the outputs of the previous layer,  v, xR, and x, combined with the observation and side information HTSDySD, HTRDyRD, HTSDHSD, HTRDHRD, and  HTSRHSR,

image

Fig. 3. The DNwSRC in the MIMO DF relay channel.

enter the next layer. In the last layer, the final decision is made as  ˆx = φ(ˆxL). To improve performance, we adopt the residual learning [27], i.e., applying a weighted average of the previous output and the current output to the current output [19].

• Learning algorithm The learning algorithm of the detection network at the destination in Fig. 3 is denoted as

image

is a set of parameters that is trained during the training phase.

• Loss function

To train the learning algorithm,  gθ, in (16), we can use the reference signal x and the observation  (ySD, yRD)with the side information (HSD, HRD, HSR) as the training data. From the projected gradient descent method in (14) and the NNL in Fig. 2, we can find that x and  xRaffect each other in each iteration or layer. Thus, setting  xRas another reference signal helps to improve the accuracy. Since the training phase is a preprocessing step,  xRcan be known as a reference signal before training.

Combining all outputs of the NNLs [19], two possible loss functions can be used.

1) When both x and  xRare known as the reference signal, the loss function  l1can be employed as

image

image

2) When only x is known as the reference signal, the transmitted signal from the relay,  xR, cannot be used for training, and the loss function  l2is used instead:

image

This detection method is called a detection network with SR channel (DNwSRC) in the MIMO DF relay channel.

Proposition 1. Obtaining the zero gradient point for the convex function in (12), i.e.,

image

a linear receiver in the MIMO DF relay channel can be obtained as

image

sequently, a new linear detector for the desired signal x is derived as

image

where  ˜x = [ ˜x]1:2N. This is called a detector of zero gradient with SR channel (ZGwSRC).

B. With Statistical CSI of SR Link

In this section, we handle the case where only the statistical CSI of the SR link is known at the destination for the Rayleigh fading SR channel  HCSR ∼ CN(0, σ2SRINRN). Since the instantaneous CSI of the SR link,  HSR, is unknown at the destination, the exact PEP,  PSR(x → xR|HSR), could not be applied in (8). Instead, the average PEP, ¯PSR, can be used. Subsequently, a NML detector with two-level-PEP (NMLw2PEP) [8] is written as

image

where  x=�xT xTR�T.

Substituting  HTSRHSR = σ2ln P −1e ·Iinto (15) and training the detection network in Fig. 3 to minimize the loss function in (18) or (19), a new deep learning detection algorithm namely detection network with relay error probability (DNwREP) is achieved. The loss functions are normalized using a linear receiver in the following proposition.

Proposition 2. A linear receiver  ˜xis obtained by substituting HTSRHSR = σ2ln P −1e · Iinto (20). Making decisions to the desired part, i.e.,  ˆx = φ�[ ˜x]1:2N�, a detector of zero gradient with relay error probability (ZGwREP) is derived.

C. Without CSI of SR Link

Without the CSI of the SR link, the minimum distance (MD) detector [6] can first be considered. The MD detector ignores the detection errors at the relay although the error may occur. Setting  xR = xin (8), the MD detection algorithm is written as

image

and its linear detection version is

image

which was proposed in [9]. We call it a detector of ZF with the MRC (ZFwMRC).

Since the error probability at the relay is not considered in the MD and the ZFwMRC detectors, they exhibit poor performance [6]. From the metrics for the NML and NMLw2PEP detectors in (11) and (23), we can find that the influence of the SR link in the metrics exists only when  xR ̸= x. Due to the similar reason presented in Section III-B, 14∥x − xR∥2can represent the influence of the SR link well regardless of the CSI of the SR link. Meanwhile, adding 14∥x − xR∥2can be regarded as a regularization method in convex optimization problems [28]. Subsequently, we propose a suboptimal detection algorithm in the following proposition.

Proposition 3. Without any knowledge of the SR link, an NML with relay signal distance (NMLwRSD) detector is proposed as

image

This NMLwRSD detector will achieve improved performance by considering the influence of the SR channel, however, it requires exhaustive search for all possible signal sets. Based on the NMLwRSD detector, we propose a new deep learning detection algorithm called detection network with relay signal distance (DNwRSD), by substituting  HTSRHSR = Iinto (15). This detection network is trained using the loss function in (18) or (19) normalized by a linear receiver proposed in Proposition 4 at the end of this section.

Furthermore, observing (15), when  HTSRHSR = I, the last term becomes  ˆxk−1 − ˆxR,k−1and can be represented by  ˆxk−1and  ˆxR,k−1. Thus, the input dimension can be reduced, i.e., ikbecomes

image

Substituting  HTSRHSR = 0into the detection network in Fig.3, a new deep learning detection algorithm, namely simplified DNwRSD (sDNwRSD) is obtained.

Proposition 4. A linear receiver  ˜xis obtained by substituting HTSRHSR = Iinto (20). Making decisions for the desired part, i.e.,  ˆx = φ�[ ˜x]1:2N�, a detector of zero gradient with relay signal distance (ZGwRSD) is derived.

The training for the NMLDNs such as DNwSRC, DNwREP, DNwRSD, and sDNwRSD is implemented on the TensorFlow frameworks [29] by applying the Adam optimizer, a variation of the stochastic gradient descent method [30]. The equivalent real system in (4)-(6) with independently and uniformly distributed input  x ∈ A2Nis applied. Using a 4N dimensional vector  vk, we have a 18N dimensional input vector  ikin (15).

image

The dimension of the input vector  ikis reduced to 16N for the sDNwRSD.

To train the detection networks,  Nbatch = 5000data samples of  x, ySR, ySD, yRD, HSR, HSD, and  HRDare randomly generated according to their distributions in each iteration, and Niteration = 5 × 104iterations are implemented. The training process of the NMLDNs with the loss function  l1in (18) is shown in Algorithm 1, where  HSRis the equivalent SR channel matrix depending on the type of NMLDN in Section III, and  Pb = 1Neval�Nevalj=1 Pb,n−Neval+jis the average bit error rate (BER) during  Neval = 50iterations. A total of  2N NevalNbatchbits can be used to evaluate the performance to well reflect the performance of BER> 1002N NevalNbatch = 2N × 10−4. Through the offline training process of Algorithm 1, the final parameter set θ∗is determined. Once  θ∗is determined, the transmitted signal x can be detected in real time using Algorithm 2. The training for the loss function  l2in (19) can be performed similarly.

In this section, the SDR technique [31], [32] with polynomial complexity is applied in the MIMO DF relay channels as a performance comparison to the proposed NMLDNs. We begin from the optimization problem in (13) that is a revised version of the NML detector in (8). The optimization problem for  A = {1, −1}can equivalently be rewritten as

image

where

image

constraint  X = ssTwith  X ⪰ 0in the nonconvex optimization problem in (28), an SDR problem is obtained as

image

which can be solved by standard convex optimization techniques [28] or the CVX packages in MATLAB, e.g., [33]. Subsequently, the desired signal can be detected as

image

which is called a detector of SDR with SR channel (SDR-

wSRC). Since the relationship of  X = ssT =�ˆx

is not established in the problem of (30), this SDR detector cannot achieve the same performance as the one in (28). However, we can expect a fine performance similar to the case of the MIMO channel [31], [32], where the SDR detector achieves the same diversity order with the ML detector.

Substituting  HTSRHSR = σ2 ln P −1e · Iand  HTSRHSR = Iinto (29), and plugging (29) into (30), two SDR detectors, an SDR with relay error probability (SDRwREP) and an SDR with relay signal distance (SDRwRSD), are obtained.

In the previous sections, four types of detection algorithms at the destination (DetD) are introduced when the ML detector is applied at the relay. They are the NML detectors such as NML, NMLw2PEP, and NMLwRSD; the NML-based ZG (NMLZG) detectors such as ZGwSRC, ZGwREP, and ZGwRSD; the NML-based detection networks (NMLDNs) such as DNwSRC, DNwREP, DNwRSD, and sDNwRSD; and the NML-based SDR detectors such as SDRwSRC, SDRwREP, and SDRwRSD. Regardless of the DetD, various types of detectors can be used at the relay. For different detection algorithms at the relay (DetR), the equivalent channel matrix of the SR link applied in the DetD in Sections III-A, III-B, and V should be different. In the following subsections, we introduce some DetR and their equivalent SR channels, discuss the detection complexities of the DetR and DetD, and subsequently present various DetR-DetD methods according to the error performance and detection complexity.

A. Detection Algorithms at the Relay (DetR)

In this section, we briefly introduce some representative DetR and handle the corresponding equivalent SR channel ˜HSRapplied in the DetD. 1) The ML detector at the relay (MLaR) The optimal MLaR is written as

image

and the PEP between x and  xRis in (9). Thus, the original  HSRis used in the detections at the destination. 2) The ZF detector at the relay (ZFaR)2

image

The SNR for  xiis equal to  SNRi = 112 σ2[(HTSRHSR)−1]ii. Thus, an equivalent channel model can be written as

image

where ˜z ∼ N�0, 12σ2I�and HZFRSR =diag� 1√[(HTSRHSR)−1]11 , . . . , 1√[(HTSRHSR)−1]2N,2N

Substituting  HZFRSRinto (9) instead of  HSR, we obtain the PEP for the ZFaR. Thus, we have ˜HSR = HZFRSRfor the detection algorithms of the NML, DNwSRC, and ZGwSRC in Section III-A, and the SDRwSRC in Section V. Since 2√[(HTSRHSR)−1]ii ∼ χ22(NR−N+1), the chi-squared distribution with  2(NR − N + 1)degree of freedom, the diversity order of the ZFaR is dR = NR −N +1[34]. Subsequently,  dR = NR −N +1is used to determine the average error probability,  Pe, in the detection algorithms of the NMLw2PEP, DNwREP,

and ZGwREP in Section III-B, and in the SDRwREP detector in Section V. 3) The SDR detector at the relay (SDRaR) The SDRaR [31], [32] with polynomial complexity is written as

image

achieves the same diversity order as the ML detector [31]. Thus, we use the original parameter  HSRfor the SDRaR in the detection algorithms in Sections III-A, III-B, and V.

image

A MIMO detection network [19] contains L estimation layers, and each single layer is similar to Fig. 2 except that the part related to the helping signal  ˆxR,kdoes not exist. The input vector in the kth layer is

image

where  ˜x = (HTSRHSR)−1HTSRySRis the ZF receiver in (34).

As shown in [19] and in Section VII, the MIMO detection network shows the similar performance with the SDR detector; thus, we use the original parameter  HSRfor the DNaR case.

B. Detection Complexity

For the complexity measure of the detection algorithms, we apply the naive calculation method, i.e., the complexity is O(nmp) for the multiplication of matrices of  n × mand m × p, and  O(n3)for the  n × nmatrix inversion. The detection complexities for the DetR and DetD are discussed for constants  βR = NRNand  βD = NDN.

Based on the rules above regarding the computational complexity, the following results are obtained for the detection algorithms of the NML, NMLZG, and NMLDNs.

• Because ��ySD − HSDx��2, ��yRD − HRDxR��2, and∥HSR(x − xR)∥2require the complexity of  O(N 2)for each  x=�xT xTR�T ∈ A4N, the complexities of the NML detectors are at least  O(N 2 · |A|4N).

• Due to the matrix multiplication and the matrix inversion, the NMLZG detectors require the complexity of  O(N 3). For a quasi-static fading channel (fixed CSI), the NMLZG detectors only need to do a multiplication of a  4N ×4NDmatrix,  (HTDHD +HTRHR)−1HTD, and a  4ND ×1vector, y, and thus, the complexity is  O(N 2).

• For the NMLDNs, the computation of the channel information  HTSDHSD, HTRDHRD, and  HTSRHSRrequires the complexity of  O(N 3), and the pre-computed values of HTSDySDand  HTRDyRDrequire the complexity of  O(N 2). The multiplication of  ikand  W1kand the multiplications of  ukand  W2k, WR,2k, W3kin each NNL require the complexity of  O(N 2); thus, the overall detection complexity for the NMLDN is  O(N 3) + O(LN 2). For the quasi-static fading channels, the channel information HTSDHSD, HTRDHRD, and  HTSRHSRdoes not need to be computed again; therefore, the complexity of  O(LN 2)is obtained. Setting the number of layers as a multiple of N, the complexity becomes  O(N 3). The detection complexity of the NMLDNs can be lowered by reducing L appropriately according to the required BER. The NMLSDR detectors possess the complexity of O((4N)3.5 log(1/ǫ)) = O(N 3.5 log(1/ǫ))given a solution accuracy  ǫ > 0from [32], [35]. We summarize the detection complexity for the DetD as

• NML:  O(N 2 · |A|4N)• NMLZG:  O(N 2)• NMLDN:  O(N 3)• NMLSDR:  O(N 3.5 log(1/ǫ)). Similarly, the detection complexities for the DetR mentioned in Section VI-A are given as

image

C. System Configuration

Using various DetR and DetD, the DF relay system in Fig. 1 exhibits different characteristics in error performance and detection complexity. We introduce and compare several types of system methods depending on the applied detection algorithms at the relay and the destination. The complexity is based on the discussion in Section VI-B, and the error performance will be demonstrated in Section VII.

• MLaR-NML:

Exhaustive search detection algorithms are implemented at the relay and the destination, i.e., the optimal ML is used at the relay, and the suboptimal NML detectors including the NML, NMLw2PEP, and NMLwRSD are applied at the destination. This system achieves excellent performance, but cannot be used in large-scale antenna systems due to its high complexity.

image

4 6 8 10 12 14 ρ (dB)

Fig. 4. BER comparisons of various DetR over the SR channel with  NR =N = 10.

• ZFaR-NMLZG:

A ZF detector is used at the relay, and NMLZG detectors such as the ZGwSRC, ZGwREP, and ZGwRSD are applied at the destination. This method has a simple detection complexity at both the relay and the destination, but exhibits poor performance.

• SDRaR-NMLSDR: The SDR versions of the exhaustive search detectors are implemented at the relay and the destination. The NMLSDR detectors include the SDRwSRC, SDRwREP, and SDRwRSD detectors in Section V. This method exhibits a fair performance with polynomial complexity.

• DNaR-NMLDN:

The deep learning detection networks are employed at both the relay and the destination. The applied NMLDN methods include the DNwSRC, DNwREP, DNwRSD, and sDNwRSD proposed in Section III. This method achieves a reasonable performance with lower complexity through a pre-process of training.

In this section, we evaluate the performance of vari- ous system methods and compare the proposed detection algorithms. We consider MIMO DF relay channel with HCSR ∼ CN(0, INRN), HCSD ∼ CN(0, INDN), and  HCRD ∼CN(0, INDN)in (1)-(3).

First of all, we compare the error performance of the DetR mentioned in Section VI-A for  NR = N = 10. As shown in Fig. 4, the MLaR achieves the best performance, and the ZFaR exhibits the poorest performance. The DNaR achieves the error performance similar to the SDRaR over a fairly wide SNR range. This supports the argument in Session VI-A well. Note that the DNaR in Fig. 4 has undergone the structure in Session VI-A - 4) and has been trained by  Niteration = 106iterations with  Nbatch = 104batches in each iteration.

Since the MLaR-NML methods could not be implemented in real time due to their high complexity, we compare the

image

8 9 10 11 12 ρ (dB)

Fig. 5. BER comparisons of the DNaR-DNwSRC method trained by the loss function  l1in (18) for various numbers of layers over the MIMO DF relay channel with  NR = ND = N = 10.

image

8 9 10 11 12 ρ (dB)

Fig. 6. BER comparisons of the DNaR-DNwSRC method for various training iterations over the MIMO DF relay channel with  NR = ND = N = 10.

DNaR-NMLDN and SDRaR-NMLSDR methods for  ND =NR = N = 10. The similar performance of the DNaR and the SDRaR shown in Fig. 4 as well as in [19] renders a fair comparison. For the best-performance NML detectors and the poorest-performance NMLZG detectors, we evaluate later with a smaller number of antennas. We first evaluate the BERs of the DNaR-DNwSRC method with various numbers of layers trained  Niteration = 5 × 104iterations as shown in Fig. 5. As the number of layers increases, the BER improves and converges to a certain level when  L ≥ 60 = 3 ∗ 2N. Hence, L = 6N is applied in the following simulations. On the other hand, the BERs of the DNaR-DNwSRC method with different numbers of training iterations are shown in Fig. 6. It can be seen that as the number of the training iterations increases, the error performance improves. It is possible to obtain better performance by increasing the number of training iterations, but considering the time required for training, we

image

0 2 4 6 8 10 12 ρ (dB)

image

0 2 4 6 8 10 12 ρ (dB)

image

Fig. 7. BER comparisons of the DNaR-NMLDN and SDRaR-NMLSDR methods over the MIMO DF relay channel with  NR = ND = N = 10. (a)The NMLDNs are trained using  l1; (b) The NMLDNs are trained using  l2.

train the NMLDNs with  Niteration = 5 × 104iterations when comparing with the NMLSDR methods. Fig. 7 compares BERs for various DNaR-NMLDN and SDRaR-NMLSDR methods over the relay channel. From the curves, one can observe as the following:

• The DNaR-NMLDN methods trained using the loss function  l1achieve better performance than those trained by the loss function  l2and the SDRaR-NMLSDR methods, under the same knowledge of the SR link in wide ranges of SNR.

• The DNwSRC achieves the best error performance, and the DNwREP, DNwRSD, and sDNwRSD show similar performance for the DNaR.

• Without knowledge of the instantaneous CSI of the SR link, the DNaR-DNwREP, DNaR-DNwRSD, and sDNaRDNwRSD methods achieve better performance than the SDRaR-SDRwREP and SDRaR-SDRwRSD methods. Particularly, without any knowledge of the SR link at the destination, the DNaR-DNwRSD and DNaR-sDNwRSD show significant SNR improvements compared to the

image

4 8 12 16 20 24 28 ρ (dB)

Fig. 8. BER comparisons of various DetR and DetD methods over the MIMO DF relay channel with  NR = ND = N = 2.

SDRaR-SDRwRSD, i.e., approximately 2 dB at BER= 10−2. Additionally, we evaluate the MLaR-NML, SDRaRNMLSDR, and ZFaR-NMLZG methods in the DF relay channel with  ND = NR = N = 2. The DNaR-NMLDNs are not compared in this case since the deep learning detection does not have any advantages in both performance and complexity in small antenna systems. Fig. 8 shows that the MLaR-NML methods obtain the best performance, and the ZFaR-NMLZG methods exhibit the worst performance, while the SDRaRNMLSDR methods show a nice performance with slopes similar to the corresponding MLaR-NML methods. Moreover, the NMLwRSD, ZGwSRC, ZGwREP, and ZGwRSD detectors proposed in Propositions 3, 1, 2, 4 exhibit good performance compared to the existing detectors under the same DetR. In detail, the MLaR-NMLwRSD method obtains approximately 3.6 dB SNR improvement compared with the MLaR-MD method at BER= 10−3without any knowledge of the SR link. The ZGwSRC, ZGwREP, and ZGwRSD detectors yield approximately 9.6 dB, 9 dB, and 6.8 dB SNR improvements, respectively, compared to the ZFwMRC detector at BER= 10−3when the ZF detector is used at the relay.

Based on the exhaustive search suboptimal NML detectors, the deep learning detection networks are proposed by unfolding iterative calculations into neural-network layers in the MIMO DF relay channel with the instantaneous or statistical CSI of the SR link. Without any knowledge of the SR channel, the suboptimal NMLwRSD detector is proposed by reflecting the influence of the SR channel using the squared relay signal distance. Based on the NMLwRSD detector, two deep learning detection networks, the DNwRSD and sDNwRSD, are also proposed that do not take into account the CSI of the SR link. The deep learning detection networks exhibit a fair performance with less complexity compared to the suboptimal NML detectors and the NMLSDR detectors. The proposed detection algorithms of the NMLwRSD, SDRwSRC, SDRwREP, SDRwRSD, ZGwSRC, ZGwREP, ZGwRSD, DNwSRC, DNwREP, DNwRSD, and sDNwRSD can be applied in more complex communication networks such as multi-relay channels and multi-way relay channels. Furthermore, the discussion on the performance and complexity for various types of DetR-DetD methods provides a basic idea and direction for the system configuration.

[1] E. C. van der Meulen, “The discrete memoryless channel with two senders and one receiver,” in Proc. 2nd Int. Symp. Inf. Theory, Tsahkadsor, Armenian SSR, 1971, pp. 103–135.

[2] T. M. Cover and A. El Gamal, “Capacity theorems for the relay channel,” IEEE Trans. Inf. Theory, vol. 25, no. 5, pp. 572–584, Sep. 1979.

[3] J. N. Laneman, D. N. C. Tse and G. W. Wornell, “Cooperative diversity in wireless networks: Efficient protocols and outage behavior,” IEEE Trans. Inf. Theory, vol. 50, no. 12, pp. 3062–3080, Dec. 2004.

[4] A. Sendonaris, E. Erkip, and B. Aazhang, “User cooperation diversity- Part II: Implementation aspects and performance analysis,” IEEE Trans. Commun., vol. 51, no. 11, pp. 1939–1948, Nov. 2003.

[5] T. Wang, A. Cano, G. B. Giannakis, and J. N. Laneman, “High per- formance cooperative demodulation with decode-and-forward relays,” IEEE Trans. Commun., vol. 55, no. 7, pp. 1427–1438, Jul. 2007.

[6] X. Jin, D.-S. Jin, J.-S. No, and D.-J. Shin, “Diversity analysis of MIMO decode-and-forward relay network by using near-ML decoder,” IEICE Trans. Commun., vol. E94-B, no. 10, pp. - Oct. 2011.

[7] X. Jin, J.-S. No, and D.-J. Shin, “Relay selection for decode-and-forward cooperative network with multiple antennas,” IEEE Trans. Wireless Commun., vol. 10, no 12, pp. 4068–4079, Dec. 2011.

[8] X. Jin, E. J. Kum and D. W. Lim, “Maximum diversity achieving decoders in MIMO decode-and-forward relay systems with partial CSI,” J. Commun. Networks, vol. 16, no. 1, pp. 26–35, Feb. 2014.

[9] B. K. Chalise and L. Vandendorpe, “Performance analysis of linear receivers in a MIMO relaying system,” IEEE Commun. Letters, vol. 13, no. 5, pp. 330–332, May 2009.

[10] X. Jin and Y. Kim, “The approximate capacity of the MIMO relay channel,” IEEE Trans. Inf. Theory, vol. 63, no. 2, pp. 1167–1176, Feb 2017.

[11] R. Lupas and S. Verdu, “Linear multiuser detectors for synchronous code-division multiple-access channels,” IEEE Trans. Inf. Theory, vol. 35, no. 1, pp. 123–136, Jan 1989.

[12] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, Nov. 2016.

[13] A. Ioannidou, E. Chatzilari, S. Nikolopoulos, and I. Kompatsiaris, “Deep learning advances in computer vision with 3d data: A survey,” ACM Computing Surveys (CSUR), 50, no. 2, 2017.

[14] O. Abdel-Hamid, A. r. Mohamed, H. Jiang, L. Deng, G. Penn and D. Yu, “Convolutional neural networks for speech recognition,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 10, pp. 1533–1545, Oct. 2014.

[15] C. Zhang, P. Patras, and H. Haddadi, “Deep learning in mobile and wireless networking: A survey,” arXiv preprint arXiv:1803.04311, 2018.

[16] E. Nachmani, E. Marciano, L. Lugosch, W. J. Gross, D. Burshtein and Y. Beery, “Deep learning methods for improved decoding of linear codes,” IEEE Journal of Selected Topics in Signal Processing, vol. 12, no. 1, pp. 119–131, Feb. 2018.

[17] F. Liang, C. Shen, and F. Wu, “An iterative BP-CNN architecture for channel decoding,” IEEE Journal of Selected Topics in Signal Processing, vol. 12, no. 1, pp. 144–159, Feb. 2018.

[18] X. Yan, F. Long, J. Wang, N. Fu, W. Ou, and B. Liu, “Signal detection of MIMO-OFDM system based on auto encoder and extreme learning machine,” in International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, May 14-19, 2017, pp. 1602-1606.

[19] N. Samuel, T. Diskin and A. Wiesel, “Deep MIMO detection,” in IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Sapporo, Jul. 3-6, 2017, pp. 1–5.

[20] N. Farsad and A. Goldsmith,“Detection algorithms for communication systems using deep learning,” arXiv preprint arXiv:1705.08044, 2017.

[21] N. Farsad and A. Goldsmith,“Neural network detection of data se- quences in communication systems,” arXiv preprint arXiv:1802.02046, 2018.

[22] J. R Hershey, J. L. Roux, and F. Weninger, “Deep unfolding: Model-based inspiration of novel deep architectures,” arXiv preprint arXiv:1409.2574, 2014.

[23] H. Art´es, D. Seethaler, and F. Hlawatsch, “Effcient detection algorithms for MIMO channels: A geometrical approach to approximate ML detection,” IEEE Trans. Signal Processing, vol. 51, no. 11, pp. 2808– 2820, Nov. 2003.

[24] B. M. Hochwald and S. T. Brink, “Achieving near-capacity on a multiple-antenna channel,” IEEE Trans. Commun., vol. 51, pp. 389-399, Mar. 2003.

[25] S. Lin and D. J. Costello, Jr., Error Control Coding: Fundamentals and Applications, 2nd ed. Upper Saddle River, NJ: Pearson Prentice Hall, 2004.

[26] M. C. Ju and I. M. Kim, “ML performance analysis of the decode- and-forward protocol in cooperative diversity networks,” IEEE Trans. Wireless Commun., vol. 8, no. 7, pp. 3855-3867, Jul. 2009.

[27] K. He, X. Zhang, S. Ren and J. Sun, “Deep residual learning for image recognition,” 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016, pp. 770–778.

[28] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge: Cambridge University Press, 2004.

[29] M. Abadi, A. Agarwal, P. Barham, E. Brevdo, Z. Chen, C. Citro, G. S. Corrado, A. Davis, J. Dean, M. Devin, et al., “Tensorflow: Large-scale machine learning on heterogeneous distributed systems,” arXiv preprint, 2016. url = http://arxiv.org/abs/1603.04467

[30] D. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.

[31] J. Jald´en and B. Ottersten, “The diversity order of the semidefinite relaxation detector,” IEEE Trans. Inf. Theory, vol. 54, no. 4, pp. 1406– 1422, 2008.

[32] Z. Q. Luo, W. K. Ma, A. M. So, Y. Ye, and S. Zhang, “Semidefinite relaxation of quadratic optimization problems,” IEEE Signal Processing Magazine, vol. 27, no. 3, pp. 20-34, 2010.

[33] M. Grant and S. Boyd, “CVX: Matlab software for disciplined convex programming, version 2.1 beta,” Dec. 2017.

[34] Y. Jiang, M. K. Varanasi and J. Li, “Performance analysis of ZF and MMSE equalizers for MIMO systems: An in-depth study of the high SNR regime,” IEEE Trans. Inf. Theory, vol. 57, no. 4, pp. 2008–2026, April 2011.

[35] C. Helmberg, F. Rendl, R. Vanderbei, and H. Wolkowicz, “An interior- point method for semidefinite programming,” SIAM J. Optim., vol. 6, pp. 342-361, 1996.

Designed for Accessibility and to further Open Science

Thank you Xianglan Jin, Hyoung-Nam Kim, who authored Deep Learning Detection Networks in MIMO Decode-Forward Relay Channels 🙏 This page is the html of their arXiv pdf, with no changes made other than format. Please cite their work