1. Introduction
In recent years, hybrid beamformer (BF)/precoder has been regarded as a prospective technology for millimeter wave (mmWave) communications [1], since the hardware overhead and power cost for radio frequency (RF) circuit and analog-digital (A/D) converters in fully digital system will be extremely large in mmWave system [2].
Two structures have been proposed for hybrid BF/precoder, e.g., the fully-connected and the sub-connected structures [3], where a manifold optimization (MO) approach was proposed to design the precoder. In particular, hybrid BF/precoder design is generally non-convex and hard to handle, mainly caused by the constant modulus constraint (CMC) of the analog BF/precoder [4]. One effective method is to use the matrix factorization, then minimize the Euclidean distance [5]. For example, [6] exploited the spatial sparse structure of the mmWave channels and proposed an orthogonal matching pursuit (OMP) method. And in [7], a MO based hybrid BF was investigated in the presence of beam-misalignment.
Another approach is to handle the original problem directly. To be specific, in [8] and [9], the precoder in multiple-input multiple-output (MIMO) systems was designed via the majorization-minimization (MM) method. Also, [10] proposed a joint design method based on the matrix monotonic optimization. Furthermore, [11] presented an alternating minimization algorithm. In [12], the authors developed a bisection search method to design the hybrid precoder for mmWave systems. In [13], a hybrid BF algorithm was proposed based on the penalty constraint convex concave procedure (CCCP) method. In addition, a neural network scheme was studied to design the hybrid BF [14].
Moreover, [15] investigated a robust hybrid BF technique to resist directions of arrival (DOAs) estimation error. The key idea is to employ null space projection (NSP) in the analog BF. The simulation results showed that the proposed scheme performs more robustly than other baselines. [16] studied the secure hybrid BF scheme in a wiretap channel, where a leakage-based low-complexity algorithm was proposed to design the hybrid BF. Then, in [17], the authors investigated the secure hybrid BF for a mmWave extended reality communications system, where a gradient descent algorithm was proposed to obtain the digital BF and a MM-based scheme is proposed to optimize the analog BF.
These above works mainly focus on the multiple-input single-output (MISO) channel or single user MIMO channel, for more complex mmWave multi-user MIMO (MU-MIMO) channel, [18] proposed a hybrid precoder scheme with quality of service (QoS) constraint. Recently, [19] studied a hybrid precoder scheme for mmWave wiretap channel with the aid of an intelligent reflecting surface (IRS). While in [20], the authors proposed a hybrid BF scheme for sub-connected hybrid precoder structures by using the alternating optimization.
Overall, the optimization methods in these works are computational complex, and some approaches can only suitable for fully-connected scheme or sub-connected scheme. To make the hybrid (BF)/precoder technique more practical, it is important to design a unified method for the two schemes with acceptable computational complexity. Motivated by this fact, we study the hybrid precoder scheme in mmWave multi-user MIMO system in the work, where a weighted sum rate (WSR) objective is solved. Specifically, to obtain the hybrid precoder in the fully-connected scheme, we transform the nonconvex objective function by the MM and block coordinate descent (BCD) methods. Then, the digital precoder is solved via the bisection search and the analog counterpart is solved via the penalty dual decomposition (PDD) technique. Besides, we extend the proposed algorithm to the sub-connected scheme by using the idea of block design. The main contributions are summarized as follows:
a) To tackle the non-convex WSR problem, we recast the objective function into a convex reformulation by applying the MM technique. Then, we decouple the reformulated problem into several subproblems and propose a BCD method to solve each subproblem iteratively.
b) The digital precoder is obtained via the bisection search. While for the analog precoder, we first propose a PDD algorithm for the fully-connected case. Then, by utilizing the block-diagonal feature, we extend the PDD algorithm to the sub-connected case, where several individual subproblems are solved by the PDD method. In fact, a unified optimization method for the hybrid precoder design is developed.
c) We analyze and compare the computational complexity of the proposed algorithm with other benchmarks, and the result indicates that the proposed design is more computational efficiently. Simulation results demonstrate the performance of the proposed scheme and provide some meaningful insights: 1) the proposed method obtains close performance with other baselines; 2) hybrid precoder obtains closed performance of the fully-digital schemes; 3) fully-connected scheme is super than the sub-connected case.
Notations: Throughout the paper, ≥ means positive semi-definite. AT, A∗, AH, A−1, vec(A), Tr(A) and |A| denote the transpose, the conjugate, the conjugate transpose, the inverse, the vectorization, the trace, the determinant of A, respectively. The block-diagonal matrix is denoted as BLKDiag (A1,..., AN), with diagonal elements A1,..., AN. [A]m1:m2,n1:n2 denotes the submatrix consisting of the m1 to m2 rows and n1 to n2 columns of A, and [a]m:n a is the sub-vector contains the m-th to n-th elements of a.
2. System Model and Problem Formulation
2.1 System Model
We study a MU-MIMO downlink network with one Tx and L users as Fig. 1. The set for these users is denoted by 𝒰 = {U1,..., UL} . We denote Hl ∈ CNU,l x NTX as the Tx-to-l-th user channel, where NTX and NU,l denote the antenna numbers of the Tx and the l-th user, respectively.
Fig. 1. The system model.
Tx uses NRF(NRF ≤ NTX) RF chains to achieve hybrid precoder with two commonly used structures, e.g., the fully-connected structure, where every RF chain is linked to all antennas by NTX phase shifters, and the sub-connected one, where each RF chain is linked to a disjoint subset of antennas by some phase shifters [3].
To improve the system performance, linear precoder is utilized at Tx to transmit the information to all users concurrently. Let sl ∈ CNd,lx1 denote the symbols for the l-th user. Here, sl and sl′ are independent for l ≠ l′. Then, the transmit signal is
\(\begin{align}\mathbf{x}=\mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{s}_{l}\right)\end{align}\), (1)
where Fd,l ∈ CNRFxNd,l represents the digital precoder for the l-th user, while Fa ∈ CNTXxNRF is the common analog precoder for all the users. Since Fa only alters the phase of the signal, Fa should satisfy the CMC, which is given by |[Fa]m,n| = 1, ∀m, ∀n. In addition, for the sub-connected scheme, except the CMC, Fa also need to satisfy the block-diagonal structure Fa = BLkDiag[f1,..., fNRF], where fr ∈ CNTX/NRF, ∀r ∈ [1,..., NRF] is a complex vector.
Then, the signal received of the l-th user is
\(\begin{align}\mathbf{y}_{l}=\mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{s}_{l}\right)+\mathbf{n}_{l}, \forall_{l}\end{align}\), (2)
where nl means the noise with nl ~ CN(0, σ2lI).
The l-th user uses Rl ∈ CNd,lxNU,l to recover sl. Thus, the rate for the l-th user is
Rl = ln|I + RlHlFaFd,lFHd,lFHaHHlRHl(RlWlRHl)-1|, (3)
where \(\begin{align}\mathbf{W}_{l}=\mathbf{H}_{l} \mathbf{F}_{a} \sum_{j \neq l}^{L} \mathbf{F}_{d, j} \mathbf{F}_{d, j}^{H} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}+\sigma_{l}^{2} \mathbf{I}\end{align}\) is the noise plus interference covariance.
Here, we assume that the minimum mean-square error (MMSE) decoder is used by each user, e.g., Rl is given by
\(\begin{align}\mathbf{R}_{l}=\mathbf{F}_{d, l}^{H} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\left(\sum_{j=1}^{L} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, j} \mathbf{F}_{d, j}^{H} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}+\sigma_{l}^{2} \mathbf{I}\right)^{-1}\end{align}\). (4)
Then, (3) is equivalent to
Rl = ln|I + HlFaFd,lW-1lFHd,lFHaHHl|, (5)
where the proof of (5) can be found in [16, Appendix A].
2.2 Channel Model
According to the Saleh-Valenzuela model [6], [21], [24], H1 is given by
\(\begin{align}\mathbf{H}_{1}=\sum_{p=1}^{N_{\text {path }}} \alpha_{p} \mathbf{a}_{\mathrm{r}}\left(\psi_{p}^{\mathrm{r}}, \beta_{p}^{\mathrm{r}}\right) \mathbf{a}_{\mathrm{t}}\left(\psi_{p}^{\mathrm{t}}, \beta_{p}^{\mathrm{t}}\right)^{H}\end{align}\), (6)
where Npath is the path number in H1, αp ~ CN(0, κ210−0.1PL(D)) is the scale of the p-th path in H1, \(\begin{align}\kappa=\sqrt{M_{1} N_{\mathrm{TX}} / N_{\mathrm{path}}}\end{align}\) and PL(D) denotes the loss [21]. Besides, ar(ψrp, βrp)/at(ψtp, βtp) denote the array responses of the p-th path, where ψrp(βrp)/ψtp(βtp) is the azimuth (elevation) angles of arrival/departure.
For uniform planar arrays, at(ψtp, βtp) is given by
\(\begin{align}\mathbf{a}_{\mathrm{t}}\left(\psi_{p}, \beta_{p}\right)=\frac{1}{\sqrt{N_{\mathrm{TX}}}}\left[1, \ldots, e^{j \frac{2 \pi d}{\lambda}\left(w \sin \left(\psi_{p}\right) \sin \left(\beta_{p}\right)+h \cos \left(\beta_{p}\right)\right)}, \ldots, e^{j \frac{2 \pi d}{\lambda}\left((W-1) \sin \left(\psi_{p}\right) \sin \left(\beta_{p}\right)+(H-1) \cos \left(\beta_{p}\right)\right)}\right]^{T}\end{align}\), (7)
where λ means the wave length, d denotes the entry space, and 0 ≤ w < W, 0 ≤ h < H are the entry indices. Thus, the array size is NTX = WH.
2.3 Problem Formulation
Mathematically, the WSR design is given as
\(\begin{align}\max _{\left\{\mathbf{F}_{a, l}\right\}_{l=1}^{L}, \mathbf{F}_{a}} \sum_{l=1}^{L} \varpi_{l} R_{l}\end{align}\) (8a)
\(\begin{align}\sum_{l=1}^{L} \operatorname{Tr}\left(\mathbf{F}_{a} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H} \mathbf{F}_{a}^{H}\right) \leq P_{s}\end{align}\), (8b)
|[Fa]m,n| = 1, ∀m, ∀n, (8c)
where \(\begin{align}\varpi_{l}\left(0 \leq \varpi_{l} \leq 1, \sum_{l=1}^{L} \varpi_{l}=1\right)\end{align}\) means the weight for Rl, (8b) is the transmit power constraint with Ps being the maximum transmit power, and (8c) is the unit modules constraint of the analog precoder.
3. Joint Digital and Analog Precoder Design
3.1 Problem Transformation
Firstly, due to the Sylvester’s determinant property, i.e., |I + MN| = |I + NM|, we have Rl = ln|I + HlFaFd,lFHd,lFHaHHlW-1l|.
Then, we aim to find a lower bound of Rl. According to [22], we denote
\(\begin{align}\mathbf{D}_{l}=\left[\begin{array}{cc}\mathbf{I}_{M} & \mathbf{F}_{d, l}^{H} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H} \\ \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l} & \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}+\mathbf{W}_{l}\end{array}\right]\end{align}\). (9)
Via the method in [22], we obtain the following lower bound of
\(\begin{align}R_{l} \geq \ln \left|\mathbf{E} \overline{\mathbf{D}}_{l}^{-1} \mathbf{E}\right|-\operatorname{Tr}\left(\overline{\mathbf{C}}_{l}\left(\mathbf{D}_{l}-\overline{\mathbf{D}}_{l}\right)\right)\end{align}\), (10)
where E = [IM 0]T, \(\begin{align}\overline{\mathbf{C}}_{l}\end{align}\) is the obtained Dl in the previous iteration, and \(\begin{align}\overline {C}_l\end{align}\) is given by \(\begin{align}\overline{\mathbf{C}}_{l}=\overline{\mathbf{D}}_{l}^{-1} \mathbf{E}\left(\mathbf{E}^{H} \overline{\mathbf{D}}_{l}^{-1} \mathbf{E}\right)^{-1} \mathbf{E}^{H} \overline{\mathbf{D}}_{l}^{-1}\end{align}\).
Then, by denoting \(\begin{align}\bar{c}_{l}=\ln \left|\mathbf{E}^{H} \overline{\mathbf{D}}_{l}^{-1} \mathbf{E}\right|+\operatorname{Tr}\left(\overline{\mathbf{C}}_{l} \overline{\mathbf{D}}_{l}\right)\end{align}\), we attain \(\begin{align}R_{l} \geq \bar{c}_{l}-\operatorname{Tr}\left(\overline{\mathbf{C}}_{l} \mathbf{D}_{l}\right)\end{align}\) and solve the following problem
\(\begin{align}\min _{\left\{\mathbf{F}_{d, l}\right\}_{l=1}^{L}, \mathbf{F}_{a}} \varpi_{l} \operatorname{Tr}\left(\overline{\mathbf{C}}_{l} \mathbf{D}_{l}\right)\end{align}\) (11a)
s.t. (8b), (8c). (11b)
Moreover, we denote \(\begin{align}\overline{\mathbf{F}}_{a}\end{align}\) and \(\begin{align}\overline{\mathbf{F}}_{d,l}\end{align}\) as the obtained Fa and Fd,l in the previous iteration, and \(\begin{align}\overline{\mathbf{W}}_{l}=\mathbf{H}_{l} \sum_{j \neq l}^{L} \overline{\mathbf{F}}_{a} \overline{\mathbf{F}}_{d, j} \overline{\mathbf{F}}_{d, \mathbf{j}}^{H} \overline{\mathbf{F}}_{a}^{H} \mathbf{H}_{l}^{H}+\sigma_{l}^{2} \mathbf{I}\end{align}\). Then, according to [22], \(\begin{align}\overline{\mathbf{C}}_{l}\end{align}\) can be decomposed as
\(\begin{align}\overline{\mathbf{C}}_{l}=\left[\begin{array}{ll}\mathbf{C}_{l}^{11} & \mathbf{C}_{l}^{12} \\ \mathbf{C}_{l}^{21} & \mathbf{C}_{l}^{22}\end{array}\right]=\left[\begin{array}{cc}\overline{\mathbf{P}}_{l}^{H} & \overline{\mathbf{Q}}_{l} \\ \overline{\mathbf{Q}}_{l}^{H} & \overline{\mathbf{Q}}_{l}^{H} \overline{\mathbf{P}}_{l}^{-1} \overline{\mathbf{Q}}_{l}\end{array}\right]\end{align}\). (12)
where \(\begin{align}\overline{\mathbf{P}}_{l}\end{align}\) and \(\begin{align}\overline{\mathbf{Q}}_{l}\end{align}\) are, respectively, given by
\(\begin{align}\overline{\mathbf{C}}_{l} \mathbf{D}_{l}=\mathbf{C}_{l}^{11}+2 \mathfrak{R}\left\{\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right\}+\mathbf{C}_{l}^{22}+\sigma_{l}^{2} \mathbf{C}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\end{align}\), (13a)
\(\begin{align}\min _{\left\{\left\{_{a},\right\}_{l=1}^{l} \mathbf{F}_{a}\right.} \sum_{l=1}^{L} 2 \varpi_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\mathbf{C}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right)\right\}+\sum_{l=1}^{L} \varpi_{l} \operatorname{Tr}\left(\mathbf{C}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\right)\end{align}\). (13b)
Thus, we have
\(\begin{align}\overline{\mathbf{C}}_{l} \mathbf{D}_{l}=\mathbf{C}_{l}^{11}+2 \Re\left\{\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right\}+\mathbf{C}_{l}^{22}+\sigma_{l}^{2} \mathbf{C}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\end{align}\). (14)
Via the above procedure, the WSR design is transformed to
\(\begin{align}\min _{\left\{\mathbf{F}_{d, l},\right\}_{l=1}^{l} \mathbf{F}_{a}} \sum_{l=1}^{L} 2 \sigma_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\mathbf{C}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right)\right\}+\sum_{l=1}^{L} \omega_{l} \operatorname{Tr}\left(\mathbf{C}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d,} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\right)\end{align}\) (15a)
s.t. (8b), (8c). (15b)
When fixing other variables, (15a) is quadratic with respect to (w.r.t.) the specific variable.
3.2 Digital Precoder Optimization
By denoting fd,l = vec(Fd,l) and utilizing the vectorization operation, i.e., Tr(ST) = vec(SH)H vec(T) and Tr(UH VWX) = vec(U)H (XT ⊗ V) vec(W), (15a) is recast as
\(\begin{align}\min _{\left\{\mathbf{f}_{d, l}^{L}\right\}_{l=1}^{L}} \sum_{l=1}^{L}\left(\mathbf{f}_{d, l}^{H} \mathbf{V} \mathbf{f}_{d, l}+2 \mathfrak{R}\left\{\mathbf{v}_{l}^{H} \mathbf{f}_{d, l}\right\}\right)\end{align}\) (16)
where \(\begin{align}\mathbf{V}=\mathbf{I} \otimes\left(\sum_{l=1}^{L} \varpi_{l} \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H} \mathbf{A}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\right)\end{align}\) and \(\begin{align}\mathbf{v}_{l}=\varpi_{l} \operatorname{vec}\left(\left(\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a}\right)^{H}\right)\\\end{align}\), respectively.
Thus, we have
\(\begin{align}\min _{\left\{\left\{_{d, l}\right\}_{l=1}^{L}\right.} \sum_{l=1}^{L}\left(\mathbf{f}_{d, l}^{H} \mathbf{V}_{d, l}+2 \mathfrak{R}\left\{\mathbf{v}_{l}^{H} \mathbf{f}_{d, l}\right\}\right)\end{align}\) (17a)
\(\begin{align}\text {s.t. } \sum_{l=1}^{L} \mathbf{f}_{d, l}^{H}\left(\mathbf{F}_{a}^{H} \mathbf{F}_{a} \otimes \mathbf{I}\right) \mathbf{f}_{d, l} \leq P_{s}\end{align}\). (17b)
Then, the Lagrange function is
\(\begin{align}\mathcal{L}\left(\left\{\mathbf{f}_{d, l}\right\}_{l=1}^{L}, u\right)=\sum_{l=1}^{L}\left(\mathbf{f}_{d, l}^{H} \mathbf{V} \mathbf{f}_{d, l}+2 \Re\left\{\mathbf{v}_{l}^{H} \mathbf{f}_{d, l}\right\}\right)+u\left(\sum_{l=1}^{L} \mathbf{f}_{d, l}^{H}\left(\mathbf{F}_{a}^{H} \mathbf{F}_{a} \otimes \mathbf{I}\right) \mathbf{f}_{d, l}-P_{s}\right)\end{align}\), (18)
where u ≥ 0 is the dual variable of (17b).
Therefore, we obtain
\(\begin{align}g(u)=\min _{\left\{f_{d},\right\}_{l=1}^{L}} \mathcal{L}\left(\left\{\mathbf{f}_{d, l}\right\}_{l=1}^{L}, u\right)\end{align}\), (19)
and the dual problem is
\(\begin{align}\max _{u} g(u) \text { s.t. }u \geq 0\end{align}\). (20)
Given u, we denote the optimal solution to (19) as {fd,l(u)}Ll=1. Then, based on the first-order optimization condition w.r.t. fd,l, the optimal fd,l is given by
fd,l(u) = -(uFHaFa ⊗ I + v)-1 vl. (21)
The optimal u should satisfy the complementary slackness constraint
\(\begin{align}u\left(\sum_{l=1}^{L} \mathbf{f}_{d, l}^{H}(u)\left(\mathbf{F}_{a}^{H} \mathbf{F}_{a} \otimes \mathbf{I}\right) \mathbf{f}_{d, l}(u)-P_{s}\right)=0\end{align}\). (22)
From (22), the optimal u can be determined by the following procedure: if u = 0 and the constraint
\(\begin{align}\sum_{l=1}^{L} \mathbf{f}_{d, l}^{H}(0)\left(\mathbf{F}_{a}^{H} \mathbf{F}_{a} \otimes \mathbf{I}\right) \mathbf{f}_{d, l}(0) \leq P_{s}\end{align}\), (23)
is satisfied, then the optimal u is u = 0. Otherwise, we need to find the optimal u by solving
\(\begin{align}P(u)=\sum_{l=1}^{L} \mathbf{f}_{d, l}^{H}(u)\left(\mathbf{F}_{a}^{H} \mathbf{F}_{a} \otimes \mathbf{I}\right) \mathbf{f}_{d, l}(u)=P_{s}\end{align}\). (24)
In fact, [23] have proved that P(u) is decreasing w.r.t. u monotonically, thus u can be obtained by bisection search. The whole bisection search is summarized as Algorithm 1.
Algorithm 1: Bisection search algorithm
3.3 Analog Precoder Optimization
Here, we optimize the analog precoder Fa, where the method is mainly based on the PDD algorithm.
3.3.1 The Fully-Connected Analog Precoder Design
This problem is given by
\(\begin{align}\min _{\mathbf{F}_{a}} \sum_{l=1}^{L} 2 \varpi_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right)\right\}+\sum_{l=1}^{L} \varpi_{l} \operatorname{Tr}\left(\mathbf{A}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\right)\end{align}\) (25a)
s.t. (8b), |[Fa]m,n| = 1, ∀m, ∀n. (25b)
Via denoting vec fa = vec(Fa), (25) is equivalent to
\(\begin{align}\min _{\mathbf{f}_{a}} f\left(\mathbf{f}_{a}\right)=\mathbf{f}_{a}^{H} \boldsymbol{\Omega} \mathbf{f}_{a}+2 \Re\left\{\boldsymbol{\omega}^{H} \mathbf{f}_{a}\right\}\end{align}\) (26a)
\(\begin{align}\mathbf{f}_{a}^{H}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l}^{H} \mathbf{F}_{d, l} \otimes \mathbf{I}\right) \mathbf{f}_{a} \leq P_{s}\end{align}\) (26b)
|[fa]n| = 1, ∀n, (26c)
where \(\begin{align}\boldsymbol{\Omega}=\sum_{l=1}^{L}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right)^{T} \otimes \omega_{l} \mathbf{H}_{l}^{H} \mathbf{A}_{l}^{22} \mathbf{H}_{l}\end{align}\) and \(\begin{align}\boldsymbol{\omega}=\sum_{l=1}^{L} \varpi_{l} \operatorname{vec}\left(\left(\mathbf{F}_{d, l} \mathbf{A}_{l}^{12} \mathbf{H}_{l}\right)^{H}\right)\end{align}\), respectively.
Next, we handle (26). At first, we introduce z ∈ CNTXNRFx1 and recast (26) as
\(\begin{align}\min _{\mathbf{f}_{a}, \mathbf{z}} \mathbf{f}_{a}^{H} \boldsymbol{\Omega} \mathbf{f}_{a}+2 \mathfrak{R}\left\{\boldsymbol{\omega}^{H} \mathbf{f}_{a}\right\}\end{align}\) (27a)
\(\begin{align}\text {s.t. }\mathbf{f}_{a}^{H}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l}^{H} \mathbf{F}_{d, l} \otimes \mathbf{I}\right) \mathbf{f}_{a} \leq P_{s}\end{align}\), (27b)
fa = z, (27c)
|[z]n| = 1, ∀n. (27d)
Then, by penalizing (27c), we obtain
\(\begin{align}\min _{\mathbf{f}_{a}, \mathbf{z}, \mathbf{t}} \mathbf{f}_{a}^{H} \boldsymbol{\Omega} \mathbf{f}_{a}+2 \Re\left\{\boldsymbol{\omega}^{H} \mathbf{f}_{a}\right\}+\frac{1}{2 \lambda}\left\|\mathbf{f}_{a}-\mathbf{z}\right\|_{2}^{2}+\mathfrak{R}\left\{\mathbf{t}^{H}\left(\mathbf{f}_{a}-\mathbf{z}\right)\right\}\end{align}\) (28a)
s.t. (27b), (27d), (28b)
where λ ≥ 0 and t ∈ CNTXNRFx1 are the penalty factor and the dual variable of (27c).
The PDD consists of a two-layer optimization. Specifically, for the inner layer iteration, when z is fixed, (28) is recast as
\(\begin{align}\min _{\mathbf{f}_{a}} \mathbf{f}_{a}^{H} \boldsymbol{\Omega} \mathbf{f}_{a}+2 \mathfrak{R}\left\{\boldsymbol{\omega}^{H} \mathbf{f}_{a}\right\}+\frac{1}{2 \lambda}\left\|\mathbf{f}_{a}-\mathbf{z}\right\|_{2}^{2}+\mathfrak{R}\left\{\mathbf{t}^{H}\left(\mathbf{f}_{a}-\mathbf{z}\right)\right\}\end{align}\) (29a)
s.t. (27b), (29b)
which is convex and can be solved by bisection search.
Then, when fa is given, we have
\(\begin{align}\min _{\mathbf{z}} \frac{1}{2 \lambda}\left\|\mathbf{f}_{a}-\mathbf{z}\right\|_{2}^{2}+\mathfrak{R}\left\{\mathbf{t}^{H}\left(\mathbf{f}_{a}-\mathbf{z}\right)\right\}\end{align}\) (30a)
s.t. |[z]n| = 1, ∀n. (30b)
Due to (30b), we have \(\begin{align}\frac{1}{2 \lambda}\|\mathbf{z}\|_{2}^{2}=\frac{N_{\mathrm{TX}} N_{\mathrm{RF}}}{2 \lambda}\end{align}\). Hence, (30) is simplified to
\(\begin{align}\max _{\left[[\mathbf{z}]_{n}=1\right.} \mathfrak{R}\left\{\left(\mathbf{f}_{a}+\lambda \mathbf{t}\right)^{H} \mathbf{z}\right\}\end{align}\), (31)
and the optimal solution is z = ej∠(fa+λt).
The inner layer updates fa and z until convergence alternatively. As for the outer layer, t and λ are updated by t ← t + λ-1(fa - z) and λ ← ρλ, where ρ ≤ 1 is the scaling factor to control λ.
Algorithm 2: PDD algorithm
Since the convergence of PDD have been proved in [18, Appendix A], we omit this for brevity.
3.3.2 The Sub-Connected Analog Precoder Design
Firstly, we define b = NTX/NRF. Due to the block diagonal structures of Fa, we have
\(\begin{align}2 \varpi_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right)\right\}=\sum_{r=1}^{N_{\mathrm{NF}}}\left\{2 \varpi_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\left[\mathbf{A}_{l}^{12} \mathbf{H}_{l}\right]_{:,(r-1) b+1: r b} \mathbf{f}_{r}\left[\mathbf{F}_{d, l}\right]_{r, n}\right)\right\}\right\}\end{align}\), (32)
and
\(\begin{align}\begin{array}{l}\varpi_{l} \operatorname{Tr}\left(\mathbf{A}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\right) \\ =\sum_{r=1}^{N_{\text {RF }}}\left\{\varpi_{l} \operatorname{Tr}\left(\left[\mathbf{A}_{l}^{22} \mathbf{H}_{l}\right]_{;,(r-1) b+1: r b} \mathbf{f}_{r}\left[\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right)\right]_{r, r} \times \mathbf{f}_{r}^{H}\left[\mathbf{H}_{l}^{H}\right]_{(r-1) b+1: r b, ;}\right)\right\}\end{array}\end{align}\). (33)
Thus, the following equations hold
\(\begin{align}\sum_{l=1}^{L} 2 \sigma_{l} \mathfrak{R}\left\{\operatorname{Tr}\left(\mathbf{A}_{l}^{12} \mathbf{H}_{l} \mathbf{F}_{a} \mathbf{F}_{d, l}\right)\right\}=\sum_{r=1}^{N_{\mathrm{RF}}} 2 \mathfrak{R}\left\{\mathbf{w}_{r}^{H} \mathbf{f}_{r}\right\}\end{align}\), (34a)
\(\begin{align}\sum_{l=1}^{L} \varpi_{l} \operatorname{Tr}\left(\mathbf{A}_{l}^{22} \mathbf{H}_{l} \mathbf{F}_{a}\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right) \mathbf{F}_{a}^{H} \mathbf{H}_{l}^{H}\right)=\sum_{r=1}^{N_{\mathrm{RF}}} \mathbf{f}_{r}^{H} \mathbf{W}_{r} \mathbf{f}_{r}\end{align}\), (34b)
where
\(\begin{align}\mathbf{w}_{r}^{H}=\sum_{l=1}^{L} \varpi_{l}\left[\mathbf{F}_{d, l}\right]_{r,:}\left[\mathbf{A}_{l}^{12} \mathbf{H}_{l}\right]_{:,(r-1) s+1: r s}\end{align}\), (35a)
\(\begin{align}\mathbf{W}_{r}=\sum_{l=1}^{L}\left\{\varpi_{l}\left[\left(\sum_{l=1}^{L} \mathbf{F}_{d, l} \mathbf{F}_{d, l}^{H}\right)\right]_{r, r} \times\left[\mathbf{H}_{l}^{H}\right]_{(r-1) s+1: r s, ;}\left[\mathbf{A}_{l}^{22} \mathbf{H}_{l}\right]_{;,(r-1) s s+1: r s}\right\}\end{align}\). (35b)
Thus, by neglecting (8b), we obtain
\(\begin{align}\min _{\mathbf{f}_{r}} \sum_{r=1}^{N_{\mathrm{BF}}}\left\{\mathbf{f}_{r}^{H} \mathbf{W}_{r} \mathbf{f}_{r}+2 \mathfrak{R}\left\{\mathbf{w}_{r}^{H} \mathbf{f}_{r}\right\}\right\}\end{align}\) (36a)
s.t. |[fr]n| = 1, ∀n ∈ [1,…, s]. (36b)
Since for different r ≠ r′, fr and fr′ are independent, (36) can be treated as NRF minimization subproblems. Then, the previous proposed PDD algorithm can be applied to the sub-connected case, from r = 1 to r = NRF until all the columns in Fa are optimized.
3.4 Overall Algorithm and Complexity Analysis
Algorithm 3: BCD algorithm
The BCD algorithm is summarized as Algorithm 3. Besides, the complexity to optimize {Fd,l}Ll=1 is given by \(\begin{align}\mathcal{O}\left(\log _{2}\left(\frac{u_{u}-u_{l}}{\varepsilon}\right) \sum_{l=1}^{L} N_{d, l}^{2} N_{\mathrm{RF}}^{2}\right)\end{align}\). As for the PDD algorithm to update fa, the main complexity is given by 𝒪(T(NTXNRF + 1)2), where T is the iteration number. Thus, the overall complexity is
\(\begin{align}\mathcal{C}=O\left(\max \left\{\log _{2}\left(\frac{u_{u}-u_{l}}{\varepsilon}\right) \sum_{l=1}^{L} N_{d, l}^{2} N_{\mathrm{RF}}^{2}, T\left(N_{\mathrm{TX}} N_{\mathrm{RF}}+1\right)^{2}\right\}\right)\end{align}\), (37)
which is polynomial time.
In addition, we have mentioned that one can use the bisection method to handle (29). For (29), if the bisection method is used to obtain fa, the computational complexity is given by
\(\begin{align}\mathcal{C}=O\left(\log _{2}\left(\frac{u_{u}-u_{l}}{\varepsilon}\right) N_{\mathrm{TX}}^{2} N_{\mathrm{RF}}^{2}\right)\end{align}\), (38)
which is a quadratic function w.r.t. the dimension of fa. Therefore, it is suitable for a practical scenario when the dimension of fa is large.
Moreover, we noted that the complexity of the MMSE-based method in [11] is
C = O(NoutNin(4N2TXNRF + 13NTXN2RF + 4N3RF)), (39)
where Nout and Nin are the iteration numbers.
While the complexity of the local-optimal method in [20] is
\(\begin{align}\mathcal{C}=O\left(I_{\mathrm{P}}\left(I_{\mathrm{A}}\left(\frac{5}{3} N_{\mathrm{TX}}^{3}+(L+1) N_{\mathrm{TX}}^{2}\right)\right)+(L+2)\left(N_{\mathrm{TX}}^{2}+N_{\mathrm{RF}}^{2}\right)\right)\end{align}\), (40)
where IP and IA are the iteration numbers.
From the comparison, we can see that the proposed method is more computational efficiently than these benchmarks since Nd,l and NRF are commonly much smaller than these iteration numbers. Readers can refer [11] and [20] for more detail about {Nout, Nin, IP, IA}.
4. Simulation Results
We set the coordinates of Tx as (10 m, 0 m, 20 m), and set the 5 users in a circle with radius 5 m and centered at (10 m, 50 m, 1.5 m) randomly. Unless specified, we set NTX = 4 × 4, Nd,l = 8, NU,l = 2 x 2, \(\begin{align}\varpi_{l}=1 / L\end{align}\), ∀l, and Ps = 30 dBm, σ2l = -80 dBm, ∀l. In addition, we assume Npath = 8, and PL(D) is [24]:
PL(D)[dB] = a + 10blog10(D) + ζ, (41)
where ζ ~ N(0, 𝜚2).
4.1 Convergence
Specifically, the PDD method is referred as the inner layer iteration and the BCD method is referred as the outer layer iteration.
Fig. 2 provides the convergence of the former, where the solid and dotted lines denote the fully-and-sub-connected cases, respectively. From this figure, we observe that the WSR increase with the iteration numbers for different NTX and NRF, and converge within 40 iterations, which demonstrates the convergence.
Fig. 2. PDD algorithm convergence.
Then, we test the convergence of the latter in Fig. 3 and 4, for the fully-and sub-connected cases, respectively. From the two figures, we can see that the WSR increases and finally converges within 20 iterations for various NTX and NRF. Besides, by comparing the two figures, we can see that in the same channel condition, the fully-connected scheme overwhelms the sub-connected case.
Fig. 3. BCD algorithm convergence: Fully-connected case.
Fig. 4. BCD algorithm convergence: Sub-connected case.
4.2 Performance Evaluation
We evaluate the performance of these schemes, which are labeled as “Fully-connected case”, “Sub-connected case”, and “Fully digital precoder”, respectively. In addition, we compare the proposed method with some other benchmarks: 1) the MMSE method in [11], which is used for the fully-connected case and is labeled as “MMSE method”; 2) the local-optimal method in [20], which is used for the sub-connected case and is labeled as “Local-optimal method”, respectively.
At first, Fig. 5 plots the WSR and Ps, where we can see that the fully digital precoder obtain a performance improvement than that of the hybrid precoder, at the cost of high implementation complexity. Since we assume NRF = 4 RF chains in this simulation, e.g., only a quarter of NTX, thus the hybrid precoder inevitable suffer a certain performance loss, especially for the sub-connected hybrid precoder. Besides, we can see that the proposed method obtains close performance with the methods in [11] and [20], both for the fully-connected case and sub-connected case.
Fig. 5. WSR versus Ps.
Then, we plot the WSR versus NTX in Fig. 6. As we can see, with larger NTX, a more spatial degrees of freedom (DoF) can be obtained by these methods, thus is beneficial to improve the WSR. Besides, the gap among the fully digital case and the hybrid scheme is slightly enlarged with NTX, since the performance of the latter is not only affected by NTX, but also constrained by NRF.
Fig. 6. WSR versus NTX.
Next, we plot the WSR versus NRF in Fig. 7, where we can see that the WSR increases with NRF for all these hybrid methods, since the extra RF chain can facilitate the exploration of the spatial DoF. In addition, when NRF = NTX, the fully-connected hybrid precoder obtains closed performance as the fully digital scheme, which shows the availability of the former.
Fig. 7. The WSR versus NRF.
Lastly, as pointed by [12], energy efficiency is an important criterion of wireless communication. Motivated by this, we generate the method to the energy efficiency design by adopting the Dinkelbach’s method [25]. From Fig. 8, one observes that for all these schemes, the efficiency tends to converge to a constant when Ps becomes large. Since there exists a unique Ps for the energy efficiency maximization. Besides, we observe that the sub-connect method outperforms others schemes due to its simplest structure, while the fully digital precoder method leads to the worst performance due to the fully digital structure, which lead to the highest circuit power consumption, thus reduces the energy efficiency.
Fig. 8. The energy efficiency versus Ps.
5. Conclusion
This work developed a unified hybrid precoder design in a mmWave multi-user MIMO system. Specifically, we studied the WSR design and proposed a BCD method, where the digital precoding is handled by the dual method and the analog precoding is addressed by the PDD technique. Then, we extended the proposed algorithm to the sub-connected schemes. In fact, the proposed design was more computational efficiently than other benchmarks. Simulation results verified the performance of the proposed scheme and provide some meaningful insights: 1) the proposed method obtains close performance with other baselines; 2) hybrid precoder obtains closed performance of the fully-digital schemes; 3) fully-connected scheme is super than the sub-connected case.
References
- J. Zhang, E. Bjornson, M. Matthaiou, D. W. K. Ng, H. Yang, and D. J. Love, "Prospective multiple antenna technologies for beyond 5G," IEEE Journal on Selected Areas in Communications, vol.38, no.8, pp.1637-1660, 2020. https://doi.org/10.1109/JSAC.2020.3000826
- Z. Wang, M. Li, Q. Liu and A. L. Swindlehurst, "Hybrid Precoder and Combiner Design With LowResolution Phase Shifters in mmWave MIMO Systems," IEEE Journal of Selected Topics in Signal Processing, vol.12, no.2, pp.256-269, 2018. https://doi.org/10.1109/JSTSP.2018.2819129
- X. Yu, J. Shen, J. Zhang, and K. B. Letaief, "Alternating Minimization Algorithms for Hybrid Precoding in Millimeter Wave MIMO Systems," IEEE Journal of Selected Topics in Signal Processing, vol.10, no.3, pp.485-500, 2016. https://doi.org/10.1109/JSTSP.2016.2523903
- L. Dai, B. Wang, M. Peng, and S. Chen, "Hybrid Precoding-Based Millimeter-Wave Massive MIMO-NOMA With Simultaneous Wireless Information and Power Transfer," IEEE Journal on Selected Areas in Communications, vol.37, no.1, pp.131-141, 2019. https://doi.org/10.1109/JSAC.2018.2872364
- S. S. Ioushua and Y. C. Eldar, "A family of Hybrid Analog-Digital Beamforming Methods for Massive MIMO Systems," IEEE Transactions on Signal Processing, vol.67, no.12, pp. 3243-3257, 2019. https://doi.org/10.1109/TSP.2019.2911255
- O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, "Spatially Sparse Precoding in Millimeter Wave MIMO Systems," IEEE Transactions on Wireless Communications, vol.13, no.3, pp.1499-1513, 2014. https://doi.org/10.1109/TWC.2014.011714.130846
- C. Pradhan, A. Li, L. Zhuo, Y. Li, and B. Vucetic, "Hybrid-Precoding for mmWave Multi-User Communications in the Presence of Beam-Misalignment," IEEE Transactions on Wireless Communications, vol.19, no.9, pp.6083-6099, 2020. https://doi.org/10.1109/TWC.2020.2999897
- A. Arora, C. G. Tsinos, B. S. M. R. Rao, S. Chatzinotas, and B. Ottersten, "Hybrid Transceivers Design for Large-Scale Antenna Arrays Using Majorization-Minimization Algorithms," IEEE Transactions on Signal Processing, vol.68, pp.701-714, 2020. https://doi.org/10.1109/TSP.2019.2957611
- S. Gong, C. Xing, V. K. N. Lau, S. Chen, and L. Hanzo, "Majorization-Minimization Aided Hybrid Transceivers for MIMO Interference Channels," IEEE Transactions on Signal Processing, vol.68, pp.4903-4918, 2020. https://doi.org/10.1109/TSP.2020.3018548
- C. Xing, X. Zhao, W. Xu, X. Dong, and G. Y. Li, "A Framework on Hybrid MIMO Transceiver Design Based on Matrix-Monotonic Optimization," IEEE Transactions on Signal Processing, vol.67, no.13, pp.3531-3546, 2019. https://doi.org/10.1109/TSP.2019.2912833
- T. Lin, J. Cong, Y. Zhu, J. Zhang, and K. B. Letaief, "Hybrid Beamforming for Millimeter Wave Systems Using the MMSE Criterion," IEEE Transactions on Communications, vol.67, no.5, pp.3693-3708, 2019. https://doi.org/10.1109/TCOMM.2019.2893632
- O. Alluhaibi, Q. Z. Ahmed, E. Kampert, M. D. Higgins and J. Wang, "Revisiting the EnergyEfficient Hybrid D-A Precoding and Combining Design for mm-Wave Systems," IEEE Transactions on Green Communications and Networking, vol.4, no.2, pp.340-354, 2020. https://doi.org/10.1109/TGCN.2020.2972267
- Y. Cai, F. Cui, Q. Shi, Y. Wu, B. Champagne, and L. Hanzo, "Secure Hybrid A/D Beamforming for Hardware-Efficient Large-Scale Multiple-Antenna SWIPT Systems," IEEE Transactions on Communications, vol.68, no.10, pp.6141-6156, 2020. https://doi.org/10.1109/TCOMM.2020.3006913
- T. Peken, S. Adiga, R. Tandon, and T. Bose, "Deep Learning for SVD and Hybrid Beamforming," IEEE Transactions on Wireless Communications, vol.19, no.10, pp.6621-6642, 2020. https://doi.org/10.1109/TWC.2020.3004386
- L. Sun et al., "A Robust Secure Hybrid Analog and Digital Receive Beamforming Scheme for Efficient Interference Reduction," IEEE Access, vol.7, pp 22227-22234, 2019. https://doi.org/10.1109/ACCESS.2019.2899154
- T. Shen, Y. Lin, J. Zou, Y. Wu, F. Shu and J. Wang, "Low-Complexity Leakage-Based Secure Precise Wireless Transmission With Hybrid Beamforming," IEEE Wireless Communications Letters, vol.9, no.10, pp.1687-1691, 2020. https://doi.org/10.1109/LWC.2020.3001150
- D. Wu et al., "Secure Hybrid Analog and Digital Beamforming for mmWave XR Communications With Mixed-DAC," IEEE Journal of Selected Topics in Signal Processing, vol.17, no.5, pp.995- 1006, 2023. https://ieeexplore.ieee.org/document/10192330 10192330
- X. He and J. Wang, "QCQP With Extra Constant Modulus Constraints: Theory and Application to SINR Constrained Mmwave Hybrid Beamforming," IEEE Transactions on Signal Processing, vol.70, pp.5237-5250, 2022. https://doi.org/10.1109/TSP.2022.3217672
- L. Yang, J. Wang, X. Xue, J. Shi and Y. Wang, "Secure Hybrid Beamforming for IRS-Assisted Millimeter Wave Systems," IEEE Transactions on Wireless Communications, vol.22, no.8, pp.5111-5128, 2023. https://doi.org/10.1109/TWC.2022.3231724
- Y. Hu, H. Qian, K. Kang, X. Luo and H. Zhu, "Joint Precoding Design for Sub-Connected Hybrid Beamforming System," IEEE Transactions on Wireless Communications, vol.23, no.2, pp.1199-1212, 2024. https://ieeexplore.ieee.org/document/10161727 10161727
- M. R. Akdeniz, Y. Liu, M. K. Samimi, S. Sun, S. Rangan, T. S. Rappaport, and E. Erkip, "Millimeter Wave Channel Modeling and Cellular Capacity Evaluation," IEEE Journal on Selected Areas in Communications, vol.32, no.6, pp.1164-1179, 2014. https://doi.org/10.1109/JSAC.2014.2328154
- M. M. Naghsh, M. Masjedi, A. Adibi, and P. Stoica, "Max-Min Fairness Design for MIMO Interference Channels: A Minorization-Maximization Approach," IEEE Transactions on Signal Processing, vol.67, no.18, pp.4707-4719, 2019. https://doi.org/10.1109/TSP.2019.2929470
- L. Zhang, Y. Wang, W. Tao, Z. Jia, T. Song, and C. Pan, "Intelligent Reflecting Surface Aided MIMO Cognitive Radio Systems," IEEE Transactions on Vehicular Technology, vol.69, no.10, pp.11445-11457, 2020. https://doi.org/10.1109/TVT.2020.3011308
- S. H. Hong, J. Park, S-J. Kim, and J. Choi, "Hybrid Beamforming for Intelligent Reflecting Surface Aided Millimeter Wave MIMO Systems," IEEE Transactions on Wireless Communications, vol.21, no.9, pp.7343-7357, 2022. https://doi.org/10.1109/TWC.2022.3157880
- A. Kaushik, J. Thompson, E. Vlachos, C. Tsinos, and S. Chatzinotas, "Dynamic RF Chain Selection for Energy Efficient and Low Complexity Hybrid Beamforming in Millimeter Wave MIMO Systems," IEEE Transactions on Green Communications and Networking, vol.3, no.4, pp.886-900, 2019. https://doi.org/10.1109/TGCN.2019.2931613