Frequency-channelized Mismatch-shaped Quadrature Data Converters for Carrier Aggregation in MU-MIMO LTE-A

Sandipan Kundu¹, Member, IEEE, Subhanshu Gupta², Senior Member, IEEE, David J. Allstot³, Life Fellow, IEEE and Jeyanandh Paramesh⁴, Senior Member, IEEE

Abstract—Emerging wireless standards aggregate information by selecting combinations of contiguous or non-contiguous channels, thereby enabling wider transmission bandwidths, and hence, higher data rates. Frequency-interleaved analog-to-digital conversion (FI-ADC) is an attractive emerging technique for carrier aggregation receivers because it facilitates an efficient way to dynamically vary the receiver bandwidth in order to address the many possible channel combinations. Compared to their time-interleaved counterparts, the specifications of the samplers in the parallel channels in FI-ADCs are significantly relaxed, thereby resulting in lower overall power consumption in the receiver. This work extends the FI-ADC concept to the quadrature frequency-interleaved oversampled data converter (QFI-ADC) to achieve greater aggregate data rates. Previously, digital-to-analog converter (DAC) and other inter-channel mismatches have limited the performance of QFI-ADCs. In this paper, we propose a low-complexity element rotation algorithm (ERA) to mitigate DAC mismatches. The ERA is synthesized from the corresponding mismatch transfer function using a rigorous mathematical procedure which is shown to be applicable generally to low-pass, high-pass, band-pass and quadrature ERAs. Simulations confirm that the resulting low-complexity quadrature ERAs have advantages over previously proposed approaches in both performance and hardware complexity. An additional gain calibration technique alleviates image folding due to gain and timing mismatches between the quadrature DAC elements, which yields higher SNDR.

Index Terms—Quadrature mismatch shaping, sigma-delta ADC, dynamic element matching, element rotation algorithm, gain calibration.

I. INTRODUCTION

Cloud technologies, the internet-of-things (IoT) and emerging multi-user multi-input multi-output (MU-MIMO) standards, such as LTE-A and IEEE 802.16m, have increased data rate demands, which exacerbates crowding of the radio spectrum and complicates physical layer design. Spectrum usage efficiency has improved, but this alone cannot provide the targeted data rates. Carrier aggregation (CA) is proposed to overcome this limitation; it enables the simultaneous use of more than one carrier frequency (i.e., more than one channel), which increases the overall bandwidth. The channels may be selected from among contiguous or non-contiguous intra- or inter-band options. Current state-of-the-art implementations require multiple transceivers, which adds cost and complexity, particularly to user terminals where space and power dissipation are key considerations.

Wide-band ADCs have been identified as the primary performance limiters, especially in non-contiguous inter-band scenarios. Parallel channel sampling based on the generalized sampling theory is an effective method for high-speed A/D conversion. Parallel architectures including time-interleaved ADCs (TI-ADC) [1]-[6], quadrature mirror filter bank (QMFB) [7] and hybrid filter bank (HFB) structures [8]-[10], and hybrid frequency conversion architectures [11], have demonstrated impressive performance because of their simplicity in achieving parallelism and equalization. Many of these architectures (e.g., Fig. 1(a)) require high-speed sample-and-hold (S/H) stages, wherein gain and offset mismatches between parallel time-interleaved channels cause distortion products, which necessitate spectral reconstruction using digital calibration techniques [12], [13].

A frequency-interleaved analog-to-digital converter (FI-ADC) relaxes the requirements on each S/H stage by sampling the wideband input signal after it is decomposed into sub-bands [8], [11], [14] (Fig. 1(a)). This approach increases flexibility in the frequency domain and reduces sensitivity to timing mismatches between parallel S/H stages. In the time-interleaved approach, the Nyquist-rate ADCs must exhibit maximum precision at all frequencies even though it is required at only the highest signal frequency. On the other hand, noise-shaping Σ-Δ converters feature an inherent trade-off between sampling frequency and resolution, which enables energy optimization of sampling rate versus quantization accuracy; i.e., only the frequencies of interest have suppressed quantization noise [15], [16]. A quadrature Σ-Δ modulator enables digitization of the in-phase and quadrature-phase components using two identical ADCs. The resulting complex converter architecture features several performance advantages over its real counterpart including immunity to DC offsets and flicker noise [17]-[19]. A parallel bank of quadrature ADCs enables the frequency-agile reception of narrowband signals in adjacent (or non-contiguous) concurrent bands (Fig. 1(a)) [8], [11], [14]. A drawback of complex-valued signal processing is I/Q imbalance caused by coefficient mismatches between the in-phase and quadrature-phase channels. This imbalance causes

¹ S. Kundu was with the Dept. of ECE, Carnegie Mellon Univ., Pittsburgh, PA. He is now with Intel Corp., Hillsboro, OR.
² S. Gupta is with the School of EECS, Washington State Univ., Pullman, WA.
³ D.J. Allstot is with the Dept. of EECS, Univ. of California, Berkeley, CA.
⁴ J. Paramesh is with the Dept. of ECE, Carnegie Mellon Univ., Pittsburgh, PA. This work was supported in part by the National Science Foundation under grant CCF-1314876.
in-band image-frequency interference which degrades the overall signal-to-noise ratio (SNR) of the receiver. In addition, process, voltage, and temperature (PVT) variations between the analog and the digital filter coefficients of the QFI-ADC limit the reconstruction accuracy, which introduces aliasing artifacts and reduces its advantages over TI-ADC architectures. Recent works have described a digital post-compensation scheme to mitigate mirror-frequency interference and gain/offset mismatches for the synthesis filter bank for quadrature ADCs [11], [20]. Although impressive results are achieved, digital calibration is necessary and the higher throughput rates foreseen for wideband next-generation architectures may require high-\(f_T\) CMOS processes. In this paper, a low-latency mismatch-shaping algorithm and a low-power multi-rate least-mean-squares (LMS) digital calibration scheme [21] are combined to achieve greater efficiency. The mismatch-shaping algorithm, which features low hardware complexity, is embedded in the quadrature ADC to mitigate gain and offset \(I/Q\) mismatches as well as the intra- and inter-channel mismatches between frequency-interleaved channels. The low-power LMS-based digital calibration scheme alleviates aliasing effects from the mirror frequencies.

II. FREQUENCY CHANNELIZATION ARCHITECTURE AND QUADRATURE BAND-PASS \(\Sigma\Delta\) ADC

A quadrature frequency-channelized receiver with several ADC options is shown in Fig. 1(a). This work aims at reducing the effects of mismatches on the QFI-ADC of Fig. 1(b), wherein a wideband signal is input to \(M = 4\) channels centered symmetrically about \(f_B\). Thus, the aggregate input bandwidth, \(\Omega_B\), is the sum of the individual channel bandwidths.

Each of the four channels comprises two paths that frequency translate the input signal centered at carrier frequency \(f_{I1}, f_{I2}, f_{I3}\), or \(f_4\) to corresponding in-phase and quadrature-phase components. The quadrature signals are analog band-pass filtered by \(H(s)\) and digitized using a quadrature \(\Sigma\Delta\) converter. The oversampled digital outputs are down converted to baseband, bandpass filtered, decimated by \(M\), and recombined to reconstruct a digital representation of the wideband analog input signal.

When fully deployed, the system functions as an ADC digitizing a wideband signal of bandwidth \(\Omega_B\); alternatively, some of the parallel channels may be turned off when less carrier aggregation is used. The \(M\)-channel architecture of Fig. 1(b) has \(2M\) identical band-pass filters and \(M\) identical quadrature band-pass oversampled \(\Sigma\Delta\) converters. Each converter operates at a sampling frequency of \(2\text{OSR}(\Omega_B/M)\), where \(\text{OSR}\) is the oversampling ratio.

The analysis of complex data converters was treated by Tang, et al. [18] wherein a third-order complex band-pass modulator Fig. 2(a)) was synthesized from a third-order real band-pass modulator with each real integrator, \(H_N(z) = z^{-1}/(1 - z^{-1})\) replaced by its complex counterpart, \(H_C(z) = Pz^{-1}/(1 - Pz^{-1})\), with \(P = \exp(2\pi f_{fB}/f_k)\) as shown. Thus,

\[
Y = Y_I + jY_Q = (X_I + jX_Q)z^{-3} + (1 - Pz^{-1})^{3}(Q_I + jQ_Q)
\]  

(1)
has a real signal transfer function, \( STF = 1 \), and a complex noise transfer function, \( NTF = N_r + jN_i = \left(1 - Pz^{-1}\right)^{\delta} \).

The performance of complex modulators is sensitive to \( I/Q \) mismatches [18], which introduce differential-mode errors in the signal and noise transfer functions, which, in turn, corrupt the signal band and degrade SNR. Although SNR degradation can be reduced using a DSP solution [22], the concomitant folding of quantization noise into the signal band is problematic. This effect is mitigated by cascading an oversampled complex modulator in the first stage with a Nyquist-rate pipeline converter in the second stage followed by adaptive noise cancellation digital logic [18].

This paper demonstrates techniques that shape DAC integral non-linearity errors similarly as the NTF of a \( \Sigma\Delta \) loop shapes quantization errors [23]-[27]. The mismatch transfer function (MTF) is made similar to the NTF to maximize SNR. One mismatch shaping method is the element rotation algorithm (ERA), implemented conceptually in Fig. 2(b). Other algorithms for high-order mismatch shaping are vector-based [26], [28] and tree-structured/butterfly-shuffler DAC-based methods [23], [27], [29], [30]. The hardware requirements for some of these techniques are compared below.

Section III gives a brief survey of mismatch shaping techniques for quadrature band-pass and frequency-interleaved ADCs. Section IV introduces a general approach for deriving an ERA, and extends it to quadrature applications. Section V describes mismatch shaping in an FI-ADC. Simulation results for a single-channel quadrature ADC and a two-channel FI-ADC are presented in Section VI, and hardware versus performance trade-offs for state-of-the-art ERAs versus the proposed algorithm are given in Section VII. Section VIII considers the inverse objective of finding the MTF for a given ERA, and Section IX concludes this contribution.

III. MISMATCH SHAPING

The proposed MTF is evaluated first for a single-channel quadrature band-pass \( \Sigma\Delta \) converter [19] and then for a dual-channel FI-ADC. A conceptual model of the mismatch-shaping ERA is depicted in Fig. 2(b). The ADC output, \( c_i(n) \), is pre-filtered by the inverse of MTF with digital output, \( c_d(n) \). The DAC output, \( c_d(n) \), is then post filtered by MTF, with analog output, \( c_f(n) \). Ideally, the input signal is unaltered, but errors introduced by the DAC, \( \delta \), are shaped by the MTF. A literal implementation of Fig. 2(b) is problematic because the input to the DAC, \( c_f(n) \), may grow without bound for certain MTFs.

Several approaches are functionally equivalent to the conceptual implementation including ERAs for low-pass [31], [32], real band-pass [33] and quadrature modulators [34], [35]-[36]. Maurino and Papavassiliou [35] used quadrature mixers to frequency translate the input/output signals into low-pass mismatch shaping DACs with good results. But, that approach exhibits high latency and is relatively complex. Prior-art quadrature ERAs require less hardware but do not faithfully represent the quadrature MTF [34], [36].

The systematic derivation of an ERA for a desired MTF starting from Fig. 2(b), or from known knowledge, has not been developed for advanced architectures such as a quadrature FI-ADC. This paper extends the approach in [19] to synthesize an optimum quadrature ERA with less hardware than more general mismatch shaping schemes [24]. Behavioral simulations confirm the advantages of the proposed ERA over the previous implementations used for single- or multi-channel frequency-channelized quadrature receivers [20], [34]. The general approach is also applicable to low-pass, high-pass and real band-pass architectures [19].

Energy efficiency is increased using a gain calibration technique that works with the multi-band quadrature ERA to mitigate image-frequency interference; specifically, a multi-rate implementation of a sign-sign LMS algorithm is employed [21]. The extra DSP circuitry adds latency in the \( \Sigma\Delta \) loop, which reduces the available amplifier settling time. The additional switches in the unit cells reduce the bandwidth and increase layout challenges. These issues are critical in the design of high-speed wideband multi-band \( \Sigma\Delta \) converters for LTE-Advanced and other future WiFi standards. The combination of analog mismatch shaping and digital calibration enables complex quadrature shaping with relatively low hardware complexity.

IV. ELEMENT ROTATION ALGORITHMS

A. General Approach

An MTF\((z) = 1 + bz^{-k}\), where \( k \) is an integer and \( b \in \{\pm1, \pm j\} \), is chosen so that the DAC can be realized using two-level elements. The output in Fig. 2(b) is [19]:

Fig. 2. (a) The third-order \( \Sigma\Delta \) modulator used in this work, and (b) a conceptual and (c) a detailed realization of the mismatch-shaping ERA; (d) signal distribution before down conversion.
Table I: Quadrature ERA calculation: Step 1

<table>
<thead>
<tr>
<th>Cycle</th>
<th>$s_j + jt_l$</th>
<th>$c_j = s_j + jt_l$</th>
</tr>
</thead>
<tbody>
<tr>
<td>$n$</td>
<td>$u_1 + jv_1$</td>
<td>$(V - u_1) + j(U + v_1)$</td>
</tr>
<tr>
<td>$n+1$</td>
<td>$u_2 + jv_2$</td>
<td>$-(U + v_1 - u_2) - j(V - u_1 - v_2)$</td>
</tr>
<tr>
<td>$n+2$</td>
<td>$u_3 + jv_3$</td>
<td>$(V - u_1 - v_2 + u_3) - j(U + v_1 - u_2 - v_3)$</td>
</tr>
<tr>
<td>$n+3$</td>
<td>$u_4 + jv_4$</td>
<td>$(U + v_1 - u_2 - v_3 + u_4) + j(V - u_1 - v_2 + u_1 + v_4)$</td>
</tr>
</tbody>
</table>

Quadrature ERA calculation: Step 2

<table>
<thead>
<tr>
<th>$s_j$</th>
<th>$I_d$</th>
</tr>
</thead>
<tbody>
<tr>
<td>$-\sum_{i=1}^{n} (1+\delta_i) + \sum_{i=1}^{n+1} (1+\delta_i)$</td>
<td>$\sum_{i=1}^{n} (1+\delta_i) + \sum_{i=1}^{n+1} (1+\delta_i)$</td>
</tr>
<tr>
<td>$=u_1 + \sum_{i=1}^{n} \delta_i$</td>
<td>$=v_1 + \sum_{i=1}^{n} \delta_i$</td>
</tr>
<tr>
<td>DAC1-index moves from $V$ to $V-u_1+1$; i.e.</td>
<td>DAC2-index moves from $U+1$ to $U+v_1+1$; i.e.</td>
</tr>
<tr>
<td>$-\sum_{i=1}^{n} (1+\delta_i) + \sum_{i=1}^{n+1} (1+\delta_i)$</td>
<td>$-\sum_{i=1}^{n} (1+\delta_i) + \sum_{i=1}^{n+1} (1+\delta_i)$</td>
</tr>
<tr>
<td>$=u_2 + \sum_{i=1}^{n} \delta_i$</td>
<td>$=v_2 + \sum_{i=1}^{n} \delta_i$</td>
</tr>
<tr>
<td>DAC2-index moves from $U+v_1$ to $U+v_1-u_2+1$; i.e.</td>
<td>DAC1-index moves from $V-u_1$ to $V-u_1-v_2+1$; i.e.</td>
</tr>
<tr>
<td>$-\sum_{i=1}^{n} (1+\delta_i) - \sum_{i=1}^{n+1} (1+\delta_i)$</td>
<td>$-\sum_{i=1}^{n} (1+\delta_i) - \sum_{i=1}^{n+1} (1+\delta_i)$</td>
</tr>
<tr>
<td>$=u_3 + \sum_{i=1}^{n} \delta_i$</td>
<td>$=v_3 + \sum_{i=1}^{n} \delta_i$</td>
</tr>
<tr>
<td>DAC1-index moves from $V-u_1-v_2+1$ to $V-u_1-v_2+u_3$; i.e.</td>
<td>DAC2-index moves from $U+v_1-u_2-v_3+u_1$ to $U+v_1-u_2-v_3+u_3$; i.e.</td>
</tr>
<tr>
<td>$\sum_{i=1}^{n} (1+\delta_i) - \sum_{i=1}^{n+1} (1+\delta_i)$</td>
<td>$\sum_{i=1}^{n} (1+\delta_i) - \sum_{i=1}^{n+1} (1+\delta_i)$</td>
</tr>
<tr>
<td>$=u_4 + \sum_{i=1}^{n} \delta_i$</td>
<td>$=v_4 + \sum_{i=1}^{n} \delta_i$</td>
</tr>
<tr>
<td>DAC2-index moves from $U+v_1-u_2-v_3+u_1$ to $U+v_1-u_2-v_3+u_3$; i.e.</td>
<td>DAC1-index moves from $V-u_1-v_2+u_3$ to $V-u_1-v_2+u_3+v_3$; i.e.</td>
</tr>
</tbody>
</table>

$c_4(n) = mtf(n) + c_3(n)$

A corresponding ERA is deduced using a two-step process:

- **Step 1**: Ideally, $c_d(n)$ is equal to $c_f(n)$. Thus, (2) is solved for $c_f(n)$ terms of the input sequence:

  \[ c_f(n) = c_f(n) - bc_f(n-k) \rightarrow c_f(n) - bc_f(n-k) \]  

- **Step 2**: Eqn. (3) is used to calculate the pointer, $c_f(n)$, at each time step using the input values, $c_f$. Assume an initial value of $c_f(n) = U$. Based on the value of $c_f(n)$ (e.g., $u_1$), the selected DAC elements start from $U \pm u_1$ where each selected cell adds $1$ to the output for the $(n+1)^{th}$ time step. Assuming a random deviation, $\delta$, from the ideal for each cell, an additional error term, $+\text{sgn}(u_1)(\delta)$, is also added which eventually limits the performance of the mismatch-shaping algorithm. By evaluating (2) for several time steps, the usage pattern of the DAC elements is determined, which yields the element usage algorithm. The rotational part of the ERA arises from realizing the linear array in a circular form with a limited number of DAC cells. After the element usage is determined, (2) is used to find the error introduced by DAC mismatches at each time step.

**B. Mismatch-shaping ERA for Quadrature Band-pass ADC**

The approach described above is now extended to an ERA for the quadrature MTF(z) = 1 - $z^{-1}$ which has a zero only at $z^{-1}$ (and not at $z^{-1}$). Eqsns. (2) and (3) are re-written for the quadrature case of Fig. 2(c) as [19]:

\[ s_f(n) = s_f(n) + t_f(n-1) \]

\[ t_f(n) = t_f(n) - s_f(n-1) \]

(4) (5)

where $s_f(n) = s_f(n), t_f(n) = t_f(n), s_f(n) = t_f(n)$, and $s_f(n) = s_f(n)$. 

- **Step 1**: Values of the complex-valued pointer, $c_f = s_f + jt_f$, are calculated using (5) for the complex-valued inputs, $c_f = s_f + jt_f$ (Table 1).

- **Step 2**: The real and imaginary parts of $c_f(n)$ are first evaluated using Table 1 and then (5) to sum the contributions of the DAC elements using the indices given by $s_f$ and $t_f$. Note that there are two DACs (DAC1 and DAC2), and, hence, two different sets of unit cells with mismatches, $\delta_f$ and $\delta_f$. An additional assumption, $\delta_f = \delta_f$ (for all i), has been made in Table 2 in Table 1. This assumption states that the DAC1 and DAC2 paths are identical and only inter-element mismatches within each DAC are addressed by the quadrature ERA. This assumption is necessary to avoid contention—the simultaneous use of a single DAC element by both DAC1 and DAC2. It is essential to note that this assumption is needed only to derive the proposed quadrature ERA; it also works in realistic cases where $\delta_f \neq \delta_f$ so long as digital calibration is added to equalize the random DAC gain mismatches. This issue is addressed later.

Step 2 in Table 1 yields the required element usage patterns as exemplified in Fig. 3(a). Note that the DAC indices are assumed to be increasing (decreasing) to the right (left). For example, when the DAC1 index moves from $V$ in cycle $n$ to $V-u_1+1$ in the next cycle, the index moves to the left (since $V-u_1+1 \leq V$) as indicated by the left-pointing arrow.

Two essential aspects of the proposed ERA should be emphasized: The element usage in each DAC changes direction after every two cycles, and, thus repeats after four cycles. And, there is a one cycle time difference (i.e., a 90° phase difference).

between the direction changes in the two DACs. This is a key difference from previous quadrature ERAs [35]-[36]. Another key aspect of this ERA is the per-cycle DAC swapping; i.e., if DAC1 (DAC2) is used in the $f$-path ($Q$-path) in the current cycle,
DAC2 (DAC1) will be used in the I-path (Q-path) in the next cycle. Further insight is gained from Fig. 3(b) where a critical frequency is identified from the DAC cell selection direction of movement for two traditional ERAs and the proposed quadrature ERA. Specifically, a low-pass ERA has a zero at DC (no direction changes in dom) and a high-pass ERA has a zero at f/2 (dom changes every clock cycle). In the proposed ERA algorithm, dom alternates after every two clock cycles which corresponds to a zero in the MTF at f/4.

C. Proposed Mismatch Shaping ERA for FI-ADC

The quadrature ERA algorithm is now extended to FI-ADCs. Analog mismatch shaping mitigates intra-channel mismatches because the ERA swaps the roles of the DACs from one cycle to the next. However, this creates a cycle-to-cycle gain mismatch; i.e., if G_{DAC1} and G_{DAC2} are the DAC gains with G_{DAC1} ≠ G_{DAC2}, then G_{DAC1} + jG_{DAC2} in one cycle is not equal to G_{DAC2} + jG_{DAC1} in the next. Thus, LMS digital gain calibration is used so that G_{DAC1} ≈ G_{DAC2}.

For a multi-channel FI-ADC, it is necessary to extend these techniques for more than two DACs. For example, in a two-channel implementation (at F_C1 and F_C2), there are four DACs with independent mismatches; i.e., G_{DAC1,F1} ≠ G_{DAC2,F1} ≠ G_{DAC1,F2} ≠ G_{DAC2,F2}. The proposed ERA shapes these intra-DAC mismatches for both channels with an MTF = (1 - j^-4).

As a consequence of the extended ERA, and before digital calibration, the two-channel FI-ADC has different DAC gains in alternate odd/even cycles (Table II). Digital LMS gain calibration makes G_{DAC1,F1} ≈ G_{DAC2,F1} ≈ G_{DAC1,F2} ≈ G_{DAC2,F2}.

<table>
<thead>
<tr>
<th>Clock Cycle</th>
<th>Effective Gain</th>
</tr>
</thead>
<tbody>
<tr>
<td>Even</td>
<td>G_{DAC1,F1} + jG_{DAC2,F1}</td>
</tr>
<tr>
<td></td>
<td>G_{DAC1,F2} + jG_{DAC2,F2}</td>
</tr>
<tr>
<td>Odd</td>
<td>G_{DAC1,F1} + jG_{DAC1,F2}</td>
</tr>
<tr>
<td></td>
<td>G_{DAC2,F2} + jG_{DAC1,F2}</td>
</tr>
</tbody>
</table>

V. CLOCK JITTER AND INTER-CHANNEL MISMATCHES

All of the frequency-interleaved channels are sampled using clock signals derived from the same reference; thus, there is less sensitivity to clock jitter than in a TI-ADC [11]. However, the local oscillator used to perform the down-conversion (Fig. 1(b))
must generate multi-band quadrature clock signals with low jitter. Such LO jitter can be modeled as sinusoidal tones as described by Ding, et al. [37]

\[ \text{LO}_0(t) = e^{-j \frac{2 \pi t}{T} \cdot \text{jitter}(t)} \]  

(6)

An analysis of the effects of jitter for square-wave-like LO generators used with harmonic-reject mixers in frequency-channelized receivers is detailed by Forbes and Gharpurey [38].

The mismatch noise shaping due to the odd/even cycle swapping of the DACs along with the digital LMS gain calibration techniques described above also can be used to correct input I/Q timing errors. That is, the effects of mismatches at the inputs to the quadrature ADC are analyzed with the same mismatch errors moved into the feedback DACs.

Suppose the even samples (I-phase) in Fig. 1(b) occur at the correct times while the odd samples (Q-phase) have a timing error, \( \theta \). Begin the analysis of the effects of this timing error when multiplexing the even and odd streams by considering an input signal sampled at \( f_0 \):

\[ S(nT_s) = \cos(2\pi n f_0 / f_s) \text{ where } f_s = 1 / T_s \]  

(7)

Next, extract the odd and even components of (7) and up sample both by 2X. Only the terms between 0 and \( 2\pi \) are considered in the extracted terms assuming \( f_0 < f_s / 2 \); it is important to note that two of the terms are due to aliasing. With the assumed phase error between even and odd samples, \( \theta \), where \( \theta = 2\pi f_0 \Delta t \), the even and odd parts are

\[ S_e(nT_s) = 0.5 \cdot [e^{j 2\pi n f_0 / f_s} + e^{j 2\pi n f_0 / f_s + \theta}] \]  

(8)

\[ S_o(nT_s) = 0.5 \cdot [e^{j 2\pi n f_0 / f_s} + e^{j 2\pi n f_0 / f_s - \theta}] \]  

Thus, the image terms are aliases of the desired signal. Given \( S_e(n-1/2T_s) = S_o(n-1/2T_s) \), the equivalent sampled output at \( t = nT_s \) is

\[ S(nT_s) = S_e(nT_s) + S_o[(n-1)T_s] \]

where

\[ S_o[(n-1)T_s] = 0.5 \cdot [e^{j 2\pi n f_0 / f_s + \theta} + e^{j 2\pi n f_0 / f_s - \theta}] \]

\[ = 0.5 \cdot [e^{j 2\pi n f_0 / f_s + \theta} + e^{j 2\pi n f_0 / f_s - \theta}] - \pi + \theta \]

Combining \( S_e(nT_s) \) and \( S_o(n-1)T_s \) with \( e^{j \theta} \sim (1 + j\theta) \) (for small \( \theta \)) yields

\[ S(nT_s) = 0.5 \cdot e^{j 2\pi n f_0 / f_s} [2 + j\theta] + 0.5 \cdot e^{j 2\pi n f_0 / f_s} [j\theta] \]

(9)

The second term in (9) dominates the image signal error owing to the phase mismatch between the even and odd sampling instants. The resulting total distortion is \( \theta \cdot 0.5 = \pi \cdot f_0 \cdot \Delta t \). Also note that the image is rotated by \( \pi / 2 \) relative to the image present in the odd-even sequences. Thus, this error can be corrected using either a narrow-band I/Q digital calibration technique [11], [20], or the same digital LMS gain calibration as shown above in Section VI.B.

\[ \begin{array}{c}
0.5 \cdot e^{j 2\pi n f_0 / f_s} [2 + j\theta] + 0.5 \cdot e^{j 2\pi n f_0 / f_s} [j\theta] \\
\end{array} \]

(9)

\[ \text{Fig. 4. (a) Comparison of the reconstructed noise floors: no ERA (brown), ERA [36] (blue), and proposed ERA (black); (b) combined output of the two-channel FI-ADC with 0.5}\% mismatch. An 8192-point FFT was performed to capture the entire baseband bandwidth of 50 MHz.} \]

VI. SIMULATION RESULTS

Experimental results are obtained using Matlab/Simulink® simulators with the proposed ERA applied to the 3rd-order, 4-bit, quadrature band-pass \( \Sigma - \Delta \) ADC of Fig. 2(a). The same architecture is then extended to a two-channel FI-ADC as in Fig. 1(b). Detailed simulations are performed to illustrate the effects of I/Q gain and phase mismatches along with the improved performance obtained using digital gain calibration. An example frequency allocation of intra-band contiguous channels is shown in Fig. 2(d). For this work, \( f_1 \) and \( f_2 \) represent the center frequencies of two adjacent 25 MHz bands at the output of the first RF down-conversion mixer. Two CW tones are chosen at arbitrary frequencies within these bands given by \( f_{\text{in}} = (k / 2^N) f_c \), where \( N \) is the \( N \)-point Fast Fourier Transform (FFT), and \( k \) is chosen to be a prime number (where \( k < 2^N \)) to avoid repetitive patterns in the output, and \( f_c \) is the ADC sampling frequency.

A. Proposed ERA for Two-channel Quadrature ADC

Behavioral simulations for the 3rd-order 4-bit two-channel FI-ADC based on the architecture in Fig. 1(b) are shown in Fig. 4. The digital reconstruction comprises digital multipliers, adders, up-samplers and FIR filters. Selecting the FI-ADC
center frequency as \( f/4 \) enables multiplication coefficients for the digital mixers of either 0 or \( \pm 1 \). Thus, no multipliers are required for the digital mixers. Quadrature bandpass filters are the main source of implementation complexity in the digital block. An 8th-order infinite-impulse-response reconstruction filter is used for power-efficiency in MATLAB’s Filter Design Analysis and automatic synthesis tools.

For simulations, two contiguous 20 MHz bands with an aggregate bandwidth of 40 MHz and a transition band of 5 MHz were chosen. The chosen sub-bands are centered at \( f_1 = 212.5 \) MHz and \( f_2 = 237.5 \) MHz. Again, the third-order quadrature band-pass \( \Sigma \Delta \) modulator of Fig. 2(a) is used. Each 20 MHz input band is mixed down to the same IF frequency, \( f_{IB} = 100 \) MHz, which enables the use of the same quadrature converter design for each. Fourth-order band-pass channel-select filters are used to provide > 40 dB adjacent channel attenuation to negate any in-band image artifacts. Note that a mirror image implementation would allow even lower filter orders due to the intrinsic band-pass nature of its signal transfer function as shown by Schreier and Temes [28].

The power spectral density (PSD) plots show the simulated noise floor (Fig. 4(a)) and the reconstructed output of the two-channel QFI-ADC (Fig. 4(b)). The image frequencies are intentionally notched out by the DSP to allow comparison of the noise floors for the three cases: no ERA, ERA in [36] and the proposed ERA. It can be seen that for the same mismatch (1%), the noise floor after decimation for the proposed ERA has a remarkable improvement over the other techniques.

A significant decrease in the noise floor and a corresponding increase in \( SNDR \) is achieved using the proposed ERA for a quadrature band-pass \( \Sigma \Delta \) ADC assuming equal inter-element mismatches (i.e., \( \delta_{ii} = \delta_{ji} \) for all \( i \)). The \( SNDR \) improvement is maintained for a wide range of \( OSR \) values [19]. With \( \delta_{ij} \neq \delta_{ji} \) for all \( i \), an image tone appears at \( [f_0 - f_i] \) due to the DAC1/DAC2 swapping. Its magnitude depends on the gain imbalance, so it can degrade the overall \( SNDR \) even though it and the noise floor are substantially reduced using the proposed ERA along with the digital LMS gain calibration presented in the next section. Note that this image tone also appears in a similar manner in previous ERAs [35]-[36]. Herein, mismatch alters the effective gains of DAC1 and DAC2 as highlighted by the analysis in the previous section.

Based on the results of Fig. 4(b), it is important to use layout techniques that maximize the matching of the DACs in an FI-ADC; such techniques are highlighted by Kundu, et al. [19]. Perfect matching is statistically impossible so the resulting residual image tone is removed using auxiliary \( m \)-bit DACs to equalize the gains of DAC1 and DAC2, as shown schematically in Fig. 3(c). The implementation of the LMS algorithm based on a multi-rate polyphase filter for \( \Sigma \Delta \) converters is presented in [21].

**B. Gain Mismatch Calibration**

For a multi-channel FI-ADC, it is necessary to calibrate the gains of each DAC against a common value to maximize overall \( SNDR \) and reduce/eliminate image tones from the signal bands. It is important to note that the calibration method mentioned here also corrects for \( I/Q \) phase mismatch (Eqn. (9)) and intra- and inter-channel mismatches in the signal path in addition to DAC gain mismatches by applying a gain correction only to the feedback DACs. However, for simplicity, we will refer to all of these mismatches collectively as “DAC gain mismatches” in this section since they all produce a tone at the same image frequency. An offline version of the gain calibration is implemented here, while an online version is equally feasible albeit with more hardware.

The calibration method requires choosing any one DAC as a reference DAC (\( DAC_{REF} \)) from among all of the 2M DACs available in the \( M \) FI-ADC channels. The calibration method sets the gains of all the other DACs equal to that of the reference DAC; i.e., \( G_{DACx,Fci} = G_{DACREF} \) for \( x = 1:2 \) and \( i = 1:M \). Without loss of generality, DAC1,F1 is chosen as the reference DAC. The calibration steps are outlined in Fig. 5.

The calibration method is demonstrated first through the simulation of a one-channel FI-ADC. Fig. 6(a) shows how \( |Gain_{DAC1,F1}(k) - Gain_{DACREF}| \) varies with \( G_{CORR,DAC1}(k) \) where \( G_{CORR,DAC1}(k) \) is the gain correction applied to DAC1 and \( k \) denotes the LMS iteration number. It also shows the settling behaviors of the gain correction and the error terms. Next, \( G_{CORR,DAC1} \) is fixed at 0.999654, the optimum value found, and the calibration continues to determine the optimum value for \( G_{CORR,DAC2} \). As \( G_{CORR,DAC2} \) moves towards its optimum value, the \( SNR \) (single channel) of the \( i \)-th channel increases as the image tone decreases. This behavior is illustrated in Fig. 6(b).

![Fig. 5. Proposed gain calibration algorithm steps.](image-url)
The LMS-optimized $SNR$ (96.27 dB at $OSR = 32$) is close to the single-channel maximum $SNR$ (97.71 dB) as determined by sweeping $G_{CORR,DAC1}$ (Fig. 6(b)), which verifies the efficacy of the proposed approach.

Fig. 7(a) shows simulated $SQNR$ in the presence of intra-DAC ($DAC_1$/$DAC_0$) mismatches for a two-channel FI-ADC. The effect of the image due to mismatch is studied for the three ERA cases as shown; generally, the proposed ERA performs better showing about a 15 dB increase in $SQNR$. For large mismatches, the $SQNR$ is only 8 bits, but an extra bit is recovered using the digital LMS gain calibration algorithm as shown in Fig. 7(b). The proposed scheme operates with mismatches up to 10% but the increase in the quantization noise floor and incomplete image cancellation limits the $SQNR$ with digital calibration at higher mismatches. Techniques for improvement are under investigation including the use of a multi-rate LMS algorithm to correct mismatches between the analysis and synthesis filters [20].

VII. HARDWARE OVERHEAD VS. PERFORMANCE

The extra digital signal processing in the DAC adds latency in the $\Sigma\Delta$ loop, which reduces the available settling time. Also, the additional switches within the DAC unit cells required in a complex DAC reduce bandwidth and create layout difficulties. These issues are critical in high-speed $\Sigma\Delta$ converters [39].

Table III compares the proposed ERA for a 4-bit $\Sigma\Delta$ ADC (including calibration hardware with $M = 8$) to a first-order reduced-complexity butterfly shuffler and a first-order tree-structured complex DAC [23], [30]. Note that hardware reduction techniques [40] have been applied to the latter, where appropriate, in order to make a fair comparison. The proposed scheme is superior in terms of hardware complexity, and, therefore, better suited to high-speed modulators while achieving similar first-order mismatch shaping performance. [35]-[36] have similar hardware complexity and latency, but their performance is inferior to the proposed scheme. Although it employs digital calibration to mitigate mismatches between the $I$ and $Q$ path gains, the significant performance advantages outweigh the additional hardware in many practical cases.

VIII. REVERSE PROBLEM: DEDUCING MTF FROM ERA

The significance of the previous systematic approach is enhanced by solving the reverse problem, i.e., deriving the $MTF$ from a given ERA. Note that it is not always possible to obtain
For a capacitive DAC, individual capacitors can no longer abut because the top plate is not a common node. To reduce cross element parasitic capacitance, they are spaced apart in the layout. For a current mode DAC, headroom is an issue because of the additional series switches. Due to another series switch in the main charging path, a closed-form expression from a given ERA. The approach is applied here to the quadrature ERA of Kurosawa, et al. [36] to derive its $MTF$, which shows its sub-optimal nature compared to the quadrature ERA proposed herein.

By retraicing the two-step process in reverse, one set of difference equations for every alternate step can be written for the quadrature ERA of [36]:

$$s_i(n) = s_i(n) + t_i(n-1)$$
$$t_i(n) = t_i(n) - s_i(n-1)$$
$$s_o(n) = s_o(n) - s_o(n-1) + 1$$
$$t_o(n) = t_o(n) + s_o(n-1) - 1$$

Eqn. (10) represents $MTF = 1 - j z^{-1}$ (like (4)) whereas (11) approximates $MTF = 1 + j z^{-1}$. Since the ERA uses the $MTFs$ on alternate cycles, it appears to realize a real band-pass $MTF$ (i.e., product of the two) albeit sub-optimally as shown by the results in Fig. 8; SNDR for an ADC with $NTF = 1 - q z^{-1}$ is about equal to that with $NTF = 1 + q z^{-1}$.

However, the proposed ERA truly implements $MTF = 1 - q z^{-1}$ (i.e., with a zero at $+f/4$) to achieve an increase of almost 25 dB in SNDR. However, it does not perform well, as shown when used with $NTF = 1 + q z^{-1}$ (i.e., with a zero at $-f/4$).

IX. CONCLUSIONS AND FUTURE WORK

This paper describes mathematical procedures for deriving DAC element rotation algorithms from given general mismatch transfer functions for quadrature $\Sigma$-$\Delta$ data converters in frequency channelized receivers. An optimal quadrature ERA is derived and applied in multi-channel frequency-channelized data converters for wide-bandwidth applications such as LTE-Advanced. Gain mismatches between $I$ and $Q$ DACs are mitigated by combining analog mismatch shaping and digital LMS gain calibration algorithms with minimal hardware overhead. The proposed system shows significant SNDR improvements over earlier approaches for both intra- and inter-DAC element mismatches. Future work will investigate reducing the image artifacts, executing the calibration loops with separate localized coefficients for each channel, the impact of convergence of the LMS calibration loops on the overall

### Table III. Performance versus Hardware Overhead

<table>
<thead>
<tr>
<th></th>
<th>Digital Transistor count</th>
<th>Digital Latency (FO4 inv. delays)</th>
<th>Additional Analog Complexity</th>
<th>Relative Analog Bandwidth</th>
</tr>
</thead>
<tbody>
<tr>
<td>This work</td>
<td>2592T (1X)</td>
<td>9 (1X)</td>
<td>Two 8-bit gain calibration DACs</td>
<td>1X</td>
</tr>
<tr>
<td>Butterfly [23]</td>
<td>9472T (3.6X)</td>
<td>18 (2X)</td>
<td>Two additional switches in each DAC unit cell(^a)</td>
<td>~0.5X(^b)</td>
</tr>
<tr>
<td>Tree-structured [23]</td>
<td>6992T (2.7X)</td>
<td>20 (2.2X)</td>
<td>Two additional switches in each DAC unit cell(^a)</td>
<td>~0.5X(^b)</td>
</tr>
</tbody>
</table>

\(^a\) For a capacitive DAC, individual capacitors can no longer abut because the top plate is not a common node. To reduce cross element parasitic capacitance, they are spaced apart in the layout. For a current mode DAC, headroom is an issue because of the additional series switches.

\(^b\) Due to another series switch in the main charging path, plus additional poles.

[a] The SNDR using the proposed approach and implementation in silicon. A detailed comparison of the hardware and performance trade-offs compared to conventional approaches confirms the reduced area and latency advantages that are attractive for wide-bandwidth low-latency closed-loop architectures.

### REFERENCES


Sandipan Kundu (M’13) received the B.Tech. Degree in Electronics and Electrical Communication Engineering in 2007 from the Indian Institute of Technology (IIT), Kharagpur, India and the Ph.D. Degree in Electrical and Computer Engineering in 2013 from Carnegie Mellon University, Pittsburgh, PA. He is currently with the Intel Corporation working on I/O circuit research. His research interests include RF, high-speed I/O and mixed-signal IC design. He was a recipient of the Analog Devices Outstanding Student Designer Award (2011).
Subhanshu Gupta (S’03-M’11-SM’16) received the B.E. Degree from the National Institute of Technology (NIT), Trichy, India, in 2002, and the M.S. and Ph.D. degrees from the University of Washington in 2006 and 2010, respectively. He is currently an Assistant Professor at Washington State University in the School of Electrical Engineering and Computer Sciences. He has worked at Maxim, National Semiconductor and Novell Netware in research and development positions. He was a recipient of the Analog Devices Outstanding Student Designer Award in 2008 and the IEEE RFIC Symposium Best Student Paper Award in 2011. His current research interests include broadband reconfigurable architectures for wireless backhaul and millimeter-wave applications, signal processing paradigms for heterogeneous networks and sensors and the design of RF and mixed-signal electronic circuits.

David J. Allstot (S’72-M’72-SM’83-F’92-LF’12) received the B.S., M.S., and Ph.D. degrees from the Univ. of Portland, Oregon State Univ., and the Univ. of California, Berkeley, respectively. He has held several industrial and academic positions. He was the Boeing-Egtvedt Chair Professor of Engineering at the Univ. of Washington from 1999 to 2012 and Chair of the Dept. of Electrical Engineering from 2004 to 2007. In 2012 he was a Visiting Professor of Electrical Engineering at Stanford University and since 2013 he is the MacKay Professor in Residence in the EECS Dept. at UC Berkeley.

Dr. Allstot has advised approximately 65 M.S. and 40 Ph.D. graduates, published more than 300 papers, and received several awards for outstanding teaching and research including the 1980 IEEE W.R.G. Baker Award, 1995 and 2010 IEEE Circuits and Systems Society (CASS) Darlington Award, 1998 IEEE International Solid-State Circuits Conference Beatrice Winner Award, 2004 IEEE CASS Charles A. Desoer Technical Achievement Award, 2005 Semiconductor Research Corporation Aristotle Award, 2008 Semiconductor Industries Association University Research Award, 2011 IEEE CASS Mac Van Valkenburg Award, and 2015 IEEE Trans. on Biomedical Circuits and Systems Best Paper Award. He has been active in service to the IEEE Circuits and Systems and Solid-State Circuits Societies throughout his career.

Jeyanandh Paramesh (SM’10) received the B.Tech. degree from IIT, Madras, the M.S degree from Oregon State University and the Ph.D. degrees from the University of Washington, all in Electrical Engineering. He is currently an Associate Professor of Electrical and Computer Engineering at Carnegie Mellon University. He has been employed at AKM Semiconductor (Analog Devices), Motorola and Intel in product development and research positions. His research interests include the design of RF and mixed-signal integrated circuits and systems for a wide variety of applications.