Study Notebook

OFDM: From Foundations to 6G

A Complete Study Notebook — 4G LTE · 5G NR · 6G Horizon

Featured Equation — OFDM Baseband Signal
s(t)  =  k  Sk  ·  ejk Δf · t
where  Sk = complex symbol on subcarrier k,  Δf = subcarrier spacing
Abstract

This notebook provides a rigorous end-to-end treatment of OFDM — from DFT/IDFT mathematical foundations through CP-OFDM, DFT-s-OFDM, multipath resilience, synchronization, channel estimation, LTE/5G NR air interface design, and the 6G outlook. Covers FFT/IFFT signal processing fundamentals, PAPR, frequency/timing offsets, flexible 5G numerology, and OFDM variants (FBMC, OTFS, f-OFDM), alongside a comprehensive pros/cons analysis. Each section builds systematically on the last, balancing mathematical rigour with practical implementation insight drawn from 3GPP specifications. Embedded interactive charts and worked numerical examples reinforce every major concept for active, hands-on learning.

11
Sections
20+
Interactive Charts
60+
Key Equations
4G→6G
Coverage
Table of Contents
Section Section
§1 Mathematical Foundations (DFT/IDFT/FFT/IFFT) §7 LTE (4G) OFDM & SC-FDMA
§2 CP-OFDM Signal Model & Spectrum §8 5G NR Flexible Numerology
§3 Cyclic Prefix & Multipath Robustness §9 OFDM Variants (FBMC/OTFS/f-OFDM)
§4 PAPR Problem & Reduction Techniques §10 Pros & Cons Analysis
§5 Frequency & Timing Synchronization §11 6G Outlook (THz / AI / ISAC)
§6 Channel Estimation & Equalization
Prerequisites

Core: Linear algebra (FFT, matrix operations), probability & statistics, basic DSP (sampling, convolution, z-transforms).

Recommended: 5G NR frame structure before §7 onward. Familiarity with complex baseband notation helpful throughout.

How to Use This Notebook
  • Sequential or jump reading: each section is self-contained; cross-references are provided for dependencies.
  • Offline mathematics: all equations render without an internet connection.
  • Interactive charts: hover, zoom, and toggle traces to explore signal and spectral plots.
  • Study questions: appear at each section end — attempt them before checking the worked solutions.
Version: 1.0 | Standards: 3GPP TS 38.211  /  36.211  /  38.300 | Date: June 2026 | Audience: RF / PHY Engineers
§1

Mathematical Foundations — DFT, IDFT, FFT, IFFT

1.1 From Fourier Series to the Discrete Fourier Transform

The journey toward the DFT begins with the observation that any sufficiently well-behaved periodic function can be decomposed into a weighted sum of harmonically related sinusoids — a fact first systematized by Joseph Fourier in 1822. Understanding the conceptual chain Fourier Series → Fourier Transform → DTFT → DFT shows that the DFT is not an ad-hoc engineering trick but the natural endpoint of discretizing and periodizing the continuous transform.

1.1.1 Fourier Series (continuous-time, periodic)

Let $x(t)$ be $T_0$-periodic. The Fourier Series (FS) representation is:

$$x(t) = \sum_{k=-\infty}^{\infty} c_k \, e^{\,j 2\pi k f_0 t}, \qquad f_0 = \frac{1}{T_0}$$

where the coefficients $c_k$ are obtained by the analysis integral:

$$c_k = \frac{1}{T_0}\int_{0}^{T_0} x(t)\, e^{-j 2\pi k f_0 t}\, dt$$

The set $\{e^{j 2\pi k f_0 t}\}_{k \in \mathbb{Z}}$ forms a complete orthonormal basis for $L^2([0,T_0])$:

$$\frac{1}{T_0}\int_{0}^{T_0} e^{j 2\pi k f_0 t} e^{-j 2\pi m f_0 t}\, dt = \delta[k-m]$$

The frequency spectrum is discrete (harmonics at $kf_0$) but the time signal is continuous.

1.1.2 Continuous-Time Fourier Transform (CTFT)

When $T_0 \to \infty$ (the signal is no longer constrained to be periodic, or equivalently the fundamental frequency $f_0 \to df$ becomes infinitesimal), the harmonic sum becomes an integral — the Fourier Transform:

$$X(f) = \int_{-\infty}^{\infty} x(t)\, e^{-j 2\pi f t}\, dt, \qquad x(t) = \int_{-\infty}^{\infty} X(f)\, e^{j 2\pi f t}\, df$$

Both time and frequency are now continuous. Infinite support in time implies a continuous frequency spectrum. The spectrum is generally complex-valued; $|X(f)|$ is the amplitude spectrum and $\angle X(f)$ is the phase spectrum.

1.1.3 Discrete-Time Fourier Transform (DTFT)

Practical digital systems work with sequences $x[n]$ obtained by sampling $x(t)$ at interval $T_s$ (rate $f_s = 1/T_s$). The DTFT maps a discrete-time sequence to a continuous, periodic frequency function:

$$X\!\left(e^{j\omega}\right) = \sum_{n=-\infty}^{\infty} x[n]\, e^{-j\omega n}, \qquad \omega \in [-\pi, \pi]$$

The DTFT is $2\pi$-periodic in $\omega$ (equivalently, $f_s$-periodic in Hz). It is still a continuous function of $\omega$ and requires an infinite sequence — not computable directly on a machine.

1.1.4 Arriving at the DFT

If we additionally limit the sequence to $N$ samples and evaluate the DTFT at $N$ equally-spaced frequencies $\omega_k = 2\pi k / N$ for $k = 0, 1, \ldots, N-1$, we obtain the Discrete Fourier Transform:

$$\boxed{X[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j 2\pi kn/N}, \qquad k = 0, 1, \ldots, N-1}$$

Both the time-domain sequence $x[n]$ and the frequency-domain sequence $X[k]$ are now finite and discrete — perfectly suited for digital computation. The DFT implicitly assumes both sequences are periodic with period $N$.

Key insight — the four Fourier transforms:
TransformTime domainFrequency domainComputable?
Fourier SeriesContinuous, periodicDiscrete, aperiodicAnalytically
CTFTContinuous, aperiodicContinuous, aperiodicAnalytically
DTFTDiscrete, aperiodicContinuous, periodicNo (infinite sum)
DFTDiscrete, periodic (N)Discrete, periodic (N)Yes — O(N²) or O(N log N)

1.2 DFT and IDFT — Formal Definitions

Let $x[n]$, $n = 0, 1, \ldots, N-1$, be a finite sequence of (possibly complex) numbers. Define the N-th root of unity twiddle factor:

$$W_N \triangleq e^{-j 2\pi / N}$$

so that $W_N^{kn} = e^{-j 2\pi kn/N}$. The DFT pair is then written compactly as:

$$\text{(DFT)}\quad X[k] = \sum_{n=0}^{N-1} x[n]\, W_N^{kn}, \qquad k = 0, 1, \ldots, N-1$$
$$\text{(IDFT)}\quad x[n] = \frac{1}{N}\sum_{k=0}^{N-1} X[k]\, W_N^{-kn}, \qquad n = 0, 1, \ldots, N-1$$

Physical interpretation of DFT bins:

Normalization convention: Different textbooks (and software libraries) use different normalization factors. The convention above (no $1/N$ in DFT, $1/N$ in IDFT) is the engineering standard. MATLAB uses this convention. NumPy uses the same convention by default (numpy.fft.fft/numpy.fft.ifft). Some mathematics texts place $1/\sqrt{N}$ on both transforms for a unitary operator. Always verify which convention applies before comparing results across sources.

1.3 Orthogonality of Complex Exponentials — Proof

The correctness of the IDFT formula — that it inverts the DFT exactly — rests entirely on the discrete orthogonality of the complex exponential basis functions. We now prove this fundamental identity.

Theorem (Discrete Orthogonality). For integers $k, m$ and any $N \geq 1$: $$\sum_{n=0}^{N-1} e^{j 2\pi kn/N} e^{-j 2\pi mn/N} = \sum_{n=0}^{N-1} e^{j 2\pi (k-m)n/N} = N \,\delta[k-m]_N$$ where $\delta[k-m]_N = 1$ if $(k-m)$ is a multiple of $N$, and $0$ otherwise.

Proof. Let $r = k - m$. Consider two cases:

Case 1: $r \equiv 0 \pmod{N}$ (i.e., $k=m$ mod $N$).

Then $e^{j 2\pi r n/N} = e^{0} = 1$ for all $n$. The sum equals $\sum_{n=0}^{N-1} 1 = N$.  ■

Case 2: $r \not\equiv 0 \pmod{N}$.

Let $\alpha = e^{j 2\pi r/N}$. Since $r$ is not a multiple of $N$, $\alpha \neq 1$. The sum is a finite geometric series:

$$S = \sum_{n=0}^{N-1} \alpha^n = \frac{1 - \alpha^N}{1 - \alpha}$$

Now $\alpha^N = e^{j 2\pi r N/N} = e^{j 2\pi r} = 1$ (since $r$ is an integer and Euler's formula gives $e^{j 2\pi r} = \cos(2\pi r) + j\sin(2\pi r) = 1$). Therefore the numerator $1 - \alpha^N = 1 - 1 = 0$, while the denominator $1 - \alpha \neq 0$. Hence $S = 0$.  ■

Consequence — IDFT inverts DFT. Start from the DFT:

$$X[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j 2\pi kn/N}$$

Substitute into the IDFT candidate formula and exchange the order of summation:

$$\frac{1}{N}\sum_{k=0}^{N-1} X[k]\, e^{j 2\pi km/N} = \frac{1}{N}\sum_{k=0}^{N-1} \left[\sum_{n=0}^{N-1} x[n]\, e^{-j 2\pi kn/N}\right] e^{j 2\pi km/N}$$
$$= \frac{1}{N}\sum_{n=0}^{N-1} x[n] \underbrace{\sum_{k=0}^{N-1} e^{j 2\pi k(m-n)/N}}_{= N\,\delta[m-n]} = \sum_{n=0}^{N-1} x[n]\,\delta[m-n] = x[m]$$

The IDFT exactly recovers $x[n]$ for each $n = m$.  ■

Subcarrier orthogonality in OFDM. This is the same mathematical property that makes OFDM work. In OFDM, the subcarriers are the basis sequences $e^{j 2\pi kn/N}$ for $k = 0, \ldots, N-1$. Because they are mutually orthogonal over the DFT window of $N$ samples, each subcarrier can be demodulated independently at the receiver with zero inter-carrier interference (ICI) — provided the channel does not destroy the orthogonality (i.e., the cyclic prefix is long enough).

1.4 DFT as Matrix Multiplication

Writing the DFT in full for all $k$ simultaneously reveals that it is simply a linear transformation representable by a single DFT matrix $\mathbf{W} \in \mathbb{C}^{N \times N}$.

$$\mathbf{X} = \mathbf{W}\,\mathbf{x}$$

where the vectors are $\mathbf{x} = [x[0], x[1], \ldots, x[N-1]]^T$ and $\mathbf{X} = [X[0], X[1], \ldots, X[N-1]]^T$, and the $(k, n)$ entry of $\mathbf{W}$ (using 0-based indexing) is:

$$[\mathbf{W}]_{k,n} = W_N^{kn} = e^{-j 2\pi kn/N}, \qquad k, n = 0, 1, \ldots, N-1$$

For $N = 4$, the DFT matrix expands explicitly as (using $j = \sqrt{-1}$):

$$\mathbf{W}_4 = \begin{bmatrix} 1 & 1 & 1 & 1 \\ 1 & -j & -1 & j \\ 1 & -1 & 1 & -1 \\ 1 & j & -1 & -j \end{bmatrix}$$

Note $W_4^0 = 1$, $W_4^1 = e^{-j\pi/2} = -j$, $W_4^2 = e^{-j\pi} = -1$, $W_4^3 = e^{-j3\pi/2} = j$. Each row $k$ of $\mathbf{W}_N$ is a complex sinusoid at frequency $k/N$ cycles/sample. The DFT is the inner product of the input vector with each row — i.e., the projection of $x[n]$ onto each basis sinusoid.

Inverse DFT matrix:

$$\mathbf{x} = \mathbf{W}^{-1}\mathbf{X} = \frac{1}{N}\mathbf{W}^{H}\mathbf{X}$$

where $\mathbf{W}^H$ denotes the conjugate transpose. This follows directly from the orthogonality theorem: $\mathbf{W}^H \mathbf{W} = N \mathbf{I}$, so $\mathbf{W}^{-1} = \frac{1}{N}\mathbf{W}^H$.

Unitary form. Defining $\tilde{\mathbf{W}} = \mathbf{W}/\sqrt{N}$ gives a unitary matrix ($\tilde{\mathbf{W}}^H \tilde{\mathbf{W}} = \mathbf{I}$), so the normalized DFT is a rotation in $\mathbb{C}^N$ — it preserves all inner products and energy (Parseval's theorem). The $1/\sqrt{N}$ convention is preferred in theoretical analysis; the engineering convention places $1/N$ in the IDFT.

Computational cost of direct matrix multiplication: Each of the $N$ output bins requires $N$ complex multiplications and $N-1$ complex additions. Total: $O(N^2)$ complex multiplications. For $N = 1024$: roughly $10^6$ multiplications. For $N = 4096$: $\sim 1.7 \times 10^7$. This is why direct computation becomes prohibitive for large $N$, motivating the FFT.

1.5 The FFT Algorithm — Cooley-Tukey Radix-2 DIT

The Fast Fourier Transform (FFT) is not a different mathematical transform — it computes the exact same DFT, but exploits the special structure of the DFT matrix $\mathbf{W}$ to reduce the arithmetic to $O(N \log_2 N)$ operations. The most common variant is the Cooley-Tukey radix-2 decimation-in-time (DIT) algorithm, published in 1965 (though the core idea was known to Gauss in 1805).

1.5.1 Divide and Conquer — The Danielson-Lanczos Lemma

Assume $N = 2^m$ (a power of 2). Split the input sequence into two $N/2$-point sub-sequences — the even-indexed samples and the odd-indexed samples:

$$x_{\text{even}}[r] = x[2r], \quad x_{\text{odd}}[r] = x[2r+1], \qquad r = 0, 1, \ldots, \frac{N}{2}-1$$

The DFT can be rewritten as:

$$X[k] = \sum_{n=0}^{N-1} x[n]\,W_N^{kn} = \sum_{r=0}^{N/2-1} x[2r]\,W_N^{2rk} + \sum_{r=0}^{N/2-1} x[2r+1]\,W_N^{(2r+1)k}$$

Using $W_N^{2rk} = e^{-j 2\pi (2r)k/N} = e^{-j 2\pi rk/(N/2)} = W_{N/2}^{rk}$:

$$X[k] = \underbrace{\sum_{r=0}^{N/2-1} x_{\text{even}}[r]\,W_{N/2}^{rk}}_{E[k]} + W_N^k \underbrace{\sum_{r=0}^{N/2-1} x_{\text{odd}}[r]\,W_{N/2}^{rk}}_{O[k]}$$
$$\boxed{X[k] = E[k] + W_N^k \cdot O[k]}$$

where $E[k]$ and $O[k]$ are the $N/2$-point DFTs of the even and odd sub-sequences, respectively. The factor $W_N^k$ is called the twiddle factor.

1.5.2 The Butterfly and Periodicity

Since $E[k]$ and $O[k]$ are $N/2$-periodic, we have:

$$E\!\left[k + \tfrac{N}{2}\right] = E[k], \qquad O\!\left[k + \tfrac{N}{2}\right] = O[k]$$

and $W_N^{k+N/2} = W_N^k \cdot W_N^{N/2} = -W_N^k$. Therefore:

$$X[k] = E[k] + W_N^k \cdot O[k]$$ $$X\!\left[k + \tfrac{N}{2}\right] = E[k] - W_N^k \cdot O[k]$$

These two equations computed together for each $k = 0, \ldots, N/2 - 1$ form a butterfly operation: two inputs $(E[k], O[k])$, one complex multiplication by $W_N^k$, one addition, one subtraction — yielding two outputs.

Radix-2 DIT FFT Butterfly Diagram — N=8 (3 stages, 4 butterflies each)
Stage 1 2-pt DFTs Stage 2 4-pt DFTs Stage 3 8-pt DFT Input (bit-reversed) Output (natural order) x[0] x[4] x[2] x[6] x[1] x[5] x[3] x[7] + W₈⁰ + W₈⁰ + W₈⁰ + W₈⁰ + W₈⁰ + W₈² + W₈⁰ + W₈² + W₈⁰ + W₈¹ + W₈² + W₈³ X[0] X[1] X[2] X[3] X[4] X[5] X[6] X[7] Stage 1 Stage 2 Stage 3 Butterfly: a, b → (a+Wₙᵏ·b), (a−Wₙᵏ·b) │ W₈ᵏ = e⁻ʲ²2πk/8 │ N=8: 3×4 = 12 mults vs 64 direct
Input in bit-reversed order: [0,4,2,6,1,5,3,7]. Filled circles = inputs; open circles = outputs. W₈ᵏ = e^(−j2πk/8) is the twiddle factor. Stage 1: W⁸⁰=1 (no multiply). Stage 2: W⁸⁰=1, W⁸²=−j. Stage 3: W⁸⁰=1, W⁸¹=e^(−jπ/4), W⁸²=−j, W⁸³=e^(−j3π/4).

1.5.3 Bit-Reversal Permutation

The DIT FFT requires the input to be in bit-reversed order. For $N=8$, the natural indices $0{-}7$ in binary are $000, 001, 010, 011, 100, 101, 110, 111$. Reversing the 3-bit binary representations gives: $000, 100, 010, 110, 001, 101, 011, 111$ → indices $0, 4, 2, 6, 1, 5, 3, 7$. The butterfly structure then processes interleaved sub-sequences at each stage.

1.5.4 Complexity Analysis

$N$ (FFT size) Direct DFT ($N^2$ mults) FFT ($\frac{N}{2}\log_2 N$ mults) Speedup factor
644,09619221×
25665,5361,02464×
1,0241,048,5765,120205×
4,09616,777,21624,576683×
65,536$4.3 \times 10^9$524,2888,192×
$$\text{Speedup} \approx \frac{N^2}{\frac{N}{2}\log_2 N} = \frac{2N}{\log_2 N}$$

The speedup grows without bound as $N$ increases. For 5G NR OFDM with $N = 4096$ (FR1 maximum), the FFT is $\sim 683\times$ faster than direct computation — making real-time OFDM processing practical on embedded hardware.

1.5.5 Why $N = 2^m$ is Preferred

The radix-2 Cooley-Tukey algorithm applies only when $N$ is a power of 2, enabling $\log_2 N$ recursion levels each with $N/2$ butterflies. For other $N$:

1.6 IFFT — The Inverse FFT and Its Role in OFDM

The IDFT is:

$$x[n] = \frac{1}{N}\sum_{k=0}^{N-1} X[k]\, e^{j 2\pi kn/N}$$

Observe that this is the same computational structure as the DFT, but with the twiddle factors conjugated ($e^{+j\ldots}$ vs $e^{-j\ldots}$) and with a $1/N$ scaling. The IFFT is therefore computed by:

  1. Conjugate the input: $X^*[k]$
  2. Apply the standard FFT to get $Y[n] = \text{FFT}(X^*[k])$
  3. Conjugate the output and scale: $x[n] = Y^*[n] / N$

Alternatively, IFFT = complex-conjugate the twiddle factors in the FFT butterfly and divide by $N$ at the end. Most hardware and software implement the FFT and IFFT with a shared butterfly core, differing only in the sign of the twiddle factor angle and the scaling.

IFFT as the OFDM modulator. In an OFDM transmitter, frequency-domain symbols $X[k]$ (e.g., 16-QAM constellation points mapped to subcarrier $k$) are the input to the IFFT. The output $x[n]$ is the time-domain OFDM symbol: $$x[n] = \frac{1}{N}\sum_{k=0}^{N-1} X[k]\, e^{j 2\pi kn/N}$$ This is a superposition of $N$ complex sinusoids — one per subcarrier — all generated simultaneously by a single $N$-point IFFT. At the receiver, the FFT separates all subcarriers at once.

1.7 Key Properties of the DFT / FFT

All properties below are stated for the $N$-point DFT and use the notation $x[n] \xleftrightarrow{\text{DFT}} X[k]$. Index arithmetic is modulo $N$.

Property 1 — Linearity

$$a\,x_1[n] + b\,x_2[n] \;\xleftrightarrow{\text{DFT}}\; a\,X_1[k] + b\,X_2[k]$$

The DFT is a linear operator. This follows immediately from the matrix form $\mathbf{W}(a\mathbf{x}_1 + b\mathbf{x}_2) = a\mathbf{W}\mathbf{x}_1 + b\mathbf{W}\mathbf{x}_2$. Consequence: OFDM subcarriers are modulated and demodulated independently using superposition.

Property 2 — Time-Shift $\Rightarrow$ Phase Rotation in Frequency

$$x\!\left[\langle n - n_0 \rangle_N\right] \;\xleftrightarrow{\text{DFT}}\; e^{-j 2\pi k n_0/N}\,X[k]$$

A circular shift of $n_0$ samples in time multiplies every DFT bin by a linear phase ramp $e^{-j 2\pi k n_0/N}$. The magnitude spectrum $|X[k]|$ is unchanged — only the phase rotates. In OFDM, the cyclic prefix exploits this property: a channel delay $\tau$ introduces a phase shift $e^{-j 2\pi k \tau/N}$ per subcarrier, which is corrected by a per-subcarrier complex division (one-tap equalizer) after the FFT at the receiver.

Property 3 — Frequency-Shift $\Rightarrow$ Phase Rotation in Time (Modulation)

$$e^{j 2\pi k_0 n/N}\,x[n] \;\xleftrightarrow{\text{DFT}}\; X\!\left[\langle k - k_0 \rangle_N\right]$$

Multiplying by a complex exponential in time circularly shifts the spectrum. This is the discrete-time analog of the AM modulation theorem and is used in frequency-domain equalization and carrier offset estimation.

Property 4 — Circular Convolution Theorem (most important for OFDM)

$$x_1[n] \circledast x_2[n] \;\xleftrightarrow{\text{DFT}}\; X_1[k] \cdot X_2[k]$$

where $\circledast$ denotes circular (cyclic) convolution of length $N$:

$$(x_1 \circledast x_2)[n] = \sum_{m=0}^{N-1} x_1[m]\, x_2\!\left[\langle n-m \rangle_N\right]$$

Key consequence for OFDM: A multipath channel of length $L$ (impulse response $h[n]$) performs linear convolution on the transmitted signal. By appending a cyclic prefix of length $\geq L-1$ to each OFDM symbol, the linear convolution becomes effectively circular over the FFT window. The FFT at the receiver then converts this circular convolution to a pointwise multiplication: $Y[k] = H[k] \cdot X[k]$. Channel equalization reduces to $N$ independent scalar divisions — one per subcarrier — instead of matrix inversion. This is the core computational advantage of OFDM in frequency-selective channels.

Property 5 — Parseval's Theorem (Energy Conservation)

$$\sum_{n=0}^{N-1} |x[n]|^2 = \frac{1}{N} \sum_{k=0}^{N-1} |X[k]|^2$$

The total energy of the time-domain signal equals the total energy of the frequency-domain signal divided by $N$ (due to the $1/N$ convention). In the unitary DFT convention, the $1/N$ factor is absent and energy is exactly preserved: $\|\tilde{\mathbf{W}}\mathbf{x}\|^2 = \|\mathbf{x}\|^2$. Parseval's theorem is used in OFDM PAPR (Peak-to-Average Power Ratio) analysis and signal-to-noise ratio calculations.

Property 6 — Conjugate Symmetry (Real Inputs)

If $x[n]$ is real-valued, then:

$$X[N-k] = X^*[k], \qquad k = 1, 2, \ldots, N/2 - 1$$

The DFT is Hermitian symmetric: bins $k$ and $N-k$ are complex conjugates. Only the first $N/2 + 1$ bins are independent. This is exploited in the real FFT (rfft) which computes only bins $0$ through $N/2$ in roughly half the operations of a complex FFT. In 5G NR downlink, the OFDM waveform is real-valued after RF upconversion (mixing with carrier), though the baseband model uses complex I+Q representation where this symmetry does not apply.

Property 7 — Circular Shift in Frequency → Multiplication in Time

$$X\!\left[\langle k - k_0 \rangle_N\right] \;\xleftrightarrow{\text{IDFT}}\; e^{j 2\pi k_0 n/N}\,x[n]$$
Summary of DFT Properties
PropertyTime domainFrequency domain
Linearity$ax_1[n]+bx_2[n]$$aX_1[k]+bX_2[k]$
Time shift$x[\langle n-n_0\rangle_N]$$e^{-j2\pi kn_0/N}X[k]$
Freq shift$e^{j2\pi k_0 n/N}x[n]$$X[\langle k-k_0\rangle_N]$
Circ. conv.$x_1\circledast x_2$$X_1[k]\cdot X_2[k]$
Parseval$\sum|x[n]|^2$$\frac{1}{N}\sum|X[k]|^2$
Conjugate sym.$x[n]$ real$X[N-k]=X^*[k]$
Time reversal$x[\langle{-n}\rangle_N]$$X[\langle{-k}\rangle_N]$

1.8 OFDM Connection — How IFFT Creates and FFT Demodulates Subcarriers

OFDM (Orthogonal Frequency-Division Multiplexing) is a direct engineering application of the IDFT/DFT pair. The insight is elegant: the IDFT formula for OFDM modulation and the DFT formula for demodulation are computationally identical to the IFFT/FFT algorithms already described.

1.8.1 OFDM Transmitter (IFFT-based)

Given $N$ complex data symbols $\{X[k]\}_{k=0}^{N-1}$ (one per subcarrier, e.g., QAM points), the continuous-time OFDM signal over one symbol period $T = N \cdot T_s$ would ideally be:

$$s(t) = \sum_{k=0}^{N-1} X[k]\, e^{j 2\pi k \Delta f \, t}, \qquad 0 \leq t < T$$

where $\Delta f = 1/T$ is the subcarrier spacing (in Hz). At the $n$-th sample time $t = nT_s = n/(N\Delta f)$:

$$s[n] = s(nT_s) = \sum_{k=0}^{N-1} X[k]\, e^{j 2\pi k n / N} = N \cdot \text{IDFT}\{X[k]\}$$

This is exactly the IDFT (up to the $1/N$ normalization). The IFFT simultaneously generates all $N$ subcarrier sinusoids and sums them — creating the composite time-domain waveform in a single $O(N \log N)$ operation.

1.8.2 Cyclic Prefix Insertion

After the IFFT, the last $N_{CP}$ samples of $s[n]$ are prepended to the symbol:

$$\tilde{s}[n] = s[n + N - N_{CP}], \quad n = 0, \ldots, N_{CP}-1 \quad \text{(cyclic prefix)}$$

followed by the original $N$ samples. Total transmitted length: $N + N_{CP}$. This converts the linear channel convolution into circular convolution (as explained in §1.7, Property 4), enabling simple one-tap frequency-domain equalization.

1.8.3 OFDM Receiver (FFT-based)

After removing the cyclic prefix, the received signal (assuming AWGN channel with frequency-selective fading $H[k]$) in the frequency domain is:

$$Y[k] = \text{FFT}\{y[n]\} = H[k] \cdot X[k] + Z[k]$$

where $H[k] = \text{FFT}\{h[n]\}$ is the channel frequency response at subcarrier $k$ and $Z[k]$ is the additive noise. The FFT at the receiver demodulates all $N$ subcarriers simultaneously in $O(N \log N)$ operations.

Equalization then recovers the data symbol on each subcarrier independently:

$$\hat{X}[k] = \frac{Y[k]}{H[k]} = X[k] + \frac{Z[k]}{H[k]}$$
Why OFDM uses IFFT/FFT — the key insight.
  • Without OFDM: a wideband channel with $L$ taps requires a complex equalizer with $\sim L$ taps (up to hundreds of coefficients in mobile channels) operating at the full sample rate.
  • With OFDM + FFT: the FFT converts the wideband problem into $N$ independent narrowband problems. Each subcarrier sees a flat fading channel (single complex coefficient). Equalization costs $N$ complex divisions — independent of $L$.
  • The FFT converts $O(N \cdot L)$ equalization complexity to $O(N \log N + N)$.
  • For 5G NR with $N = 4096$ subcarriers and channel delay spread $L \approx 100$ taps, this is a $\sim 28\times$ reduction in computational complexity, on top of the modulation efficiency from packing $N$ symbols per FFT window.

1.9 Numerical Example — $N = 8$, Rectangular Pulse

Let $x[n] = 1$ for $n = 0, 1, 2, 3$ and $x[n] = 0$ for $n = 4, 5, 6, 7$ (a length-4 rectangular window in an $N = 8$ DFT). Compute $X[k]$ analytically.

$$X[k] = \sum_{n=0}^{3} e^{-j 2\pi kn/8} = \sum_{n=0}^{3} \left(e^{-j\pi k/4}\right)^n$$

This is a geometric series with ratio $\alpha = e^{-j\pi k/4}$:

$$X[k] = \frac{1 - e^{-j\pi k}}{1 - e^{-j\pi k/4}} = e^{-j3\pi k/8} \cdot \frac{\sin\!\left(\frac{4\pi k}{8}\right)}{\sin\!\left(\frac{\pi k}{8}\right)} \quad (k \neq 0)$$

using the identity $\sum_{n=0}^{M-1} \alpha^n = e^{j(M-1)\phi/2} \cdot \sin(M\phi/2)/\sin(\phi/2)$ where $\phi = -\pi k/4$.

$N=8$, $x[n]=\{1,1,1,1,0,0,0,0\}$ — DFT values
$k$$X[k]$ (exact)$|X[k]|$$\angle X[k]$ (rad)
0$4 + 0j$4.0000
1$1 - 2.414j$2.613$-1.176$
2$0 - 1j$1.000$-\pi/2$
3$1 - 0.414j$1.082$-0.392$
4$0 + 0j$0.000
5$1 + 0.414j$1.082$+0.392$
6$0 + 1j$1.000$+\pi/2$
7$1 + 2.414j$2.613$+1.176$

Observations: (1) $X[0] = 4$ is the sum of all samples (DC component). (2) $X[4] = 0$ exactly — the Nyquist bin is zero because the rectangular pulse has a null at $f = f_s/2$. (3) The spectrum is Hermitian: $X[k] = X^*[N-k]$ since $x[n]$ is real. (4) The $|X[k]|$ envelope follows a Dirichlet kernel (discrete sinc) shape.

1.10 Interactive DFT Magnitude Spectrum

The chart below shows the DFT of $x[n] = \cos\!\left(\frac{2\pi \cdot 3 \cdot n}{N}\right) + 0.5\cos\!\left(\frac{2\pi \cdot 1 \cdot n}{N}\right)$ for $N = 16$ samples. Analytically, we expect spectral peaks at bins $k = 1, 3$ (positive frequencies) and $k = 13, 15$ (corresponding negative frequencies), with magnitudes $N/2 = 8$ for the $k=3$ tone and $N/4 = 4$ for the $k=1$ tone.

Figure 1.1 — DFT Magnitude Spectrum: $x[n] = \cos(2\pi\cdot3n/16) + 0.5\cos(2\pi\cdot1n/16)$, $N=16$
Left panel: time-domain signal $x[n]$ (16 samples). Right panel: DFT magnitude $|X[k]|$ for $k = 0, \ldots, 15$. Peaks at $k=1$ (magnitude 4.0) and $k=3$ (magnitude 8.0) confirm the two cosine components. Mirror peaks at $k=13$ and $k=15$ are the negative-frequency counterparts (Hermitian symmetry for real input).

1.11 Spectral Leakage, the Dirichlet Kernel, and Windowing

The DFT implicitly assumes the $N$-sample block is one period of a periodic signal. When the signal is not an integer number of cycles within the DFT window (i.e., the frequency does not fall exactly on a bin), energy "leaks" into adjacent bins — a phenomenon called spectral leakage.

The mathematical origin: the DFT of a finite-length signal is equivalent to the DTFT of the signal multiplied by a rectangular window $w[n] = 1$ for $0 \leq n \leq N-1$. In frequency, this multiplication becomes circular convolution of the ideal spectrum with the DFT of the rectangle — the Dirichlet kernel:

$$D_N(\omega) = \sum_{n=0}^{N-1} e^{-j\omega n} = e^{-j\omega(N-1)/2} \cdot \frac{\sin(N\omega/2)}{\sin(\omega/2)}$$

$|D_N|$ has a main lobe of width $4\pi/N$ and sidelobes at approximately $-13\,\text{dB}$ below the main lobe peak. Spectral leakage is the convolution of any sharp spectral feature with these sidelobes.

Windowing applies a time-domain taper $w[n]$ to the signal before the DFT, trading main-lobe width (frequency resolution) for reduced sidelobe level (reduced leakage):

WindowMain lobe widthPeak sidelobe (dB)Use case
Rectangular$4\pi/N$$-13$Best resolution, worst leakage
Hann (Hanning)$8\pi/N$$-31.5$General-purpose audio/spectral
Hamming$8\pi/N$$-42.7$Filter design
Blackman$12\pi/N$$-58.1$Low leakage needed
Kaiser ($\beta=8$)Variable$-69$Flexible tradeoff
OFDM and windowing: Standard OFDM (LTE, NR) does NOT apply a window before the FFT at the receiver. The cyclic prefix already ensures integer-cycle alignment for each subcarrier within the FFT window (assuming perfect synchronization), so there is no leakage between subcarriers — the orthogonality holds exactly. Windowing is applied only in some OFDM variants (e.g., W-OFDM, f-OFDM) to reduce out-of-band emissions at the transmitter.

1.12 Zero-Padding and DFT Interpolation

Appending $M$ zeros to $x[n]$ before computing an $(N+M)$-point DFT does not add new information but produces a denser sampling of the DTFT — effectively interpolating the frequency spectrum:

$$X_{\text{zp}}[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j 2\pi kn/(N+M)}, \quad k = 0, \ldots, N+M-1$$

The frequency resolution (bin spacing) becomes $f_s/(N+M)$ instead of $f_s/N$. Zero-padding is used to: (1) make $N$ a power of 2 for the FFT, (2) improve visual interpolation of spectra, (3) compute circular cross-correlation via the overlap-add method for linear convolution.

Zero-padding in OFDM resource mapping: In 5G NR, the IFFT size is larger than the number of active subcarriers. For example, with a 50 MHz bandwidth (270 active subcarriers for 30 kHz SCS) and $N_{FFT} = 1024$, the remaining $1024 - 270 = 754$ frequency bins are set to zero. This zero-padding in the frequency domain is equivalent to spectral upsampling by the ratio $N_{FFT}/N_{active}$, producing a higher sample-rate time-domain waveform — exactly what is needed to match the DAC sample rate while keeping the subcarrier spacing fixed.
Study Questions — §1 Mathematical Foundations
  1. DFT inversion proof. Starting only from the DFT definition $X[k] = \sum_{n=0}^{N-1} x[n]\, e^{-j2\pi kn/N}$ and the discrete orthogonality property, prove that the IDFT formula $x[m] = \frac{1}{N}\sum_{k=0}^{N-1} X[k]\, e^{j2\pi km/N}$ recovers every sample $x[m]$ exactly. Identify the step where the geometric series formula is applied, and explain why $r \not\equiv 0 \pmod{N}$ is required for the sum to vanish.
  2. FFT complexity and OFDM feasibility. A 5G NR base station processes $K = 100$ OFDM symbols per 1 ms subframe at $N_{FFT} = 4096$. (a) Compute the total number of complex multiplications required per subframe using (i) direct DFT and (ii) radix-2 FFT ($\frac{N}{2}\log_2 N$ multiplications per transform). (b) If each complex multiplication takes 10 ns on a DSP, estimate the processing time for each approach. (c) Explain why the cyclic prefix — even though it wastes bandwidth — is nevertheless essential for the one-tap equalizer derivation based on the circular convolution theorem.
  3. Subcarrier orthogonality and ICI. In OFDM, a carrier frequency offset (CFO) of $\epsilon$ (in units of subcarrier spacing $\Delta f$) causes the subcarrier spacing $\Delta f$ to shift to $(1+\epsilon)\Delta f$ for subcarrier $k$. Using the discrete orthogonality sum, show that for $\epsilon \neq 0$ the inner product between subcarrier $k$ and subcarrier $m \neq k$ over the DFT window is no longer zero — i.e., inter-carrier interference (ICI) is introduced. Sketch how the ICI power depends on $\epsilon$ for small offsets and state the consequence for OFDM synchronization requirements.
§2

CP-OFDM Signal Model & Spectrum

2.1  CP-OFDM Baseband Signal Model

The continuous-time CP-OFDM baseband signal for a single OFDM symbol is the superposition of N complex exponentials, one per subcarrier:

$$s(t) = \sum_{k=0}^{N-1} S_k \, e^{\,j 2\pi k \,\Delta f \, t}, \qquad 0 \le t < T_u$$
SymbolMeaningTypical value (5G NR, μ=1)
$$N$$Number of active subcarriersup to 3276 (FR1, 100 MHz)
$$S_k \in \mathbb{C}$$QAM symbol on subcarrier kQPSK / 16-QAM / 256-QAM
$$\Delta f$$Subcarrier spacing (SCS)30 kHz
$$T_u = 1/\Delta f$$Useful OFDM symbol duration33.33 µs
Key idea: Each subcarrier k is a single tone at frequency \(f_k = k \,\Delta f\) relative to the carrier. The bandwidth of the entire waveform is \(B \approx N \,\Delta f\) (ignoring guard bands). All tones are summed in the time domain before transmission — this is precisely what the IFFT computes.

2.2  Multicarrier Structure — N Parallel Narrow-Band Channels

OFDM is a multicarrier modulation scheme. Rather than transmitting one symbol per use of the channel at the full bandwidth B (as in single-carrier BPSK/QAM), OFDM divides B into N narrow sub-bands, each of width ≈ Δf, and transmits one QAM symbol per sub-band simultaneously:

One OFDM symbol (duration T_u) carries N QAM symbols simultaneously

Frequency →
┌────────┬────────┬────────┬────────┬────────┬────────┬────────┬────────┐
│ S₀ │ S₁ │ S₂ │ S₃ │ S₄ │ S₅ │ S₆ │ S₇ │ ← N=8 subcarriers
│ f=0 │ f=Δf │ f=2Δf │ f=3Δf │ f=4Δf │ f=5Δf │ f=6Δf │ f=7Δf │
└────────┴────────┴────────┴────────┴────────┴────────┴────────┴────────┘
←————————————————— bandwidth B = N·Δf ——————————————————→

Each sub-band sees an effectively flat channel (if Δf ≪ coherence BW)
→ single complex multiply suffices for equalization per subcarrier

The serial-to-parallel (S/P) converter at the transmitter maps a stream of QAM symbols onto the N frequency-domain inputs of the IFFT. After the IFFT and DAC, all N tones are present in the transmitted signal simultaneously — the channel sees a wideband signal, but the DSP treats it as N independent narrowband channels.

Narrowband approximation: A multipath channel with maximum excess delay \(\tau_{\max}\) has coherence bandwidth \(B_c \approx 1/(5\tau_{\max})\). OFDM works well when \(\Delta f \ll B_c\), i.e., each subcarrier experiences flat fading. In 5G NR with \(\Delta f = 15\,\text{kHz}\) and urban \(\tau_{\max} \approx 4\,\mu\text{s}\), \(B_c \approx 50\,\text{kHz} \gg \Delta f\) — the condition is well satisfied.

2.3  Orthogonality Proof

The fundamental reason CP-OFDM works without inter-subcarrier interference (ICI) is that the set of complex exponentials \(\{\phi_k(t) = e^{j2\pi k \Delta f t}\}\) forms an orthogonal basis over the interval \([0, T_u]\).

$$\langle \phi_m, \phi_n \rangle \;=\; \int_0^{T_u} e^{\,j2\pi m \Delta f t} \cdot \left(e^{\,j2\pi n \Delta f t}\right)^* \mathrm{d}t \;=\; \int_0^{T_u} e^{\,j2\pi (m-n) \Delta f t} \,\mathrm{d}t$$

Case 1: m = n

$$\int_0^{T_u} e^{\,j \cdot 0} \,\mathrm{d}t = \int_0^{T_u} 1 \,\mathrm{d}t = T_u$$

Case 2: m ≠ n  (let \(p = m - n \in \mathbb{Z} \setminus \{0\}\))

$$\int_0^{T_u} e^{\,j2\pi p \Delta f t} \,\mathrm{d}t = \left[\frac{e^{\,j2\pi p \Delta f t}}{j2\pi p \Delta f}\right]_0^{T_u} = \frac{e^{\,j2\pi p \Delta f T_u} - 1}{j2\pi p \Delta f}$$

Substituting \(\Delta f \cdot T_u = 1\):

$$= \frac{e^{\,j2\pi p} - 1}{j2\pi p \Delta f} = \frac{1 - 1}{j2\pi p \Delta f} = 0$$

Combining both cases with the Kronecker delta \(\delta_{mn}\):

$$\boxed{\int_0^{T_u} e^{\,j2\pi m \Delta f t} \cdot \left(e^{\,j2\pi n \Delta f t}\right)^* \mathrm{d}t = T_u \,\delta_{mn}}$$
Physical interpretation: The subcarrier spacing condition \(\Delta f = 1/T_u\) guarantees exactly an integer number of cycles difference between any two distinct subcarriers over the integration window \(T_u\). Exactly one complete extra cycle → zero integral → zero ICI. Violation of this condition (e.g., due to carrier frequency offset) breaks orthogonality and causes ICI proportional to the offset.
Consequence for demodulation: To recover symbol \(S_m\), correlate \(r(t)\) against \(\phi_m^*(t) = e^{-j2\pi m \Delta f t}\): $$\hat{S}_m = \frac{1}{T_u}\int_0^{T_u} r(t)\,e^{-j2\pi m \Delta f t}\,\mathrm{d}t = \frac{1}{T_u}\sum_{k=0}^{N-1} S_k \underbrace{\int_0^{T_u} e^{j2\pi(k-m)\Delta f t}\,\mathrm{d}t}_{T_u\,\delta_{km}} = S_m$$ Zero contribution from all \(k \neq m\) — no ICI.

2.4  Subcarrier Spacing & Symbol Duration — 5G NR Numerologies

The fundamental constraint is the time-frequency uncertainty relation for OFDM:

$$\Delta f \cdot T_u = 1$$

5G NR defines a set of numerologies indexed by \(\mu \in \{0,1,2,3,4\}\), where \(\Delta f = 2^\mu \times 15\,\text{kHz}\). Higher \(\mu\) gives larger spacing (shorter symbols), trading time resolution for Doppler robustness.

Numerology μ \(\Delta f\) (kHz) \(T_u\) (µs) CP (normal, µs) \(T_s = T_u + T_{CP}\) (µs) Use case
01566.674.6971.35 FR1 sub-6 GHz, eMBB
13033.332.3435.68 FR1 sub-6 GHz, eMBB/URLLC
26016.671.1717.84 FR1 (no SRS) / FR2 mmWave
31208.330.598.92 FR2 mmWave
42404.170.294.46 FR2 reference signal only
Note on μ=2 in FR1: 60 kHz SCS is not supported for PDSCH/PUSCH in FR1 in most deployments (only for reference signals). For 5G NR FR1 data channels, only μ=0 and μ=1 are used.
5G NR Numerologies: Subcarrier Spacing & Symbol Duration
Bar chart of \(\Delta f\) (kHz, left axis) and \(T_u\) (µs, right axis) for \(\mu = 0\ldots4\). Note the inverse relationship \(\Delta f \cdot T_u = 1\).

2.5  OFDM Symbol with Cyclic Prefix (CP)

After the IFFT produces N time-domain samples \(\{s_0, s_1, \ldots, s_{N-1}\}\), a cyclic prefix of length \(N_{CP}\) samples is prepended:

IFFT output: [ s₀ s₁ s₂ … s_{N-N_CP-1} | s_{N-N_CP} … s_{N-1} ]
                                                             ↑ last N_CP samples

CP symbol:    [ s_{N-N_CP} … s_{N-1} | s₀ s₁ s₂ … s_{N-1} ]
                 ←——— CP ———→ ←——————— useful part ——————→
                   N_CP samples            N samples
                 ←————————— total: N + N_CP samples ——————————→

The total symbol duration is:

$$T_s = T_u + T_{CP} = \frac{N + N_{CP}}{f_s}$$

where \(f_s = N \cdot \Delta f\) is the sampling rate.

Why CP converts linear convolution to circular convolution:

Let the channel impulse response be \(h[n]\), length \(L_{h} \le N_{CP} + 1\). The received signal (ignoring noise) is the linear convolution:
  \(r[n] = (h * x)[n] = \sum_{l=0}^{L_h-1} h[l]\,x[n-l]\)

At the receiver, after removing the CP (discarding the first \(N_{CP}\) samples), the remaining \(N\) samples satisfy:
  \(r[n] = \sum_{l=0}^{L_h-1} h[l]\,x[(n-l) \bmod N]\) for \(n = 0, \ldots, N-1\)

This is the circular convolution \(r = h \circledast x\) of length \(N\). Taking the DFT of both sides:
  \(R[k] = H[k] \cdot S[k]\)

The channel acts as a scalar multiplier per subcarrier — one complex divide for equalization. This is the entire reason OFDM is efficient in multipath channels.
CP length constraint: The CP must be longer than the channel's maximum excess delay: \(T_{CP} > \tau_{\max}\). If violated, the circular convolution property breaks and ISI from the previous symbol "leaks" into the useful window, causing both ISI and ICI simultaneously. In 5G NR normal CP at \(\mu=0\): \(T_{CP} = 4.69\,\mu\text{s}\), covering up to \(\sim\!2\,\text{km}\) round-trip delay spread.

2.6  Discrete-Time CP-OFDM — the IFFT Implementation

In practice, the continuous-time model is implemented entirely in discrete time. The N-point IFFT of the frequency-domain symbols \(\{S_0, \ldots, S_{N-1}\}\) yields the time-domain samples:

$$s_n = \frac{1}{N}\sum_{k=0}^{N-1} S_k \, e^{\,j2\pi kn/N}, \qquad n = 0, 1, \ldots, N-1$$

This is the N-point IDFT. The factor \(1/N\) is an implementation convention (some implementations absorb it into the FFT scaling). Comparing to the continuous-time model with \(t = n T_s / N = n / f_s\):

$$e^{j2\pi k \Delta f \cdot (n/f_s)} = e^{j2\pi k \Delta f n / (N\Delta f)} = e^{j2\pi kn/N} \quad \checkmark$$

The discrete and continuous models are identical at the sample points.

Transmitter block diagram (discrete time):

QAM bits → [QAM Mapper] → S[0..N-1]
                       ↓
                   [S/P Converter] → N parallel symbols
                       ↓
             [N-point IFFT] → N time samples {s_n}
                       ↓
          [Add CP: prepend last N_CP samples]
                       ↓
                   [P/S Converter] → serial stream (N + N_CP samples)
                       ↓
                [DAC + RF Upconverter] → transmitted signal x(t)

Receiver (reverse path):
RF → ADC → Remove CP (discard first N_CP samples) → N-point FFT → divide by H[k] → QAM demapper → bits

The complexity advantage over a bank of N matched filters is enormous: \(\mathcal{O}(N \log_2 N)\) for the FFT versus \(\mathcal{O}(N^2)\) for direct DFT computation. For \(N = 4096\) (5G NR, 100 MHz FR1), this is a factor of \(\sim\!341\times\) reduction in multiply-accumulate operations.

2.7  CP-OFDM Spectrum & Spectral Efficiency

Each subcarrier k, viewed in isolation, is a finite-duration complex tone of duration \(T_u\). Its Fourier transform is a sinc function:

$$\Phi_k(f) = S_k \cdot T_u \cdot \operatorname{sinc}\!\left[(f - k\Delta f)\,T_u\right] \cdot e^{-j\pi(f - k\Delta f)T_u}$$

where \(\operatorname{sinc}(x) = \sin(\pi x)/(\pi x)\). The main lobe of subcarrier k is centred at \(f = k \Delta f\) with first nulls at \(f = k\Delta f \pm \Delta f\) — precisely at the centres of adjacent subcarriers. This is the geometric reason for orthogonality: each subcarrier's sinc passes through zero at all other subcarrier frequencies.

CP-OFDM Spectrum — 8 Subcarriers (Individual Sinc + Composite PSD)
Individual sinc-shaped spectra for 8 subcarriers (dashed, coloured) and the composite PSD (solid white). Each subcarrier's sinc passes through zero at all other subcarrier centres, confirming frequency-domain orthogonality. Composite PSD is relatively flat over the occupied bandwidth \(N \Delta f\).

Spectral Efficiency

The CP wastes \(N_{CP}/(N + N_{CP})\) of the time resource. The spectral efficiency (fraction of time used for useful data) is:

$$\eta_{CP} = \frac{N}{N + N_{CP}} = \frac{T_u}{T_u + T_{CP}}$$
Numerology μ \(N_{CP}/N\) (normal CP) \(\eta_{CP}\)
0144/2048 ≈ 7.0%93.4%
1144/2048 ≈ 7.0%93.4%
2144/4096 ≈ 3.5% (FR2)96.6%
3144/8192 ≈ 1.8% (FR2)98.3%
Spectral flatness: The composite PSD of CP-OFDM with uniform symbol power is approximately flat over \([0, N\Delta f]\) because the individual sinc side-lobes cancel statistically (they add incoherently for independent QAM symbols). In practice, windowing (raised-cosine transitions) is applied at the OFDM symbol boundaries to further reduce out-of-band emissions.

2.8  Resource Grid — Time-Frequency Structure

The transmitted signal is organized into a resource grid in the time-frequency plane:

$$\text{Resource Element (RE)} = 1\text{ subcarrier} \times 1\text{ OFDM symbol}$$
$$\text{Resource Block (RB)} = 12\text{ subcarriers} \times 14\text{ OFDM symbols (1 slot)}$$
Entity Time dimension Frequency dimension Size
Resource Element (RE) 1 OFDM symbol 1 subcarrier 1 complex QAM symbol
Resource Block (RB) 14 OFDM symbols (1 slot) 12 subcarriers 168 RE = 168 QAM symbols
Slot (normal CP, \(\mu=0\)) 0.5 ms (1 ms subframe / 2) 14 OFDM symbols
Subframe 1 ms \(2^\mu\) slots
Radio Frame 10 ms 10 subframes = \(10 \cdot 2^\mu\) slots
Max RBs (FR1 100 MHz, \(\mu=1\)) 273 RBs = 3276 subcarriers 546 MHz? No: \(273 \times 12 \times 30\,\text{kHz} = 98.28\,\text{MHz}\)
Slot duration scales with μ: A slot always contains 14 OFDM symbols (normal CP), but since symbol duration halves with each step of \(\mu\), the slot duration is \(0.5/2^{\mu-1}\,\text{ms}\) for \(\mu \ge 1\). At \(\mu=3\) (120 kHz), a slot is only \(0.0625\,\text{ms} = 62.5\,\mu\text{s}\) — enabling ultra-low latency URLLC scheduling.

2.9  CP-OFDM vs Plain OFDM — Why CP is Essential

Without the CP, the OFDM signal would suffer from two simultaneous impairments in a multipath channel:

Impairment Without CP With CP (length ≥ channel delay spread)
Inter-Symbol Interference (ISI) Last samples of symbol n corrupt first samples of symbol n+1 Delayed copies fall inside the CP; CP is discarded → no ISI
Inter-Carrier Interference (ICI) Orthogonality broken: linear conv ≠ circular conv → mixing between subcarriers Circular convolution property holds → \(R[k] = H[k]S[k]\) → no ICI
Equalization complexity Requires N×N matrix inversion or MLSE with exponential complexity Single complex divide per subcarrier: \(\hat{S}[k] = R[k]/H[k]\)
Channel estimation Must estimate full N×N channel matrix Estimate scalar \(H[k]\) at pilot subcarriers and interpolate
Single-tap frequency-domain equalization (FDE):

After FFT at the receiver:
  \(R[k] = H[k] \cdot S[k] + W[k]\),   where \(W[k] \sim \mathcal{CN}(0, \sigma^2)\)

Zero-forcing (ZF) equalizer:
  \(\hat{S}_{ZF}[k] = \dfrac{R[k]}{H[k]} = S[k] + \dfrac{W[k]}{H[k]}\)

MMSE equalizer:
  \(\hat{S}_{MMSE}[k] = \dfrac{H^*[k]}{|H[k]|^2 + \sigma^2/\sigma_s^2} \cdot R[k]\)

Both are \(\mathcal{O}(1)\) per subcarrier — the CP is what makes this possible.
CP overhead trade-off: A longer CP covers more delay spread but wastes more spectral efficiency. The 5G NR extended CP (available only at \(\mu=2\)) uses \(N_{CP} = 512\) samples instead of 144, increasing robustness for scenarios with \(\tau_{\max} > 4.17\,\mu\text{s}\) (e.g., rural macro, DAS, or HAPS scenarios) at the cost of \(\sim\!11\%\) overhead instead of \(\sim\!7\%\).
Study Questions — §2
  1. Orthogonality under CFO: Suppose there is a carrier frequency offset (CFO) of \(\varepsilon \cdot \Delta f\) (i.e., \(\varepsilon\) is a normalized CFO in units of subcarrier spacing). Derive the ICI power from subcarrier m onto subcarrier n as a function of \(\varepsilon\) and \(|m-n|\). At what value of \(\varepsilon\) does the ICI become comparable to the signal power? (Hint: redo the orthogonality integral with the additional phase offset \(e^{j2\pi\varepsilon\Delta f t}\) and evaluate at small \(\varepsilon\).)
  2. CP design for a given channel: You are designing a 5G NR deployment for an industrial hall with a measured RMS delay spread of \(\tau_{rms} = 300\,\text{ns}\) and maximum excess delay \(\tau_{\max} = 1.5\,\mu\text{s}\). Which numerology (\(\mu\)) satisfies the CP constraint \(T_{CP} > \tau_{\max}\)? What is the resulting spectral efficiency \(\eta_{CP}\)? Would \(\mu=1\) be feasible here?
  3. PAPR analysis: The CP-OFDM signal \(s(t) = \sum_{k=0}^{N-1} S_k e^{j2\pi k\Delta f t}\) has high Peak-to-Average Power Ratio (PAPR). Show that when all \(N\) subcarriers are co-phased (worst case), the peak power is \(N^2 \sigma_s^2\) while the average power is \(N\sigma_s^2\), yielding a maximum PAPR = \(N\) (i.e., \(10\log_{10}N\) dB). For \(N=1024\), what is this PAPR in dB? Why is PAPR a critical design constraint for the power amplifier at the transmitter?
3

Cyclic Prefix & Multipath Robustness

ISI/ICI suppression · circular convolution · single-tap equalization · CP overhead in 5G NR

3.1 Multipath Propagation Model

In a wireless channel the transmitted signal arrives at the receiver via multiple paths — reflections, diffractions and scattering off buildings, terrain and vehicles. Each path adds a scaled, delayed copy of the signal. The baseband-equivalent received signal is modelled as a tapped-delay line:

$$y(t) = \sum_{l=0}^{L-1} h_l \, s(t - \tau_l) + n(t)$$

where \(h_l \in \mathbb{C}\) is the complex gain of the \(l\)-th path, \(\tau_l\) is its propagation delay, \(L\) is the total number of resolvable paths, and \(n(t)\) is additive white Gaussian noise (AWGN). In the continuous-time domain the channel is fully described by its channel impulse response (CIR):

$$h(\tau) = \sum_{l=0}^{L-1} h_l \,\delta(\tau - \tau_l)$$

The key channel parameter for OFDM design is the maximum delay spread \(\tau_{\max} = \max_l \tau_l - \min_l \tau_l\). From it we derive the coherence bandwidth — the frequency range over which the channel response is approximately flat:

$$B_c \approx \frac{1}{5\,\tau_{\max}}$$
Insight: If the OFDM subcarrier spacing \(\Delta f \ll B_c\), each subcarrier experiences flat (frequency-non-selective) fading — a single complex scalar \(H[k]\). This is the fundamental reason OFDM handles frequency-selective channels so elegantly.

3.2 Inter-Symbol Interference (ISI) — Without CP

Consider two consecutive OFDM symbols, each of duration \(T_u = N/f_s\) seconds (where \(N\) is the FFT size and \(f_s\) is the sample rate). Without a guard interval, the \(m\)-th received sample of symbol \(i\) is:

$$r_i[n] = \sum_{l=0}^{L-1} h_l \cdot s\!\left[\bigl(n - \lfloor\tau_l f_s\rfloor\bigr) \bmod N_{\text{sym}}\right] + \text{tail of }s_{i-1}$$

If the channel length \(L > 1\) (i.e. \(\tau_{\max} > 0\)), delayed copies of the previous symbol \(s_{i-1}\) spill into the current symbol's observation window. This Inter-Symbol Interference destroys the orthogonality of the DFT basis and cannot be removed with a simple single-tap equalizer.

Without CP: The first \(L-1\) samples of every received OFDM symbol are contaminated by the tail of the previous symbol. For a 5 µs delay spread at 30.72 MHz sampling, that is \(\approx\) 154 corrupted samples — roughly 7.5 % of a 2048-point FFT.

3.3 Inter-Carrier Interference (ICI) — Linear vs. Circular Convolution

Even if ISI is ignored, linear convolution in the time domain breaks subcarrier orthogonality. The DFT of a linear convolution is not the pointwise product of the individual DFTs:

$$\mathrm{DFT}\{h * s\}_{\text{linear}} \neq H[k] \cdot S[k]$$

As a result, subcarrier \(k\) receives energy from all other subcarriers \(k' \neq k\) — this is ICI. Each received DFT bin becomes:

$$R[k] = \underbrace{H[k]\,S[k]}_{\text{desired}} \;+\; \underbrace{\sum_{k' \neq k} I_{kk'}\,S[k']}_{\text{ICI}} + N[k]$$

where \(I_{kk'}\) are inter-carrier leakage coefficients whose magnitude depends on how far the received window is misaligned with the circular structure of the channel.

Key insight: The DFT does diagonalise circular convolution: if \(r = h \circledast s\) (circular), then \(R[k] = H[k]\cdot S[k]\) exactly, with zero ICI. The cyclic prefix is the mechanism that enforces this circular structure.

3.4 CP Insertion and Removal — Proof of Circularity

The cyclic prefix is formed by copying the last \(N_{CP}\) samples of the IFFT output and prepending them. The transmitted block of length \(N + N_{CP}\) is:

$$\tilde{s} = \bigl[\underbrace{s[N-N_{CP}],\,\ldots,\,s[N-1]}_{\text{CP}},\; \underbrace{s[0],\,s[1],\,\ldots,\,s[N-1]}_{\text{payload}}\bigr]$$
CP Insertion / Removal (pseudocode)
Transmitter:
  s[0..N-1]  ← IFFT(S[0..N-1])           // N-point IFFT
  s_tx[0..N_CP-1]   ← s[N-N_CP .. N-1]   // copy last N_CP samples
  s_tx[N_CP..N+N_CP-1] ← s[0..N-1]       // append payload
  transmit s_tx  (length N + N_CP)

Receiver:
  r_rx[0..N+N_CP-1] ← received block
  discard r_rx[0..N_CP-1]                 // remove CP
  r[0..N-1] ← r_rx[N_CP..N+N_CP-1]       // N samples for DFT
  R[0..N-1] ← FFT(r[0..N-1])

Proof: CP Enforces Circular Convolution

Assume the channel has \(L\) taps: \(h[0], h[1], \ldots, h[L-1]\) and \(N_{CP} \geq L - 1\). After the CP is stripped, the \(n\)-th received sample is:

$$r[n] = \sum_{l=0}^{L-1} h[l]\,\tilde{s}[n + N_{CP} - l] + w[n], \quad n = 0,\ldots,N-1$$

For \(n - l \geq 0\): \(\tilde{s}[n + N_{CP} - l] = s[n - l]\). For \(n - l < 0\): the sample falls in the CP region, but because \(N_{CP} \geq L - 1\), it equals \(s[N + n - l]\) — exactly what circular indexing \(s[(n-l) \bmod N]\) gives. Therefore:

$$r[n] = \sum_{l=0}^{L-1} h[l]\,s\!\bigl[(n-l) \bmod N\bigr] + w[n] = (h \circledast s)[n] + w[n]$$

Taking the N-point DFT of both sides, and using the circular-convolution theorem:

$$\boxed{R[k] = H[k]\,S[k] + W[k], \qquad k = 0, 1, \ldots, N-1}$$

The channel is perfectly diagonalised: each subcarrier \(k\) sees only a single complex multiplication by \(H[k]\) — no ISI, no ICI.

Why "cyclic"? The CP makes the finite-length received block look as if it came from an infinite periodic signal passed through an LTI channel, so the DFT's inherent periodicity assumption is satisfied exactly.

3.5 Single-Tap Frequency-Domain Equalization

Because \(R[k] = H[k]\,S[k] + W[k]\), recovering \(S[k]\) requires only a scalar division (or multiplication by a weight) per subcarrier. Two common equalizer choices:

$$\text{Zero-Forcing (ZF):}\quad \hat{S}[k] = \frac{R[k]}{H[k]} = S[k] + \frac{W[k]}{H[k]}$$
$$\text{MMSE:}\quad \hat{S}[k] = \frac{H^*[k]}{|H[k]|^2 + \sigma_w^2/\sigma_s^2}\,R[k]$$

ZF perfectly cancels the channel but amplifies noise at deep fades (\(|H[k]| \approx 0\)). MMSE adds a noise-regularisation term \(\sigma_w^2/\sigma_s^2\) in the denominator, trading a small residual bias for reduced noise enhancement. MMSE reduces to ZF at high SNR.

Massive advantage of CP-OFDM: A frequency-selective channel that would require a length-\(L\) time-domain equalizer (tapped delay line, \(O(L^2)\) complexity per symbol) is reduced to \(N\) independent scalar operations — one per subcarrier. For \(N = 4096\) and \(L = 512\) this is a factor of ~256 reduction in equalizer complexity.

3.6 CP Overhead and 5G NR Numerology

The CP occupies \(N_{CP}\) out of every \(N + N_{CP}\) transmitted samples. The CP overhead fraction is:

$$\eta_{CP} = \frac{N_{CP}}{N + N_{CP}}$$

In 5G NR the FFT size and CP length scale with the subcarrier spacing \(\Delta f = 2^\mu \times 15\text{ kHz}\) (numerology index \(\mu\)). Normal CP values for the reference sample rate of 30.72 MHz (\(N = 2048\) at 15 kHz):

\(\mu\) \(\Delta f\) (kHz) \(N_u\) (FFT) \(N_{CP,\text{normal}}\) CP duration (\(\mu\)s) Overhead \(\eta_{CP}\)
0152048144 (first: 160)4.69 (5.21)~7.0 %
130102472 (first: 80)2.34 (2.60)~7.0 %
26051236 (first: 40)1.17 (1.30)~7.0 %
312025618 (first: 20)0.59 (0.65)~7.0 %
260512128 (Extended CP)4.17~20.0 %
5G NR normal CP overhead ≈ 7 % across all numerologies. The ratio \(N_{CP}/N\) is kept constant by design, so overhead stays fixed as \(\Delta f\) scales up. The first symbol in each half-subframe gets a slightly longer CP (160 vs 144 samples at \(\mu=0\)) to handle timing alignment and make the slot duration exactly \(0.5\) ms.

3.7 CP Length Design: Matching Delay Spread to Numerology

The fundamental constraint is \(T_{CP} \geq \tau_{\max}\). Choosing a larger \(\Delta f\) (higher numerology) reduces \(T_{CP}\) and requires a lower \(\tau_{\max}\) — fine for indoor / mmWave deployments but problematic for rural macro cells.

Deployment Typical \(\tau_{\max}\) Recommended \(\mu\) / SCS Note
Rural macro LTE/5G 3–5 µs \(\mu=0\), 15 kHz CP = 4.7 µs barely covers 5 µs; extended CP if needed
Urban macro 1–3 µs \(\mu=0\) or \(\mu=1\) Normal CP adequate; 30 kHz viable
Dense urban / indoor 0.1–0.5 µs \(\mu=2\), 60 kHz Short delay spread; lower latency with wider SCS
Indoor hotspot / FR2 mmWave < 0.1 µs \(\mu=3\), 120 kHz Very short propagation distances; 0.59 µs CP is ample
High-speed rail (Doppler) 0.5–2 µs \(\mu=1\) or \(\mu=2\) Doppler drives toward wider SCS; not delay spread
Design trap — rural with 30 kHz SCS: At \(\mu=1\), \(T_{CP} \approx 2.34\,\mu\text{s}\). A rural environment with \(\tau_{\max} = 3\,\mu\text{s}\) would have the CP shorter than the delay spread, reintroducing ISI. Always verify \(T_{CP} \geq \tau_{\max}\) before selecting numerology.

3.8 OFDM Channel Model: Per-Subcarrier Fading

With CP in place the channel is fully described by \(H[k]\) — the channel frequency response sampled at each subcarrier. In a Rayleigh fading environment, each \(H[k] \sim \mathcal{CN}(0,1)\) independently when subcarrier spacing \(\Delta f \gg B_c^{-1}\). In practice, adjacent subcarriers are correlated over a coherence bandwidth of roughly \(B_c \approx 1/(5\tau_{\max})\):

$$\mathbb{E}\bigl[H[k]\,H^*[k']\bigr] \approx e^{-j2\pi(k-k')\Delta f \cdot \tau_{\text{rms}}} \cdot R_H\!\bigl((k-k')\Delta f\bigr)$$

where \(R_H(f)\) is the channel's frequency correlation function and \(\tau_{\text{rms}}\) is the RMS delay spread. This frequency correlation is exploited by:

3.9 Guard Interval Alternatives: ZP-OFDM and Others

The CP is not the only guard interval strategy. Zero-Padding OFDM (ZP-OFDM) appends \(N_{ZP}\) zeros after the IFFT block instead of a cyclic prefix:

$$\tilde{s}_{ZP} = \bigl[\underbrace{s[0],\,\ldots,\,s[N-1]}_{\text{payload}},\; \underbrace{0,\,\ldots,\,0}_{N_{ZP}}\bigr]$$
Property CP-OFDM ZP-OFDM
Guard interval Cyclic copy (N_CP samples) Zero padding (N_ZP zeros)
TX power on guard Full power (CP carries signal) Zero — no TX power wasted
ISI suppression Complete if \(N_{CP} \geq L-1\) Complete if \(N_{ZP} \geq L-1\)
Circular convolution Yes — exact DFT diagonalisation No — requires overlap-add or special equalizer
Equalization complexity \(O(N)\) scalars \(O(N \cdot L)\) or iterative
Spectral efficiency \(N/(N+N_{CP})\) \(N/(N+N_{ZP})\) — same overhead
Used in LTE, 5G NR, Wi-Fi (802.11a/g/n/ac/ax) Some 802.11 modes, UWB, research
Why does CP-OFDM dominate? The power saving of ZP is negligible at the system level (7 % overhead → 0.3 dB TX power increase), while the equalization complexity advantage of CP is enormous. The single-tap per-subcarrier equalizer is trivially parallelisable on DSP/ASIC/FPGA, making CP-OFDM the universal choice for high-throughput wireless standards.
Study Questions — §3
  1. Q3.1 — CP length selection: A 5G NR deployment in a dense urban environment measures a maximum excess delay of \(\tau_{\max} = 2.1\,\mu\text{s}\) and an RMS delay spread of \(\tau_{\text{rms}} = 0.7\,\mu\text{s}\). (a) What is the minimum numerology \(\mu\) that guarantees no ISI with the normal CP? (b) If the operator uses \(\mu = 2\) (60 kHz SCS), what fraction of the CP is "wasted" (i.e., not needed to cover the delay spread)? (c) Calculate the coherence bandwidth and verify that a single 12-subcarrier PRB at \(\mu = 1\) is within it.
  2. Q3.2 — Circular convolution proof: Consider a 3-tap channel \(h = [0.8, 0.4j, -0.2]\) and an OFDM symbol with \(N = 8\) subcarriers and \(N_{CP} = 2\). The frequency-domain symbol is \(S[k] = e^{j\pi k/4}\) for \(k = 0,\ldots,7\). (a) Compute the IFFT output \(s[n]\), form the CP-padded transmission, apply the channel, strip the CP and take the FFT. (b) Verify that \(R[k] = H[k]\cdot S[k]\) where \(H[k]\) is the 8-point DFT of \(h\) zero-padded to length 8. (c) What would \(R[k]\) look like if \(N_{CP} = 1\) (too short)?
  3. Q3.3 — MMSE vs ZF tradeoff: In a frequency-selective channel, subcarrier \(k_0\) undergoes a deep fade: \(|H[k_0]|^2 = 0.01\) (−20 dB). The operating SNR is 20 dB (\(\sigma_s^2/\sigma_w^2 = 100\)). (a) Compute the post-equalization SNR for ZF and MMSE equalizers on subcarrier \(k_0\). (b) On a non-faded subcarrier with \(|H[k]|^2 = 1\), compare the two equalizers again. (c) Explain qualitatively why a modern 5G receiver uses MMSE-IRC rather than ZF, especially in interference-limited scenarios.
§4

PAPR — The Peak-to-Average Power Ratio Problem

Why OFDM's greatest strength creates its most painful hardware constraint

4.1 PAPR Definition

The Peak-to-Average Power Ratio quantifies how much the instantaneous power of a transmitted signal can exceed its average power. For a continuous-time OFDM signal s(t):

$$\mathrm{PAPR} = \frac{\max_{0 \le t \le T_s} |s(t)|^2}{\mathbb{E}\!\left[|s(t)|^2\right]}$$

An OFDM symbol with N subcarriers, each bearing unit-power:

$$s(t) = \frac{1}{\sqrt{N}}\sum_{k=0}^{N-1} X_k \, e^{j2\pi k \Delta f \, t}, \quad 0 \le t \le T_s$$

Worst case: all N subcarriers align in phase at the same instant, so the instantaneous amplitude is N times the per-subcarrier amplitude:

$$\mathrm{PAPR}_{\max} = N \quad \Longrightarrow \quad 10\log_{10}(N) \text{ dB}$$
N (subcarriers)PAPRmax (linear)PAPRmax (dB)
646418.1 dB
25625624.1 dB
51251227.1 dB
1024102430.1 dB
4096409636.1 dB
Insight: The theoretical maximum PAPR grows with the number of subcarriers, but coherent alignment of all subcarriers at a single point in time is astronomically improbable for random data. The relevant metric for hardware design is the statistical PAPR at a given outage probability.

4.2 Statistical Analysis of PAPR

For large N, the Central Limit Theorem (CLT) applies: the sum of many independent random variables approaches a Gaussian distribution. The real and imaginary parts of s(t) each become approximately i.i.d. Gaussian, so the envelope |s(t)| follows a Rayleigh distribution.

$$|s(t)| \sim \mathrm{Rayleigh}\!\left(\sigma\right), \quad \sigma^2 = \frac{P_{\mathrm{avg}}}{2}$$

The instantaneous power |s(t)|² follows an exponential distribution with mean Pavg. Treating the N sub-samples as independent (Nyquist rate), the Complementary CDF (CCDF) of PAPR is:

$$\Pr(\mathrm{PAPR} > \gamma) \approx 1 - \left(1 - e^{-\gamma}\right)^N$$
Note on independence: The N Nyquist samples of an OFDM symbol are not truly independent (the IFFT introduces correlation), so this formula slightly underestimates the true CCDF at high PAPR. More accurate expressions use the oversampling factor L and correction terms. For engineering intuition, the formula above is widely used.

Operating PAPR at 0.1% outage (CCDF = 10−3): solving 1 − (1 − e−γ)N = 10−3:

$$\gamma \approx \ln(N) + \ln(10^3) = \ln\!\left(N \times 10^3\right)$$
Nγ0.1% (linear)γ0.1% (dB)
64≈ 2.23≈ 3.5 dB
256≈ 3.61≈ 5.6 dB
512≈ 4.30≈ 6.3 dB
1024≈ 5.00≈ 7.0 dB
Practical rule of thumb: Real OFDM systems with oversampling (L = 4–8) and actual random QAM data exhibit PAPR values of roughly 10–12 dB at 0.1% outage for N = 512–2048. The formula above gives a lower bound; simulations add ~2–4 dB.

4.3 Why PAPR Matters — The HPA Problem

A High Power Amplifier (HPA) is characterised by a linear region up to its saturation point Psat, beyond which the output clips and severe nonlinear distortion is introduced. The transmit chain must operate with sufficient back-off to keep the signal within the linear region.

$$\mathrm{OBO} = P_{\mathrm{sat}} - P_{\mathrm{avg,out}} \quad [\mathrm{dB}]$$

where OBO is the Output Back-Off. For reliable operation, OBO must equal (or exceed) the PAPR at the required outage level.

Analogy: Imagine driving a sports car capable of 200 mph but you must keep it below 40 mph (20% throttle) to avoid engine damage. The car's average fuel efficiency at 40 mph is a fraction of its peak efficiency — you are paying for a powerful engine you can barely use. PAPR forces amplifier designers into exactly this position.

Quantifying Efficiency Loss

For a Class A amplifier, the theoretical drain efficiency is:

$$\eta_A = \frac{1}{2} \cdot \frac{P_{\mathrm{out}}}{P_{\mathrm{DC}}} = \frac{1}{2} \cdot 10^{-\mathrm{OBO}/10}$$

At saturation (OBO = 0 dB): ηA = 50%. With a 10 dB back-off:

$$\eta_A\big|_{\mathrm{OBO}=10\,\mathrm{dB}} = 0.5 \times 10^{-1} = 5\%$$
Critical impact: A 10 dB PAPR forces a 10 dB output back-off. Class A amplifier efficiency collapses from 50% to 5% — a 10× efficiency penalty. In a base-station with kilowatts of RF output power, this translates to massive cooling and energy costs.

Class B amplifiers have a more favourable back-off characteristic:

$$\eta_B = \frac{\pi}{4} \cdot 10^{-\mathrm{OBO}/20}$$

Peak efficiency ≈ 78.5% at saturation, dropping to ≈ 24.8% at 10 dB OBO — still painful, but significantly better than Class A.

4.4 PAPR Reduction Techniques

Six major categories of PAPR reduction exist, each with distinct trade-offs in complexity, spectral efficiency, and performance.

(a) Clipping and Filtering

The simplest approach: hard-clip the signal when its envelope exceeds a threshold Aclip, then apply a bandpass filter to restore the out-of-band spectrum.

$$\tilde{s}(t) = \begin{cases} s(t) & |s(t)| \le A_{\mathrm{clip}} \\ A_{\mathrm{clip}}\,e^{j\angle s(t)} & |s(t)| > A_{\mathrm{clip}} \end{cases}$$

The Clipping Ratio (CR) is defined as:

$$\mathrm{CR} = \frac{A_{\mathrm{clip}}}{\sqrt{P_{\mathrm{avg}}}}$$

Trade-off: Smaller CR → more PAPR reduction, but more clipping noise → higher BER and in-band distortion. Filtering removes out-of-band regrowth but may re-introduce small peaks. Iterative clipping and filtering (ICF) can approach CR = 1.4 with acceptable BER degradation (<1 dB at 10−3 BER for QPSK).

(b) Selected Mapping (SLM)

Generate U candidate OFDM symbols by multiplying the frequency-domain data vector X by U different phase rotation vectors P(u). Transmit the candidate with the lowest PAPR.

$$\mathbf{X}^{(u)} = \mathbf{X} \odot \mathbf{P}^{(u)}, \quad u = 1, \ldots, U$$
$$u^* = \arg\min_u \mathrm{PAPR}\!\left(\mathbf{x}^{(u)}\right)$$

Side information: The receiver must be informed which rotation vector was used to recover the data. This costs ⌈log₂U⌉ bits per OFDM symbol. U = 4 reduces PAPR by ≈3 dB; U = 8 by ≈4 dB, but complexity grows linearly in U.

(c) Partial Transmit Sequences (PTS)

Partition the N subcarriers into V disjoint sub-blocks X(v). Apply phase factors b(v) ∈ {±1, ±j} to each sub-block, then sum:

$$s = \sum_{v=1}^{V} b^{(v)} \, \mathcal{F}^{-1}\!\left\{\mathbf{X}^{(v)}\right\}$$

Optimise over all combinations of b(v). With 4 phase values and V sub-blocks: 4V candidates (minus one degree of freedom = 4V−1 IFFT evaluations). Total IFFT count: V × 4V−1. V = 4 → 16 IFFTs; V = 8 → 8192 IFFTs — exponential complexity growth.

(d) Tone Reservation (TR)

Reserve a small subset R of subcarriers (the "peak-reduction tones") that carry no data. Use these tones to construct a cancelling signal c(t) that reduces peaks in the time domain:

$$\tilde{s}(t) = s(t) + c(t), \quad C_k = 0 \text{ for } k \notin \mathcal{R}$$

No side information needed — the receiver simply ignores the reserved tones. The iterative kernel algorithm converges in 5–10 iterations to 1–2 dB PAPR reduction. TR is specified in LTE Release 8 (PDSCH PAPR reduction) and DVB-T2. Typically |R| = 1–2% of total subcarriers.

(e) Active Constellation Extension (ACE)

Extend outer constellation points further outward (away from the decision boundary) to create destructive interference with peak-contributing subcarriers. Inner points are left unchanged to preserve minimum Euclidean distance.

$$\tilde{X}_k = X_k + \Delta_k, \quad |\Delta_k| \text{ constrained s.t. BER unchanged}$$

Advantages: No rate loss, no side information, compatible with adaptive modulation. Typical PAPR reduction: 1.5–3 dB. Works best with QAM constellations of order 16 or higher (more outer points available for extension).

(f) DFT-s-OFDM (SC-FDMA)

The most impactful PAPR reduction: use a DFT precoding stage before the IFFT to convert the signal into a single-carrier-like waveform while retaining OFDM's multi-carrier framework. Covered in detail in §4.5.

4.5 DFT-s-OFDM (SC-FDMA) — PAPR Advantage in Detail

Transmitter Architecture

The DFT-s-OFDM transmitter adds a single DFT precoding stage before the conventional OFDM IFFT:

StageOperationOutput Size
1. DFT Precoding M-point DFT of M data symbols: X̃ = FM · d M frequency bins
2. Subcarrier Mapping Map M DFT outputs onto M contiguous subcarriers of N-point grid (N ≫ M) N subcarrier values
3. N-point IFFT Standard OFDM IFFT N time samples
4. CP insertion Add cyclic prefix of length NCP N + NCP samples

The cascade of M-point DFT followed by the N-point IFFT is equivalent to transmitting M data symbols as a single-carrier signal occupying a bandwidth of M · Δf. Mathematically, the time-domain output is:

$$s[n] = \frac{1}{\sqrt{N}} \sum_{k=0}^{N-1} \tilde{X}_k \, e^{j 2\pi kn/N}$$
$$\tilde{X}_k = \frac{1}{\sqrt{M}} \sum_{m=0}^{M-1} d[m] \, e^{-j 2\pi mk/M}, \quad k \in \mathcal{S}$$

Substituting, s[n] reduces to a linearly-modulated single-carrier signal (with rectangular pulse-shaping), explaining the dramatically lower PAPR.

Why PAPR is lower: In CP-OFDM, each subcarrier is independently modulated — they can align constructively. In DFT-s-OFDM, each "subcarrier" carries a mixture of all M data symbols (spread by the DFT), so a single large symbol cannot dominate the peak. The signal behaves statistically like a single-carrier waveform.

PAPR Numbers

ParameterCP-OFDMDFT-s-OFDM
PAPR at 0.1% CCDF (QPSK)~10–11 dB~5–6 dB
PAPR at 0.1% CCDF (16QAM)~11–12 dB~6–7 dB
Reduction vs CP-OFDM4–6 dB

Why 5G NR Uses Both Waveforms

DirectionWaveformPrimary Reason
Downlink (gNB → UE) CP-OFDM Base station has abundant power; MIMO spatial multiplexing requires independent per-subcarrier precoding; complexity/power at gNB is manageable.
Uplink (UE → gNB) DFT-s-OFDM (default) + CP-OFDM (optional) UE battery and PA efficiency are critical; 4–6 dB PAPR reduction → ~3× better PA efficiency → longer battery life. UE uses simpler SISO or small MIMO.
5G NR specification detail: 3GPP TS 38.211 §5.3 specifies DFT-s-OFDM for PUSCH when transform precoding is enabled (transformPrecoder = enabled). The UE signals capability and the network configures it. CP-OFDM is also supported in UL for MIMO rank > 1, since DFT-s-OFDM limits the UL to single-layer transmission.

4.6 Waveform Comparison: CP-OFDM vs DFT-s-OFDM

Property CP-OFDM DFT-s-OFDM (SC-FDMA)
PAPR (0.1% outage) ~10–12 dB ~5–7 dB (4–6 dB gain)
PA efficiency Low (~5% Class A at 10 dB OBO) Higher (~15% Class A at 5 dB OBO)
Spectral efficiency High — independent per-subcarrier modulation Slightly lower — DFT precoding adds constraint
Receiver complexity One-tap FEQ per subcarrier (simple) FEQ + IDFT at receiver (slightly higher)
MIMO spatial multiplexing Fully supported — arbitrary rank Limited to single-layer (rank-1) in standard use
Frequency diversity Limited (localized scheduling) Inherent via DFT spreading
Subcarrier allocation Any subset (distributed or localized) Must be contiguous (LFDMA/DFDMA special cases)
Standards (DL) LTE, 5G NR, Wi-Fi, WiMAX, DVB-T2 Not used in DL (no PA efficiency advantage)
Standards (UL) 5G NR UL (optional), Wi-Fi LTE UL (mandatory), 5G NR UL (default)
Sensitivity to frequency offset High (ICI across all subcarriers) High (same OFDM ICI mechanism)
Study Questions — §4 PAPR
  1. Q1 — CCDF calculation: An OFDM system uses N = 512 subcarriers. Using the approximation Pr(PAPR > γ) ≈ 1 − (1 − e−γ)N, calculate the PAPR threshold γ (in dB) at a 0.01% outage probability (CCDF = 10−4). Then compare with N = 2048 (5G NR numerology μ=0, 20 MHz BW). How much additional back-off is required for the larger FFT? Hint: solve e−γ ≈ ln(N)/N at small outage values.
  2. Q2 — Efficiency comparison: A 5G NR base station transmits 40 W average output power. The HPA is Class B. Compare the DC power consumption when: (a) transmitting CP-OFDM DL at 10 dB OBO, and (b) transmitting DFT-s-OFDM UL at 5 dB OBO. Using Class B efficiency formula ηB = (π/4) · 10−OBO/20, calculate the wasted power in each case. What is the CO₂ implication if 10,000 base stations operate continuously?
  3. Q3 — SLM vs PTS trade-off: You are designing a PAPR reduction scheme for an N = 256 OFDM system targeting 3 dB PAPR reduction. Compare SLM with U = 4 versus PTS with V = 4 sub-blocks and 4 phase values. (a) How many IFFT operations does each method require per OFDM symbol? (b) How many bits of side information does SLM require per symbol? (c) Which method is preferable for a latency-constrained real-time system, and why might PTS be preferred for offline coding?
§5

Frequency & Timing Synchronization

CFO, ICI, and Recovery — Why a 1% offset breaks OFDM orthogonality

5.1 Why Synchronization is Critical for OFDM

Single-carrier systems are relatively robust to small frequency and timing errors: a slight offset merely shifts the constellation slightly. OFDM is fundamentally different. Its entire performance rests on orthogonality between subcarriers — the property that every subcarrier integrates to zero over every other subcarrier's tone. That orthogonality is fragile.

Orthogonality condition: for two OFDM subcarriers at frequencies \(f_k = k \Delta f\) and \(f_l = l \Delta f\), orthogonality holds exactly only when the receiver DFT integration window is perfectly aligned and the carrier reference frequency is identical at TX and RX. Any departure immediately causes energy from every subcarrier to spill into every other — Inter-Carrier Interference (ICI).

A normalized CFO of just \(\varepsilon = 0.1\) (10% of subcarrier spacing) degrades SIR by roughly 10–12 dB. At \(\varepsilon = 0.2\) the system is effectively unusable without correction. This makes synchronization a first-order design requirement in any practical OFDM transceiver.

Key insight: In single-carrier systems, a frequency offset is a nuisance. In OFDM, it is catastrophic — it converts a carefully designed orthogonal multicarrier waveform into a mutually interfering mess.

5.2 Carrier Frequency Offset (CFO) — Model and Effect

Sources of CFO

  • Oscillator mismatch: TX and RX crystal oscillators run at nominally the same frequency but with small tolerances (ppm-level). At 28 GHz, even 1 ppm mismatch = 28 kHz offset — nearly two 15 kHz subcarriers.
  • Doppler shift: Relative motion between TX and RX causes \(f_D = v \cdot f_c / c\). At vehicular speeds (100 km/h) and millimeter-wave, Doppler can be several kHz.
  • Phase noise: Local oscillator phase noise acts as a time-varying instantaneous frequency offset, producing both common phase error and ICI.

Signal Model with CFO

Let the transmitted baseband OFDM symbol be \(s(t)\). After passing through a channel with carrier frequency offset \(\Delta f\), the received signal before the DFT is:

\[ r(t) \;=\; s(t)\, e^{\,j 2\pi \Delta f\, t} + w(t) \]

Normalizing to subcarrier spacing \(\Delta f_{\rm sc}\), define the normalized CFO:

\[ \varepsilon \;=\; \frac{\Delta f}{\Delta f_{\rm sc}} \]

With an \(N\)-point DFT and sample index \(n = 0, 1, \ldots, N-1\), the received time-domain samples are:

\[ r[n] \;=\; s[n]\, e^{\,j 2\pi \varepsilon n / N} + w[n] \]

ICI Power Formula

After the DFT, the output on subcarrier \(k\) is:

\[ Y[k] \;=\; X[k] \cdot \underbrace{\frac{\sin(\pi \varepsilon)}{N \sin(\pi \varepsilon / N)} e^{\,j\pi\varepsilon(N-1)/N}}_{\text{desired signal, attenuated}} \;+\; \underbrace{\sum_{l \neq k} X[l] \cdot I_{l-k}}_{\text{ICI}} \;+\; W[k] \]

where \(I_m = \frac{\sin(\pi(\varepsilon - m))}{N \sin(\pi(\varepsilon-m)/N)} e^{j\pi(\varepsilon-m)(N-1)/N}\) is the ICI coefficient from subcarrier \(l = k+m\). For small \(\varepsilon\), the total ICI power relative to signal power is:

\[ \frac{P_{\rm ICI}}{P_s} \;\approx\; \frac{\pi^2 \varepsilon^2}{3} \qquad (\varepsilon \ll 1) \]

The resulting Signal-to-ICI Ratio (SIR) due to CFO alone is:

\[ \boxed{ \mathrm{SIR}_{\rm CFO} \;\approx\; \frac{3}{\pi^2 \varepsilon^2} } \]
Numerical example: At \(\varepsilon = 0.1\), \(\mathrm{SIR} \approx 3/(\pi^2 \cdot 0.01) \approx 30.4 \approx 14.8 \text{ dB}\). This caps the maximum achievable SNR regardless of transmit power — a hard ceiling.

5.3 Integer vs Fractional CFO

CFO can be decomposed as \(\varepsilon = \varepsilon_{\rm int} + \varepsilon_{\rm frac}\) where \(\varepsilon_{\rm int} \in \mathbb{Z}\) and \(|\varepsilon_{\rm frac}| \leq 0.5\).

Integer CFO (\(\varepsilon_{\rm int}\))

  • Shifts all subcarriers by a whole number of positions in the DFT output.
  • No ICI — orthogonality is preserved since the subcarrier grid merely shifts cyclically.
  • Data is decoded from wrong subcarrier indices → symbol errors, but no spectral spreading.
  • Can be detected via known pilots at fixed subcarrier positions (ambiguity from cyclic shift).
  • Correction: apply a frequency shift of \(-\varepsilon_{\rm int} \cdot \Delta f_{\rm sc}\) before the DFT.

Fractional CFO (\(\varepsilon_{\rm frac}\))

  • Breaks orthogonality — every subcarrier receives interference from all others.
  • ICI proportional to \(\varepsilon_{\rm frac}^2\) as derived above.
  • Even \(|\varepsilon_{\rm frac}| = 0.01\) causes \(\sim\)–35 dB SIR ceiling — problematic for high-order QAM.
  • Must be corrected in time domain (before DFT) by multiplying by \(e^{-j 2\pi \hat{\varepsilon}_{\rm frac} n / N}\).
  • Primary target of all fine frequency synchronization algorithms.
Practical rule: Coarse sync (PSS, training) eliminates integer CFO. Fine sync (CP correlation, pilot phase) corrects fractional CFO to sub-percent precision. Both steps are mandatory before DFT in any OFDM receiver.

Chart: ICI Power vs Normalized CFO

The plot below shows the ICI-to-signal ratio (in dB) as a function of normalized CFO \(\varepsilon\). Both the exact expression (computed numerically from the DFT output formula) and the small-angle approximation \(P_{\rm ICI}/P_s \approx \pi^2\varepsilon^2/3\) are shown. The rapid degradation with even modest CFO is the defining challenge of OFDM synchronization.

Figure 5.1 — ICI-to-signal ratio vs normalized CFO. At \(\varepsilon=0.1\) the SIR floor is ~14.8 dB; at \(\varepsilon=0.3\) it collapses to ~5 dB, making 16-QAM or higher impractical without correction.

5.4 CFO Estimation Algorithms

Time-Domain: CP-Based Estimation

The cyclic prefix is a copy of the last \(N_{\rm CP}\) samples of the OFDM symbol. Without CFO, \(r[n] = r[n + N]\) for \(n\) in the CP window. With CFO, this relationship becomes:

\[ r[n] \;=\; r[n+N] \cdot e^{-j 2\pi \varepsilon} \cdot (\text{channel factor}) \]

The cross-correlation between the CP and its copy is:

\[ \gamma \;=\; \sum_{n=0}^{N_{\rm CP}-1} r^*[n] \cdot r[n + N] \]

The CFO estimate is extracted from the phase:

\[ \hat{\varepsilon} \;=\; \frac{1}{2\pi} \angle(\gamma) \quad\Longrightarrow\quad \hat{\Delta f} \;=\; \hat{\varepsilon} \cdot \Delta f_{\rm sc} \]
Range limitation: Since \(\angle(\gamma) \in [-\pi, \pi)\), the CP-based estimator is unambiguous only for \(|\varepsilon| \leq 0.5\) (half a subcarrier spacing). For larger offsets, integer CFO correction must precede fine estimation.

Frequency-Domain: Pilot-Based Estimation

If pilot subcarriers transmit known symbols \(P[k]\), the received pilot after DFT is:

\[ Y_p[k] \;=\; H[k] \cdot P[k] \cdot e^{j\phi_{\rm CFO}[k]} + W[k] \]

where the CFO-induced phase on pilot \(k\) in symbol \(m\) is approximately \(\phi \approx 2\pi\varepsilon m\) (linear in symbol index). Comparing received to expected pilot phases across successive symbols:

\[ \hat{\varepsilon} \;=\; \frac{1}{2\pi} \cdot \frac{\angle(Y_p^{(m+1)}[k]\, (Y_p^{(m)}[k])^*)}{1} \]

Schmidl-Cox Algorithm

The Schmidl-Cox algorithm uses a special training symbol where the first half is identical to the second half in the time domain (achieved by transmitting non-zero data only on even subcarriers). Define the timing metric:

\[ M(d) \;=\; \frac{\bigl|P(d)\bigr|^2}{R(d)^2} \qquad P(d) = \sum_{m=0}^{N/2-1} r^*[d+m]\, r[d + m + N/2] \]

where \(R(d) = \sum_{m=0}^{N/2-1} |r[d+m+N/2]|^2\) is a normalization term. The peak of \(M(d)\) locates the symbol start, and the phase of \(P(d)\) at the peak gives the fractional CFO estimate:

\[ \hat{\varepsilon} \;=\; \frac{N}{2\pi \cdot N/2} \cdot \frac{1}{2} \angle(P(d_{\rm peak})) \;=\; \frac{\angle(P(d_{\rm peak}))}{\pi} \]
Schmidl-Cox strengths: joint timing + frequency estimation in one pass; metric \(M(d)\) is normalized (0 to 1) and channel-independent; used extensively in IEEE 802.11 (Wi-Fi) preamble design.

5.5 Symbol Timing Offset (STO)

Even with perfect frequency synchronization, incorrect DFT window placement causes its own impairments. Define the symbol timing offset \(\delta = \delta_{\rm int} + \delta_{\rm frac}\) in samples.

Integer STO

An integer timing offset of \(\delta_{\rm int}\) samples corresponds to a cyclic shift of the DFT input. By the DFT shift theorem, this appears as a linear phase ramp across subcarriers:

\[ Y[k] \;=\; X[k] \cdot H[k] \cdot e^{-j 2\pi k \delta_{\rm int} / N} \]

This is a multiplicative phase that varies linearly with subcarrier index \(k\). It can be absorbed into the channel estimate during equalization — so integer STO within the CP window causes no ICI and no ISI, only a correctable phase ramp.

CP tolerance window: As long as \(0 \leq \delta_{\rm int} \leq N_{\rm CP} - L_{\rm ch}\) (where \(L_{\rm ch}\) is channel delay spread in samples), integer timing offset causes no ISI/ICI — the CP absorbs it. This is one of the most valuable properties of the CP.

Fractional STO and Beyond-CP Offset

If the timing offset causes the DFT window to extend beyond the CP boundary (i.e., it captures part of the previous or next symbol), ISI and ICI result:

  • ISI: energy from adjacent OFDM symbol enters the DFT window.
  • ICI: the channel impulse response is no longer circular with respect to the DFT window, destroying the diagonal channel model in frequency domain.

Timing Estimation via CP Correlation

Exploit the CP structure: the correlation between the received signal and a delayed version of itself peaks when the delay equals the DFT length \(N\):

\[ \Lambda(d) \;=\; \sum_{m=0}^{N_{\rm CP}-1} r^*[d + m] \cdot r[d + m + N] \]

The index \(\hat{d} = \arg\max_d |\Lambda(d)|^2 / E(d)^2\) (where \(E(d)\) normalizes energy) gives the estimated symbol start position. The correlation magnitude forms a plateau of width \(N_{\rm CP} - L_{\rm ch} + 1\) — the safe timing window.

Chart: CP-Based Timing & Frequency Sync Correlation

The chart below simulates the normalized CP correlation metric \(M(d)\) as a function of timing offset from the true symbol start. The plateau region corresponds to the safe CP window; the peak marks the optimal DFT start. A multipath channel (4-tap, delay spread 4 samples) is included to show real-world behavior.

Figure 5.2 — Normalized CP correlation metric vs timing offset (samples). \(N=64\), \(N_{\rm CP}=16\), SNR = 10 dB, 4-tap Rayleigh channel with max delay 4 samples. The plateau region has width \(\approx N_{\rm CP} - L_{\rm ch} = 12\) samples. True symbol start is at offset = 0.

5.6 Phase Noise

Model

Local oscillator phase noise introduces a time-varying phase \(\phi(t)\) that modulates the received signal. In discrete time, the received sample is:

\[ r[n] \;=\; s[n] \cdot e^{j\phi[n]} + w[n] \]

After the DFT, phase noise has two effects that can be separated via the Fourier expansion of \(e^{j\phi[n]}\):

Common Phase Error (CPE)

  • DC component of phase noise: \(\Phi_0 = \frac{1}{N}\sum_n \phi[n]\).
  • Rotates all subcarriers by the same angle \(\Phi_0\).
  • Equivalent to a single complex multiplier on the entire symbol.
  • Correctable using pilot symbols distributed across frequency: estimate \(\hat{\Phi}_0\) from pilot phase, then de-rotate.
  • This is why 5G NR defines DMRS and PTRS for CPE tracking.

ICI from Phase Noise

  • Non-DC components of phase noise cause ICI between subcarriers.
  • For Wiener (random walk) phase noise with variance \(\sigma_\phi^2\) per sample:
\[ \mathrm{SIR}_{\rm PN} \;\approx\; \frac{1}{2\pi^2 \sigma_\phi^2 f_{\rm max}^2 T_u^2} \]
  • Increases with symbol duration \(T_u\) — larger OFDM symbols (smaller SCS) suffer more from phase noise.
  • Particularly severe at mmWave (FR2): oscillator stability is worse, and 5G NR mmWave uses PTRS to track and correct residual ICI.

5.7 Doppler Effect and High-Mobility Scenarios

Relative motion between TX and RX at velocity \(v\) causes a Doppler frequency shift:

\[ f_D \;=\; \frac{v \cdot f_c}{c} \qquad \text{(max Doppler for one-way motion)} \]

Worked Example: 28 GHz at 100 km/h

Parameters: \(f_c = 28 \text{ GHz}\), \(v = 100 \text{ km/h} = 27.78 \text{ m/s}\), \(c = 3 \times 10^8 \text{ m/s}\)

\(f_D = 27.78 \times 28 \times 10^9 / (3 \times 10^8) = 2593 \text{ Hz}\)

With 15 kHz SCS (\(\mu=0\)): \(\varepsilon = f_D / \Delta f_{\rm sc} = 2593/15000 = 0.173\)

\(\mathrm{SIR} \approx 3/(\pi^2 \times 0.173^2) \approx 10.1 \text{ dB}\) — severely limits 64-QAM or 256-QAM without correction.

With 60 kHz SCS (\(\mu=2\)): \(\varepsilon = 2593/60000 = 0.043\)\(\mathrm{SIR} \approx 164 \approx 22.1 \text{ dB}\) — acceptable.

This is the fundamental reason 5G NR defines multiple numerologies (subcarrier spacings). Mobility determines the appropriate SCS:

\(\mu\) SCS (kHz) Slot duration (ms) Max Doppler @ 28 GHz, 120 km/h Normalized CFO \(\varepsilon\) SIR (dB)
0151.0 3111 Hz0.2078.6
1300.5 3111 Hz0.10414.5
2600.25 3111 Hz0.05220.5
31200.125 3111 Hz0.02626.5

Table 5.1 — Normalized Doppler CFO and resulting SIR vs 5G NR numerology at 28 GHz, 120 km/h.

5.8 Synchronization in 5G NR

5G NR implements a hierarchical synchronization architecture. The UE acquires synchronization in stages, from coarse to fine, using dedicated reference signals.

Synchronization Signal Block (SSB)

The SSB consists of PSS + SSS + PBCH DMRS. It is transmitted periodically (default 20 ms period) and enables initial cell search.

PSS
Primary Synchronization Signal
  • Zadoff-Chu (ZC) sequence of length 127 on subcarriers −63 to +63.
  • Three possible ZC root indices: \(u \in \{25, 29, 34\}\) (indicating 3 physical layer cell IDs modulo 3).
  • ZC sequences have ideal autocorrelation → correlation peak gives symbol timing.
  • PSS detection gives: coarse timing + integer CFO (frequency hypothesis search) + \(N_{\rm ID}^{(2)} \in \{0,1,2\}\).
SSS
Secondary Synchronization Signal
  • m-sequence based sequence of length 127, also on 127 subcarriers.
  • 336 possible sequences → encodes \(N_{\rm ID}^{(1)} \in \{0,\ldots,335\}\).
  • Full Physical Cell ID: \(N_{\rm ID}^{\rm cell} = 3 N_{\rm ID}^{(1)} + N_{\rm ID}^{(2)}\), giving 1008 unique cell IDs.
  • Also provides frame timing (half-frame detection).
DMRS
Demodulation Reference Signal
  • Embedded within PDSCH/PUSCH data regions at known time-frequency positions.
  • Enables channel estimation at the receiver for equalization.
  • Also corrects Common Phase Error (CPE) from residual CFO and phase noise within each slot.
  • Density and pattern depend on channel conditions (Type A: front-loaded, Type B: additional).
PTRS
Phase Tracking Reference Signal
  • Used only at FR2 (mmWave, above 24.25 GHz) where phase noise is most severe.
  • Sparse in frequency (1 or 2 subcarriers per PRB block) but dense in time (every or every-other symbol).
  • Tracks residual CPE after DMRS correction, symbol by symbol.
  • Particularly important for 256-QAM at FR2 where phase noise margin is very tight.
5G NR sync sequence: PSS (timing + coarse CFO + cell ID mod 3) → SSS (cell ID group + frame timing) → PBCH DMRS (fine channel est.) → PDCCH/PDSCH DMRS (per-slot channel + CPE) → PTRS (per-symbol phase tracking at FR2)

Study Questions

  1. A 5G NR system operates at 3.5 GHz with 30 kHz SCS (\(\mu=1\)). A UE moves at 200 km/h. (a) Calculate the maximum Doppler shift \(f_D\). (b) Compute the normalized CFO \(\varepsilon\). (c) Using the SIR approximation \(\mathrm{SIR} \approx 3/(\pi^2\varepsilon^2)\), find the maximum achievable SNR ceiling due to ICI. (d) Would this SIR be acceptable for 64-QAM (which requires approximately 22 dB SNR)?
  2. The CP-based CFO estimator computes \(\hat{\varepsilon} = \frac{1}{2\pi}\angle(\gamma)\) where \(\gamma = \sum_{n=0}^{N_{\rm CP}-1} r^*[n] r[n+N]\). (a) What is the unambiguous estimation range of this estimator in terms of normalized CFO? (b) If the true CFO is \(\varepsilon = 1.3\) subcarrier spacings, what would the CP estimator report? (c) Describe a two-stage approach (combining CP-based and pilot-based estimation) to handle CFOs larger than ±0.5 subcarrier spacings.
  3. Explain why 5G NR defines Phase Tracking RS (PTRS) for FR2 (mmWave) but not for FR1 (sub-6 GHz). Your answer should address: (a) how oscillator phase noise power spectral density scales with carrier frequency, (b) the difference between CPE correction via DMRS and per-symbol CPE tracking via PTRS, and (c) why larger subcarrier spacings (used at FR2 for Doppler robustness) actually help with phase noise even though the symbol duration \(T_u\) is shorter.
§6

Channel Estimation & Equalization

Single-tap equalization · ZF & MMSE · Pilot grids · DFT smoothing · 5G NR DMRS

6.1 Channel Model for OFDM

After cyclic prefix (CP) removal and the N-point FFT, the multipath channel reduces to a set of independent, parallel scalar channels — one per subcarrier. This is the fundamental reason OFDM is so amenable to simple equalisation. If the channel impulse response (CIR) is h[l], \; l=0,\ldots,L-1 with L \le N_{CP}, the received sample on subcarrier k is:

$$Y[k] \;=\; H[k]\,X[k] \;+\; N[k], \qquad k = 0,1,\ldots,N-1$$

where H[k] = \sum_{l=0}^{L-1} h[l]\,e^{-j2\pi kl/N} is the DFT of the CIR evaluated at bin k, X[k] is the transmitted QAM symbol, and N[k] \sim \mathcal{CN}(0,\sigma_n^2) is complex AWGN.

Key insight: Because the CP converts linear convolution into circular convolution, the entire channel matrix in the time domain becomes diagonal in the DFT domain. Each subcarrier sees only a scalar complex multiplication — the DFT coefficient H[k]. Equalization then requires only a single complex division (or multiplication) per subcarrier, instead of a full matrix inversion.

The single-tap equalizer produces the estimate:

$$\hat{X}[k] \;=\; W[k]\,Y[k] \;=\; W[k]\,H[k]\,X[k] \;+\; W[k]\,N[k]$$

The design of the equalizer coefficient W[k] trades off between residual channel distortion (if W[k]H[k] \ne 1) and noise amplification (large |W[k]| boosts W[k]\,N[k]). The two canonical solutions are the Zero-Forcing and MMSE equalizers.

6.2 Zero-Forcing (ZF) Equalizer

The Zero-Forcing equalizer forces the inter-symbol interference to zero by perfectly inverting the channel:

$$W_{\mathrm{ZF}}[k] \;=\; \frac{1}{H[k]}$$

This gives \hat{X}_{\mathrm{ZF}}[k] = X[k] + N[k]/H[k], so the residual noise power on subcarrier k is:

$$\sigma_{\mathrm{ZF},k}^2 \;=\; \frac{\sigma_n^2}{|H[k]|^2}$$
Noise enhancement: When the channel has a deep fade at subcarrier k (i.e., |H[k]| \ll 1), the coefficient |W_{\mathrm{ZF}}[k]| = 1/|H[k]| becomes very large, catastrophically amplifying the noise. In the extreme case of a channel null (H[k] = 0), the ZF equalizer is undefined — the subcarrier carries no useful information. This is particularly damaging at low SNR.

The post-equalization SNR at each subcarrier is:

$$\mathrm{SNR}_{\mathrm{ZF}}[k] \;=\; \frac{|H[k]|^2}{\sigma_n^2} \;=\; \mathrm{SNR}_{\mathrm{pre}}[k]$$

Achieves optimal performance only when the channel is flat (|H[k]|=\text{const}). In practice, diversity techniques (coding, frequency hopping) are needed to protect against nulls.

6.3 MMSE Equalizer

The Minimum Mean Square Error (MMSE) equalizer minimizes \mathbb{E}\left[|\hat{X}[k]-X[k]|^2\right]. Assuming X[k] \sim \mathcal{CN}(0,\sigma_s^2):

$$W_{\mathrm{MMSE}}[k] \;=\; \frac{H^*[k]}{|H[k]|^2 + \dfrac{\sigma_n^2}{\sigma_s^2}} \;=\; \frac{H^*[k]}{|H[k]|^2 + \dfrac{1}{\mathrm{SNR}}}$$

where \mathrm{SNR} = \sigma_s^2/\sigma_n^2 is the average signal-to-noise ratio. The post-equalization SINR is:

$$\mathrm{SINR}_{\mathrm{MMSE}}[k] \;=\; \frac{|H[k]|^2\,\sigma_s^2}{\sigma_n^2} \cdot \frac{1}{1 + |H[k]|^2\,\sigma_s^2/\sigma_n^2} \;=\; \frac{|H[k]|^2\,\mathrm{SNR}}{1 + |H[k]|^2\,\mathrm{SNR}}$$

Limiting cases

  • High SNR: 1/\mathrm{SNR} \to 0W_{\mathrm{MMSE}} \to H^*[k]/|H[k]|^2 = 1/H[k] = W_{\mathrm{ZF}}
  • Low SNR / channel null: |H[k]|^2 \ll 1/\mathrm{SNR}W_{\mathrm{MMSE}} \approx H^*[k] \cdot \mathrm{SNR} \to 0 (gracefully suppress noisy subcarrier)
  • Very low SNR: MMSE → matched filter W_{\mathrm{MF}}[k] = H^*[k]/|H[k]|

Residual bias

MMSE is biased: it introduces a scaling error on X[k]. The estimate must be corrected:

$$\hat{X}_{\mathrm{MMSE}}[k] = \frac{W_{\mathrm{MMSE}}[k]\,Y[k]}{W_{\mathrm{MMSE}}[k]\,H[k]}$$

Alternatively, the bias is absorbed into the soft demapper's LLR computation in modern receivers.

ZF vs MMSE Equalizer — Noise Enhancement at Channel Nulls

A frequency-selective channel with a deep fade near subcarrier 40. The ZF response |W_{\mathrm{ZF}}[k]| = 1/|H[k]| blows up at the null, while the MMSE response |W_{\mathrm{MMSE}}[k]| remains bounded by trading off noise against residual distortion.

6.4 Pilot-Based Channel Estimation

To apply the equalizer, the receiver must first estimate H[k] across all subcarriers. The standard approach inserts known pilot symbols at selected time-frequency positions and interpolates between them.

Pilot Insertion Patterns

Comb pilots

Pilots placed every N_f subcarriers, across all or many OFDM symbols. Suited for rapidly time-varying channels (high Doppler) because the channel can be tracked continuously in time. Used in LTE CRS and 5G NR DMRS Type 1.

  • Pilot spacing in frequency: \Delta_f \le B_c (coherence BW)
  • Overhead proportional to 1/N_f

Block pilots

Entire OFDM symbols are dedicated as pilot symbols. Suited for slow fading channels with long coherence time. Simpler interpolation in frequency; extrapolation required at symbol boundaries in time.

  • Pilot symbol spacing in time: \Delta_t \le T_c (coherence time)
  • One pilot symbol every N_t data symbols

Least-Squares Estimation at Pilot Positions

At pilot subcarrier index k_p, where the transmitted symbol X_p[k_p] is known:

$$\hat{H}_{\mathrm{LS}}[k_p] \;=\; \frac{Y[k_p]}{X_p[k_p]} \;=\; H[k_p] \;+\; \frac{W[k_p]}{X_p[k_p]}$$

The noise term on the estimate has variance \sigma_n^2/|X_p|^2. Using unit-power pilots (|X_p|^2=1) gives the minimum-variance unbiased LS estimator. Full-power pilots (boosted above data power) improve estimate quality at the cost of increased PAPR and interference to adjacent subcarriers.

Interpolation Methods

MethodComplexityMSENotes
LinearLowModerateFast, suitable for slowly varying channels
Spline (cubic)MediumLowSmooth estimate; can overshoot at edges
DFT-basedMediumLow–Very LowLeverages CIR sparsity; see §6.5
LMMSE (Wiener)HighOptimalRequires channel statistics; see §6.8

2D Nyquist Pilot Density Requirements

To avoid aliasing in the estimated channel, the pilot grid must satisfy the 2D Nyquist sampling criterion:

$$\Delta f_{\mathrm{pilot}} \;\le\; \frac{B_c}{2}, \qquad \Delta t_{\mathrm{pilot}} \;\le\; \frac{T_c}{2}$$

where B_c \approx 1/(2\pi\tau_{\max}) is the coherence bandwidth and T_c \approx 1/(2\pi B_D) is the coherence time, \tau_{\max} is the RMS delay spread, and B_D is the Doppler spread. Equivalently in discrete terms:

$$N_f \;\le\; \left\lfloor\frac{B_c}{\Delta f_{\mathrm{sc}}}\right\rfloor, \qquad N_t \;\le\; \left\lfloor\frac{T_c}{T_{\mathrm{OFDM}}}\right\rfloor$$

6.5 DFT-Based Channel Estimation

The CIR has at most L taps, where L \ll N for typical channels (e.g., L=16 taps in a 2048-point FFT). This sparsity can be exploited to dramatically reduce estimation noise:

1
LS estimate at N pilot positions
Obtain \hat{H}_{\mathrm{LS}}[k_p] for all N_p pilots across the OFDM symbol.
2
IDFT → time domain CIR estimate
\hat{h}[l] = \mathrm{IDFT}\{\hat{H}_{\mathrm{LS}}[k]\}. The true CIR occupies taps l=0,\ldots,L-1; higher lags contain only noise.
3
Zero-pad / windowing beyond L taps
Set \hat{h}[l] = 0 for l \ge L (rectangular window) or apply a smooth taper to avoid Gibbs ringing.
4
DFT back → smoothed frequency-domain estimate
\hat{H}_{\mathrm{DFT}}[k] = \mathrm{DFT}\{\hat{h}_{\mathrm{trunc}}[l]\}.

The noise reduction factor is N_p/L. For N_p = 512 pilots and L = 16 taps, the noise power in the estimate is reduced by 15 dB. The MSE of the DFT estimator is:

$$\mathrm{MSE}_{\mathrm{DFT}} \;\approx\; \frac{L}{N_p} \cdot \frac{\sigma_n^2}{\sigma_s^2} \;=\; \frac{L}{N_p \cdot \mathrm{SNR}}$$
Practical note: The truncation window length L must be chosen conservatively to include the full channel delay spread. If L is underestimated, some channel energy is zeroed out, introducing bias. In 5G NR, the CP length provides an upper bound: the channel cannot have delays beyond N_{CP} without causing ISI.

6.6 DMRS in 5G NR

The Demodulation Reference Signal (DMRS) is the primary channel estimation pilot in 5G NR for PDSCH (downlink data) and PUSCH (uplink data). Its design reflects lessons from LTE and the demand for ultra-low-latency channel estimation.

DMRS Type 1

  • Comb-2 pattern: pilots on alternate subcarriers within each RB
  • Supports up to 4 orthogonal ports (OCC × frequency comb)
  • Lower pilot density → lower overhead for single-layer transmissions
  • Default for most NR deployments

DMRS Type 2

  • Comb-3 pattern: pilots grouped in sets of 2 consecutive subcarriers
  • Supports up to 6 orthogonal ports
  • Better suited for massive MIMO with many spatial layers
  • Higher pilot density in each port's allocation

Front-Loaded DMRS

NR places DMRS in the first 1 or 2 OFDM symbols of each slot (or mini-slot). This allows the UE to start channel estimation and data demodulation without waiting for the full slot — critical for low-latency URLLC. In LTE, CRS was spread across the entire subframe, incurring higher decoding latency.

Additional DMRS Positions (High Mobility)

For high-speed scenarios (vehicular, HST), the channel varies significantly within a slot. NR supports additional DMRS positions (up to 4 per slot), densifying the pilot grid in the time direction to track faster fading. This is configured via maxLength and additionalPosition in the DMRS configuration IE.

DMRS Sequence Generation

NR DMRS symbols are generated from a length-31 Gold sequence (same pseudorandom generator used for scrambling), initialized per cell ID, slot, and symbol index. The complex symbols are:

$$d(m) \;=\; \frac{1}{\sqrt{2}}\bigl(1 - 2c(2m)\bigr) \;+\; j\,\frac{1}{\sqrt{2}}\bigl(1 - 2c(2m+1)\bigr)$$

where c(n) is the Gold sequence output. This BPSK-like construction gives |d(m)|=1 — constant modulus — which minimises PAPR and ensures consistent pilot power across all positions.

SignalUseDirectionPattern
DMRSData demodulation (per-layer CE)DL+ULComb (Type 1/2) in time-first symbols
PTRSPhase noise trackingDL+ULSparse in frequency, dense in time
CSI-RSCQI/PMI/RI feedback, beam managementDLConfigurable grid, up to 32 ports
SRSUL channel sounding, reciprocityULComb in last symbols of slot
CRS (LTE)DL CE, RRM measurementsDLComb, 4 ports, all subframes

6.7 Time-Varying Channels & the Scattering Function

Real wireless channels are non-stationary: scatterers move, causing the channel to vary in both delay and Doppler. The Wide-Sense Stationary Uncorrelated Scattering (WSSUS) model characterises the channel by its scattering function S(\tau,\nu), the power spectral density in delay-Doppler space.

Delay spread & coherence bandwidth

$$B_c \;\approx\; \frac{1}{5\,\sigma_\tau}, \quad B_c^{(0.9)} \approx \frac{1}{50\,\sigma_\tau}$$

\sigma_\tau = RMS delay spread. Subcarrier spacing \Delta f \ll B_c is needed to ensure flat fading per subcarrier.

Doppler spread & coherence time

$$T_c \;\approx\; \frac{0.423}{f_D}, \quad f_D = \frac{v}{\lambda} = \frac{v\,f_c}{c}$$

f_D = maximum Doppler frequency, v = UE velocity, f_c = carrier frequency.

The 2D Nyquist pilot density requirements in discrete terms (subcarrier spacing \Delta f_{\mathrm{sc}}, OFDM symbol duration T_s = T_u + T_{CP}):

$$N_f \;\le\; \left\lfloor\frac{B_c}{2\,\Delta f_{\mathrm{sc}}}\right\rfloor, \qquad N_t \;\le\; \left\lfloor\frac{T_c}{2\,T_s}\right\rfloor$$

Violating these bounds causes aliasing in the estimated channel — the interpolated H[k] will contain errors that persist regardless of SNR, creating an estimation error floor.

ICI in high-Doppler channels: When the Doppler spread B_D is non-negligible compared to the subcarrier spacing \Delta f, the channel varies within one OFDM symbol. The DFT then no longer diagonalises the channel matrix; off-diagonal terms introduce Inter-Carrier Interference (ICI). ICI cannot be removed by single-tap equalisation — multi-tap (matrix) equalisation or OTFS modulation is required.

6.8 LMMSE (Wiener) Channel Estimation

The Linear MMSE (also called Wiener filter) estimator exploits the statistical correlation of the channel across frequency and time — the channel does not change independently at each subcarrier; nearby subcarriers are correlated with correlation length \sim B_c/\Delta f_{\mathrm{sc}}.

2D Wiener Filter

Let \mathbf{y}_p = [Y[k_{p,1}],\ldots,Y[k_{p,N_p}]]^T be the received pilots and \mathbf{h}_p the channel at pilot positions. The LMMSE estimate of the full channel vector \mathbf{h} is:

$$\hat{\mathbf{H}}_{\mathrm{LMMSE}} \;=\; \mathbf{R}_{H H_p} \left(\mathbf{R}_{H_p H_p} + \frac{\sigma_n^2}{\sigma_s^2}\mathbf{I}\right)^{-1} \hat{\mathbf{H}}_{\mathrm{LS}}$$

where \mathbf{R}_{H H_p} is the cross-correlation matrix between all subcarriers and pilot subcarriers, and \mathbf{R}_{H_p H_p} is the autocorrelation at pilot positions. The SNR-scaled identity accounts for the noise in the LS estimate.

Complexity challenge

Full LMMSE requires inversion of an N_p \times N_p matrix — complexity \mathcal{O}(N_p^3). For N_p = 128, that is ~2M operations per OFDM symbol — too expensive for real-time receivers.

Separable 1D approximation

Approximate the 2D filter as two cascaded 1D Wiener filters (frequency domain, then time domain). Reduces complexity to \mathcal{O}(N_f^2 + N_t^2). Most 4G/5G baseband implementations use this approach with pre-computed filter taps.

The 1D frequency-domain Wiener filter coefficients at pilot spacing \Delta k are derived from the Clarke/Jakes power delay profile. For an exponential PDP with RMS delay spread \sigma_\tau:

$$R_H[\Delta k] \;=\; \frac{1}{1 + j2\pi\,\Delta k\,\Delta f_{\mathrm{sc}}\,\sigma_\tau}$$

Pilot Grid & Interpolated Channel Estimate

Left: time-frequency resource grid showing pilot positions (yellow dots) vs data subcarriers (blue background). Right: the interpolated channel magnitude |\hat{H}(t,f)| surface reconstructed from the pilot estimates — a smoothly varying response reflecting the channel's coherence in both dimensions.

6.9 Practical Channel Estimation in LTE / 5G NR

Real deployments layer multiple reference signals, each optimised for a specific estimation purpose:

LTE Downlink

  • CRS (Cell-specific Reference Signals): Legacy DL pilots, present in every subframe regardless of data scheduling. Comb pattern with 6-subcarrier frequency separation and 7-symbol time separation per antenna port (4 ports: port 0–3). Used for channel estimation, RSRP/RSRQ measurement, and PDSCH demodulation in non-precoded modes.
  • DMRS (UE-specific RS): Precoded pilot for PDSCH in transmission modes 7–9. Located in symbol 4 (normal CP) of the subframe. Same precoding as data → transparent to the UE's equalizer.
  • CSI-RS: Introduced in LTE-A Release 10 for large-scale MIMO feedback. Configurable density, up to 8 ports.

5G NR — Full Reference Signal Suite

  • DMRS (PDSCH/PUSCH): Primary CE pilot; front-loaded (symbols 2–3 of slot); Types 1/2 with 1 or 2 CDM groups; mapped to up to 12 orthogonal ports via OCC/CDM. Supports both single-symbol and double-symbol DMRS.
  • PTRS (Phase Tracking RS): Extremely sparse in frequency (1 per PRG of 2/4 RBs), dense in time (every symbol). Corrects CPE (common phase error) from local oscillator phase noise — dominant impairment at mmWave (FR2).
  • CSI-RS: Flexible configuration: 1–32 ports, various densities (3, 1, 1/2 RE per RB per port). Multiple uses: NZP-CSI-RS for CSI measurement (CQI/PMI/RI), TRS (tracking RS) for timing/frequency tracking, CSI-RS for beam management (L1-RSRP).
  • SRS (Sounding RS): UL; wideband channel sounding for UL scheduling, antenna selection, and TDD DL/UL reciprocity-based precoding. Transmitted in last symbols of slot; configurable bandwidth, comb factor (2 or 4), and periodicity.
NR vs LTE — key CE improvements: (1) NR DMRS is front-loaded → 2–3 OFDM symbol latency reduction vs LTE CRS; (2) NR DMRS is UE-specific (always precoded) → no pilot contamination in MU-MIMO; (3) NR separates CE from measurement (DMRS vs CSI-RS) → avoids CRS blanking overhead of LTE; (4) PTRS is mmWave-aware — LTE had no equivalent.

Study Questions

  1. A 5G NR PDSCH allocation has 52 RBs (624 subcarriers), SCS = 30 kHz, and the channel has RMS delay spread \sigma_\tau = 100\,\mathrm{ns} and UE velocity v = 120\,\mathrm{km/h} at f_c = 3.5\,\mathrm{GHz}. Calculate the maximum allowable DMRS subcarrier spacing \Delta f_{\mathrm{pilot}} (in subcarriers) and the maximum pilot symbol spacing \Delta t_{\mathrm{pilot}} (in OFDM symbols) to satisfy the 2D Nyquist criterion. How many pilot RE per slot are required, and what percentage overhead does this represent for a 14-symbol slot?
  2. A ZF equalizer is applied to a 5G NR channel where subcarrier k^* has a deep fade of |H[k^*]| = -20\,\mathrm{dB} (relative to the mean channel gain). If the operating SNR is 20 dB, what is the post-equalization SNR on subcarrier k^* for (a) ZF and (b) MMSE equalizers? By how many dB does MMSE outperform ZF on this subcarrier? What does this imply for the required coding rate when frequency-selective fading is present?
  3. In the DFT-based channel estimator (§6.5), explain the bias-variance trade-off when choosing the truncation window length L. If the true CIR has 10 significant taps but you set L = 8 (underestimate) vs L = 32 (overestimate), quantify the impact on (a) estimation MSE and (b) channel reconstruction bias, assuming AWGN with SNR = 15 dB, N_p = 256 pilots. What practical rule of thumb should guide the choice of L in a 5G NR receiver?
§7

LTE (4G) OFDM — OFDMA, SC-FDMA, Resource Grid & Air Interface

Long Term Evolution (LTE), standardised in 3GPP Release 8 (2008) and refined through Releases 9–15, brought OFDM from Wi-Fi into wide-area cellular networks for the first time. The downlink uses OFDMA (Orthogonal Frequency Division Multiple Access) while the uplink uses SC-FDMA (Single-Carrier FDMA, also called DFT-s-OFDM) — a deliberate asymmetry driven by the need to conserve UE battery life. Understanding LTE's parameter choices is the essential prerequisite for 5G NR's flexible numerology (§8).

7.1 LTE OFDM Parameters

LTE fixes the subcarrier spacing at 15 kHz for all channel bandwidths. This was chosen to balance Doppler tolerance (wider spacing is better) against multipath tolerance (wider spacing shortens the useful symbol time, reducing CP efficiency). 15 kHz maps to a useful symbol duration of exactly \(T_u = 1/\Delta f = 66.7\;\mu\text{s}\).

LTE subcarrier spacing \[ \Delta f = 15\;\text{kHz} \quad \Longrightarrow \quad T_u = \frac{1}{\Delta f} = 66.7\;\mu\text{s} \]
Parameter 1.4 MHz 3 MHz 5 MHz 10 MHz 15 MHz 20 MHz
Subcarrier spacing \(\Delta f\) 15 kHz15 kHz15 kHz 15 kHz15 kHz15 kHz
FFT size \(N_{FFT}\) 128256512 102415362048
Occupied subcarriers 72180300 6009001200
Resource Blocks (RBs) 61525 5075100
Useful symbol duration \(T_u\) 66.7 µs (= 1/15 kHz) — identical for all BW
Normal CP length 4.7 µs first symbol / 5.2 µs first symbol of slot · 144 or 160 samples @ \(f_s = 30.72\;\text{MHz}\) (20 MHz case) — ratio 144/2048
Extended CP length 16.67 µs · 512 samples @ 30.72 MHz — 6 symbols/slot instead of 7
Slot duration 0.5 ms — 7 symbols (normal CP) or 6 symbols (extended CP)
Subframe duration 1 ms = 2 slots (Transmission Time Interval, TTI)
Frame duration 10 ms = 10 subframes = 20 slots
Sampling rate \(f_s\) (20 MHz) 30.72 MHz = 2048 × 15 kHz — exact integer relationship between FFT size and sampling clock
Normal CP derivation \[ T_{CP,\text{normal}} = \frac{144}{2048} \cdot T_u = \frac{144}{2048} \times 66.7\;\mu\text{s} \approx 4.69\;\mu\text{s} \quad \left(\frac{160}{2048} \times 66.7 \approx 5.21\;\mu\text{s} \;\text{for symbol 0 of each slot}\right) \]
Why is the first symbol of each slot slightly longer? The first symbol uses a 160-sample CP (5.21 µs) to ensure that an exact 0.5 ms slot boundary is hit. With 7 symbols, you would have 7 × 144 + 2048 × 7 = 1008 + 14336 = 15344 samples. Adding 16 extra samples (160 − 144 = 16) gives 15360 = 0.5 ms × 30.72 MHz. This rounding trick keeps timing perfectly aligned to the subframe grid.
Extended CP (16.67 µs = 512 samples) exists for high-delay-spread scenarios: MBSFN (multicast), some relay deployments, and coverage-limited cells. Channels with excess delay spread beyond 4.7 µs — e.g., mountainous terrain with path length differences > 1.4 km — require extended CP. The price is one fewer symbol per slot, reducing capacity by ~14%.

7.2 LTE OFDMA — Downlink Architecture

In the LTE downlink, the eNodeB (base station) generates all subcarriers simultaneously via a single N-point IFFT. Multiple UEs are multiplexed by assigning each a non-overlapping set of Resource Blocks (RBs) within each 1 ms subframe — this is Orthogonal Frequency Division Multiple Access.

Resource Block Definition

A Physical Resource Block (PRB) in LTE occupies 12 consecutive subcarriers (180 kHz) × 7 OFDM symbols (one 0.5 ms slot) = 84 Resource Elements (REs). One subframe = 2 PRBs in time = 168 REs per RB-pair. At 20 MHz, 100 PRBs are available = 1200 occupied subcarriers.
PRB bandwidth \[ \text{PRB BW} = 12 \times \Delta f = 12 \times 15\;\text{kHz} = 180\;\text{kHz} \]
Total occupied bandwidth \[ \text{Occupied BW} = N_{RB} \times 12 \times \Delta f = 100 \times 180\;\text{kHz} = 18\;\text{MHz} \quad \text{(20 MHz channel, with guard bands)} \]

Frequency-Selective Scheduling

At each TTI (1 ms), the eNodeB scheduler assigns RBs to UEs based on Channel Quality Indicator (CQI) feedback. A UE reports the CQI it measures on each subband (a group of RBs). The scheduler exploits multi-user diversity: different UEs experience fading peaks at different frequencies, so the aggregate system capacity is maximised by assigning each subband to whichever UE sees it best — the proportional-fair or maximum C/I criterion.

Spectral efficiency vs FDMA: In classical FDMA each user is permanently assigned a frequency slice with guard bands on either side to prevent inter-user interference. In OFDMA, subcarriers are mathematically orthogonal, so no guard bands are needed between users assigned to adjacent subcarriers. This eliminates the "wasted spectrum" of guard bands, directly improving spectral efficiency.

7.3 SC-FDMA (DFT-s-OFDM) — Uplink Architecture

Why Not OFDMA in the Uplink?

OFDMA's chief drawback is a high Peak-to-Average Power Ratio (PAPR) — typically 8–12 dB (see §4). A high PAPR forces the UE power amplifier to operate far below its saturation point (large backoff), drastically reducing power efficiency. For a battery-powered UE, this directly reduces talk time. The eNodeB is a mains-powered infrastructure node where PAPR is a manageable engineering problem; the UE is not.

SC-FDMA achieves low PAPR (2–4 dB) — comparable to QPSK or 16-QAM single-carrier — while preserving all the OFDM advantages: cyclic prefix ISI elimination and one-tap frequency-domain equalisation at the receiver.

DFT-s-OFDM Signal Processing Chain

SC-FDMA / DFT-s-OFDM Transmitter — Step by Step
Input bits
  → QAM mapper (QPSK / 16-QAM / 64-QAM)  →  M data symbols: d[0..M-1]
  → M-point DFT  →  D[k] = Σ_{m=0}^{M-1} d[m] · e^{-j2πmk/M},  k=0..M-1
  → Subcarrier mapping (M occupied out of N total, N > M)
        LFDMA: map D[k] to contiguous subcarriers [k_0 .. k_0+M-1]
        IFDMA: map D[k] to every (N/M)-th subcarrier (not used in LTE)
  → N-point IFFT  →  time-domain single-carrier-like waveform  s[n]
  → Insert cyclic prefix (same structure as OFDM)
  → DAC + RF → air interface
    

The key insight: the DFT "pre-spreads" the M QAM symbols across M subcarriers so that the final IFFT output looks like a single-carrier signal. The time-domain signal has the envelope statistics of single-carrier (low PAPR) rather than a sum of many independent sinusoids (high PAPR).

DFT-s-OFDM transmit signal \[ s[n] = \frac{1}{\sqrt{N}} \sum_{k=0}^{N-1} \tilde{D}[k] \cdot e^{j2\pi nk/N}, \quad n = 0,\ldots,N+N_{CP}-1 \] where \(\tilde{D}[k]\) is the subcarrier-mapped DFT output (zero on unoccupied subcarriers, \(D[k-k_0]\) on the assigned M subcarriers).
PAPR definition (reminder) \[ \text{PAPR} = \frac{\max_n |s[n]|^2}{\mathbb{E}[|s[n]|^2]} \quad \text{(instantaneous peak power / average power)} \]

M Must Be "Highly Composite"

The M-point DFT is implemented as an FFT. For computational efficiency, M must factor into small primes. LTE mandates that M (the number of assigned subcarriers, always a multiple of 12) must be expressible as \(M = 2^a \cdot 3^b \cdot 5^c\) with \(a,b,c \geq 0\). This is called a "good" or highly-composite number constraint. Valid values in LTE include 12, 24, 36, 48, 60, 72, 96, 120, 144, 180, … up to 1200. Any M that requires a prime factor > 5 is forbidden, which means RB allocations of 7, 11, 13 RBs (= 84, 132, 156 subcarriers) are not schedulable.

LFDMA vs IFDMA: LTE uses only Localised FDMA (contiguous RB allocation). Interleaved FDMA would yield slightly better frequency diversity but requires the entire bandwidth to be allocated to one UE, preventing multi-user multiplexing — impractical for a shared uplink channel.

7.4 LTE Reference Signals

Reference Signal Full Name Direction Purpose Density / Periodicity
CRS Cell-specific RS DL Channel estimation (non-precoded), cell search, RSRP measurement, handover, legacy UE support Every subframe; RE positions shift by cell ID mod 6. 4 REs per RB per slot for 2-port CRS (ports 0–1)
DMRS Demodulation RS (UE-specific) DL (Rel-10+) & UL Channel estimation for precoded transmission; transparent to beamforming (precoded with data) DL DMRS in symbols 4 and/or 7 of RB. UL DMRS in symbol 4 of each slot (Zadoff-Chu sequence)
CSI-RS Channel State Information RS DL CQI/PMI/RI feedback; beamforming; interference measurement (ZP-CSI-RS nulls interferers) 1, 2, 4, or 8 ports; configurable periodicity 5/10/20/40/80 ms; 1 RE per RB per port
SRS Sounding Reference Signal UL Frequency-selective channel sounding; allows eNodeB to estimate UL channel and perform frequency-selective scheduling + link adaptation Last symbol of subframe; configurable periodicity 2–320 ms; configurable bandwidth (wideband or partial-band)
PSS / SSS Primary / Secondary Synchronisation Signal DL Cell search and synchronisation; PSS carries physical layer ID group (0–2); SSS carries cell ID within group (0–167) PSS in subframe 0 and 5 (FDD); occupies central 62 subcarriers; Zadoff-Chu (PSS) and length-31 m-sequence (SSS)
CRS overhead cost: With 2-antenna CRS, roughly 4 REs per RB-pair are consumed as pilot overhead. Over a 14-symbol subframe and 12 subcarriers per RB, that is 4/168 ≈ 2.4% per antenna port. With 4-port CRS (4×2 MIMO), overhead reaches ~9.5%. This "always-on" pilot became one of the motivations for 5G NR's reference-signal-lean design using DMRS-only with on-demand CSI-RS.

7.5 LTE MIMO and OFDM Interaction

OFDM converts a wideband frequency-selective MIMO channel into a bank of narrowband flat-fading MIMO subchannels — one per subcarrier. This is the most elegant property of OFDM for MIMO: the spatial processing (precoding, detection) can be performed independently per subcarrier.

Per-subcarrier MIMO channel \[ \mathbf{y}[k] = \mathbf{H}[k]\,\mathbf{x}[k] + \mathbf{n}[k] \quad k = 0, 1, \ldots, N_{FFT}-1 \] where \(\mathbf{H}[k] \in \mathbb{C}^{N_r \times N_t}\) is the \(N_r \times N_t\) flat-fading MIMO channel matrix at subcarrier \(k\), \(\mathbf{x}[k]\) is the transmitted vector, and \(\mathbf{n}[k] \sim \mathcal{CN}(\mathbf{0},\sigma^2\mathbf{I})\).
MIMO Mode LTE Transmission Mode (TM) Max Layers Precoding Feedback Required
Single AntennaTM11 NoneCQI
Transmit Diversity (SFBC)TM21 SFBC (Space-Frequency Block Coding)CQI
Open-Loop Spatial MuxTM34 Cyclic Delay Diversity + codebookCQI, RI
Closed-Loop Spatial MuxTM44 DL / 2 UL Codebook PMI (3GPP 36.213 tables)CQI, PMI, RI
MU-MIMOTM51 (per UE) Single-layer PMI, orthogonal UE pairingCQI, PMI
Beamforming (TDD)TM7–TM98 (TM9) UE-specific DMRS — eNodeB estimates UL channel for DL precoder (TDD reciprocity) CQI, PMI, RI (subband)

Space-Frequency Block Coding (SFBC) in LTE TM2 encodes pairs of symbols across two antenna ports and two adjacent subcarriers, achieving 2nd-order diversity without CSI at the transmitter. It is the LTE equivalent of Alamouti coding but mapped to the frequency dimension (rather than time) to avoid the time variation between symbols.

SFBC (Alamouti in frequency) \[ \begin{pmatrix} x_0[k] \\ x_0[k+1] \end{pmatrix}_{\text{port 0}} = \begin{pmatrix} s_0 \\ s_1 \end{pmatrix}, \quad \begin{pmatrix} x_1[k] \\ x_1[k+1] \end{pmatrix}_{\text{port 1}} = \begin{pmatrix} -s_1^* \\ s_0^* \end{pmatrix} \] providing 2nd-order diversity over 2 subcarriers × 2 antennas with no rate loss.

7.6 LTE Capacity — Shannon Analysis

Shannon capacity \[ C = B \cdot \log_2\!\left(1 + \text{SINR}\right) \]

For LTE 20 MHz with 2×2 MIMO (2 independent spatial streams):

LTE peak theoretical (20 MHz 2×2 MIMO) \[ C_{\text{peak}} = 2 \times 20\;\text{MHz} \times \log_2\!\left(1 + 10^{20/10}\right) = 40\;\text{MHz} \times \log_2(101) \approx 40 \times 6.66 \approx 266\;\text{Mbps} \]
Shannon capacity assumes ideal AWGN and infinite modulation granularity. At 20 dB SINR the maximum LTE modulation is 64-QAM with rate 9/10 turbo coding (spectral efficiency ≈ 5.55 bps/Hz per layer), so the real constraint is the modulation/coding scheme (MCS), not Shannon.

Practical LTE Peak vs Theoretical

Overhead Source Approximate Loss Explanation
Cyclic Prefix (Normal) ~7% capacity loss CP = 144/2048 ≈ 6.7% of symbol duration is guard; carries no data
CRS Pilot Overhead (2 ports) ~4.8% 4 REs per RB-pair occupied by CRS; 4/168 × 2 ports in 2×2 system
PDCCH / Control Region ~14–21% First 1–3 OFDM symbols per subframe reserved for DL control (PDCCH, PCFICH, PHICH)
Guard Subcarriers ~10% 1200 occupied out of 1536 (for 15 MHz; 1200 out of ~1365 for 20 MHz) — spectral shaping
MCS Granularity & BLER Target ~10–15% Finite MCS set forces operating below Shannon bound; 10% BLER target adds ~1 dB margin
Practical Peak DL (20 MHz 2×2) ~75 Mbps 3GPP baseline; operator measurements typically 60–80 Mbps in good conditions
LTE peak data rate formula (3GPP 36.213) \[ R_{\text{peak}} = N_{\text{layers}} \times \frac{N_{RB} \times 12 \times N_{\text{sym}} \times \eta_{MCS}} {T_{\text{subframe}}} \times \eta_{\text{OH}} \] where \(\eta_{MCS}\) is the MCS spectral efficiency (bps/Hz) and \(\eta_{\text{OH}}\) accounts for reference signals and control channels. For 2 layers, 100 RBs, 12 subcarriers, ~8.4 data symbols per slot (14 − 3 control − 2 CRS ≈ 9), and 64-QAM rate-0.926: \[ R \approx 2 \times \frac{100 \times 12 \times 9 \times 6 \times 0.926} {10^{-3}} \approx 150\;\text{Mbps (theoretical MCS peak)} \] Adding all overheads above gives the practical ~75 Mbps figure.

7.7 LTE vs 5G NR — Comparison Preview

Full 5G NR treatment follows in §8. This teaser table highlights the key differences.

Parameter LTE (4G) 5G NR (FR1 + FR2)
Subcarrier Spacing Fixed 15 kHz Flexible: 15 / 30 / 60 / 120 / 240 kHz (numerology µ = 0–4)
DL Waveform CP-OFDM (OFDMA) CP-OFDM (OFDMA only)
UL Waveform DFT-s-OFDM (SC-FDMA) only CP-OFDM or DFT-s-OFDM (UE-configurable)
Max BW (single CC) 20 MHz 100 MHz (FR1 sub-6 GHz), 400 MHz (FR2 mmWave)
Max MIMO Ports (DL) 4 ports (TM9: 8 CSI-RS ports) 32 ports (FR1 massive MIMO), 256 in O-RAN AAU deployments
Reference Signals Always-on CRS (overhead ~5–10%) No always-on CRS; DMRS + on-demand CSI-RS (lean design)
TTI Fixed 1 ms subframe Mini-slot (2–7 symbols) to full 14-symbol slot; scalable latency
Peak DL Rate ~75 Mbps (20 MHz 2×2) >1 Gbps (100 MHz 4×4, 256-QAM)
Carrier Aggregation Up to 5 CC (100 MHz) Up to 16 CC (can span FR1+FR2)
The philosophical shift from LTE to NR: LTE was designed for a single use case (broadband data) with fixed parameters. NR's flexible numerology means the same standard can serve eMBB (enhanced Mobile Broadband, µ=0–1), URLLC (Ultra-Reliable Low-Latency, µ=2–3), and mmWave backhaul (µ=3–4) — all from one air interface specification.

7.8 Interactive Charts

Chart 7-A — LTE Resource Grid (One Subframe: 2 Slots, 14 Symbols)
One 1 ms subframe (14 OFDM symbols, normal CP) for a 3-RB slice (36 subcarriers). Colour coding: blue = PDCCH control region (symbols 0–2), green = PDSCH data REs, red = CRS pilot REs (ports 0+1, offset by cell ID). Hover over any RE for details. Slot boundary is between symbols 6 and 7.
Chart 7-B — SC-FDMA vs OFDMA PAPR CCDF (20 MHz, 300 Active Subcarriers)
Complementary Cumulative Distribution Function (CCDF) of PAPR for OFDMA (CP-OFDM) and SC-FDMA (DFT-s-OFDM), both with 300 active subcarriers out of 2048 total. Waveforms carry 16-QAM symbols; CCDF estimated over 10 000 random symbols blocks. SC-FDMA shows ~4–5 dB lower PAPR at Pr(PAPR > x) = 0.1%, directly translating to UE power amplifier backoff savings and battery life improvement.
  1. CP Efficiency Trade-off: LTE normal CP is 4.7 µs (6.7% overhead) while extended CP is 16.7 µs (20% overhead). Given that LTE uses 15 kHz subcarrier spacing with \(T_u = 66.7\;\mu\text{s}\), calculate the maximum excess multipath delay (in meters of path-length difference) that each CP can protect against. Under what real-world propagation scenario — urban canyon, suburban LOS, rural mountainous, or maritime — would you expect to need extended CP, and why?
  2. SC-FDMA PAPR vs Capacity: A UE is allocated M = 60 subcarriers (5 RBs) for uplink transmission using SC-FDMA (LFDMA). The total FFT size is N = 2048 (20 MHz). (a) Show that 60 is a "good" number (express as \(2^a \cdot 3^b \cdot 5^c\)). (b) If the UE's power amplifier has a 1 dB compression point of 23 dBm and the required power backoff for OFDMA is 8 dB but only 3 dB for SC-FDMA, what is the maximum output power the UE can deliver for each waveform, and what is the coverage-area ratio (assume path loss \(\propto d^{3.5}\))?
  3. MIMO Overhead and Practical Capacity: A 4×4 LTE system (TM9) uses 8-port CSI-RS. At 20 MHz with 4 spatial layers, the theoretical peak (Shannon, 4 layers × 20 MHz × log₂(1 + 20 dB SINR)) is approximately 532 Mbps. (a) Accounting for CP overhead (6.7%), PDCCH control region (2 symbols out of 14), CRS overhead for 4 antenna ports (~9.5%), and CSI-RS overhead for 8 ports (1 RE per port per RB, once every 5 ms), estimate the net spectral efficiency per layer. (b) Compare this to the LTE Category 6 UE peak rate of 300 Mbps specified in 3GPP TS 36.306. What additional practical factors account for the remaining gap?
§8
5G NR Flexible Numerology
mu=0..4 · FR1/FR2 · BWP · DFT-s-OFDM

8.1 Numerology Framework: Δf = 2μ × 15 kHz

3GPP Release 15 introduced flexible numerology as the foundation of 5G NR air interface design. Unlike LTE's single fixed 15 kHz subcarrier spacing, NR defines five numerologies indexed by μ ∈ {0,1,2,3,4}, each doubling the subcarrier spacing and halving the OFDM symbol duration relative to the previous one. This single design parameter simultaneously controls SCS, slot duration, cyclic prefix length, and FFT size — making the entire air interface scalable from coverage- optimised macro cells at 700 MHz up to low-latency mmWave links at 39 GHz.

Subcarrier spacing (SCS)
$$\Delta f_\mu = 2^\mu \times 15\ \text{kHz}$$
OFDM useful symbol duration
$$T_u^\mu = \frac{1}{\Delta f_\mu} = \frac{1}{2^\mu \times 15 \times 10^3}\ \text{s}$$
Normal CP length (fraction of symbol)
$$T_{CP}^{\text{normal}} \approx \frac{T_u^\mu}{14} \approx 7.14\%\ T_u^\mu \quad (\text{144 samples at 30.72 MHz for } \mu=0)$$
Extended CP (μ=2 only — for special subframes)
$$T_{CP}^{\text{ext}} = \frac{T_u^{\mu=2}}{4} = 4.17\ \mu\text{s} \quad (\approx 25\%\ T_u)$$
Key insight: Every numerology keeps the slot = 14 symbols × (T_u + T_CP) structure constant in sample count but the absolute slot duration halves with each μ step: slot duration = 1 ms / 2μ. A 10 ms radio frame always contains 10 subframes of 1 ms each, with 2μ slots per subframe.
Table 8.1 — 5G NR Numerology Reference (3GPP TS 38.211, Table 4.2-1)
μ SCS Δf T_u (useful) N-CP (normal) [samples @30.72 MHz·2μ] T_CP normal T_CP extended Slot dur. Slots/subframe Slots/frame Typical FFT size @common Fs Use case
0 15 kHz 66.67 μs 144 / 160 (1st) 4.69 μs 1.0 ms 1 10 2048 @ 30.72 MHz LTE-compatible; FR1 macro; coverage
1 30 kHz 33.33 μs 144 / 160 (1st) 2.34 μs 0.5 ms 2 20 2048 @ 61.44 MHz Most common FR1 sub-6 GHz; data
2 60 kHz 16.67 μs 144 / 160 (1st) 1.17 μs 4.17 μs (ext.) 0.25 ms 4 40 2048 @ 122.88 MHz FR1 & FR2; low-latency; unlicensed
3 120 kHz 8.33 μs 144 / 160 (1st) 0.586 μs 0.125 ms 8 80 2048 @ 245.76 MHz FR2 mmWave; data; <100 m range
4 240 kHz 4.17 μs 144 / 160 (1st) 0.293 μs 0.0625 ms 16 160 2048 @ 491.52 MHz FR2 reference signals (SSB) only

Normal CP sample counts: For μ=0 at Fs=30.72 MHz: CP = 144 samples (≈4.69 μs) for symbols 1-6 and 8-13; CP = 160 samples (≈5.21 μs) for symbols 0 and 7 (first symbol of each half-slot gets a slightly longer CP to fill the 1 ms subframe boundary exactly). Extended CP (μ=2) uses CP = 512 samples at 245.76 MHz → 4.17 μs, for special downlink subframes in unpaired spectrum only.

8.2 Why Flexible Numerology? — The Delay-Doppler Trade-off

The two fundamental channel impairments that numerology must balance are delay spread (multipath) and Doppler spread (mobility). They pull the design in opposite directions.

Delay Spread → Lower SCS preferred

Multipath echoes arrive over a delay spread τmax. The CP must exceed τmax to prevent ISI. Since T_CP is proportional to T_u (fixed ratio ~7%), using lower SCS → longer T_u → longer T_CP in absolute time → more multipath protection.

$$T_{CP} > \tau_{\max} \quad \Rightarrow \quad \frac{T_u}{14} > \tau_{\max} \quad \Rightarrow \quad \Delta f < \frac{1}{14\,\tau_{\max}}$$

Urban macro: τmax ≈ 1–5 μs → SCS ≤ 30 kHz preferred.
Indoor: τmax ≈ 100 ns → SCS up to 240 kHz viable.

Doppler Spread → Higher SCS preferred

Mobile UEs create Doppler shift fD = v·f_c/c. The channel must remain quasi-static over one OFDM symbol for flat-fading ICI-free reception. Higher SCS → shorter symbol → less time for channel to decorrelate.

$$f_D \ll \Delta f \quad \Rightarrow \quad \frac{v \cdot f_c}{c} \ll \Delta f$$

120 km/h @28 GHz: fD = 3.1 kHz → SCS ≥ 30 kHz needed (fD/Δf < 0.1).
500 km/h (HSR) @3.5 GHz: fD = 1.62 kHz → SCS ≥ 15 kHz.

Design Criterion — simultaneous flat-fading and quasi-static conditions
$$\underbrace{\Delta f \cdot \tau_{\max} \ll 1}_{\text{flat fading per subcarrier}} \quad \text{AND} \quad \underbrace{\frac{f_D}{\Delta f} \ll 1}_{\text{channel quasi-static over one symbol}}$$
Combining both constraints
$$f_D \cdot \tau_{\max} \ll 1 \quad \Leftrightarrow \quad \frac{v \cdot f_c}{c} \cdot \tau_{\max} \ll 1$$

The product fD · τmax is a channel-specific constant; when it approaches 1, no single numerology satisfies both constraints simultaneously — requiring techniques like OTFS or channel estimation with ICI mitigation. In practice, 5G NR allows different BWPs on the same carrier to use different numerologies, and mixed-numerology operation within a carrier is possible (with guard bands to limit ICI between BWPs).

CP overhead cost: Normal CP is ~7.14% of T_u at all numerologies (constant ratio). Extended CP at μ=2 is 25% — a 3.5× overhead hit taken only when long delay spread and high SCS are both needed (e.g., outdoor-to-indoor at 5 GHz unlicensed). Total CP overhead in a 14-symbol slot: 2 long CPs + 12 normal CPs = 10.4% of slot time.
Table 8.2 — Numerology Selection Guide by Scenario
Scenariof_cτ_max (typ.)f_D (typ.)Recommended μRationale
Rural macro, coverage layer700 MHz5–10 μs<100 Hz0 (15 kHz)Long CP; low Doppler; LTE coexistence
Urban macro, sub-3 GHz2.1 GHz1–3 μs200–500 Hz1 (30 kHz)CP ≥ 2.34 μs clears multipath; most common
Urban sub-7 GHz / dense3.5 GHz0.5–1 μs300–800 Hz1–2 (30/60 kHz)Low latency; short CP OK for small cells
mmWave indoor/outdoor28 GHz50–200 ns1–3 kHz3 (120 kHz)Very short τ_max; high f_D; 0.586 μs CP sufficient
mmWave dense indoor39 GHz10–50 ns500 Hz3–4 (120/240 kHz)Tiny delay spread; μ=4 for beam mgmt RS only
High-speed rail3.5 GHz0.5–2 μs1–2 kHz1–2 (30/60 kHz)Doppler dominates; shorter symbol needed

8.2-A Interactive Chart: Numerology Metrics Comparison (μ=0..4)

8.3 5G NR Resource Grid and Scheduling

The 5G NR physical resource grid preserves the LTE concept of Resource Blocks (12 subcarriers each) but extends it with flexible time-domain scheduling.

Resource Element (RE)

One subcarrier × one OFDM symbol. Carries one complex modulation symbol (QPSK/16QAM/64QAM/256QAM/1024QAM).

$$\text{RE capacity} = \log_2(M)\ \text{bits} \quad (M = \text{modulation order})$$
Resource Block (RB)

12 subcarriers × 1 slot (14 symbols). Same as LTE. Minimum scheduling granularity is 1 RB in frequency.

$$N_{RE}^{RB} = 12 \times 14 = 168\ \text{RE/slot}$$
Slot (Normal CP)

14 OFDM symbols. Duration = 1 ms / 2μ. Two half-slots of 7 symbols. Normal scheduling granularity.

$$T_{\text{slot}} = \frac{1\ \text{ms}}{2^\mu}$$
Mini-Slot

2, 4, or 7 symbols for ultra-low latency (URLLC). Can start at any symbol boundary. Pre-emption indicator signals release of resources.

$$T_{\text{mini-slot}} \in \{2,4,7\}\ \text{symbols} \approx \{0.14, 0.29, 0.5\}\frac{\text{ms}}{2^\mu}$$
Maximum data rate estimate (single layer, one carrier)
$$R_{\text{peak}} = N_{RB} \times 12 \times \frac{14 \times (1-\text{OH})}{T_{\text{slot}}} \times \log_2(M) \times R_c \times \nu_{\text{layers}}\ \text{bps}$$
OH = overhead fraction (DMRS, PDCCH, CSI-RS ~14%); R_c = code rate; ν_layers = MIMO layers
Example: μ=1, 100 MHz BW, 273 RBs, 256QAM, R_c=0.93, 4 layers
$$R \approx 273 \times 12 \times \frac{14 \times 0.86}{0.5\ \text{ms}} \times 8 \times 0.93 \times 4 \approx 3.9\ \text{Gbps}$$
Table 8.3 — Maximum RBs by Bandwidth and Numerology (FR1)
BWμ=0 (15 kHz)μ=1 (30 kHz)μ=2 (60 kHz)
5 MHz25 RB11 RB
10 MHz52 RB24 RB11 RB
20 MHz106 RB51 RB24 RB
50 MHz270 RB133 RB66 RB
100 MHz273 RB135 RB
Synchronisation Signal Block (SSB): The 5G NR SSB consists of 4 consecutive OFDM symbols carrying PSS (1 sym), SSS (1 sym), PBCH (2 sym) over 240 subcarriers (20 RBs). SSB numerology follows the carrier numerology, and up to 8 SSB beams are swept per half-frame in FR1, up to 64 in FR2. The SSB periodicity is 20 ms by default (configurable 5–160 ms).

8.4 CP-OFDM in 5G NR — UL MIMO Enabled

A key departure from LTE is that 5G NR mandates CP-OFDM for both downlink and uplink, whereas LTE restricted the UL to SC-FDMA (DFT-spread OFDM). This change has profound implications for uplink MIMO.

LTE UL: SC-FDMA only
  • Single-carrier DFT precoding forces frequency-contiguous resource allocation
  • PAPR ≈ 7–8 dB (vs 10–12 dB for CP-OFDM)
  • No UL spatial multiplexing (MIMO layers) — single antenna transmission per UE
  • Power amplifier efficiency advantage for coverage-limited UEs
5G NR UL: CP-OFDM primary
  • Non-contiguous resource allocation possible (frequency hopping, multi-cluster)
  • Up to 4 UL MIMO layers per UE (Category 3 UE)
  • Consistent waveform with DL → simplified channel estimation and precoding
  • Massive MIMO: gNB can transmit up to 32 DL layers (multi-user MIMO)
  • DFT-s-OFDM still available optionally for coverage (see §8.5)
CP-OFDM UL signal model (μ-th subcarrier, l-th symbol)
$$s[n] = \sum_{k=0}^{N-1} X[k]\, e^{j2\pi kn/N}, \quad n = -N_{CP},\ldots,N-1$$
Received signal after channel h[l] (l = 0..L-1, L ≤ N_CP)
$$y[n] = \sum_{l=0}^{L-1} h[l]\, s[n-l] + w[n] \xrightarrow{\text{CP removal + DFT}} Y[k] = H[k]\,X[k] + W[k]$$
CP ensures circular convolution → one-tap equalisation per subcarrier in ideal case
UL MIMO layers in NR: UE Category defines the max UL MIMO layers: Cat 1 = 1 layer, Cat 2 = 2 layers, Cat 3 = 4 layers. The gNB uses PUSCH DMRS ports to separate layers. CP-OFDM is mandatory for 2+ layer UL transmission; DFT-s-OFDM is restricted to rank-1 (single layer) only.

8.5 DFT-s-OFDM — Coverage Maximisation via PAPR Reduction

DFT-spread OFDM is an optional UL waveform in 5G NR, retained from LTE's SC-FDMA for scenarios where UE transmit power is the bottleneck. A UE reports its DFT-s-OFDM capability; the gNB configures it via higher-layer signalling when the UE is coverage-limited (e.g., cell edge, deep indoor).

DFT-s-OFDM transmit chain
$$x[n] \xrightarrow{M\text{-pt DFT}} X[k] \xrightarrow{\text{subcarrier mapping}} \tilde{X}[k] \xrightarrow{N\text{-pt IDFT}} s[n] \xrightarrow{+\text{CP}} \text{transmit}$$
M = number of allocated subcarriers (must be product of 2, 3, 5 only — 3GPP constraint for DFT efficiency); N = FFT size
Effective waveform: single-carrier in time domain
$$s[n] = \frac{1}{\sqrt{M}} \sum_{m=0}^{M-1} x[m]\, e^{j2\pi mn/M} \quad \Rightarrow \quad \text{PAPR} \approx \frac{M \cdot P_{\text{peak}}}{P_{\text{avg}}} \approx 8\text{ dB (CP-OFDM vs ≈3–4 dB DFT-s)}$$
Table 8.4 — PAPR Comparison: Waveform × Modulation
WaveformModulationPAPR (99.9% CCDF)Notes
CP-OFDMQPSK~10.5 dBHigh peak due to multicarrier summation
CP-OFDM16QAM~11 dBSlightly higher due to non-constant envelope
CP-OFDM256QAM~12 dBEven higher amplitude variation
DFT-s-OFDMQPSK~6 dBSignificant PAPR reduction
DFT-s-OFDMπ/2-BPSK~3 dBNear-optimal; best coverage waveform
DFT-s-OFDMπ/2-BPSK + spectral shaping~1–2 dB3GPP Rel-16 enhancement; used in RedCap
Coverage gain from PAPR reduction (link budget improvement)
$$\Delta_{\text{coverage}} = \underbrace{(PAPR_{\text{CP-OFDM}} - PAPR_{\text{DFT-s}})}_{\approx 7\ \text{dB for QPSK}} \Rightarrow P_{TX,\text{eff}} \uparrow 7\ \text{dB} \Rightarrow d_{\text{max}} \times 2.24\ (\text{at PL} \propto d^3)$$
π/2-BPSK DFT-s-OFDM: PAPR ~3 dB vs CP-OFDM QPSK ~10.5 dB → ~7.5 dB link budget gain
π/2-BPSK mechanism: Symbols alternate phase by π/2 between consecutive symbols: x[n] ∈ {±1, ±j} with x[n] = d[n] · ejnπ/2. This ensures the signal constellation rotates on the unit circle between adjacent symbols, minimising zero-crossings and dramatically reducing PAPR when combined with DFT spreading. 3GPP Rel-15 includes this for NR PUSCH and PRACH formats 0–3.
Table 8.5 — DFT-s-OFDM vs CP-OFDM Feature Matrix
FeatureDFT-s-OFDMCP-OFDM
PAPR3–7 dB (waveform dep.)10–12 dB
CoverageBetter (cell edge)Standard
UL MIMO layers1 only (rank-1)Up to 4
Resource allocationContiguous RBs onlyAny allocation
Frequency hoppingLimitedFlexible
Spectral efficiencyLower at high SNRHigher at high SNR
UE capability req.Optional (reported)Mandatory
UsageCoverage-limited ULDefault UL and all DL

8.6 Frequency Ranges: FR1 (Sub-7 GHz) vs FR2 (mmWave)

Table 8.6 — FR1 vs FR2 Specification Comparison (3GPP TS 38.101-1/2)
ParameterFR1FR2
Frequency range450 MHz – 7.125 GHz24.25 GHz – 52.6 GHz
Common designationSub-6 GHz / Sub-7 GHzmmWave
Maximum channel BW100 MHz400 MHz
Maximum aggregated BW1 GHz (CA)2 GHz (CA)
Supported SCS15, 30, 60 kHz60, 120, 240 kHz
SCS for SSB15, 30 kHz120, 240 kHz
Max SSB beams/half-frame8 (L_max=8, >3 GHz)64 (L_max=64)
Path loss exponent (NLOS)3.5–4.5 (PL ∝ f²·d³·⁵)4–6 (heavy shadowing)
Free space path loss @1 km~105 dB @3.5 GHz~131 dB @28 GHz
Typical cell radius100 m – 5 km10 – 200 m
UE antenna config2–4 antennas (pattern diversity)Up to 8 panels (phased arrays)
gNB antenna configUp to 256 TRX (Massive MIMO)Up to 1024 elements (beamforming)
Duplex modeFDD or TDDTDD only
Phase noise (PN)Manageable, minimal PTRSSevere (PN ∝ f²); PTRS mandatory
Oxygen absorptionNegligibleHigh @60 GHz (15 dB/km)
Rain attenuationNegligible below 10 GHz10–20 dB/km @28 GHz heavy rain
Penetration loss (glass)1–5 dB30–50 dB (mmWave reflects off glass)
Key challengeInterference management, spectral efficiencyBeam management, phase noise, blockage
Free space path loss (Friis)
$$PL_{\text{FS}} = 20\log_{10}(d) + 20\log_{10}(f_c) + 20\log_{10}\!\left(\frac{4\pi}{c}\right) \approx 20\log_{10}(d) + 20\log_{10}(f_c) - 147.55\ \text{dB}$$
mmWave additional path loss vs 3.5 GHz at same distance d
$$\Delta PL = 20\log_{10}\!\left(\frac{f_{FR2}}{f_{FR1}}\right) = 20\log_{10}\!\left(\frac{28\ \text{GHz}}{3.5\ \text{GHz}}\right) = 18\ \text{dB}$$
This 18 dB is compensated by: larger bandwidth gain (×114 capacity), beamforming gain (up to 30 dBi for phased array), and shorter link distances
mmWave Beam Management (3GPP TS 38.214): FR2 requires continuous beam tracking due to narrow beams and blockage. The procedure involves: (1) Initial beam acquisition via SSB sweep (64 beams in Rel-15); (2) Beam refinement via CSI-RS (up to 64 CSI-RS ports per burst); (3) Beam failure recovery (BFR): UE detects beam failure, sends PRACH on candidate beam, gNB responds on new beam within T_BFR timeout. BFR adds latency (10–50 ms) and is a key reliability concern for mmWave.

8.7 Phase Tracking Reference Signal (PTRS) — Combating LO Phase Noise

At mmWave frequencies, local oscillator (LO) phase noise becomes a dominant impairment. Unlike frequency-selective fading (corrected per-subcarrier by channel equalisation), phase noise manifests as Common Phase Error (CPE) — a phase rotation that is identical across all subcarriers within one OFDM symbol — plus Inter-Carrier Interference (ICI) from the tails of the phase noise spectrum.

LO Phase Noise Power Spectral Density (simplified Leeson model)
$$S_{\phi}(f) \approx \frac{f_0^2}{f^2} \cdot S_{\phi,0} \quad [dBc/Hz]$$
Phase noise PSD scales as f₀² — doubling carrier frequency quadruples phase noise power at fixed offset frequency
Common Phase Error (CPE) per OFDM symbol
$$\text{CPE} = \frac{1}{N} \sum_{k=0}^{N-1} \text{ICI}_k \approx e^{j\phi_0}$$
CPE-induced SNR degradation (first-order approximation)
$$\text{SNR}_{\text{eff}} = \frac{\text{SNR}}{1 + \text{SNR} \cdot \sigma_\phi^2} \approx \frac{1}{\sigma_\phi^2} \quad \text{(CPE-limited regime)}$$
σ_φ² = phase noise variance integrated over OFDM symbol bandwidth
Table 8.7 — PTRS Configuration Rules (3GPP TS 38.214, Tables 6.2.3.1-1/2)
ParameterDL (PDSCH)UL (PUSCH)
Time density (symbols)Every 1 or 2 symbols (depends on SCS and MCS)Same as DL
Frequency density1 port per 2, 4, or 16 RBs (ptrsFrequencyDensity)Same; adapts to scheduled BW
Modulation threshold (time density)MCS ≥ 29 → every symbol; MCS 10–28 → every 2 symbolsSame thresholds
BW threshold (freq density)BW > N_RB0: 2 RB; > N_RB1: 4 RB; else: 16 RBSame
PTRS port associationAssociated with DMRS antenna port (not standalone)Same
Required at low MCS?No (disabled for QPSK/low 16QAM)No
Phase noise benefitCPE estimation + correction per symbolCPE correction for UL transmission
PTRS CPE estimation procedure: (1) Receiver interpolates channel H[k] at PTRS subcarriers using adjacent DMRS. (2) Computes phase error: θ_l = angle(Y_PTRS[k] / (H[k] × X_PTRS[k])) averaged over PTRS subcarriers in symbol l. (3) Applies phase de-rotation: Ŷ[k,l] = Y[k,l] × e−jθ_l before data equalisation. (4) Residual ICI is treated as noise. PTRS density trades spectral efficiency vs phase noise suppression.

8.8 Bandwidth Parts (BWP) — Adaptive UE Bandwidth Operation

A Bandwidth Part (BWP) is a contiguous set of PRBs on a carrier, characterised by a numerology (SCS + CP type) and a physical resource block offset. Each UE has up to 4 configured BWPs per carrier direction (DL/UL), with exactly one active BWP at any time. BWPs enable UEs to operate on a bandwidth narrower than the full carrier, reducing RF processing and power consumption during low-activity periods.

Initial BWP

Used during random access (RACH) and initial system information reception. Same SCS as SSB or CORESET#0. Width = CORESET#0 bandwidth. UE does not need to receive full carrier during RA.

Default BWP

Fallback BWP when inactivity timer (bwp-InactivityTimer) expires. Can be narrower than data BWP to save power. Configured in RRCReconfiguration. If not configured, BWP#0 is default.

Active BWP

Current operational BWP. Switched via DCI field (2-bit BWP indicator in DCI 1_1/0_1) or RRC reconfiguration. UE only needs to receive signals within active BWP bandwidth.

Dormant BWP (Rel-16+)

Secondary cell (SCell) can be configured with a dormant BWP — UE measures channel on narrow BWP, reducing power. Activation triggered by MAC-CE or DCI. Faster than full SCell activation.

BWP definition in 3GPP (TS 38.211, Section 4.4.5)
$$\text{BWP}_i: \quad \{N_{start,i}^{BWP},\ N_{size,i}^{BWP},\ \mu_i,\ \text{CP type}_i\}$$
N_start: starting CRB (Common Resource Block) offset; N_size: number of PRBs in BWP; both in units of the BWP numerology
BWP bandwidth constraint
$$N_{size,i}^{BWP} \leq \min(N_{BW}^{\mu},\ 275)\ \text{PRBs} \quad \text{(max 275 PRBs per carrier, any numerology)}$$
Table 8.8 — BWP Switching Triggers and Behaviour
TriggerMechanismSwitching LatencyUse Case
DCI-based (downlink)DCI 1_1 BWP indicator field (2 bits → BWP ID 0-3)3–4 slotsDynamic bandwidth adaptation, service change
DCI-based (uplink)DCI 0_1 BWP indicator field3–4 slotsUL rate adaptation
Inactivity timerbwp-InactivityTimer (1–2560 ms)Timer expiryPower saving: switch to narrow default BWP after silence
RRC reconfigurationRRCReconfiguration message~10 ms (RRC delay)Service reconfiguration, handover
MAC-CE activationMAC control element (SCell)2–3 slotsDormant BWP activation on SCells
Random accessAutomatic switch to initial BWPImmediateRACH procedure, beam failure recovery
Power saving benefit of BWP: UE RF chain power consumption scales roughly with processed bandwidth. A UE on a 100 MHz carrier but with only 10 MHz active BWP can switch off ~90% of the ADC/DAC bandwidth, saving ~30–50% modem power. Combined with DRX (Discontinuous Reception), BWP is a key Rel-15/16 power saving mechanism for enhanced Mobile Broadband (eMBB) and IoT devices.

8.9 5G NR vs LTE — Comprehensive Air Interface Comparison

Table 8.9 — 5G NR vs LTE Air Interface Feature Comparison
FeatureLTE (Rel-8)LTE-A (Rel-12)5G NR (Rel-15)5G NR (Rel-17+)
SCS options15 kHz only15 kHz only15/30/60/120/240 kHzSame + 480/960 kHz (NTN)
Max channel BW20 MHz100 MHz (CA)100 MHz (FR1) / 400 MHz (FR2)Same + 1.6 GHz (mmWave)
DL waveformCP-OFDMCP-OFDMCP-OFDMCP-OFDM
UL waveformSC-FDMA (DFT-s-OFDM) onlySC-FDMA onlyCP-OFDM (primary) + DFT-s-OFDMSame
DL MIMO layers4 (Rel-8)8 (Rel-10)8 per UE (up to 32 MU-MIMO)Same (Type-II CSI)
UL MIMO layers1 (no UL MIMO)4 (Rel-10)4 (CP-OFDM) / 1 (DFT-s)Same
Slot duration1 ms (fixed)1 ms (fixed)0.5 ms / 2μSame
Mini-slotNo (sub-frame only)No2/4/7 symbolsSame
Latency (user-plane)~4–8 ms~3–6 ms<1 ms (URLLC)<0.5 ms (XR/IIoT)
CP overhead/slot~6.7% (144/2048)Same~10.4% (14 symbols incl. long CP)Same
Reference signalsCRS (always-on, 4-port)CSI-RS addedDMRS (on-demand), CSI-RS, SSB, PTRS, TRSSame + PRS (positioning)
Always-on RSCRS (all RBs all time)CRS + CSI-RSSSB only (periodic, configurable)Same
Carrier aggregationNo (Rel-8)Up to 5 CCUp to 16 CCUp to 16 CC
Frequency range700 MHz – 3.5 GHzUp to 5.9 GHz450 MHz – 52.6 GHzSame + 71 GHz (Rel-17)
Duplex flexibilityFDD & TDD (separate bands)SameFDD, TDD, SDL, SUL, dynamic TDDSame + FDR (full duplex research)
Numerology flexibilityNoneNoneFull (per BWP, per carrier)Same
Phase noise compensationNot neededNot neededPTRS (FR2)Enhanced PTRS
Beam managementNot supportedLimited (CoMP)Full (SSB/CSI-RS beam mgmt)BFR enhancements

8.9-A Interactive Chart: FR1 vs FR2 — Coherence Bandwidth & Coherence Time

Study Questions — §8

  1. Numerology trade-off analysis: A vehicular UE operates at 120 km/h on a 28 GHz mmWave link (τ_max ≈ 100 ns, carrier freq = 28 GHz). Calculate (a) the Doppler spread f_D, (b) the coherence time T_c ≈ 0.423/f_D, and (c) which NR numerology μ best satisfies both f_D / Δf < 0.1 and T_CP > τ_max. Then explain why the same scenario at 700 MHz would use a different μ despite identical vehicle speed.
  2. DFT-s-OFDM coverage calculation: A cell-edge UE at 2.1 GHz is 8 km from the gNB. Using CP-OFDM QPSK (PAPR ≈ 10.5 dB) the UE cannot close the link budget. Switching to DFT-s-OFDM with π/2-BPSK (PAPR ≈ 3 dB) effectively increases transmit power. Assuming path loss PL ∝ d3.5 and the UE's PA is running 7.5 dB below saturation with CP-OFDM, calculate the new maximum range after switching to DFT-s-OFDM with π/2-BPSK. Explain why DFT-s-OFDM is restricted to rank-1 UL MIMO.
  3. BWP power saving & switching latency: A gNB configures a UE with two BWPs: BWP#1 (100 MHz, μ=1, active during data burst) and BWP#2 (10 MHz, μ=0, default dormant BWP). The bwp-InactivityTimer is set to 10 ms. (a) After a 5 Mbps data burst of 500 ms ends, what event triggers BWP switching and when does it occur? (b) A new DCI arrives 2 ms after the timer expires — what is the total interruption time? (c) How does mixed-numerology operation between BWP#1 and a neighbouring UE's BWP (both on the same carrier) generate ICI, and what guard band is required between them?
§9

OFDM Variants & Alternatives — FBMC, f-OFDM, UFMC, OTFS

9.1 CP-OFDM Limitations That Motivate Alternatives

CP-OFDM has dominated 4G and 5G NR because of its elegant frequency-domain equalization (one complex multiply per subcarrier), but four structural weaknesses motivate the search for alternatives.

9.1.1 Cyclic-Prefix Overhead

The CP of length \(N_{CP}\) is a copy of the last \(N_{CP}\) samples of each OFDM symbol of length \(N\). The spectral efficiency penalty is:

\[ \eta_{CP} = \frac{N}{N + N_{CP}} \] (9.1)

In 5G NR numerology \(\mu=0\) (SCS = 15 kHz) the normal CP is 144 samples over a 2048-sample FFT, giving \(\eta = 2048/(2048+144) \approx 93.4\%\). At \(\mu=4\) (SCS = 240 kHz) the extended CP reduces efficiency further. For ultra-dense small-cell deployments, even this 6–7% loss translates to meaningful capacity reduction.

9.1.2 High Out-of-Band Emissions (Sinc Sidelobe Problem)

The rectangular window applied to each OFDM symbol has a frequency-domain response of the form \(\mathrm{sinc}(fT)\). The first sidelobe is only \(-13.3\) dB below the main lobe. For a multi-carrier signal the composite power spectral density decays as:

\[ S(f) \propto \sum_{k=0}^{N-1} \left| \frac{\sin\!\bigl(\pi(f-k\Delta f)T\bigr)} {\pi(f-k\Delta f)T} \right|^2 \] (9.2)

This slow \(\sim 1/f^2\) rolloff forces regulators to mandate wide guard bands around the occupied spectrum — 0.5–1 MHz in LTE/5G NR — wasting radio resources. Adjacent-channel interference into other operators or asynchronous users (unlicensed, NB-IoT) is a direct consequence.

9.1.3 Peak-to-Average Power Ratio (PAPR)

An OFDM signal with \(N\) independently modulated subcarriers can constructively superpose. In the worst case:

\[ \mathrm{PAPR}_{\max} = 10\log_{10}(N) \;\text{[dB]} \] (9.3)

For \(N=1024\) this is 30 dB — far in excess of what any practical power amplifier can tolerate linearly. Statistical measures (complementary CDF at \(10^{-3}\)) give PAPR \(\approx 10\text{–}11\) dB for QPSK/16-QAM, still requiring significant power back-off that degrades energy efficiency.

9.1.4 Sensitivity to Doubly-Dispersive Channels

CP-OFDM restores orthogonality only when the channel delay spread \(\tau_{\max} \le T_{CP}\) and the Doppler spread \(\nu_{\max} \ll \Delta f\). In vehicular or LEO-satellite channels both conditions can be violated simultaneously. Inter-carrier interference (ICI) from Doppler shifts destroys the diagonal structure of the channel matrix, requiring costly equalizers and reducing throughput.

\[ \text{ICI condition: } \nu_{\max} T_s N \ll 1 \quad\Rightarrow\quad \nu_{\max} \ll \frac{1}{N T_s} = \Delta f \] (9.4)
At 5 GHz carrier and 250 km/h vehicle speed the Doppler shift is \(\nu_d = v f_c/c = (250/3.6)\times5\times10^9/(3\times10^8) \approx 1157\) Hz. For 5G NR \(\mu=0\) (\(\Delta f = 15\) kHz) this gives \(\nu_d/\Delta f \approx 7.7\%\), already noticeable. For \(\mu=0\) with large \(N\) (e.g., 2048-point FFT) the ICI can degrade coded BER by several dB.

9.2 FBMC — Filter Bank Multi-Carrier

FBMC replaces the rectangular pulse of OFDM with a well-designed prototype filter \(p(t)\) applied independently to each subcarrier. The transmitted signal is:

\[ s(t) = \mathrm{Re}\!\left\{ \sum_{k=0}^{K-1}\sum_{m=-\infty}^{\infty} d_{k,m}\, g_{k,m}(t) \right\}, \quad g_{k,m}(t) = p(t - mT/2)\,e^{j2\pi k \Delta f t}\,e^{j\varphi_{k,m}} \] (9.5)

where \(d_{k,m}\) are real-valued OQAM symbols, \(T\) is the symbol period, \(\Delta f = 1/T\) is subcarrier spacing, and \(\varphi_{k,m} = \tfrac{\pi}{2}(k+m)\) is the phase offset used in Offset-QAM.

9.2.1 Prototype Filter Design — PHYDYAS Filter

The most widely studied prototype filter for FBMC is the PHYDYAS filter, specified by its frequency-domain coefficients. For overlapping factor \(K=4\) (filter spans 4 multicarrier symbol periods) the frequency samples are:

\[ H[0]=1,\quad H[1]=0.971960,\quad H[2]=\frac{1}{\sqrt{2}},\quad H[3]=0.235147 \] (9.6)

The prototype filter impulse response in continuous time has length \(L_p = K\cdot N\) samples (where \(N\) is the FFT size). The resulting power spectral density sidelobe is suppressed to below \(-40\) dB, compared to only \(-13\) dB for the rectangular window (OFDM).

The PHYDYAS sidelobe suppression of >40 dB allows FBMC subcarriers to coexist with legacy systems in adjacent spectrum with negligible mutual interference — a key advantage for cognitive radio and fragmented-spectrum deployments.

9.2.2 OQAM — Offset QAM

The fundamental constraint of FBMC is that perfect reconstruction without a CP requires the transmit basis functions to satisfy the Nyquist condition only in the real field. Complex QAM symbols produce unavoidable intrinsic imaginary interference from neighboring subcarriers. OQAM circumvents this by transmitting the in-phase (I) and quadrature (Q) components of each QAM symbol on staggered time grids separated by \(T/2\):

\[ d_{k,m} = \begin{cases} \mathrm{Re}[c_{k,m/2}] & m \text{ even} \\ \mathrm{Im}[c_{k,(m-1)/2}] & m \text{ odd} \end{cases} \] (9.7)

where \(c_{k,n}\) is the complex QAM symbol on subcarrier \(k\) at time index \(n\). The real orthogonality condition for FBMC-OQAM is:

\[ \mathrm{Re}\!\bigl\{\langle g_{k,m},\, g_{k',m'}\rangle\bigr\} = \delta_{k,k'}\,\delta_{m,m'} \] (9.8)

The imaginary interference \(\mathrm{Im}\{\langle g_{k,m}, g_{k',m'}\rangle\}\) from the nearest time-frequency neighbors is non-zero but known — it can be cancelled with auxiliary pilots or by treating it as structured noise in MMSE equalization.

FBMC-OQAM Advantages
  • No CP — 100% spectral efficiency in time
  • Sidelobe suppression >40 dB vs ~13 dB (OFDM)
  • No guard bands needed between coexisting users
  • Ideal for cognitive radio and fragmented spectrum
  • Per-subcarrier filter design enables flexible resource allocation
FBMC Disadvantages
  • Only real-valued symbols per subcarrier (OQAM constraint)
  • Latency: filter length \(= K \times N\) (4× symbol duration for \(K=4\))
  • Channel estimation requires special pilot structures (scattered pilots contaminated by imaginary interference)
  • MIMO extension is non-trivial (imaginary cross-talk between antennas)
  • Not standardized in 3GPP; implementation complexity higher than OFDM

9.3 f-OFDM — Filtered OFDM

Filtered OFDM (f-OFDM) is a pragmatic variant that applies a single time-domain filter to each sub-band of contiguous subcarriers, rather than filtering at the individual-subcarrier level. The baseband model for sub-band \(b\) is:

\[ s_b(t) = \bigl[x_b(t)\bigr] * h_b(t) \] (9.9)

where \(x_b(t)\) is the conventional CP-OFDM waveform for sub-band \(b\) and \(h_b(t)\) is a raised-cosine or windowed-sinc sub-band filter with a rolloff factor \(\beta\) and passband equal to the sub-band bandwidth.

9.3.1 Mixed Numerology in 5G NR

A key motivation for f-OFDM is the ability to multiplex different subcarrier spacings (\(\mu = 0, 1, 2, \ldots\) corresponding to SCS 15, 30, 60 kHz, …) in the same wideband carrier without mutual interference. Each numerology occupies its own sub-band and is filtered independently:

\[ s(t) = \sum_{b=0}^{B-1} s_b(t) \, e^{j2\pi f_b t}, \quad f_b = \text{center frequency of sub-band } b \] (9.10)

At the filter transition band (a few subcarriers wide), inter-sub-band interference (ISBI) exists. In 5G NR this is managed by the guard sub-band mechanism — typically 1–2 subcarriers are left unused at sub-band edges.

9.3.2 f-OFDM vs UFMC

Property f-OFDM UFMC
Filtering granularity Per sub-band (many RBs) Per RB or small RB group
Filter length Moderate (several symbols) Short (\(L-1\) samples, \(L\ll N_{CP}\))
CP retained? Yes (within each sub-band) No (filter provides delay tolerance)
Mixed numerology Primary use case Limited support
PAPR Similar to OFDM Similar to OFDM
Complexity Low (B sub-band filters) Higher (per-RB filters)

9.4 UFMC — Universal Filtered Multi-Carrier

UFMC filters each resource block (RB) or small group of RBs individually using a Dolph-Chebyshev filter. The transmitted signal is the sum of per-RB filtered OFDM bursts:

\[ \mathbf{s} = \sum_{i=1}^{B} \mathbf{F}_i \mathbf{V}_i \mathbf{d}_i \] (9.11)

where \(\mathbf{d}_i \in \mathbb{C}^{N_i}\) is the QAM symbol vector for RB \(i\), \(\mathbf{V}_i\) is the \(N \times N_i\) IDFT submatrix for the \(N_i\) active subcarriers of RB \(i\), and \(\mathbf{F}_i\) is the convolution (Toeplitz) matrix of the Dolph-Chebyshev filter of length \(L\). The total transmitted block length is \(N + L - 1\) samples — no CP is inserted.

9.4.1 Dolph-Chebyshev Filter Properties

The Dolph-Chebyshev filter minimizes the main lobe width for a given maximum sidelobe level (or vice versa). For a prescribed sidelobe attenuation \(A\) dB, the filter order \(L\) and window coefficients are determined analytically via:

\[ w[n] = \frac{1}{L} \sum_{k=0}^{L-1} W(k)\, e^{j2\pi kn/L}, \quad W(k) = \frac{T_{L-1}\!\left(x_0 \cos\!\tfrac{\pi k}{L}\right)}{T_{L-1}(x_0)} \] (9.12)

where \(T_n(\cdot)\) is the Chebyshev polynomial of the first kind and \(x_0 = \cosh\!\bigl(\tfrac{1}{L-1}\cosh^{-1}(10^{A/20})\bigr)\). Typical UFMC parameters: \(L = 73\) (for a 5G NR 12-subcarrier RB at SCS 15 kHz), sidelobe attenuation 40 dB.

UFMC Receiver (1-shot DFT processing)
  1. Receive \(N + L - 1\) samples (no CP removal).
  2. Zero-pad to \(2N\) samples.
  3. Apply \(2N\)-point DFT.
  4. Take every other DFT output (decimation by 2): recovers the \(N\) subcarrier symbols on the desired RBs.
  5. Apply one-tap frequency-domain equalization per subcarrier (channel estimation needed; standard LS/MMSE pilots).

9.4.2 UFMC Target Scenarios

UFMC was proposed in 5GNOW (EU FP7 project) primarily for:

9.5 OTFS — Orthogonal Time-Frequency Space

OTFS is a modulation scheme proposed by Hadani et al. (2017) that places information symbols in the delay-Doppler (DD) domain rather than the time-frequency (TF) plane. The fundamental insight is that for typical wireless channels, the DD domain representation is sparse: each physical scatterer corresponds to a single (delay, Doppler) tap, making the channel extremely compact regardless of mobility.

9.5.1 Delay-Doppler Signal Model

Consider an \(M \times N\) OTFS frame. Symbols \(x[l,k]\) are placed on a DD grid with delay resolution \(\Delta\tau = 1/(M\Delta f)\) and Doppler resolution \(\Delta\nu = 1/(NT)\). The modulation chain is:

\[ X[n,m] = \frac{1}{\sqrt{MN}} \sum_{k=0}^{N-1}\sum_{l=0}^{M-1} x[l,k]\, e^{j2\pi\left(\frac{nk}{N} - \frac{ml}{M}\right)} \quad \xleftarrow{\text{2D-ISFFT}} \] (9.13)

\(X[n,m]\) is the time-frequency domain representation, which is then converted to a time-domain waveform via the Heisenberg transform (a standard OFDM modulator):

\[ s(t) = \sum_{n=0}^{N-1}\sum_{m=0}^{M-1} X[n,m]\, g_{tx}(t - nT)\, e^{j2\pi m \Delta f (t-nT)} \] (9.14)

where \(g_{tx}(t)\) is the transmit pulse (rectangular for OTFS-CP). At the receiver, the Wigner transform demodulates back to TF, and a 2D-SFFT transforms to DD:

\[ y[l,k] = \sum_{l'=0}^{M-1}\sum_{k'=0}^{N-1} h_w[l',k']\, x[\langle l-l'\rangle_M,\langle k-k'\rangle_N] + \eta[l,k] \] (9.15)

This is a 2D circular convolution with the DD domain channel \(h_w[l',k']\) — a sparse matrix with at most \(P\) non-zero entries (one per scattering path). This sparsity underpins OTFS's efficiency advantage for channel estimation and equalization.

9.5.2 Full Diversity Property

Every OTFS symbol \(x[l,k]\) spreads its energy across the entire time-frequency plane via the 2D-ISFFT. As a result, each symbol experiences the full diversity of the channel — all \(P\) delay-Doppler paths contribute to each received DD-domain sample. The diversity order is:

\[ d_{\mathrm{OTFS}} = P \cdot MN \quad \text{(ideal case)} \] (9.16)

In contrast, a single CP-OFDM subcarrier experiencing fading has diversity order 1 (per subcarrier) without coding or interleaving. This is why OTFS shows dramatically better BER at high mobility — it converts fading dips into a manageable average-SNR problem.

9.5.3 Relationship to CP-OFDM and PAPR

OTFS is mathematically equivalent to CP-OFDM preceded by a 2D-ISFFT precoding matrix. This means:

9.5.4 DD Channel Estimation

Since the channel is sparse in DD domain (\(P \ll MN\)), pilot-based estimation requires far fewer pilots than OFDM. A single impulse pilot at DD coordinate \((l_p, k_p)\) with guard region produces:

\[ y_{\text{pilot}}[l,k] = h_w[l-l_p, k-k_p] \cdot x_p + \eta \] (9.17)

The guard region of size \((2l_{\max}+1)\times(2k_{\max}+1)\) around the pilot isolates the channel response from data interference. The pilot overhead scales with the channel spread, not the frame size, giving significant savings for sparse doubly-dispersive channels.

3GPP and 6G Outlook: OTFS has not been included in 3GPP Rel-18 or Rel-19 (which focus on 5G Advanced). However, the delay-Doppler framework has been adopted into the academic 6G roadmap and appears in 3GPP Study Item discussions for enhanced NTN (non-terrestrial networks) at FR1 and FR2 bands, where LEO satellite Doppler (\(\nu \sim 20\text{–}50\) kHz) exceeds all current NR numerologies.

9.6 Waveform Comparison

Waveform CP overhead OOB emissions PAPR High-mobility Complexity Standard
CP-OFDM ~7% (\(\mu=0\)) Poor (~−13 dB sidelobe) High (~10–11 dB) Poor (ICI at high \(\nu\)) Low (DFT) LTE, 5G NR, Wi-Fi
FBMC-OQAM 0% (no CP) Excellent (>40 dB) High (same as OFDM) Poor (long filter latency) High (per-SC filter bank) None (research/cognitive radio)
f-OFDM ~7% (CP retained) Good (>30 dB, sub-band) High Moderate Low–Medium (B filters) 5G NR research; not standardized
UFMC 0% (no CP) Good (>30–35 dB) High Moderate Medium (per-RB filters) 5GNOW proposal; NB-IoT studies
OTFS ~1 CP/frame (<1%) Same as CP-OFDM High (same as OFDM) Excellent (full diversity) Medium (2D-ISFFT + OFDM) 6G research; 3GPP NTN study

9.7 OQAM Nyquist Condition for the Prototype Filter

The prototype filter \(p(t)\) must satisfy the real Nyquist condition to guarantee zero inter-symbol and inter-carrier interference in FBMC-OQAM. In the Zak (or Wigner-Ville) domain, this condition is:

\[ \sum_{k \in \mathbb{Z}} p\!\left(t - k\frac{T}{2}\right) \overline{p\!\left(t - k\frac{T}{2} - \tau\right)} = \delta(\tau), \quad \forall t \] (9.18)

Equivalently in the frequency domain (Balian-Low constraint):

\[ |P(f)|^2 + |P(f - \Delta f)|^2 = T, \quad \forall f \] (9.19)

This is the half-Nyquist (power complementary) condition. The PHYDYAS filter satisfies (9.19) up to a very good approximation for \(K \geq 4\). The ISOTROPE filter satisfies it exactly with a closed-form expression.

OQAM carrier trick (even/odd indexing): On even-indexed subcarriers (\(k\) even), the in-phase symbol is transmitted at time \(mT\) and the quadrature symbol at \((m+\tfrac{1}{2})T\). On odd-indexed subcarriers the roles are swapped. The phase offset \(\varphi_{k,m} = \tfrac{\pi}{2}(k+m)\) ensures that neighboring symbols (in time and frequency) are in imaginary phase relative to the desired symbol. Since the Nyquist filter guarantees real orthogonality, these imaginary neighbors do not contaminate the decision variable after taking \(\mathrm{Re}\{\cdot\}\) at the receiver.

9.8 Visualizations

OOB Spectral Emissions — CP-OFDM vs f-OFDM vs FBMC (PSD comparison)
Normalized power spectral density (dB) around a 12-subcarrier block. CP-OFDM (rectangular window) shows slow sinc-like rolloff with first sidelobe at ~−13 dB. f-OFDM (raised-cosine sub-band filter, rolloff 0.1) suppresses OOB by >30 dB beyond 2 subcarriers. FBMC with the PHYDYAS prototype filter (\(K=4\)) achieves >40 dB suppression — enabling guard-band-free coexistence.
OTFS vs CP-OFDM — BER vs SNR at 250 km/h, 5 GHz (Doppler ~1157 Hz)
Uncoded BER vs \(E_b/N_0\) for 16-QAM over a doubly-dispersive channel (3-path, delay spread 1 µs, Doppler spread ±1157 Hz corresponding to 250 km/h at 5 GHz). CP-OFDM suffers an irreducible error floor from ICI; OTFS exploits full delay-Doppler diversity and achieves a steep BER slope with no floor. OTFS gain over CP-OFDM is ~8 dB at BER = 10⁻³.

9.9 Study Questions

  1. FBMC vs CP-OFDM guard bands: A cognitive radio system must coexist with a licensed user 2 subcarriers away. CP-OFDM sidelobes at 2 subcarrier offset are approximately \(-20\log_{10}(2\pi \times 2) \approx -22\) dB below the main lobe. The PHYDYAS filter (\(K=4\)) achieves >40 dB suppression at the same offset. If the licensed user's interference tolerance is −30 dBc, explain why CP-OFDM requires a guard band of at least 4–5 subcarriers while FBMC-OQAM can operate with zero guard subcarriers. What constraint does this place on the FBMC frame latency if SCS = 15 kHz and \(K=4\)?
  2. OTFS diversity order: A channel has \(P=3\) scattering paths. An OTFS frame uses \(M=16\) delay bins and \(N=8\) Doppler bins. Using equation (9.16), what is the maximum diversity order available to each symbol? Contrast this with coded CP-OFDM (coding gain but not modulation diversity) and explain why OTFS BER curve slope is steeper. Under what condition does OTFS diversity collapse to match OFDM (hint: consider the rectangular pulse guard region in the DD domain)?
  3. UFMC vs FBMC latency trade-off: Both UFMC and FBMC remove the CP and reduce OOB emissions. UFMC uses a Dolph-Chebyshev filter of length \(L=73\) samples at SCS 15 kHz (FFT size \(N=2048\)). FBMC-OQAM with \(K=4\) has a prototype filter of length \(L_p = K \times N = 8192\) samples. Calculate the additional filter-induced latency in microseconds for each scheme. For a URLLC application requiring end-to-end latency < 1 ms, which waveform is disqualified and why? What is the minimum practical overlapping factor \(K_{\min}\) for FBMC to remain within this latency budget at SCS = 15 kHz?
§10

OFDM Pros & Cons — Comprehensive Analysis

This section provides a rigorous, balanced assessment of CP-OFDM — the waveform that underpins 4G LTE, 5G NR, Wi-Fi 4–7, and most modern broadband standards. Every advantage carries a quantified engineering justification; every disadvantage is paired with its real-world impact and the mitigations that have been standardised or researched. The section closes with a deployment decision matrix and two interactive charts.

1   Advantages of CP-OFDM

1a   Multipath Resilience via Single-Tap Equalization

The cyclic prefix converts the linear channel convolution into a circular one. In the frequency domain this means that the received subcarrier k sees a multiplicative scalar channel coefficient rather than inter-symbol interference:

CP-OFDM equalization \[ Y[k] = H[k] \cdot X[k] + W[k] \]

Equalising each subcarrier therefore costs exactly one complex multiplication per subcarrier per OFDM symbol, regardless of the channel delay spread. Compare this with the brute-force single-carrier (SC) alternative:

SC Viterbi complexity \[ \text{SC Viterbi states} = M^J \]

where \(M\) is the constellation order and \(J\) is the number of channel taps. For a 100-tap LTE Urban-Macro channel with 64-QAM (\(M=64\)):

Even a modest 16-QAM, 10-tap SC channel requires 1610 ≈ 1.1 × 1012 Viterbi states, while OFDM needs exactly 10 multiplications. This is the primary engineering reason OFDM was chosen for ADSL, LTE, and 802.11a/g/n.

1b   Efficient FFT-Based Implementation

The modulator/demodulator is a single IFFT/FFT pair. The Cooley–Tukey radix-2 FFT reduces the multiply count from \(N^2\) (direct DFT) to \(\frac{N}{2}\log_2 N\) complex multiplications:

FFT complexity \[ C_\text{FFT} = \tfrac{N}{2}\log_2 N \qquad \text{vs} \qquad C_\text{DFT} = N^2 \]

For N = 2048 (LTE 20 MHz / 5G NR μ=0 at 30.72 Msps):

MethodOperationsRatio
Direct DFT (N²)4,194,304
Radix-2 FFT22,528186× fewer

For N = 4096 (400 MHz NR, μ=3) the savings are even larger: 8,192 vs 16,777,216 — a 2048× reduction. This makes real-time 100+ MHz OFDM feasible on a single embedded DSP or FPGA slice.

1c   Flexible Frequency-Domain Resource Allocation (OFDMA)

Because each subcarrier is modulated independently, the scheduler can allocate different subsets of subcarriers to different users, power levels, and MCS in each OFDM symbol. This enables three powerful capabilities:

  1. Water-filling capacity: allocate power inversely proportional to noise level per subcarrier. Theoretical capacity approaches the Shannon limit: \[C = \sum_{k} B_\text{sc} \log_2\!\left(1 + \frac{P_k |H[k]|^2}{\sigma^2}\right)\]
  2. Frequency-selective multi-user scheduling: assign subcarrier group \(k\) to the user whose channel \(|H_u[k]|\) is largest — multiuser diversity gain scales as \(\log\log K\) for \(K\) users (Knopp–Humblet theorem).
  3. Non-contiguous spectrum access: subcarriers in a fragmented licensed band or cognitive-radio secondary band can be individually toggled, something impossible with a single-carrier system without a full redesign.
LTE's Resource Block (RB) = 12 subcarriers × 180 kHz = 2.16 MHz granularity. A 20 MHz carrier has 100 RBs, any subset of which can be allocated per TTI — achieving scheduling gains of 2–3× over round-robin in typical urban deployments.

1d   Easy MIMO Implementation

At each subcarrier the channel is flat-fading: the MIMO channel matrix \(\mathbf{H}[k] \in \mathbb{C}^{N_r \times N_t}\) is constant across that subcarrier's bandwidth. Spatial multiplexing therefore reduces to a simple matrix equation per subcarrier:

MIMO-OFDM per subcarrier \[ \mathbf{y}[k] = \mathbf{H}[k]\,\mathbf{x}[k] + \mathbf{w}[k] \]

For massive MIMO with \(N_t = 64\) transmit antennas and a single-stream UE (\(N_r = 1\)), the per-subcarrier equalizer is 64 scalar multiplications — one weight per antenna. Contrast this with a wideband single-carrier massive MIMO equalizer which requires a full \(64 \times J\)-tap matrix filter per receive antenna.

In the flat-fading (single-path) limit each \(\mathbf{H}[k] = h_k \mathbf{I}\), reducing MIMO-OFDM to a trivial scalar problem. Real channels are more complex, but the per-subcarrier decomposition always holds as long as the CP absorbs the delay spread.

1e   Scalable Bandwidth and Numerology

The OFDM framework parameterises cleanly: subcarrier spacing \(\Delta f = 1/T_u\), FFT size \(N\), CP length \(N_\text{CP}\), and sampling rate \(f_s = N \cdot \Delta f\). 5G NR defines five numerologies sharing the same baseband architecture:

μΔf (kHz)Nominal FFT (FR1)Max BW (MHz)Use case
015204850LTE-compat, eMBB sub-6
1304096100eMBB sub-6 primary
2604096200FR1 high-capacity / mmWave FR2
31204096400FR2 eMBB / V2X low-latency
42404096400FR2 reference signal only

Only the FFT size, CP tap count, and sampling clock change between numerologies. The IFFT/FFT hardware IP block is identical — a major silicon cost saving.

1f   Spectral Efficiency

Orthogonality allows subcarriers to overlap in frequency while remaining inter-symbol interference-free — eliminating the guard bands required between single-carrier channels:

Spectral efficiency vs SC-FDM \[ \eta_\text{OFDM} = \frac{N \cdot \Delta f}{N \cdot \Delta f} = 1 \quad\text{(no guard between SCs)} \]

With adaptive modulation and coding (AMC) per subcarrier, OFDM approaches the Shannon bound. Practical LTE/NR peak spectral efficiencies (downlink):

The 3GPP IMT-2020 peak spectral efficiency target of 30 bit/s/Hz is achieved via massive MIMO OFDM — not single-carrier — underscoring OFDM's spectral advantage.

2   Disadvantages of CP-OFDM

2a   High Peak-to-Average Power Ratio (PAPR)

An OFDM signal is the sum of \(N\) independently modulated sinusoids. By the central limit theorem the real and imaginary parts approach Gaussian distributions as \(N \to \infty\), giving a CCDF tail that decays slowly. The theoretical maximum PAPR is:

Maximum PAPR \[ \text{PAPR}_\text{max} = 10\log_{10}(N) \quad\text{[dB]} \]

For N = 2048 this is 33 dB, but the practical 10-4 CCDF value is 10–12 dB. The consequences for the power amplifier (PA) are severe:

PA back-offPA efficiency (typical class-AB)Comment
0 dB (at 1-dB comp.)~50%Heavy clipping / distortion
3 dB~25%Marginal linearity
10 dB (required for OFDM)~5%Standard LTE downlink PA

A 10 dB back-off reduces PA drain efficiency from ~50% to ~5% — a 10× increase in power consumption for the same useful RF output. This is why 3GPP chose DFT-s-OFDM (SC-FDMA) for the LTE/NR uplink: the single-carrier nature reduces PAPR by 4–6 dB, cutting UE battery drain significantly.

Mitigation techniques:
  • DFT-s-OFDM: spreads data across all subcarriers via DFT pre-coding → 4–6 dB PAPR reduction; used in NR UL.
  • Clipping & filtering: clip at 8–9 dB PAPR threshold → ≈2–3 dB gain at cost of EVM floor ≈−30 dB.
  • SLM / PTS: Selected Mapping / Partial Transmit Sequences → 3–4 dB gain; requires side information overhead (<2 bits/symbol).
  • Tone reservation: reserve ≈2% subcarriers for PAPR cancellation tones (IEEE 802.11af approach).

2b   Cyclic Prefix Overhead

The CP carries no new data — it is a copy of the tail of the OFDM symbol prepended to guard against ISI. Every CP sample is "wasted" bandwidth:

CP overhead ratio \[ \eta_\text{CP} = \frac{N_\text{CP}}{N + N_\text{CP}} \]

Concrete values:

StandardNNCP (normal)Overhead per symbolEffective average overhead
LTE normal CP2048144 (6 of 7 symbols) / 160 (1st symbol)144/2192 = 6.57%≈8.3% (slot average)
LTE extended CP2048512512/2560 = 20%20% (all symbols)
5G NR μ=02048144/160same as LTE≈7.1%
5G NR μ=351236/4036/548 = 6.6%≈7.0%

The overhead cannot be reduced below the channel's maximum excess delay divided by the sampling period without incurring ISI. For LTE, the normal CP covers \(N_\text{CP} \times T_s = 144 / 30.72\text{ MHz} = 4.69\;\mu\text{s}\) — sufficient for macro-cell delay spreads up to ≈1.4 km path-length difference.

2c   Sensitivity to CFO and Phase Noise

A carrier frequency offset (CFO) \(\varepsilon\) normalised to subcarrier spacing (\(\varepsilon = \delta f / \Delta f\)) breaks orthogonality, causing inter-carrier interference (ICI). The SIR from ICI is approximately:

ICI SIR due to CFO \[ \text{SIR}_\text{ICI} \approx \frac{3}{\pi^2 \varepsilon^2} \]

Example: a 750 Hz CFO on a 15 kHz subcarrier grid (\(\varepsilon = 0.05\)):

Numerical example \[ \text{SIR} \approx \frac{3}{\pi^2 \times 0.0025} \approx 121 \approx 20.8\;\text{dB} \]

This limits the useful SNR to 20.8 dB — insufficient for 256-QAM (requires >30 dB SIR). Real oscillator specs must achieve sub-10 Hz residual CFO after AFC for 256-QAM links.

Phase noise at mmWave: LO phase noise power spectral density scales as \(f_c^2\) (for a free-running VCO). At 60 GHz vs 30 GHz the phase noise is 4× worse in power (6 dB) — and the wider 120 kHz / 240 kHz subcarrier spacing of 5G NR FR2 only partially compensates. 3GPP introduced Phase Tracking Reference Signals (PTRS) in NR FR2 specifically to track and cancel low-frequency phase noise.

Mitigations:
  • AFC loops: track residual CFO to <1% of Δf.
  • PTRS (5G NR TS 38.211 §7.4.1.2): dense phase-error pilots in time; spans 1 in every 4 OFDM symbols at high MCS.
  • Wider SCS: larger Δf → larger tolerable absolute CFO at same normalised ϵ.
  • Phase-locked oscillators: tighter PLL bandwidth reduces integrated phase noise.

2d   Out-of-Band Emissions (OOB)

The OFDM time-domain symbol has a rectangular (abrupt) window. Its frequency spectrum is a sum of sinc functions, with the first sidelobe at only −13 dB relative to the main lobe:

Rectangular window PSD sidelobe \[ \left|\frac{\sin(\pi f T)}{\pi f T}\right|^2 \xrightarrow{fT=1.5} -13.3\;\text{dB} \]

To protect adjacent channels, guard bands must be inserted. LTE allocates approximately 10% of the channel bandwidth as guard subcarriers (e.g., 72 unused subcarriers at each edge of the 2048-point FFT in 20 MHz mode). These are in addition to any regulatory spectral mask requirements.

Mitigations:
  • Windowing (raised-cosine / Hann): roll-off at symbol boundary → sidelobe ≈ −40 dB at cost of ≈0.5% CP length for ramp.
  • f-OFDM (filtered OFDM): per-subband bandpass filter; used in some 5G NR mixed-numerology proposals.
  • FBMC (Filter Bank Multicarrier): per-subcarrier Nyquist filter; achieves −80 dB OOB but requires twice the prototype filter length (see §9).
  • DFT-spread OFDM + spectrum shaping: the single-carrier envelope reduces OOB by ≈6–8 dB vs CP-OFDM for equivalent output power.

2e   Poor Performance in Doubly Dispersive Channels

CP-OFDM handles delay spread (frequency selectivity) via the CP, but it handles Doppler spread (time selectivity) only if the channel is quasi-static within one OFDM symbol duration \(T_u\). The ICI caused by Doppler shift \(f_D\) is approximately:

ICI from Doppler \[ \text{SIR}_\text{Doppler} \approx \frac{3}{(\pi f_D T_u)^2} \]

For a 5G NR μ=0 symbol at 250 km/h and 3.5 GHz: \(f_D = 250 \times 3.5 \times 10^9 / (3 \times 10^8 \times 3.6) \approx 810\;\text{Hz}\), and \(T_u = 66.7\;\mu\text{s}\), giving \(f_D T_u \approx 0.054\) and SIR ≈ 20.4 dB — borderline for 256-QAM.

When both delay spread and Doppler are significant (doubly dispersive), the CP may be insufficient (ISI) and the symbol duration may be too long (ICI) simultaneously. Increasing \(\Delta f\) (shorter symbol) reduces ICI but also reduces the maximum tolerable delay spread.

Mitigations:
  • Higher SCS: μ=3 (120 kHz) at mmWave → shorter symbol → more Doppler robust.
  • OTFS (Orthogonal Time Frequency Space): modulates in the delay-Doppler domain → full diversity for doubly dispersive channels (see §9).
  • Robust pilot patterns: denser pilot grids for high-mobility channels (e.g., NR PDSCH DMRS type-B with additional CDM groups).
  • V2X slot design: 3GPP Rel-17 NR-V2X uses the 60 kHz SCS specifically to handle high Doppler in vehicular scenarios.

2f   High ADC/DAC Complexity for Wide Bandwidth

OFDM exploits bandwidth by increasing the number of subcarriers, which requires proportionally higher sampling rates. The Nyquist sampling rate for a 400 MHz NR carrier (μ=3, FFT=4096) with standard oversampling is:

NR 400 MHz sampling rate \[ f_s = N \cdot \Delta f = 4096 \times 120\;\text{kHz} = 491.52\;\text{Msps} \]

After 2× oversampling for interpolation filtering: 983.04 Msps. ADC power consumption scales roughly as \(f_s \times 2^\text{ENOB}\), so a 12-bit ADC at 1 Gsps consumes ≈500 mW — substantial in a battery-powered UE. Contrast with NB-IoT (180 kHz, μ=0): ADC at 240 ksps draws <1 mW.

This is an infrastructure cost concern more than a PHY layer design flaw. The large ADC/DAC power budgets are manageable in gNB basebands (where power is from the grid) but constrain future 6G UE designs at THz frequencies.

3   Net Verdict: Why OFDM Won — and Where It Falls Short

CP-OFDM won the 4G/5G standardisation process for a combination of reasons that no single-carrier alternative could simultaneously match in the mid-2000s engineering environment:

  1. Equalizer simplicity: one multiplication per subcarrier vs exponential SC complexity enabled real-time 20 MHz 2×2 MIMO on the <65 nm silicon of 2008.
  2. OFDMA multi-user flexibility: sub-carrier granularity enabled the LTE scheduler architecture that drives the 3–5× throughput gains of LTE over HSPA.
  3. MIMO synergy: per-subcarrier flat-fading made 4×4 MIMO SIC/SU-MIMO tractable without a joint space-time processor.
  4. Ecosystem momentum: ADSL, 802.11a/g, DVB-T all used OFDM; VLSI toolchains, test equipment, and channel models were already OFDM-native.

Where it falls short: OFDM is sub-optimal for (a) high-mobility vehicular/aerial channels (>500 km/h), (b) UE uplink power efficiency, (c) uncoordinated adjacent-channel coexistence due to OOB, and (d) future THz channels with extreme phase noise. The industry has addressed these with DFT-s-OFDM (UL), PTRS (mmWave), windowed/filtered variants (adjacent-channel), and is actively researching OTFS and AFDM for 6G high-mobility scenarios.

The 2030 ITU-R IMT-2030 (6G) framework explicitly lists support for high-mobility and integrated sensing as new requirements — both of which stress CP-OFDM. OTFS and ISAC-native waveforms are the leading candidates for FR3 (7–24 GHz) and above.

4   Deployment Decision Matrix

Scenario Recommended Waveform Key Reason Standard
High-mobility V2X (>250 km/h) OTFS or NR μ=3 Doppler > 1 kHz → ICI floor in CP-OFDM; OTFS spreads energy over full delay-Doppler grid 3GPP Rel-17 NR-V2X; OTFS research (IEEE 2017+)
UE uplink (battery-constrained) DFT-s-OFDM 4–6 dB PAPR reduction → PA back-off ≈6 dB → 2× better PA efficiency LTE SC-FDMA; 5G NR PUSCH DFT-s (TS 38.211)
mmWave gNB downlink CP-OFDM + PTRS Massive MIMO flat-fading per SC; PTRS corrects LO phase noise at 28/39 GHz 5G NR FR2 (TS 38.211 §7.4.1.2)
Cognitive / secondary radio FBMC-OQAM OOB −80 dB via prototype filter; no CP needed for intra-band coexistence IEEE 802.22; 5GNOW project
6G THz / ISAC OTFS + ISAC (TBD) Extreme Doppler; radar ambiguity function requirements; delay-Doppler native sensing ITU-R IMT-2030; ongoing 3GPP study item
IoT narrowband (NB-IoT) Narrow-CP-OFDM Simplicity; reuses LTE numerology (μ=0, 15 kHz); single-subcarrier option for lowest power 3GPP TS 36.211 (NB-IoT), Rel-13+
Wi-Fi 7 (IEEE 802.11be) CP-OFDM + 16384-FFT 4096 subcarriers, 320 MHz BW; OFDMA multi-link operation; indoor → low mobility OK IEEE 802.11be (2024)
Chart 10-1   Waveform Performance Radar — CP-OFDM vs DFT-s-OFDM vs FBMC vs OTFS
Scores are normalised 0–10 (higher = better for each axis). Axes: PAPR performance (low PAPR = high score), OOB suppression, Doppler robustness, ease of implementation (low complexity = high score), and spectral efficiency. Values reflect engineering consensus from 3GPP/IEEE literature; ±1 point estimation uncertainty.
Chart 10-2   OFDM Spectral Efficiency Overhead Breakdown — LTE 20 MHz vs 5G NR 100 MHz (μ=1)
Stacked bar showing the fraction of total time-frequency resources consumed by each overhead category. Remaining bar (top, green) = net data capacity. LTE figures from TS 36.211/213; NR from TS 38.211/213. Pilot overhead assumes moderate channel (PDSCH DMRS type-A, 1 symbol mapping).
  1. PAPR and PA efficiency: A 5G NR gNB uses a class-AB PA with peak efficiency 48% at saturation. The OFDM signal requires 8 dB back-off. Using the quadratic back-off model \(\eta(p) = \eta_\text{sat}\,(p/p_\text{sat})\), calculate the operating PA efficiency and the wasted power fraction. How much does DFT-s-OFDM (4 dB PAPR reduction) improve this?
  2. CFO budget allocation: A 5G NR NR-DC link operates at 3.5 GHz with 30 kHz SCS. The system requires SIR ≥ 30 dB on 256-QAM. Using the ICI SIR formula \(\text{SIR} \approx 3/(\pi^2\varepsilon^2)\), derive the maximum tolerable normalised CFO \(\varepsilon\). Convert this to an absolute frequency error in Hz and state whether a typical TCXO (±0.5 ppm) satisfies this budget.
  3. Waveform design trade-off: You are designing the PHY for a 6G high-altitude platform station (HAPS) link at 26 GHz with UE speed up to 1000 km/h and a delay spread of 2 μs. (a) Compute the coherence bandwidth and minimum subcarrier count for 100 MHz system bandwidth. (b) Compute the maximum Doppler frequency. (c) Evaluate whether CP-OFDM with μ=3 can handle this channel, or whether OTFS is required. Justify using both the ICI SIR formula and the Zadoff criterion \(f_D T_u \ll 1\).
§11

6G Outlook — THz, AI-Native Waveforms, ISAC, and Beyond OFDM

The fifth generation of cellular communications is not yet fully deployed, yet the research community and standards bodies are already defining its successor. The ITU-R IMT-2030 framework sets requirements so demanding that they challenge the fundamental assumptions behind CP-OFDM. This section surveys the key technology pillars of 6G: the physics of THz propagation, the delay-Doppler domain waveform OTFS designed for extreme mobility, AI/ML-native waveform design, the integrated sensing-and-communications paradigm, Reconfigurable Intelligent Surfaces, and the green-communications imperative — and asks where, precisely, OFDM and its successors stand in each.

11.1 6G Vision and IMT-2030 Targets (ITU-R WP5D)

ITU-R Working Party 5D published the IMT-2030 Framework Recommendation (M.2160, 2023) establishing the key performance indicators (KPIs) that candidate technologies must meet by 2030. The jump from 5G to 6G is more radical than the 4G-to-5G transition: several KPIs improve by two orders of magnitude, and entirely new capability classes — native sensing and native AI/ML — are added to the requirement set.

KPI 5G NR (IMT-2020) 6G (IMT-2030 Target) Improvement
Peak data rate 20 Gbps (DL) 1 Tbps (DL) 50×
User experienced rate 100 Mbps 1 Gbps 10×
U-plane latency 1 ms (URLLC) 0.1 ms 10×
Reliability (BLER) 10−5 10−7 100×
Connection density 106 devices/km² 107 devices/km² 10×
Area traffic capacity 10 Mbps/m² 1 Gbps/m² 100×
Sensing range resolution N/A Sub-centimetre New capability
Sensing velocity resolution N/A Sub-mm/s New capability
Energy efficiency Baseline 100× bits/Joule 100×
AI/ML integration Study item (Rel-18) Native, normative New paradigm

The candidate frequency bands reflect the dual imperative of coverage and capacity:

Chart 11-A — 6G vs 5G Key Metrics: Radar (Spider) Chart
Normalised to 5G baseline = 1.0. Values represent the IMT-2030 improvement factor over IMT-2020 on a log10 scale for display purposes. Peak rate: 50×; User rate: 10×; Latency improvement (inverse): 10×; Reliability improvement: 100×; Device density: 10×; Energy efficiency: 100×. Sensing accuracy is a new native capability with no 5G baseline (shown as 10× relative to positioning accuracy from Rel-16/17).

11.2 THz Band Challenges for OFDM

The THz band offers extreme bandwidth, but its propagation physics stress every assumption that makes CP-OFDM efficient. Understanding these constraints quantitatively is essential for any system designer working on 6G air-interface selection.

Molecular Absorption

Unlike lower-frequency bands where path loss follows a smooth power law, the THz band has sharply frequency-selective molecular absorption windows. Atmospheric O2 produces a strong absorption peak near 60 GHz (used deliberately for backhaul isolation in WiGig), while H2O vapour dominates at 183 GHz and 325 GHz. The implication for OFDM is a non-flat channel at the system level — even before multipath — requiring either careful subband selection or absorption-aware water-filling power allocation.

Phase Noise Scaling

Oscillator phase noise power spectral density grows as the square of the carrier frequency. A 300 GHz oscillator with state-of-the-art design exhibits approximately 100× worse phase noise than a comparable 30 GHz oscillator. In OFDM, phase noise causes two degradation modes:

(11.1)
Phase noise ICI variance (first-order approximation):
$$\sigma^2_{\text{ICI}} \approx \frac{2\pi^2}{3} \cdot f_c^2 \cdot c_{\phi} \cdot T_s^2$$
where \(f_c\) is carrier frequency, \(c_\phi\) is the oscillator phase noise coefficient (Lorentzian model), and \(T_s\) is the OFDM symbol duration (excluding CP). The \(f_c^2\) dependence confirms the 100× penalty at 10× carrier frequency.

Ultra-Wideband Delay Spread

At 100 GHz of instantaneous bandwidth, a modest channel delay spread of \(\tau_{\max} = 50\,\text{ns}\) translates to 5,000 channel taps at the Nyquist rate. The CP duration must cover this delay spread, incurring significant overhead, and the equaliser must handle a very long channel — erasing the simplicity of one-tap per-subcarrier equalisation unless a grouped-subcarrier or OFDM-with-DFT-spreading approach is used.

(11.2) $$N_{\text{CP}} = \lceil \tau_{\max} \cdot B \rceil = \lceil 50 \times 10^{-9} \times 100 \times 10^{9} \rceil = 5000 \text{ samples}$$
At \(B = 100\,\text{GHz}\): CP overhead = 5000 samples with total symbol \(N + N_{\text{CP}}\) samples. CP efficiency \(\eta = N/(N + N_{\text{CP}})\) drops sharply unless \(N\) is chosen very large — which in turn worsens ICI from Doppler and phase noise.

Extreme Doppler at THz

At a carrier of \(f_c = 300\,\text{GHz}\) and pedestrian speed \(v = 100\,\text{km/h}\), the Doppler shift is:

(11.3) $$f_D = \frac{v}{c} \cdot f_c = \frac{100/3.6}{3 \times 10^8} \times 300 \times 10^9 \approx 27{,}800\,\text{Hz} \approx 27.8\,\text{kHz}$$
Compare: at 30 GHz (mmWave), \(f_D \approx 2{,}780\,\text{Hz}\). The 300 GHz Doppler is 10× larger. With a 15 kHz OFDM subcarrier spacing (5G NR numerology 0), \(f_D/\Delta f \approx 1.85\) — Doppler is nearly two subcarrier widths, producing severe ICI that cannot be handled by simple one-tap equalisation.

Path Loss and Pencil Beamforming

Free-space path loss at 300 GHz over 10 m is approximately 102 dB — roughly 20 dB worse than at 30 GHz over the same distance. Atmospheric absorption adds further loss. Compensation requires extremely narrow beams: with \(N_{\text{ant}} = 1024\) elements, the array gain is \(10\log_{10}(1024) \approx 30\,\text{dB}\) — barely sufficient to close the link at short range. Beam management at THz must operate at microsecond timescales, far faster than 5G mmWave beam management.

OFDM suitability at THz is genuinely questionable. The combination of severe phase noise ICI, extreme Doppler ICI, and the very large CP overhead required by wideband delay spread pushes OFDM toward its breaking point. Single-carrier (DFT-s-OFDM) or delay-Doppler domain waveforms (OTFS) may be preferred above approximately 100 GHz. The 6G air interface selection for THz bands remains an open research question.

11.3 OTFS for 6G High-Mobility Scenarios

Orthogonal Time Frequency Space (OTFS), proposed by Hadani et al. (2017), modulates information in the delay-Doppler (DD) domain rather than the time-frequency domain used by OFDM. In channels with extreme Doppler — LEO satellite communications being the canonical example — OTFS achieves near-ideal performance while OFDM's subcarrier orthogonality is destroyed.

LEO Satellite Channel: Motivating Numbers

(11.4)
LEO Doppler at 30 GHz (orbital altitude 600 km, orbital speed 7.8 km/s):
$$f_{D,\text{LEO}} = \frac{v_{\text{sat}}}{c} \cdot f_c = \frac{7800}{3 \times 10^8} \times 30 \times 10^9 = 780\,000\,\text{Hz} = 780\,\text{kHz}$$
780 kHz Doppler / 15 kHz SCS = 52 subcarrier widths. No CP-OFDM subcarrier spacing in the 5G NR numerology table can make this channel "flat in frequency" within one OFDM symbol duration. This is the fundamental motivation for OTFS in NTN contexts.

OTFS Signal Model

OTFS uses a 2D transform — the Symplectic Finite Fourier Transform (SFFT) — to convert the time-frequency plane into the delay-Doppler plane, where the mobile channel appears as a sparse, separable (delay, Doppler) impulse response:

(11.5)
DD-domain input-output relation (OTFS):
$$y[\ell, k] = \sum_{i=1}^{P} h_i \cdot x[\ell - \ell_i, k - k_i] + z[\ell, k]$$
\(\ell\) = delay bin index, \(k\) = Doppler bin index, \(h_i\) = complex gain of the \(i\)-th path, \((\ell_i, k_i)\) = integer delay and Doppler shift of path \(i\). The channel is a 2D circular convolution with a sparse kernel — only \(P\) non-zero taps (one per resolvable path) regardless of total signal bandwidth or Doppler spread. Channel estimation reduces to locating \(P\) taps in a \(\ell_{\max} \times k_{\max}\) grid.
(11.6)
OTFS modulation via ISFFT + Heisenberg transform:
$$s(t) = \sum_{\ell=0}^{N-1}\sum_{k=0}^{M-1} X[\ell,k]\, g_{tx}(t - \ell T)\, e^{j2\pi k\Delta f(t - \ell T)}$$
where \(N\) = number of delay bins, \(M\) = number of Doppler bins, \(T\) = OTFS frame duration, \(\Delta f = 1/(NT)\). With rectangular pulse shaping, OTFS reduces to a block of CP-OFDM symbols; with ideal pulse shaping (ISSFFT), OTFS achieves full diversity order \(NM\) (vs OFDM's diversity order 1 in ergodic Rayleigh fading).

OTFS vs OFDM: Capacity and Diversity

Chart 11-B — OTFS vs CP-OFDM BER in LEO Satellite Channel (30 GHz, Extreme Doppler)
Simulated uncoded BER vs Eb/N0 for a LEO channel at 30 GHz with 4 paths, maximum Doppler 780 kHz (52×SCS), delay spread 4 bins. QPSK modulation, \(N = 32\) delay bins, \(M = 32\) Doppler bins. CP-OFDM (same spectral efficiency) collapses to a BER floor due to ICI; OTFS maintains a steep diversity slope. At BER = 10−3, OTFS advantage exceeds 15 dB. Data based on Hadani et al. (IEEE ICC 2017) and Raviteja et al. (IEEE Trans. Wireless Comm. 2018, doi:10.1109/TWC.2018.2876189).

11.4 AI/ML-Native Waveform Design

The conventional communications system design paradigm — derive an analytical channel model, engineer each block (modulation, channel coding, equalization, detection) separately to match the model — faces fundamental limitations in the complex, non-stationary channels of THz and high-mobility 6G. The emerging alternative is end-to-end learning: treat the transmitter and receiver jointly as an autoencoder and optimise them together via stochastic gradient descent.

Autoencoder Approach (Simonyan/Gruber Framework)

(11.7)
End-to-end learned communication system:
$$\hat{m} = f_{\theta_{\text{RX}}}\!\left(h \star f_{\theta_{\text{TX}}}(m) + n\right)$$
\(m \in \{1,\ldots,M\}\) = transmitted message index, \(f_{\theta_{\text{TX}}}\) = transmitter neural network (encoder), \(f_{\theta_{\text{RX}}}\) = receiver neural network (decoder), \(h\) = channel response, \(n\) = AWGN. The objective is to minimise block error rate by jointly optimising \(\theta_{\text{TX}}\) and \(\theta_{\text{RX}}\). Unlike OFDM, there is no explicit subcarrier structure — the TX learns its own waveform basis functions tuned to the target channel and SNR operating point.

Key empirical findings from DeepSIG, OpenAirInterface (OAI), and the academic literature:

3GPP AI/ML Channel Estimation — Rel-18 Study Item

3GPP Release 18 includes a study item (RP-213599) on AI/ML for NR physical layer enhancements, covering three use cases: (1) CSI feedback compression, (2) beam management, (3) channel estimation and equalisation. The Rel-18 study is not normative — it defines the evaluation framework and assesses feasibility. Normative AI/ML PHY enhancements are targeted for Rel-19/20.

Semantic Communication

Beyond bit-level optimisation, semantic communication transmits meaning rather than bits. For structured data (images, speech, sensor readings), joint source-channel coding at the semantic level achieves 10–100× compression over separate source coding + channel coding pipelines at the same reconstruction quality. This requires a shared semantic model between transmitter and receiver — a major departure from the bit-pipe model underlying all existing standards.

The AI/ML-native paradigm does not necessarily replace OFDM: many proposals retain OFDM as the air interface but replace specific PHY blocks (pilot design, channel estimator, equaliser, detector) with neural networks trained offline. This hybrid approach is the pragmatic path toward standardisation in Rel-19/20, where backward compatibility and interpretability requirements constrain how radically the waveform can change.

11.5 ISAC — Integrated Sensing and Communications

ISAC (also called Dual-Function Radar-Communication, DFRC) uses a single waveform, transceiver hardware, and spectrum for both wireless communications and radar-like environmental sensing. OFDM's structure — known pilots on a regular time-frequency grid — makes it a natural dual-purpose waveform.

OFDM Radar Processing

(11.8)
OFDM radar range-velocity estimation (2D periodogram):
$$Y[n,m] = \sum_{\ell=0}^{P-1} \alpha_\ell \, e^{-j2\pi n \ell / N} \, e^{j2\pi m k_\ell / M} + W[n,m]$$
\(n\) = subcarrier index (range domain), \(m\) = OFDM symbol index (Doppler domain), \(N\) = number of subcarriers, \(M\) = number of symbols in radar CPI (coherent processing interval), \(\ell\) = range bin of \(\ell\)-th target, \(k_\ell\) = Doppler bin. Range resolution: \(\Delta R = c/(2B)\). Doppler resolution: \(\Delta v = c\lambda / (2MT_s)\). Range-velocity estimation reduces to a 2D FFT of the channel transfer matrix \(Y[n,m]\) — exploiting OFDM's existing subcarrier structure at no additional hardware cost.

Unified ISAC Signal Model

(11.9)
Communications mode: \(\mathbf{y}_c = \mathbf{H}_c \mathbf{x} + \mathbf{z}_c\)
Sensing mode: \(\mathbf{y}_s = \mathbf{H}_s(\tau, f_D) \mathbf{x} + \mathbf{z}_s\)
\(\mathbf{x}\) = transmitted OFDM block (same waveform for both functions). \(\mathbf{H}_c\) = communications channel (UE-specific multipath). \(\mathbf{H}_s(\tau, f_D)\) = sensing channel (target range \(\tau\), velocity \(f_D\)). The fundamental tension: communications performance favours randomised data symbols (maximise information entropy); sensing performance favours deterministic known symbols (maximise waveform ambiguity function control). This defines the ISAC Pareto frontier.

ISAC Pareto Frontier and Trade-off

The achievable region for simultaneous communications (bit rate \(R\)) and sensing (estimation SINR \(\Gamma_s\)) forms a Pareto frontier parameterised by the power split between data and reference (pilot/sensing) symbols. Key results:

3GPP ISAC in 5G NR and 6G

11.6 RIS — Reconfigurable Intelligent Surfaces

A Reconfigurable Intelligent Surface (RIS) is a planar array of passive electromagnetic elements, each with a controllable reflection phase shift. Unlike an active relay, a RIS does not amplify or decode the signal — it passively steers reflected wavefronts. This enables coverage extension without additional RF chains, power amplifiers, or noise amplification.

OFDM + RIS Channel Model

(11.10)
Per-subcarrier received signal with RIS:
$$Y[k] = \left(\mathbf{h}_d^H[k] + \mathbf{h}_r^H[k]\,\text{diag}(\boldsymbol{\theta}[k])\,\mathbf{G}[k]\right)\mathbf{x}[k] + Z[k]$$
\(\mathbf{h}_d[k]\) = direct channel vector (gNB to UE) at subcarrier \(k\), \(\mathbf{h}_r[k]\) = RIS-to-UE channel vector, \(\mathbf{G}[k]\) = gNB-to-RIS channel matrix, \(\boldsymbol{\theta}[k] = [e^{j\phi_1[k]}, \ldots, e^{j\phi_L[k]}]^T\) = complex reflection coefficients of \(L\) RIS elements at subcarrier \(k\). Frequency-selective RIS: full degrees of freedom require per-subcarrier phase control — an area of active research (most current RIS hardware provides single-frequency or wideband-average phase control).

The passive beamforming gain of an \(L\)-element RIS scales as \(L^2\) (coherent combination), compared to \(L\) for active arrays. This quadratic scaling is the primary motivation: a 256-element RIS provides 48 dB of passive beamforming gain, compensating for the 20 dB additional path loss of mmWave / sub-THz bands over microwave. Practical constraints include:

11.7 Energy Efficiency in 6G

The 100× bits-per-Joule target relative to 5G is the most challenging KPI in the IMT-2030 framework. The global ICT sector currently consumes approximately 2–3% of world electricity; with the projected 100× traffic growth from 5G to 6G, aggressive energy efficiency improvement is essential for sustainability.

OFDM Energy Overhead

CP-OFDM has inherent energy inefficiencies compared to narrowband alternatives:

Green 6G Alternatives

The energy efficiency imperative creates a spectrum of 6G air-interface requirements that a single waveform cannot optimally serve: Tbps links demand wideband OFDM/OTFS with massive MIMO; billion-device IoT demands near-zero-energy UNB alternatives. 6G must be a multi-waveform, multi-numerology framework even more heterogeneous than 5G NR.

11.8 Summary: OFDM Evolution Across Generations

Generation Primary Waveform(s) Key OFDM Innovation Peak Rate Bands New Capability
4G LTE CP-OFDM (DL), DFT-s-OFDM (UL) OFDMA multiple access; SC-FDMA for UL PAPR reduction 1 Gbps Sub-3 GHz Mobile broadband at scale
5G NR CP-OFDM + DFT-s-OFDM Flexible numerology (SCS 15–240 kHz); mmWave OFDM; massive MIMO 20 Gbps Sub-6 GHz, FR2 (24–52 GHz) URLLC, eMBB, mMTC
5G Advanced CP-OFDM + ISAC extensions Sensing RS (SRS/PRS-based); dual-function pilot patterns; AI/ML PHY (study) 20+ Gbps Sub-6 GHz, FR2, FR3 (7–24 GHz, study) Native positioning; sensing feasibility; AI/ML study
6G (IMT-2030) OTFS / CP-OFDM / AI-native / UNB-IoT Native ISAC waveform; DD-domain modulation; end-to-end learned waveforms; RIS integration 1 Tbps FR3, sub-THz (100–300 GHz), THz (>300 GHz) Native sensing; native AI/ML; native energy efficiency; NTN integration

Study Questions

  1. THz OFDM design trade-off: At a carrier frequency of 300 GHz with 100 GHz bandwidth, derive the required CP length (in samples) if \(\tau_{\max} = 30\,\text{ns}\), and compute the resulting CP overhead fraction for subcarrier spacings of 480 kHz and 960 kHz (5G NR numerologies 5 and 6). Given that phase noise ICI power scales as \(f_c^2 \cdot T_s^2\), how does doubling the SCS affect the ICI-vs-CP-overhead trade-off, and at what SCS does ICI become the dominant impairment rather than CP overhead?
  2. OTFS diversity advantage: A LEO satellite channel at 30 GHz has 4 paths with delays \(\ell \in \{0, 1, 3, 7\}\) delay bins and Doppler shifts \(k \in \{0, 5, -3, 12\}\) Doppler bins on a \(32 \times 32\) DD grid. Sketch the 2D channel response in the delay-Doppler plane. What is the full diversity order achievable by OTFS (assuming ideal pulse shaping and optimal coding)? Why does CP-OFDM with a one-tap equaliser achieve diversity order 1 in this channel, and what coding/equalisation approach would be needed to recover full diversity with OFDM?
  3. ISAC Pareto frontier: Consider an OFDM ISAC system with \(N = 1024\) subcarriers, \(M = 14\) symbols per slot, and total power \(P_T\). Let a fraction \(\alpha \in [0,1]\) of resource elements carry data and \((1-\alpha)\) carry sensing reference signals (known waveform). Express the communications spectral efficiency \(R(\alpha)\) and the sensing Fisher information \(\mathcal{I}(\alpha)\) as functions of \(\alpha\). Show that \(R(\alpha)\) is concave and \(\mathcal{I}(\alpha)\) is monotonically decreasing in \(\alpha\), and determine the \(\alpha^*\) that maximises \(R + \lambda \mathcal{I}\) for a given Lagrange multiplier \(\lambda\) (sensing priority weight). What does the optimal \(\alpha^*\) imply for the design of 6G dual-function pilot patterns?
OFDM is not the end of the story — it is the platform on which 6G innovation is being built. Every advanced technique surveyed in this section — OTFS, AI/ML waveform design, ISAC, RIS, THz communications — either extends OFDM (ISAC, RIS), augments it (AI/ML channel estimation), or defines itself in relation to it (OTFS as a 2D generalisation of the time-frequency grid). The transition to 6G will not be a clean break from OFDM: it will be an evolution toward a richer, more capable multi-waveform framework in which CP-OFDM remains the reference air interface while specialist waveforms — delay-Doppler for extreme mobility, AI-native for THz non-stationarity, ultra-narrowband for zero-energy IoT — coexist in a heterogeneous network. The unifying theme is the time-frequency plane: OFDM's mathematical foundations in harmonic analysis, uncertainty principles, and sampling theory remain the bedrock on which every 6G air interface is analytically described.
§12

3GPP OFDM Signal Generation — 4G LTE & 5G NR

This section walks through the normative OFDM baseband signal models defined in 3GPP TS 36.211 (LTE) and TS 38.211 (NR), covering downlink CP-OFDM, uplink DFT-s-OFDM (SC-FDMA in LTE), and the Sounding Reference Signal (SRS) for both generations. Each model is accompanied by an interactive chart.

12.1 LTE Physical Layer — Parametric Overview TS 36.211 §5–6

LTE uses CP-OFDM on the downlink (PDSCH, PDCCH, etc.) and DFT-s-OFDM (SC-FDMA) on the uplink (PUSCH) to keep the UE power-amplifier back-off low. Key parameters are fixed across all bandwidths:

BW (MHz)NFFTNused NRBCPnormal (μs)CPext (μs) Tsymbol (μs)
1.41287664.6916.6771.35
3256152124.6916.6771.35
5512300254.6916.6771.35
101024600504.6916.6771.35
151536900754.6916.6771.35
20204812001004.6916.6771.35

Subcarrier spacing is fixed at \(\Delta f = 15\,\text{kHz}\), yielding a useful symbol duration \(T_u = 1/\Delta f = 66.67\,\mu\text{s}\). The first CP of each 0.5 ms slot is slightly longer (5.21 μs) to maintain the 0.5 ms frame alignment.

12.2 LTE Downlink — CP-OFDM Signal Model TS 36.211 §6.12

The complex baseband OFDM time-domain signal for OFDM symbol \(\ell\) in subframe \(s\) is (TS 36.211 Eq. 6.12-1):

\[ s_{\ell}(t) = \sum_{k=-N_{\rm FFT}/2}^{N_{\rm FFT}/2 - 1} a_{k,\ell}\,e^{j2\pi k\Delta f\,(t - N_{\rm CP}T_s)}, \quad 0 \le t < (N_{\rm CP}+N_{\rm FFT})\,T_s \]

where \(a_{k,\ell}\) is the complex modulation symbol on subcarrier \(k\), symbol \(\ell\); \(T_s = 1/(15000 \times 2048) \approx 32.55\,\text{ns}\) is the basic LTE time unit; \(N_{\rm CP}\) is the CP length in samples (160 for the first symbol, 144 for symbols 1–6 in a 0.5 ms slot). Implemented as IFFT + CP prepend:

\[ \mathbf{s}_\ell = \underbrace{\text{CP}}_{\text{last }N_{\rm CP}\text{ samples}} \| \; \text{IFFT}_{N_{\rm FFT}}\!\left(\mathbf{a}_\ell\right) \]

The resource grid spans \(N_{\rm RB}\times 12\) subcarriers in frequency and 14 OFDM symbols per 1 ms subframe (normal CP).

LTE DL Resource Grid — Normal CP, 10 MHz (50 RBs, 14 symbols/subframe)
Each column = 1 OFDM symbol; each row = 1 subcarrier. Colour key: data (blue), CRS ports 0/1 (orange/red), PDCCH (purple), DC guard (grey). TS 36.211 Table 6.10.1.2-1 pilot pattern (2 CRS ports shown).
LTE DL Spectrum — Used vs Guard Subcarriers for 1.4 / 5 / 10 / 20 MHz Configurations
Each bar shows the fraction of NFFT carrying data vs DC/guard. DC subcarrier is always nulled. Guard band ≈ 10% each side. TS 36.211 Table 5.6-1 / 6.12-1.

12.3 LTE Uplink — DFT-s-OFDM (SC-FDMA) TS 36.211 §5.3–5.4

To reduce PAPR and relax the UE power amplifier, the LTE uplink uses DFT-spread OFDM. The transmit chain (TS 36.211 §5.3) is:

\[ \underbrace{d_0,\ldots,d_{M-1}}_{\text{QAM symbols}} \xrightarrow{\;\text{M-pt DFT}\;} \tilde{d}_0,\ldots,\tilde{d}_{M-1} \xrightarrow{\;\text{subcarrier map}\;} \underbrace{a_0,\ldots,a_{N_{\rm FFT}-1}}_{\text{spread over }N_{\rm FFT}} \xrightarrow{\;\text{IFFT}_{N_{\rm FFT}} + \text{CP}\;} s(t) \]

The DFT step spreads each QAM symbol across all \(M\) subcarriers allocated to the UE, creating a single-carrier signal in the time domain with PAPR comparable to QPSK even for higher-order modulations. \(M\) must be a product of 2, 3, and 5 (e.g.\ 12, 24, 36, …, 1200) — TS 36.211 Table 5.3-1.

Localised (LFDMA) and interleaved (IFDMA) subcarrier mappings are supported. LTE uses exclusively LFDMA (contiguous allocation):

\[ a_{k} = \begin{cases} \tilde{d}_{k - k_0} & k_0 \le k < k_0 + M \\ 0 & \text{otherwise} \end{cases} \]

The received signal after flat-fading channel \(H_k\) and OFDM demodulation:

\[ Y_k = H_k\,\tilde{d}_{k-k_0} + N_k, \quad k_0 \le k < k_0+M \]

A single-tap frequency-domain equalizer is then applied before the M-pt IDFT to recover \(\hat{d}_m\).

PAPR CCDF — CP-OFDM (DL) vs DFT-s-OFDM (UL) for LTE, N=1024, M=300
Complementary CDF of instantaneous PAPR. DFT-s-OFDM achieves ~4 dB lower PAPR at CCDF = 0.1%, directly relaxing UE power-amplifier back-off requirements. QPSK DFT-s-OFDM approaches single-carrier performance. TS 36.101 §6.5.2.

12.4 LTE Sounding Reference Signal (SRS) TS 36.211 §5.5.3

The SRS is a UL reference signal used by the eNB for uplink channel sounding (wideband CSI acquisition, scheduling, beamforming). Key properties (TS 36.211 §5.5.3):

\[ a_{k_0 + k} = r^{(\alpha)}_u(k), \quad k = 0, 1, \ldots, M_{\rm SRS} - 1 \]
LTE SRS Frequency Hopping — BSRS=0 (40 RBs), 8 transmission instances
Each bar shows the subcarrier allocation for one SRS instance. Hopping covers the full sounding bandwidth while each instance occupies a narrow sub-band. TS 36.211 Table 5.5.3.3-1 (illustrative).

12.5 5G NR Physical Layer — Flexible Numerology TS 38.211 §4–5

5G NR uses CP-OFDM for both DL and UL, with DFT-s-OFDM as an optional UL mode. The key flexibility is numerology \(\mu\): subcarrier spacing scales as \(\Delta f = 2^\mu \times 15\,\text{kHz}\), TS 38.211 Table 4.3.2-1.

μΔf (kHz)Tu (μs) CPnormal (μs)Slots / subframe Symbols / slotBands
01566.674.69114FR1 (<1 GHz)
13033.332.34214FR1 sub-6 GHz
26016.671.17414FR1 / FR2
31208.330.59814FR2 (mmWave)
42404.170.291614FR2 reference only

All numerologies share the same 10 ms frame and 1 ms subframe boundary, enabling mixed-numerology operation within a single carrier (BWP switching). The slot duration is \(T_{\rm slot} = 14 \times (T_u + T_{\rm CP})\) for normal CP.

12.6 5G NR Downlink — CP-OFDM Signal Model TS 38.211 §7.4

The NR DL baseband signal is (TS 38.211 Eq. 7.4-1):

\[ s_{\mu,\ell}(t) = \sum_{k=0}^{N^{\mu}_{\rm FFT}-1} a^{(\mu)}_{k,\ell}\, e^{j2\pi\left(k - \tfrac{N^{\mu}_{\rm FFT}}{2}\right)\Delta f_\mu\,(t - N^{\mu}_{\rm CP}T_c)}, \quad 0 \le t < (N^{\mu}_{\rm CP}+N^{\mu}_{\rm FFT})\,T_c \]

where \(T_c = 1/(\Delta f_{\rm ref}\cdot N^{\rm ref}_{\rm FFT}) = 1/(480\,\text{kHz}\times 4096) \approx 0.509\,\text{ns}\) is the NR basic time unit; \(\Delta f_\mu = 2^\mu \times 15\,\text{kHz}\); and \(N^\mu_{\rm FFT}\) scales with bandwidth class and numerology.

Typical NR FFT sizes (100 MHz FR1, μ=1, Δf=30 kHz):

\[N_{\rm FFT}^{\mu=1} = 4096 \;\text{(for 100 MHz FR1, SCS 30 kHz)}\]

The NR resource grid spans up to 3300 subcarriers (275 RBs × 12) for 100 MHz / SCS 30 kHz — significantly wider than LTE, reflecting NR's flexible bandwidth part (BWP) architecture.

5G NR Numerology — OFDM Symbol Duration & CP vs μ (0 to 4)
Symbol useful duration Tu = 1/Δf halves with each μ step. Normal CP overhead ≈ 7% for all μ. Extended CP (μ=2 only) = 25% overhead. TS 38.211 Table 4.3.2-1.
5G NR Channel Bandwidth vs Numerology — Max RBs and Subcarriers
Max RBs per TS 38.101 Table 5.3.2-1. Wider bandwidths require larger μ in FR2. FR1: up to 100 MHz (μ=1/2); FR2: up to 400 MHz (μ=3).

12.7 5G NR Uplink — CP-OFDM and DFT-s-OFDM TS 38.211 §6.3

Unlike LTE, NR supports both CP-OFDM and DFT-s-OFDM on the uplink (TS 38.211 §6.3). The UE capability bit transformPrecoding (TS 38.214 §6.1.3) selects between the two:

The NR DFT-s-OFDM chain (TS 38.211 §6.3.1.4):

\[ y(n) = \text{IFFT}_{N_{\rm FFT}}\!\Bigl[ \underbrace{\text{FFT}_{M}\bigl(d_0,\ldots,d_{M-1}\bigr)}_{\text{DFT precoding}} \text{mapped to }[k_0, k_0+M) \Bigr] + \text{CP} \]

NR DFT-s-OFDM adds a key enhancement: spectral shaping (π/2-BPSK) reduces PAPR further to ~0 dB overhead, enabling 5G to operate in extreme coverage scenarios (TS 38.211 §6.3.1.1, NR PUSCH enhancement Rel-16).

NR UL Waveform — Time-Domain Envelope: CP-OFDM vs DFT-s-OFDM (N=512, M=60)
Normalised |s(n)| for a single OFDM symbol. CP-OFDM envelope fluctuates widely; DFT-s-OFDM envelope is compressed toward a near-constant amplitude, explaining its lower PAPR. TS 38.211 §6.3. Simulated with random QPSK symbols.

12.8 5G NR Sounding Reference Signal (SRS) TS 38.211 §6.4.1.4

NR SRS is significantly enhanced over LTE SRS (TS 38.211 §6.4.1.4):

FeatureLTE SRS (TS 36.211)NR SRS (TS 38.211)
Base sequenceZC, length MSRSZC or low-PAPR type 1/2, length MSRS,b
Antenna ports11–4 (Rel-15), up to 8 (Rel-16+)
Symbol duration1 symbol (last in subframe)1–4 consecutive symbols (configurable)
Periodicity2–160 ms1–5120 slots (any numerology)
Bandwidth4 hopping levels (mSRS,0..3)4 hopping levels (BSRS∈{0,1,2,3})
UseUL channel sounding, schedulingUL CSI, beamforming, beam management, codebook
Combcomb-2 only (every other SC)comb-2 or comb-4

The NR SRS sequence (TS 38.211 §6.4.1.4.2):

\[ r^{(p)}_{\rm SRS}(n) = e^{j\alpha_p n}\,\bar{r}_{u,v}(n\bmod M_{\rm ZC}), \quad n = 0, 1, \ldots, M_{\rm SRS,b}-1 \]

where \(\alpha_p = 2\pi n^{(p)}_{\rm cs}/N_{\rm cs}\) is the cyclic shift for antenna port \(p\); \(\bar{r}_{u,v}\) is a ZC base sequence of group \(u\), base sequence \(v\); and \(N_{\rm cs} \in \{8, 12\}\) (comb 2/4). Mapped to every \(K_{\rm TC}\)-th subcarrier (\(K_{\rm TC} = 2\) or \(4\) for comb-2/4):

\[ a^{(p)}_{k_0 + 2k} = r^{(p)}_{\rm SRS}(k), \quad k = 0, 1, \ldots, \frac{M_{\rm SRS,b}}{K_{\rm TC}}-1 \]
NR SRS Allocation — BSRS=2, comb-4, 4 antenna ports, 275 RBs (μ=1)
Each row = 1 SRS port; x-axis = frequency (RBs). Comb-4 occupies every 4th subcarrier within the allocation. Cyclic shifts separate the 4 ports in code domain. TS 38.211 Table 6.4.1.4.3-1.

12.9 LTE vs NR OFDM Signal Generation — Head-to-Head

LTE vs 5G NR — Key OFDM Signal Parameters at a Glance
Normalised scores across 6 dimensions (higher = more capable/flexible). NR gains on all axes except backward-compatibility simplicity. Ref: TS 36.211 vs TS 38.211.
Key takeaway: NR DL and UL both use CP-OFDM by default, making the air interface symmetric and enabling multi-layer MIMO on the UL. The optional DFT-s-OFDM UL mode preserves coverage for cell-edge UEs with limited PA power. The SRS enhancements (multi-port, multi-symbol, comb-4) in NR directly enable massive MIMO beamforming with accurate per-port channel estimates — this is the critical enabler of NR mmWave performance.
  1. LTE fixes NFFT = 2048 for 20 MHz while NR uses NFFT = 4096 for the same bandwidth. Why does the larger FFT improve performance in NR, and what is the cost in latency and processing complexity?
  2. An NR gNB serves a cell-edge UE in 5 MHz at μ=0. The UE selects DFT-s-OFDM UL with M=60 subcarriers. Derive the exact CP duration in nanoseconds and the slot duration in μs using TS 38.211 Table 4.3.2-1 values.
  3. NR SRS uses comb-4 with 4 antenna ports. How many SRS subcarriers per RB does each port occupy? Show that the 4 ports can be orthogonally separated in frequency using cyclic shifts alone, and identify the condition on Ncs for this to work.