Study Notebook — Signal Processing Series

OFDM-ISAC: Integrated Sensing & Communications

A Complete Study Notebook — 5G NR Rel-17 through Rel-20

y(t) = ∑n=0N−1m=0M−1 X(n,m) · ej2πnΔf(t−mT) · rect(t−mT)

This notebook provides a rigorous, end-to-end treatment of OFDM-based Integrated Sensing and Communications (ISAC) as standardized in 3GPP 5G NR Releases 17–20 and prototyped toward 6G. Starting from first principles of the OFDM waveform and progressing through channel models, reference-signal exploitation, MIMO beamforming, and Cramér–Rao bound analysis, each section combines rigorous mathematics with interactive Plotly visualizations and worked numerical examples. The target audience is signal processing engineers, 5G PHY developers, and researchers who need both theoretical depth and practical implementation insight for dual-function radar–communications systems.

5G NR 3GPP Rel-18/19
10
Sections
17
Interactive Charts
90+
3GPP Citations
50+
Key Equations

Table of Contents

§1 ISAC Fundamentals §6 Downlink Sensing & SIC
§2 OFDM Signal Model §7 PHY Sensing Chain
§3 Channel Models §8 MIMO-ISAC Beamforming
§4 SRS/DMRS Sensing §9 CRB & Pareto Analysis
§5 PRACH/PUCCH Sensing §10 3GPP Roadmap & 6G
Prerequisites

Readers should be comfortable with linear algebra (matrix operations, eigendecomposition) and probability theory (estimation theory, Fisher information). Basic DSP knowledge (FFT, convolution, windowing) is assumed throughout. Familiarity with the 5G NR air interface — including frame structure, numerologies, and reference signals (SRS, DMRS, PRACH) — is strongly recommended before Section 4 onward.

How to Use This Notebook

  1. Read sequentially or jump to any topic — each section is self-contained with internal cross-references where prior results are reused.
  2. All mathematics rendered via KaTeX — fully offline — no internet connection required; all fonts and the KaTeX library are bundled inline.
  3. Charts are interactive — hover over data points for exact values, click legend entries to isolate traces, and scroll-zoom or box-select any Plotly figure to explore fine detail.
  4. Study questions at the end of each section — work through these to consolidate understanding; selected answers are provided in collapsible panels to support self-assessment.
Version  2.0 | Standards  3GPP TS 38.211 / 38.300 / 22.837 / 23.700-97 | Date  June 2026 | Audience  5G PHY / Signal Processing Engineers
§1

ISAC Fundamentals

§1.1 Dual-Function Insight

Integrated Sensing and Communications (ISAC) unifies radar sensing and wireless data delivery onto a single platform — sharing hardware (antennas, RF chains, ADCs/DACs), spectrum, and waveform. Rather than operating two separate systems in adjacent bands, an ISAC transceiver transmits one signal that simultaneously illuminates targets in the environment and delivers information bits to communication receivers.

The fundamental enabler is that any radio channel carries implicit geometric information: path delays encode range, Doppler shifts encode velocity, and angle-of-arrival encodes spatial position. A communication receiver discards most of this as nuisance; an ISAC receiver harvests it.

Key Insight: Every communication waveform carries radar information — the channel itself is the sensing medium. The question is not whether sensing information is present, but whether the receiver is designed to extract it.
System Bandwidth Range Res. Vel. Res. Spectral Eff.
Dedicated Radar (LFM) Up to 2 GHz 7.5 cm @ 2 GHz Depends on CPI 0 bit/s/Hz (no comms)
Dedicated Comms (OFDM) 100–400 MHz N/A (no sensing) N/A 4–8 bit/s/Hz typical
ISAC (OFDM-ISAC) 100–400 MHz 1.5 m @ 100 MHz ≈0.1 m/s @ 10 ms \(R_c + \eta_s\) jointly

The aggregate spectral efficiency of an ISAC system can be written as:

\[ \eta_{\text{ISAC}} = R_c + \eta_s \] Eq. 1.0

where \(R_c\) (bit/s/Hz) is the Shannon communication rate and \(\eta_s\) is the sensing information rate — a measure of how much geometric/environmental state information is recovered per unit bandwidth per second. In practice, there is a sensing-communications tradeoff: waveform designs optimised for high \(\eta_s\) (e.g., flat power spectral density) are also near-optimal for \(R_c\), making OFDM a natural dual-function waveform.

§1.2 Core ISAC Signal Model

The canonical baseband ISAC signal model over a single OFDM frame aggregates all subcarriers and symbols into matrix form. For a monostatic or bistatic configuration with \(K\) point targets, the dominant single-target received signal is:

\[ \mathbf{Y} = \alpha_k \, \mathbf{H}_s \, \mathbf{X} + \mathbf{W} \] Eq. 1.1
Symbol Dimension Description
\(\mathbf{Y}\) \(\mathbb{C}^{N \times M}\) Received signal matrix; \(N\) subcarriers, \(M\) OFDM symbols
\(\alpha_k\) \(\mathbb{C}\) Complex reflection coefficient of the \(k\)-th target (encodes RCS and phase)
\(\mathbf{H}_s\) \(\mathbb{C}^{N \times N}\) Sensing channel matrix; diagonal in frequency domain with entries \(e^{-j2\pi n \Delta f \tau_k}\)
\(\mathbf{X}\) \(\mathbb{C}^{N \times M}\) Transmitted OFDM resource grid (pilot + data symbols)
\(\mathbf{W}\) \(\mathbb{C}^{N \times M}\) Additive complex Gaussian noise, \(\mathbf{W} \sim \mathcal{CN}(0, \sigma^2 \mathbf{I})\)

The sensing channel \(\mathbf{H}_s\) encodes the target's delay \(\tau_k\) (range) and Doppler shift \(\nu_k\) (velocity). In the frequency-time 2D grid, the target manifests as a phase progression: across subcarriers (dimension \(n\)) it is \(e^{-j2\pi n \Delta f \tau_k}\), and across symbols (dimension \(m\)) it is \(e^{j2\pi m T_s \nu_k}\), where \(T_s\) is the OFDM symbol duration including cyclic prefix.

Note: \(\alpha_k\) is complex and time-varying due to target motion and Swerling fluctuation. For sensing processing, pilot symbols with known \(\mathbf{X}\) allow direct channel estimation; data symbols require joint detection unless a decision-directed approach is used.

For \(K\) simultaneous targets, the model extends by superposition: \(\mathbf{Y} = \sum_{k=1}^{K} \alpha_k \mathbf{H}_s^{(k)} \mathbf{X} + \mathbf{W}\), and target separation is performed via 2D-CFAR or sparse recovery in the delay-Doppler domain.

§1.3 Range and Doppler Resolution

Four fundamental performance limits govern OFDM-ISAC sensing geometry, all derivable from the time-frequency structure of the waveform:

Range Resolution

Determined by the total occupied bandwidth \(B = N \cdot \Delta f\). The minimum resolvable distance between two targets is:

\[ \Delta R = \frac{c}{2B} \] Eq. 1.2

Velocity (Doppler) Resolution

Determined by the total coherent observation time \(T_{\text{obs}} = M \cdot T_s\). The minimum resolvable relative velocity is:

\[ \Delta v = \frac{\lambda}{2 T_{\text{obs}}} \] Eq. 1.3

Unambiguous Range

The maximum unambiguous range is set by the inter-subcarrier spacing \(\Delta f\), because a target delay of \(\tau = 1/\Delta f\) completes exactly one full phase cycle and is aliased back to zero delay:

\[ R_{\max} = \frac{c}{2 \, \Delta f} \] Eq. 1.4

Unambiguous Velocity

The maximum unambiguous velocity is set by the OFDM symbol rate (inverse of symbol duration \(T_s\)). In terms of subcarrier spacing and wavelength:

\[ v_{\max} = \frac{\lambda \, \Delta f}{2} \] Eq. 1.5
5G NR n77 Numerical Example (100 MHz BW, 30 kHz SCS, \(f_c\) = 3.5 GHz, \(\lambda\) = 85.7 mm):
  • \(\Delta R = c/(2 \times 100\,\text{MHz}) = 1.5\,\text{m}\)
  • \(\Delta v = 0.0857/(2 T_{\text{obs}})\) — at \(T_{\text{obs}} = 0.5\,\text{ms}\) (14 symbols): \(\approx 85.7\,\text{m/s}\); at \(T_{\text{obs}} = 400\,\text{ms}\) (1 frame): \(\approx 0.107\,\text{m/s}\)
  • \(R_{\max} = c/(2 \times 30\,\text{kHz}) = 5\,\text{km}\)
  • \(v_{\max} = 0.0857 \times 30\,\text{kHz}/2 = 1285\,\text{m/s}\)
mmWave (n257, 28 GHz, 400 MHz, \(\lambda\) = 10.7 mm): \(\Delta R = 0.375\,\text{m}\), \(R_{\max} = 625\,\text{m}\) (120 kHz SCS), \(v_{\max} = 642\,\text{m/s}\).

§1.4 Range Resolution vs. Bandwidth

Figure 1.1 — Range Resolution vs. Bandwidth (5G NR Band Markers)
Log-scale y-axis. Vertical dashed lines mark 5G NR band bandwidths. \(\Delta R = c/(2B)\). Range resolution improves (decreases) linearly with bandwidth; mmWave bands (n257, n261) achieve sub-metre resolution.

§1.5 Velocity Resolution vs. Observation Time

Figure 1.2 — Velocity Resolution vs. Observation Time
Log-scale y-axis. Higher carrier frequency (n257, 28 GHz) yields finer velocity resolution for the same observation window due to shorter wavelength. \(\Delta v = \lambda / (2 T_{\text{obs}})\).

§1.6 ISAC vs. Separate Systems

Three architectural approaches exist for providing both sensing and communications from a base station or access point:

Architecture Spectral Efficiency Hardware Cost SIC Requirement Range–Comm Tradeoff
ISAC (shared waveform) Highest — spectrum used once for both functions One RF chain, one PA, shared antennas High — self-interference cancellation (SIC) ≥ 100 dB needed in monostatic Fundamental; beamforming weights balance sensing gain vs. user throughput
Co-located Separate Low — two separate bands, no sharing 2× RF chains, 2× PA, possible antenna sharing Moderate — cross-system interference via antenna coupling only Independent optimisation but no joint gain
Spectrum Sharing (no HW sharing) Medium — same band, time/frequency-multiplexed 2× RF chains, scheduling coordination required Low to moderate — guard bands or TDD slot allocation reduces interference Soft tradeoff via slot allocation; no waveform-level integration
Pilot Reuse — Zero Overhead Sensing: In 5G NR, sounding reference signals (SRS) and demodulation reference signals (DMRS) are already transmitted with known sequences on a dense time-frequency grid. An ISAC-capable gNB can process these as radar waveforms at zero additional spectral overhead — effectively turning every UL/DL pilot into a free range-Doppler measurement. This is the single most compelling argument for NR-native ISAC.

3GPP ISAC Standardisation Timeline

Rel-17 (2022) → Study Item: ISAC use cases and requirements (TR 22.837) Identified: positioning, environmental mapping, gesture recognition Rel-18 (2024) → TS 22.837 normative KPIs defined KPIs: range accuracy ≤ 1 m, velocity accuracy ≤ 0.1 m/s, latency ≤ 100 ms Study Item on NR sensing (TR 22.837 complete) Rel-19 (2025–) → Work Item 1080070 ACTIVE: NR-based sensing air-interface enhancements Scope: waveform design, SRS extension, sensing-specific reference signals, unified positioning+sensing framework Rel-20 (2027+) → Full ISAC air-interface: joint waveform standardisation, beamforming codebooks for sensing, E2E ISAC architecture in NG-RAN

Study Questions

Q1. Why can OFDM waveforms achieve both communication and sensing simultaneously?

Show answer

OFDM transmits known (or estimable) symbols on a dense, regular time-frequency grid. Each received symbol \(Y[n,m]\) is the product of the transmitted symbol \(X[n,m]\) and the frequency-domain channel \(H[n,m]\), plus noise. For sensing, the receiver divides out \(X[n,m]\) (using pilots, or decisions from decoded data) to obtain \(\hat{H}[n,m]\), then applies a 2D-DFT to convert the channel response into the delay-Doppler domain — directly yielding range-velocity maps. Simultaneously, the communication receiver performs standard OFDM equalisation to recover bits. The two operations share the same ADC samples; no additional transmission resources are consumed.

Q2. What limits ISAC range resolution in 5G NR sub-6 GHz bands vs. mmWave?

Show answer

Range resolution \(\Delta R = c/(2B)\) depends only on occupied bandwidth \(B\), not carrier frequency. In sub-6 GHz (e.g., n77), the maximum channel bandwidth per carrier is 100 MHz, giving \(\Delta R = 1.5\,\text{m}\). In mmWave (n257/n258/n261), channels of 400 MHz or 2 GHz are available, yielding 0.375 m and 0.075 m respectively. The fundamental limit is thus regulatory/spectrum allocation: sub-6 GHz bands are narrower. Additionally, at sub-6 GHz, wideband LFM-style sensing is not possible within the NR numerology without aggregating many carriers, whereas a single mmWave carrier inherently spans hundreds of MHz.

Q3. Derive the unambiguous range formula from subcarrier spacing.

Show answer

Consider a target at delay \(\tau\). In the OFDM frequency-domain channel, the phase on subcarrier \(n\) is \(\phi_n = -2\pi n \Delta f \tau\). The phase between adjacent subcarriers is \(\Delta\phi = -2\pi \Delta f \tau\).

Ambiguity arises when \(\Delta\phi = -2\pi\), i.e., when \(\Delta f \, \tau = 1\), because the phase wraps and a delay of \(\tau + 1/\Delta f\) is indistinguishable from delay \(\tau\). Therefore the unambiguous delay range is: \[ \tau_{\max} = \frac{1}{\Delta f} \] Converting to range via \(R = c\tau/2\) (two-way): \[ R_{\max} = \frac{c \, \tau_{\max}}{2} = \frac{c}{2 \, \Delta f} \] For 5G NR with \(\Delta f = 30\,\text{kHz}\): \(R_{\max} = 3\times10^8 / (2 \times 30\times10^3) = 5000\,\text{m}\). Note this is the CP length analogue — the CP must be \(\geq \tau_{\max}\) for ISI-free communications, but sensing ambiguity is resolved by the full \(1/\Delta f\) window.

§2

OFDM Signal Model for ISAC

§2.1 OFDM Transmit Signal

An OFDM waveform carrying \(N\) subcarriers over \(M\) symbols is written as a double sum over frequency indices \(n\) and symbol indices \(m\). Each subcarrier is modulated by a complex data/pilot symbol \(x_{n,m}\), shifted in frequency by the subcarrier spacing \(\Delta f\), and windowed to one OFDM symbol period \(T_s = 1/\Delta f + T_{cp}\):

\[ s(t) = \frac{1}{\sqrt{N}} \sum_{n=0}^{N-1} \sum_{m=0}^{M-1} x_{n,m} \, e^{j2\pi n\Delta f (t - mT_s)} \,\text{rect}\!\left(\frac{t - mT_s}{T_s}\right) \] (2.1)
  • \(N\) — number of subcarriers; \(M\) — number of OFDM symbols in one burst.
  • \(\Delta f\) — subcarrier spacing (SCS); \(T_s = 1/\Delta f + T_{cp}\) — total symbol duration including cyclic prefix.
  • \(x_{n,m} \in \mathbb{C}\) carries QAM data on data subcarriers and known pilots on pilot subcarriers.

In ISAC operation the resource grid is partitioned into pilot and data sub-regions. Pilot positions — typically arranged as a regular comb or as 3GPP DMRS — carry known symbols \(x_{n,m} = p_{n,m}\) with \(|p_{n,m}|^2 = P_p\). The remaining resource elements carry QAM payload. The pilot pattern simultaneously enables coherent channel estimation for communications and provides the reference signal needed for range-Doppler sensing.

§2.2 Sensing Channel Matrix

For a scene containing \(K\) point targets, the received frequency-domain channel on the \((n,m)\)-th resource element is the superposition of delayed and Doppler-shifted replicas:

\[ H_s[n,m] = \sum_{k=1}^{K} \alpha_k \, e^{-j2\pi n\Delta f \,\tau_k} \, e^{j2\pi m T_s \nu_k} \] (2.2)

where \(\tau_k = 2R_k/c\) is the round-trip delay to target \(k\) at range \(R_k\), and \(\nu_k = 2v_k/\lambda\) is the Doppler frequency induced by radial velocity \(v_k\). The phase across subcarriers encodes delay; the phase across symbols encodes Doppler.

After pilot-based channel division the per-resource-element channel estimate is:

\[ \hat{H}_s[n,m] = \frac{Y[n,m]}{X[n,m]} = H_s[n,m] + \tilde{W}[n,m] \] (2.2b)

where \(Y[n,m]\) is the received symbol, \(X[n,m]\) the transmitted pilot, and \(\tilde{W}[n,m] = W[n,m]/X[n,m]\) is noise scaled by the inverse pilot amplitude.

Insight. The 2D DFT of \(\hat{H}_s\) directly gives the range-Doppler map — no additional waveform design needed. The OFDM resource grid is already a 2D sample of the delay-Doppler domain.

§2.3 Pilot Allocation Strategies

The choice of pilot pattern trades off sensing SNR (determined by pilot density) against communication throughput (overhead fraction consumed by pilots). Four canonical strategies:

Strategy Coverage Sensing SNR Comm overhead 3GPP reference
Full-grid pilots All \(N \times M\) REs Maximum (0 dB loss) 100 % (no data) Non-standard / radar-only
Comb-4 Every 4th subcarrier, all symbols \(-6\) dB vs full-grid 25 % NR SRS (TS 38.211 §6.4.1.4)
Comb-12 Every 12th subcarrier, all symbols \(-10.8\) dB vs full-grid 8.3 % NR SRS large comb (TS 38.211)
DMRS (3GPP) Freq-comb + 1–4 symbol rows Lower; symbol-limited Doppler 3–14 % (config-dep.) NR PDSCH DMRS Type 1/2 (TS 38.211 §7.4.1.1)
Algorithm 2.1 — OFDM Sensing Pipeline
  1. Extract pilots from received grid \(Y[n,m]\) at known pilot positions \((n,m) \in \mathcal{P}\).
  2. Divide by known pilot symbols: \(\hat{H}_s[n,m] = Y[n,m] / X[n,m]\) for \((n,m) \in \mathcal{P}\).
  3. Optionally interpolate \(\hat{H}_s\) to the full \(N \times M\) grid (improves ambiguity sidelobes).
  4. Apply 2D-FFT with windowing: \(Z[l,p] = \text{IFFT}_l\{\text{FFT}_p\{\hat{H}_s[n,m]\}\}\) → range-Doppler map.
  5. Apply CFAR detector (CA-CFAR or OS-CFAR) to \(|Z[l,p]|^2\) → target list \(\{(\hat{l}_i, \hat{p}_i)\}\).
  6. Map indices to physical quantities: \(\hat{R}_i = \hat{l}_i \cdot c / (2 N \Delta f)\), \(\hat{v}_i = \hat{p}_i \cdot \lambda / (2 M T_s)\).

§2.4 Range-Doppler via 2D-FFT

The range-Doppler map is the 2D DFT of the channel estimate matrix:

\[ Z[l,p] = \sum_{n=0}^{N-1} \sum_{m=0}^{M-1} \hat{H}_s[n,m] \, e^{-j2\pi nl/N} \, e^{j2\pi mp/M} \] (2.3)

Substituting Eq. (2.2) into (2.3), the peak for target \(k\) appears at bin indices:

  • Range bin: \(l^* = \tau_k N \Delta f = \frac{2R_k}{c / (N\Delta f)} = \frac{2R_k}{c/B}\), where \(B = N\Delta f\) is the signal bandwidth. Range resolution \(\delta R = c/(2B)\).
  • Doppler bin: \(p^* = \nu_k M T_s = \frac{2v_k M T_s}{\lambda}\), where \(T_{obs} = M T_s\) is the coherent processing interval. Velocity resolution \(\delta v = \lambda/(2 T_{obs})\).
Caution. The cyclic prefix length \(T_{cp}\) is included in \(T_s\), so the Doppler bin spacing depends on \(T_s = 1/\Delta f + T_{cp}\), not just \(1/\Delta f\). Ignoring this causes a systematic velocity bias of \(\approx \Delta f \cdot T_{cp} \cdot v\). For NR 30 kHz SCS with normal CP (\(T_{cp} \approx 2.34\;\mu\text{s}\)), the relative bias is \(\approx 6.6\%\) per target velocity.

§2.5 OFDM Ambiguity Function

The OFDM ambiguity function \(\chi(\tau,\nu)\) characterises the delay-Doppler resolution and sidelobe structure of the waveform. For a rectangular pilot-loaded OFDM burst it factorises as a product of sinc functions: \[ |\chi(\tau,\nu)|^2 \approx \left|\operatorname{sinc}(\tau B)\right|^2 \left|\operatorname{sinc}(\nu T_{obs})\right|^2 \] The \(-3\;\text{dB}\) delay mainlobe width is \(1/B\) and Doppler mainlobe width is \(1/T_{obs}\), confirming the resolution formulas above.

Figure 2.1 — OFDM Ambiguity Function χ(τ,ν)
OFDM sinc² × sinc² separable ambiguity function (dB scale, clipped at −40 dB). N = 64 subcarriers, M = 14 symbols, SCS = 30 kHz, B = 1.92 MHz, Tobs ≈ 0.5 ms.

§2.6 Range-Doppler Map Simulation

A Monte-Carlo realisation of the range-Doppler map is obtained by constructing \(\hat{H}_s[n,m]\) from three point targets, adding AWGN at SNR = 15 dB, and applying the 2D-DFT of Eq. (2.3). Target parameters:

  • Target 1: \(R=150\;\text{m}\), \(v=+12.9\;\text{m/s}\), amplitude 1.0 (reference).
  • Target 2: \(R=300\;\text{m}\), \(v=-5\;\text{m/s}\), amplitude 0.6 (\(-4.4\;\text{dB}\)).
  • Target 3: \(R=80\;\text{m}\), \(v=+30\;\text{m/s}\), amplitude 0.8 (\(-1.9\;\text{dB}\)).
Figure 2.2 — Simulated Range-Doppler Map (3 Targets + Noise, SNR = 15 dB)
2D-FFT of sensing channel estimate \(\hat{H}_s[n,m]\). NR FR1 parameters: N = 64, M = 14, SCS = 30 kHz, fc = 3.5 GHz. Range resolution \(\delta R = c/(2B) \approx 78\;\text{m}\); velocity resolution \(\delta v = \lambda/(2T_{obs}) \approx 3.1\;\text{m/s}\).

§2.7 Study Questions

  1. OFDM sensing SNR. Show that for a single target with round-trip SNR \(\rho\), the post-2D-FFT peak SNR is \(\rho_{RD} = \rho \cdot N \cdot M_p\), where \(M_p \leq M\) is the number of pilot symbols used. What design choice maximises sensing SNR without reducing communication throughput, and what is the fundamental trade-off?
  2. Cyclic prefix and unambiguous range. The CP duration \(T_{cp}\) sets the maximum unambiguous range \(R_{\max} = c\,T_{cp}/2\). For NR numerology \(\mu=1\) (SCS 30 kHz, normal CP \(T_{cp}\approx 2.34\;\mu\text{s}\)), compute \(R_{\max}\). If a target at \(R > R_{\max}\) reflects energy back, explain what artefact appears in the range-Doppler map and how it can be mitigated.
  3. Why 2D-FFT works for ISAC. Starting from Eq. (2.2), show algebraically that \(Z[l,p]\) defined by Eq. (2.3) yields a peak at \((l^*, p^*) = (\tau_k N\Delta f,\; \nu_k M T_s)\). Under what conditions on \(\tau_k\) and \(\nu_k\) does the 2D-DFT approximation break down (i.e., when does cross-term leakage become significant)?
§3

Channel Models for ISAC

§3.1 Channel Impulse Response and Jakes Doppler Spectrum

In the delay-Doppler domain, a wideband multipath channel is fully characterised by its spreading function. For a scene with \(K\) discrete scatterers the channel impulse response is a sum of weighted Dirac masses:

\[ h(\tau, \nu) = \sum_{k=1}^{K} \alpha_k \, \delta(\tau - \tau_k) \, \delta(\nu - \nu_k) \] (3.1)

where \(\alpha_k \in \mathbb{C}\) is the complex path gain (incorporating free-space loss, reflection coefficient, and antenna patterns), \(\tau_k\) is the propagation delay, and \(\nu_k\) is the Doppler shift. In a monostatic sensing geometry \(\tau_k = 2R_k/c\) and \(\nu_k = 2v_k f_c/c\).

For a communication link with a large number of unresolved scatterers, the Doppler spectrum of a single cluster converges (by the central-limit argument) to the classical Jakes U-shaped spectrum. If the scatterers are uniformly distributed in azimuth and the maximum Doppler shift is \(f_D = v f_c / c\), then:

\[ S(\nu) = \frac{1}{\pi f_D \sqrt{1 - (\nu/f_D)^2}}, \qquad |\nu| \leq f_D \] (3.2)

The \(1/\sqrt{\cdot}\) singularity at \(\nu = \pm f_D\) reflects the larger probability of scatterers near broadside. The Jakes spectrum is fundamental to the level-crossing rate and average fade duration statistics used to dimension HARQ retransmission timers.

ISAC relevance. In sensing mode each target contributes a single spike \(\delta(\nu - \nu_k)\) — completely unlike the continuous Jakes spectrum assumed by communications channel estimators. ISAC algorithms must be designed to exploit (or at least tolerate) this dichotomy.

§3.2 CDL-A Power Delay Profile

3GPP TR 38.901 defines the Clustered Delay Line A (CDL-A) model as the canonical NLOS sub-urban macro channel for 5G NR simulations. Each cluster \(l\) has a tabulated relative power \(P_{dB,l}\) in dB; the linear power is:

\[ P(\tau_l) = 10^{P_{dB,l}/10} \] (3.2a)

The first 8 clusters (of 23 total) are listed below. The full profile has an RMS delay spread \(\sigma_\tau \approx 40\;\text{ns}\) (at the reference DS = 40 ns scaling).

Cluster Delay \(\tau_l\) (ns) Rel. power \(P_{dB,l}\) (dB) AoD spread (°) AoA spread (°)
100.05.011.0
210−2.25.99.0
320−3.56.310.2
430−5.07.812.6
540−6.15.68.5
650−7.39.713.1
760−8.18.411.9
870−9.06.210.7
Figure 3.1 — CDL-A Power Delay Profile (3GPP TR 38.901)
Insight. 3GPP TR 38.901 CDL-A models a NLOS suburban macro environment. With \(\sigma_\tau \approx 40\;\text{ns}\), the coherence bandwidth is \(B_c \approx 1/(2\pi\cdot 40\;\text{ns}) \approx 4\;\text{MHz}\). NR FR1 subcarrier spacings of 15–120 kHz are far below \(B_c\), so each subcarrier sees a nearly flat fade — the key assumption behind Eq. (2.2). For ISAC the 1000 ns tail sets \(R_{\max}^{clutter} = c\tau_{\max}/2 = 150\;\text{m}\) of multipath clutter that must be suppressed before target detection.

§3.3 Coherence Bandwidth and Coherence Time

Two reciprocal coherence parameters govern OFDM system design and set the sensing resolution ceilings:

\[ B_c \approx \frac{1}{2\pi \sigma_\tau} \] (3.3a)
\[ T_c \approx \frac{0.423}{f_D} \] (3.3b)

where \(\sigma_\tau\) is the RMS delay spread and \(f_D = v f_c / c\) is the maximum Doppler shift. The factor 0.423 is derived from the Clarke autocorrelation \(J_0(2\pi f_D T_c) = 0.5\) (\(-3\;\text{dB}\) correlation). Coherent ISAC processing requires that the waveform bandwidth \(B \leq B_c\) (flat fading per subcarrier) and CPI duration \(T_{obs} \leq T_c\) (channel stationarity across symbols).

Figure 3.2 — Coherence Bandwidth vs. RMS Delay Spread & Coherence Time vs. Doppler
Blue (left axis): \(B_c\) vs \(\sigma_\tau\) (ns). Red (right axis): \(T_c\) vs \(f_D\) (Hz). Both axes share the same horizontal sweep variable (1–100 units).

§3.4 Scattering Environment Classifications

The five canonical propagation environments span the range from sparse-scattering indoor to dense urban and high-mobility vehicular scenarios. Key ISAC implications follow from \(B_c\) and \(T_c\):

Environment \(\sigma_\tau\) (ns) \(B_c\) (kHz) \(f_D\) (Hz) @ 60 km/h \(T_c\) (ms) ISAC implication
Urban macro (NLOS) 300–500 320–530 194 (3.5 GHz) 2.2 Rich clutter; \(B_c\) limits useful bandwidth; short CPI needed
Suburban macro (CDL-A) 40–100 1,600–4,000 194 2.2 Moderate clutter; 20 MHz NR BWP typically flat per sub-carrier
Indoor office (CDL-D) 10–30 5,300–16,000 8 (pedestrian) 53 Long \(T_c\) → long CPI viable; close-range, dense clutter
Highway / V2X 100–300 530–1,600 972 (5.9 GHz, 180 km/h) 0.43 High Doppler dominates; very short CPI; velocity resolution critical
mmWave outdoor (28 GHz) 5–20 8,000–32,000 1,556 (60 km/h, 28 GHz) 0.27 Quasi-optical, sparse; high \(B_c\) enables cm-level ranging; tiny \(T_c\)
Design warning. For V2X at 5.9 GHz, \(T_c \approx 0.43\;\text{ms}\) corresponds to fewer than one NR slot at 30 kHz SCS (\(T_{slot}=0.5\;\text{ms}\)). Standard OFDM channel tracking will stale within a single subframe — dedicated fast channel prediction or OTFS-based waveforms are required.

§3.5 ISAC Sensing Channel vs. Communication Channel

The fundamental duality in ISAC arises because the same physical electromagnetic channel is simultaneously used for two structurally different operations:

Property Sensing channel (monostatic) Communication channel (downlink)
Geometry Same TX/RX site (co-located) Separated TX (gNB) and RX (UE)
Reflection type Specular / quasi-deterministic Diffuse (rich scattering)
Delay-Doppler Discrete spikes at \((\tau_k, \nu_k)\) Continuous spread; ergodic
Channel model Point-scatterer model, Eq. (3.1) CDL/TDL statistical model
Estimation goal Detect \((\tau_k, \nu_k)\) with high resolution Estimate \(H[n,m]\) for coherent demodulation
Self-interference Strong TX-to-RX leakage (needs SIC) Absent (half-duplex NR)

Both channels share the same physical medium; the total received channel matrix decomposes as:

\[ \mathbf{H}_{\text{comm}} = \mathbf{H}_{s} + \mathbf{H}_{\text{scatter}} \] (3.3)

where \(\mathbf{H}_{s}\) is the structured (specular, deterministic) component exploited for sensing, and \(\mathbf{H}_{\text{scatter}}\) is the diffuse (random, rich-scattering) component that dominates in the communication receiver. The ISAC challenge is to simultaneously:

  • Reliably demodulate QAM data using the total \(\mathbf{H}_{\text{comm}}\).
  • Estimate target parameters from the deterministic component \(\mathbf{H}_s\) in the presence of \(\mathbf{H}_{\text{scatter}}\) acting as clutter.
Insight. ISAC exploits the deterministic component of \(\mathbf{H}\) for sensing while communication treats the entire channel as a random process to be estimated and equalised. The separation \(\mathbf{H}_{\text{comm}} = \mathbf{H}_{s} + \mathbf{H}_{\text{scatter}}\) is not observable from a single snapshot — it requires either prior knowledge of the radar cross-section (for clutter cancellation) or statistical averaging over many OFDM bursts to distinguish the coherent component from the incoherent background.

§3.6 Study Questions

  1. CDL-A vs ITU-R models for 5G ISAC. Older ITU-R channel models (e.g., ITU-R M.1225 pedestrian/vehicular) use a small number of taps with fixed delays and powers. CDL-A uses 23 clusters with angle-of-departure and angle-of-arrival statistics. Explain: (a) why CDL-A is preferred for massive-MIMO ISAC where spatial filtering of clutter is critical; (b) what information present in CDL-A is absent from ITU-R pedestrian-A; and (c) how the angular spread of CDL-A clusters affects the spatial covariance matrix \(\mathbf{R} = \mathbb{E}[\mathbf{h}\mathbf{h}^H]\) used in beamforming design.
  2. Delay spread limiting radar range. In a bistatic or monostatic ISAC scenario, the clutter-free unambiguous range window is bounded by the channel delay spread: targets at \(R_k > c(\tau_{\max} - \sigma_\tau)/2\) are obscured by clutter from the last resolvable cluster. Using CDL-A parameters, compute the effective clutter-free range window for NR FR1 with SCS 30 kHz (normal CP, \(T_{cp} = 2.34\;\mu\text{s}\)). How does increasing \(\Delta f\) to 60 kHz help?
  3. Deriving \(B_c\) from the frequency correlation function. The frequency correlation function of the channel is \(R_H(\Delta f) = \mathbb{E}[H(f) H^*(f+\Delta f)]\). For a single-cluster exponential PDP \(P(\tau) = \frac{1}{\sigma_\tau}e^{-\tau/\sigma_\tau}\), show that \(R_H(\Delta f) = (1 + j2\pi\Delta f\,\sigma_\tau)^{-1}\). Hence derive that \(B_c = 1/(2\pi\sigma_\tau)\) as the \(-3\;\text{dB}\) bandwidth of \(|R_H(\Delta f)|^2\), confirming Eq. (3.3a).
§4 SRS / DMRS Sensing — Pilot Extraction, CRB & Range-Doppler Processing

4.1  SRS Pilot Extraction

In NR, the Sounding Reference Signal (SRS) is transmitted by the UE on the uplink in a configurable set of OFDM symbols and subcarriers. Because the gNB already knows the exact time-frequency resource grid allocation and the SRS sequence, it can treat each SRS transmission as a known pilot burst and extract a monostatic or bistatic channel estimate purely from the received signal — giving a zero-cost sensing observation piggy-backed on the communication waveform.

SRS Resource Mapping (NR)

SRS occupies the last \(N_{SRS}^{symb} \in \{1,2,4\}\) OFDM symbols of a slot. The comb structure (KTC \(\in \{2,4,8\}\)) interleaves multiple UEs in frequency. For a single UE with comb offset \(k_0\) and starting physical resource block \(n_{PRB,start}\), the set of occupied subcarriers is

\[ \mathcal{K}_{SRS} = \bigl\{\, k_0 + K_{TC}\cdot m \;\Big|\; m = 0,1,\ldots,M_{SRS}-1 \bigr\}, \quad M_{SRS} = \frac{N_{RB}^{SRS}\cdot 12}{K_{TC}} \] (4.1)

where \(N_{RB}^{SRS}\) is the SRS bandwidth in resource blocks and \(K_{TC}\) is the comb size. After reception the gNB extracts the pilot vector \(\mathbf{y}_p \in \mathbb{C}^{M_{SRS}}\) and forms the per-subcarrier channel estimate:

\[ \hat{H}_{sens}[k] = \frac{Y[k]}{X_p[k]}, \quad k \in \mathcal{K}_{SRS} \] (4.2)

In matrix form, collecting \(L\) consecutive SRS OFDM symbols:

\[ \mathbf{Y} = \mathbf{H} \cdot \mathbf{X}_p + \mathbf{N}, \qquad \mathbf{Y},\mathbf{N} \in \mathbb{C}^{M_{SRS}\times L},\; \mathbf{X}_p \in \mathbb{C}^{M_{SRS}\times L}\;\text{(diagonal per symbol)} \] (4.3)

The least-squares channel estimate across all pilot resources is then \(\hat{\mathbf{H}} = \mathbf{Y}\,\mathbf{X}_p^{-1}\). Because \(\mathbf{X}_p\) is diagonal (single-carrier pilots), inversion is trivially element-wise division. The resulting \(\hat{\mathbf{H}}\) represents the delay-Doppler transfer function sampled on the SRS grid — the raw input to all downstream sensing algorithms.

Key insight — sensing channel estimate. \(\hat{H}_{sens}[k,l]\) is the 2D sampled channel function in (subcarrier, OFDM symbol) space. Its 2D-IFFT yields the range-Doppler map directly. Every target appears as a peak at \((n_\tau, n_\nu)\) corresponding to its round-trip delay and Doppler shift. The SRS grid spacing \(\Delta f = 15\,\text{kHz}\cdot 2^\mu\) in frequency and \(T_{sym}\) in time set the unambiguous range and velocity windows.

SRS vs DMRS — Sensing Capability Comparison

Parameter SRS (Uplink) DMRS (Uplink/Downlink)
Max bandwidth Up to full UE BW (e.g., 100 MHz for FR1) Allocated PUSCH/PDSCH BW only (often narrower)
Periodicity Configurable: 1, 2, 4, 5, 8, 10, 16, 20 ms Every scheduled slot (1 ms @ SCS 15 kHz)
Sequence length 12–272 RBs × 12/KTC subcarriers DMRS pattern: 2–4 symbols, reduced density
Range resolution \(\delta R = c / (2B) \approx 1.5\,\text{m}\) @ 100 MHz Limited by allocated BW; typically worse
Velocity resolution \(\delta v = \lambda / (2T_{CPI})\); CPI set by SRS period Better per-slot availability; limited CPI accumulation
Multi-UE orthogonality Comb + cyclic shift (up to KTC×8 UEs) CDM groups per DMRS port
Sensing mode fit Monostatic / bi-static range + velocity Bistatic Doppler (good), range (limited BW)
3GPP reference TS 38.211 §6.4.1.4 TS 38.211 §6.4.1.1 / §7.4.1.1

4.2  Cramér–Rao Bound for Range & Velocity

The Cramér–Rao Bound (CRB) gives the theoretical minimum variance achievable by any unbiased estimator of a deterministic parameter. For OFDM-ISAC the two primary target parameters are the round-trip delay \(\tau\) (mapped to range via \(R = c\tau/2\)) and the Doppler shift \(\nu\) (mapped to velocity via \(v = \lambda\nu/2\)).

Fisher Information Matrix

For a complex Gaussian observation model \(\mathbf{y} \sim \mathcal{CN}(\boldsymbol{\mu}(\boldsymbol{\theta}),\,\sigma_n^2 \mathbf{I})\), where \(\boldsymbol{\theta} = [\tau,\nu]^T\), the Fisher Information Matrix (FIM) is

\[ \mathbf{J}(\boldsymbol{\theta}) = \frac{2}{\sigma_n^2} \,\text{Re}\!\left\{ \frac{\partial \boldsymbol{\mu}^H}{\partial \boldsymbol{\theta}} \frac{\partial \boldsymbol{\mu}}{\partial \boldsymbol{\theta}^T} \right\} \] (4.4)

For an OFDM waveform with \(N\) subcarriers (subcarrier spacing \(\Delta f\), total bandwidth \(B = N\Delta f\)) and \(L\) OFDM symbols (observation time \(T_{obs} = L \cdot T_{sym}\)), the FIM is diagonal (delay and Doppler are asymptotically decoupled) and the CRB simplifies to:

\[ \text{CRB}(\tau) \;=\; \frac{3}{8\pi^2\cdot\text{SNR}\cdot B^2} \quad\Longrightarrow\quad \sigma_{R,\min} = \frac{c}{2}\sqrt{\frac{3}{8\pi^2\cdot\text{SNR}\cdot B^2}} \] (4.5)
\[ \text{CRB}(\nu) \;=\; \frac{3}{8\pi^2\cdot\text{SNR}\cdot T_{obs}^2} \quad\Longrightarrow\quad \sigma_{v,\min} = \frac{\lambda}{2}\sqrt{\frac{3}{8\pi^2\cdot\text{SNR}\cdot T_{obs}^2}} \] (4.6)

The factor of 3 arises from the second moment of the rectangular spectral / temporal support: for a uniform distribution over \([-B/2, B/2]\), \(\mathbb{E}[f^2] = B^2/12\), so \(8\pi^2 \mathbb{E}[f^2] = 8\pi^2 B^2/12 = 2\pi^2 B^2/3\), and its inverse gives the factor of \(3/(2\pi^2 B^2)\) — which absorbs the 2-way propagation factor to yield (4.5).

Fundamental limit. The CRB sets the minimum variance — no unbiased estimator can do better regardless of algorithm. It is achieved asymptotically by the Maximum Likelihood Estimator (MLE) at high SNR. In practice, matched-filter / 2D-FFT estimators approach the CRB in the asymptotic regime (SNR > ~10 dB) but diverge below the threshold SNR due to sidelobe interference and noise-induced outliers.
Watch out — SNR definition. In equations (4.5)–(4.6), SNR is per subcarrier, not total received SNR. If the total received signal power is \(P_r\) and noise PSD is \(N_0\), then \(\text{SNR}_{sc} = P_r / (N_0 \cdot \Delta f)\). Confusing total and per-subcarrier SNR leads to optimistic CRB estimates by a factor of \(N\) (number of subcarriers).

CRB vs SNR — Chart

CRB: Range & Velocity Estimation Lower Bound vs SNR (B = 100 MHz, Tobs = 5 ms)

Log-scale CRB curves computed from (4.5) and (4.6). The shaded region marks the NR SRS typical operating SNR (10–25 dB). At 15 dB SNR, range 1-σ bound ≈ 0.12 m and velocity 1-σ bound ≈ 0.06 m/s with these parameters.

4.3  2D-FFT CFAR Detection Pipeline

The canonical NR-ISAC sensing pipeline converts the extracted pilot channel estimates into a Range-Doppler (RD) map via a 2D-IFFT and then applies Constant False Alarm Rate (CFAR) detection to localise targets. The steps below assume \(M\) frequency-domain pilots per symbol and \(L\) consecutive SRS symbols.

Algorithm 4.1 — 2D-FFT CFAR Sensing Pipeline

Input: Received OFDM grid \(Y[k,l]\), known pilot matrix \(X_p[k,l]\), \(k \in \mathcal{K}_{SRS}\), \(l = 0,\ldots,L-1\)
Output: Detected target list \(\{(\hat{R}_i, \hat{v}_i, \hat{\alpha}_i)\}\)

Step 1 — Extract pilot OFDM symbols.
 Identify slot indices carrying SRS (from RRC config), demodulate (remove CP), apply FFT, extract subcarriers \(\mathcal{K}_{SRS}\):
 \(Y[k,l] \leftarrow \text{FFT}\bigl(r[n,l]\bigr)\big|_{k \in \mathcal{K}_{SRS}}\)

Step 2 — Divide by known pilots → channel estimates.
 Element-wise LS channel estimate per resource element:
 \(\hat{H}[k,l] = Y[k,l]\;/\;X_p[k,l], \quad \forall\,(k,l)\)
 Optional: apply 2D Wiener filter \(\hat{H}_{MMSE} = \mathbf{W}_{MMSE}\,\hat{H}\) for SNR > 0 dB.

Step 3 — Apply 2D-IFFT → Range-Doppler map.
 Zero-pad \(\hat{H}\) to \((N_{FFT,R} \times N_{FFT,D})\) and apply windowing (e.g., Hann in both dims to suppress sidelobes):
 \(\mathbf{G}[n_\tau, n_\nu] = \text{IDFT}_M\bigl\{\text{DFT}_L\bigl\{\hat{H}[k,l]\cdot w[k]\cdot w[l]\bigr\}\bigr\}\)
 Range axis: \(R = n_\tau \cdot \frac{c}{2B}\);  Velocity axis: \(v = n_\nu \cdot \frac{\lambda}{2\,T_{obs}}\)

Step 4 — CFAR thresholding.
 For each cell \((n_\tau, n_\nu)\), estimate local noise power from a sliding window (Cell-Averaging CA-CFAR):
 \(\hat{\sigma}^2[n_\tau,n_\nu] = \frac{1}{N_{ref}}\sum_{(i,j)\in\mathcal{W}_{ref}}|\mathbf{G}[i,j]|^2\)
 Detection threshold: \(T[n_\tau,n_\nu] = \alpha_{CFAR}\cdot\hat{\sigma}^2\) where \(\alpha_{CFAR} = N_{ref}\bigl(P_{fa}^{-1/N_{ref}}-1\bigr)\) for CA-CFAR.
 Declare detection if \(|\mathbf{G}[n_\tau,n_\nu]|^2 > T[n_\tau,n_\nu]\).

Step 5 — Peak detection and clustering.
 Apply non-maximum suppression (NMS) to the detection map.
 Cluster adjacent detections (e.g., DBSCAN or simple connected-components).
 Interpolate peak position for sub-bin accuracy:
 \(\hat{n}_\tau = n_{\tau,peak} - \frac{1}{2}\cdot\frac{|\mathbf{G}[n_\tau{+}1]|^2 - |\mathbf{G}[n_\tau{-}1]|^2}{2|\mathbf{G}[n_\tau]|^2 - |\mathbf{G}[n_\tau{+}1]|^2 - |\mathbf{G}[n_\tau{-}1]|^2}\)
 Output target parameters: \(\hat{R}_i = \hat{n}_{\tau,i}\cdot\frac{c}{2B}\), \(\hat{v}_i = \hat{n}_{\nu,i}\cdot\frac{\lambda}{2T_{obs}}\), \(\hat{\alpha}_i = |\mathbf{G}[\hat{n}_{\tau,i},\hat{n}_{\nu,i}]|\).
CFAR false-alarm control. CA-CFAR maintains a constant \(P_{fa}\) independently of the local clutter power level by adapting the threshold cell-by-cell. Typical operating points: \(P_{fa} = 10^{-4}\) with \(N_{ref} = 32\) guard+reference cells. In NR-ISAC the guard cells must span at least the range/Doppler extent of one target spread (2–4 bins each dimension with Hann windowing).

Simulated 2D Range-Doppler Map

The heatmap below simulates the output of Step 3 for a 3-target scene: Target 1 at (50 m, +5 m/s), Target 2 at (120 m, −12 m/s), Target 3 at (200 m, +20 m/s). A Gaussian spread models the sinc-like peak after windowing, and a −40 dB noise floor is applied.

Simulated 2D Range-Doppler Map (NR SRS, 100 MHz BW)

Colour scale in dB. Three targets are clearly visible above the −40 dB noise floor. In practice, CFAR detection (Step 4) would threshold this map before peak extraction. Range resolution ≈ 1.5 m; Doppler resolution ≈ 0.43 m/s (with Tobs = 5 ms and λ ≈ 85.7 mm @ 3.5 GHz).

Section 4 — Key Design Takeaways

Range limited by BW
Range resolution \(\delta R = c/(2B)\) depends only on bandwidth. Doubling B from 50 to 100 MHz halves the range resolution from 3 m to 1.5 m. SRS bandwidth config directly controls sensing acuity.
Velocity limited by Tobs
Doppler resolution \(\delta v = \lambda/(2T_{obs})\) scales with coherent processing interval. Longer SRS periodicity chains (multi-slot CPI) are needed for fine velocity discrimination.
2D decoupling property
For rectangular time-frequency support, delay and Doppler FIM entries are asymptotically zero — range and velocity can be estimated independently at moderate-to-high SNR.
CFAR + NMS essential
Raw 2D-FFT output includes sidelobes −13 dB below mainlobe (no window) or −32 dB (Hann). CFAR + non-maximum suppression prevents sidelobe false alarms from contaminating the target list.

Study Questions

Q4.1 — Pilot density and range vs velocity.
Why does increasing pilot density in the frequency domain (more subcarriers per OFDM symbol) improve range estimation but not velocity estimation?
Show hint

Range is determined by the delay axis of the 2D-IFFT, which is the IFFT over the frequency dimension. A denser frequency grid (larger effective bandwidth or more frequency samples) improves the DFT resolution in the delay domain → finer range bins, lower CRB(\(\tau\)). Velocity is determined by the Doppler axis, which is the DFT over the time (symbol) dimension. Frequency-domain density has no effect on time-domain extent or sampling — hence no improvement in Doppler resolution. The two axes of the 2D-IFFT are orthogonal.

Q4.2 — Numerical CRB calculation.
A target at 150 m is observed with SNR = 15 dB. Using B = 100 MHz, compute the 1-σ range error bound from (4.5). Is this bound consistent with the CRB chart above?
Show solution

SNR\(_{lin}\) = \(10^{15/10} \approx 31.62\).
\(\text{CRB}(\tau) = \dfrac{3}{8\pi^2 \times 31.62 \times (10^8)^2} = \dfrac{3}{8 \times 9.8696 \times 31.62 \times 10^{16}} \approx 1.20 \times 10^{-19}\;\text{s}^2\)
\(\sigma_R = \dfrac{c}{2}\sqrt{\text{CRB}(\tau)} = \dfrac{3\times10^8}{2}\times\sqrt{1.20\times10^{-19}} \approx 1.5\times10^8 \times 1.095\times10^{-9.5} \approx \mathbf{0.12\;\text{m}}\).
The target range (150 m) is irrelevant to the bound — CRB depends on SNR and bandwidth, not target distance. Reading off the chart at SNR = 15 dB, the range curve is ≈ 0.12 m — consistent.

Q4.3 — SRS vs DMRS for bistatic sensing.
Compare SRS and DMRS for bistatic sensing. Which reference signal offers better Doppler resolution and why? Consider a 5G NR system with 15 kHz SCS and SRS periodicity = 10 ms.
Show hint

Doppler resolution is \(\delta\nu = 1/T_{CPI}\) (frequency-domain dual). For SRS with 10 ms periodicity, accumulating a CPI of 10 symbols gives \(T_{CPI} = 10 \times 10\,\text{ms} = 100\,\text{ms}\) → \(\delta v = \lambda/(2\times0.1) = 5\lambda\) m/s. DMRS is present every scheduled slot (1 ms @ 15 kHz SCS), so a continuous 100 ms observation window is naturally available, giving the same Doppler resolution. However DMRS bandwidth is allocation-dependent and typically narrower, hurting range resolution. SRS offers more controllable, dedicated bandwidth with wider BW configurability. For bistatic Doppler sensing specifically, DMRS availability every slot enables continuous Doppler tracking without gaps, an advantage over periodic SRS.

5 PRACH / PUCCH Sensing

5.1 Zadoff-Chu Sequences

Zadoff-Chu (ZC) sequences are the mathematical backbone of NR PRACH. Their constant-envelope and perfect periodic autocorrelation properties make them ideal both for random access and — as exploited in ISAC — for unambiguous range and Doppler estimation.

\[ x_u(n) \;=\; e^{-j\pi u n(n+1)/N_{\mathrm{ZC}}}, \quad n = 0, 1, \ldots, N_{\mathrm{ZC}}-1 \] (5.1)

Here \(u\) is the root index (coprime with \(N_{\mathrm{ZC}}\)) and \(N_{\mathrm{ZC}}\) is the sequence length. Key properties:

ISAC relevance: The perfect autocorrelation of ZC sequences means the PRACH signal itself acts as a nearly ideal pulse-compression waveform. The range sidelobe level is 0 dB below noise for an ideal channel — far better than a random QPSK sequence.

NR PRACH Format Summary

Format \(N_{\mathrm{ZC}}\) Bandwidth (PRBs) Duration Max Unambiguous Range Notes
Format 0 839 6 1 ms (1 OFDM sym equiv.) ~14.5 km (CP=3168 κ) FR1, long sequence; best for sensing
Format 1 839 6 3 ms ~29 km (CP=21024 κ) Extended CP; large-cell coverage
Format B4 139 12 ~0.29 ms ~1.5 km Short format; FR1 & FR2; limited range
Format C0 139 12 ~0.14 ms ~0.94 km Short format; mmWave (FR2)
ZC Sequence Autocorrelation (NZC=839, PRACH Format 0)
Normalised periodic autocorrelation \(|R_{uu}(k)|/N_{\mathrm{ZC}}\) for three ZC roots. The auto-correlation (same root, k=0) equals 1; cross-correlations between different roots flatten at \(1/\sqrt{839}\approx 0.034\). A small AWGN noise floor (SNR=30 dB) is included to show realistic conditions.

5.2 ZC-Based Range Estimation

When the gNB receives a reflected PRACH preamble, it cross-correlates the received signal \(y(n)\) with a local replica of the known ZC sequence to form a range profile. The delay bin with maximum correlation energy corresponds to the target round-trip delay.

\[ \hat{\tau} \;=\; \arg\max_{k} \left| \sum_{n=0}^{N_{\mathrm{ZC}}-1} y(n)\, x_u^*(n - k) \right|^2 \] (5.2)

The corresponding range estimate is simply:

\[ \hat{d} \;=\; \frac{c\,\hat{\tau}}{2} \] (5.3)

where \(c = 3\times10^8\) m/s. The factor of 2 accounts for the two-way propagation (monostatic sensing scenario).

Cramér-Rao Bound for ZC Ranging

For a ZC preamble of duration \(T_{\mathrm{ZC}}\) with bandwidth \(B\) observed at SNR \(\gamma\), the CRB on delay estimation variance is:

\[ \mathrm{var}(\hat{\tau}) \;\geq\; \frac{1}{8\pi^2 \gamma\, \bar{f}^2\, T_{\mathrm{ZC}}} \] (5.4)

where \(\bar{f}^2 = \int f^2 |X(f)|^2\,df / \int |X(f)|^2\,df\) is the mean-square bandwidth (second spectral moment). For a flat-spectrum ZC preamble, \(\bar{f}^2 \approx B^2/12\). The range CRB is then:

\[ \mathrm{CRB}(d) \;=\; \frac{c^2}{8\pi^2 \gamma\, (B^2/12)\, T_{\mathrm{ZC}}} \;=\; \frac{3c^2}{2\pi^2\, \gamma\, B^2\, T_{\mathrm{ZC}}} \] (5.5)

Cyclic Prefix and Unambiguous Range

The PRACH cyclic prefix \(T_{\mathrm{CP}}\) sets the maximum unambiguous round-trip delay: \(\tau_{\max} = T_{\mathrm{CP}}\), giving:

\[ d_{\max} \;=\; \frac{c\,T_{\mathrm{CP}}}{2} \] (5.6)

Additionally, the NCS (Ncs — number of cyclic shifts reserved per root) determines the separation between UE preambles in delay space. The unambiguous range from NCS is:

\[ d_{\mathrm{NCS}} \;=\; \frac{c \cdot N_{\mathrm{CS}} \cdot T_s}{2} \] (5.7)

For Format 0 with NCS=13, \(T_s = 1/(15000\times2048)\) s, giving \(d_{\mathrm{NCS}} \approx 22\) km. Any target beyond this distance folds into an ambiguous range bin.

ISAC Constraint: The NCS value chosen by the network operator for random-access efficiency simultaneously hard-limits the unambiguous sensing range. To maximise sensing range, operators should use larger NCS values or restrict to the CP-limited range. For dense urban deployments (NCS=13), 22 km is more than adequate; for vehicular sensing this is acceptable.
PRACH Range Profile (Format 0, NCS=13)
Simulated cross-correlation range profile. Two targets are visible at 2.5 km and 8.7 km. The noise floor is ~25 dB below peak. The vertical dashed line marks the NCS=13 unambiguous range boundary (~22 km); energy beyond this point folds into ambiguous bins.

5.3 Multi-Preamble Doppler Estimation

A single PRACH preamble resolves range but not velocity. By observing multiple PRACH transmissions in successive slots, the gNB can extract Doppler information from the phase evolution of the matched-filter output peak across repetitions.

Phase Difference Method

For a target at constant radial velocity \(v\), the round-trip delay changes as \(\tau_m = \tau_0 + 2v\,m\,T_{\mathrm{rep}}/c\), where \(T_{\mathrm{rep}}\) is the preamble repetition interval (e.g. 20 ms for a 50 Hz PRACH occasion rate) and \(m\) is the preamble index. The phase of the peak in the \(m\)-th correlation is:

\[ \phi_m \;=\; \phi_0 \;+\; 2\pi\,f_D\,m\,T_{\mathrm{rep}} \] (5.8)

The Doppler frequency \(f_D\) is estimated from the phase difference between consecutive preambles:

\[ \Delta\phi \;=\; \phi_{m+1} - \phi_m \;=\; 2\pi\,f_D\,T_{\mathrm{rep}} \] (5.9)

Velocity Estimation

Once \(\hat{f}_D\) is estimated (e.g. via FFT across the preamble sequence or simple phase unwrapping), the radial velocity follows from the standard Doppler-velocity relation:

\[ \hat{v} \;=\; \frac{\hat{f}_D\,\lambda}{2} \] (5.10)

At \(f_c = 3.5\) GHz (\(\lambda \approx 8.6\) cm), a Doppler of \(f_D = 500\) Hz corresponds to \(v \approx 21.4\) m/s (77 km/h).

The maximum unambiguous Doppler is limited by the Nyquist criterion:

\[ |f_D|_{\max} \;=\; \frac{1}{2\,T_{\mathrm{rep}}} \] (5.11)
Algorithm 5.1 — Multi-Preamble Doppler Estimation

Input: M received preambles y_0, y_1, ..., y_{M-1}; root u; N_ZC; T_rep; lambda
Output: range estimate d_hat, velocity estimate v_hat

1. FOR m = 0 TO M-1:
    a. Compute correlation: R_m(k) = sum_n y_m(n) * x_u*(n-k)
    b. Find peak delay: k_hat_m = argmax_k |R_m(k)|^2
    c. Extract peak phase: phi_m = angle(R_m(k_hat_m))
2. Estimate range: d_hat = c * mean(k_hat_m) * T_s / 2
3. Estimate Doppler via FFT of {exp(j*phi_m)} across m:
    f_D_hat = argmax_f |FFT({exp(j*phi_m)})(f)|^2 / (M * T_rep)
4. Estimate velocity: v_hat = f_D_hat * lambda / 2
5. RETURN (d_hat, v_hat)
Practical note: For M=4 preamble repetitions at T_rep=20 ms, the Doppler resolution is \(\Delta f_D = 1/(M\,T_{\mathrm{rep}}) = 12.5\) Hz, corresponding to a velocity resolution of ~0.54 m/s at 3.5 GHz. This is competitive with automotive radar, which typically targets <0.5 m/s resolution. The main bottleneck is the low duty cycle of PRACH transmissions.

5.4 PUCCH Periodic Sensing

While PRACH is a contention-based transmission, Physical Uplink Control Channel (PUCCH) carries scheduled control information (ACK/NACK, CSI, SR) with a known, regular cadence. For ISAC, PUCCH Formats 2 and 3 are particularly attractive because they use DMRS pilots and span multiple OFDM symbols.

Why Periodic PUCCH Enables Sensing

PUCCH Format Comparison for Sensing

FormatSymbolsUCI bitsSensing utility
01–2≤2Poor — no DMRS, sequence-modulated
14–14≤2Moderate — long but ZC-based, 1 bit payload
21–23–11Good — DMRS in symbol 0; wide bandwidth
34–14>2Best — DMRS in symbols 0 & 4; long coherence

Key Limitations

Despite its advantages, PUCCH-based sensing faces several challenges:

  • Scheduling dependency: If the UE has no UCI to send, the PUCCH occasion is dropped. Semi-persistent scheduling mitigates but does not eliminate gaps.
  • UCI payload corruption: ACK/NACK modulation sits on the same resource as the reference; the data symbols are corrupted by unknown payload bits and cannot be used directly as a sensing reference (only DMRS symbols are clean).
  • Lower effective SNR than SRS: PUCCH DMRS occupies only 1–2 of the 4–14 total symbols (typically ~14% duty cycle for Format 3), whereas SRS is a pure pilot with 100% pilot density. The sensing SNR loss relative to SRS is approximately \(10\log_{10}(N_{\mathrm{total}}/N_{\mathrm{DMRS}})\) dB.
  • Frequency hopping: PUCCH Format 1/3 with intra-slot frequency hopping changes the resource allocation mid-slot, complicating coherent range-Doppler processing.
SNR gap vs SRS: For PUCCH Format 3 with 10 symbols and 2 DMRS symbols, the pilot fraction is 20%. Compared to a full-band SRS of equal duration, the sensing SNR is ~7 dB lower. For SRS configured with full sweep bandwidth, SRS-based sensing outperforms PUCCH by 7–12 dB in typical deployments — which is why 3GPP Rel-17/18 ISAC standardisation focuses primarily on SRS as the sensing reference.
Study Questions — Section 5
  1. Why is \(N_{\mathrm{ZC}} = 839\) for long PRACH formats?
    839 is a prime number, which ensures that all \(u\) with \(1 \leq u \leq N_{\mathrm{ZC}}-1\) are coprime with \(N_{\mathrm{ZC}}\), maximising the number of available roots (838 distinct roots). For a prime \(N_{\mathrm{ZC}}\), every non-zero \(u\) generates a distinct sequence with the same cross-correlation floor \(1/\sqrt{N_{\mathrm{ZC}}}\). The choice of 839 specifically balances: (a) fitting within 6 PRBs at 15 kHz SCS (72 subcarriers \(\times\) oversampling factor ≈ 839), (b) providing enough roots for large deployments, and (c) keeping the CP overhead manageable. Verify: 6 PRBs \(\times\) 12 subcarriers = 72 subcarriers; DFT size = 839, zero-padded to the next FFT size.
  2. Two UEs transmit PRACH simultaneously with different root sequences. Can the gNB distinguish their range estimates?
    Yes — this is a key feature of ZC-based PRACH. Because different roots have cross-correlation magnitude \(1/\sqrt{N_{\mathrm{ZC}}}\) (≈ −29.2 dB for N_ZC=839), the matched filter for root \(u_1\) is orthogonal to the signal from root \(u_2\). The gNB runs independent correlators for each configured root; the output for root \(u_1\) shows the range profile of UE 1, and vice versa. The residual cross-correlation term appears as a noise floor at −29 dB, which is below typical sensing thresholds. Multi-UE PRACH is therefore inherently multi-target capable.
  3. Why does PUCCH-based sensing have lower SNR than SRS-based sensing?
    Three compounding reasons: (1) Pilot fraction: PUCCH DMRS occupies only a subset of OFDM symbols (typically 1–2 out of 4–14), so less energy is available for coherent sensing integration compared to a full-pilot SRS burst. (2) Bandwidth: PUCCH occupies narrow bandwidth (1–16 PRBs depending on format and payload), while SRS can be configured to sweep the full carrier bandwidth — wider bandwidth means finer range resolution and higher effective SNR from processing gain. (3) Power allocation: PUCCH power is optimised for control reliability (SINR margin for ACK/NACK), whereas SRS transmit power can be set specifically for sensing coverage requirements. Combined, SRS-based sensing typically achieves 7–12 dB better sensing SNR for the same UE transmit power budget.
6 Downlink Sensing & Self-Interference Cancellation

6.1 CSI-RS and PRS Sensing

In monostatic ISAC, the gNB transmits reference signals and listens to its own echoes. CSI-RS (Channel State Information Reference Signal) is the workhorse for monostatic sensing: its structured time-frequency allocation enables coherent channel estimation at the TX itself. PRS (Positioning Reference Signal, defined in 3GPP Rel-16/17) extends this to bistatic/passive scenarios where a separate RX node estimates geometry from the DL transmission.

Channel estimation from echoes: Given pilot symbols \(X_{CSI}(\ell,k)\) at OFDM symbol \(\ell\), subcarrier \(k\), the monostatic channel estimate is simply \[ \hat{H}(\ell,k) = \frac{Y(\ell,k)}{X_{CSI}(\ell,k)} \] where \(Y(\ell,k)\) is the received (echo) signal. Least-squares interpolation then fills non-pilot subcarriers to obtain a full range-Doppler map.

The PRS sequence is a QPSK-modulated pseudo-random sequence ensuring low PAPR and good autocorrelation properties. For positioning resource \(m\):

\[ r(m) = \frac{1}{\sqrt{2}}\bigl(1 - 2c(2m)\bigr) + j\,\frac{1}{\sqrt{2}}\bigl(1 - 2c(2m+1)\bigr) \] (PRS QPSK sequence)

where \(c(n)\) is the length-31 Gold code defined in TS 38.211. The constant \(1/\sqrt{2}\) normalises each PRS symbol to unit power.

Reference Signal Comparison

Signal Overhead Periodicity Primary Measurement Typical SNR req. Sensing Use Case
CSI-RS 1–4 ports, configurable density (up to 3 RE/RB/port) 4–640 ms CQI, PMI, RI, beam management −5 to +20 dB Monostatic range-Doppler (echo at gNB)
SRS 1–4 symbols per occasion, UL only Aperiodic or 1–2560 ms UL channel sounding, reciprocity 0 to +15 dB UL-based (see Section 7); gNB reconstructs DL channel
PRS Dedicated bandwidth part (up to 24 PRBs to full BW) 160–10240 ms (positioning burst) RSTD, RTOA, Rx-Tx time diff −13 dB (NR Rel-17) Bistatic / passive sensing; UE-side sensing
Study note — CSI-RS density vs range resolution: A CSI-RS pattern covering \(N_{sub}\) subcarriers yields range resolution \(\Delta R = c/(2 \cdot N_{sub} \cdot \Delta f)\). For \(\mu=1\) (SCS = 30 kHz) with 100 PRBs (1200 subcarriers, BW ≈ 36 MHz): \(\Delta R \approx 4.2\) m. Wider bandwidth or channel-stitching is needed for sub-metre sensing.

6.2 Full-Duplex Problem & Self-Interference

Monostatic ISAC requires simultaneous TX and RX at the same node — the defining challenge of in-band full-duplex (IBFD). The transmitter leaks directly into the receiver through antenna coupling, PCB paths, and near-field scattering:

\[ y_{rx}(t) = \underbrace{h_{SI}(t) * x_{tx}(t)}_{\text{self-interference}} + \underbrace{h_{tgt}(t) * x_{tx}(t)}_{\text{target echo}} + n(t) \] (IBFD received signal)
Scale of the problem: A gNB transmitting at +43 dBm (20 W) receives its own leakage at roughly −40 to −80 dBm after antenna isolation, while a target echo 200 m away may arrive at −100 to −130 dBm. Self-interference therefore sits 80–120 dB above the target echo — it must be suppressed before the ADC saturates and before any sensing processing can proceed.

The SI cancellation budget is typically decomposed into three domains, each with characteristic limits set by hardware constraints and non-linearities:

Self-Interference Cancellation Budget
SIC waterfall: SI enters at +120 dBm equivalent (at receiver input). Three cancellation stages progressively reduce the residue toward the thermal noise floor. Each bar shows the post-stage power level; cancellation amounts are annotated.

6.3 SIC Stage-by-Stage Analysis

Each cancellation stage must be budgeted carefully. Denote the SI cancellation gains as \(G_{pass}\), \(G_{ana}\), \(G_{dig}\) (linear power ratios). The residual SI variance after all three stages is:

\[ \sigma_{SI,res}^2 = \frac{\sigma_{SI}^2}{G_{pass} \cdot G_{ana} \cdot G_{dig}} \] (total SI residue)

The three-stage pipeline proceeds as follows:

Algorithm 6.1 — Three-Stage SIC Pipeline
  1. Stage 1 — Passive Isolation \(L_{ant}\):
    Physical antenna design (dual-polarised array, absorptive shielding, circulator). Achievable: 30–50 dB. No power consumption beyond hardware cost. Residual after stage 1: \(P_{SI,1} = P_{SI,0} - L_{ant}\) dB.
  2. Stage 2 — Analog Cancellation:
    A tapped delay line samples the TX signal and subtracts a weighted copy in the RF/IF domain before the ADC: \[ y_{ana}(t) = y(t) - \hat{h}_{SI,ana}(t) * x_{tx}(t) \] Residual power is dominated by TX non-linearities (PA harmonics, IQ imbalance). Achievable: 20–40 dB. Critical to keep the ADC from clipping.
  3. Stage 3 — Digital Cancellation (LMS/RLS):
    Adaptive filter on quantised digital samples. LMS update rule: \[ \mathbf{w}_{n+1} = \mathbf{w}_n + \mu \, e_n^* \, \mathbf{x}_n, \quad e_n = y_n - \mathbf{w}_n^H \mathbf{x}_n \] Convergence limited by ADC quantisation noise floor and residual non-linear SI. Achievable: 20–40 dB additional suppression.
Key insight — why three stages? Digital-only cancellation is limited to roughly 6 dB × (ADC bits) of dynamic range. For a 14-bit ADC that is ~84 dB — insufficient to cover the full 100+ dB SI problem. Analog cancellation protects the ADC from saturation, creating headroom for digital processing. Passive isolation reduces the burden on the analog circuit, lowering its complexity and power.
LMS convergence: Step size \(\mu\) trades convergence speed against steady-state misadjustment: \[ \mu_{opt} \approx \frac{1}{N \cdot P_{SI,2}} \] where \(N\) is filter length and \(P_{SI,2}\) is the post-analog SI power.
RLS (recursive least squares) converges in \(\sim 2N\) samples regardless of eigenvalue spread, at cost of \(\mathcal{O}(N^2)\) complexity. Preferred for fast time-varying SI channels (moving platforms). LMS sufficient for quasi-static indoor scenarios.

6.4 TDD Guard Period Sensing

In TDD NR, the monostatic sensing window is restricted to the Guard Period (GP) — the brief transition between DL and UL bursts during which no data is scheduled. This avoids the self-interference problem entirely at the cost of limited observation time.

The maximum unambiguous range is set by the round-trip propagation budget during the GP:

\[ R_{max} = \frac{c \cdot T_{GP}}{2} \] (monostatic R_max in TDD GP)

For NR \(\mu=1\) (SCS = 30 kHz), one OFDM symbol duration (including CP) is \(T_{sym} \approx 35.7\,\mu s\). A 2-symbol GP gives \(T_{GP} \approx 71.4\,\mu s\), hence \(R_{max} \approx 10.7\) km. Typical deployments use 1–3 GP symbols, yielding 5–16 km range.

Velocity ambiguity in GP sensing: Because sensing is restricted to one short burst per TDD period, coherent integration across multiple bursts is needed for Doppler resolution. With burst repetition interval \(T_{rep}\), max unambiguous velocity is \(v_{max} = \lambda / (4 T_{rep})\). For 3.5 GHz, \(T_{rep} = 5\) ms: \(v_{max} \approx 4.3\) m/s — enough for pedestrian but not vehicular sensing without Doppler unfolding.

TDD Slot Structure — Timing Diagram

Frame (10 ms) Slot / Symbol allocation (DDDSU pattern, \(\mu=1\))
Slot 0 DL — 14 symbols (data + CSI-RS)
Slot 1 DL — 14 symbols
Slot 2 DL — 14 symbols
Slot 3 (S-slot) DL (6 sym) GP (2 sym)
→ sensing window
UL (6 sym)
Slot 4 UL — 14 symbols (SRS, PUSCH)

DDDSU pattern (3GPP typical). The Special slot (S) contains the GP where monostatic echo reception is possible. GP duration configurable: 1–10 symbols depending on cell size and operator choice.

6.5 Bistatic Downlink Sensing

In bistatic sensing, the transmitter (gNB) and receiver are physically separated — the RX may be a UE in passive-listening mode, a dedicated sensor node, or another base station. There is no self-interference problem because TX and RX are at different locations.

A target at position \(\mathbf{p}\) creates a two-leg propagation path. The bistatic excess delay relative to the direct TX–RX path is:

\[ \tau_{bi} = \frac{R_{TX} + R_{RX}}{c} - \frac{R_{direct}}{c} = \frac{\|\mathbf{p} - \mathbf{p}_{TX}\| + \|\mathbf{p} - \mathbf{p}_{RX}\| - \|\mathbf{p}_{TX} - \mathbf{p}_{RX}\|}{c} \] (bistatic excess delay)

Loci of constant \(\tau_{bi}\) are ellipses with TX and RX at the foci. The bistatic range (semi-major axis \(a\)) satisfies: \[ 2a = R_{direct} + c\,\tau_{bi}, \qquad R_{bi} = c\,\tau_{bi} \]

Advantages of bistatic:
  • No self-interference — the receiver is never exposed to the direct TX power.
  • Can reuse existing NR DL transmissions (PRS, SSB, CSI-RS) without waveform modification.
  • RX node can be lightweight (no PA, simplified front-end).
  • Forward-scatter geometry (target crossing the TX-RX baseline) provides strong RCS.
Challenges:
  • Synchronisation: TX and RX must share a common time and frequency reference. Timing error \(\Delta\tau\) maps directly to range error \(\Delta R = c\,\Delta\tau\). For 1 m accuracy, \(\Delta\tau < 3.3\) ns is required.
  • Direct path leakage: The strong direct TX-RX signal dominates the ADC; requires notching or subtraction.
  • Geometry-dependent resolution: Bistatic range resolution depends on aspect angle \(\beta\): \(\Delta R_{bi} = \Delta R_{mono} / \cos(\beta/2)\).
Synchronisation requirement for 1 m bistatic range resolution:
Range resolution \(\Delta R_{bi} = c/(2B\cos(\beta/2))\) for bandwidth \(B\). The synchronisation accuracy needed is independent of range resolution — it sets absolute range accuracy. For \(\Delta R_{abs} = 1\) m: \[ \Delta\tau < \frac{1\,\text{m}}{c} = 3.33\,\text{ns} \] NR PRS provides timing accuracy of ~1–5 ns in good SNR conditions (3GPP TR 38.857). Sub-nanosecond synchronisation requires IEEE 1588 PTP with hardware timestamping or GNSS-disciplined oscillators at both nodes.
Bistatic Sensing Geometry
Bistatic sensing geometry: TX (gNB) at origin, RX at (1000 m, 0). Two targets are shown with their associated constant-delay ellipses. The direct TX–RX link is the bistatic baseline. Ellipse semi-axes are determined by \(2a = R_{direct} + R_{TX} + R_{RX}\).

Section 6 — Study Questions

  1. TDD R_max: In a TDD NR system with SCS = 30 kHz (\(\mu=1\)) and a 1-symbol guard period, what is \(R_{max}\) for monostatic sensing? (Recall: one NR symbol at \(\mu=1\) has duration \(T_{sym} = 1/(30000) \approx 33.3\,\mu\text{s}\) plus ~20% CP.) Hint: use \(T_{GP} = T_{sym}\) and \(R_{max} = cT_{GP}/2\).
  2. Analog SIC necessity: Why is analog SIC required before digital SIC? What fundamental hardware limitation prevents digital-only cancellation from being sufficient? Hint: consider the ADC input power budget and what happens when the SI power exceeds the ADC full-scale range.
  3. Bistatic synchronisation: For bistatic sensing, what timing synchronisation accuracy is needed to achieve 1 m absolute range accuracy? How does this compare to the NR frame timing accuracy achievable with PRS (typically ~5 ns at SNR = 0 dB)? At what SNR is PRS timing error < 3.33 ns achievable per 3GPP Rel-17 specs?
7

PHY Sensing Chain

7.1 TX → Channel → RX Processing Chain

The OFDM-ISAC sensing chain reuses the communications PHY layer end-to-end. Complex baseband symbols are modulated, pulse-shaped, transmitted, reflected off targets, and then coherently processed at the receiver to extract range, Doppler, and angular information. The received discrete-time signal is modelled as:

\[ y[n] = \sum_p \alpha_p \, x[n - \tau_p] \, e^{j2\pi\nu_p n T_s} + w[n] \] (7.1)

where \(\alpha_p\) is the complex reflection coefficient of the \(p\)-th target, \(\tau_p = 2R_p/c\) is the round-trip delay (samples), \(\nu_p = 2v_p f_c/c\) is the Doppler shift (Hz), \(T_s\) is the sampling period, and \(w[n]\sim\mathcal{CN}(0,\sigma_w^2)\) is additive white Gaussian noise. The summation is over all \(P\) scattering paths.

Full Signal Flow: Monostatic OFDM-ISAC Sensing Chain
TX
Modulator
QAM / QPSK
\(s[k,l]\in\mathbb{C}\)
N-IFFT
\(x[n]=\tfrac{1}{N}\sum_k s[k]e^{j2\pi kn/N}\)
CP Insert
Prepend \(N_{cp}\) samples
\(\tilde{x}[n]\)
DAC
D/A, \(f_s\) → RF
\(x_{RF}(t)\)
TX Filter
Pulse shape
\(h_{tx}(t)\)
CHANNEL
Propagation
Free-space path
\(r(t)=h(t)*x(t)+w(t)\)
Target
Reflect: \(\alpha_p, \tau_p, \nu_p\)
RX
RX Filter
Matched / LPF
\(h_{rx}(t)\)
ADC
A/D at \(f_s\)
\(y[n]\)
CP Remove
Discard \(N_{cp}\) samp.
\(\hat{y}[n]\)
N-FFT
\(Y[k,l]=\sum_n \hat{y}[n]e^{-j2\pi kn/N}\)
SENSING DSP
Pilot Division
\(H[k,l]=Y[k,l]/S[k,l]\)
2D-IFFT
Range-Doppler
\(h[\tau,\nu]\)
CFAR
CA / OS CFAR
Detection
Track
Kalman Filter
\(\hat{x}_k, \hat{P}_k\)

After the 2D-IFFT, the range-Doppler map is:

\[ h[\tau, \nu] = \sum_p \alpha_p \, \delta(\tau - \tau_p)\,\delta(\nu - \nu_p) \] (7.2)

Each target appears as a 2D impulse at \((\tau_p, \nu_p)\), which after CFAR thresholding yields discrete detections fed into the tracking stage. The pilot-division step normalises out the data symbols, exploiting the known transmit waveform to achieve coherent sensing.

Key stage equations:

  • IFFT output: \(x[n] = \frac{1}{N}\sum_{k=0}^{N-1} S[k]\,e^{j2\pi kn/N}\)
  • CP-extended symbol: \(\tilde{x}[n] = x[((n))_N],\; -N_{cp}\le n < N\)
  • Received freq-domain: \(Y[k,l] = H[k,l]\,S[k,l] + W[k,l]\)
  • Channel estimate: \(\hat{H}[k,l] = Y[k,l]/S[k,l]\)
  • Range bin: \(\Delta R = c/(2B)\), Doppler bin: \(\Delta v = \lambda/(2T_{CPI})\)
CP as multipath guard: The cyclic prefix of length \(N_{cp} \ge \tau_{max}\) converts the linear convolution with the channel into circular convolution, enabling simple one-tap equalisation per subcarrier. In sensing, this ensures that the delay of each target is fully captured within the guard interval.

7.2 CFAR Detection

Constant False Alarm Rate (CFAR) detectors adaptively set the detection threshold based on local noise estimates, maintaining a fixed \(P_{FA}\) regardless of clutter level. The cell under test (CUT) is compared against a threshold derived from surrounding reference cells.

Cell-Averaging CFAR (CA-CFAR)

CA-CFAR estimates the noise power from the \(2L\) reference cells (excluding \(G\) guard cells on each side):

\[ T_{\text{CFAR}} = \alpha \cdot \frac{1}{2L} \sum_{i \in \text{ref cells}} |z_i|^2 \] (7.3)

where the threshold multiplier \(\alpha\) is chosen to achieve a target \(P_{FA}\):

\[ \alpha = L\!\left(P_{FA}^{-1/L} - 1\right) \] (7.4)

The false alarm probability for CA-CFAR with \(L\) reference cells is exactly:

\[ P_{FA} = \left(1 + \frac{\alpha}{L}\right)^{-L} \] (7.5)

A detection is declared when \(|z_{CUT}|^2 > T_{\text{CFAR}}\).

Algorithm 7.1 — CA-CFAR Sliding Window
  1. Set reference window: \(L\) cells on each side, \(G\) guard cells each side.
  2. For each cell under test (CUT) at index \(m\):
  3. Collect reference set \(\mathcal{R} = \{z_i : m-L-G \le i \le m+L+G,\; |i-m|>G\}\)
  4. Estimate noise: \(\hat{\sigma}^2 = \frac{1}{2L}\sum_{i\in\mathcal{R}}|z_i|^2\)
  5. Compute threshold: \(T = \alpha \cdot \hat{\sigma}^2\)
  6. If \(|z_m|^2 > T\): declare detection, record \((m, \text{power})\)
  7. Slide window; handle boundary conditions with cell-averaging only over available cells.

Ordered-Statistics CFAR (OS-CFAR)

OS-CFAR sorts the \(2L\) reference cell samples in ascending order and uses the \(k\)-th order statistic as the noise estimate:

\[ T_{\text{OS}} = \alpha_{OS} \cdot z_{(k)}, \qquad z_{(1)} \le z_{(2)} \le \cdots \le z_{(2L)} \] (7.6)

Typical choice: \(k = \lfloor 0.75 \cdot 2L \rfloor\). OS-CFAR is robust to clutter edges and interference spikes that inflate the CA-CFAR noise estimate, at the cost of slightly reduced \(P_D\) in homogeneous clutter.

VariantNoise EstimateStrengthWeakness
CA-CFARCell averageOptimal in homogeneous noiseFails at clutter edges
OS-CFAR\(k\)-th order statRobust to outliers & edgesLower \(P_D\), slower (\(O(n\log n)\))
GO-CFARGreater-Of leading/laggingGood at clutter boundariesMasking in multi-target
SO-CFARSmaller-Of leading/laggingMulti-target sensitivityElevated \(P_{FA}\) at boundary
Guard cell sizing: Guard cells must span the expected target response width (range sidelobe region). Too few guard cells causes self-masking — the target leaks into reference cells, inflating the threshold and suppressing detection. For OFDM sensing, guard width \(\approx 2\times\) range sidelobe width in bins.

7.3 MUSIC Direction Finding

After CFAR yields range-Doppler detections, a co-located antenna array estimates the Direction-of-Arrival (DoA) for each detection via the MUSIC (MUltiple SIgnal Classification) algorithm. With an \(M\)-element Uniform Linear Array (ULA) receiving signals from \(K\) point targets, the array output covariance is:

\[ R_{xx} = \mathbf{A}\,S\,\mathbf{A}^H + \sigma_n^2 I = U_s \Sigma_s U_s^H + \sigma_n^2 U_n U_n^H \] (7.7)

where \(U_s \in \mathbb{C}^{M\times K}\) spans the signal subspace (eigenvalues \(\Sigma_s\)) and \(U_n \in \mathbb{C}^{M\times(M-K)}\) spans the orthogonal noise subspace.

Steering Vector and MUSIC Pseudo-Spectrum

For a ULA with half-wavelength spacing \(d = \lambda/2\), the steering vector for angle \(\theta\) is:

\[ \mathbf{a}(\theta) = \Bigl[1,\; e^{j\pi\sin\theta},\; e^{j2\pi\sin\theta},\; \ldots,\; e^{j\pi(M-1)\sin\theta}\Bigr]^T \] (7.8)

The MUSIC pseudo-spectrum is formed by projecting the steering vector onto the noise subspace:

\[ P_{\text{MUSIC}}(\theta) = \frac{1}{\mathbf{a}^H(\theta)\,U_n\,U_n^H\,\mathbf{a}(\theta)} \] (7.9)

Sharp peaks in \(P_{\text{MUSIC}}(\theta)\) indicate target angles. The theoretical angular resolution for an \(M\)-element ULA is:

\[ \Delta\theta \approx \frac{0.886\,\lambda}{M\,d} = \frac{1.772}{M} \quad \text{(radians, for } d=\lambda/2\text{)} \] (7.10)
Algorithm 7.2 — MUSIC DoA Estimation
  1. Collect \(N_s\) snapshots: form \(\hat{R}_{xx} = \frac{1}{N_s} X X^H\).
  2. Optional — Spatial Smoothing: For coherent sources, divide array into \(L\) overlapping sub-arrays of size \(M'\) and average their covariance matrices: \(\tilde{R} = \frac{1}{L}\sum_{l=1}^{L} R_l\)
  3. Eigendecompose \(\hat{R}_{xx}\): sort eigenvalues \(\lambda_1\ge\lambda_2\ge\cdots\ge\lambda_M\).
  4. Estimate number of sources \(\hat{K}\) via MDL/AIC or known from CFAR count.
  5. Extract noise subspace: \(U_n = [\mathbf{u}_{K+1}, \ldots, \mathbf{u}_M]\).
  6. Sweep \(\theta \in [-90°, 90°]\), compute \(P_{\text{MUSIC}}(\theta)\) at each angle.
  7. Find \(\hat{K}\) largest peaks; report \(\hat{\theta}_1, \ldots, \hat{\theta}_{\hat{K}}\).
MUSIC DoA Spectrum (8-element ULA, SNR = 20 dB, 3 targets)
MUSIC pseudo-spectrum sweep from −90° to +90°. Three sharp peaks appear at the true target angles of −20°, +5°, and +35°. Dashed vertical lines mark the true directions. Noise floor is approximately −30 dB. 8-element ULA with half-wavelength spacing, 20 dB SNR.

7.4 Kalman Tracking

CFAR detections are discrete, noisy, and possibly missing (misdetections) or spurious (false alarms). A Kalman filter provides a principled recursive estimator that maintains a state estimate and its uncertainty over time, suppressing measurement noise while predicting target motion.

State Model

The tracking state vector for a 2D range-angle target is:

\[ \mathbf{x}_k = \bigl[r_k,\; \dot{r}_k,\; \theta_k,\; \dot{\theta}_k\bigr]^T \] (7.11)

State transition (constant-velocity model) with sampling interval \(T\):

\[ F = \begin{bmatrix} 1 & T & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & T \\ 0 & 0 & 0 & 1 \end{bmatrix}, \qquad Q = q\begin{bmatrix} T^4/4 & T^3/2 & 0 & 0 \\ T^3/2 & T^2 & 0 & 0 \\ 0 & 0 & T^4/4 & T^3/2 \\ 0 & 0 & T^3/2 & T^2 \end{bmatrix} \] (7.12)

where \(q\) is the process noise spectral density (units: m²/s³ or rad²/s³), controlling how quickly the filter adapts to manoeuvres.

Predict Step

\[ \mathbf{x}_{k|k-1} = F\,\mathbf{x}_{k-1|k-1}, \qquad P_{k|k-1} = F\,P_{k-1|k-1}\,F^T + Q \] (7.13)

Update Step

Given measurement \(\mathbf{z}_k = H\mathbf{x}_k + \mathbf{v}_k\) where \(\mathbf{v}_k\sim\mathcal{N}(0,R)\):

\[ K_k = P_{k|k-1}\,H^T \!\left(H\,P_{k|k-1}\,H^T + R\right)^{-1} \] (7.14)
\[ \mathbf{x}_{k|k} = \mathbf{x}_{k|k-1} + K_k\!\left(\mathbf{z}_k - H\,\mathbf{x}_{k|k-1}\right), \qquad P_{k|k} = (I - K_k H)\,P_{k|k-1} \] (7.15)

The innovation \(\tilde{\mathbf{z}}_k = \mathbf{z}_k - H\,\mathbf{x}_{k|k-1}\) measures the prediction error; \(K_k\) blends prediction and measurement optimally. The measurement noise covariance is:

\[ R = \begin{bmatrix} \sigma_r^2 & 0 \\ 0 & \sigma_\theta^2 \end{bmatrix}, \qquad H = \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{bmatrix} \] (7.16)

where \(\sigma_r\) and \(\sigma_\theta\) are the CFAR range and angle measurement standard deviations, respectively (typically \(\sigma_r \approx \Delta R / \sqrt{2\cdot\text{SNR}}\) and \(\sigma_\theta \approx \Delta\theta_{MUSIC} / \sqrt{2\cdot\text{SNR}}\)).

Kalman Filter Track vs Raw CFAR Detections
Simulated 2D tracking scenario: a target follows a curved arc from (50, 50) m to (150, 120) m over 20 time steps. Red circles are noisy CFAR detections (σ = 5 m); the green dashed line is the true trajectory; the blue line is the Kalman-filtered estimate. The filter visibly smooths measurement noise while following the curved path.

Study Questions

  1. OS-CFAR vs CA-CFAR \(P_D\) tradeoff: OS-CFAR has lower \(P_D\) than CA-CFAR at the same \(P_{FA}\) in homogeneous clutter. When would you choose OS-CFAR?
    Show answer

    Choose OS-CFAR when the environment has clutter edges (transitions between high and low clutter regions), interfering targets in the reference window, or non-homogeneous backgrounds (sea clutter, urban). In these cases CA-CFAR over-estimates the noise level (a few large reference cells dominate the average), raising the threshold and degrading \(P_D\) far more than OS-CFAR's mild \(P_D\) loss. The \(k\)-th order statistic ignores the \(2L-k\) largest outliers, providing a robust noise estimate. In practice OS-CFAR is preferred in automotive radar and surveillance radar operating over terrain with varying reflectivity.

  2. MUSIC with underestimated \(K\): MUSIC requires knowing the number of targets \(K\). What happens if \(K\) is underestimated (i.e., \(\hat{K} < K_{\text{true}}\))?
    Show answer

    The noise subspace \(U_n\) is allocated too many dimensions, absorbing some signal subspace eigenvectors. Specifically, if \(\hat{K} = K-1\), then one true signal eigenvector is placed in \(U_n\), making \(\mathbf{a}(\theta_p)^H U_n U_n^H \mathbf{a}(\theta_p) \ne 0\) for one target. The corresponding MUSIC peak disappears — that target is not detected. With severely underestimated \(K\), multiple peaks can vanish. Overestimating \(K\) is less catastrophic: the extra signal subspace columns slightly contaminate the noise subspace (reducing peak sharpness) but do not eliminate true peaks. In practice, use MDL or AIC to estimate \(K\) from the eigenvalue profile before running MUSIC.

  3. Kalman process noise \(Q\) tradeoffs: The process noise spectral density \(q\) in \(Q\) controls track responsiveness. What are the tradeoffs of large vs small \(q\)?
    Show answer

    Large \(q\): The filter trusts measurements more than the model prediction. The Kalman gain \(K_k\) is large, so the track responds quickly to manoeuvres and heading changes. However, measurement noise passes through — the track is jittery and sensitive to false alarms.

    Small \(q\): The filter trusts the constant-velocity model strongly. The Kalman gain is small; the track is smooth and noise-resistant, but lags behind actual manoeuvres (track divergence during turns or accelerations).

    In ISAC sensing, \(q\) is typically tuned to match the maximum expected target acceleration. For cooperative UE tracking, small \(q\) suffices; for fast vehicles or pedestrians with frequent direction changes, adaptive \(q\) (IMM — Interacting Multiple Model filter) or a Singer model with exponential manoeuvre autocorrelation is preferred.

8 MIMO-ISAC Beamforming

8.1 MIMO Sensing Channel

In a MIMO-ISAC system the sensing channel is characterized by a matrix \(H_s \in \mathbb{C}^{N_R \times N_T}\) where \(N_T\) is the number of transmit antennas and \(N_R\) is the number of receive antennas. The received signal block is:

\[ Y = H_s X + N, \quad X \in \mathbb{C}^{N_T \times L} \] (8.1)

where \(L\) is the number of snapshots (OFDM symbols) and \(N\) is additive noise. For a point target at angle \(\theta\) and range \(r\), \(H_s = \sigma_t \, \mathbf{b}(\theta)\mathbf{a}^T(\theta)\) where \(\mathbf{a}(\theta)\) and \(\mathbf{b}(\theta)\) are the transmit and receive steering vectors, respectively.

Virtual aperture and spatial resolution. Coherent MIMO radar synthesizes a virtual array of length:

\[ N_{\mathrm{virt}} = N_T \cdot N_R \] (8.2)

Angular resolution (3 dB beamwidth of the virtual aperture, ULA with inter-element spacing \(d\)):

\[ \Delta\theta = \frac{0.886\,\lambda}{N_{\mathrm{virt}}\, d} \] (8.3)
A 4 TX × 4 RX MIMO array achieves the same angular resolution as a 16-element phased array, while simultaneously enabling waveform diversity across transmit channels.
Property Phased Array (16 ant.) MIMO (4 TX × 4 RX)
Virtual aperture 16 \(4 \times 4 = 16\)
Angular resolution \(\Delta\theta\) \(0.886\lambda/(16d)\) \(0.886\lambda/(16d)\) (same)
Waveform diversity None (coherent) Full (independent TX waveforms)
Transmit beamforming gain \(N_T^2 = 256\) 1 (omnidirectional TX) — or designed
Beampattern control Fixed steering Arbitrary via \(R_{XX}\)
ISAC compatibility Moderate High (flexible DoF allocation)

8.2 Beampattern Design

The transmit beampattern is the spatial power distribution radiated by the array. Given signal covariance \(R_{XX} = \mathbb{E}[XX^H] \in \mathbb{C}^{N_T \times N_T}\) and the transmit steering vector \(\mathbf{a}(\theta) = \frac{1}{\sqrt{N_T}}[1, e^{j\pi\sin\theta}, \ldots, e^{j(N_T-1)\pi\sin\theta}]^T\), the power pattern is:

\[ P_{TX}(\theta) = \mathbf{a}^H(\theta)\, R_{XX}\, \mathbf{a}(\theta) \] (8.4)

A desired (sensing) beampattern \(p_d(\theta)\) is specified (e.g., uniform coverage of a surveillance sector). The covariance design problem is:

\[ \min_{R_{XX} \succeq 0} \int_{-\pi/2}^{\pi/2} \bigl|P_{TX}(\theta) - p_d(\theta)\bigr|^2 d\theta \quad\text{s.t.}\quad \mathrm{diag}(R_{XX}) = \mathbf{p},\quad R_{XX} \succeq 0 \] (8.5)

The per-antenna power constraint \(\mathrm{diag}(R_{XX}) = \mathbf{p}\) ensures hardware amplifier limits are respected. This is a semidefinite program (SDP) and can be solved efficiently with CVX or similar convex solvers.

ISAC Balance: For a dual-function system, the beampattern must simultaneously shape a sensing lobe toward the surveillance zone and direct data streams toward communication users. The SDP formulation extends naturally by adding per-user SINR constraints to (8.5).

8.3 DFRC Precoder Optimization

A Dual-Function Radar-Communications (DFRC) precoder \(W \in \mathbb{C}^{N_T \times K}\) simultaneously serves \(K\) single-antenna downlink users and generates a sensing beam. Let \(W_s \in \mathbb{C}^{N_T \times K}\) be the purely sensing-optimal precoder (from beampattern design). The joint optimization minimizes deviation from the sensing solution while enforcing comms quality:

\[ \min_{W}\; \|W_s - W\|_F^2 \quad\text{s.t.}\quad \mathrm{SINR}_k \geq \gamma_k\;\forall k,\quad \|W\|_F^2 \leq P_{\max} \] (8.6)

where the SINR of user \(k\) under zero-forcing / beamforming is:

\[ \mathrm{SINR}_k = \frac{|\mathbf{h}_k^H \mathbf{w}_k|^2} {\sum_{j \neq k}|\mathbf{h}_k^H \mathbf{w}_j|^2 + \sigma_n^2} \] (8.7)
SCA/SDR Solution Outline
  1. Lift: define \(\tilde{W} = WW^H\) (rank-\(K\) matrix variable).
  2. Relax rank constraint → SDP. SINR becomes linear in \(\tilde{W}\).
  3. Solve SDP; if rank(\(\tilde{W}\)) > K, apply Gaussian randomization or SCA to recover a rank-\(K\) feasible point.
  4. Successive Convex Approximation: at iteration \(t\), linearize non-convex terms around \(W^{(t)}\) and solve the resulting QP.
  5. Convergence: typically 10–30 outer SCA iterations; inner SDP via MOSEK/SCS.
The SINR constraints act as a hard floor: if all \(\gamma_k\) are set to zero, the precoder collapses to the pure sensing solution. The minimum SINR requirement is precisely what prevents trivially ignoring communications. See the trade-off chart (8.5.2) below.

8.4 Massive MIMO Degrees of Freedom

As \(N_T \to \infty\) with fixed \(K\) users, the law of large numbers gives channel hardening and favorable propagation. Key asymptotic results:

\[ \text{Sensing DoF} = N_T - K \xrightarrow{N_T \gg K} N_T \] (8.8)

Near-field sensing at mmWave (XL-MIMO). When the target distance \(r \lesssim 2 D^2/\lambda\) (Fraunhofer distance, \(D = N_T d\)), the wavefront curvature across the array is non-negligible. The near-field steering vector encodes both angle and range:

\[ a_n^{NF}(\theta, r) = e^{-j\frac{2\pi}{\lambda} \left(\sqrt{r^2 + (nd)^2 - 2r\,nd\sin\theta} - r\right)} \] (8.9)

This enables 3D (angle + range) sensing from a single aperture — a key capability of XL-MIMO ISAC at mmWave frequencies.

At 28 GHz with \(N_T = 256\) antennas and \(d = \lambda/2 = 5.36\) mm, the Fraunhofer distance is \(\approx 37\) m — meaning many indoor and urban targets are in the near field, enabling joint angle-range imaging.

8.5 MUSIC AoA Estimation with ULA

MUSIC (MUltiple SIgnal Classification) exploits the eigenstructure of the spatial covariance matrix. Given \(L\) snapshots at the \(M\)-antenna receiver:

\[ \hat{R} = \frac{1}{L} \sum_{l=1}^{L} \mathbf{y}_l \mathbf{y}_l^H \;\in\; \mathbb{C}^{M \times M} \] (8.10)

EVD: \(\hat{R} = E_s \Lambda_s E_s^H + E_n \Lambda_n E_n^H\) where \(E_s \in \mathbb{C}^{M \times P}\) spans the signal subspace (\(P\) targets) and \(E_n\) spans the noise subspace. MUSIC pseudo-spectrum:

\[ P_{\mathrm{MUSIC}}(\theta) = \frac{1}{\mathbf{a}^H(\theta)\, E_n E_n^H\, \mathbf{a}(\theta)} \] (8.11)

Peaks of \(P_{\mathrm{MUSIC}}(\theta)\) give AoA estimates. With \(M = 16\) antennas and sufficient SNR, MUSIC resolves targets separated by as little as 3°, well below the Rayleigh limit of \(\approx 6.4°\) for a 16-element ULA.

MUSIC Algorithm Steps
  1. Collect \(L\) snapshots \(\{\mathbf{y}_l\}\).
  2. Form sample covariance \(\hat{R}\).
  3. Compute EVD; select noise eigenvectors \(E_n\) (eigenvalues near \(\sigma_n^2\)).
  4. Sweep \(\theta \in [-90°, 90°]\); evaluate \(P_{\mathrm{MUSIC}}(\theta)\).
  5. Detect peaks → AoA estimates.
Resolution Limits
MethodMin sep. (\(M=16\))
Rayleigh (DFT)~6.4°
MUSIC (SNR=20 dB)~1–2°
ESPRIT~1–2°
Capon (MVDR)~3°

8.6 Visualizations

DFRC vs Phased Array Beampattern (NT=16)
Transmit beampattern (dB) vs angle for three scenarios: (1) phased array steered to 20°, (2) DFRC precoder with comms beam at 20° and sensing sidelobe coverage around −30°, and (3) omnidirectional reference. Shaded region indicates the sensing surveillance zone.
ISAC Sensing-Comms Trade-off (DFRC Precoder, K=2 users)
Pareto frontier of the DFRC precoder optimization (8.6). Moving left (better sensing pattern) requires relaxing comms SINR constraints. Three operating points are marked: comms-first, ISAC balanced, and sensing-first.

8.7 Study Questions

  1. Virtual aperture comparison: A MIMO radar has \(N_T = 4\) TX and \(N_R = 4\) RX antennas. The virtual aperture is \(N_{\mathrm{virt}} = 4 \times 4 = 16\), identical to a 16-antenna phased array. Hence angular resolution \(\Delta\theta = 0.886\lambda/(16d)\) is the same. However, the MIMO system gains waveform diversity (4 independent TX waveforms) enabling arbitrary beampattern shaping via \(R_{XX}\), whereas the phased array is restricted to a single steered beam.
  2. Pencil beams and ISAC sensing: Massive MIMO narrows the beamwidth to \(\approx \lambda/(N_T d)\). Narrow pencil beams mean (a) high directional gain improves target detection SNR, (b) low sidelobe energy reduces ambiguities in AoA estimation, and (c) the large number of orthogonal beams allows simultaneous coverage of many sensing directions without mutual interference — improving both angular resolution and multi-target separation.
  3. SINR floor in DFRC: The constraints \(\mathrm{SINR}_k \geq \gamma_k\) guarantee a minimum quality of service for each communication user. Setting \(\gamma_k = 0\) removes the comms requirement entirely and yields the pure sensing precoder. In practice, network SLAs, regulatory requirements, or safety-critical downlink traffic impose strictly positive \(\gamma_k\), forcing a non-trivial allocation of power and spatial DoF to communications. Additionally, the total power constraint \(\|W\|_F^2 \leq P_{\max}\) prevents infinite sensing gain.
9 Cramér-Rao Bounds & Pareto Frontiers

9.1 Fisher Information Matrix

The Fisher Information Matrix (FIM) provides the fundamental lower bound on the variance of any unbiased estimator. For a parameter vector \(\boldsymbol{\xi} = [\xi_1, \xi_2, \ldots, \xi_p]^T\), the \((i,j)\) entry of the FIM is defined as the negative expected curvature of the log-likelihood:

\[ [I(\boldsymbol{\xi})]_{ij} = -\mathbb{E}\!\left[\frac{\partial^2 \ln p(\mathbf{y};\boldsymbol{\xi})}{\partial \xi_i \,\partial \xi_j}\right] \] (9.1)

For complex Gaussian observations \(\mathbf{y} \sim \mathcal{CN}(\boldsymbol{\mu}(\boldsymbol{\xi}), C)\) with signal-dependent mean and known covariance \(C\), the FIM takes the compact form:

\[ I(\boldsymbol{\xi}) = 2\,\mathrm{Re}\!\left[ \frac{\partial \boldsymbol{\mu}^H}{\partial \boldsymbol{\xi}}\, C^{-1}\, \frac{\partial \boldsymbol{\mu}}{\partial \boldsymbol{\xi}^T} \right] \] (9.2)

The Cramér-Rao Bound (CRB) states that the variance of any unbiased estimator \(\hat{\xi}_i\) satisfies:

\[ \mathrm{Var}(\hat{\xi}_i) \;\geq\; \bigl[I^{-1}(\boldsymbol{\xi})\bigr]_{ii} \] (9.3)

Joint Delay-Doppler FIM for OFDM Sensing

For an OFDM waveform with \(N\) subcarriers (spacing \(\Delta f\)) and \(M\) symbols (PRI \(T_s\)), the received signal from a target at delay \(\tau\) and Doppler shift \(\nu\) is:

\[ y_{k,m} = \alpha \, s_{k,m} \, e^{j2\pi(k \Delta f \cdot (-\tau) + m T_s \cdot \nu)} + n_{k,m} \] (9.4)

where \(\alpha\) is the complex target reflectivity, \(s_{k,m}\) is the transmitted symbol on subcarrier \(k\), symbol \(m\), and \(n_{k,m} \sim \mathcal{CN}(0, \sigma_n^2)\). Stacking into the joint parameter vector \(\boldsymbol{\xi} = [\tau, \nu]^T\) and computing the partial derivatives yields the \(2\times2\) FIM:

\[ I(\tau,\nu) = \frac{2|\alpha|^2}{\sigma_n^2} \begin{bmatrix} 4\pi^2 \Delta f^2 \sum_{k,m} k^2 |s_{k,m}|^2 & -4\pi^2 \Delta f T_s \sum_{k,m} km \,|s_{k,m}|^2 \\[4pt] -4\pi^2 \Delta f T_s \sum_{k,m} km \,|s_{k,m}|^2 & 4\pi^2 T_s^2 \sum_{k,m} m^2 |s_{k,m}|^2 \end{bmatrix} \] (9.5)
Key insight — FIM coupling: The off-diagonal term \(I_{\tau\nu} = -4\pi^2 \Delta f T_s \sum_{k,m} km\,|s_{k,m}|^2\) is generally nonzero when the subcarrier index \(k\) is not zero-mean (i.e., the pilot grid is not symmetric about DC). This coupling means the delay and Doppler estimates are statistically correlated: an error in one parameter inflates the variance of the other. The off-diagonal entries appear in the inverse \(I^{-1}\), inflating both diagonal CRB entries. Symmetric pilot placement (equal power above and below DC) zeros this cross-term and decouples the bounds.

9.2 CRB vs SNR Analysis

For an OFDM system with \(N\) subcarriers carrying pilots at frequencies \(\{f_k\}\) with powers \(\{p_k\}\), define the effective mean-square bandwidth and observation duration:

\[ B_{\mathrm{eff}}^2 = \frac{\sum_k f_k^2\, p_k}{\sum_k p_k}, \qquad T_{\mathrm{eff}}^2 = \frac{\sum_m t_m^2\, p_m}{\sum_m p_m} \] (9.6)

The one-sided range (delay) and velocity (Doppler) CRBs become:

\[ \sigma_r^2 \;\geq\; \frac{c^2}{8\pi^2 \cdot \mathrm{SNR} \cdot B_{\mathrm{eff}}^2} \] (9.7)
\[ \sigma_v^2 \;\geq\; \frac{\lambda^2}{8\pi^2 \cdot \mathrm{SNR} \cdot T_{\mathrm{eff}}^2} \] (9.8)

Pilot Power Allocation: Uniform vs Edge-Weighted

Uniform pilots (\(p_k = P/N\) for all \(k\)):
\[B_{\mathrm{eff,unif}}^2 = \frac{1}{N}\sum_k f_k^2\] For \(N\) equally spaced carriers spanning \([-B/2, B/2]\): \[B_{\mathrm{eff,unif}}^2 \approx \frac{B^2}{12}\]
Edge-weighted pilots (all power on \(f = \pm B/2\)):
\[B_{\mathrm{eff,edge}}^2 = \frac{(B/2)^2 + (B/2)^2}{2} = \frac{B^2}{4}\] Edge-weighting gives a 3 dB lower CRB (factor of 3 improvement in \(B_{\mathrm{eff}}^2\)): \[\sigma_{r,\mathrm{edge}}^2 = \frac{1}{3}\,\sigma_{r,\mathrm{unif}}^2\]
Practical trade-off: Edge-weighted (two-tone) pilots maximize sensing CRB performance but leave interior subcarriers unlit, reducing comms throughput and raising PAPR. A practical compromise concentrates extra power at band edges while maintaining a baseline uniform pilot grid for channel estimation.
CRB Performance vs SNR — Range and Velocity Estimation
Range 1-sigma CRB (left axis) for B = 100 MHz and B = 400 MHz; velocity 1-sigma CRB (right axis) for Tobs = 1 ms and Tobs = 5 ms. Vertical dashed line marks the typical 5G NR operating point at SNR = 15 dB. All curves use \(\sigma_r = \sqrt{c^2/(8\pi^2 \cdot \mathrm{SNR} \cdot B^2)}\) and \(\sigma_v = \sqrt{\lambda^2/(8\pi^2 \cdot \mathrm{SNR} \cdot T_{\mathrm{obs}}^2)}\) with \(\lambda = c/f_c\), \(f_c = 28\) GHz.

9.3 Sensing-Comms Pareto Frontier

Consider a total transmit power budget \(P\) split between communication and sensing waveforms with splitting factor \(\alpha \in [0,1]\): \(P_c = \alpha P\) and \(P_s = (1-\alpha)P\). The joint ISAC optimization problem can be stated as a bi-objective program:

\[ \max_{\alpha,\,\mathbf{s}} \;\Bigl\{ C_{\mathrm{comms}},\; -\sigma_r^2 \Bigr\} \quad \text{s.t.} \quad P_c + P_s = P,\; 0 \leq \alpha \leq 1 \] (9.9)

For the scalarized single-parameter sweep, comms spectral efficiency (Shannon capacity with interference-free AWGN channel) and range CRB are:

\[ C(\alpha) = \log_2\!\left(1 + \frac{\alpha \cdot P \cdot |h|^2}{N_0 W}\right) = \log_2(1 + \alpha \cdot \gamma_0) \] (9.10)
\[ \sigma_r^2(\alpha) = \frac{c^2}{8\pi^2 \cdot (1-\alpha)\,\gamma_0 \cdot B_{\mathrm{eff}}^2} \;\propto\; \frac{1}{(1-\alpha)\,\gamma_0} \] (9.11)

The Pareto frontier is the set of operating points \((C(\alpha), \sigma_r^2(\alpha))\) as \(\alpha\) sweeps \([0,1]\). Three canonical operating regimes are identified:

Operating Point \(\alpha\) Comms SE (bps/Hz) Range CRB Application
Sensing-only 0 0 Minimum Radar / detection only
ISAC Balanced ≈ 0.5 Moderate Moderate V2X, NR with positioning
Comms-only 1 Maximum (\(\log_2(1+\gamma_0)\)) \(\rightarrow \infty\) eMBB, data-centric NR
The Pareto "knee": The frontier is convex in the \((C, \sigma_r^2)\) space. Near \(\alpha \to 1\), even a small increment \(\delta\alpha\) yields a tiny comms gain \(\delta C \approx \delta\alpha / (\ln 2 \cdot (1 + \gamma_0))\) while the sensing CRB diverges as \(\sigma_r^2 \propto (1-\alpha)^{-1}\). The "knee" is where the gradient \(d\sigma_r^2 / dC\) changes from gradual to steep — the marginal sensing cost per unit comms gain becomes unbounded.
Sensing-Comms Pareto Frontier (OFDM-ISAC)
Parametric Pareto curve as comms power fraction \(\alpha\) sweeps from 0 (sensing-only, left) to \(\alpha_{\max} = 0.98\) (comms-dominated, right). Range CRB plotted on log scale. Shaded region shows the feasible set (achievable but sub-optimal operating points). Marked points: sensing-only (\(\alpha=0\)), ISAC balanced (\(\alpha=0.5\)), and comms-dominant (\(\alpha=0.9\)). Reference SNR \(\gamma_0 = 20\) dB, \(B = 100\) MHz, \(f_c = 28\) GHz.

9.4 Practical Considerations

Coherent vs Incoherent Integration

When multiple OFDM symbols (pulses) illuminate the same target, SNR can be accumulated. The improvement depends on whether phase coherence is maintained across pulses:

\[ \mathrm{SNR}_{\mathrm{coh}} = N \cdot \mathrm{SNR}_1 \quad \text{(coherent, phase-aligned)} \] (9.12)
\[ \mathrm{SNR}_{\mathrm{incoh}} \approx \sqrt{N} \cdot \mathrm{SNR}_1 \quad \text{(incoherent, non-coherent energy detection)} \] (9.13)

Coherent integration provides a linear (10 dB/decade) gain vs. the \(\sqrt{N}\) (5 dB/decade) gain of incoherent integration. However, coherence requires:

Mismatched Filter and Sidelobe Masking

In practice, the sensing matched filter is implemented via 2D-IFFT over the delay-Doppler grid. Imperfect windowing produces range/velocity sidelobes. For a rectangular window:

\[ \mathrm{ISLR}_{\mathrm{rect}} \approx -13.3\;\mathrm{dB} \quad\text{(integrated sidelobe ratio)} \] (9.14)

Sidelobe masking occurs when a weak target falls within the sidelobe zone of a strong nearby target. The masking threshold in range is approximately:

\[ \Delta R_{\mathrm{mask}} \approx \frac{c}{2B} \quad\text{(one range resolution cell)} \] (9.15)
Summary table — Integration gain comparison:
Method SNR gain (\(N\) pulses) Phase knowledge required Doppler tolerance OFDM feasibility
Coherent \(N\) (linear) Yes (full phase) Low (short CPI) High (known pilots)
Non-coherent \(\sqrt{N}\) No High Always feasible
Post-detection \(\ll \sqrt{N}\) No Very high Fallback only

When is Coherent Integration Feasible?

For vehicular targets at \(f_c = 28\) GHz with radial velocity \(v\), the Doppler shift is \(f_D = 2v f_c/c\). The coherence time is \(T_c \approx 1/(2B_D)\). For a single point target at known velocity, the CPI can span several milliseconds. For extended or fluctuating targets (Swerling models), envelope fluctuation limits effective CPI to \(T_c\), beyond which non-coherent combining is preferred. OFDM has the advantage that the pilot phase is deterministic, enabling phase-coherent accumulation across symbols up to the channel coherence time.

Study Questions — Section 9
  1. FIM off-diagonal coupling: The off-diagonal FIM term \(I_{\tau\nu}\) is nonzero when the pilot subcarrier indices are not symmetric about DC (non-zero-mean \(k\)). What does this imply for a joint \((\hat\tau, \hat\nu)\) estimator? In particular, will the individual CRBs \([I^{-1}]_{\tau\tau}\) and \([I^{-1}]_{\nu\nu}\) be larger or smaller than their decoupled counterparts \(1/I_{\tau\tau}\) and \(1/I_{\nu\nu}\)? Hint: consider the Schur complement formula for \(2\times2\) matrix inversion.
  2. Uniform vs edge-weighted pilots: For \(N = 64\) uniformly spaced subcarriers spanning a bandwidth \(B\), show that \(B_{\mathrm{eff,unif}}^2 = B^2/12\) and \(B_{\mathrm{eff,edge}}^2 = B^2/4\). Conclude that edge-weighted pilots yield a CRB that is 3× lower (i.e., \(\approx 4.8\) dB better) in range estimation. What is the comms throughput penalty?
  3. Pareto "knee" explanation: For the parametric Pareto curve \((C(\alpha), \sigma_r^2(\alpha))\), compute \(d\sigma_r^2/dC\) and show it diverges as \(\alpha \to 1\). The "knee" exists because the Shannon capacity function \(\log_2(1+\alpha\gamma_0)\) saturates (logarithmic growth) while the sensing CRB \(\propto (1-\alpha)^{-1}\) diverges algebraically — small comms gains near saturation come at catastrophically increasing sensing cost.
10

3GPP Standardization Roadmap & Future Directions

10.1 TS 22.837 Service Requirements

3GPP TS 22.837 defines the service requirements for Integrated Sensing and Communication (ISAC) in 5G NR. These KPIs set hard targets that physical-layer designs must meet across heterogeneous deployment scenarios.

\[ P_D \;\geq\; 90\%,\quad P_{FA} \;\leq\; 10^{-4},\quad \sigma_r \;\leq\; 1\,\text{m (indoor)},\quad \sigma_v \;\leq\; 0.1\,\text{m/s},\quad \sigma_\theta \;\leq\; 1° \] (10.1)
KPI Hierarchy: TS 22.837 treats range accuracy as the primary discriminator. The 1 m indoor requirement implies a minimum signal bandwidth of \(B \geq c / (2 \cdot \Delta r) \approx 150\,\text{MHz}\), which is at the upper edge of FR1 NR bands and motivates FR2 (mmWave) deployments for precise sensing.
Table 10.1 — TS 22.837 ISAC Use Cases and Key Performance Requirements
Use Case Scenario Range Accuracy Velocity Accuracy Angular Accuracy PD Update Rate
Automotive — Pedestrian Detection Urban V2X ≤ 0.5 m ≤ 0.1 m/s ≤ 1° ≥ 99% ≥ 25 Hz
Smart Factory — Asset Tracking Indoor Industrial ≤ 1 m ≤ 0.5 m/s ≤ 3° ≥ 95% ≥ 1 Hz
Weather Sensing Suburban / Outdoor ≤ 3 m ≤ 0.05 m/s ≤ 5° ≥ 90% ≥ 0.1 Hz
Indoor Navigation Shopping Mall / Office ≤ 1 m ≤ 0.2 m/s ≤ 2° ≥ 90% ≥ 5 Hz
Gesture Recognition Smart Home / XR ≤ 0.1 m ≤ 0.01 m/s ≤ 1° ≥ 85% ≥ 100 Hz
Bandwidth Constraint: At FR1 (sub-6 GHz) with 100 MHz NR bandwidth, the Cramér–Rao lower bound for range resolution is \(\sigma_r \geq c/(2\sqrt{8\pi^2}\,\sigma_B)\). For \(B = 100\,\text{MHz}\), \(\sigma_B \approx 28.9\,\text{MHz}\) (uniform spectrum), giving \(\sigma_r \approx 0.83\,\text{m}\). The 1 m indoor requirement is marginally achievable only at high SNR; automotive ≤0.5 m demands FR2 bands.
TS 22.837 ISAC KPI Comparison by Use Case
Radar chart of normalised ISAC KPIs (0 = worst, 1 = best) across four 3GPP use cases. Gesture recognition leads in range & velocity accuracy; automotive dominates detection probability.

10.2 TR 38.837 Findings (Rel-18 Study Item)

3GPP TR 38.837 documents the Rel-18 ISAC Study Item outcomes, establishing the feasibility baseline for NR-based sensing and identifying the priority issues for the subsequent Rel-19 Work Item.

Key Findings — What Was Confirmed

  • Monostatic ISAC feasibility: Confirmed for NR gNB with self-interference cancellation in guard period (GP) symbols of TDD frames. Effective sensing window per half-frame \(\approx 2\text{–}4\) GP symbols.
  • SRS-based sensing baseline: Sounding Reference Signal repurposed as probing waveform. Existing SRS sequence design (ZC-based) provides adequate ambiguity function properties for monostatic radar.
  • No new RS for Phase 1: Study concluded existing NR reference signals (SRS, CSI-RS, SSB) are sufficient as sensing waveforms in Phase 1, avoiding spec disruption.
  • Bistatic uplink sensing: UL SRS received at neighbour gNB enables bistatic sensing with no UE modification — identified as a key low-disruption path.
  • Cross-link interference (CLI) management: Multi-cell sensing creates new CLI patterns; existing CLI mitigation from Rel-16 is a candidate foundation.

Gap Analysis — What Rel-19 Must Address

  • Sensing-specific signal design: Optimised waveforms (e.g., OCDM, DFRC precoding) not yet standardised; Phase 2 may need new RS.
  • Multi-target resolution: CFAR and MUSIC/ESPRIT parameter estimation procedures not part of the standard; left to implementation.
  • Measurement reporting: No standard interface for the gNB to report sensing results (range/Doppler/angle estimates) to core network or application layer.
  • Privacy framework: Sensing-as-surveillance risk; consent, anonymisation, and regulatory hooks missing from existing stage-1/2 specs.
  • Energy efficiency: Continuous sensing increases duty cycle and power consumption; sleep-mode interaction with sensing not yet defined.
  • Handover during sensing: Target continuity across cell boundaries requires sensing-aware handover triggers — a significant RRC-layer gap.
Half-Duplex Constraint on Monostatic Range: In TDD NR, the gNB transmits a sensing waveform in DL slots and must receive the echo in UL or GP. The minimum round-trip delay to avoid TX/RX overlap is one GP symbol duration \(T_{GP}\). For 30 kHz SCS, \(T_{GP} \approx 35.7\,\mu\text{s}\), giving a minimum range: \[ R_{min} = \frac{c \cdot T_{GP}}{2} \approx \frac{3\times10^8 \times 35.7\times10^{-6}}{2} \approx 5.4\,\text{km} \] This means short-range targets (\(R < 5.4\,\text{km}\) for 30 kHz SCS) fall inside the blind zone — a fundamental half-duplex constraint that bistatic or full-duplex (FD) architectures must overcome.

10.3 3GPP Release Timeline

ISAC standardisation follows a phased incremental approach within 3GPP, building on positioning enhancements from Rel-17 and maturing through Rel-19/20 work items.

Table 10.2 — 3GPP Release Timeline: Positioning & ISAC Milestones
Release Freeze Date Positioning / Sensing Milestone Key Specs / TRs ISAC Readiness
Rel-16 Q2 2020 NR Positioning baseline: DL-TDOA, UL-TDOA, E-CID TS 38.215, TR 38.855 Foundation layer (timing & angle refs)
Rel-17 Q3 2022 Enhanced NR positioning: Multi-RTT, DL-AoD, UL-AoA, SRS-based ranging TS 38.215 v17, TR 38.857 SRS sensing precursor; <1 m positioning demonstrated
Rel-18 Q1 2024 ISAC Study Item (SI): feasibility evaluation, use-case catalogue, KPI framework TR 38.837 SI complete; monostatic & bistatic feasibility confirmed
Rel-19 Q2 2025 ISAC Work Item (WI): Stage-2/3 specs, sensing signal design, measurement reporting TS 38.xxx (in progress) First normative ISAC specifications; SRS sensing procedures
Rel-20 Q3 2026 Advanced ISAC: multi-static coordination, RIS-aided sensing, AI/ML integration TR 38.8xx (planned) Multi-cell sensing; full DFRC waveform support expected
IMT-2030 (6G) 2030+ Native ISAC: sub-cm range, THz bands, XL-MIMO, cell-free architecture ITU-R IMT-2030 framework ISAC as first-class service; not a retrofit
3GPP ISAC Standardization Timeline
Gantt chart of 3GPP releases with positioning and ISAC activity windows (2021–2030). SI = Study Item, WI = Work Item. Bars indicate the active standardisation period per track.

10.4 6G Outlook — ISAC as a Native Service

The IMT-2030 framework (6G) elevates ISAC from an add-on capability to a first-class design goal. Sensing performance targets are orders of magnitude beyond current 5G NR.

\[ \underbrace{\sigma_r \leq 1\,\text{cm}}_{\text{sub-cm range}}, \quad \underbrace{\sigma_v \leq 0.1\,\text{mm/s}}_{\text{sub-mm/s velocity}}, \quad \underbrace{f_c \in [100\,\text{GHz},\,10\,\text{THz}]}_{\text{THz band}}, \quad \underbrace{N_{ant} \sim 10^3\text{–}10^4}_{\text{XL-MIMO}} \] (10.2)

6G ISAC Key Enablers

  • Extremely Large Antenna Arrays (XL-MIMO): Arrays of \(N \sim 10^3\) elements operate in the near-field regime (\(r \leq 2D^2/\lambda\)). Near-field beam focusing enables 3D sensing with spatial resolution \(\sim \lambda/D\) in both azimuth and range simultaneously.
  • Reconfigurable Intelligent Surfaces (RIS): Passive reflectors with programmable phase shift \(\phi_{mn}\) create virtual MIMO apertures without additional RF chains. Sensing coverage extended to NLOS targets. Effective bistatic geometry reconfigurable in software.
  • THz Communication & Sensing: Molecular absorption provides natural range gating in THz bands. Bandwidth \(B \sim 10\,\text{GHz}\) yields \(\Delta r \sim 1.5\,\text{cm}\) range resolution natively. Tradeoff: severe path loss limits operational range to \(\sim 10\text{–}100\,\text{m}\).
  • Cell-Free Architecture: Distributed access points (APs) connected via fronthaul act as a virtual multistatic radar network. Joint processing at CPU provides aperture diversity, eliminating monostatic blind zones entirely.
  • Integrated AI/ML Inference: On-device neural networks perform simultaneous channel estimation and target parameter extraction. Learned dictionaries replace MUSIC/ESPRIT in cluttered environments. Real-time semantic sensing (object classification, not just localisation).

5G ISAC vs 6G ISAC — Design Philosophy

Aspect5G NR (Rel-18/19)6G IMT-2030
Design approachRetrofit (comms-first)Native co-design
WaveformOFDM (CP-OFDM repurposed)OCDM / OTFS / custom
Range accuracy0.5–3 m< 1 cm
Antenna scale32–256 (mMIMO)10³–10⁴ (XL-MIMO)
Freq. bandFR1 + FR2 (≤100 GHz)Sub-THz / THz
DuplexHalf-duplex (TDD)Full-duplex (FD)
InterferenceManaged post-hocDesigned-in isolation
PrivacyNot addressedBuilt-in consent framework
Sensing as serviceSecondary / optionalTier-1 requirement

10.5 Open Research Challenges

Despite rapid progress, several fundamental challenges remain open for the research community before ISAC can be deployed at scale.

1. Multi-Cell Interference Coordination
Simultaneous sensing transmissions from adjacent gNBs create correlated clutter. Existing ICIC/eICIC frameworks are designed for communications SIR, not for sensing ambiguity-function sidelobe suppression. New coordination protocols are needed.
2. Privacy & Ethical Concerns
Passive sensing can track individuals without consent — a fundamental tension with GDPR and analogous regulations. Technical mitigations (anonymisation, resolution limiting, consent gating) are not yet standardised in any 3GPP spec.
3. Hardware Impairments at mmWave/THz
Phase noise \(\mathcal{L}(f)\), IQ imbalance, and non-linear PA distortion increase with carrier frequency. At THz bands, these introduce range-dependent sensing floors that cannot be calibrated out with existing NR procedures.
4. Beam Management for Sensing
NR beam management (BM) optimises SINR for communications beams. Sensing requires beams that illuminate the surveillance region with uniform sidelobe control — conflicting objectives. Dual-function beamforming under both constraints is NP-hard in general.
5. Integration with AI Inference
End-to-end neural network approaches (joint channel + target estimation) show promise but require massive labelled datasets and lack interpretability guarantees. Deployment in safety-critical contexts (automotive) demands certification pathways that do not yet exist.
6. Standardised Sensing Metrics
No consensus on a single figure of merit analogous to spectral efficiency for communications. Candidates — SINRsensing, CRB, ambiguity volume, sensing capacity — have different dependencies on waveform and scenario, making cross-paper comparison unreliable.

Study Questions

  1. CRB vs TS 22.837 indoor range requirement: TS 22.837 requires \(\sigma_r \leq 1\,\text{m}\) indoors. For NR at \(B = 100\,\text{MHz}\) with flat spectrum, the effective RMS bandwidth is \(\sigma_B = B/\sqrt{12} \approx 28.9\,\text{MHz}\). The range CRB is \(\sigma^{CRB}_r = c / (4\pi\sigma_B\sqrt{2\,\text{SNR}})\). Substituting: \(\sigma^{CRB}_r \approx 0.83 / \sqrt{2\,\text{SNR}}\,\text{m}\). For the 1 m requirement, \(\sqrt{2\,\text{SNR}} \geq 0.83\), i.e., \(\text{SNR} \geq -1.6\,\text{dB}\). This appears achievable, but practical NR pilots occupy only a fraction of the bandwidth (every 4th subcarrier for SRS), reducing \(\sigma_B\) by \(\sim 2\times\) and requiring SNR > 4 dB — tight but feasible for indoor small-cell deployments.
  2. Half-duplex constraint on monostatic range: A gNB transmitting sensing pulses in DL cannot simultaneously receive echoes. The receiver switches on only after the TX guard period. For 30 kHz SCS, GP \(\approx 35.7\,\mu\text{s}\), giving blind range \(R_{blind} = c \cdot T_{GP}/2 \approx 5.4\,\text{km}\). Targets closer than \(R_{blind}\) produce echoes that arrive while the gNB is still transmitting — these are completely masked. For urban macro cells (ISD \(\sim 500\,\text{m}\)), this is not an issue; for short-range applications (factory, gesture), a full-duplex or bistatic architecture is mandatory.
  3. Incremental (3GPP) vs native (6G) ISAC tradeoffs: The incremental 3GPP approach reuses existing OFDM waveforms and reference signals, enabling ISAC deployment on installed base without hardware changes — low cost, fast time-to-market, backward compatible. The native 6G approach co-designs waveform, multiple access, and beamforming from scratch for joint optimality, achieving superior performance (sub-cm range, full-duplex) but requires a clean-slate deployment with new devices and infrastructure. The incremental approach incurs a persistent performance penalty from CP overhead, half-duplex constraint, and suboptimal sensing waveforms; the native approach risks fragmentation if 6G standardisation timelines slip or market adoption is slow.

Notebook Summary — Key Formulas by Section

Quick-reference table covering all ten sections of this OFDM-ISAC notebook. Each row gives the section topic, the single most important formula, and a brief interpretation.

Table 10.3 — OFDM-ISAC Notebook: Key Takeaway Formula per Section
# Section Topic Key Formula Interpretation
1 OFDM Signal Model \(s(t) = \sum_{k=0}^{N-1} d_k \, e^{j2\pi k\Delta f\, t}\,\text{rect}(t/T_u)\) Multicarrier TX signal; \(\Delta f = 1/T_u\) ensures subcarrier orthogonality
2 Range-Doppler Processing \(\chi(\tau,\nu) = \int s(t)\,s^*(t-\tau)\,e^{-j2\pi\nu t}\,dt\) Ambiguity function; thumbtack shape for OFDM — range & Doppler decoupled
3 ISAC Waveform Design \(\min_{\mathbf{w}}\;\|\mathbf{R}_{xx}-\mathbf{R}_d\|_F^2 \;\text{s.t.}\; \text{SINR}_k \geq \gamma_k\) Dual-function beamforming: sensing beam pattern vs per-user communications SINR
4 Cramér–Rao Bounds \(\sigma^{CRB}_r = \frac{c}{4\pi\sigma_B\sqrt{2\,\text{SNR}}}\) Minimum achievable range std dev; set by RMS bandwidth, not resolution bandwidth
5 MIMO-ISAC \(\mathbf{Y} = \mathbf{H}_{SI}\mathbf{X} + \sum_{k}\alpha_k\,\mathbf{a}_r(\theta_k)\mathbf{a}_t^H(\theta_k)\mathbf{X} + \mathbf{N}\) MIMO receive model: comms channel + radar returns; need SIC to separate them
6 Sensing Channel Models \(h(\tau,\nu) = \sum_k \alpha_k\,\delta(\tau-\tau_k)\,\delta(\nu-\nu_k)\) Delay-Doppler channel; OTFS exploits this sparsity; OFDM degrades in high Doppler
7 Clutter & CFAR \(\Lambda(\mathbf{y}) = \frac{|\mathbf{a}^H\mathbf{y}|^2}{\mathbf{a}^H\hat{\mathbf{K}}_{cl}^{-1}\mathbf{a}} \underset{H_0}{\overset{H_1}{\gtrless}} \eta\) Whitened matched filter; \(\hat{\mathbf{K}}_{cl}\) estimated from reference cells (CFAR)
8 5G NR Sensing Signals \(r_{u,v}(n) = \frac{1}{\sqrt{N_{ZC}}}e^{-j\pi u n(n+1)/N_{ZC}}\,e^{j2\pi vn/N_{ZC}}\) SRS Zadoff–Chu base sequence; low PAPR, flat spectrum, ideal for monostatic sensing
9 Sensing Capacity & Trade-offs \(C_s + C_c \leq \log_2\det\!\left(\mathbf{I} + \frac{P}{N_0}\mathbf{H}_c\mathbf{H}_c^H\right)\) Sensing–communications capacity region bound; power split & beamforming set the Pareto front
10 3GPP Standardisation \(P_D \geq 90\%,\;\sigma_r \leq 1\,\text{m},\;\sigma_v \leq 0.1\,\text{m/s},\;\sigma_\theta \leq 1°\) TS 22.837 hard KPI targets that drive FR2/mmWave deployments for indoor ISAC
§A

3GPP ISAC Standardization Timeline

This appendix traces the complete standardization journey of Integrated Sensing & Communications through 3GPP — from the first workshop discussions to the active Rel-19 Work Item and the 6G horizon. Each milestone links to the responsible working group and the key output document.

2020 — Inception
First 3GPP Workshop Discussions on ISAC
SA1 · RAN
Industry players (Ericsson, Huawei, Nokia, ZTE) submit first contributions proposing ISAC as a 5G-Advanced study area. Initial use cases: automotive radar, indoor positioning, environmental sensing.
2021 Q1 — SA1#93-e
SA1 Study Item “FS_ISAC” Approved
SA1 FS_ISAC
SA WG1 formally approves feasibility study on ISAC. Scope: identify use cases, define KPIs (range, velocity, angular accuracy), assess network architecture impacts.
2021 Q3 — SA2 Study
SA2 Architecture Study Initiated
SA2 TR 23.700-97
SA WG2 studies 5GC architecture impacts: new sensing service function, exposure APIs (NEF), QoS for sensing data flows, multi-cell coordination procedures.
2022 Q1 — Rel-18 Stage 1 Freeze
TR 22.837 & TS 22.837 Completed — ISAC Service Requirements
SA1 TR 22.837 TS 22.837
Key KPIs finalized: range ≤1 m (indoor), velocity ≤0.1 m/s, AoA ≤1°, PD ≥90%, PFA ≤10−4. Eight priority use cases: automotive, smart factory, weather, gesture, asset tracking, indoor navigation, healthcare, UAV control. TS 22.837 normative requirements published as Rel-18 Stage 1.
2022 Q2 — RAN#96-e
RAN Study Item on NR ISAC Approved for Rel-18
RAN · RAN1 · RAN2 · RAN4 SI: NR_ISAC
RAN plenary approves the 18-month study. RAN1: signal design, sensing RS evaluation. RAN2: RRC/MAC procedures for sensing sessions. RAN4: SIC RF requirements, minimum performance.
2022–2023 — Study Phase
RAN1 / RAN2 / RAN4 Technical Evaluation
RAN1 · RAN2 · RAN4
Key conclusions: (1) SRS-based monostatic sensing feasible without new RS design; (2) PRS enables bistatic sensing with UE as passive receiver; (3) half-duplex constraint limits monostatic to TDD guard period (→ Rmax ~2–6 km); (4) no new UE sensing capability needed for Phase 1; (5) multi-cell interference coordination deferred to Rel-20.
2023 Q3 — RAN#101
TR 38.837 Completed — NR ISAC Feasibility Study
RAN1 TR 38.837
Technical report documents: ISAC channel models, RS performance comparison, monostatic / bistatic / passive feasibility results, SIC budget analysis, CRB performance benchmarks. Confirms sub-meter range accuracy achievable with 100 MHz SRS bandwidth at typical NR SNR.
2023 Q4 — RAN#102
Rel-19 Work Item Approved — NR ISAC Phase 1
RAN · RAN1 · RAN2 WI: NR_ISAC_Ph1
First normative ISAC specification work. Phase 1 scope: SRS enhancements for monostatic sensing, sensing measurement reporting framework, PRS-based bistatic sensing procedures, network-controlled sensing session management. Target specs: TS 38.211, 38.213, 38.300.
2024 — Rel-19 Active
Normative Specification Work in Progress
RAN1 · RAN2 · RAN4 TS 38.211 TS 38.213 TS 38.300
RAN1 defines sensing RS enhancements and measurement procedures. RAN2 specifies RRC signaling for sensing session establishment and reporting. RAN4 defines minimum performance requirements for sensing-capable gNBs. First sensing-specific IEs appear in 3GPP stage 2/3 ASN.1.
2025 (Expected) — Rel-19 Freeze
First Published ISAC Specifications
RAN · SA
Rel-19 ASN.1 freeze. First commercially deployable ISAC specifications covering monostatic (gNB self-sensing via SRS), bistatic (gNB-TX / UE-RX), and basic sensing session management. Implementations expected in 5G-Advanced radios 2026+.
2026+ — Rel-20 / 5G-Advanced Phase 2
Advanced ISAC — Multi-Cell, Multi-Static, AI-Assisted
RAN · SA1 · SA2
Expected scope: coordinated multi-cell sensing networks, AI/ML-based sensing processing (RAN Intelligent Controller integration), RIS-assisted sensing, advanced dual-function beam management, FR3 / sub-THz band sensing.
2028–2030 — 6G / IMT-2030
Native ISAC in 6G Air Interface
ITU-R WP5D · 3GPP SA1
ITU-R IMT-2030 framework includes sensing as a native 6G capability (not retrofit). Targets: sub-cm range accuracy, sub-mm/s velocity, 3D environmental mapping, joint comms–sensing waveform design from the ground up.

Key 3GPP Documents

DocumentWGReleaseStatusTopic
TR 22.837SA1Rel-18CompleteFeasibility study on ISAC service requirements & use cases
TS 22.837SA1Rel-18PublishedNormative ISAC service requirements — KPIs & use case categories
TR 23.700-97SA2Rel-18Complete5GC architecture study — sensing function, NEF exposure APIs
TR 38.837RAN1Rel-18CompleteNR ISAC feasibility — channel models, RS evaluation, SIC requirements
TS 38.211RAN1Rel-19In ProgressPhysical channels — SRS/PRS enhancements for sensing
TS 38.213RAN1Rel-19In ProgressPhysical layer procedures — sensing scheduling & measurement reporting
TS 38.300RAN2Rel-19In ProgressNR overall description — sensing session management & RRC procedures
TR 38.8xxRAN1Rel-20PlannedAdvanced ISAC study — multi-cell coordination, AI/ML sensing processing
IMT-2030 FrameworkITU-R WP5D6GIn Study6G vision — sensing as native IMT-2030 radio capability

Standardization Gantt

3GPP ISAC Work Items & Releases (2020–2030)
Blue = completed • Orange = in progress • Purple = planned. Dashed red line = approximate current date.
The Rel-18 study (TR 38.837) confirmed NR ISAC feasibility without new physical-layer channels — existing SRS and PRS are sufficient for Phase 1. The key innovation in Rel-19 is not new waveforms but new procedures: how the network configures, schedules, and reports sensing measurements within the existing NR framework.
  1. Why did 3GPP choose to reuse SRS/PRS for Rel-19 Phase 1 rather than defining a dedicated sensing signal? What are the performance trade-offs versus a purpose-built sensing RS?
  2. TR 38.837 deferred multi-cell sensing coordination to Rel-20. What technical challenges make this harder than single-cell monostatic sensing?
  3. SA1 requires ≤1 m range accuracy indoors. With NR FR1 limited to 100 MHz per carrier, how might carrier aggregation be used to meet this target under the CRB?