Modulation Classification

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/3479904
Survey of automatic modulation classification techniques: Classical

approaches and new trends
Article in IET Communications · May 2007

DOI: 10.1049/iet-com:20050176 · Source: IEEE Xplore
CITATIONS READS
662 2,340
4 authors, including:
Octavia A. Dobre Ali Abdi

Memorial University of Newfoundland New Jersey Institute of Technology
316 PUBLICATIONS 4,362 CITATIONS 144 PUBLICATIONS 5,132 CITATIONS
SEE PROFILE SEE PROFILE
Wei Su
US Army
67 PUBLICATIONS 2,669 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Spectrally-efficient transmission for long-haul optical networks View project
NOMA for 5G and Beyond View project
All content following this page was uploaded by Octavia A. Dobre on 08 October 2014.
The user has requested enhancement of the downloaded file.

Survey of automatic modulation classification
techniques: classical approaches and new trends
O.A. Dobre, A. Abdi, Y. Bar-Ness and W. Su
Abstract: The automatic recognition of the modulation format of a detected signal, the inter-
mediate step between signal detection and demodulation, is a major task of an intelligent receiver,
with various civilian and military applications. Obviously, with no knowledge of the transmitted
data and many unknown parameters at the receiver, such as the signal power, carrier frequency
and phase offsets, timing information and so on, blind identification of the modulation is a difficult
task. This becomes even more challenging in real-world scenarios with multipath fading,
frequency-selective and time-varying channels. With this in mind, the authors provide a compre-
hensive survey of different modulation recognition techniques in a systematic way. A unified
notation is used to bring in together, under the same umbrella, the vast amount of results and
classifiers, developed for different modulations. The two general classes of automatic modulation
identification algorithms are discussed in detail, which rely on the likelihood function and features
of the received signal, respectively. The contributions of numerous articles are summarised in
compact forms. This helps the reader to see the main characteristics of each technique.
However, in many cases, the results reported in the literature have been obtained under different
conditions. So, we have also simulated some major techniques under the same conditions, which
allows a fair comparison among different methodologies. Furthermore, new problems that have
appeared as a result of emerging wireless technologies are outlined. Finally, open problems and
possible directions for future research are briefly discussed.
List of symbols 1 timing offset

hl threshold
The principal symbols used are defined below.
F(i) theoretical feature-vector which corresp-
ai amplitude corresponding to the ith onds to the ith modulation
modulation
F^ sample extimate of the feature vector
a channel amplitude
fd frequency deviation
BW signal bandwidth
g(t) pulse shape, including the channel and the
cs (i),n,q nth-order/q-conjugate cumulant of the ith transmit filter
modulation
fgpgPp¼1 channel coefficients
c(i)
r,n,q,(g; tn21) nth order/q-conjugate cyclic cumulant of
g cycle frequency
r(t) under hypothesis Hi, at cycle
frequency g and delay vector tn21 gs signal-to-noise ratio
ĉr,n,q(tn21) sample estimate of the nth-order/ Hi ith hypothesis in a decision problem
q-conjugate cumulant at delay vector tn21 h(t) channel impulse response
D number of levels to which the timing ^ı decision on the modulation type of the
offset is quantised intercepted signal
d(., .) Euclidean distance K number of symbols
Df carrier frequency offset kn,q set of cycle frequencies
Dck phase difference between the phase L number of antennas at the receiver
sample at t ¼ (k þ 1)T and t ¼ kT L[r(t)jvi , Hi] conditional likelihood function of the
Es baseband signal energy noisy received signal r(t) under Hi , con-
Ep pulse energy ditioned on the unknown vector vi
LiA[r(t)] likelihood funtion under hypothesis Hi ,
computed by averaging over the
# The Institution of Engineering and Technology 2007
unknown quantities
doi:10.1049/iet-com:20050176
LiG[r(t)] likelihood funtion under hypothesis Hi ,
Paper first received 24th April 2005
computed using maximum likelihood
O.A. Dobre is with the Faculty of Engineering and Applied Science, Memorial
University of Newfoundland, St. John’s, NL, Canada A1B 3X5 estimates of the unknown quantities
A. Abdi and Y. Bar-Ness are with the Department of Electrical and Computer
Engineering, New Jersey Institute of Technology, Newark, NJ 07102, USA
LiH[r(t)] likelihood funtion under the hypothesis
W. Su is with US Army RDECOM Fort Monmouth, NJ 07703, USA
Hi , computed using maximum likelihood
E-mail: dobre@engr.mun.ca
IET Commun., 2007, 1, (2), pp. 137 –156 137

estimates of some unknown quantities, 1 Introduction
and averaging over others.
Automatic modulation classification (AMC) is an inter-
Mi number of equi-probable points in the ith mediate step between signal detection and demodulation,
signal constellation and plays a key role in various civilian and military appli-
ms (i),n,q nth-order/q-conjugate moment of the ith cations. Implementation of advanced information services
modulation and systems for military applications, in a crowded electro-
^ r,n,q(tn21)
m sample estimate of nth-order/q-conjugate magnetic spectrum, is a challenging task for communication
moment at delay vector tn21 engineers. Friendly signals should be securely transmitted
and received, whereas hostile signals must be located, ident-
m Hi mean of a feature estimate under
ified and jammed. The spectrum of these signals may range
hypothesis Hi
from high frequency (HF) to millimetre frequency band and
Nmod number of candidate modulations their format can vary from simple narrowband modulations
n order of the statistic to wideband schemes. Under such conditions, advanced
n(t) baseband noise techniques are required for real-time signal interception
and processing, which are vital for decisions involving elec-
N(., .) Gaussian distribution tronic warfare operations and other tactical actions.
vM,PSK statistic used to distinguish between Furthermore, blind recognition of the modulation format
M-ary and M 0 -ary phase shift keying of the received signal is an important problem in commer-
modulation, M 0 . M cial systems, especially in software defined radio (SDR),
v4,QAM statistic used to distinguish between two which copes with the variety of communication systems.
quadrature amplitude modulations signals Usually, supplementary information is transmitted to recon-
V average fading power figure the SDR system. Blind techniques can be used with
an intelligent receiver, yielding an increase in the trans-
P number of paths mission efficiency by reducing the overhead. Such appli-
0
P(ic ji) probability to declare that the i 0 th cations have emerged the need for flexible intelligent
modulation format has been sent, when communication systems, where the automatic recognition
the modulation format of the incoming of the modulation of a detected signal is a major task. A
signal is i simplified block diagram of the system model is shown in
Pcc average probability of correct Fig. 1. The design of a modulation classifier essentially
classification involves two steps: signal preprocessing and proper selec-
tion of the classification algorithm. Preprocessing tasks
Pe average probability of error
may include, but not limited to perform some or all of,
P(i)
e probability of error under hypothesis Hi noise reduction, estimation of carrier frequency, symbol
p(vijHi) a priori probability density function of vi period, and signal power, equalisation etc. Depending on
under Hi the classification algorithm chosen in the second step, pre-
pTX (t) transmit pulse shape processing tasks with different levels of accuracy are
required; some classification methods require precise esti-
w channel phase mates, whereas others are less sensitive to the unknown
K
ffkgk¼1 phase jitter parameters.
ck phase sample at t ¼ kT Regarding the second step, two general classes of AMC
q phase shift between two adjacent antenna algorithms can be crystallised, likelihood-based (LB)
elements [1 – 26] and feature-based (FB) [27 – 88] methods,
respectively. The former is based on the likelihood function
q number of conjugated terms of the received signal and the decision is made comparing
r(t) baseband received complex envelope the likelihood ratio against a threshold. A solution offered
rk output of the receive matched filter at by the LB algorithms is optimal in the Bayesian sense,
t ¼ kT viz., it minimises the probability of false classification.
The optimal solution suffers from computational complex-
S signal power
ity, which in many cases of interest naturally gives rise to
fs(i) K
k gk¼1 K complex transmitted data symbols suboptimal classifiers. In the FB approach, on the other
taken from the ith finite-alphabet modu- hand, several features are usually employed and a decision
lation format is made based on their observed values. These features are
s(t; ui) noise-free baseband complex envelope of normally chosen in an ad hoc way. Although an FB-based
the received signal method may not be optimal, it is usually simple to
sH2i variance of a feature estimate under the implement, with near-optimal performance, when designed
hypothesis Hi properly. Once the modulation format is correctly ident-
ified, other operations, such as signal demodulation and
ss2(i ) variance of the ith signal constellation information extraction, can be subsequently performed.
T symbol period In general, AMC is a challenging task, especially in a non-
tn21 delay vector cooperative environment, where in addition to multipath
u time-invariant carrier phase propagation, frequency-selectivity and time-varying nature
of the channel, no prior knowledge of the incoming signal
ui vector of unknown quantities on which is available.
the noise-free baseband complex envel- In recent years, new technologies for wireless communi-
ope depends cations have emerged. The wireless industry has shown
vi vector of the unknown quantities great interest in orthogonal frequency division multiplexing
138 IET Commun., Vol. 1, No. 2, April 2007

Interference Receiver
the number of equi-probable
Ð points in the ith signal
and jamming noise constellation, Ep ¼ 1 2
21jpTX(t)j dt as the pulse energy,
Input Output
symbols Modulator Channel Preprocessor Demodulator symbols with pTX (t) as the transmit pulse shape, Df is the
+ +
carrier frequency offset, u is the time-invariant carrier
Modulation
format phase, ffkgKk¼1 represent the phase jitter, fs(i) K
k gk¼1 are K
Classification
algorithm complex transmitted data symbols taken from the ith
Receiver
finite-alphabet modulation format, T is the symbol period,
1 denotes the timing offset with respect to the receiver refer-
Fig. 1 System block diagram ence clock, such that 0 1 , 1, and g(t) ¼ pTX(t) h(t),
with h(t) as the channel impulse response and as the con-
volution. For example, for a slowly-varyingP fading channel
(OFDM) systems, due to the efficiency of OFDM schemes with P independent paths, h(t) ¼ Pp¼1ap e jwpd(t 2 tp),
to transmit information in frequency selective fading chan- with d(.) as the Dirac delta function, such that fapgPp¼1,
nels, without complex equalisers [89, 90]. Multiple-input fwpgPp¼1
multiple-output (MIMO) systems have also received con- and ftpgPp¼1 are the amplitudes, phases and delays of the
siderable attention, owing to the significant capacity paths, respectively. We adopt the notation s(t; ui) to stress
increase they offer. Such emerging technologies in wireless the signal dependence on the unknown quantities, that is
communications have raised new challenges for the ui ¼ [ai Df u T 1 g(t) ffkgKk¼1 fs(i) K †
k gk¼1] , with † as the
designers of signal intelligence and SDR systems, such as, transpose. The term ‘unknown quantities’ refers to
discriminating between OFDM and single carrier modu- unknown parameters, such as the carrier frequency offset,
lations [91], identification of signals transmitted from mul- as well as unknown data symbols. Without loss of general-
tiple antenna systems, and so on. ity, unit variance constellations will be considered in the
Research on automatic classification of both digital and sequel, obtained by normalising the signal constellations.
analogue modulations has been carried out for at least two For example, for M-ary amplitude-shift keying
decades [1– 88]. Partial surveys of algorithms for identify- (ASK) the symbols are given by s(M-ASK) ¼ s(M-ASK) ,
k k,I
ing digitally modulated signals are given in [92] and [93]. (M-ASK)
sk,I [ f(2m 2 1 2 M )/ss (M-ASK), m ¼ 1, . . . , Mg,
Of course, many techniques have been developed, which k ¼ 1, . . . , K, for rectangular M-ary quadrature amplitude
are different from each other when it comes to details. modulation (QAM) s(M-QAM) (M-QAM)
¼ sk,I (M-QAM)
þ jsk,Q ,
k
However, general structures that connect a variety of appar- (M-QAM)
sk,I (M-QAM)
, sk,Q 1/2
[ f(2m 2 1 2 M )/ss (M-QAM), m ¼
ently different techniques can be identified. In this paper, we 1, . . . , M 1/2g, k ¼ 1, . . . , K, for M-ary phase-shift-
provide a unified comprehensive overview of what has been keying (PSK) s(M-PSK) ¼ ejum, um [ f2pm/M, m ¼ 0, . . . ,
k
accomplished so far in this area, highlighting the bottle- M 2 1g, k ¼ 1, . . . , K and for M-ary frequency-shift-
necks and challenging issues which need to be addressed keying (FSK) s(M-FSK) ¼ ej2pfmt, fm [ f(2m 2 1 2 M )fd ,
k
by further research. A comparison among the performance m ¼ 1, . . . , Mg, where the subscripts I and Q represent
of different LB and FB algorithms is also carried out, the in-phase (real) and quadrature (imaginary) parts,
emphasising the advantages and disadvantages of diverse respectively, ss2(i ) is the variance of the constellation
techniques. before normalisation, M is a power of 2, and fd is the fre-
The rest of the paper is organised as follows. In Section 2 quency deviation or spacing between any two adjacent
the signal model and classifier performance measures are FSK constellation points, which for orthogonal signalling
discussed. Sections 3 and 4 are devoted to LB and FB is a multiple of 1/2T (see, for example, [94] Chap. 4).
methods, respectively. Numerical performance assessments Note that the data symbols depend on t for FSK. For
and comparisons are provided in Section 5, and some con- others, such as ASK, PSK, QAM, s(i) k is constant for each
cluding remarks are given in Section 6. period (k 2 1)T to kT. To simplify the notation, sub-
sequently we use s(i) k without t also for FSK signals,
2 Signal model and performance measures unless otherwise mentioned. Note that n(t) in (1) is the
aggregate baseband complex noise, that is receiver noise,
AMC algorithms proposed in the literature employ infor- in addition to co-channel interferences and jammers.
mation extracted from either the received baseband wave- A classifier is supposed to correctly choose the modu-
form [1 – 9, 14 – 27, 41 –54, 58 – 64] or intermediate lation format of the incoming signal from a pool of Nmod
frequency [32 – 38, 55, 88]. A general expression for the candidate modulations, denoted by the integers
baseband received complex envelope is given by i ¼ 1, . . . , Nmod , or to decide that the modulation format
cannot be recognised. The latter case is not discussed
here, as not addressed in the literature. Owing to the lack
rðtÞ ¼ sðt; ui Þ þ nðtÞ ð1Þ
of space, we focus on algorithms for ASK, PSK, QAM,
and FSK classification. 0
where As a basic performance measure, let P(ic ji) denote the
(classification) probability to declare that the i 0 th
X
K modulation format has been sent, when the modulation
sðt; ui Þ ¼ ai e j2pDft e ju e jfk sðiÞ
k gðt ðk 1ÞT 1T Þ; format of the incoming signal is i. For i, i 0 ¼ 1, . . . , Nmod ,
k¼1 these probabilities can be arranged as a Nmod Nmod con-
0 t KT ð2Þ fusion matrix, where the diagonal element P(iji) c is the prob-
ability of correct classification for the ith modulation. In
classifying Nmod equi-probable modulations, the average
is the noise-free baseband complex envelope of the received probability of correct classification is defined by
signal. p
In (2) we have j 2 ¼ 21, ai ¼ (Es/ss2(i ) Ep ), with Es as N
X
PMi (i) 2 1
mod
the baseband signal energy, s2s (i ) ¼ M21

i m¼1jsm j as the Pcc ¼ Nmod PðijiÞ
c ð3Þ
variance of the ith zero-mean signal constellation, Mi as i¼1
IET Commun., Vol. 1, No. 2, April 2007 139

Obviously, one can use the complementary probabilities as quantities, assuming Hi is true, and then use these estimates
a performance measure, i.e. the probability of error under in a likelihood ratio test, as if they were correct. If
Hi , defined as P(i) (iji)
e ¼ 1 2 Pc , and the average probability maximum likelihood (ML) is used for estimates, the test
of error, defined as Pe ¼ 1 2 Pcc . Most of the AMC work is called GLRT. Obviously, GLRT treats the unknown
used Pcc , or equivalently, Pe , as a performance measure. quantities (including both the parameters and data
However, by using the confusion matrix, one gains more symbols) as deterministic unknowns, and the LF under Hi
insight into the classifier behaviour. is given by
Clearly, a desirable classifier should provide a high prob-
ability of correct classification in a short observation inter- LðiÞ
G ½rðtÞ ¼ max L½rðtÞjvi ; Hi ð6Þ
vi
val, particularly for a large range of signal-to-noise ratio
(SNR) range. In addition, it should satisfy these require- HLRT: This is a combination of the aforementioned
ments: capability to recognise many different modulations approaches, for which the LF under Hi is given by
in environments with diverse propagation characteristics, ð
robustness to model mismatches, real-time functionality,
and low computational complexity. LðiÞ
H ½rðtÞ ¼ max L½rðtÞjvi1 ; vi2 ; Hi pðvi2 jHi Þ dvi2 ð7Þ
vi1
3 Likelihood-based approach to AMC where vi ¼ [vi†1 vi†2]† and, vi1 and vi2 are vectors of unknown
quantities modelled as unknown deterministics and r.v.s,
Within the LB framework, AMC is a multiple composite respectively. Usually, vi1 and vi2 consist of parameters and
hypothesis-testing problem. The idea behind the LB-AMC data symbols, respectively.
is that the probability density function (PDF) of the Note that ALRT requires a multidimensional integration,
observed waveform, conditioned on the embedded modu- whereas GLRT requires a multidimensional maximisation.
lated signal, contains all information for classification. The difficulty of performing a multidimensional integration
Depending on the model chosen for the unknown quantities, for a large number of unknown quantities and the need for
three LB-AMC techniques are proposed in the literature: knowing the prior PDFs may render the ALRT impractical.
average likelihood ratio test (ALRT) [1 – 13, 22], general- Alternatively, maximisation over the unknown data
ised likelihood ratio test (GLRT) [14, 19, 20] and hybrid symbols in GLRT can lead to the same value of the LF
likelihood ratio test (HLRT) [14 –18, 21, 22]. Quasi for nested signal constellations, for example BPSK and
ALRT [3 – 5 7, 9 – 13] and quasi HLRT [22] are also QPSK, 16-QAM and 64-QAM, [14, 96 Chap. 6], which in
proposed in the literature. turn yields incorrect classification. Averaging over the
unknown data symbols in HLRT, however, removes the
ALRT: This approach treats the unknown quantities as nested constellations problem of GLRT. Finally we empha-
random variables (r.v.s) with certain PDFs. So, the likeli- sise that the estimates of the unknown quantities, as
hood function (LF) under the hypothesis Hi , representative by-products of GLRT and HLRT, are of interest for data
of the ith modulation, i ¼ 1, . . . , Nmod , is given by demodulation.
ð In a two-hypothesis classification problem, the decision is
LðiÞ
A ½rðtÞ ¼ L½rðtÞjvi ; Hi pðvi jHi Þ dvi ð4Þ made according to
H1
where L[r(t)jvi , Hi] is the conditional LF of the noisy Lð1Þ ð2Þ .

l ½rðtÞ=Ll ½rðtÞ , hl ; l ¼ AðALRTÞ; GðGLRTÞ; HðHLRTÞ
H2
received signal r(t) under Hi , conditioned on the unknown
vector vi , and p(vijHi) is the a priori PDF of vi under Hi . ð8Þ
The known PDF of vi enabled us to reduce the problem to
a simple hypothesis-testing problem by integrating over vi . where hl is a threshold. The left-hand side is referred to as
For a baseband complex additive white Gaussian noise the likelihood ratio and the test is called average likelihood
(AWGN) in (1), the conditional LF is given by (see, for ratio test (ALRT), generalised likelihood ratio test (GLRT)
example, [94] Chap. 6) and hybrid likelihood ratio test (HLRT), respectively,
ð KT depending on the method employed to compute the LF.
L½rðtÞjvi ; Hi ¼ exp 2N01 Re rðtÞs ðt; ui Þ dt Extension of (8) to multiple classes is straightforward
0 (see, for example, [95] Chap. 2 and [96] Chap. 3 and 6).
ð KT Equivalently, the log function can be applied to both
N01 jsðt; ui Þj2 dt ð5Þ members of the inequality (8). Accordingly, the terms
0 log-likelihood ratio and log-likelihood ratio test are used.
Table 1 lists several LB-AMC algorithms proposed in the
where N0 is the two-sided power spectral density (PSD) literature, emphasising the type of modulations, unknown
of AWGN in W/Hz, with the autocorrelation parameters, and channel used. A simplified signal model
Efn(t)n (t þ t)g ¼ N0d(t) such that Ef.g is expectation of that given in (1) was considered in the literature, as
and denotes the complex conjugate. Furthermore, here follows. The transmit pulse shape was assumed rectangular,
vi ¼ [u†i N0]†, and Ref.g stands for the real part. If the that is pTX (t) ¼ uT (t), where uT (t) ¼ 1 for 0 t , T and
chosen p(vijHi) is the same as the true PDF, ALRT results zero potherwise. So, Ep ¼ T and with s2s (i ) ¼ 1, one obtains
in an optimal classifier in the Bayesian sense. a ¼ S, in which S ¼ Es/T is the signal power. With all
GLRT: In this approach the unknown parameters are treated this and h(t) ¼ aejwd(t), with a and w constant over the K
as unknown deterministics. The best performance is symbol interval, (2) can be written as
achieved by the so-called uniformly most powerful sðt; ui Þ
(UMP) test [95]. For a necessary and sufficient condition
for the existence of an UMP test see, for example, [95] pffiffiffi X
K
(Chap. 2). When an UMP test does not exist or hard to ¼ a S ejðuþwÞ ej2pDft ejfk sðiÞ
k uT ðt ðk 1ÞT 1T Þ ð9Þ
derive, a logical procedure is to estimate the unknown k¼1

Table 1: Summary of likelihood-based classifiers
Author(s) Classifier(s) Modulations Unknown parameter(s) Channel
Sills [1] ALRT BPSK, QPSK, 16QAM, V29, carrier phase u AWGN
32QAM, 64QAM
Wei and Mendel [2] ALRT 16QAM, V29 – AWGN
Kim and Polydoros [3, 4] quasi-ALRT BPSK, QPSK carrier phase u AWGN
Huang and Polydoros [5] quasi-ALRT UW, BPSK, QPSK, 8PSK, 16PSK carrier phase u and timing offset 1 AWGN
Sapiano and Martin [6] ALRT UW, BPSK, QPSK, 8PSK – AWGN
Long et al. [7] quasi-ALRT 16PSK, 16QAM, V29 carrier phase u AWGN
Hong and Ho [8] ALRT BPSK, QPSK signal level a AWGN
Beidas and Weber [9] ALRT quasi-ALRT 32FSK, 64FSK phase jitter ffkgKk¼1 AWGN
Beidas and Weber [10, 12] ALRT quasi-ALRT 32FSK, 64FSK phase jitter ffkgKk¼2 and timing AWGN
offset 1
Panagiotu et al. [14] GLRT HLRT 16PSK, 16QAM, V29 carrier phase u AWGN
Chugg et al. [15] HLRT BPSK, QPSK, OQPSK carrier phase u, signal power S and AWGN
PSD N0
Hong and Ho [16] HLRT BPSK, QPSK signal level a AWGN
Hong and Ho [17, 18] HLRT BPSK, QPSK angle of arrival w AWGN
Dobre et al. [21] HLRT BPSK, QPSK, 8PSK, 16PSK, channel amplitude a and phase q flat fading
16QAM, 64QAM
Abdi et al. [22] ALRT quasi-HLRT 16QAM, 32QAM, 64QAM channel amplitude a and phase w flat fading

These modulations have been used in the simulations in the original papers
where a ¼ 1 and w ¼ 0 when there is no fading. We define computed by averaging over the constellation points
the observed per-symbol SNR gs ¼ VST/N0 , in which corresponding to the ith modulation format. This is done
V ¼ 1 when there is no fading. Otherwise, V ¼ Efa2g is by substituting (5) into (4) (see (37), in the Appendix)
the average fading power, with a as the channel amplitude.
Under fading, S is set equal to one. Y
K pffiffiffi
LðiÞ
A ½rðtÞ ¼ EsðiÞ fexp½2a S N01 RefRðiÞ
k g
k
k¼1
3.1 ALRT-based algorithms
a2 STN01 jsðiÞ 2
k j g ð10Þ
In this section, optimal and suboptimal ALRT-based algor-
where Es(ik )f.g is nothing but a finite summation over all the
ithms applicable to identify both linearly modulated and
Mi possible constellation points of the ith modulation,
FSK signals under various conditions will be presented
divided by Mi , for the kth interval. Furthermore
[Note 1], in addition to ALRT-based classifiers specific to
linear modulation and FSK, respectively. Suboptimal classi- ð kT

fiers are obtained based on the approximations of the LFs at RðiÞ
k ¼ rðtÞsðiÞ
k ðtÞuT ðt ðk 1ÞT Þ dt
low SNR. Interestingly, several FB classifiers are shown to ðk1ÞT
be simplified versions of such suboptimal structures [5]. ð kT

Hence, decision theory can be perceived as a rigorous ¼ rðtÞsðiÞ
k ðtÞ dt; k ¼ 1; . . . ; K ð11Þ
framework that justifies the selection of features in some ðk1ÞT
FB methods. Note that for linear modulations s(i) k (t) is constant over
(i)
the period
Ð kT (k 2 1)T to kT and thus, R(i)
k ¼ sk rk , with
3.1.1 ALRT-AMC for linearly modulated and FSK rk ¼ (k21)T r(t) dt the output of the receive matched
signals: filter at t ¼ kT.
ALRT-based classifiers: With all parameters perfectly With multiple antennas at the receiver, AWGN and block
known [Note 2], that is vi ¼ [fs(i) K †
k gk¼1] , ALRT leads to a fading, and all parameters perfectly known, that is
structure whose performance is better than all the others, vi ¼ [fs(i) K †
k gk¼1] , the LF is given by [22]
which have to deal with some unknown
parameters. Therefore, the performance of this classifier Y
L Y
K n h pffiffiffi
can be considered as a benchmark. The data symbols LðiÞ
A ½rðtÞ ¼ EsðiÞ exp 2a‘ S N01 Refejw‘ RðiÞ
k;‘ g
k
fs(i) K
k gk¼1 are treated as independent and identically
‘¼1 k¼1
io
distributed (i.i.d.) r.v. s. The LF under hypothesis Hi is a2‘ STN01 jsðiÞ 2
k j ð12Þ
Note 1: Owing to the lack of space, we give details of some of the algorithms,
especially those used in the comparative study of the AMC algorithms in
where L is the number of antennas at the receive-side, a‘ e jw‘
Section 5, whereas we only mention others.
(i)
Ð kT process(i)on each branch, ‘ ¼ 1, . . . , L, and
is the fading
Note 2: From now on, we set the known parameters to some fixed numerical Rk,‘ ¼ (k21)T r‘(t)sk (t) dt, k ¼ 1, . . . , K, ‘ ¼ 1, . . . , L,
values. In AWGN channel, with all parameters perfectly known, u ¼ Df ¼ with r‘(t) ¼ s‘ (t; ui) þ n‘(t) as the received signal on the
1 ¼ w ¼ ffkgKk¼1¼0 and a ¼ 1, whereas in a block fading channel Df ¼
1 ¼ ffkgKk¼1 ¼ 0, S ¼ 1, and u is included into w. Of course, the unknowns ‘ th branch, s‘ (t; ui) as the noise free envelope, and
will be put into the vector vi . n‘(t) as the zero-mean AWGN, with the PSD N0 .
The expression for the noise free envelope on the ‘ th branch, [Note 2]. The probability of error under Hi is shown to be [2]
s‘ (t; ui), can be easily written similar to (9) [Note 3]. Note
L ð1 ð1 ð1 ð1
that both the noise fn‘(t)g‘¼1 and fading processes
jw‘ L
fa‘ e g‘¼1 among the L diversity branches are assumed to PðiÞ
e ¼1 pffiffiffi pffiffiffi pffiffiffi pffiffiffi
be independent. Actually, a maximal ratio combining K mi;1 K mi;i1 K mi;iþ1 K mi;Nmod
(MRC) was used here to combine the received signals. In |fflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflfflffl}
Nmod 1
fading channels such a structure takes advantage of the
array gain, as well as diversity gain (see, for example, [97] pðbi jHi Þdbi ; i ¼ 1; . . . ; Nmod ð16Þ
Chap. 5), and thus, performance improvement is expected
when compared with a single antenna classifier. However,
as one can easily notice, when in addition to the unknown where
p
data symbols, there are other unknown parameters, e.g. bi ¼ (ai 2 Kmi)/ K, ai ¼ [ai,1 . . . ai,i21ai,iþ1 . . . ai,Nmod]†,
PK
fa‘gL‘¼1 and fw‘gL‘¼1, integration over these parameters ai,j ¼ k¼1 [ln(L(i) ( j)
A (rk)) 2 ln(LA (rk))], j = i, L
becomes more difficult and, the implementation of a multi- (i) p 21 (i) 21 (i) 2
A (rk) ¼ Es(ik )fexp[2 (S)N0 Refrksk g 2 STN0 jsk j ]g,
antenna ALRT-based classifier turns out to be even more †
mi ¼ [mi,1 mi,i21 mi,iþ1 . . . mi,Nmod] , and
complex.
In AWGN, with vi ¼ [u fs(i) K † mi,j ¼ Efln(L(i) ( j)
A (rk)) 2 ln(LA (rk))jHjg, j = i.
k gk¼1] and uniform distri-
bution for u over [2 p, p), representing no prior knowledge The (Nmod 2 1) 1 vector bi is shown to be the sum of K
of the time-invariant phase, the LF can be shown to be [21] i.i.d. random vectors which satisfies the multivariate
(see (38), in the Appendix) [Note 2] central limit theorem if K is large. Therefore, p(bijHi) was
considered as a multivariate Gaussian density, with zero
n 1 ðiÞ pffiffiffi o mean and covariance matrix, Efbib†i jHig, depending on the
LðiÞ
A ½rðtÞ ¼ EfsðiÞ gK eSTN0 hK
I0 ð2 S N01 jjðiÞ
K jÞ ð13Þ first and second order statistics of ln(L(i) ( j)
A (rk)) 2 ln(L A (rk))
k¼1
k
[2]. The integral in (16) was numerically calculated for
V. 29, 16-QAM, 32-QAM and 64-QAM, for K ¼ 100, 200
where the notation EfsðiÞk gKk¼1 emphasises that
PKthe averaging and 1000 [2]. Note that V.29 is a special QAM modulation,
is performed over K data symbols, h(i) K ¼
(i) 2
k¼1jsk j , I0(.) with 16 points in the signal constellation [2].
is the zero-orderPmodified Bessel function of the first
kind, and jK(i) ¼ k¼1 K
R(i)
k . Obviously, such a classifier is Quasi-ALRT-based classifier: A synchronous classifier
difficult to implement, as it requires MKi data sequences to (1 ¼ 0) can be simply transformed into an asynchronous
compute the LF under the hypothesis Hi . one, with the timing offset 1 as a uniformly distributed
In AWGN, under the assumption of per-symbol r.v. over [0, 1), using the following approximation of the
phase-incoherence owing to phase jitter, that is u ¼ 0 and LF [9, 10, 13]
vi ¼ [ffkgKk¼1 fs(i) K † K
k gk¼1] , with ffkgk¼1 as i.i.d. uniform
r.v.s, it can be easily shown that the LF is given by
X
D1
(see (39), in the Appendix) [Note 2] LðiÞ 1
A ½rðtÞ ’ D L½rðtÞj1d ; Hi ð17Þ
d¼0
Y
K n 1 ðiÞ 2 pffiffiffi o
LðiÞ
A ½rðtÞ ¼ EsðiÞ eSTN0 jsk j I0 ð2 S N01 jRðiÞ
k jÞ ð14Þ
k¼1
k where D is the number of levels to which the timing offset is
quantised and 1d ¼ d/D, d ¼ 0, . . . , D 2 1. For such a
scenario, R(i)
k Ð, as defined in (11), needs to be replaced
In a slowly-varying flat Rayleigh fading channel [Note 2], kT þ1 T
by Rk(1d) ¼ ðk1ÞTd þ1 T r(t)s(i)
k (t)uT(t 2 (k 2 1)T 2 1dT ) dt.
characterised via a Rayleigh-distributed a and uniform d
This approximation improves as D ! 1, since the sum-
w, such that vi ¼ [a w fs(i) K †
k gk¼1] , the LF is given by [21] mation converges to an integral. The value of D directly
(see (42), in the Appendix) determines the classifier complexity, as introduces more
terms in (17). Note that a similar approximation can be
LðiÞ also used when the carrier phase u is unknown, by discretis-
A ½rðtÞ
( " #) ing the range of its values.
1 VN02 jjðiÞ
Kj
2
¼ EfsðiÞ gK exp ð15Þ
k k¼1
1 þ VTN01 hðiÞ
K 1 þ VTN01 hðiÞ
K 3.1.2 ALRT-AMC for linearly modulated signals: In
the sequel we present various classifiers for linear modu-
lation classification under different conditions, such as a
One can notice that the LF depends on the average fading differential ALRT algorithm designed for unknown carrier
power V, assumed perfectly known. phase, quasi ALRT classifiers designed also for unknown
The ALRT-based classifiers are implemented by repla- carrier phase, as well as unknown carrier phase/timing
cing the expression of the LF given in (10), (12), (13), offset, and so on.
(14) and (15), respectively, in (8), with hA ¼ 1; such a clas-
sifier is called the ML classifier. Differential ALRT with unknown carrier phase: A
differential data solution was proposed in [1] to
Performance analysis of the ML classifier: Theoretical per- classify linearly modulated signals in AWGN channel,
formance analysis of the ML classifier was performed in [2] with the unknown carrier phase uniformly distributed
for no unknown parameters, when identifying linearly over [2 p, p), that is vi ¼ [u fs(i) K †
k gk¼1] . The joint
modulated signals in AWGN, with the LF given in (10) PDF of the magnitude jrkj, and phase difference
Dck ¼ jckþ1 2 ckjmod2p , such that tan ck ¼ rk,Q/rk,I
Note 3: With a multi-antenna classifier, in AWGN channel and all parameters and rk ¼ rk,I þ jrk,Q , was used to derive the LF. Under the
L
perfectly known, we set u ¼ Df ¼ f1‘g‘¼1 ¼ fw‘gL‘¼1 ¼ ffkgKk¼1¼0 and
L
fa‘g‘¼1 ¼ 1, whereas with a flat block-fading channel Df ¼ f1‘gL‘¼1 ¼ assumption that the signal amplitude and phase difference
ffkgKk¼1 ¼ 0, S ¼ 1 and u is included into w‘, ‘ ¼ 1, . . . , L. are approximately independent r.v.s at large SNRs, the LF
Table 2: Theoretical moments for several unit-variance signal constellations with equiprobable symbols
BPSK QPSK 8-PSK 16-PSK V.29 16-QAM 64-QAM
ms (i ),2,0 1 0 0 0 0 0 0
ms (i ),2,1 1 1 1 1 1 1 1
ms (i ),4,0 1 1 0 0 0.5158 20.68 20.619
ms (i ),4,1 1 0 0 0 0 0 0
ms (i ),4,2 1 1 1 1 20.5816 1.32 1.38
ms (i ),6,0 1 0 0 0 0 0 0
ms (i ),6,1 1 1 0 0 21.5243 21.32 21.298
ms (i ),6,2 1 1 0 0 0 0 0
ms (i ),6,3 1 1 1 1 1.4897 1.96 2.22
ms (i ),8,0 1 1 1 0 25.6304 2.2 1.91
ms (i ),8,1 1 0 0 0 0 0 0
ms (i ),8,2 1 1 0 0 9.4585 22.48 22.75
ms (i ),8,3 1 0 0 0 0 0 0
ms (i ),8,4 1 1 1 1 28.5370 3.12 3.96
is given by [1] finally y n22q is 1 if q ¼ n/2 and 2 if q , n/2. Equation

(19) can be further simplified, as for symmetric constella-
n o tions the odd order moments are equal to zero [63]. The right-
Y
K 1
ðiÞ ðiÞ
LðiÞ EsðiÞ ;sðiÞ pðjrk j sðiÞ hand side of (19) is actually a measure of the correlation
A ½rðtÞ ¼ k ;Hi ÞpðDck sk ; skþ1 ; Hi Þ
k¼1 k kþ1 between the theoretical moments of the ith constellation
and sample estimates. Theoretical values of the nth order/
ð18Þ q-conjugate moment ms (i ),n,q , with n ¼ 2, 4, 6, 8 and
q ¼ 0, ... , bn/2c, are given in Table 2, for different signal
in which jrkj was taken as Ricean-distributed and, for constellations. These values were computed as ensemble
large SNRs, Dck was approximated by a Gaussian PDF. average over the ideal-noise free constellation under the con-
Owing to the lack of space we omit here the expressions straint of unity variance and the assumption of equiprobable
for the PDFs of jrkj and Dck . For details see [1], p. 219. symbols. As one can notice from Table 2, the lowest order
The advantage of using the phase difference instead of statistic to distinguish between M-PSK and M 0 -PSK
phase itself is that the effect of a time-invariant phase (M 0 . M ) is the Mth-order/zero-conjugate moment,
offset will be mitigated. However, the classifier perform- ms (i ),M,0 . Alternatively, also from Table 2, this property
ance can still be degraded owing to the phase jitter. The does not hold for QAM signals. A general rule for QAM
differential ALRT-based classifier was implemented with signals is that ms (i ),n,q ¼ 0 when n is a multiple of four
the decision rule given in (8) (hA ¼ 1) and the expression (n ¼ 4 and 8 in Table 2) and q is odd or n is not a multiple
of the LF given in (18). of four (n ¼ 6 in Table 2) and q is even.
By resorting to only the lowest order statistics, small n s,
Quasi-ALRT with unknown carrier phase [Note 4]: With suboptimal but implementationally manageable classifiers
vi ¼ [u fs(i) K †
k gk¼1] , where u is uniformly distributed over were proposed to discriminate PSK and QAM in [3– 5, 7],
[2 p, p), approximations of the LF were developed in respectively. Based on the aforementioned property of
[3– 5, 7] for AWGN channel, leading to suboptimal classi- moments for PSK signals, Polydoros et al. proposed a
fication structures. A low SNR approximation of the LF for binary decision tree classifier for PSK signals [3– 5],
PSK and QAM signals is given by [Note 5] where the decision at each node was made by comparing
X 1
pffiffiffi
the following metric
n bn=2c
X
LðiÞ ½rðtÞ ’ exp S N 1
K ½yn2q ðq!ðn qÞ!Þ1
A 0 X K
M
n¼1 q¼0 vM;PSK ¼ rk ð20Þ
k¼1
^ r;n;q ð0n1 Þj
jmsðiÞ ;n;q jjm ð19Þ 0
against a threshold, denoted here by h(M-PSK,M
vM
-PSK)
. As an
example, see the Appendix for the derivation of (20) for
where ms (i ),n,q ¼ Es (i )f(s (i))n2q(s (i) )qg is the nth-order/ BPSK/QPSK classification. The decision rules involving
q-conjugate
P moment of the ith constellation, m ^ r,n,q(0n21) ¼ the approximations of the LFs require appropriate thresholds.
K 21 Kk¼1 rn2q
k (rk)q is the sample estimate of the In order to maximise the probability of correct classification
nth-order/q-conjugate moment at zero-delay vector 0n21 when discriminating between the M-PSK and M 0 -PSK
[Note 6], b.c denotes rounding up to the nearest integer, and modulations, the threshold hvM (M-PSK,M 0 -PSK)
was chosen to
satisfy
Note 4: The quasi ALRT classifier was originally derived as an LB method.

However, it can be considered as an FB technique as well. 0 0 0
-PSKÞ
Note 5: Equation (19) can be easily obtained from (5) given in [7], by using the pðM-PSKÞ hðM-PSK;M
vM ¼ pðM -PSKÞ hðM-PSK;M
vM
-PSKÞ
signal moments.
Note 6: For the definition of the nth-order, q-conjugate moment, mr,n,q(tn21) and ð21Þ
the cumulent, cr,n,q(tn21) of a stationary random process, as well the relations
between the moments and cumulants, see, e.g. [98] and [99] and Chap. 2. A 0
tn210n21 delay vector is an (n 2 1)1 vector, with all the elements equals to where p (M-PSK)(vM) and p (M -PSK)(vM) are the PDFs of the
zero. metric vM under the hypothesis that M-PSK and M 0 -PSK
PSK modulated
signal
Yes ν 2, PSK > η(νBPSK

2
, M ' PSK )
No
M'>2
BPSK Yes No
ν 4, PSK > η(νQPSK
4
, M ' PSK )
M'>4
QPSK
PSK , M ' PSK )
Yes ν8, PSK > η(8
ν8 No
M '>8
8-PSK 16-PSK,
32-PSK, etc.
Fig. 2 Binary decision tree for PSK signals using the quasi-ALRT classifier
are the modulation formats of the incoming signal, respect- By comparing (20) and (23) with (13), one can say that
ively. A closed-form solution was derived in [5] for classify- the complexity of a quasi-ALRT classifier is much less
ing PSK signals with unknown carrier phase. Under the than that of the ALRT classifier, as it needs neither an
assumption of a large number of available symbols K, this averaging operation nor the computation of the Bessel
threshold was approximated by [Note 7] function.
0
-PSKÞ Quasi-ALRT with unknown carrier phase and timing offset:
hðM-PSK;M ¼ KS M=2 T M =2 ð22Þ
vM In AWGN, with vi ¼ [u 1 fs(i) K †
k gk¼1] , where the carrier phase
u and timing offset 1 are uniformly distributed over [2 p, p)
As one can notice, the threshold depends on the signal power, and [0, 1), respectively, the following statistic was used to
S. In the sequel we denote it by hQA,TE , where QA and TE distinguish between M-PSK and M 0 -PSK [5]
stand for quasi-ALRT and theoretical, respectively. An
example of a binary decision tree classifier used for PSK D1 X
X K
M
signals is shown in Fig. 2. nM;PSK ¼ rk ðd=DÞ ð24Þ

As previously explained, the nth order/q-conjugate d¼0 k¼1
moments for QAM signals do not have the attractive prop- Ð
erty used to devise a binary decision tree for PSK signal with rk(1d) ¼ kTþ1 dT
(k21)T þ 1dT r(t)uT(t 2 (k 2 1)T 2 1d T ) dt.
identification. Using (19) and the results given in Table 2, Similar to Fig. 2, a binary decision tree classifier was
it can be easily noticed that the lowest order statistics employed for PSK signal classification, with (24) compared
which can be used for QAM signals classification is of against a threshold. The threshold was empirically set,
order n ¼ 4 (q ¼ 0, 2). Following the same procedure as following the histogram method.
in the example given in the Appendix for BPSK and ALRT with unknown signal level: An ALRT algorithm was
QPSK signals, it can be easily shown that the lowest developed in [8] to identify PSK signals in AWGN, with
order metric which can be used to distinguish between vi ¼ [a fs(i) K †
k gk¼1] , where the signal level a is a
any two QAM signals is given by Rayleigh-distributed r.v. The decision rule given in (8)
was employed, with the threshold hA set to one.
X K X K
4 4
n4;QAM ¼ A rk þ B jrk j ð23Þ Miscellaneous algorithms: Numerical calculation of the
k¼1 k¼1
integrals in ALRT using a Markov chain Monte Carlo
method was performed in [24]. In the algorithms previously
where the coefficients A and B depend on the theoreti- described, AMC was treated as a hypothesis testing problem
cal values of the fourth-order/zero- and two-conjugate with a fixed number of received symbols. By formulating
moments of the QAM signals, respectively. Such a metric the AMC problem as a variable sample size hypothesis
was used in [7] to discriminate between 16-QAM and testing problem, an algorithm based on the sequential prob-
V.29, with A ¼ 0.0135 and B ¼ 20.0246. The decision ability ratio test was proposed in [25, 26].
was made by comparing the metric against a threshold,
which was empirically set. This threshold is set to maximise
the average probability of correct classification over a large 3.1.3 ALRT-AMC for FSK signals
number of data and noise realisations. It is assumed that ALRT-based classifier under the assumption of per-symbol
such simulations can be run off-line and the threshold can phase-incoherence: In AWGN, under the assumption of
be stored as a function of the noise and signal parameters. per-symbol phase-incoherence, that is vi ¼ [ffkgKk¼1
In a practical implementation, the threshold is therefore fs(i) K † K
k gk¼1] , with ffkgk¼1 as i.i.d. uniform r.v.s, the LF of
(i)
obtained from a look-up table. We denote this threshold (14), with jsk j ¼ 1, was derived in [9]. As one can easily
by hQA,E , where QA and E stand for quasi-ALRT and notice from (11) and (14), the implementation of such
empirical, respectively. classifier requires the explicit calculation of the Fourier spec-
trum of the received waveform at a set of Mi candidate fre-
p quencies [9]. The signal bandwidth of an Mi-FSK signal is
Note 7: The original presentation in [4] used 2rk/ (N0T) instead of rk in the
metric inp(20). To account for this difference, the threshold we give in (22) in defined as BWi ¼ MiTi21, with Ti as the symbol period
indeed ( (N0T )/2)M times the threshold of [4]. under the hypothesis Hi . Optimal structures were developed
for the three possible cases: the same signal bandwidth and respectively given by [14]
distinct symbol period (equal MiTi21, with different Ti), the ( )
same symbol period and distinct bandwidth (different XK
pffiffiffi ðiÞ 2
ðiÞ ðiÞ ju 1
MiTi21, with equal Ti), and both the symbol period and LG ½rðtÞ ¼ max max Refsk rk e g 2 S T jsk j
u skðiÞ
signal bandwidth different (different MiTi21, with different k¼1
Ti) [9]. The decision was made based on the rule given in ð26Þ
(8), with hA ¼ 1.
Quasi-ALRT-based classifier under the assumption of per-

symbol phase-incoherence: On the other hand, with (
YK n h pffiffiffi
vi ¼ [ffkgKk¼1 fs(i) K †
k gk¼1] , by using a power series expansion LðiÞ
H ½rðtÞ ¼ max EsðiÞ exp 2 S N01 RefsðiÞ
k rk e
ju
g
of the modified Bessel function in (14), an approximation u k
k¼1
of the LF based on higher-order correlations (HOCs) was )
given in [9– 12]. For a single symbol interval this is given by io
STN01 jsðiÞ
k j
2
ð27Þ
ðT
ðiÞ
LA ½rðtÞ ’ 1 þ C1 c1;r ð0Þ þ C2 jc1;r ðtÞj2 dt
T These relations can be easily derived according to (6) and
ðT ðT (7), respectively, after substituting (9) [Note 2] into (5).
þ C3 c1;r ðtÞc2;r ðtÞ dt þ C4 jc2;r ðtÞj2 dt The empirical histogram method was used to set the
T T thresholds with the GLRT and HLRT tests [14]. Here we
ðT denote these thresholds as hG,E and hH,E , respectively.
þ C5 c1;r ðtÞc3;r ðtÞ dt þ ð25Þ GLRT displays some implementation advantages over
T
ALRT and HLRT, as it avoids the calculation of exponen-
where cr,1 (t) is the signal autocorrelation, defined as tial functions and does not require the knowledge of noise
8ð power to compute the LF. However, it suffers from the
T t
>
> nested constellation problem discussed earlier. Note that
>
< r ðtÞrðt þ tÞ dt; 0t,T HLRT does not have this problem.
0
cr;1 ðtÞ ¼ ð T jtj Other GLRT- and HLRT-based classifiers investigated in
>
>
>
: r ðt þ jtjÞrðtÞ dt; T t , 0 the literature are as follows. The AMC problem was exam-
0 ined in an intersymbol interference (ISI) environment,
where the signal was considered to be degraded by
and cr,n(t) is the nth-order (n 2) correlation, defined as the AWGN and ISI [19, 20]. The LF was computed using the
autocorrelation of cr,n21(t) ML estimates of the data sequence and channel coefficients
8 ð n2 fgpgPp¼1, with the per-survivor processing technique
2 T t
>
> employed for estimation [20]. Obviously, this is GLRT
>
< cr;n1 ðtÞcr;n1 ðtþ tÞ dt; 0 t ,2n1 T;
2n2 T with vi ¼ [fgpgPp¼1 fs(i) K †
k gk¼1] . The threshold used for
cr;n ðtÞ¼ ð n2
>
> 2 T jtj decision was empirically set. With ui ¼ [u S fs(i) K
k gk¼1]
†
>
: c ðtþjtjÞc ðtÞdt; 2n1 T t ,0 (i) K
and unknown PSD N0 , that is vi ¼ [u S fsk gk¼1 N0] , an †
r;n1 r;n1
2n2 T HLRT classifier was explored in [15], where ML estimates
The coefficients C1 , C2 , C3 , C4 and C5 are given by of S and N0 were used, together with the approximate LF in
gs(2N0T BW )21, gs(16N0T BW )21, gs(288N0T BW )21, (19), obtained by averaging over u and fs(i) K
k gk¼1. In other
gs(9216N0T BW )21 and gs(460 800N0T BW )21, words, the LF is computed based on (7), with
respectively. When the received signal includes K i.i.d. vi2 ¼ [S N0]† and vi1 ¼ [u fs(i) K †
k gk¼1] . Both empirical and
symbols, the LF is simply the product of K LFs, each one theoretical thresholds were used. In the same line, an
given by (25). To increase the accuracy of the LF approxi- HLRT structure was investigated in [16] by considering
mation in (25), one needs to use higher-order correlations. vi ¼ [a fs(i) K †
k gk¼1] , with the signal power given by a S.
2
This results in a better but more complex classifier. The The decision rule given in (8) was used, with the threshold
decision was made by comparing the metric against an set to one. An HLRT-based multi-antenna classifier was
empirical threshold, hHOC,E , where HOC and E denotes developed for BPSK/QPSK in AWGN [17, 18], with
higher-order correlation and empirical, respectively. vi ¼ [q fs(i) K †
k gk¼1] , where q is an unknown phase shift
An FSK signal classifier based also on an approximation between two adjacent antenna elements, which appears
of the LF was proposed in [13] for flat Rayleigh fading due to their spatial separation. The decision rule given in
channels. Usually, in the literature only one incoming (8) was employed, with the threshold set to one.
signal is assumed at the receiver. The case of multiple An HLRT-based classifier was developed for linear
FSK signals at the receiver was discussed in [13]. Using modulations identification in flat block fading channels in
(17), the previously discussed ALRT-based and [21], with vi ¼ [a w fs(i) K †
k gk¼1] . Similar to (27), the LF was
quasi-ALRT synchronous algorithms were transformed computed by averaging over the data symbols and using
into asynchronous classifiers, where the timing offset 1 the ML estimates of the channel phase w and amplitude
was assumed uniformly distributed over [0, 1) [10, 12, 13]. a. In [21] the threshold hH used for decision was set to
one. HLRT has the advantage over ALRT that no prior
PDFs of the channel parameters are needed, and therefore,
3.2 GLRT- and HLRT-based algorithms it is applicable to different environments, e.g. Rician and
Rayleigh fading [21].
As the ALRT algorithm suffers from high computational
complexity in most practical cases, GLRT and HLRT algor- Quasi HLRT classifier: HLRT does not seem to be a good
ithms have been investigated as possible solutions to ident- solution with an increased number of unknown parameters,
ify linear modulations [14 – 22]. In AWGN and with as finding their ML estimates can be very time consuming.
vi ¼ [u fs(i) K †
k gk¼1] , the LF for GLRT and HLRT are Quasi-HLRT classifiers, which use low-complexity yet
accurate estimates, can be used instead. Such classifier were 4.1 FB algorithms to distinguish between different
proposed in [22] for QAM signal identification in block classes
fading channels. The estimators used for the channel ampli-
tude and phase are given, respectively, by 4.1.1 Instantaneous amplitude, phase and frequency-
based algorithms: The most intuitive way to identify the
^ 2r;2;1 ð0Þ m
a^ 4i T 4 ¼ ½2m ^ r;4;2 ð03 Þð2m2sðiÞ ;2;1 msðiÞ ;4;2 Þ1 ð28Þ modulation class of the incoming signal is to use the infor-
mation contained in its instantaneous amplitude, phase and
frequency. To extract such information, different methods
and were applied in the literature [27– 40]. The following differ-
! ences between signal classes were employed for classifi-
X
K
cation in [27– 31]:
w^ ¼ 4 1
angle rk4 ð29Þ
k¼1 † FSK signals are characterised by constant instantaneous
amplitude, whereas ASK signals have amplitude fluctu-
Then, using these estimators, the LF was calculated accord- ations, and PSK signals have information in the phase.
ing to [Note 2] The maximum of the discrete Fourier transform (DFT) of
centred (the term ‘centred’ specifies that the average is
Y
K n h removed from the data set) normalised instantaneous ampli-
LðiÞ
quasi-H ½rðtÞ ¼ EsðiÞ exp 2a^ i N01 RefsðiÞ
k rk e
jw^
g tude was used as a feature to distinguish between FSK and
k
k¼1 ASK/ PSK classes
io † ASK and BPSK signals have no information in the absol-
a^ 2i TN01 jsðiÞ
k j
2
ð30Þ ute phase, whereas M-PSK (M . 2) has. The variance of
absolute centred normalised phase was used to distinguish
The decision rule is given in (8), with the threshold set to between M-PSK (M . 2) and real-valued constellation,
one. A multi-antenna quasi-HLRT classifier was also pro- BPSK and ASK
posed in [22], with (28) and (29) used to estimate the † ASK signals have no phase information by their nature,
channel phase and amplitude on each branch, respectively. whereas BPSK has. Variance of direct (not absolute)
The threshold used for decision was set to one. centred normalised phase was used to distinguish between
BPSK and ASK classes. A binary decision tree structure
was employed to discriminate between classes, and further-
4 Feature-based approach to AMC more, within each class, as we will briefly mention later. At
each node of the tree, the decision was made by comparing a
The design of a FB algorithm first needs some features for statistic against a threshold.
data representation and then decision making [100].
Examples of features are the variance of the centred normal- In [32] and [33], the variance of the zero-crossing interval
ised signal amplitude, phase and frequency [27], the var- was used as a feature to distinguish FSK from PSK and the
iance of the zero-crossing interval [32, 33], the variance unmodulated waveform (UW). The zero-crossing interval is
of the magnitude of the signal wavelet transform (WT) a measure of the instantaneous frequency, and it is a stair-
after peak removal [36– 38], the phase PDF [44– 46] and case function for FSK signals, whereas a constant for UW
its statistical moments [47 – 49], moments, cumulants, and and PSK signals. The AMC is treated as a two hypothesis
cyclic cumulants of the signal itself [41 – 43, 53, 54, 58 – testing problem: H1 for FSK and H2 for UW and PSK.
66] etc. The entropy [67, 68], fuzzy logic [69, 70], a The hypotheses are formulated based on the Gaussian
moment matrix technique [71, 72] and a constellation assumption for the estimated feature, that is N(mHi , sH2i),
shape recovery method [73] were also used for AMC. i ¼ 1, 2, with the hypothesis-dependent mean mHi and var-
Different methods were employed for decision making, iance sH2i (the mean is actually the theoretical value of the
such as PDF-based [41 – 53], the Hellinger distance [74, feature under Hi whereas the variance is estimated under
75], the Euclidian distance [60 – 65] and unsupervised clus- each hypothesis.). An LRT is used for decision, which
tering techniques [76, 77]. Table 3 summarises most of the due to the Gaussian assumption is simplified to
FB-AMC work, emphasising the selected features, type of the comparison of the feature with a threshold h, derived
modulations, channel and the unknown parameters. In its from the LRT. For any two class problem, assuming equal
first part, algorithms which employ information extracted priors, the average probability of error is then given by
from the instantaneous amplitude, phase and frequency of
Pe ¼ ½erfcððh mH1 Þ=sH1 Þ þ erfcððmH2 hÞ=sH2 Þ=2 ð31Þ
the received signal are presented. The second and third
parts include classifiers based on the wavelet transform
and signal statistics, respectively. Finally, a classifier where erfc(.) is the complementary
Ð error function defined as
based on spectral properties of FSK signals is mentioned. erfc(x) ¼ (2p)21/2 1 2
x exp(2u /2) du.
In what follows, these FB-AMC algorithms are presented The variance of the instantaneous frequency was also
[Note 1] with a hierarchical approach in mind, that is the employed in [34, 35] to discriminate FSK from UW and
modulation class of the incoming signal is first identified PSK. In fact, the autoregressive spectrum modelling was
(e.g. ASK, PSK, QAM, FSK), and then the modulation used to extract the instantaneous frequency. The decision
order M within the recognised class. A single signal in was made by comparing the feature against a threshold.
AWGN, with the parameters perfectly known, and a rec-
tangular pulse shape uT(t) were assumed, except otherwise 4.1.2 Wavelet transform-based algorithms: The
mentioned. In addition to ASK, PSK, QAM and FSK, utility of the wavelet transform to localise the changes in
the identification of other modulations was examined the instantaneous frequency, amplitude and phase of the
in the literature, for example minimum shift keying received signal was also studied for AMC. The distinct
(MSK) [66, 78 – 80], offset QPSK (OQPSK) [79], behaviour of the Haar WT (HWT) magnitude for PSK,
continuous-phase FSK (CPFSK) [81]. QAM and FSK signals was employed for class
Table 3: Summary of feature-based classifiers
Author(s) Features Modulations Unknown parameters Channel(s)
Azzouz and Nandi [27, 28] maximum power spectral density of 2ASK, 4ASK, BPSK, QPSK, 2FSK, 4FSK – AWGN
normalised centred amplitude, standard
deviations of normalised centred amplitude,
phase and frequency
Soliman and Hsue [32, 33] variance of the zero-crossing interval UW, BPSK, QPSK, 8PSK, BFSK, 4FSK, – AWGN
sequence, phase difference and 8FSK
zero-crossing interval histograms
IET Commun., Vol. 1, No. 2, April 2007

Soliman and Hsue [44–46] PDF of phase UW, BPSK, QPSK, 8PSK – AWGN
Soliman and Hsue [47–49] statistical moments of phase UW, BPSK,QPSK, 8PSK – AWGN
Sapiano et al. [51] DFT of phase PDF UW, BPSK,QPSK, 8PSK – AWGN
Ho et al. [36, 37] variance of HWT magnitude, HWT magnitude BPSK, QPSK, 8PSK, FSK, 4FSK, 8FSK, – AWGN
and peak magnitude histograms CP2FSK, CP4FSK, CP8FSK, MSK
Hong and Ho [38] variance of HWT magnitude and normalised QPSK, 4FSK, 16QAM – AWGN
HWT magnitude
Swami and Sadler [41] normalised fourth-order cumulants of the BPSK, 4ASK, 16QAM, 8PSK, V32, V29, carrier phase u, frequency offset Df AWGN, impulsive noise, cochannel
received signal V29c and timing offset 1 interference
Swami et al. [53] normalised fourth-order cumulants of the BPSK, 4ASK, QPSK, 16QAM, V29, V32, – Frequency selective channel
received signal and the cost function of the 64QAM
alphabet-matched equalisation algorithm
Martret and Boiteau [54] fourth- and second-order moments of the QPSK, 16QAM – AWGN
received signal
Marchand et al. [58, 59] fourth- and second-order cyclic cumulants of QPSK, 16QAM, 64QAM – AWGN
the received signal
Spooner et al. [60–62] sixth-, fourth-, and second-order cyclic MSK, QPSK, BPSK, 8PSK, 8QAM, frequency offset Df, excess bandwidth, AWGN, cochannel interference
cumulants of the received signal QPSK, 16QAM, 64QAM, V29 symbol period T, signal amplitude a
Dobre et al. [63] eighth-order cyclic cumulants of the received BPSK, QPSK, 8PSK, 4ASK, 8ASK, – AWGN
signal 16QAM, 64QAM, 256QAM
Dobre et al. [64] eighth-, sixth-, and fourth-order cyclic 4QAM, 16QAM carrier phase u, phase jitter AWGN, impulsive noise
cumulants of the received signal fwkgKk¼1, frequency offset Df
Dobre et al. [65] eighth-order cyclic cumulants of the signal at 4-ASK, 8-ASK, BPSK,QPSK, 16QAM, – Rayleigh and Ricean fading channels
the output of a selection combiner 32QAM, 64QAM
Yu et al. [88] DFT of the received signal 2FSK, 4FSK, 8FSK, 16FSK, 32FSK – AWGN

These modulations have been used in the simulations in the original papers
147
identification in [36 –38]. For a PSK signal this is a con- identification. In the high-SNR region, M-PSK exibits M
stant, with peaks occurring at phase changes. On the other distinct modes, while when the SNR decreases or M
hand, because of the frequency and amplitude variations increases, the peaks smear off and finally the PDF con-
in FSK and QAM, respectively, the HWT magnitude is a verges to a uniform PDF [48]. Specifically for PSK signal
staircase function with peaks at phase changes. These classification, an approximation using the Tickhonov PDF
peaks do not provide useful information for non-continuous and a Fourier series expansion of the phase PDF were
phase FSK signals. If only the phase is retained for a QAM employed in [44– 46], with a log-likelihood ratio test for
signal, it behaves like a PSK signal and thus, the HWT mag- decision. By using these methods to compute the phase
nitude is constant. On the other hand, as PSK and FSK PDF, closed-form expressions for the phase statistical
signals are of constant amplitude, amplitude normalisation moments were derived, and the PDFs of the sample esti-
has no effect on their HWT magnitude. After peak mates of the moments were used for decision making
removal, the variance of the HWT magnitude with ampli- [47 – 50]. The distribution of the sample estimate of the
tude normalisation was used to discriminate FSK from nth-order moment was assumed to be Gaussian,
PSK and QAM. Furthermore, the variance of the HWT N(mn,Hi ,sn 2,Hi), where the mean mn,Hi and variance s2n,Hi
magnitude without amplitude normalisation was employed depend on the hypothesis Hi and n. The decision criterion
to distinguish between QAM and PSK. The decisions was further reduced to comparing the sample estimates of
were made by comparing the features against some the phase moments with a threshold. The histogram of the
thresholds, chosen based on the statistical analysis of the phase difference between two adjacent symbols was used
features, to minimise the probability of error for PSK in [32, 33, 39] for PSK order identification, with the
signals [36 – 38]. decision made based on the comparison of the histogram
against particular patterns. The periodic components of
4.1.3 Signal statistics-based algorithms: To discrimi- the phase PDF were analysed for PSK order identification
nate among BPSK, ASK, M-PSK (M . 2) and QAM, the in [51], using the DFT of the phase histogram. In other
cumulant-based feature cr,4,0(03)/c2r,2,1(0) was proposed words, the empirical characteristic function of the phase
in [41], where cr,n,q(0n21) [Note 6] is the nth-order/ was exploited for classification in this work. Furthermore,
q-conjugate cumulant of the output of the matched filter in [52] the algorithm was extended to QAM signal classifi-
frkgKk¼1, at the zero delay vector. For decision, an LRT cation, by exploiting the additional information provided by
based on the PDF of the sample estimate of the feature the magnitude of the received signal. Other features
was formulated to achieve minimum probability of error. extracted from the instantaneous amplitude and phase
The moment-based feature mr,6,3(05)/m3r,2,1(0) was used in were investigated for PSK and QAM identification in [40,
[42], where mr,n,q(0n21) is the nth-order/q-conjugate 78, 83, 84], such as the kurtosis of the amplitude.
moment of the output of the matched filter frkgKk¼1, at the
zero delay vector. The goal was to distinguish between 4.2.2 Wavelet transform-based algorithm: Different
PSK and QAM. A joint power estimation and classification PSK signals give rise to different sets of peak values in
was performed in [42]. The decision was made based on the the magnitude of the Haar wavelet transform. The histo-
minimum absolute value of the difference between the gram of the peak magnitudes was employed to identify
sample estimate and prescribed values of the feature. the order of a PSK signal in [37], with the decision made
Reference [43] combined several normalised moments by comparing the histogram with the theoretical PDFs
and cumulants for training a neural network to identify corresponding to different orders.
FSK, PSK and QAM in multipath environments.
4.2.3 Signal statistics-based algorithms: Cumulant-
4.1.4 Miscellaneous algorithms: Neural networks based features were proposed in [41] to identify the order
(NNs) were also used for classification in [27, 29 – 31, 35]. of ASK, PSK, and QAM modulations, as follows: the
The Wigner – Ville distribution was used in [82] to dis- normalised cumulant of fourth-order/two-conjugate,
tinguish between PSK and FSK signals. cr,4,2(03)/c2r,2,1(0), for ASK, the magnitude of the normalised
cumulant of fourth-order/zero-conjugate, jcr,4,0(03)j/
4.2 FB algorithms for linearly modulated signals jc2r,2,1(0)j, for PSK (M . 2), and the normalised cumulant
of fourth-order/zero-conjugate, cr,4,0(03)/c2r,2,1(0), for
Classifiers summarised in Table 3, which can be applied to QAM. The theoretical values of the nth-order/q-conjugate
identify the modulation order M of linear modulations, are cumulant cs (i ),n,q , q ¼ 0, . . . , n/2, n even, for several linear
discussed in the sequel. modulations are given in Table 4. These values were com-
puted using the moment to cumulant formula [Note 6] in
4.2.1 Instantaneous amplitude and phase-based which the nth-order moments were calculated as ensemble
algorithms: Information extracted from the instantaneous averages over the noise-free unit-variance constellations
amplitude and phase of the received signal was exploited for with equiprobable symbols. Note that owing to the symmetry
linear modulation recognition, as follows. The variance of of the signal constellations considered, the nth-order
the absolute value of the normalised centred instantaneous moments for n odd are zero and hence, using the moment
amplitude was used to distinguish between 2-ASK and to cumulant formula, it is easy to show that the nth-order
4-ASK, as for the former the amplitude changes between cumulants for n odd are also zero. However, for n even we
two levels, equal in magnitude and opposite in sign, so, it have cs (i ),n,q ¼ cs (i ),n,n2q . An LRT was formulated based on
has no information in the absolute amplitude, whereas it the PDFs of the sample estimates of features, which are
has for the latter [27 – 31]. The statistic was compared Gaussian, that is N(mHi , sHi2). With a simplifying approxi-
against a threshold for decision making at a tree node, as mation, that is equal variances under all the hypotheses,
part of the binary decision tree classifier mentioned in the decision was further reduced to comparing the sample
Subsection 4.1.1. The phase PDF and its statistical estimate of the chosen feature v̂ against a threshold, with
moments were investigated for PSK signal recognition in v as any of the cumulant-based features previously men-
[44– 50]. The phase PDF is multimodal, and the number tioned. For an Nmod hypothesis testing problem, with the
of modes provides information for the PSK order hypotheses ordered such that mH1 , mH2 , , mHNmod ,
Table 4: Theoretical cumulants for several unit 2 variance signal constellations with equiprobable symbols
BPSK QPSK 8-PSK 16-PSK V.29 16-QAM 64-QAM
cs (i ),2,0 1 0 0 0 0 0 0
cs (i ),2,1 1 1 1 1 1 1 1
cs (i ),4,0 22 1 0 0 0.5158 20.68 20.619
cs (i ),4,1 22 0 0 0 0 0 0
cs (i ),4,2 22 21 21 21 1.4148 20.68 20.619
cs (i ),6,0 16 0 0 0 0 0 0
cs (i ),6,1 16 24 0 0 1.0683 2.08 1.7972
cs (i ),6,2 16 0 0 0 0 0 0
cs (i ),6,3 16 4 4 4 2.2551 2.08 1.7972
cs (i ),8,0 2272 234 1 0 3.7797 213.9808 211.5022
cs (i ),8,1 2272 0 0 0 0 0 0
cs (i ),8,2 2272 34 0 0 2.1993 213.9808 211.5022
cs (i ),8,3 2272 0 0 0 0 0 0
cs (i ),8,4 2272 234 233 233 3.7797 213.9808 211.5022
the decision rule is to choose Hi if received signal, respectively, were used in [55] to distinguish
between BPSK and QPSK. In [56, 57] the same property was
ðmHi1 þ mHi Þ=2 , v^ , ðmHi þ mHiþ1 Þ=2 ð32Þ explored for a baseband signal. By increasing the order of the
nonlinear signal transformation beyond fourth powers, this
where mH0 ¼ 21 and mHNmodþ1 ¼ 1. argument can be extended to identify modulations of order
Note that the cumulant-based features cr,4,2(03)/c2r,2,1(0) higher than QPSK. Note that the quasi-optimal algorithm
and jcr,4,0(03)j/jc2r,2,1(0)j do not depend on a fixed carrier derived within the LB framework for PSK signal classifi-
phase u, as for q ¼ n/2 the exponential factors which cation also exploits such a property, by using the information
depend on u cancel each other, whereas for q = n/2 the extracted in time domain [3–5]. However, the signal cyclo-
phase dependency is dropped by taking the magnitude. stationarity is not exploited in this work, as the sampling is
This work was extended in [53] to classify linear modu- performed at the symbol rate T 21.
lations in frequency-selective channels. The blind alphabet- Cyclic-cumulant (CC) based features of different orders
matched equalisation algorithm [101], which was used for were investigated for modulation classification in [58–65].
equalisation, was also employed for classification. Some A feature based on fourth-order/two-conjugate and
other cumulant-based features were added [30] to the set second-order/one-conjugate CCs at the CF equal to the
of features extracted from the instantaneous amplitude, symbol rate, similar to the one that used moments [54],
phase and frequency [27– 29], to include QAM signals in was proposed in [58] and [59], to identify the order of
the set of candidate modulations to be recognised. QAM modulations. The same decision criterion as in [54]
Signal moments were applied to distinguish between was employed. In [61] a generic algorithm was proposed to
QPSK and 16-QAM in [54]. Specifically, a linear exploit signal cyclostationarity for classification. A feature
combination of the fourth-order/two-conjugate moment vector was proposed, whose components were the magni-
and the squared second-order/one-conjugate moment tudes of the CCs up to the nth-order, raised to the power of
were employed, with the coefficients and the delay vector 2/n, when n goes to infinity, and computed at all possible
optimised to maximise the probability of correct classifi- CFs and delay vectors. Apparently, such a classifier is hard
cation. A set of features was chosen for certain values of to implement. Note that raising the nth-order CC magnitudes
the delay vector, and classification was made based on the to the power of 2/n forces the features to take values within
correlation between the sample estimate and theoretical the same order of magnitude. Therefore, the classical
feature vectors. The signal-moment feature mr,6,3(05)/ Euclidian distance can be used for decision.
m3r,2,1(0) was employed to identify the order of QAM For linear modulations, the nth order/q-conjugate CC of
signals in [42], with the decision made based on the r(t), where r(t) is given in (1), with fk ¼ 0, k ¼ 1, . . . , K,
minimum absolute value of the difference between the g(t) as a raised cosine pulse shape, and n(t) as the
sample estimate and prescribed values of the feature. AWGN, and the set of CFs are given by [60, 63]
Signal cyclostationarity was also exploited for linear cðiÞ n 1 j2pb1T jðn2qÞu
r;n;q ðg; tn1 Þ ¼ a csðiÞ ;n;q T e e
modulation identification [55–64], via two approaches: spec-
tral line generation when passing the signal through different Pn1 ð1 Y
n1
nonlinearities [55–57], and periodic fluctuations with time of e j2pDf u¼1 ðÞu tu g ðÞu ðtÞ
cumulants up to the nth-order [58–65]. We note that the 1 u¼1
nth-order cycle frequencies (CFs) are given by ðÞn j2ptb
g ðt þ tu Þe dt; q ¼ 0; . . . ; n;
(n 2 2q)Df þ m/T, with m an integer [60, 63]. The nth-order
CF formula also holds for an IF signal, where Df is replaced ð33Þ
by the IF frequency, fIF . With this property, the cyclostatio-
narity of the received signal was exploited for AMC kn;q ¼ fg : g ¼ b þ ðn 2qÞDf ; b ¼ k=T ;
through a pattern of sine-wave frequencies in signal poly-
nomial transformations. For example, the 2fIF and 4fIF sinu- k integer; cðiÞ
r;n;q ðg; tn1 Þ = 0g ð34Þ
soids that appear in the second and fourth powers of the
where g is a CF, ( )u represents a possible conjugation of the a smaller number of symbols is needed to attain a specific
uth term, u ¼ 1, . . . , n, such that the total number of performance. In [63 – 65] the minimum Euclidian distance
conjugations is q, and (2)u is the minus sign associated between the sample estimate and prescribed feature-vectors
with the possible conjugation ( )u , u ¼ 1, . . . , n 2 1. was also used for decision making. A raised-cosine pulse
Since n(t) is a stationary, zero-mean Gaussian process, its shape was considered in [60– 65].
cumulants are time independent and non-zero only for the 4.2.4 Miscellaneous algorithms: Finally, the Radon
second order. Therefore, AWGN does not have any contri- transform was investigated for QAM classification in
bution to the higher-order (n 3) CCs of r(t). One can [85 – 87].
easily notice that by taking the magnitude of the nth order
CC, a feature robust to the carrier phase and timing offset 4.3 FB algorithms for FSK signals
is obtained.
In [60 – 61, 62], the magnitudes of the CCs up to the Similar to using the information contained in the instan-
fourth- and sixth-order, at a CF equals to taneous phase to identify the order of the PSK modulation,
g ¼ (n 2 2q)Df þ 1/T and a delay vector for which a the information extracted from the instantaneous frequency
maximum is reached (i.e. tn21 ¼ 0n21 [63]), were investi- is exploited to recognise the order of the FSK modulation.
gated as features, respectively. Based on these features, a In [27– 31], the variance of the absolute value of the nor-
feature vector was proposed in [62], as malised centred instantaneous frequency was used to dis-
tinguish between 2-FSK and 4-FSK. The feature was
F ðiÞ ¼ ½jcðiÞ ðiÞ ðiÞ
r;2;0 ðg; 0Þj jcr;2;2 ðg; 0Þkcr;4;0 ðg; 03 Þj compared against a threshold for decision, at a tree node,
as part of the binary decision tree classifier mentioned in
j cðiÞ ðiÞ ðiÞ y
r;4;4 ðg; 03 Þkcr;6;0 ðg; 05 Þj jcr;6;6 ðg; 05 Þj ; Subsection 4.1.1. In [34], the instantaneous frequency
i ¼ 1; . . . ; Nmod ð35Þ derivative was used to distinguish between 2-FSK and
4-FSK, under the assumption of the same bandwidths of
The CC-based features are estimated from Kr samples, the signals. The height of the peaks which occur in the dif-
taken over the observed K symbol interval [102]. Note ferentiated instantaneous frequency is proportional to the
that the received signal is oversampled in order to exploit frequency deviation, and thus, for 4-FSK this is expected
signal cyclostationarity. The sampling frequency is equal to be two times lower than for 2-FSK. If the peak average
to r/T, with r a positive integer, called the oversampl- falls below a certain threshold, 4-FSK is chosen, otherwise
ing factor. The decision is made by comparing the 2-FSK. As for PSK order identification, the number of
sample estimate with prescribed feature vectors from a modes in the instantaneous frequency histogram was
look-up table employed to determine the order of the FSK modulation
in [32, 33]. The number of modes in the histogram of the
^
^ı ¼ arg min dðF ðiÞ ; FÞ ð36Þ Haar wavelet transform magnitude was investigated for
1iNmod the FSK order identification in [36, 37]. If M/2þ1 to M
modes appear in the histogram, the input is identified as
where ^ı represents the decision on the modulation type of M-FSK. Finally, spectral properties of FSK signals were
the intercepted signal, F(i) is the theoretical feature-vector explored for classification in [88].
which corresponds to the ith modulation in the look-up
table, F^ is the estimated feature vector, and d(., .) is the 5 Numerical results
Euclidean distance. Joint detection and classification was
investigated in [62], with single and multiple incoming After scanning the literature to answer the question ‘which
signals present at the receiver, respectively. Multiple AMC technique is the best in terms of performance under
signals, which overlap in time and frequency but have dis- realistic conditions?’ it turned out that performance compari-
tinct symbol rates and hence, different cycle frequencies, son of published classifiers is not straightforward. There are a
can be distinguished using CCs (selectivity property of number of reasons for this. First, performance of different
CCs) [103]. Moreover, it is claimed that all the parameters classifiers cannot be compared, unless the candidate modu-
necessary for identification, i.e. symbol period, carrier lations are the same. Second, most of the classifiers are
frequency offset, excess bandwidth and signal amplitude designed to handle specific unknown parameters. So, one
are estimated. However, no details are given about the cannot really compare their performance, unless the uncer-
estimation methods. Eight-order CC-based features, that is tainties the classifiers take into account are the same.
the magnitude of the eighth-order (q ¼ 0, . . . , 8) CCs at Nevertheless, to obtain some insight, we make some com-
the CF g ¼ 1/T þ (n 2 2q)Df and zero delay vector were parisons in Subsections 5.1–5.3. We rely on some numerical
investigated in [63] for classifying real- and complex- results reported in the literature. However, we have also
valued constellations, respectively. The nth-order (n ¼ 4, simulated some of the algorithms under the same conditions,
6, 8) CC-based features were shown to be robust to a in Subsections 5.1–5.3 to make the comparison possible.
carrier frequency offset and phase jitter for q ¼ n/2 and
used for QAM classification in [64]. Features similar to 5.1. Comparative study of classifiers for PSK
those proposed in [63] were investigated in [65] for classify- signals
ing linear modulations in block fading channels. In this
case, the features were extracted from the signal at the Performance achieved with several algorithms for classify-
output of a selection combiner. Such a CC-based multi- ing PSK signals in AWGN is presented in Table 5. We
antenna classifier takes advantage of the robustness of the consider here the ideal scenario, that is no unknown par-
CC-based features to phase, timing errors and stationary ameters, as well as the scenarios with unknown carrier
noise, as well as of the ability of the selection combiner phase, and unknown carrier phase/timing offset, respect-
to mitigate the impact of fading via spatial diversity. ively. We have examined the ALRT, quasi-ALRT, HLRT,
Furthermore, the classifier is robust to the variations of and cumulant-based algorithms. For illustration, BPSK
the Ricean factor and a possible correlation among the and QPSK are considered as candidate modulations. Of
antennas. By increasing the number of antenna elements, course, when higher order modulations are included in the
Table 5: Performance and details of several classifiers for BPSK against QPSK
Classifier Unknown parameter Model mismatch SNR (dB) Pcc K
1 ALRT, L ¼ 1, (10) and (8), with – – 23 0.975 100

hA ¼ 1
2 ALRT, L ¼ 2, (12) [Note 3] and (8), – – 26 0.975 100
with hA ¼ 1
3 ALRT, L ¼ 1, (10) and (8), with – carrier phase u, U[2 p, p) 23 0.63 100
hA ¼ 1
4 quasi-ALRT, (20) and (22), with carrier phase u – 22 0.96 100
M¼2
5 HLRT, (27)† and (8), with the carrier phase u – 22 0.968 100
threshold set to 1
6 Cumulant-based, (32) with carrier phase u – 4 0.96 100
v̂ ¼ jc^ r,4,0(03)/c^ 2r,2,1(0)j, Nmod ¼ 2,
mH1 ¼ 1, and mH2 ¼ 2
7 quasi-ALRT, (20) and (22), with carrier phase u timing offset 1 ¼ 0.15 11 0.96 100
M¼2
8 Cumulant-based (see sixth row in carrier phase u timing offset 1 ¼ 0.15 6 0.96 100
this table)

These are not included when designing the classifier, and their impacts on the performance were studied to evaluate the classifier
robustness.

These numerical results are not given in the papers where the classifiers are proposed, and we have simulated them to perform a
comparison with other methods.
†
The maximisation procedure with respect to the carrier phase u was carried out by discrete search with a 1 degree resolution. Owing to
the symmetry of the investigated constellations, the maximisation procedure can be performed over [0, p/2], instead of [0, 2p] [14, 21, 22].
modulation pool, higher SNRs and/or a larger number of As expected, the ALRT-based classifier designed for an
symbols are needed to achieve the same performance ideal scenario fails under an unknown carrier phase offset.
[2, 5, 21]. We have simulated these modulations to draw Investigated solutions are the HLRT, quasi-ALRT and
some basic yet insightful conclusions, which shed some cumulant-based classifiers. A slightly better performance
light on major techniques. is achieved with HLRT, when compared with the
Subsequently, the probability of correct classification quasi-ALRT. Also, the quasi-ALRT provides a better
Pcc is used to evaluate the classification performance, with performance than the cumulant-based classifier. However,
P(iji)
c estimated based on 1000 Monte Carlo trials. Unless a synchronisation error dramatically degrades the
otherwise mentioned, the pulse shape is rectangular, the performance of the former, whereas it has a smaller effect
received SNR per symbol is defined as gs ¼ ST/N0 , the on the latter. Though, the quasi-ALRT can be modified as
symbol period and signal power are set to T ¼ 1 and in (24) to further account for the unknown timing.
S ¼ 1, respectively and the number of symbols is K ¼ 100. Performance enhancement can be achieved by using
Under ideal conditions, the ALRT-based classifier in (10) multiple receive antennas, even with L ¼ 2. More examples
and (8), with hA ¼ 1, provides a Pcc of 0.975 at 23 dB SNR of modulation recognition with multiple antennas can be
(5-1) [Note 8]. A 3 dB SNR improvement is achieved with found in [22, 65].
an extra antenna, i.e. L ¼ 2, and maximal ratio combining at
the receiver, presented in (12) [Note 3] and (8), with
5.2 Comparative study of classifiers for QAM
hA ¼ 1(5-2). With a carrier phase u uniformly distributed
signals
over [2p, p), the performance of the ALRT-based classifier
designed for no unknown parameters, shown in (10) and (8) The performance of several algorithms is given in Table 6,
with hA ¼ 1, drops to 0.63 at 23 dB (5-3). With the when discriminating between 16-QAM and V.29 signals in
quasi-ALRT classifier in (20) and (22), with M ¼ 2, a Pcc AWGN. We consider here the ideal scenario, as well as
of 0.96 at 22 dB SNR is achieved (5-4), which is close to the case with the unknown carrier phase u. We have investi-
that under ideal condition (5-1), (5-4). As noticed from gated the ALRT, quasi-ALRT, HLRT, quasi-HLRT,
(5-5), a slightly better performance is achieved with the cumulant- and cyclic cumulant-based algorithms. Under
HLRT designed for an unknown carrier phase u, given in ideal conditions, ALRT in (10) and (8), with hA ¼ 1, recog-
(27) and (8), with the threshold set to 1. Using the cumulant- nises 16-QAM and V.29 with a Pcc of 0.99 at 7 dB SNR
based classifier in (32) with v̂ ¼ j^cr,4,0(03)/^c2r,2,1(0)j (for the (6-1). With an unknown carrier phase u, uniformly distribu-
cumulant estimator formula one can see [41], (5) and (6)), ted over [2p, p), the ALRT classifier designed for ideal
Nmod ¼ 2, mH1¼1 and mH2 ¼ 2, 4 dB SNR is required to conditions fails (6-2), as expected. For a detailed sensitivity
attain a Pcc of 0.96 (5-6). However, a timing error degrades analysis of the ALRT-based classifier to model mismatches
the performance of the quasi-ALRT classifiers dramatically, one can see [93]. A 2 dB SNR degradation occurs
that is a 13 dB SNR loss (5-4), (5-7), whereas only 2 dB loss when using the cumulant-based algorithm in (32), with
is observed for the cumulant-based classifier (5-6), (5-8). v̂ ¼ c^ r,4,0(03)/^c2r,2,1(0), Nmod ¼ 2, mH1 ¼ 20.68, and mH2 ¼
0.5185, under ideal conditions (6-3). This also fails when
Note 8: When necessary, we refer to the ith row in Table 5, Table 6, Table 7 as the unknown carrier phase u appears (6-4). Alternatively,
(5-i), (6-i), and (7-i), respectively. even the quasi-ALRT which is designed for an unknown
Table 6: Performance and details of several classifiers for 16QAM against V.29
Classifier Unknown parameter Model SNR (dB) Pcc K

mismatch

1 ALRT, (10) and (8), with hA ¼ 1 – – 7 0.99 100
2 ALRT, (10) and (8), with hA ¼ 1 – carrier phase u, 7 0.50 100
U[2p, p)
3 cumulant-based (32) with v̂ ¼ c^ r,4,0(03)/ – – 9 0.99 100
c^ 2r,2,1(0), Nmod ¼ 2, mH1 ¼ 20.68,
mH2 ¼ 0.5185

4 cumulant-based, (see third row of this table) – carrier phase u, 9 0.50 100
U[2p, p)
5 quasi-ALRT, (23) with A ¼ 0.0135 and carrier phase u – 30 0.88 100
B ¼ 20.0246, and hQA,E þ
6 HLRT, (31)† and (8), with hþ
H,E carrier phase u – 9 0.99 100
7
quasi-HLRT (30)þþ , (29) and threshold set to carrier phase u – 19 0.99 100
one
11 0.91 6000
8 CC-based, (35) and (36) carrier phase u and timing – 9 0.99 6000
error 1

robustness.

These numerical results are not given in the papers where the classifiers are proposed, and we have simulated them to perform a
comparison with other methods.
†
The maximisation procedure w.r.t. the carrier phase u was carried out by discrete search with a 1 degree resolution. Owing to the
symmetry of the investigated constellations, the maximisation procedure can be performed over [0, p/2], instead of [0, 2p] [14, 21, 22].
þ
The values of these thresholds were not specified in the papers.
þþ
The LF in (30) simplified for the AWGN channel, considered in this simulation, with the carrier phase as the only unknown parameter,
and ai replaced by S, which is assumed perfectly known.
carrier phase scenario (see (23), with A ¼ 0.0135, with enough symbols, the CC-based classifier provides
B ¼ 20.0246, and the threshold hQA,E [Note 9], provides a superior performance when compared to the quasi-HLRT,
Pcc of 0.88 at 30 dB SNR (6-5). An acceptable performance taking also advantage of the CC properties.
is attained with the HLRT classifier in (31) and (8), with hH,E
[Note 9], designed for unknown carrier phase: a Pcc of 0.99 at
5.3 Comparative study of classifiers
9 dB SNR (6-6). By using the low complexity quasi-HLRT
for FSK signals
classifier, with the LF given in (30) [Note 10], the carrier
phase as in (29), and threshold set to 1, 19 dB SNR is Here we mainly take the numerical results from [9].
requested to attain the same performance (6-7). By increas- Performance of the optimal ALRT and quasi-ALRT algor-
ing the number of symbols to 6000, 11 dB SNR is needed ithms is given in Table 7, to distinguish between 32-FSK
to obtain a Pcc of 0.91 (6-7). Alternatively, with 6000 and 64-FSK in AWGN, under the assumption of per-symbol
symbols, the CC-based classifier in (35) and (36) provides phase incoherence and identical bandwidths of the incoming
a Pcc of 0.99 at only 9 dB (6-8). Moreover, this classifier is signals. The symbol period of 64-FSK is two times larger
robust to the timing offset, is applicable to a larger pool of than that of 32-FSK, under the assumption of the equal band-
modulations, including QAM, PSK and ASK, and benefit widths. With the same observation interval, K ¼ 10 and 5
from the selectivity property of CCs. Note that with the symbols are considered for 32-FSK and 64-FSK, respect-
CC-based classifier, the pulse shape is raised cosine, with a ively, to have a fair comparison. The optimal ALRT in
roll-off factor of 0.35 and the SNR is defined at the output (14) and (8), with hA ¼ 1, provides a Pcc of 0.975 at 6 dB
of the root raised cosine receive filter. With multiple SNR (5-1). The quasi-ALRT classifier based on the first-
receive antennas one can enhance the performance, as order correlation, that is (25) with the first three terms and
shown for quasi-HLRT [22] and CC-based classifiers [65]. the threshold hHOC,E [Note 9] for decision making, provides
We can conclude that the ALRT- and cumulant-based a Pcc of 0.74 at the same SNR (5-2). The quasi-ALRT
classifiers, designed for an ideal scenario, fail when the classifier that employs the first- and second-order
unknown carrier phase offset is present. Investigated sol- correlations, that is the first five terms in (25), provides a
utions are the quasi-ALRT, HLRT, quasi-HLRT, and Pcc of 0.90 (5-3). The ALRT-based classifier essentially
CC-based classifiers. With the quasi-ALRT classifier, breaks down for a frequency drift of half of the frequency
QAM signals are not identified accurately enough. As deviation, that is Df ¼ fd/2 (5-4), whereas the quasi-ALRT
expected, the HLRT classifier provides a better classifi- classifier is insensitive to such a model mismatch [9].
cation performance compared with the quasi-HLRT. By The performance of the quasi-ALRT classifier
increasing the number of processed symbols, the perform- approaches to the optimal ALRT, by increasing the
ance of the quasi-HLRT classifier improves. However,
Note 10: The LF in (30) is simplified for the AWGN channel, considered in this
simulation, with the carrier phase as the only unknown parameter, and âi
Note 9: The values of these thresholds were not specified in the papers. replaced by S, which is assumed perfectly known.

Table 7: Performance and details of several classifiers for 32FSK against 64FSK
Algorithm Unknown parameter Model mismatch SNR (dB) Pcc K
1 ALRT, (14) with jS(i)

k j ¼ 1, (8), with phase jitter fwkgKk¼1 – 6 0.975 10
hA ¼ 1
2 quasi-ALRT (first-order correlation), phase jitter fwkgKk¼1 – 6 0.74 10
(25) with the first three terms,
calculated based on K(K/2) symbols
for 32FSK (64FSK) and hþ
HOC,E as the
threshold
3 quasi-ALRT (second-order correlation), phase jitter fwkgKk¼1 – 6 0.90 10
(25) with the first five terms,
expressed for K(K/2) symbols for
32FSK (64FSK) and hþ
HOC,E as the
threshold
4 ALRT (see the first row of this table) phase jitter fwkgKk¼1 Frequency offset (Df ¼ fd/2) 6 0.57 10

robustness. þThe values of these thresholds were not specified in the papers
correlation order. The quasi-ALRT classifier has the advan- development of classification methods which rely less on
tage of robustness to the carrier frequency offset. preprocessing is another topic for further investigation.
New classification problems have raised as a result of emer-
ging wireless technologies, such as, single carrier against
6 Conclusion multicarrier modulation recognition, classification of
signals received from single and multiple transmit antennas,
Based on a comprehensive literature survey, this paper has identification of space-time modulation format, etc. These
summarised the two main approaches to automatic modu- issues mean that AMC in real-world environments con-
lation classification (AMC), that is the likelihood-based tinues to be a dynamic research field.
(LB) and the feature-based (FB) methods, and has lightened
their advantages and drawbacks. Although the LB approach
7 References
provides optimal performance, it is difficult to obtain an
exact analytical solution for the decision function, when 1 Sills, J.A.: ‘Maximum-likelihood modulation classification for PSK/
the number of unknown parameters increases. When a QAM’. Proc. IEEE MILCOM, 1999, pp. 57– 61
closed-form solution exists, the computational complexity 2 Wei, W., and Mendel, J.M.: ‘Maximum-likelihood classification for
can make the classifier impractical. By using a low SNR digital amplitude-phase modulations’, IEEE Trans. Commun., 2000,
48, pp. 189– 193
approximation of the likelihood function (LF), the so-called 3 Kim, K., and Polydoros, A.: ‘Digital modulation classification: the
quasi-ALRT algorithms were proposed in the literature. BPSK versus QPSK case’. Proc. IEEE MILCOM, 1988,
These algorithms provide near optimal performance for pp. 431–436
identifying PSK and FSK signals. Nevertheless, they are 4 Polydoros, A., and Kim, K.: ‘On the detection and classification of
quadrature digital modulations in broad-band noise’, IEEE Trans.
not accurate for recognising QAM signals. Using ML esti- Commun., 1990, 38, pp. 1199–1211
mates of the unknown quantities, GLRT and HLRT tech- 5 Huang, C.Y., and Polydoros, A.: ‘Likelihood methods for MPSK
niques were investigated as two alternatives. Although modulation classification’, IEEE Trans. Commun., 1995, 43,
GLRT has some advantages, it fails in identifying nested pp. 1493–1504
constellations. However, HLRT does not have this 6 Sapiano, P.C., and Martin, J.D.: ‘Maximum likelihood PSK
classifier’. Proc. ICASSP, 1996, pp. 1010–1014
problem. However, with several unknown parameters, 7 Long, C., Chugg, K., and Polydoros, A.: ‘Further results in likelihood
finding the ML estimates can be very time consuming. classification of QAM signals’. Proc. IEEE MILCOM, 1994,
The complexity is reduced in quasi-HLRT classifiers, pp. 57– 61
which rely on low-complexity yet accurate estimators. 8 Hong, L., and Ho, K.C.: ‘Classification of BPSK and QPSK signals
Obviously, there is a trade-off between the complexity with unknown signal level using the Bayes technique’. Proc. IEEE
ISCAS, 2003, IV.1–IV.4
and performance, which depends on the estimation 9 Beidas, B.F., and Weber, C.L.: ‘Higher-order correlation-based
method. In the FB approach, some signal features are approach to modulation classification of digitally
employed to identify the modulation format. Although sub- frequency-modulated signals’, IEEE J. Sel. Areas Commun., 1995,
optimal, FB algorithms can be simpler to implement. Many 13, pp. 89–101
10 Beidas, B.F., and Weber, C.L.: ‘Asynchronous classification of
AMC algorithms which use the instantaneous amplitude, MFSK signals using the higher order correlation domain’, IEEE
phase, frequency, signal wavelet transform, and signal stat- Trans. Commun., 1998, 46, pp. 480– 493
istics such as moments, cumulants, and cyclic cumulants, 11 Beidas, B.F., and Weber, C.L.: ‘General framework for the
were compactly presented, in addition to other miscella- high-order correlation domain’. Proc. IEEE MILCOM, 1995,
neous techniques. In a hierarchical classification system, pp. 180–185
12 Beidas, B.F., and Weber, C.L.: ‘Higher-order correlation-based
some of these features can be used to identify the modu- classification of asynchronous MFSK signal’. Proc. IEEE
lation class and then, within each class, the modulation MILCOM, 1996, pp. 1003– 1009
order can be determined. 13 El-Mahdy, A.E., and Namazi, N.M.: ‘Classification of multiple
Accurate preprocessing is required for the effective M-ary frequency-shift keying signals over a Rayleigh fading
channel’, IEEE Trans. Commun., 2002, 50, pp. 967 –974
implementation of most of the known AMC algorithms. 14 Panagiotou, P., Anastasoupoulos, A., and Polydoros, A.: ‘Likelihood
Devising low complexity blind algorithms for joint par- ratio tests for modulation classification’. Proc. IEEE MILCOM,
ameter estimation is a topic of interest in AMC. In addition, 2000, pp. 670–674

15 Chugg, K.M., Long, C.S., and Polydoros, A.: ‘Combined likelihood 43 Hatzichristos, G., and Fargues, M.P.: ‘A hierarchical approach to the
power estimation and multiple hypothesis modulation classification’. classification of digital modulation types in multipath environments’.
Proc. ASILOMAR, 1995, pp. 1137–1141 Proc. ASILOMAR, 2001, pp. 1494– 1498
16 Hong, L., and Ho, K.C.: ‘BPSK and QPSK modulation classification 44 Yang, Y., and Soliman, S.S.: ‘Optimum classifier for M-ary PSK
with unknown signal level’. Proc. IEEE MILCOM, 2000, signals’. Proc. ICC, 1991, pp. 1693– 1697
pp. 976–980 45 Yang, Y., and Liu, C.H.: ‘An asymptotic optimal algorithm for
17 Hong, L., and Ho, K.C.: ‘Modulation classification of BPSK and modulation classification’, IEEE Commun. Lett., 1998, 2, pp. 117–119
QPSK signals using a two element antenna array receiver’. Proc. 46 Yang, Y., and Soliman, S.S.: ‘A suboptimal algorithm for
IEEE MILCOM, 2001, pp. 118– 122 modulation classification’, IEEE Trans. Aerosp. Electron. Syst.,
18 Hong, L., and Ho, K.C.: ‘An antenna array likelihood modulation 1997, 33, pp. 38–45
classifier for BPSK and QPSK signals’. Proc. IEEE MILCOM, 47 Yang, Y., and Soliman, S.S.: ‘Statistical moments based classifier for
2002, pp. 647–651 MPSK signals’. Proc. GLOBECOM, 1991, pp. 72– 76
19 Lay, N., and Polydoros, A.: ‘Per-survivor processing for channel 48 Soliman, S.S., and Hsue, S.Z.: ‘Signal classification using statistical
acquisition, data detection and modulation classification’. Proc. moments’, IEEE Trans. Commun., 1992, 40, pp. 908–916
ASILOMAR, 1995, pp. 170 –174 49 Yang, Y., and Soliman, S.S.: ‘An improved moment-based algorithm
20 Lay, N., and Polydoros, A.: ‘Modulation classification of signals in for signal classification’, Signal Process., 1995, 43, pp. 231– 244
unknown ISI environments’. Proc. IEEE MILCOM, 1995, 50 Lichun, L.: ‘Comments on signal classification using statistical
pp. 170–174 moments’, IEEE Trans. Commun., 2002, 50, p. 195
21 Dobre, O.A., Zarzoso, J., Bar-Ness, Y., and Su, W.: ‘On the 51 Sapiano, P.C., Martin, J., and Holbeche, R.: ‘Classification of PSK
classification of linearly modulated signals in fading channel’. signals using the DFT of phase histogram’. Proc. ICASSP, 1995,
Proc. CISS Conf., Princeton, March 2004 pp. 1868–1871
22 Abdi, A., Dobre, O.A., Choudhry, R., Bar-Ness, Y., and Su, W.: 52 Schreyogg, C., and Reichert, J.: ‘Modulation classification of QAM
‘Modulation classification in fading channels using antenna schemes using the DFT of phase histogram combined with modulus
arrays’. Proc. IEEE MILCOM, 2004 information’. Proc. IEEE MILCOM, 1997, pp. 1372–1376
23 Boiteau, D., and Le Martret, C.: ‘A generalized maximum 53 Swami, A., Barbarossa, S., and Sadler, B.: ‘Blind source separation
likelihood framework for modulation’. Proc. ICASSP, 1998, and signal classification’. Proc. ASILOMAR, 2000, pp. 1187– 1191
pp. 2165– 2168 54 Martret, C., and Boiteau, D.M.: ‘Modulation classification by means
24 Lesage, S., Tourneret, J., and Djuric, P.M.: ‘Classification of digital of different order statistical moments’. Proc. IEEE MILCOM, 1997,
modulations by MCMC sampling’. Proc. ICASSP, 2001, pp. 1387–1391
pp. 2553– 2555 55 DeSimio, M.P., and Prescott, G.E.: ‘Adaptive generation of decision
25 Lin, Y.-C., and Kuo, C.-C.J.: ‘Classification of quadrature amplitude functions for classification of digitally modulated signals’. Proc.
modulated (QAM) signals via sequential probability ratio test NAECON, 1988, pp. 1010–1014
(SPRT)’, Signal Process., 1997, 60, pp. 263– 280 56 Reichert, J.: ‘Automatic classification of communication signals
26 Lin, Y.-C., and Kuo, C.-C.J.: ‘Sequential modulation classification of using higher order statistics’. Proc. ICASSP, 1992, pp. 221– 224
dependent samples’. Proc. ICASSP, 1996, pp. 2690–2693 57 Schreyogg, C., Kittel, C., Kressel, U., and Reichert, J.: ‘Robust
27 Azzouz, E.E., and Nandi, A.K.‘Automatic modulation recognition of classification of modulation types using spectral features applied to
communication signals’ (Kluwer Academic, 1996) HMM’. Proc. IEEE MILCOM, 1997, pp. 1377–1381
28 Azzouz, E.E., and Nandi, A.K.: ‘Procedure for automation 58 Marchand, P., Lacoume, J.L., and Le Martret, C.: ‘Classification of
recognition of analogue and digital modulations’, IEE Proc., linear modulations by a combination of different orders cyclic
Commun., 1996, 143, pp. 259–266 cumulants’. Proc. Workshop on HOS, 1997, pp. 47– 51
29 Nandi, A.K., and Azzouz, E.E.: ‘Modulation recognition 59 Marchand, P., Lacoume, J.L., and Le Martret, C.: ‘Multiple
using artificial neural networks’, Signal Process., 1997, 56, hypothesis classification based on cyclic cumulants of different
pp. 165–175 orders’. Proc. ICASSP, 1998, pp. 2157–2160
30 Wong, M.L.D., and Nandi, A.K.: ‘Automatic digital modulation 60 Spooner, C.M.: ‘Classification of cochannel communication signals
recognition using spectral and statistical features with multi-layer using cyclic cumulants’. Proc. ASILOMAR, 1995, pp. 531–536
perceptrons’. Proc. Int. Symp. Signal Processing and its 61 Spooner, C.M., Brown, W.A., and Yeung, G.K.: ‘Automatic
Applications, Kuala Lumpur, Malaysia, 2001, pp. 390–393 radio-frequency environment analysis’. Proc. ASILOMAR, 2000,
31 Nandi, A.K., and Azzouz, E.E.: ‘Algorithms for automatic recognition pp. 1181–1186
of communication signals’, IEEE Trans. Commun., 1998, 46, 62 Spooner, C.M.: ‘On the utility of sixth-order cyclic cumulants for RF
pp. 431–436 signal classification’. Proc. ASILOMAR, 2001, pp. 890 –897
32 Hsue, S.Z., and Soliman, S.S.: ‘Automatic modulation recognition of 63 Dobre, O.A., Bar-Ness, Y., and Su, W.: ‘Higher-order cyclic
digitally modulated signals’. Proc. IEEE MILCOM, 1989, cumulants for high order modulation classification’. Proc. IEEE
pp. 645–649 MILCOM, 2003, pp. 112–117
33 Hsue, S.Z., and Soliman, S.S.: ‘Automatic modulation classification 64 Dobre, O.A., Bar-Ness, Y., and Su, W.: ‘Robust QAM modulation
using zero crossing’, IEE Proc. F., Radar Signal Process., 1990, 137, classification algorithm based on cyclic cumulants’. Proc. WCNC,
pp. 459–464 2004, pp. 745–748
34 Assaleh, K., Farrell, K.R., and Mammone, R.J.: ‘A new method of 65 Dobre, O.A., Abdi, A., Bar-Ness, Y., Su, W: ‘Selection combining for
modulation classification for digitally modulated signals’. Proc. modulation recognition in fading channels’. Proc. IEEE MILCOM,
IEEE MILCOM, 1992, pp. 712– 716 2005, pp. 2499–2505
35 Farrell, K.R., and Mammone, R.J.: ‘Modulation classification 66 Venalainen, J., Terho, L., and Koivunen, V.: ‘Modulation
using a neural tree network’. Proc. IEEE MILCOM, 1993, classification in fading multipath channel’. Proc. ASILOMAR,
pp. 1028– 1032 2002, pp. 1890–1894
36 Ho, K.C., Prokopiw, W., and Chan, Y.T.: ‘Modulation 67 Stolman, V.J., Paranjpe, S., and Orsak, G.C.: ‘A blind information
identification by the wavelet transform’. Proc. IEEE MILCOM, theoretic approach to automatic signal classification’. Proc. IEEE
1995, pp. 886–890 MILCOM, 1999, pp. 447–451
37 Ho, K.C., Prokopiw, W., and Chan, Y.T.: ‘Modulation identification 68 Paris, B.-P., Orsak, G.C., Chen, H., and Warke, N.: ‘Modulation
of digital signals by the wavelet transform’, IEE Proc., Radar, Sonar classification in unknown dispersive environments’. Proc. ICASSP,
Navig., 2000, 47, pp. 169–176 1997, pp. 3853–3856
38 Hong, L., and Ho, K.C.: ‘Identification of digital modulation types 69 Wei, W., and Mendel, J.M.: ‘A fuzzy logic method for modulation
using the wavelet transform’. Proc. IEEE MILCOM, 1999, classification in nonideal environments’, IEEE Trans. Fuzzy Syst.,
pp. 427–431 1999, 7, pp. 333–344
39 Liedtke, F.F.: ‘Computer simulation of an automatic classification 70 Lopatka, J., and Pedzisz, M.: ‘Automatic modulation classification
procedure for digitally modulated communication signals with using statistical moments and fuzzy classifier’. Proc. IEEE ICSP,
unknown parameters’, Signal Process., 1984, 6, pp. 311– 323 2000, pp. 1500–1505
40 Deng, H., Doroslovacki, M., Mustafa, H., Xu, J., and Koo, S.: 71 Hero, A.O., and Mahram, H.H.: ‘Digital modulation
‘Instantaneous feature based algorithm for HF digital modulation classification using power moment matrices’. Proc. ICASSP, 1998,
classification’. Proc. CISS Conf., Baltimore, March 2003 pp. 3285–3288
41 Swami, A., and Sadler, B.M.: ‘Hierarchical digital modulation 72 Mahram, H.H., and Hero, A.O.: ‘Robust QAM modulation
classification using cumulants’, IEEE Trans. Commun., 2000, 48, classification via moment matrices’. Proc. PIMRC, 2000,
pp. 416–429 pp. 133– 137
42 Dai, W., Wang, Y., and Wang, J.: ‘Joint power and modulation 73 Mobasseri, B.G.: ‘Constellation shape as a robust signature for
classification using second- and higher statistics’. Proc. WCNC, digital modulation recognition’. Proc. IEEE MILCOM, 1999,
2002, pp. 155–158 pp. 442– 446

74 Huo, X., and Donoho, D.L.: ‘A simple and robust modulation 102 Dandawade, V., and Giannakis, G.B.: ‘Asymptotic theory of mixed
classification method via counting’. Proc. ICASSP, 1998, time averages kth- order cyclic-moment cumulant statistics’, IEEE
pp. 3289–3292 Trans. Inf. Theory, 1995, 41, pp. 216– 232
75 Donoho, D.L., and Huo, X.: ‘Large-sample modulation classification 103 Gardner, W.A., and Spooner, C.M.: ‘The cumulant theory of
using Hellinger representation’. Proc. Signal Processing Advances in cyclostationary time-series. Part I: Foundation’, IEEE Trans.
Wireless Communications, 1997, pp. 133–137 Signal Process., 1994, 42, pp. 3387–3408
76 Swami, A., and Sadler, B.: ‘Modulation classification via
hierarchical agglomerative cluster analysis’. Proc. IEEE Signal
Processing Workshop on Signal Processing Advances in Wireless
8 Appendix
Communications, 1997, pp. 141– 144
77 Schreyogg, C.: ‘Identification of voiceband data signal constellations 8.1 Derivation of (10), (13), (14) and (15)
using a divisive cluster algorithm’. Proc. IEEE Digital Signal
Processing Workshop, 1996, pp. 474 –477 First we assume that the parameters are known, except for
78 Taira, S., and Murakami, E.: ‘Automatic classification of analogue the symbols, i.e. vi ¼ ui ¼ [fs(i) K
k gk¼1]. Then, by inserting
modulation signals by statistical parameters’. Proc. IEEE
MILCOM, 1999, pp. 202– 206 (5) into (4) and averaging w.r.t. the unknown symbols, the
79 Louis, C., and Sehier, P.: ‘Automatic modulation recognition with following LF can be written
a hierarchical neural network’. Proc. IEEE MILCOM, 1994, ð KT
pp. 713–717 ðiÞ 1
80 Yoshioka, H., Shirato, Y., Toyoda, I., and Umehira, M.: ‘A fast LA ½rðtÞ ¼EfsðiÞ gK exp 2N0 Re rðtÞs ðt; ui Þ dt
k k¼1
modulation recognition technique using nearest neighbor rules 0
with optimized threshold for modulation classification in Rayleigh ð KT
fading channels’. Proc. IEEE Wireless Personal Multimedia N01 jsðt; ui Þj2 dt
Communications Conf., 2002, pp. 1049–1052 0
81 Chung, C.-D., and Polydoros, A.: ‘Envelope-based classification
schemes for continous-phase binary frequency-shift-keying which upon using the signal model in (9) [Note 2] changes
modulations’. Proc. IEEE MILCOM, 1994, pp. 796 –800
82 Zhijin, Z., and Junna, S.: ‘A new method for modulation types to
recognition based on the time frequency representation’. Proc. ð KT
ICSP, 2002, pp. 208– 211 1
pffiffiffi
83 Dominguez, L.V., Borrallo, J.M.P., and Garcia, J.P.: ‘A general EfsðiÞ gK exp 2N0 Re rðtÞa S
k k¼1
0
approach to the automatic classification of radio communication
signals’, Signal Process., 1991, 22, pp. 239 –250
)
X
K
84 Umebayashi, K., Ishii, S., and Kohno, R.: ‘Blind adaptive estimation ejw sðiÞ
k ðtÞuT ðt ðk 1ÞTÞ dt
of modulation scheme for software defined radio’. Proc. PIMRC,
k¼1
2000, pp. 43–47
85 Wood, S.L., Ready, M.J., and Treichler, J.R.: ‘Constellation ð KT #)
identification using the Radon transform’. Proc. ICASSP, 1988,
pffiffiffi X
K
pp. 1878–1881
N01 ja S ejw sðiÞ 2
k ðtÞuT ðt ðk 1ÞT Þj dt
0 k¼1
86 Wood, S.L., and Treichler, J.R.: ‘Performance of the Radon
transform method for constellation identification’. Proc.
ASILOMAR, 1988, pp. 119–123 and after some manipulations, becomes
87 Wood, S.L., Ready, M.J., and Treichler, J.R.: ‘Modem constellation
identification: A performance comparison of two methods’. Proc. LðiÞ
A ½rðtÞ
ICASSP, 1990, pp. 1651– 1654
88 Yu, Z., Shi, Y.Q., and Su, W.: ‘M-ary frequency shift keying signal Y
K h pffiffiffi n o i
classification based on discrete Fourier transform’. Proc. IEEE ¼ EsðiÞ exp 2a S N01 Re ejw RðiÞ
k a2 STN01 jsðiÞ
k ðtÞj
2
MILCOM, 2003, pp. 1167–1172 k¼1
k
89 Cimini, L.J.: ‘Analysis and simulation of a digital mobile channel
using orthogonal frequency division multiplexing’, IEEE Trans. ð37Þ
Commun., 1985, 33, pp. 665–675
90 van Nee, R., and Prasad, R.: ‘OFDM for wireless multimedia Ð
communications’ (Artech House, 2000) with R(i)

k ¼
kT (i)
(k21)T r(t)sk (t)uT (t 2 (k 2 1)T ) dt.
91 Grimaldi, D., Rapuano, S., and Truglia, G.: ‘An automatic digital In AWGN, with vi ¼ [u fs(i) †
k g] and uniform distribution
modulation classifier for measurements on telecommunication for the carrier phase u over [2 p, p), the same for all
networks’. Proc. IEEE Instrumentation and Measurement K symbols, the LF can be obtained by averaging (37), sim-
Technology, 2002, pp. 957– 962
92 Su, W., and Kosinski, J.: ‘A survey of digital modulation
plified for the AWGN channel [Note 2] over the phase
recognition methods’. Proc. International Signal Processing Conf., (
Dallas, 2003 1
PK ðiÞ 2
ðiÞ
93 Dobre, O.A., Abdi, A., Bar-Ness, Y., and Su, W.: ‘Blind modulation LA ½rðtÞ ¼ EfsðiÞ gK eSTN0 js j
k¼1 k
k k¼1
classification: a concept whose time has come’. Proc. IEEE Sarnoff
Symp. on Advances in Wired and Wireless Communications, ( " #))
Princeton, 18–19 April 2005, pp. 223–228 pffiffiffi X
K
94 Proakis, J.G.: ‘Digital communications’ (McGraw-Hill, 2001, 4th edn.) Eu exp 2 S N01 jRðiÞ
k jcosðu þ uRðiÞ Þ
95 van Trees, H.L.: ‘Detection, estimation and modulation theory—Part k¼1
k
I’ (New York, Wiley, 2001) (

96 Kay, S.M.: ‘Fundamental and statistical signal processing – detection PK
1
jsðiÞ j2
theory’ (PTR Prentice-Hall, Englewood Cliffs, NJ, 1998) ¼EfsðiÞ gK eSTN0 k¼1 k
97 Paulraj, A., Nabar, R., and Gore, D.: ‘Introduction to space-time k k¼1
wireless communications’ (Cambridge University, 2003) ( " #))

98 Mendel, J.M.: ‘Tutorial on higher-order statistics (spectra) in signal pffiffiffi 1 X K
ðiÞ
processing and system theory and some applications’, Proc. IEEE, Eu exp 2 S N0 Rk cosðu þ u K RðiÞ Þ P
1991, 79, pp. 278–305 k¼1 k¼1 k
99 Nikias, C.L., and Petropulu, A.P.: ‘Higher-order spectra analysis. A
( ! )
nonlinear signal processing framework’ (PTR Prentice-Hall, PK ðiÞ 2 pffiffiffi 1 X K
Englewood Cliffs, NJ, 1993) STN01 js j ðiÞ
¼EfsðiÞ gK e k¼1 k I0 2 S N0 Rk
100 Jain, A.K., Duin, R.P.W., and Mao, J.: ‘Statistical pattern k k¼1 k¼1
recognition: A review’, IEEE Trans. Pattern Anal. Mach. Intell.,
2000, 22, pp. 4–37
101 Barbarossa, S., and Scaglione, A.: ‘Blind equalization using
ð38Þ
cost-functions matched to the signal constellations’. Proc. PK
ASILOMAR, 1997, pp. 550–54 where uR(ik ) and uPKk¼1 R(ik ) are the phases of R(i)
k and
(i)
k¼1Rk ,

respectively, and I0(u) ¼ Eufexp[u cos(u þ c)]g, where c is n ¼ 2, 4, 6, 8 and q ¼ 0, . . . , bn/2c, for BPSK and QPSK,
an arbitrary phase. given in the first and second columns of Table 2, (19) can
Similarly, in AWGN and under the assumption of be written, respectively as
per-symbol phase-incoherence owing to phase jitter, that
is vi ¼ [ffkgKk¼1 fs(i) K † K
k gk¼1] , with ffkgk¼1 as i.i.d. r.v.s, one
can easily obtain the expression of the LF LðBPSKÞ
A ½rðtÞ

n n h pffiffiffi pffiffiffi 1 2 pffiffiffi

Y
K
1 ðiÞ 2 ’ exp K S N0 ^ r;2;0 j þ 2ð S N01 Þ2 jm
jm ^ r;2;1 j
LðiÞ
A ½rðtÞ ¼ EsðiÞ eSTN0 jsk j Efk exp 2 S N01 jRðiÞ
k j
k
k¼1
pffiffiffi 4
pffiffiffi 4
ioo þ121 S N01 jm^ r;4;0 j þ 31 S N01 jm^ r;4;1 j
cosðfk þ uRðiÞ Þ
k
pffiffiffi 4
pffiffiffi 6
Y
K n 1 ðiÞ 2 pffiffiffi o þ 41 S N01 jm^ r;4;2 j þ 3601 S N01 jm^ r;6;0 j
¼ EsðiÞ eSTN0 jsk j I0 ð2 S N01 jRðiÞ
k jÞ ð39Þ
pffiffiffi 6
pffiffiffi 6
k
k¼1
þ 601 S N01 jm ^ r;6;1 j þ 241 S N01 jm ^ r;6;2 j
With vi ¼ [a w fs(i) K †
k gk¼1] , a a Rayleigh distributed r.v. and w
pffiffiffi 6
the uniform phase of the fading channel, the LF can be þ 361 S N01 jm ^ r;6;3 j þ ð43Þ
obtained by averaging (38), written for the block fading
channel [Note 2], with respect to a
LðiÞ and
A ½rðtÞ
( !)
PK X K
2
TN01 jskðiÞ j2 ðiÞ LðQPSKÞ
¼ EfsðiÞ gK Ea ea k¼1 I0 2aN01 Rk ð40Þ A ½rðtÞ
k k¼1 k¼1
pffiffiffi 4
pffiffiffi
’ exp K 2ð S N01 Þ2 jm
^ r;2;1 j þ 121 S N01 jm^ r;4;0 j
Using the following integral
n 2
o ð þ1 2 2

pffiffiffi 4
pffiffiffi 6
Ea eK1 a I0 ðaK2 Þ ¼ eK1 a I0 ðaK2 Þ2aV1 ea =V da þ41 S N01 jm^ r;4;2 j þ 601 S N01 jm^ r;6;1 j
0
ð þ1
pffiffiffi 6
pffiffiffi 6
1 1 1 1
¼ V1
2
2aeK3 a I0 ðaK2 Þ da þ24 S N0 ^ r;6;2 j þ 36
jm S N0 ^ r;6;3 j þ ...
jm
0
1
K31 K22 ð44Þ
¼ V1 K31 e4 ð41Þ
(40) becomes The difference between the log-LFs is therefore given by
(
1
LðiÞ pffiffiffi 1 2
A ½rðtÞ ¼ EfsðiÞ gK PK
1 þ VTN0 1 ðiÞ 2 lnLAðBPSKÞ ½rðtÞ lnLðQPSKÞ ½rðtÞ ’ K S N0 ^ r;2;0 j
jm
k¼1 jsk j
k k¼1
A
"P #)
VN02 j Kk¼1 RðiÞ j 2
pffiffiffi 4
pffiffiffi 6
k 1 1 1 1
exp P ð42Þ þ3 S N0 ^ r;4;1 j þ 360
jm S N0 ^ r;6;0 j þ
jm
1 þ VTN01 Kk¼1 jsðiÞk j
2
ð45Þ
8.2 Derivation of (20) for discriminating between
BPSK and QPSK
As one can easily notice from (45), the lowest order statistic
The nth order moments of symmetric constellations are zero which can be used to PKdiscriminate between BPSK and
for n odd. Using the values of the moments ms (i ),n,q for QPSK is Kjm^ r,2,0j ¼ j k¼1 r2k j, that is n2,PSK .
View publication stats

Modulation Classification

Enviado por

Dados do documento

Título original

Direitos autorais

Formatos disponíveis

Compartilhar este documento

Compartilhar ou incorporar documento

Opções de compartilhamento

Você considera este documento útil?

Este conteúdo é inapropriado?

Direitos autorais:

Formatos disponíveis

Modulation Classification

Enviado por

Direitos autorais:

Formatos disponíveis

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Survey of automatic modulation classiﬁcation techniques: Classical

Article in IET Communications · May 2007

Octavia A. Dobre Ali Abdi

SEE PROFILE SEE PROFILE

Spectrally-efficient transmission for long-haul optical networks View project

NOMA for 5G and Beyond View project

The user has requested enhancement of the downloaded file.

List of symbols 1 timing offset

IET Commun., 2007, 1, (2), pp. 137 –156 137

138 IET Commun., Vol. 1, No. 2, April 2007

the baseband signal energy, s2s (i ) ¼ M21

IET Commun., Vol. 1, No. 2, April 2007 139

where L[r(t)jvi , Hi] is the conditional LF of the noisy Lð1Þ ð2Þ .

140 IET Commun., Vol. 1, No. 2, April 2007

Author(s) Classifier(s) Modulations Unknown parameter(s) Channel

be simplified versions of such suboptimal structures [5]. ð kT

BPSK QPSK 8-PSK 16-PSK V.29 16-QAM 64-QAM

is given by [1] finally y n22q is 1 if q ¼ n/2 and 2 if q , n/2. Equation

Yes ν 2, PSK > η(νBPSK

Quasi-ALRT-based classifier under the assumption of per-

Author(s) Features Modulations Unknown parameters Channel(s)

IET Commun., Vol. 1, No. 2, April 2007

BPSK QPSK 8-PSK 16-PSK V.29 16-QAM 64-QAM

Classifier Unknown parameter Model mismatch SNR (dB) Pcc K

1 ALRT, L ¼ 1, (10) and (8), with – – 23 0.975 100

Classifier Unknown parameter Model SNR (dB) Pcc K

152 IET Commun., Vol. 1, No. 2, April 2007

Algorithm Unknown parameter Model mismatch SNR (dB) Pcc K

1 ALRT, (14) with jS(i)

IET Commun., Vol. 1, No. 2, April 2007 153

154 IET Commun., Vol. 1, No. 2, April 2007

communications’ (Artech House, 2000) with R(i)

I’ (New York, Wiley, 2001) (

wireless communications’ (Cambridge University, 2003) ( " #))

IET Commun., Vol. 1, No. 2, April 2007 155

n n h pffiffiffi pffiffiffi 1 2 pffiffiffi

156 IET Commun., Vol. 1, No. 2, April 2007

View publication stats

Você também pode gostar

Author(s) Classifier(s) Modulations Unknown parameter(s) Channel

Author(s) Features Modulations Unknown parameters Channel(s)

Classifier Unknown parameter Model mismatch SNR (dB) Pcc K

Algorithm Unknown parameter Model mismatch SNR (dB) Pcc K