This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
-- 2 E
Gin, m] ~ (z[n-~-m]
Imt<-~ M
z*[n-m])e
-j27rkm/M.
(6.1.31)
237
Discrete Time-Frequency Distributions
This pz[n, k] is the generalized d i s c r e t e q u a d r a t i c T F D . If the summand is extended in m with period M, Eq. (6.1.31) becomes
p~[n,k] - 2 DF_T{G[n,m] , (z[n+m]z*[n-m])} ; mc<M).
(6.1.32)
Because the time (n) support for z[n+m] z*[n-m] is the same as for z[n], the time support for pz[n, k] is the same as for G[n, m], z[n]. If the latter has a duration not exceeding N samples, then the non-zero elements of pz[n, k] may represented as an N • M matrix and the linear convolution in Eqs. (6.1.31) and (6.1.32) may be interpreted as modulo-N. If G[n, m] is real and even in m, then the argument of the D F T is Hermitian in m, so that pz[n, k] is real. Eq. (6.1.32) shows that the implementation of the discrete quadratic T F D involves construction of the discrete IAF, followed by convolution in n with the timelag kernel, followed by discrete Fourier transformation. These steps may be simplifled by taking advantage of symmetries, as explained in [1,2]. It remains to determine the effect of G(t, T) on the required sampling rate. If 5~r{G(t , T)} -- "y(t, f) and 9r {G(t, T)} - 9(u, T), we have the familiar results "r---~
t--* v
.
T---* f
{g(t, T) 9Kz(t,
~)} -
7(t, f) **
Wz(t, f)
S~{g(t, v) 9Kz(t, T)} -- g(u, T) Az(u, w). Comparing the above with Eqs. (6.1.11) and tional spreading of the spectrum in f but not Ifl < B c / 2 , then the total bandwidth of the required sampling rate is increased by B c in in t must be increased to match.
6.1.3.1
(6.1.33) (6.1.34)
(6.1.12), we see that there is an addiin u. If 7(t, f) is zero outside the band WVD is increased by Ba, so that the w, or 2Ba in 0, and the sampling rate
Special Cases
If G[n, m] = 5[n] g[m], then Eq. (6.1.32) reduces to Eq. (6.1.19), so that the discrete quadratic T F D reduces to the windowed DWVD. If, in addition, g[m] = 1 (that is, if G[n, m] = 5[n] ), then Eqs. (6.1.19) and (6.1.32)reduce to Eq. (6.1.10), so that the windowed DWVD and the discrete quadratic T F D reduce to the DWVD. Two of the three theorems above concern the sampling of a window or kernel function. Theorem 6.1.1 seems to be an exception in that the entire IAF is sampled (which is possible only in theory, as one cannot compute a continuous IAF in practice). However, because the WVD may be considered as a windowed WVD with g(T) = 1, Theorem 6.1.1 can be restated in terms of sampling the lag window, like Theorem 6.1.2.
238
Chapter 6:
6.1.3.2
Implementation and Realization of TFDs
Doppler-Frequency Form
The Doppler-frequency form of the general quadratic TFD is =
(6.1.35)
=
g(~,u) Z(f-~+~)Z*(f-ri-~)drleJ2""tdu oo
where
g(u, f)
(6.1.36)
oo
is the Doppler-frequency kernel. If we define
g(~, f) = g(u, f) _
where ~ -
u/2,
(6.1.37)
we obtain the alternative definition
p~(t,f) - 2
(;((,~) Z(f -rl+() Z*(f -rl-()e34'~tdrld(.
(6.1.38)
As the time-lag definition of the quadratic TFD leads to Theorem 6.1.3, so the Doppler-frequency definition leads to the following result. T h e o r e m 6.1.4: If p~(t,f) is modified by ideally sampling g(rl, u) in ~ at ~ = ~ , / 2 - ~ and in r/ at r / = ~ , where l and q are integers and N is a positive integer and f~ is a positive constant, and if g(rl, u) is band-limited so that g(r/, u) = 0
for
I~l = [~[ -> 4 ~ or [r/I _> 9f~aN
(6.1.39)
where L and Q are positive integers, and if the modified TFD is denoted by then t" n [k-q+l]fs ) Z*( [k-q-l]f~ eJ2rrln/N
pz(T , k2@) = 2 E
E
O(~2-~N' 2~) Z(
2N
2N
)
.
"fiz(t, f), (6.1.40)
Ill< ~ Iql< P r o o f : Parallel to the proof of Theorem 6.1.3 and the subsequent discussion.
9
With a change of notation, Eq. (6.1.40) becomes
p~[n, k] - 2 E
E
g[l, q] Z[k-q+l] Z*[k-q-l] ej2~tn/y
(6.1.41)
I/1< ~ Iql<
= 2 ~ g[l, k]. (Z[k+l] Z*[k-l])e ~2"Z~/g.
(6.1.42)
IZl< -~
If the summand is extended in l with period L, Eq. (6.1.42) becomes
,~[n, k] - 2 IDFT {g[1, k],(Z[k+l] Z*[k-l])} ; 1E (L).
(6.1.43)
The frequency support for pz[n, k] is the same as for G[1, k], Z[k]. If the latter support has a width not exceeding K frequency samples, then the non-zero elements of pz[n, k] may be represented as an L x K matrix and the linear convolution in Eqs. (6.1.42) and (6.1.43) may be interpreted as modulo-K. While the dimensions of the TFD matrix seem to differ from those in Theorem 6.1.3, the dimensions are upper bounds and may be matched, if desired, by zero-padding.
Discrete Time-Frequency Distributions
239
Table 6.1.1: Kernel requirements for selected properties ("Prop.") of quadratic TFDs, in time-lag and Doppler-frequency domains, for general continuous and discrete TFDs, and discrete TFDs with Doppler-independent (DI) and lag-independent (LI) kernels. Note that 0 = 7/2 and ~c - - v / 2 . KERNEL CONSTRAINTS
Continuous
Discrete
Discrete DI
Discrete LI
C[n,m] = ~[~]g2[~n] G[l, k] = G2[k]
GIn, m] = gl [n] ~[l,k] = Cl[t]~[k]
g2[m] = g~[-m]
gl In] is real.
Prop.
G(t, ~-) = c * ( t , -~-).
G[..m]
TM:
G(t, O) = 6(t)
G[n, 0] = 6In]
g2[0] = 1
W V D only
FM:
G(0, f ) = 6(f)
G[0, k] = 6[k]
W V D only
GI[0] = 1
9~[0] = 1 ; Ek kC2[k] = 0.
W V D only
RE:
IF: TD: TS: FS:
6.1.4
= a*[n,-m].
5In] ;
G(t, o) = ~(t) ; .f fG(~,f) df=O.
Gin. 01 --
G(O, f ) = 6(f) ;
~[0, k] = ~[k] ; EnnG[n,m] = 0 .
W V D only
f tG(t,r)dt = O.
c(t, ~-)=o if IOl< Itl,
Gin, m] = o
Always
W V D only
if Iml < Inl. W V D only
Always
E k kG[t, k] = 0.
~(-, f ) = 0
6[t. k] = 0
if I~1< Ifl"
if jl] < Ikl.
GI[0] = 1 ;
~-~n ngl [n] -- 0.
Desirable Properties; Kernel Constraints
The desirable properties of continuous T F D s (defined in Section 3.1.1, p. 60 if) are easily redefined for discrete TFDs. Some important examples are given below. R e a l n e s s ( R E ) says simply that the T F D is real. The m a r g i n a l c o n d i t i o n s , which may be considered optional for signalprocessing purposes, are the t i m e m a r g i n a l ( T M )
~ k pz[n, k] - Jz[n]l 2
(6.1.44)
and the f r e q u e n c y m a r g i n a l ( F M ) E n pz[n, k] - IZ[k]] 2.
(6.1.45)
The IF p r o p e r t y says that the periodic first moment of the T F D w.r.t, frequency is the instantaneous frequency. Its dual, which seems to be regarded as less important, is the t i m e d e l a y p r o p e r t y ( T D ) , and says that the periodic first moment of the T F D w.r.t, time is the time delay. The t i m e s u p p o r t ( T S ) property says that if z[n] = 0 everywhere except n l < n _< n2, then p z [ n , k ] - 0 everywhere except n l _< n _< n2. Similarly, the
240
Chapter 6:
Implementation and Realization o f TFDs
T a b l e 6 . 1 . 2 : Kernels of selected TFDs in time-lag and Doppler-frequency domains 9For the spectrogram and windowed Levin (w-Levin) distributions, the window w[n] is assumed to be real and even. W[k] denotes the sampled spectrum of the window w(~-). In the PROPERTY column, an exclamation (!) means that the property is always satisfied, while an asterisk (*) means that the property is satisfied subject to non-degenerate constraints on the window and/or parameter.
PROPERTY
KERNEL Distribution WVD Levin
G[n,m]
G[1, k]
RE
5[n]
5[k]
'
!
w
I
1 5[n + m]
+ 89 Born-Jordan
1 n rect ( 4-h--~m)] * * [ sinc n sinc m] cosh- 2~ n ,n cosh-2fl n
windowed WVD
5In] w[m]
TD
TS
FS
!
!
'
!
'
v
w
v
w
!
w
1
k rect (h-~al)]
I
I
I
I
*
*
** [ sinc k sinc l] !
W[k]
W[k+t] 1 +~w[k-l]
lw[m]~[n+m]
~~[~n] ~[~-~n] Zhao-Atlas-Marks
IF
+3
Modified B
w-Levin
TM FM
* !
I
I
I
*
*
!
*
*
v
[w[m] rect (~-~)] ** [ sinc n sinc m]
Rihaczek w-Rihaczek Page
Choi-Williams
5In - m]
:
6[k + l]
!
w * [ - m ] 6[n - m]
~
W*[k + l]
*
5In- Jml] x/W~ e x p ( - ~ ' 2 a n 2 ) 2[m[ 4m 2 ** [ sinc n sinc m] (cosh2n)
spectrogram
!
'
!
w
11 5 r k + l l + l1 S[k_l]
[
w
!
!
+ _ ~- ,l , ~ - l 2 ) 9 s i n c l e x p ( -Tr2ak2~ 4l 2 ] 9* [ sinc k sinc l]
I
I
I
I
! I
* sinc m
~[n+m]~[n-m]
w[k + l] WEk-l]
frequency s u p p o r t ( F S ) p r o p e r t y says that if Z [ k ] - 0 everywhere except kl _< k _< k2, then pz[n, k] - 0 everywhere except kl < k < k2. The class of TFDs satisfying realness, the time and frequency marginals, the IF property and the time and frequency support properties is called class P. The properties listed above are selected from Table 3.3.1 on p. 75. For each property, Table 3.3.1 gives necessary and sufficient conditions on the kernel of the general continuous quadratic TFD. To obtain the corresponding conditions for discrete TFDs, we first express the conditions entirely in the time-lag and Dopplerfrequency domains, using the scaled variables 0 and ~ where convenient, obtaining the "Continuous" column of Table 6.1.1. Then we sample the kernels as specified
Discrete Time-Frequency Distributions
241
in Theorems 6.1.3 and 6.1.4, obtaining the "Discrete" column of Table 6.1.1. The remaining columns are obtained by specialization. The sampling of the time-lag kernel will be free of aliasing if the kernel is first band-limited in t and 0 to +fs/2. This causes 5(t) to be discretized as 5[n]. Similarly, the sampling of the Doppler-frequency kernel will be free of aliasing if the kernel is first time-limited in f and ~ to +N/fs. This causes 5(f) to be discretized as 5[k].
6.1.5
Examples
By sampling the kernels of common continuous quadratic T F D s [see Table 3.3.2 on p. 76], we obtain the new Table 6.1.2, which lists the kernels for the discrete versions of those TFDs. The convolutions with sinc functions are performed before restricting the variable to integer values; this requires oversampling and is computationally inefficient. The convolutions in n and rn arise from the band-limiting of the timelag kernel prior to sampling. This band-limiting of the kernel does not affect the result of convolving the kernel with the IAF, because the IAF is assumed to be similarly band-limited. Similarly, the convolutions in k and l arise from the timelimiting of the Doppler-Frequency kernel prior to sampling. Where no convolution appears in the kernel, either a sinc function has been converted to a discrete delta function by the sampling, or a window function is assumed to provide sufficient filtering.
6.1.6
Summary and Conclusions
Ideal sampling of window or kernel functions leads to straightforward definitions of discrete-time forms of the WVD, the windowed WVD and other quadratic TFDs. Use of the analytic signal minimizes the required sampling rate. Further theoretical details may be found in [3-5]. Some practical computational issues will be examined in Article 6.5.
References [1] B. Boashash and P. J. Black, "An efficient real-time implementation of the Wigner-Ville distribution," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 35, pp. 1611-1618, November 1987. [2] B. Boashash and A. Reilly, "Algorithms for time-frequency signal analysis," in TimeFrequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 7, pp. 163181, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [3] J. C. O'Neill and W. J. Williams, "Shift-covariant time-frequency distributions of discrete signals," IEEE Trans. Signal Processing, vol. 47, pp. 133-146, January 1999. [4] M. S. Richman, T. W. Parks, and R. G. Shenoy, "Discrete-time, discrete-frequency time-frequency analysis," IEEE Trans. Signal Processing, vol. 46, pp. 1517-1527, June 1998. [5] A.H. Costa and G. F. Boudreax-Bartels, "An overview of aliasing errors in discrete-time formulations of time-frequency distributions," IEEE Trans. Signal Processing, vol. 47, pp. 1463-1474, May 1999.
242
6.2
Chapter 6:
Implementation and Realization of TFDs
Q U A D R A T I C A N D HIGHER ORDER T I M E - F R E Q U E N C Y ANALYSIS BASED ON THE STFT ~
The oldest, simplest, and most commonly used tool for time-frequency (TF) analysis of a signal x(t) is the spectrogram, defined as the squared magnitude of the short time Fourier transform (STFT) [1]. The STFT is
F~(t, f) -
//
x(t + T)W(T)e--J2~/'dT,
(6.2.1)
(X)
where W(T) is a real-valued even lag window [cf. Section 2.3.1]. Implementations (hardware and software) of this transform are already widely present in practice. The STFT is linear and very simple for realization. However, it has some serious drawbacks. The most important one lies in its low concentration in the TF plane, when highly nonstationary signals are analyzed. In order to improve TF representation, various quadratic distributions have been introduced. The most important member of this class is the pseudo Wigner distribution (WD)
Wx(t, f ) -
/
T
T
w(-5)w(--5)z(t~
~
T
T
9
+ -~)x*(t - -5)e-32"/~dT.
(6.2.2)
Y.d
O0
The WD itself has a drawback. Namely, in the case of multicomponent signals, 9 (t) - ~-]p=l P Xp(t), it produces emphatic cross-terms that can completely mask the auto-terms and make this distribution useless for analysis. This is why many other quadratic reduced interference distributions have been introduced (Choi-Williams, Zhao-Atlas-Marks, Born-Jordan, Butterworth, Zhang-Sato...) [1], [Article 6.4]. The cross-terms reduction in these distributions is based on a kind of the Wigner distribution smoothing, which inherently leads to the auto-terms degradation [2]. In contrast to these TF representations, which are focused on the preservation of marginal properties and the cross-terms reduction, the S-method (SM), which is the topic of this article, is derived with the primary goal to preserve the auto-terms quality as in the WD, while avoiding (reducing) the cross-terms. The software and hardware realization of this method is very efficient, since it is completely based on the STFT. The SM can, in a straightforward manner, be extended to the crossterms free (reduced) realization of the higher order TF representations, time-scale representations, and multidimensional space/spatial-frequency representations.
6.2.1 6.2.1.1
S T F T Based Realization of the Quadratic Representations Basic S-Method Form
Relation between the STFT and the WD, [3],
W=(t, f) - 2
/?
F=(t, I + O)F~(t, f - O)dO
(6.2.3)
O@
~ LJubi~,a Stankovid, Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (1.stankovic@ieee.org). Reviewers: J. F. Bhhme, S. Carstens-Behrens and B. Ristic.
243
Quadratic and Higher Order Time-Frequency Analysis Based on the STFT
has led to the definition of a T F representation, referred to as the S-method (SM), SMx(t, f) - 2
/?
P(O)Fx(t, f + O)F;(t, f - O)dO.
(6.2.4)
O0
The special cases of the SM are two most important T F distributions: (1) For P(O) = 1 the WD follows, SMx(t, f) = Wx(t, f), and (2) For P(O) = 5(0)/2, the spectrogram SM~(t, f) -IF~(t, f)l 2 = S~(t, f) is obtained. By changing the width of window P(0), denoted by 2Lp (P(O) = 0 for I 0 I> Lp), between these two extreme cases we can get a gradual transition from the spectrogram to the WD. The best choice of Lp would be the value when P(O) is wide enough to enable complete integration over the auto terms, but narrower than the distance between the auto-terms, in order to avoid the cross terms, Fig. 6.2.1. Then, the SM produces
the sum of the WDs of individual signal components, avoiding cross-terms. P r o p o s i t i o n : Consider the signal x(t) = ~'~P p = l xp(t) where xp(t) are m o n o c o m p o n e n t signals. A s s u m e t h a t the S T F T of each c o m p o n e n t lies inside the region Dp(t, f ) , p = 1, 2, ..., P. Denote the length of the p - t h region along f, for a given t, by 2Bp(t), and its central frequency by fop(t). The SM of x(t) produces the sum of the W D s Wxp (t, f ) of each signal c o m p o n e n t xp(t), SMx(t, f ) = E p _P_ 1 W x p (t, f ) ,
(6.2.5)
if the regions Dp(t, f), p = 1, 2, ..., P, do not overlap, Dp(t, f ) N Dq(t, f ) = q} for p ~: q (meaning cross-terms free s p e c t r o g r a m ) , and if the width of the r e c t a n g u l a r window P(O), for a point (t, f ) , is defined by Lp(t, f) = Bp(t) - I f fop(t)l for (t, f ) c Dp(t, f), and 0 elsewhere. P r o o f : Consider a point (t, f ) inside a region Dp(t, f). T h e integration interval in (6.2.4), for the p - t h signal c o m p o n e n t is s y m m e t r i c a l with respect to 0 -- 0. It is defined by the smallest absolute value of 0 for which f + 0 or f - 0 falls outside Dp(t, f), i.e., If + 0 - fop(t)l k Bp(t) or I f - 0 - fop(t)l >_ Bp(t). For f > fop(t) and positive 0, the integration limit is reached first in If + 0 - fop(t)l > Bp(t) for 0 = Bp(t) - (f - fop(t)). For f < fop(t) and positive 0, the limit is reached first in I f - 0 - fop(t)l >_ Bp(t) for 0 -- Bp(t) - (fop(t) - f). Thus, having in mind the interval s y m m e t r y , an integration limit which produces the same value of integral (6.2.4) as the value of (6.2.3), over the region Dp(t, f), is given by Lp(t, f) in the Proposition. Therefore, for (t, f) C Dp(t, f) we have SMx(t, f ) = Wxp(t, f). Since Lp(t, f ) = 0 for (t, f ) ~ Dp(t, f), p = 1,2,...,P. N o t e : Any window P(O) with constant width L p 2> m a x ( t , f ) { L p ( t , f ) } produces S M x ( t , f ) = :~'.pP1Wxp (t, f ) , if the regions Dp(t, f), p = 1, 2, .., P, are at least 2Lp a p a r t along the frequency axis, i.e., Ifop(t) - foq(t)l > Bp(t) + Bq(t) + 2Lp, for each p, q and t. This is the SM with constant window width (6.2.4). If two components overlap for some time instants t, then the cross-term
will appear, but only between these two components and for that time instants.
The SM belongs to the general class of quadratic T F distributions, whose inner product form reads
px(t, f) --
(~(tl, t2)[x(t + tl)e -j2~ftl] [x(t + t2)e-J2rft2] * dtl dt2. O0
O0
(6.2.6) If the inner product kernel (~(tl,t2) is factorized in the Hankel form (~(t~,t2) 2w(tl)p(tl + t2)w(t2), then by substituting its value into (6.2.6), with P ( - f ) = Ut~f{p(t)}, we get (6.2.4). Note that the Toeplitz factorization of the kernel G(tl,t2) -- 2w(tl)p(tl -tz)w(t2) results in the smoothed spectrogram. The
Chapter 6:
244
Implementationand Realization of TFDs
Fig. 6.2.1: Illustration of the SM calculation including two special cases: the WD and the spectrogram
smoothed spectrogram composes two STFTs in the same direction, resulting in the distribution spread, in contrast to the SM, where two STFTs are composed in counterdirection, resulting in the concentration improvement (Fig. 6.2.1; reference [4]; Article 9.1). The SM kernel in Doppler-lag domain is given by c(u, T) = P(u/2)*v Aww(u, T), where Aww(u, T) is the ambiguity function of W(T), and .~ denotes a convolution in u. Generally, this kernel is not a separable function.
6.2.1.2
Other Forms of the S-Method in Quadratic Representations
T i m e d i r e c t i o n f o r m of the SM is
SM=(t, f) = 2
F
P(v)F=(t + T, f)F~(t - T, f)e-J4~/rdv.
(6.2.7)
oo
It results from the same analysis as (6.2.4), based on the frequency domain windowed WD, Wz(t, f) = f _ ~ W ( O / 2 ) W ( - O / 2 ) X ( f + O/2)X*(f - 0/2)exp(j27rtO)dO. F r a c t i o n a l d o m a i n form: The frequency and time direction forms of the SM can be generalized to any direction in the time-frequency plane. Consider the fractional FT of x(t), denoted by X~(u) [Articles 4.8, 5.8]. Its STFT is
F~(u, v) -
F
X~(u + T)h(r)exp(-j27rvr)dr.
(6.2.8)
oo
where h(7) is the lag window. The SM in the fractional domain, is defined by SMx (U,
= 2
F
P(O)F2(u, +
0)F2* (-,
-
O)dO,
oo
It can be easily realized based on the signal's fractional FT and (6.2.8).
(6.2.9)
245
Quadratic and Higher Order Time-Frequency Analysis Based on the 5TFT
Using the STFT rotational F~ e x p ( - j T r t f ) with u = t c o s a + can rewrite (6.2.9) as SMz~ (t, f) -
2
j'?
oo
property, Fff (u, v)exp(-jTruv) = fsina, v = -tsina+ f c o s a , [5], we
P(O)F ~ (t - 0 sin oz, f + 0 cos a)
x F ~ (t + 0 sin a, f - 0 cos a)e j4~f~ sin
C~dO '
(6.2.10)
For a -- 0 it gives (6.2.4), while (6.2.7) follows for a - -7r/2. For the derivation of (6.2.10) the lag window h(T) is formally assumed as ( W ~ ( - T ) ) *. Optimal direction for the fractional SM calculation can be obtained based on the fractional moments analysis in [Article 4.8]. It has been used in [5]. Affine S M form: Continuous wavelet transform (WT) is defined by D=(t, f ) t ) f / f o ) d ~ - / v / f o / f [ . As in [6] we used frequency instead of scale a - f o / f . Consider h ( t ) i n the form h(t) - w(t)exp(j27rfot) which provides a strong formal connection of the W T with the STFT. The pseudo affine Wigner distribution is defined by
f~x(w)h*((r-
wa(t, f)-
_=x) oo
T T W(~of)W(-~of)X(t
T T + -~)x*(t - -~)e-J2~r d~-.
(6.2.11)
The affine SM form reads: SMa(t, f) - 2
/?
oo
P(O)D=(t, f; fo + O)Dx(t, f; fo - O)dO,
(6.2.12)
where Dx(t, f; fo + O) is the W T calculated with h(t) - w(t)exp(j27r(f0 + O)t). If P(O) - 5(0)/2, then SMa(t, f) is equal to the scalogram of x(t), while for P(O) - 1 it produces w a ( t , f) defined by (6.2.11). This form of the SM has been extended to other time-scale representations in [6].
6.2.2
Discrete Realization of the Basic S-Method Form
Discrete SM, for a rectangular window P(O), follows from (6.2.4)
SMx(~, k) -
2
2
Lp
~--[IFx(~' k)l + 2Re{~-~i= 1"---"
F x ( n , k -t- i ) F ; ( n , k
-i)}]
(6.2.13)
where" Fx(n, k) - DFTi--,k { x ( n + i ) w ( i ) A t } , At is the sampling interval, Tw - N A t is the width of w(T), and 2Lp + 1 is the width of P(O) in the discrete domain. For notation simplicity we will assume normalized 2/Tw - 1. Recursive relation for the SM calculation is
SMx(n,k;Lp) - SMx(n,k;Lp - 1)+ 2Re{F~(n,k + Lp)F~(n,k-
L p ) } (6.2.14)
where SM=(n, k; 0) - IF=(n, k)l 2, and SM=(n, k; Lp) denotes SM=(n, k) in (6.2.13) calculated with Lp. In this way we start from the spectrogram, and gradually make
245
Chapter 6:
Implementation and Realization of TFDs
the transition toward the WD. The calculation in (6.2.13) and (6.2.14) needn't be done for each point (n,k) separately. It can be performed for the whole matrix of the SM and the STFT. This can significantly save time in some matrix based calculation tools. In the SM calculation: (1) There is no need for analytic signal calculation since the cross-terms between negative and positive frequency components are removed in the same way as are the other cross-terms [7]. (2) If we take that Fz(n, k) = 0 outside the basic period, i.e., when k < - N / 2 or k > N / 2 - 1, then there is no aliasing when the STFT is alias-free (in this way we can calculate the alias-free WD by taking Lp = N/2 in (6.2.13)). For the SM realization we have to implement the STFT first, based either on the F F T routines or recursive approaches suitable for hardware realizations [3, 7]. After we get the STFT we have to "correct" the obtained values, according to (6.2.13), by adding few terms 2Re{Fx(n,k + i)F~(n,k - i)} to the SPEC values. There are two possibilities to implement the summation in (6.2.13): (1) With a signal independent LB. Theoretically, in order to get the WD for each individual component, the length Lp should be such that 2Lp is equal to the width of the widest auto-term. This will guarantee cross-terms free distribution for all components which are at least 2Lp samples apart. For components and time instants where this condition is not satisfied, the cross-terms will appear, but still in a reduced form (see also [Article 7.3]). (2) With a signal dependent Lp -- Lp(n, k) where the summation, for each point (n, k), lasts until the absolute square value of Fx(n, k + i) or Fx(n, k - i) is smaller than an assumed reference level R. If a zero value may be expected within a single auto-term, then the summation lasts until two subsequent zero values of Fx (n, k + i) or Fz(n, k - i) are detected. The reference level is defined as a few percent of the spectrogram's maximal value at a considered instant n, Rn = m a x k { S z ( n , k ) } / Q 2, where Q is a constant. Index n is added to show that the reference level R is time dependent. Note that if Q2 __~ c~, the WD will be obtained, while Q2 = 1 results in the spectrogram. A choice of an appropriate value for design parameter Q2 will be discussed in Example 2. Example 1: Consider a real-valued multicomponent signal
x(t) -- cos(1200(t + 0.1) 2) + r -36(t-1/3)2 cos(1200(t + 1/2) 2) + e -36(t-2/3)2 cos(1200(t - 1/3) 2) + cos(9607rt) within the interval [0, 1], sampled at At = 1/1024. This sampling rate is very close to the Nyquist rate for this signal, that is 1/960. The Hanning window of the width T~ = 1/4 is used. The spectrogram is shown in Fig. 6.2.2(a). Its "corrected" version (the SM), according to (6.2.13), with five terms, Lp = 5, is shown in Fig. 6.2.2(c). The auto-terms are concentrated almost as in the WD, Fig. 6.2.2(b). The ChoiWilliams distribution (CWD), whose kernel reads c(u, T) = exp(--(~7)2), is shown in Fig. 6.2.2(d). Normalized values - v / ~ N / 2 < ]2~v[ < v / ~ N / 2 , - V / T r N / 2 < IT[ < v/~N/2, and 128 samples within that interval, are used. If the analytic part of x(t) were used, similar results would be obtained [see Fig. 9.1.1 on p. 379].
Quadratic and Higher Order Time-Frequency Analysis Based on the STFT
247
Fig. 6.2.2: Time-frequency representation of a real-valued multicomponent signal" (a) Spectrogram, (b) Pseudo Wigner distribution, (c) S-method with five "correcting terms " L p = 5, (d) Choi-Williams distribution, as the representative of reduced interference distributions.
Example 2: The adaptive SM realization will be illustrated on a three-component real signal, with a nonlinear FM component, x(t) - e -t2 cos(25~t) + cos(120t 3 + 45~t) + 1.5e -25t2 cos(40~t 2 + 150~t) with the sampling interval At -- 1/256. The signal is considered within the time interval [-1, 1]. The Hanning window of the width Tw - 1 is used. The spectrogram is presented in Fig. 6.2.3(a), while the SM with the constant Lp = 3 is shown in Fig. 6.2.3(b). The concentration improvement with respect to the case Lp = O, Fig. 6.2.3(a), is evident. Further increase of Lp would improve concentration, but it would also cause that some cross-terms appear. Some small changes are already noticeable between the components with quadratic and constant IF. An improved concentration, without cross-terms, can be achieved by using the variable window width Lp. The regions D~(n, k), determining the summation limit Lg(n, k) for each point (n, k), are obtained by imposing the reference level Rn corresponding to Q2 = 50. They are defined as: Di(n,k) - 1 when Sx(n, k) >_ Rn - m a x k { S z ( n , k ) } / Q 2, and D~(n,k) - 0 elsewhere, Fig. 6.2.3(c). White regions mean that the value of spectrogram is below 2% of its maximal value at that time instant n, meaning that the concentration improvement is not performed at these points. The signal dependent SM is given in Fig. 6.2.3(d). The method sensitivity, with respect to the
Chapter 5:
248
Implementation and Realization of TFDs
Fig. 6.2.3: Time-frequency analysis of a multicomponent signal: (a) Spectrogram, (b) The S-method with a constant window, with Lp = 3, (c) Regions of support for the S-method with a variable window width calculation, corresponding to Q2 = 50, (d) The S-method with the variable window width calculated using regions in (c).
value of Q2, is low.
6.2.3
STFT Based Realization of Higher Order Representations
In order to improve distribution concentration in the case of nonlinear FM signals, the higher order time-varying spectra have been defined (Wigner higher order spectra, Multitime Wigner distributions). For practical realizations the most interesting are the versions of these spectra reduced to the two-dimensional TF plane [8]. Here, we will present the L-Wigner distribution (LWD) and the fourth order polynomial Wigner-Ville distribution (PWVD).
6.2.3.1
The L-Wigner Distribution
The L-Wigner Distribution (LWD)is defined by [7,9]
r )x L (t + - T~ ) e -j2~ f ~"dr. LDL(t, f) = /_~ WL(T)X *L (t -- ~-~ c~
For L = 1 it reduces to the WD.
(6.2.15)
249
Quadratic and Higher Order Time-Frequency Analysis Based on the 5TFT
T h e L W D is a w i n d o w e d slice of t h e m u l t i t i m e W i g n e r d i s t r i b u t i o n s : cx)
W K ( t l , ..., tk, f ) =
K
L--1
T
x* ( E i--1
K
T
ti + K + 1 ) n
x* ( - t i + K + 1 ) n
i--1
T
)ej2~rfrdT
x(ti - K +------1
'
i--L
along the line tl -- t2 . . . . . t L - 1 -- --t, tL -- tL+l . . . . . tK -- t, w h e r e t h e a u t o - t e r m s in W ~ (tl .... , tk, f ) are located, for L = ( K + 1)/2 [9]. Similarly, s t a r t i n g from the W i g n e r higher order s p e c t r a , dual to WxK(tl, t2, ...,tk, f ) , we get a d i s t r i b u t i o n dual to (6.2.15),
LW[(t, f)=
f_~
WL(O)x*L(f + -o ~
)xL(f_ -~--L)e-J2~retdO. o
(6.2.16)
(x)
s t u d i e d in details in [10]. Its realization is formally t h e s a m e as for t h e t i m e d o m a i n L W D .
For a frequency modulated signal x(t) = exp(jr
LWL(t, f ) -
WL(f - r
the LWD produces [7]
FT{e(J(r162162162
where 71,72 are the values of 7 within the lag window WL(T), and WL(f) = .P~_~I{WL(T)}. For L --+ ~ , the LWD tends to a distribution completely concentrated along the IF, i.e., LWL(t, f) --~ W L ( f - r The relationship between LW2L(t, f) and LWL(t, f) is of form (6.2.3),
LW2L(t, f) - 2
F
LWL(t, f + O)LWL(t, f - O)dO
oo
The realization of cross-terms and alias free version of the LWD may be efficiently done in the discrete domain, by using the SM form (6.2.13), as: (6.2.17) with LW1 (n, k) = W=(n, k), and Wz(n, k) calculated according to (6.2.13). Form (6.2.17) is very convenient for software and hardware realizations since the same blocks, connected in cascade, can provide a simple and efficient system for higher order T F analysis, based on the S T F T in the initial step, and the signal sampled at the Nyquist rate. Numerical examples and illustrations of the LWD can be found in [7, 9-11].
6.2.3.2
Polynomial Wigner-Ville Distribution
Modification of the presented method for the realization of the P W V D is straightforward. The fourth order P W V D can be written in a frequency scaled form [12]
7 A '2r ) x ( t _ A T_ ~ PWx(t,f) - o~--~ -., J-f ~ x2(t+-4T)x*2(t- -4)x*(t+ 2)e-J~-~dT, (6.2.18) oo
where A - 0.85//1.35 and f' - f/2.7. Note that PW=(t, f') - 2~ LW2(t, f') .f, WxA(t, f'), where wA(t, f') -- FT{x*(t + A2)x(t - A2) } is the scaled and reversed
250
Chapter 6:
Implementation and Realization of TFDs
Fig. 6.2.4: Time-frequency representation of a real-valued multicomponent signal: (a) The SM (crossterms and alias free version of the WD), (b) Polynomial Wigner-Ville distribution realized based on the STFT by using the SM and its order recursive form.
version of the WD. The cross-terms free realization of the WD and LWD is already presented. In the discrete implementation of the above relation, the only remaining problem is the evaluation of wA(t, f') on the discrete set of points on frequency axis, f' - - k A f ' . Since wA(t, f') is, by definition, a scaled and reversed version of Wx(t, f'), its values at f ' = - k A f ' are the values of Wx(t, f') at f' - k A f ' / A . However, these points do not correspond to any sample location along the frequency axis grid. Thus, the interpolation has to be done (one way of doing it is in an appropriate zero padding of the signal). A discrete form of convolution (6.2.18), including rectangular window P(O) and the above considerations, is Lp
PWz(n, k) - E i = - L p LW2(n, k + i)I?V~(n, k + i/A)
(6.2.19)
where 2Lp + 1 is the width of P(O) in the discrete domain, while I?Vx(n, k + i/A) is the WD approximation. We can simply use I?Vx(n,k + i/A) = SMx(n,k + [i/A]) where [i/A] is the nearest integer to i/A, or use the linear interpolation of the SM values at two nearest integers. The terms in (6.2.19), when k + i or k + [i/A] is outside the basic period, are considered as being zero in order to avoid aliasing. Example 3: Consider real-valued multicomponent signal
x(t)
-
cos(20 sin(Trt) + 307rt) + sin(20 cos(Trt) + 1007rt)
within - 1 <_ t < 1, with At -- 1/128. In the realization, a Hanning window of the width Tw - 2 is used. Based on the STFT (using its positive frequencies), the cross-terms free WD is obtained from (6.2.13) with Lp - 15, and denoted by SM, Fig. 6.2.4(a). Then the LWD, with L - 2, is calculated according to (6.2.17). It is combined with the linearly interpolated SM value into the P W V D (6.2.19), shown in Fig. 6.2.4(b). For the precise implementation of [i/A] the lag window has been zero-padded by a factor of 2.
6.2.4
Summary and Conclusions
The STFT based realization of quadratic TF representations, having the autoterms close or the same to the ones in the WD, but without (or with reduced)
Quadratic and Higher Order Time-Frequency Analysis Based on the STFT
251
cross-terms, is presented. For this realization the S-method is used. The method is generalized, in an order recursive form, for the realization of higher order T F representations. Applications of the presented method on, for example, time-scale representations [6], and multidimensional space/spatial-frequency analysis [13], are straightforward. Hardware realization of the S-method is also simple and direct [7].
References [1] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [2] L. Stankovi(~, "Auto-term representation by the reduced interference distributions: A procedure for kernel design," IEEE Trans. Signal Processing, vol. 44, pp. 1557-1563, June 1996. [3] L. Stankovi~, "A method for time-frequency analysis," IEEE Trans. Signal Processing, vol. 42, pp. 225-229, January 1994. [4] L. L. Scharf and B. Friedlander, "Toeplitz and Hankel kernels for estimating timevarying spectra of discrete-time random processes," IEEE Trans. Signal Processing, vol. 49, pp. 179-189, January 2001. [5] L. Stankovi(~, T. Alieva, and M. Bastiaans, "Fractional-Fourier-domain weighted Wigner distribution," in Proc. Eleventh IEEE Workshop on Statistical Signal Processing, pp. 321-324, Singapore, 6-8 August 2001. [6] P. Gonw and R. G. Baraniuk, "Pseudo affine Wigner distributions: Definition and kernel formulation," IEEE Trans. Signal Processing, vol. 46, pp. 1505-1517, June 1998. [7] L. Stankovi5, "A method for improved distribution concentration in the time-frequency analysis of the multicomponent signals using the L-Wigner distribution," IEEE Trans. Signal Processing, vol. 43, pp. 1262-1268, May 1995. [8] B. Ristic and B. Boashash, "Relationship between the polynomial and higher order Wigner-Ville distribution," IEEE Signal Processing Letters, vol. 2, pp. 227-229, December 1995. [9] L. Stankovi5, "Multitime definition of the Wigner higher order distribution: L-Wigner distribution," IEEE Signal Processing Letters, vol. 1, pp. 106-109, July 1994. [10] L. Stankovi5, "An analysis of Wigner higher order spectra of multicomponent signals," Ann. Tdldcommunications, vol. 49, pp. 132-136, March/April 1994. [11] L. Stankovi5, "An analysis of some time-frequency and time-scale distributions," Ann. Tdldcommunications, vol. 49, pp. 505-517, September/October 1994. [12] B. Boashash and B. Ristic, "Polynomial time-frequency distributions and time-varying higher order spectra: Application to the analysis of multicomponent FM signal and to the treatment of multiplicative noise," Signal Processing, vol. 67, pp. 1-23, May 1998. [13] S. Stankovi5, L. Stankovi(~, and Z. Uskokovid, "On the local frequency, group shift, and cross-terms in some multidimensional time-frequency distributions: A method for multidimensional time-frequency analysis," IEEE Trans. Signal Processing, vol. 43, pp. 17191725, July 1995. [14] L. Stankovi5 and J. F. BShme, "Time-frequency analysis of multiple resonances in combustion engine signals," Signal Processing, vol. 79, pp. 15-28, November 1999.
252
6.3 6.3.1
Chapter 6:
Implementation and Realization of TFDs
GABOR'S SIGNAL EXPANSION FOR A N O N - O R T H O G O N A L SAMPLING G E O M E T R Y ~ Historical Perspective
In 1946 [1], Gabor suggested the representation of a time signal in a combined time-frequency domain (see Section 2.3.3); in particular he proposed to represent the signal as a superposition of shifted and modulated versions of a so-called elementary signal or synthesis window g(t). Moreover, as a synthesis window g(t) he chose a Gaussian signal, because such a signal has good localization in both the time domain and the frequency domain. The other choice that Gabor made was that his signal expansion was formulated on a rectangular lattice in the time-frequency domain (mT, kgt), where the sampling distances T and Ft satisfied the relation gtT = 27r. The coefficients in Gabor's signal expansion can be determined by using an analysis window w(t). In the case of critical sampling, i.e., f~T = 27r, the analysis window w(t) follows uniquely from the given synthesis window g(t). However, such a unique analysis window appears to have some mathematically very unattractive properties. For this reason, the expansion should be formulated on a denser lattice, ~ T < 27r. This makes the analysis window no longer unique and thus allows for finding an analysis window that is optimal in some way. We can, for instance, look for the analysis window that resembles best the synthesis window; a better resemblance can then be reached for a higher degree of oversampling. A better resemblance can also be reached if we adapt the structure of the lattice to the form of the window as represented in the time-frequency domain. For the Gaussian window, for instance, the time-frequency representation has circular contour lines, and circles are better packed on a hexagonal lattice than on a rectangular lattice. Gabor's signal expansion on such a hexagonal, non-orthogonal lattice then leads to a better resemblance between the window functions g(t) and w(t) than the expansion on a rectangular, orthogonal lattice does.
6.3.2
Gabor's Signal Expansion on a Rectangular Lattice
We start with the usual Gabor expansion [1-5] on a rectangular time-frequency lattice, in which case a signal ~(t) can be expressed as a linear combination of properly shifted and modulated versions gmk(t) = g ( t - roT)exp(jkgtt) of a synthesis window g(t): oo
~(t) -
E m--
oo
E -- cx)
k-~
amk gmk(t).
(6.3.1)
-- (x3
The time step T and the frequency step ~ satisfy the relationship ~ T < 27r; note that the factor 27r/~T represents the degree of oversampling, and that in his original ~ M a r t i n J. B a s t i a a n s and A r n o J. v a n L e e s t , Technische Universiteit Eindhoven, Faculteit Elektrotechniek, Postbus 513, 5600 MB Eindhoven, Netherlands (M.J.Bastiaans@tue.nl). Reviewers: Joel M. Morris and Shie Qian.
Gabor's Signal Expansion for a Non-Orthogonal Sampling Geometry
253
paper [1] Gabor considered the case of critical sampling, i.e. ftT = 27r. The expansion coefficients amk follow from sampling the windowed Fourier transform with analysis window w(t), f_o~ ~ ( t ) w * ( t - r ) e x p ( - j w t ) d t , on the rectangular lattice (r = mT, co = kft):
amk -
~(t) w~k(t ) dr.
(6.3.2)
(x)
This relationship is known as the Gabor transform. The synthesis window g(t) and the analysis window w(t) are related to each other in such a way that their shifted and modulated versions constitute two sets of functions that are biorthogonal: oo
(N2)
~ m--
-- oo
k--
gmk(t~)w~k(t2) -- 5(t~ -- t2).
(6.3.3)
- - (:x:)
If the biorthogonality condition (6.3.3) is satisfied, the Gabor transform (6.3.2) and Gabor's signal expansion (6.3.1) form a transform pair in the following sense: if we start with an arbitrary signal ~(t) and determine its expansion coefficients a m k via the Gabor transform (6.3.2), the signal can be reconstructed via the Gabor expansion (6.3.1). The biorthogonality relation (6.3.3) leads immediately to the equivalent but simpler expression
ft
~
g ( t - mT) w* t -
roT+n--
- Sn,
(6.3.4)
m-~w(ND
where 5n is the Kronecker delta. In the case of critical sampling, i.e., f t T = 27r, the biorthogonality relation (6.3.4) reduces to oo
T ~
g ( t - r o T ) w * ( t - [m + n]T) - 5n
(6.3.5)
m=-oo
and the analysis window w(t) follows uniquely from a given synthesis window g(t), or vice versa. An elegant way to find the analysis window if the synthesis window is given, is presented in the next section.
6.3.3
Fourier Transform and Zak Transform
It is well known (see, for instance, [2-5]) that in the case of critical sampling, ftT = 2:r, Gabor's signal expansion (6.3.1) and the Gabor transform (6.3.2) can be transformed into product form. We therefore need the Fourier transform 8(t/T, a~/ft) of the two-dimensional array of Gabor coefficients amk, defined by oo
~(x, y) -
oo
~ ?Tt--- ~
E (X)
k--
~
amk exp[-j27r(my - kx)], (x)
(6.3.6)
254
Chapter 6:
Implementation and Realization of TFDs
and the Zak transforms ~(xT, 2~y/T; T), ~(xT, 2~y/T; T), and ~(xT, 2~y/T; T) of the signal ~(t) and the window functions g(t) and w(t), respectively, where the Zak transform f(t,w; T) of a function f(t) is defined as (see, for instance, [2,5]) oo
](t, W; T) = E
f (t + nT) exp(--jnTw).
(6.3.7)
n----O0
Note that the Fourier transform ~(x, y) is periodic in x and y with period 1, and that the Zak transform f(t, w; T) is periodic in w with period 2~/T and quasi-periodic in t with period T: f(t + roT, W + 2~k/~-; T) = f(t, W; T) exp(jmwT). Upon substituting from the Fourier transform (6.3.6) and the Zak transforms [cf. Eq. (6.3.7)] into Eqs. (6.3.1) and (6.3.2), it is not too difficult to show that Gabor's signal expansion (6.3.1) can be transformed into the product form
xT, y-~ ; T
- g(x, y) O xT, y-~ ; T
,
(6.3.8)
while the Gabor transform (6.3.2) can be transformed into the product form
g(x, y) - T ~ xT, y--~-; T
xT, y-~; T
.
(6.3.9)
In particular the product form (6.3.9) is useful for determining Gabor's expansion coefficients. Since a Zak transform is merely a Fourier transform [cf. Eq. (6.3.7)], the expansion coefficients can be determined by Fourier transformations and multiplications; and if things are formulated for discrete-time signals, we can use the fast Fourier transform to formulate a fast algorithm for the Gabor transform [3, 4]. The relationship between the Zak transforms of the analysis window w(t) and the synthesis window g(t) then follows from substituting from Eq. (6.3.9) into Eq. (6.3.8) and reads
T ~ xT, y--~-;T
zT, y ~ - ; T
= 1.
(6.3.10)
From the latter relationship we conclude that (the Zak transform of) the analysis window w(t) follows uniquely from (the Zak transform of) the given synthesis window g(t). In general, however, the unique analysis window w(t) has some very unattractive mathematical properties. We are therefore urged to consider Gabor's signal expansion on a denser lattice, in which case the analysis window is no longer unique. This enables us to choose an analysis window that is better suited to our purpose of determining Gabor's expansion coefficients.
6.3.4
Rational Oversampling
In the case of oversampling by a rational factor, 2r/FtT -- p/q > 1, with p and q relatively prime, positive integers, p > q _> 1, Gabor's expansion (6.3.1) and the
Gabor's Signal Expansion (or a Non-Orthogonal Sampling Geometry
255
Gabor transform (6.3.2) can be transformed into the sum-of-products forms [3, 4], el. Eqs. (6.3.8) and (6.3.9),
~as(x, y) - P
g,~(x, y)a~(x, y)
(s - O, 1, ..., q - 1)
(6.3.11)
q-1
a~(x,y) _ pT Zw~r* (x,y) qos(x,y) q s=o
( r - O , 1, . . . , p - I ) ,
(6.3.12)
respectively, where we have introduced the shorthand notations
a~(x, y) ~s(x, y) g~r(x, y) w~(x, y)
= = = =
a(x, y + r/p) ~((x + s)pT/q, 2Try~T; pT) ~7((x + s)pT/q, 27r(y + r/p)/T; T) (v((x + s)pT/q, 27r(y + rip)/T; T),
with 0 < x < 1 and s - 0,1, . . . , q 1 (and hence 0 < ( x + s ) / q < 1), and 0 < y < 1/p and r - 0, 1, . . . , p - 1 (and hence 0 < y + r/p < 1). The relationship between the Zak transforms of the analysis window w(t) and the synthesis window g(t) then follows from substituting from EQ. (6.3.12) into EQ. (6.3.11) and reads [cf. Eq. (6.3.10)] -T-
g~
(x,y)Ws2* r ( x ,
y) -
5~ 1 - - 8 2 '
( 6 . 3 13)
q r=0 with sl, s2 = 0, 1, . .., q - 1. The latter relationship represents a set of q2 equations for pq unknowns, which set is underdetermined since p > q, and we conclude that the analysis window does not follow uniquely from the synthesis window. After combining the p functions ar(x,y) into a p-dimensional column vector a(x, y), the q functions qas(x, y) into a q-dimensional column vector r y), and the q• functions gsr(x, y) and Wsr(X, y) into the q • matrices G(x, y) and W(x, y), respectively, the sum of products forms can be expressed as matrix-vector and matrix-matrix multiplications: r
y) -
1
-- G(x, y) a(x, y) P
a(x, y) = pT W* (x, y)r
(6.3.14) y)
(6.3.15)
Iq = -- G(x, y) W* (x, y), q
(6.3.16)
q T
where Iq denotes the q x q-dimensional identity matrix and where, as usual, the asterisk in connection with vectors and matrices denotes complex conjugation and transposition.
256
Chapter 6:
Implementation and Realization of TFDs
The latter relationship again represents q2 equations for pq unknowns, and the p • q matrix W* (x, y) cannot be found by a simple inversion of the q • p matrix G(x,y). An 'optimum' solution that is often used, is based on the generalized inverse and reads Wopt(X,y ) - (q/T) G*(x,y) [G(x,y) G*(x,y)] -1. This solution for W(x, y) is optimum in the sense that (i) it yields the analysis window w(t) with the lowest L 2 norm, (ii) it yields the Gabor coefficients amk with the lowest L 2 norm, and (iii) it yields the analysis window that - in an L 2 sense, a g a i n - best resembles the synthesis window. The 'optimum' solution gets better if the degree of oversampling p/q becomes higher. However, there is another way of finding a better solution, based on the structure of the lattice. If the lattice structure is adapted to the form of the window function as it is represented in the time-frequency domain, the 'optimum' solution will be better, even for a lower degree of oversampling. We will therefore consider the case of a non-orthogonal sampling geometry, but we will do that in such a way that we can relate this non-orthogonal sampling to orthogonal sampling. In that case we will still be able to use product forms of Gabor's expansion and the Gabor transform, and benefit from all the techniques that have been developed for them.
6.3.5 Non-OrthogonalSampling The rectangular (or orthogonal) lattice that we considered in the previous sections, where sampling occurred on the lattice points (7 = roT, w = k~), can be obtained by integer combinations of two orthogonal vectors iT, 0] t and [0, ~]t, see Fig. 6.3.1(a), which vectors constitute the lattice generator matrix
[0 o]
We now consider a time-frequency lattice that lattice is obtained by integer combinations of longer orthogonal vectors, which we express in with a, b, c and d integers, and which constitute
is no longer orthogonal. Such a two linearly independent, but no the forms [aT, c~] t and [bT, d~] t, the lattice generator matrix
[0 01[a Without loss of generality, we may assume that the integers a and b have no common divisors, and that the same holds for the integers c and d; possible common divisors can be absorbed in T and gt. Note that we only consider lattices that have samples on the time and frequency axes and that are therefore suitable for a discrete-time approach, as well. The area of a cell (a parallelogram) in the time-frequency plane, spanned by the two vectors [aT, c~] t and [bT, d~] t, is equal to the determinant of the lattice generator matrix, which determinant is equal to ~TD, with D = l a d - bc I. To be
Gabor's Signal Expansion for a Non-Orthogonal Sampling Geometry
9
rv
9
0
9
9~
9
o
; ft i9
o~
~
257
~.
9
9
0
O
9
o
9
9
t
o
o
T
(a)
0
o
o
o
w;
0
-,
(b)
T
v
F i g . 6.3.1" (a) A rectangular lattice with lattice vectors [T, 0] t and [0, ft] t, and thus R = 0 and D = 1" and (b) a hexagonal lattice with lattice vectors [T, ft] t and [0, 2~] t, and thus R = 1 and D = 2.
usable as a proper Gabor sampling lattice, this area should satisfy the condition
D < 27r/ftT. There are a lot of lattice generator matrices that generate the same lattice. We will use the one that is based on the Hermite normal form, unique for any lattice,
where R and D are relatively prime integers and 0 <_ IRI < D. Sampling then occurs on the lattice points (r - roT, w = [mR + nD]ft), and it is evident t h a t these points of the non-orthogonal lattice form a subset of the points (7- - rnT, w - kf~) of the orthogonal lattice. To be more specific: the non-orthogonal lattice is formed by those points of the rectangular (orthogonal) lattice for which k - m R is an integer multiple of D. Note that the original rectangular lattice arises for R - 0 and D - 1, see Fig. 6.3.1(a), and that a hexagonal lattice occurs for R - 1 and D - 2, see Fig. 6.3.1(b).
6.3.6
Gabor's Signal Expansion on a Non-Orthogonal Lattice
If we define the two-dimensional array %ink as oo
/~rnk-- E ~k-rnR-nD, n~--
(6.3.17)
(ND
Gabor's signal expansion on a non-orthogonal lattice [cf. Eq. (6.3.1)] oo
qD(t) --
Z m=--oo
oo
E k----oo
oo
%rnkarnk gmk(t) -- E m---oo
can be expressed
as
oo
E
a~k gmk(t)'
(6.3.18)
k---cx3
while - with a different analysis window w(t), t h o u g h ! - the expansion coefficients amk are still determined by the Gabor transform (6.3.2). Of course, since we only
Chapter 6:
258
Implementation and Realization of TFDs
need the limited array a mI k - - ) ~ m k amk - which is, in fact, a properly sampled version of the full array a m k - we need only calculate the coefficients a m k for those values of m and k for which k - mR is an integer multiple of D. We note that the Fourier ' is related to the Fourier transform 5(x, y) transform 5' (x , y) of the limited array amk of the full array a m k via the periodization relation D-1
d'(x,y)-~
E5
x-5,
y-
(6.3.19)
n--0
and thus
D-1
a~
,
~
n--0
a~ ~ - ~ , y - - f f
.
In the non-orthogonal case, the biorthogonality condition takes the form [cf. Eq. (6.3.3)] CO
CO
E
E
m----CO
k--CO
)~mkgmk(tl)Wmk(t2)=5(tl--t2)
(6.3.21)
and leads to the equivalent but simpler expression [cf. Eq. (6.3.4)]
27r Co ( [ 27r ) Df~ E g(t - mT) w* t - mT + n-D--~
exp
(.
nR)
327rm-~-
= 5n.
(6.3.22)
m=-CO
Note that for R = 0 and D = 1, for which we have a rectangular lattice [see Fig. 6.3.1(a)], Eq. (6.3.22) reduces to Eq. (6.3.4), and that for R = 1 and D = 2, for which we have a hexagonal lattice [see Fig. 6.3.1(b)], Eq. (6.3.22) takes the form CO
H
~ m--
g(t- ~T)~*
(t - [~ T + ~
(-1) m" = ~ .
(6.a.2a)
--CO
The biorthogonality condition expressed in terms of the Zak transforms of the window functions now takes the form, cf. Eq. (6.3.13), T
Dq
gs~,r
*
n z -
nR ,
-
5n~-
,
(6.3.24)
with sl, s2 = 0, 1, . . . , q - 1 and n = 0, 1, . . . , D-l, and allows an easy determination of the analysis window w(t) for a given synthesis window g(t). For R = 0 and D = 1, for instance, relation (6.3.24) reduces to Eq. (6.a.~a), while for R = 1, D = 2, q = 1, and p an even i n t e g e r - which corresponds to the integer (p/2-times) oversampled hexagonal c a s e - it reduces to
T -~
go,r(x,y)w*0,~-~p/2 (x, y)(-1) nr r--0
= 5n
(n = 0 , 1; p even),
(6.3.25)
Gabor's Signal Expansion for a Non-Orthogonal Sampling Geometry
259
from which the Zak transform ~v(t,w; T) and hence the window function w(t) can easily be determined. Since we have related Gabor's signal expansion on a non-orthogonal lattice to sampling on a denser but orthogonal lattice, followed by restriction to a sub-lattice that corresponds to the non-orthogonal lattice, we can still use all the techniques that are developed for rectangular lattices, in particular the technique of determining Gabor's expansion coefficients via the Zak transform, el. Eq. (6.3.12).
6.3.7
Summary and Conclusions
Gabor's signal expansion and the Gabor transform on a rectangular lattice have been introduced, along with the Fourier transform of the array of expansion coefficients and the Zak transforms of the signal and the window functions. Based on these Fourier and Zak transforms, the sum-of-products forms for the Gabor expansion and the Gabor transform, which hold in the rationally oversampled case, have been derived. We have then studied Gabor's signal expansion and the Gabor transform based on a non-orthogonal sampling geometry. We have done this by considering the non-orthogonal lattice as a sub-lattice of an orthogonal lattice. This procedure allows us to use all the formulas that hold for the orthogonal sampling geometry. In particular we can use the sum-of-products forms that hold in the case of a rationally oversampled rectangular lattice. We finally note that if everything remains to be based on a rectangular sampling geometry (as in Article 4.9), it will be easier to extend the theory of the Gabor scheme to higher-dimensional signals; see, for instance, [6], where the multidimensional case is treated for continuous-time as well as discrete-time signals.
References [1] D. Gabor, "Theory of communication," J. IEE, vol. 93(III), pp. 429-457, November 1946. [2] M. J. Bastiaans, "Gabor's signal expansion and its relation to sampling of the slidingwindow spectrum," in Advanced Topics in Shannon Sampling and Interpolation Theory (R. J. Marks II, ed.), pp. 1-35, New York: Springer, 1993. [3] M. J. Bastiaans, "Gabor's expansion and the Zak transform for continuous-time and discrete-time signals: Critical sampling and rational oversampling," Research Report 95E-295, Eindhoven University of Technology, Eindhoven, Netherlands, 1995. [4] M. J. Bastiaans and M. C. W. Geilen, "On the discrete Gabor transform and the discrete Zak transform," Signal Processing, vol. 49, pp. 151-166, March 1996. [5] H. G. Feichtinger and T. Strohmer, eds., Gabor Analysis and Algorithms: Theory and Applications. Berlin/Boston: Birkh/iuser, 1998. [6] A. J. van Leest, Non-separable Gabor schemes: Their design and implementation. PhD thesis, Eindhoven University of Technology, Eindhoven, Netherlands, 2001.
260
6.4
Chapter 6:
Implementation and Realization of TFDs
SPECTROGRAM DECOMPOSITIONS OF TIME-FREQUENCY DISTRIBUTIONS 0
Cohen's class of bilinear time-frequency distributions (or the quadratic class; see p. 68n) has attracted much attention over the past decade or so. While such TFDs have yielded a proliferation of methods and are becoming increasingly popular in applications, their progress has been slowed, at least in part, by the computational burdens which are generally much greater than for spectrograms or wavelet-based methods. This article demonstrates methods for greatly reducing the computational burden, by decomposing the TFDs into linear weighted sums of spectrograms. The computational efficiency comes from economizing the number of STFTs that must be computed to form these spectrograms. Surprisingly, one can reduce the number of STFTs to a small number by using Haar windows for the S T F T computation [1-3]. Finally, for certain kernels, special windows may be designed which can represent a T F D very well by using only three or four windows [3, 4]. It should be noted that several attempts have been made to decompose timefrequency distributions themselves, in order to isolate specific T F D components or to synthesize TFDs using several components. While this is an interesting and valuable topic, it is beyond the scope of this article, which deals specifically with decomposition of the kernel of the TFD.
6.4.1
Decomposition Based Approaches
Cunningham and Williams have shown that a T F D of Cohen's class can be expressed as a weighted sum of spectrograms, where the spectrogram windows are an orthonormal set [5, 6], and have given a rigorous base for further work. They were motivated by some ideas from White [7] and from Amin [8], who has continued his work in this area. The windows result from an eigensystem decomposition of the kernel. This approach has been used to approximate various TFDs [6]. A twist on this concept is to use windows which are shifted and scaled versions of each other [1-3]. In these studies, windows from the Haar basis set were used. It can be shown that any T F D can be represented by n + l Haar windows, where n = log2(N-1 ), and N is the number of sample points. Thus, a 257-point kernel can be represented by 9 Haar functions, one at each scale. It is important to note that the cross-spectrograms, i.e. spectrograms formed by STFTs using different windows, are used as well as the auto-spectrograms to form a basis for the TFD. The Reduced Interference Distribution [9-11] or RID is a well-defined set of Cohen's-class TFDs [12] that can be constructed using a simple set of constraints, while retaining a number of very desirable mathematical properties. Faster computation of this class of TFDs was a particular motivation for the development of the spectrogram decomposition methods discussed in this article. ~ W i l l i a m J. W i l l i a m s and Selin A v i y e n t e , Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor MI 48109, USA (wjw@eecs.umich.edu, saviyent@eecs.umich.edu). Reviewers: B. Boashash and P. Flandrin.
Spectrogram Decompositions of Time-Frequency Distributions
6.4.2
261
Decomposition of Time-Frequency Kernels
For a given time-frequency kernel, the first step in the spectrogram decomposition algorithm is to decompose the kernel in terms of spectrogram windows. The approach suggested by Cunningham and Williams used an eigensystem type approach to decompose the 2-D kernel expressed in time-lag (n, rn) as the outer product of a set of orthonormal basis functions. These basis functions become the windows of the spectrogram decomposition and the eigenvalues are the weightings of the spectrogram summations. Let C(n, k) be a T F D with a real, bounded kernel of interest; then it can be expressed as M
C(n, k) - ~ ~ ISTFT~(n, k)l 2 .
(6.4.1)
i=1
Denote the spectrogram windows as can be represented as
wi(n), and the signal as x(n); then the STFTs
N-1
STFTi(n,
k) - E x(m - n)wi(rn) e -j2'~mk/N.
(6.4.2)
rn--0
This is the Cunningham-Williams decomposition in a nutshell. The problem is that while the eigendecomposition gives an orthonormal set of windows, there is no orderly relationship between the windows as is the case for a Principal Component Analysis (PCA) or Karhunen-Lot?ve Transform (KLT) for a set of signals. This is the "best basis" for the kernel in the same sense as is the KLT for representation of a set of signals. There is no redundancy or relationship between windows that allows more efficient computation. The principal eigenvector does the best job of representing the kernel in terms of an outer product with itself, the next eigenvector does the next best job when its contribution is compared to the remaining others, and so on. Generally, the kernel can be adequately represented using a small number of the potential windows. However, a spectrogram must be computed for each window. In the present article we suggest methods which use specially designed windows, potentially increasing the speed and efficiency of representation.
6.4.3
Development of the Method
Time- and frequency-shift-covariant bilinear discrete T F R s are specified by a discrete kernel, and can be rewritten in the inner product form of TFRx(n, w; r
= E E x(n+nl)e-3w(n+nl)r nl
nl+n22 , n l - - n 2 )
[x(n-Jr-n2)e,-2w(n+n2)] *
n2
(6.4.3)
262
Chapter 6:
Implementation and Realization of TFDs
where S - n and M_~ are, respectively, the time and frequency shift operators o n ~2, the space of finite-energy discrete-time signals, and r is a bounded linear operator on ~2 [5,6]. The spectral representation of ~ may be used to express the TFR as a weighted sum of spectrograms or "projectograms". If the kernel is associated with a bounded, self-adjoint linear operator, then the kernel may be decomposed by an eigendecomposition such that one can represent the TFR as being composed of a finite series of spectrograms. The orthonormal windows forming the spectrograms are the eigenfunctions of the decomposition. The eigenvalues of the decomposition provide the weights for summing the set of spectrograms. The viewpoint may be taken that the projections of the signal on the eigenvectors of the kernel decomposition are then time and frequency shifted by the time and frequency shift operators, yielding, essentially, the STFT. The magnitude squared STFT is the "projectogram" or spectrogram associated with that particular window. A total of N windows are required to completely represent an N • N kernel. The spectrogram decomposition can more generally be expressed as TFRx(n, w; r
(6.4.4) N
nl
nl
N
I--1 k--1
Here, Wk and wl are the windows and Ak,z is the coefficient for each k, 1. Then, ~-'~,~wk(nl)z(n + nl)e -3~(n+n~) is recognized as STFTk(n,w) and the kernel is realized by the outer product ~--~/v=l~'~k=l g )~k,lWk(nl )wl(n2). By realizing that the spectrogram (cross and auto) is SPk,l (n, w) - STFTk(n, w) STFT~ (n, w)
(6.4.5)
one may write Eq. (6.4.4) as N
TFa=(n, w; r
= E
N
E
~k,lSPk,l(n, W).
(6.4.6)
/=1 k = l
6.4.4
Wigner Example
The kernel matrix for an 8 x 8 discrete Wigner matrix is 0 0 0 0 0 0 0 1
0 0 0 0 0 0 1 0
0 0 0 0 0 1 0 0
0 0 0 0 1 0 0 0
0 0 0 1 0 0 0 0
0 0 1 0 0 0 0 0
0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0
(6.4.7)
Spectrogram Decompositions of Time-Frequency Distributions
263
Using the Haar basis vectors
1
1 1
1 1
1 1
1 1
o 2 0 0 0
o -2 0 0 0
o 0 2 0 0
o 0 -2 0 0
1 -1
1 -1
1 -1
1 --1
o
o
o
o
0 0 2 0
0 0 -2 0
0 0 0 2
0 0 0 -2
(6.4.8)
one may obtain the projection of the kernel matrix on the outerproduct formed by all possible pairs of Haar basis vectors. The projection value is obtained by multiplying the kernel and the outerproduct matrices element by element and summing over all elements of the result. This provides the weighting A(k,l) for combining the spectrograms of Eq. (6.4.4). The array of weights is 1 0
0
0
-1 0 0 0 0 0 0
0 0 0 0 0 0
0
0 0 -1 0 0 0 0
0 -1 0 0 0 0 0
0
0 0 0 0 0 0 -1
0
0 0 0 0 0 -1 0
0
0 0 0 0 -1 0 0
0
-
0 0 0 -1 0 0 0 _
(6.4.9)
Generally, good T F R representation is possible using only a fraction of the windows required for full representation. The spectrogram itself, of course, requires only one term, since it has only one window in its decomposition. In general, N 2 cross-spectrograms are required, but for many kernels only a few weights are nonzero and other weights are small enough to set to zero. However, even better results may obtained by using a special set of orthogonal windows, one may represent the T F R using many fewer cross-spectrograms.
6.4.5
Optimum Orthogonal Windows
The kernel for the binomial T F D [11] is shown here for illustration for N = 9: 0 0
B
~.
0 0
0 0
0
~
0
0
0
0
0
~1
4 1~
0
1
6__ 16 3
0 0 4
0 0 0
0
0
0
0
~
3
0
0
0
0
~1
1
!16
!8
!4
!2
1
!2
8 1 4
16 1 8
1 16
0
4
16
8
3
t
t
0
0
0
0
0
0
~16
a8
t4
0
0
0
0
4__ 16
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
2
!8
1
(6.4.10)
264
Chapter 6:
Implementation and Realization of TFDs
The method used by Cunningham and Williams would require the decomposition Vs - BV
(6.4.11)
where ~ is a diagonal matrix of eigenvalues. The corresponding rows and columns of V form outer products which, weighted by the appropriate eigenvalues and summed, form the kernel B = V,~V'. (6.4.12) One would wish the eigenvalues to decline quickly, indicating that a truncated outerproduct reconstruction could suffice. Generally, it has been found that about 17 outerproducts are sufficient to well represent a 256 x 256 kernel matrix [5]. More efficiency can be gained taking advantages of symmetry. The upper right and lower left portions of B exhibit symmetry, so all of the kernel information is carried by the submatrix 0 1_. C
__
8 1 4 1 2
1
0
0
0
•16
0
0
0
a8 !
616 3
0 !
0 0
2 1 2
"
8 1 4
16 1 8
1 16
(6.4.13)
-
Much greater efficiency can be gained by rotating matrix C to form 0 D
1 2 1 4
1 8 1
1 2
1 4
2
8
3
4_
3
6--
0
0
!16
0
0
0
0
0
0
0
!
8
16
1 8
16
1 16
0
(6.4.14)
where the 1 in the center of the kernel matrix has been replaced by zero and a single companion matrix with the missing 1 (and zero elements otherwise) is created. These matrices can be augmented with additional zeros in their rows and columns to a size equal to the original kernel. Then, a third matrix can be formed from C by flipping C along the anti-diagonal. These three zero augmented matrices can be summed to form the original kernel. The eigenvectors of D can now be used as windows once the matrix is rotated back to its original position. However, the windows resulting from the eigenvectors now come in pairs such that the proper outerproducts are formed using a window and its time-reversed pair. Due to symmetry, only the upper right quadrant of B as represented by C, plus the impulse window which provides the 1 in the center, is required to compute the STFTs. Details about the construction of T F D S from the STFTs are given elsewhere [1-4].
Spectrogram Decompositions of Time-Frequency Distributions
265
Fig. 6.4.1. Comparison of results for dolphin clicks: (a) True Binomial TFD, (b) Time series, and (c) Approximated Binomial TFD using a 4-spectrogram approximation.
266
6.4.6
Chapter 6:
Implementation and Realization of TFDs
Kernel Decomposition Results
Computation of TFDs by this method is efficient and fast. An example (Fig. 6.4.1) is provided for a 129 x 129 binomial kernel matrix. Only four distinct windows plus the impulse window are required to provide a very good representation of a complex signal. The kernel reconstructed from the truncated outer product series has only about 7% residual error. The signal consists of a dolphin click followed by a time-shifted and scaled version of the click (2:1 compressed in time and normalized for energy) followed by a time-shifted and frequency-shifted version of the click. This signal has been used elsewhere to illustrate the time-shift, scale and frequency shift covariance of RIDs. Nine STFTs must be computed for this approximation. One additional trivial computation is required for the impulse window. Clearly, the true and approximated TFD results are very similar. If one looks closely it appears that the cross-term activity is decreased in the approximate result. This is consistent with previous observations that noise and cross-term activity are abated in the approximations. One could compute each of the STFTs required in parallel. The formation of the cross-spectrograms from the STFTs and the summation of those results would require additional hardware or software, but these operations involve only multiplication of S T F T points, multiplication of the resulting crossspectrograms by the required eigenvalue coefficients and finally, summation of the results. In addition, a number of signal processing "tricks" may be employed to speed up the computations due to symmetry, the realness of the T F D result, etc. Some TFDs that fall into the RID class may be approximated with a smaller number of windows. The Born-Jordan TFD requires only three distinct windows for a very nice representation. In fact, one may compute a legitimate RID using an impulse and one other distinct window [4]. However, this minimum window TFD may not have desirable representation properties in some cases. The windows derived by these methods have an interesting self-scaling property which accounts for the scale covariance being retained [13].
6.4.7
Summary and Conclusions
The liability of bilinear TFDs from Cohen's class in terms of computational burden can be overcome to a considerable degree by employing various means of spectrogram decomposition. In addition to fast computation, noise and cross-terms may be decreased in the approximated form. It is clear that such TFDs can now be computed very rapidly using parallel computation or dedicated hardware, bringing their use into a wider range of practical applications. The next article considers the computation of quadratic TFDs discretized by the method of Article 6.1.
References [1] W. J. Williams and J. C. O'Neill, "Decomposition of time-frequency distributions using scaled window spectrograms," in Proc. SPIE: Advanced Signal Processing Algorithms,
Spectrogram Decompositions of Time-Frequency Distributions
26?
vol. 2563, pp. 44-58, Soc. of Photo-optical Instrumentation Engineers, San Diego, CA, 10-12 July 1995. [2] W. J. Williams, T.-H. Sang, J. C. O'Neill, and E. J. Zalubas, "Wavelet windowed time-frequency distribution decompositions," in Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations VII, vol. 3162, pp. 149-160, Soc. of Photo-optical Instrumentation Engineers, San Diego, July-August 1997. [3] W. J. Williams, "Reduced interference time-frequency distributions: Scaled decompositions and interpretations," in Wavelet Transforms and Time-Frequency Signal Analysis (L. Debnath, ed.), ch. 12, Boston: Birkh~user, 2001. [4] W. J. Williams and S. Aviyente, "Optimum window time-frequency distribution decompositions," in Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, pp. 817821, Pacific Grove, CA, 1-4 November 1998. [5] G. S. Cunningham and W. J. Williams, "Kernel decomposition of time-frequency distributions," IEEE Trans. Signal Processing, vol. 42, pp. 1425-1442, June 1994. [6] G. S. Cunningham and W. J. Williams, "Fast implementations of generalized discrete time-frequency distributions," IEEE Trans. Signal Processing, vol. 42, pp. 1496-1508, June 1994. [7] L. B. White, "Transition kernels for bilinear time-frequency signal representations," IEEE Trans. Signal Processing, vol. 39, pp. 542-544, February 1991. [8] M. G. Amin, "Spectral decomposition of time-frequency distribution kernels," IEEE Trans. Signal Processing, vol. 42, pp. 1156-1165, May 1994. [9] H.-I. Choi and W. J. Williams, "Improved time-frequency representation of multicomponent signals using exponential kernels," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 37, pp. 862-871, June 1989. [10] J. Jeong and W. J. Williams, "Kernel design for reduced interference distributions," IEEE Trans. Signal Processing, vol. 40, pp. 402-412, February 1992. [11] W. J. Williams and J. Jeong, "Reduced interference time-frequency distributions," in Time-Frequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 3, pp. 74-97, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [12] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [13] W. J. Williams and S. Aviyente, "Minimal-window time-frequency distributions," in Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations IX, vol. 3807, pp. 446-457, Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 19-21 July 1999.
268
Chapter 6:
6.5
Implementation and Realization of TFDs
C O M P U T A T I O N OF DISCRETE Q U A D R A T I C TFDs ~
6.5.1
General Computational Procedure
Article 6.1 deals with definitions and properties of the discrete WVD (DWVD) and other discrete quadratic TFDs. It shows that the general discrete quadratic TFD of an analytic signal z[n] is
pz[n, k] - 2 E
E
Iml< M IPI<
G[p, m] z[n-p+m] z*[n-p-m] e -j2~km/M
(6.5.1)
P
= 2 D F_T {Gin, m] 9(z[n+m] z*[n-m]) } ; m e (M)
(6.5.2)
where the support dimensions of the kernel do not exceed M samples in the lag (m) direction and P samples in the time (n) direction, and (M) means any set of M consecutive integers; cf. [1, p. 444]. So the general procedure for evaluating such a TFD is: 1. Formation of the instantaneous autocorrelation function (IAF) 2. Discrete convolution in n (time) with the smoothing function G[n, m]; 3. Discrete Fourier transformation mapping m (lag) to k (frequency). For the DWVD, which has G[n, m ] - 5In], step 2 reduces to an identity transformation and may be omitted. The windowed DWVD has G[n, m ] - 5[n] g[m], so that step 2 reduces to multiplication of the IAF by g[m]. Some other quadratic TFDs, however, have special forms leading to computational procedures which are not degenerate cases of the above, and which may be simpler or faster. This article addresses some of the practical issues in computing quadratic TFDs of a real signal, examines various cases of the above procedure, and considers the spectrogram as one example of a special form leading to a simpler, faster algorithm.
6.5.2
Computation of the Analytic Signal
The usual definitions of quadratic TFDs, especially the WVD and the windowed WVD, assume an analytic signal in order to avoid interference terms between positive and negative frequencies. For computational purposes, an analytic signal also avoids the need for 2• oversampling prior to computation of the IAF (see Section 6.1.1 and ref. [2]). So, given a real signal s(t), we must first compute the analytic signal z(t) associated with s(t). The simplest method is the direct approach of filtering out the negative frequencies in the frequency domain. If a real signal s[n] is given for n - 0, 1, 2 , . . . , N - 1 and periodically extended with period N, where N is even (or is made even by zero-padding), the algorithm is: ~ B o u a l e m B o a s h a s h and G a v i n R. P u t l a n d , Signal Processing Research Centre, Queensland University of Technology, GPO Box 2434, Brisbane, Q 4001, Australia (b.boashash@qut.edu.au, g.putland@qut.edu.au). Reviewers: S. L. Marple, A. Reilly and V. Sucic.
269
Computation of Discrete Quadratic TFDs
1. Compute
S[k] =
DFT{s[n]} for k = 0 , 1 , . . . , N - I ;
2. Compute
z[k]3. Compute
z[n] =
S[k] 2s[k]
N for k - 0, T
0
otherwise;
k-1,2,
"~
---1 2
(6.5.3)
IDFT{Z[k]}, where I D F T { . . . } denotes the inverse DFT.
The treatment of the Nyquist term ( k = N/2) and the precise meaning of "analytic" for a discrete-time periodic signal are explained in [3]; these issues become significant if the signal has non-zero amplitude at the Nyquist frequency. Further details on implementation of TFDs, including computation of the analytic signal, are given in [4]. A time-domain algorithm for computing the analytic signal using FIR filters is described in [5].
6.5.3
Real-Time Computation of TFDs
The formula for the discrete quadratic T F D [Eq. (6.5.2)] involves the expression z[n+m] where m is allowed to be positive, together with z*[n-m] where m is allowed to be negative. The same applies to the DWVD
k] - 2 DEW
;
(N)
(6.5.4)
and the windowed DWVD
Wgz[n,k] =
2 DVT{g[m]
z[n+m] z*[n-m]} ; mc(M>
(6.5.5)
(both of these expressions are derived in Article 6.1). Both cases involve t i m e a d v a n c e d signals; for any value of n, the computation of the T F D involves signal values up to z[n+A], where A is some positive integer. In real-time computation, we cannot compute the T F D for time n until we know the signal values up to z[n+A]; thus A is the minimum l a t e n c y of the computation. In the case of the DWVD [EQ. (6.5.4)], the range of m for which the IAF can be non-zero is maximized when n is at the center of the time-support of the signal; so the latency reaches a peak of half the signal duration. For the windowed DWVD, the latency is limited to half the window duration. For the general discrete quadratic TFD, the latency is limited to half the sum of the dimensions of the G matrix. The latency of the analytic signal computation must be added to that of the T F D computation. In all cases a smaller value of M not only reduces latency but also produces shorter FFTs, hence shorter computational delays; but the cost is reduced frequency resolution. Latency is one of two measures of merit for real-time computation of TFDs. The other measure is throughput, which depends on the efficiency of the numerical algorithms. Eq. (6.5.2) can be written
pz[n, k] =
2 DFT m----* 9 k
{Rz[n, m]}
(6.5.6)
270
where
Chapter 6:
Implementation and Realization of TFDs
Rz[n, ml = a[n, m], (z[n+m] z*[n-m]). Similarly,
pz[n+ 1, k] =
2 DFT rr~--* k
{Rz[n+ 1, m]}.
(6.5.7)
The above two equations represent successive time-slices of the TFD. Multiplying the second equation by j, adding the result to the first equation and using the linearity of the DFT, we obtain
pz[n, k] + jpz[n+ 1, k]
-
2
DF_T{Rz[n, m] + jRz[n+ 1, m]}.
(6.5.8)
If the TFD is known a priori to be real, as it usually is, then Eq. (6.5.8) means that the successive time slices of the TFD are respectively the real and imaginary parts of the right-hand side, which involves only one F F T [6]. Thus the realness property can enhance efficiency by halving the required number of FFTs. It can also halve the storage requirement as it implies Hermitian symmetry in the smoothed IAF.
6.5.4
Computational Approximations for Discrete-Time Kernels
Table 6.5.1 reproduces the "G[n, m]" column of Table 6.1.2 (p. 240) and adds two special cases often found in the literature: B J1/2 denotes the Born-Jordan distribution with a = 1/2, while ZAM2 denotes the Zhao-Atlas-Marks distribution with a = 2. Some entries in the "Gin, m]" column of Table 6.5.1 call for continuous convolution prior to sampling. At best, the evaluation of such a convolution in the time-lag domain requires oversampling. At worst, it requires the numerical evaluation of an improper integral arising from a singularity in G(t, T). In either case, computational inefficiencies will arise unless the smoothing effect of the convolution can be approximated in some other way. The chosen approximations, shown in the right-hand column of Table 6.5.1, are explained below. In the case of the B-distribution, the sole purpose of the convolution is to avoid aliasing. Without the convolution, and for typical values of the parameter fl (e.g. = 0.01), the time-lag kernel would be a continuous function with a narrow slot at m = 0 caused by the factor {2m[~. This factor is approximately unity for small nonzero values of m. The convolution fills in the slot, so that the factor is approximately unity at m = 0 also. This effect can be approximated by replacing 12ml with [4m2+ 1] 1/2 , as is done in Table 6.5.1. In the case of ZAM distribution, for a suitable (unbounded) wire], the convolution also ensures that G[n, 0] = 5[n], which in turn verifies the TM property. Without the convolution, we would have GZAM[n, m] --
an w[m] rect(~-~)
--
{ w[m] 0
if lanl <_12ml otherwise.
(6.5.9)
This gives G[n, 0] = w[0]5[n], which verifies the TM property provided that w[0] = 1. Accordingly, Eq. (6.5.9) is used in Table 6.5.1, although other approximations are possible. For example, we could sacrifice the TM property in favor of
271
Computation of Discrete Quadratic TFDs
T a b l e 6.5.1: Computational approximations for time-lag kernels of selected discrete quadratic TFDs. In the "Distribution" column, subscripts indicate parameter values while the prefix "w-" means "windowed" by the function w[m]. For the spectrogram and w-Levin distributions, the window to is assumed to be real and even. The "O[n, m]" column shows the exact kernels required for the avoidance of aliasing in the Doppler-frequency domain. If G[n, m] cannot be computed as written, the "Approx." column shows the suggested computational approximation.
Approx.
Distribution WVD Levin BJ
BJ1/~
1 (~[n-~- m ] _~_1 (~[Tt-- m ]
[ 1 rect( ~ )] ** [ sinc n sinc m] 1
1 [4~m[+l
0
n
rect(~mm)] ** [ sinc n sinc m]
,~
Modified B
cosh- 2~ n n cosh- 2~ n
w-WVD
~[~]~[.~]
w-Levin
-~w[m] (5[n+m] + ~[n - m] )
ZAM
[w[m] r e c t ( ~ n )] 9 9 [ sinc n sinc m]
ZAM2
[w[m] rect (2--~) ] 9 9 [ sinc n sinc m]
Rihaczek
~[~ -
1 12ml+l 0
if 12n[ < 14aml + 1 otherwise. if Inl < Iml otherwise.
~ w[m] if lanl <_[2m[ ~ 0
otherwise.
w[m]
if Inl _~ Iml
0
otherwise.
m]
w-Rihaczek Page
a[~- I'~1]
CW
12ml exp[, am2 ) 9, [ sinc n sinc m] cosh2n
spectrogram
51n] 7ra (-Tr2an2) [,V4m2_bzr a e x p 4m2q_Tra
9 sinem
if m = 0 otherwise"
cosh 2 n
~[~+~] ~[~-~]
some s m o o t h i n g by using the a p p r o x i m a t i o n
1
[1
tanh(14rn I [2an[) ]
(6.5.10)
a n d we could salvage the T M p r o p e r t y by using a s e p a r a t e definition for rn - 0.
272
Chapter 6:
Implementation and Realization of TFDs
For the Born-Jordan (B J) and Choi-Williams (CW) distributions, the convolutions are needed to remove singularities at m - 0 and ensure that G[n, 0 ] - 5[n]. For the BJ distribution, we can remove the singularity and approximate the spreading in the [n, m] plane by replacing 14~rnl with 14~rnl + 1. The result is 1 rect (14arnl+l) GBJ [n, m] ..~ ]4c~rn[+l n ,
(6.5.11)
which is equivalent to the rule given in Table 6.5.1. For the CW distribution, a similar effect is obtained by replacing 12ml with [4m2+Tra]1/2 . This step, by itself, gives the kernel 4rrt2+Tra)
(6.5.12)
For n - r n - 0 , this reduces to G [ 0 , 0 ] - 1, which is consistent with the requirement that Gin, 0] - 5In]. However, for m - 0, Eq. (6.5.12) reduces to G[n, 0 ] - e -~n2 , which is only an approximation to 5[n]. Accordingly, a two-part definition of the kernel is used in the "Approx." column of the table. With n = 0, the kernel as defined in the table reduces to Gcw[0, rn] --
,u/ ~- r n--~ ,
(6.5.13)
which takes the value 1 at m = 0 and 1/v/2 at m = :t: V/Tra/4. For realistic values of a (e.g. a > 1), this gives a reasonable degree of smoothing in the m direction. An alternative approach to the problem of singularities, which is not pursued here, is to evaluate the kernels in the Doppler-lag [l, m] domain. This is efficient if we intend to evaluate the time-convolution by the F F T method, which also uses the [l, m] domain. But it is still an approximation (if the time-lag form of the kernel is taken as the definition) because the analytical formulae for continuous FTs of standard signals are only approximations when applied to the DFT.
6.5.5
Special Case: Direct Form of the Discrete Spectrogram
The short-time Fourier transform (STFT) of the continuous-time signal x(t) with real window w(t) is defined (in Section 2.3.1) as
F~ (t, f ) ~- .-~f~{X(T) W(T -- t)} =
--
e-J2~ft ~f {x(7 + t) W(T) }
x(~" + t)w(~')e -j2~:'d7.
e -j2~:t
(6.5.14) (6.5.15)
oo
It is shown in Chapter 2 that the spectrogram Szw (t, f), which is simply the squared magnitude of the STFT, can also be considered as a quadratic T F D with kernel w(t+ 2) w(t- ~). The discrete form of this kernel is w[n+rn] w[n- m]. Hence the discrete spectrogram can be conveniently evaluated using the general procedure described in Section 6.5.1 above. But it is simpler and more efficient to discretize the continuous spectrogram directly.
Computation of Discrete Quadratic TFDs
273
T h e o r e m 6.5.1" If the spectrogram S~ is modified by ideally sampling W(T) at
T-m/fs
(6.5.16)
where m is an integer and fs is the sampling rate, and if w ( T ) - 0 for iv ] -~f~> M
(6.5.17) A
where M is a positive integer, and if the modified TFD is denoted by S~, then Aw n
Sx (~' ~M-) -Proof/explanation:
[ m+n
m e-j27rkm/M
E x~ fs ) w(y~) Iml<M/2
2
.
(6.5.18)
W(T) is sampled, the integrand in Eq. (6.5.15) becomes
When
(x)
X(T + t)W(T)e -j2~l" E
5(T- ~)
(6.5.19)
TYt'----00
so that the S T F T becomes
Fw(t,f)-- e-J2"It
E
X(~ss +t) w(~)e-J2~rfm/f~
(6.5.20)
m'----00
By Eqs. (6.5.16) and (6.5.17), the s u m m a t i o n is restricted to [m[ < M/2, giving a m a x i m u m of M terms. 1 The sampling in ~- makes F w ( t , f) periodic in f with period f s , while the time-limiting in ~" gives a frequency resolution of M bins per period. So it is convenient to let
f-
k fs/M
(6.5.21)
where k is an integer. W i t h these restrictions, Eq. (6.5.20) becomes
~" kfs = e-J27rkfst/M E F~'(t,--M-)
m e--j27rkm/M. x ( Tm + t) W(Ts)
(6.5.22)
Iml<M/2
P u t t i n g t -- n/fs to match the quantization of T, then taking the squared magnitude of the discrete S T F T , we obtain Eq. (6.5.18). 9
With a change of notation, Eq. (6.5.18) becomes
S:[n, k]
(6.5.23)
-
Iml<M/2
I
This S~[n, k] is the d i s c r e t e s p e c t r o g r a m of the discrete-time signal x[n] with window w[m]. If the summand is extended periodically in m with period M (i.e. extended periodically in T with period M/fs), we obtain 2
S: [n, k] 1 M t e r m s for odd M" M - 1
~
x[m + n] w[m] e -j2~k'~/M
terms for even M.
(6.5.24)
Chapter 6:
274
Implementation and Realization of TFDs
where (M / denotes any set of M consecutive integers. 2 This may be written
S~[n,k] - I DF_T {x[m+n] w[m]}
2
; me(M}.
(6.5.25)
The time support of S~ In, k] is that of x[n] 9win], corresponding to x(t) 9w(t). If this has a duration not exceeding N samples, then the non-zero elements of the discrete spectrogram may be represented by an N • M real matrix. Only half of the M columns are needed for the non-negative frequencies, which are sufficient if x(t) is real. Eq. (6.5.23) involves x[n + m] where Iml < M/2 and M is the window length in samples. So, in real-time computations, the latency of the discrete spectrogram computed by this formula is half the window length.
6.5.6
Sample Code Fragments
In view of the current popularity of M A T L A B TM, w e illustrate this Article with some code fragments from the experimental MATLAB function t lkern.m, which computed all of the TFDs plotted in Article 5.7. The input parameters of the function specify the kernel in terms of a time-dependent factor gl In], a lag-dependent factor g2[m], and an "auxiliary factor" g3[n, m]. The overall time-lag kernel G[n, m] is then computed as
Gin, m] - g2(m](gl[n] * g3(n, m]) = (g2[m] gl [n]) * g3[n, m].
(6.5.26)
This scheme allows the computation of a wide variety of quadratic TFDs in under 320 lines of code, including exceptions for direct computation of the discrete spectrogram. 6.5.6.1
Example 1: MBD (General Algorithm)
For a separable kernel, the auxiliary factor would normally be omitted (i.e. taken as 5In, m]), while the time-dependent and lag-dependent factors would have input parameters specifying their types (e.g. Hamming or Hanning) and durations (in samples). Although the kernel of the modified B-distribution (MBD) is separable (see Article 5.7 and Table 6.5.1), its parameter ~ is not a duration. The MBD kernel is therefore specified using the factors -2~ n gl[n]- 5[n]; g 2 [ m ] - 1; g3[n,m] = E .cosh cosh_2~n.
(6.5.27)
Notice that the auxiliary factor is the complete kernel. To compute the MBD in Fig. 5.7.2(e) on p. 220, the function t l k e r n is called with the following significant parameters: 2For even M, the periodic extension is padded with a zero term.
275
Computation of Discrete Quadratic TFDs
s N tr tf if af ap
= = = = = = =
signal vector 1 2 8 = assumed period I = time resolution 'delta' = string specifying 91 [n] '1' = string specifying g2[rn] 'mb' = string specifying form of g3[n, m] 0.2 = auxiliary parameter (~).
All internal computations, including IAF generation, are designed to be valid for periodic signals. Therefore, to compute the IAF of a non-periodic signal such as the one in Fig. 5.7.2(e), the assumed period N must be at least twice the signal length to avoid wrap-around effects. Because the time support of the IAF is identical to that of the signal, the same value of N is also sufficient to avoid wrap-around in the subsequent convolution with gl [n]. The output is the real matrix t f d (1 : Mpad+ 1,1 : N s e l ) , whose dimensions Mpad+ 1 and Nsel are assigned early by the statements Mpad Ncut Nsel
= 2^ceil(log(2*M)/log(2)); = min(N,length(s)); = ceil(Ncut/tr);
Y. l a g - t o - f r e q u e n c y FFT length '/. d u r a t i o n of TF plot Y. no. traces in TF plot
where M is the support length of the kernel in the lag direction; in this case M has been set to l e n g t h ( s ) because of the constant "lag-dependent" factor. Preliminaries: The analytic signal is computed by the frequency-domain method. If N is even, the Nyquist term has MATLAB index N/2+l and the amplitude at that frequency is left unchanged [3]. If N is odd, there is no Nyquist term. The following code handles both cases: Noff = fix(N/2); z = fft (real (s) ,N) ; z(2:N-Noff) = 2*z(2:N-Noff); z(Noff+2:N) = O; z = ifft (z) ;
s t r u n c a t e d or p a d d e d p o s i t i v e frequencies negative f r e q u e n c i e s
For this kernel, the time-dependent factor g l and the lag-dependent factor g2 are computed by the statements gl(l:N) 9
.
g1(1) .
.
= O;
.
=
1;
.
g2(l:Mpad)
=
1
where " . . . " denotes one or more line(s) of control code, or code t h a t is skipped in this case. The auxiliary factor g3 (the whole kernel in this case) is computed by Moff= 9
.
g3(1 9
.
fix(M/2);
.
.
N,1 Mpad)
= O"
276
Chapter 6:
temp(l:N)
Implementation and Realization of TFDs
= O;
for n = - N o f f : N o f f
temp(l+rem(N+n,N))
=
end
temp = temp/sum(temp); for m = - M o l l : M o l l
(cosh(n))'(-2*ap)" ~ normalize
g3(:,l+rem(Mpad+m,Mpad))
end
= temp.';
where ap denotes the auxiliary parameter (~), and the remainder (rein) function causes high array indices to represent negative values of time and lag. Step 1--Formation of the IAF" The IAF matrix K(I" N, 1 "Mpad) is formed by for
n = I:N
for m = -Moff:Moff K(n,m)
= z(n+m)z'*(n-m),
K (n, l + r e m ( M p a d + m , M p a d ) )
end
with
corrected
indices:
= z(l+rem(2*N+n+m-I
,N) ) .conj ( z ( l + r e m ( 2 * N + n - m - i
,N) ) ) ;
end
where the "corrected" indices allow handling of periodic signals. Step 2mConvolution in time" The assembly of the time-lag kernel and the convolution in time with the IAF are performed together. The smoothed IAF is
Rz[n, m] - Kz[n, m] 9Gin, m] -- gz[n, m] 9(g2[m I gl[n] * g3[n, m]).
(6.5.28)
The above convolutions may be taken as circular if the assumed period is sufficiently long, in which case
Rz[n, m ] - IDFT { DF_T{Kz[n, m]} DFT {g3[n, m]} DFT {gl[n] g2[m]}}. (6.5.29) This is implemented by the following code, in which K(" ,mcorr) is initially the mth column of the IAF, but is overwritten by the mth column of the smoothed IAF: for m = - M o l l : M o l l mcorr = l+rem(Mpad+m,Mpad)
;
. . .
K(:,mcorr)
= ifft(fft(K(:,mcorr)).*fft(g3(:,mcorr)).,fft(gl.'.g2(mcorr)));
. . .
end
(The factor g2[m] could be taken outside the IDFT, but this would not improve the efficiency of the code because g2(mcorr) is a scalar.) The FFT method of convolution is useful in experimental code because of its generality, but is not necessarily the most efficient method, especially if one of the convolved sequences is short. Now we apply the time-resolution (tr): for n s e l Y. n s e l t h
= l:Nsel column
of r is s e l e c t e d
n = l+tr*(nsel-l) r(: ,nsel)
end
;
= K(n,:).';
r o w of K:
Computation of Discrete Quadratic TFDs
Step 3 - - D F T :
277
The final D F T (lag to frequency) is computed by
r = fit (r) ;
which, for the sake of generality, does not take advantage of realness.
Final adjustments:
The following code scales the T F D and repeats its first row (the zero-frequency row) so that the T F D spans a full cycle in the frequency domain: tfd
= [real(r) ;real(r(1,
:))] .,(Ncut/Nsel/Mpad) ;
The scaling ensures that the sum of the matrix elements is close to the signal energy regardless of the time resolution.
6.5.6.2
Example 2: Spectrogram (Special Case)
The spectrogram in Fig. 5.7.3(f) on p. 221 was computed by the same function tlkern. For the spectrogram, the parameters s, N and tr are the same as for the MBD, while t f is ignored. Other significant parameters are
i f - - ' r e e L ' - - string specifying type of window M -- 1 7 - window length (in samples) af - 'sg' - string calling for spectrogram. The output is t f d ( 1 :Mpad/2+l, 1 :Nsel), where the dimensions are assigned as for the MBD, except that the window duration M is read as an input parameter and n o t overwritten. The analytic signal is computed as for the MBD, although this is not strictly necessary for the spectrogram 9 The rectangular window is computed by Moff = f i x ( M / 2 ) " g 2 ( l : M p a d ) = 0; 9
.
.
for m = - M o l l - M o l l g2(l+rem(Mpad+m,Mpad)) end
= I;
The matrix K (i" N, 1 "Mpad) normally it is assigned differently:
represents the IAF, but for the spectrogram
for n = I:N
for m = - M o l l : M o l l Y. K(n,m) = z ( n + m ) g 2 ( m ) , w i t h c o r r e c t e d indices: K (n, l + r e m (Mpad+m, Mpad) ) = z (l+rem ( 2 * N + n + m - I, N) ) *g2 ( l + r e m (Mpad+m, Mpad) ) ; end end
The code that applies the time resolution and performs the final D F T (lag to frequency) is the same as for the MBD. But the final adjustment is different: tfd = (abs(r(l:Mpad/2+l,:))).'2.,(Ncut/Nsel/Mpad/sum(g2.'2));
Chapter 6:
278
Implementation and Realization of TFDs
The magnitude-squared operation alters the relationship between the window and the scaling of the TFD. Also note that the above step uses only half the columns of the Fourier-transformed r matrix, namely those corresponding to the non-negative frequencies. Efficiency could be further improved by exploiting the analytic signal to halve the sampling rate.
6.5.7
The TFSA package
The Time-Frequency Signal Analysis (TFSA) package is a set of functions developed over more than a decade at the Signal Processing Research Centre, Queensland University of Technology, for computing modulated signals, quadratic and polynomial TFDs, ambiguity functions, wavelet transforms and scalograms, and various estimates of instantaneous frequency. As this is a production package rather than an experimental package, computationally intensive functions are precompiled and optimized for efficiency, and an interactive user interface is added. The current version is distributed as a M:ATLAB toolbox, so that TFSA functions can be used with other computational and graphical functions of M:ATLAB. Further information is available at http://www.sprc.qut.edu.au/or http://www.eese.bee.qut.edu.au/research/spr/.
6.5.8
Summary and Conclusions
High-level programming languages with built-in FFT functions and matrix operations have made it possible to construct compact yet highly versatile functions for computing quadratic TFDs. Use of a common algorithm for all TFDs is convenient for the programmer. But, as illustrated by the direct form of the spectrogram, efficiency can sometimes be improved by using different algorithms in special cases.
References [1] B. Boashash, "Time-frequency signal analysis," in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: Prentice-Hall, 1991. [2] B. Boashash, "Note on the use of the Wigner distribution for time-frequency signal analysis," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 36, pp. 1518-1521, September 1988. [3] S. L. Marple Jr., "Computing the discrete-time "analytic" signal via FFT," IEEE Trans. Signal Processing, vol. 47, pp. 2600-2603, September 1999. [4] B. Boashash and A. Reilly, "Algorithms for time-frequency signal analysis," in TimeFrequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 7, pp. 163181, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [5] A. Reilly, G. Frazer, and B. Boashash, "Analytic signal generation--Tips and traps," IEEE Trans. Signal Processing, vol. 42, pp. 3241-3245, November 1994. [6] B. Boashash and P. J. Black, "An efficient real-time implementation of the Wigner-Ville distribution," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 35, pp. 1611-1618, November 1987.
Chapter 7
Measures, Performance Assessment and Enhancement This chapter describes a number of time-frequency methods specifically developed for performance enhancement for a given application. The adopted performance measures are defined using objective criteria. The topic is covered in five articles. Hyperbolic FM signals are well described by the Affine Wigner-Ville distribution, a method related to time-scale analysis and the wavelet transform (Article 7.1). A general procedure for enhancing the time-frequency resolution and readability of TFDs is the reassignment principle (7.2). Techniques for measuring the concentration of TFDs and for automatic optimization of their parameters may be based on entropy measures (7.3). Another approach defines a resolution performance measure using local measurements in the (t, f) domain, such as relative amplitudes of auto-terms and cross-terms (7.4). Finally, attempts to unify time-frequency, timescale, filter banks, wavelets and the discrete-time Gabor transform using product functions and cascaded frames may assist in the selection of the best-performing method for a given application (7.5).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
279
280
7.1 7.1.1
Chapter 7:
TIME-FREQUENCY GROUP. ~
Measures, Performance Assessment and Enhancement
ANALYSIS BASED ON T H E A F F I N E
Scale Transformations and their Constructive Role in the Time-Frequency Analysis of Real Signals
The notion of time-frequency analysis is very familiar from the experience of hearing perception and its practical interest in signal theory seems a common sense topic. However, at this level of generality, there is no indication for giving an analytic content to the subject and, in fact, numerous techniques have been proposed (see Chapters 2 and 5). The introduction of the affine group allows to overcome this lack of universality by adding a consistency condition on which a common consent is possible. At first it is necessary to recognize that signals describe the time evolution of a physical quantity (an acoustic pressure for example) and that the measurement of this quantity can be done only after a system of reference and a system of physical units have been chosen. In fact, depending on these choices, different descriptions (i.e. different signals) can be associated with the same physical phenomenon. Obviously the communication theory has to deal with the situation by interpreting equivalently the various options. We will show that the affne group is the right mathematical tool for managing this equivalence. In signal theory there is only one reference variable which is the time. Hence the changes of reference system are naturally interpreted as clock changes involving changes of time origin and changes of time unit. The analytical description of any change is thus given by a transformation of the form: t
~ t' = at + b,
(7.1.1)
where t and t' are the time variables and where a and b are real numbers with a > 0. The set of all such transformations constitutes the affine group. These transformations will also affect physical units (those which are derived from the time unit) by multiplying each of them by some power of the dilation a. As a result a change of clock of the form (7.1.1) will induce on a given signal s(t) a change of the form: s(t) ~ s'(t) = a r s ( a - l ( t - b)), (7.1.2) where r is a real exponent depending on the physical nature of the signal under study. The exponent r will be called index of (dimensional) conformity. The above remarks are only useful preliminaries before tackling the time-frequency problem. Actually time-frequency analysis is an alternative description in which signals are not characterized by their instantaneous values but by their form which is described in terms of modulations of amplitude and frequency. In practice, this description is summarized by a real function P ( t , f ) which gives an image of ~ J. Bertrand, LPTMC, University Paris VII, Case 7020, 75251 Paris Cedex 05, France (bertrand@ccr.jussieu.fr), and P. Bertrand (pibert@libertysurf.fr).
Time-Frequency Analysis Based on the Affine Group
281
the spreading of the signal in the time-frequency half-plane (f > 0). The basic problem of time-frequency analysis is to make this approach effective by setting up the correspondence between the physical signal s(t) and its time-frequency representation P(t, f). It is well-known that the operation cannot be linear and the usual method consists in defining P(t, f ) as an Hermitian functional on a Hilbert space whose elements are in direct relation with the physical signals. In affine theory this classical approach is followed, the specific point being the introduction of a Hilbert space where an irreducible unitary representation of the affine group does exist. Any real signal s(t) is entirely characterized by the positive frequency part of its Fourier transform, i.e. by the function:
Z(f) - Y(f)
F
e -je~ft s(t) dt,
(7.1.3)
(x)
where Y ( f ) is the Heaviside step function. In agreement with (7.1.2), a general affine transformation on signal s(t) acts on Z ( f ) according to:
Z(f)
, Z ' ( f ) - a r+l e -j2'~b:f Z ( a f ) ,
(7.1.4)
where r is the index of conformity of the physical signal. In contrast with (7.1.2), the transformation (7.1.4) corresponds to an irreducible representation of the affine group. Moreover, as can be verified, it conserves the norm defined by: IlZll 2 -
IZ(f)12 f 2~+~df.
~0(:X)
(7.1.5)
In the following, we will adopt this norm for which the representation (7.1.4) of the affine group is unitary. In a change of reference system, the time-frequency representation of a signal will also be changed. This change will depend on the change in the variables (cf. (7.1.1)) and on the meaning of the function P(t, f ) as a quantitative representation of the spreading of the signal in the time-frequency half-plane. This leads to introduce the general transformation:
P(t, f )
, P'(t, f ) - a q P ( a - l ( t - b),af),
(7.1.6)
where q is a real number which can be considered as an index of significance. The integral of P(t, f ) on the half-plane that is invariant by transformation (7.1.6) has the form:
' fq P(t, f ) d t df .
(7.1.7)
For the common choice q = 0, a probabilistic comprehension of the spreading is possible. However, in special applications, the choice of other values for q can be pertinent.
282
Chapter 7:
Measures, Performance Assessment and Enhancement
The occurrence of the dimensional indices r and q is a new feature that has been introduced by the consideration of scaling transformations. It did not appear in the case of the Wigner-Ville representation which is based, in an essential way, on translations in both time and frequency through the so-called Weyl-Heisenberg group. In fact, it can be seen directly that the invariance of the scalar product associated with the norm (7.1.5) and of the integral (7.1.7) under frequency translations is only possible if r = - 1 / 2 and q = 0 respectively.
7.1.2
Tomographic Derivation of the Affine Wigner Function
Tomographic methods are now of routine use in signal analysis. Their emergence in time-frequency analysis is related to the recognition that the general marginals of the well-known Wigner-Ville function along straight lines of arbitrary slope, are positive and easily interpretable in terms of the signal. An immediate result has been the construction of the Wigner-Ville function by relying only on the Heisenberg group which performs time and frequency translations [1,2]. Practicalapplications have arisen in the domain of quantum optics, where the method of tomography [3] allows to determine precisely the state of a system from experimental data [4]. More recently, there has been a renewed interest for the so-called fractional Fourier transform arising naturally in the expression for the Radon transform of the WignerVille function (see Articles 4.8 and 5.8). In fact, the relation between tomography and Wigner function is not accidental but is connected in an essential way to the underlying group. In signal analysis, the straight lines of a given direction in the time-frequency plane arise as families of curves invariant by a subgroup of the translations group. In the affine case, a tomography can be set up in an analogous way by considering all the subgroups which are acting as dilations around a fixed time. This will be done now. The linear chirps play a major role in usual tomography since they are associated with straight lines in the time-frequency plane. They form families of signals which are characterized by their invariance, up to phase, under the subgroup of translations in a given direction. In the affine case, the same role is played by signals that are invariant (up to a phase) in dilations centered at the instant t = ~. These are the hyperbolic chirps defined by: r
f-J2~'/3-r-le-j2~~f, /~ real.
(7.1.8)
It can be verified t h a t such signals conserve the same form in the transformation of type (7.1.4) submitted to the constraint: b = ~(1 - a).
(7.1.9)
These transformations are just those of the subgroup of dilations of origin t = ~. A signal with index of conformity r, can be decomposed on the r fixed, and the coefficients of the development are equal to:
Z(f),
(Z,g,~) - fo~176 fJ2~Z+~eJ2=r df,
(7.1.10)
283
Time-Frequency Analysis Based on the Affine Group
where the scalar product is defined from (7.1.5). The expression (7.1.10) is a general Mellin transform whose inverse is easily obtained. The group delays of the hyperbolic chirps (7.1.8) at frequency f are found to be
t = ~ + fl/f,
fl real.
(7.1.11)
They correspond to curves in the time-frequency half plane (t, f), f > 0, that are invariant by dilation (7.1.9). Let P q ( t , f ; Z ) denote the time-frequency representation of signal Z(f) that is under construction. The tomographic condition relates the integral of Pq(t, f) along hyperbolas (7.1.11) to the coefficients (7.1.10) of signal Z ( f ) on the basis of hyperbolic chirps. It takes the form:
Pq(t, f; Z)5((t - ~)f - Z) fq dt df -
\Z, r
2,
(7.1.12)
O0
where each member is separately invariant by affine transformations. Considered for all real values of fl and ~, this relation has the form of a generalized Radon transform for Pq(t, f). Its inversion gives the expression of the A]fine Wigner Function [1,5]:
Pq(t, f; Z) -- f 2 r - q + 2
cj2ruft O0
fueU/2 • Z
Z*
2 si--~]2)
fue-U/2
u
2 sinh(u/2)
2 sinh(u/2)
du. (7.1.13)
The integrand can be seen to depend only on the functions A(u) and A(-u) where:
ue~/2
A ( u ) - 2sinh(u/2)"
(7.1.14)
In formula (7.1.13), the index of significance q can be chosen according to the type of time-frequency representation needed, but r is necessarily equal to the index of conformity attached to Z(f). When an affine change is performed on signal Z(f) according to (7.1.4), the function (7.1.13) is clearly transformed as in (7.1.6). Hence, the Affine Wigner Function verifies the condition:
Pq(t, f; Z') = a q Pq(a-l (t - b), a f ; Z),
(7.1.15)
where Z'(f) is defined in (7.1.4). This property of invariance of the correspondence between a signal Z(f) and its time-frequency representation Pq(t,f) is essential for an analysis founded on the affine group. However, it does not characterize univocally the Affine Wigner Function. In fact, an infinite family of functions satisfying condition (7.1.15) can be determined: They form the affine analog of Cohen's class. 1 But the Affine Wigner Function (7.1.13) stands out as the only one 1 T h a t is, the q u a d r a t i c class; see p. 68n.
284
Chapter 7:
Measures, Performance Assessment and Enhancement
with a tomographic construction based on the affine group alone. In this sense, it occupies the same position in the affine class as the usual Wigner-Ville function does in Cohen's class.
7.1.3
Discussion of the Result in terms of Broad-Band Corrections for the Usual Wigner Function
It is instructive to start using the Affine Wigner Function (7.1.13) for the analysis of narrow-band signals. In that case, the factor f2r-q+l is approximately constant, with f replaced by the central frequency f0 of the band. Moreover, the integrand in (7.1.13) is different from zero only when the arguments of Z and Z* are both in the band. This requires for u to stay in the vicinity of the value u = 0. In fact, it can be observed that the extension of the domain of u will decrease along with the bandwidth of the signal. This allows to replace the function A(u) by its first order development about u = 0: )~(u) "~ 1 + u/2, (7.1.16) when analyzing narrow-band signals. A change of variables from u to v = u f then leads to the approximate form:
l Pq(t, f) ..~ r2r-q+ Jo
/]
v ej2~vt Z f + -~ Z* ( f - -~) dv.
(7.1.17)
oo
Thus for narrow-band signals, the expression of the Wigner-Ville function appears as a simplified expression of the Affine Wigner Function. In the general case, it remains interesting to study the properties of the function (7.1.13) in relation with those of the usual Wigner-Ville function. The total integral of Pq(t, f) over the time-frequency space is now written in the invariant form:
//o
~ Pq(t, f; Z) fq dt df - IIzll 2,
(7.1.18)
oo
where IlZll is defined in (7.1.5). Integrating the distribution on the time only, we get:
f/
Pq(t, f; Z)dt - f2r+l-q IZ(f)I2.
(7.1.19)
oo
It can be observed that it is only for the choice q = 2r + 1 that the familiar expression is exactly recovered. Since Pq(t, f) represents a spreading of the signal in the time-frequency plane, it can be used to compute the mean value of the epoch t for a fixed value of the frequency. The resulting expression, whatever the value of q, is the usual group delay of the signal:
)(C
)1
Time-Frequency Analysis Based on the Affine Group
285
where the phase O(f) of the analytic signal Z(f) is defined by:
Z(f) - [Z(f)[ eye(f).
(7.1.21)
A domain where it is particularly important to be careful with the indices r and q is in the study of the effect of transducing filters on the time-frequency representations. Suppose a signal Z(f), having an index of conformity equal to r, passes through a stationary linear device whose action changes its physical nature. The emerging signal Z(f) can be written as:
Z(f) -- T(f) Z(f),
(7.1.22)
where the function T(f) characterizes the device. In general, Z(f) will transform under dilations with an index of conformity ~ different from r. Hence, for consistency, the function T(f) must be assigned an index a determined by relation (7.1.22) as: a - ~ - r - 1. (7.1.23) Notice that the identity filter corresponds here to a - - 1 . These operations have a counterpart in the domain of time-frequency functions. Let P(q)(t, f; Z), P~(~)(t, ,f" T) and P~)(t, f; Z) be the representations corresponding to functions Z(f), T(f) and Z(f) respectively. The dependence of the representations on the indices of conformity of the signals has been shown explicitly, to avoid ambiguity. The time-frequency analog of relation (7.1.22) has the form of a convolution in time provided the indices are related according to: 0 - q + ~ + 1.
(7.1.24)
In practical situations, where ~ - q and 7 - - 1 , the relation is: P(q~)(t, f; 2) -
/?
P ( ~ (t - t', f; T) P(q)(t', f; Z) dt',
(7.1.25)
OO
where the index a is given by (7.1.23). Consider now the case of a transducer defined by:
T(I) - f-h, h real,
(7.1.26)
with conformity index a - h - 1. The only action of such a device is to replace the signal Z(f) with index of conformity r by the signal Z'(I) - f - h Z(I) with index - r + h. Relation (7.1.25) now becomes:
P~+h) (t, f; I-UZ) - P~) (t, f; Z),
(7.1.27)
an identity that can also be verified directly on the expression (7.1.13). Thus, once q has been chosen, computing Pq(t, f) for a signal Z(f) or for any of its transforms (as defined by (7.1.22) and (7.1.26)) yields the same result, provided care is taken to use the correct conformity indices. In the present context, the property leads to consider the device defined by (7.1.26) as a perfect transducer.
286
7.1.4
Chapter 7:
Measures, Performance Assessment and Enhancement
Hyperbolic Chirps and Affine Group Extension
The properties of marginalization, localization and extended invariance that are well-known for the Wigner-Ville function have direct analogs in the present case. The marginal condition leading to a density in f is derived from relation (7.1.19). In addition, marginal densities in fl are obtained for each value of ~ when integrating Pq(t, f) along hyperbolas ( t - ~)f = 13. From the tomographic condition (7.1.12) it results that these densities have the form: p~:(,3) - I(Z, r
(7.1.28)
and verify:
/;
p((#) d#
IIZII
(7.1.29)
OO
Localization in the time-frequency space arises for general hyperbolic chirps. For these signals, a direct computation gives:
Pq(t, f; r
- f - q 6((t - ~)f - 13).
(7.1.30)
The special case /7 -- 0 corresponds to a localized signal r - f-r-le-J2~(I attached to the instant t = ~. The latter form can be obtained directly by requiring that after a clock change, labeled by (a, b) and acting as in (7.1.4), the signal keeps a localized form at the transformed instant t t = a~ + b. Another case of localization, that can be considered as a limit of the hyperbolic chirp behavior, concerns the pure frequency signal with index of conformity r:
Zfo(f) = f - r 6(f - fo)
(7.1.31)
Pq(t, f; Zyo) = f l - q 6(f - fo).
(7.1.32)
which is represented by:
The property of invariance, under afIine transformations, of the correspondence between a signal Z(f) and its representation Pq(t, f; Z) can be extended. To this end, we introduce the transformations performing a translation on the/~-parameter of the hyperbolic chirps according to:
d2~(f)
>r
= f-j2~(#+c)-r-1 e-J2~I
(7.1.33)
where c is a real number. These transformations act on an arbitrary signal as:
Z(f)
>Zc(f) = f-j2~c Z(f).
(7.1.34)
Moreover, they combine with the affine transformations to form a three-parameter group Go which is is the largest group conserving the family of hyperbolas (7.1.11)
287
Time-Frequency Analysis Based on the Affine Group
as a whole and the family of hyperbolic chirps. labeled by (a, b, c), act on a signal according to:
Z(f)
These general transformations,
' Za,b,c(f) -- at-hie-j2rr(bf-t-clnf) Z(f),
(7.1.35)
and on its Affine Wigner Function as:
Pq(t,f;Z)
, Pq(t,f;Za,b,c) -- cap q ( a - l ( t - b - c f - 1 ) , a f ; Z ) .
Thus the correspondence between the signal Z(f) and the function invariant not only by the affine group but by its extension Go.
7.1.5
(7.1.36)
Pq(t, f; Z)
is
Unitarity Property and Some of its Consequences
The Affine Wigner Function (7.1.13) verifies the so-called unitarity (or Moyal) property:
// foP(t,f; Z)Pq(t, f; Z') f2q dtdf
[(z,z')l
(7.1.37)
(3O
where the scalar product (Z,Z'), issued from definition (7.1.5), depends on the index r of the signal. A special case of relation (7.1.37) is obtained when Z'(f) is the hyperbolic chirp r (cf. (7.1.8)) so that Pq(t, f; Z') has the form (7.1.30). The result is just the tomographic condition which was introduced in (7.1.12). A more general form of the unitarity property (see [6], formulas (III.15-17)) finds a direct application in the reconstruction of the signal from its affine Wigner function. In fact, it allows to write directly the formula:
(flf2)-2r-l f f 2r+2+qPq(t, f; Z) ej2ruft
Z(fl) Z*(f2)
(7.1.38)
J
5 ( f l -rue-U~2 2sinh(u/2)) 5 (f2-2sinh(u/)2) fueU/2 u (2sinh(u/2))
2r+2
du dt df ,
where, as usual, the symbols 5 hold for Dirac distributions. The knowledge of p(fl,f2) = Z(fl)Z*(f2) allows to reconstruct the analytic signal up to a constant phase. Explicitly, we have:
IZ(f)[
-
v/p(f, f),
ei(0,_02) =
p(fl, ]'2) p(f2, f2)
p(fl, fl)
'
(7.1.39)
where the decomposition (7.1.21) of Z(f) has been used. This result shows that the affine Wigner function (7.1.13) is a representation of the signal which does not discard any information but a constant phase. The unitarity relation can also be used to define a regularized version Pq(t, f) of Pq(t, f). To this end, a basic function (I)(f) is chosen and its representation
288
Chapter 7:
Measures, Performance Assessment and Enhancement
Pq(t, f; ~) written down. The smoothed version Pq(t, f; Z) of the representation Pq(t, f; Z) is then defined by the convolution on the affine group" Pq(t, f; Z) - f-q
F/o
Pq(t', f'; Z) Pq(f(t' - t), f'/f; ~) f,2q dt' dr', (7.1.40)
O0
where the kernel is the time-frequency representation of function ~. A more practical form of the smoothed function Pq is obtained when taking into account the transformation law (7.1.15) of Pq(t, f; ~) and the unitarity property (7.1.37)" /~(t, f; Z) - I(Z, ~(t,/))12 ,
(7.1.41)
where O(t,/)(if) is defined according to:
r
- f -r-le-j2~/'t @(if~f).
(7.1.42)
The set of functions O(t,/)(if) is recognized as a family of wavelets obtained from the mother wavelet O(ff) by an affne transformation such that a = f - l , b = t. Thus the right-hand side of relation (7.1.41) is the square modulus of the wavelet coefficient of Z(f). Conversely, the above developments allow to interpret the square modulus of the wavelet coefficient as the result of a smoothing in the time-frequency half-plane. In particular, this provides a guide to assess the properties of a mother wavelet. For example, the function (I)(f) may be chosen so that its representation Pq(t,f; r has the best possible concentration in the time-frequency plane. An optimal choice for (I)(f) is found to be the Zlauder wavelet [5]. It can be written using as parameters the mean frequency f0 = < f >, the mean square deviation / _ < f2 > _f3, and the relative bandwidth p = a / / f 0 . The explicit expression of that wavelet is:
(~(f)--K(P)for-1 (f )
-r-l+
1
2p2 e x p (
2p21 .t0f)
(7.1.43)
where K(p) is a normalization constant. The function (7.1.43) has a time-frequency representation approximately localized in the vicinity of point t = 0, f = f0. By varying a/, it is possible to make it spread along one or the other of the time and frequency directions, without changing the point it is attached to.
7.1.6
Summary and Conclusions
The group of affine transformations on the time, or clock changes, has been introduced in an attempt to perform an analysis of real signals that is independent of the system of reference and of the system of units employed. The approach has proved operational, leading to an adapted time-frequency representation, the
Time-Frequency Analysis Based on the Affine Group
289
Aj~fine Wigner Function, that has many properties similar to those of the usual Wigner-Ville function. The actions of time dilations on signals and time-frequency distributions are respectively characterized by two indexes r and q which are real numbers. The index of conformityr depends on the physical origin of the signal and controls its behavior in a change of time units. The index of significance q, attached to the time-frequency representation, can be chosen according to the kind of description we require. Both indexes r and q are special features coming from the introduction of dilations and allowing to express properly their effects. Those indexes are particularly important in the time-frequency analysis of problems involving transductions. They can be overlooked only in the limit of narrow relative bandwidth, in which case the usual Wigner-Ville representation appears as a universal approximation. There are several ways to build a time-frequency analysis satisfying the above constraints of independence relatively to clock changes. They result in an affine analog of Cohen's class. However, what is called the Affine Wigner Function in the present work stands out as the unique time-frequency distribution obtained by a tomographic method fitted to the affine group. This distribution is unitary, gives a realistic description of the hyperbolic chirps and does not degrade the information contained in the original signal, except for a constant phase factor.
References [1] J. Bertrand and P. Bertrand, "Repr@sentations temps-fr@quence des signaux," Comptes rendus de l'Acaddmie des Sciences, Paris, vol. 299, Ser. 1, pp. 635-638, 1984. [2] J. Bertrand and P. Bertrand, "A tomographic approach to Wigner's function," Foundations of Physics, vol. 17, pp. 397-405, 1987. [3] K. Vogel and H. Risken, "Determination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase," Physical Review A, vol. 40, pp. 2847-2849, 1989. [4] D. T. Smithey, M. Beck, M. G. Raymer, and A. Faridani, "Measurement of the Wigner distribution and the density matrix of a light mode using optical homodyne tomography: Application to squeezed states and the vacuum," Physical Review Letters, vol. 70, pp. 1244-1247, 1993. [5] J. Bertrand and P. Bertrand, "Affine time-frequency distributions," in Time-Frequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 5, pp. 118-140, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [6] J. Bertrand and P. Bertrand, "Symbolic calculus on the time-frequency half-plane," J. of Mathematical Physics, vol. 39, pp. 4071-4090, August 1998.
290
Chapter 7:
Measures, Performance Assessment and Enhancement
7.2
TIME-FREQUENCY REASSIGNMENT~
Time-frequency and time-scale representations [1] aim to extract relevant information from a signal by representing it over a two-dimensional plane. These tools have been extensively studied in the past twenty years, resulting today in many useful analysis methods. Among them, the spectrogram and the smoothed vetsions of the Wigner-Ville distribution [1] are probably the most widely used, but their applicability is limited by localization trade-offs, which may be troublesome in some applications. For the spectrogram, a shorter analysis window yields a better time resolution and henceforth a poorer frequency resolution, as a consequence of the Gabor-Heisenberg inequality [1]. For the smoothed versions of the Wigner-Ville distribution, a larger smoothing kernel yields reduced cross-terms, but also a poorer localization of the signal components (see "Article 4.2"). These shortcomings must be overcome in order to obtain time-frequency "pictures" that can be both easily read by non-experts and easily included in a signal processing application. This is exactly what the reassignment principle has been devised for. Initially introduced in 1976 by Kodera, Gendrin and de Villedary [2], this idea first remained little known and rarely used. But recently, advances obtained during the eighties in time-frequency analysis have made its rebirth possible [3], which considerably extended its applicability, both conceptually and computationally.
7.2.1
Basic Principle
For a sake of simplicity, we will first present the basics of reassignment in the case of the spectrogram, which was the only case considered by Kodera et al [2]. Its application to other representations will be discussed afterwards. The spectrogram, which is the squared modulus of the short-time Fourier transform -
IF (t, f)l 2 ,
Fh (t, f) - / x(u) h* (t - u)e -i2~/~ du
(7.2.,) (7.2.2)
can also be expressed as a two-dimensional smoothing of the Wigner-Ville distribution [1]
sh(t,f)
-
././ Wx(u,u) W h ( t - u , f - u)dudu.
(7.2.3)
In these expressions, t and f are respectively the time and frequency running variables, x(t) is the analyzed signal, and h(t) is the analyzing window. All integrals ~ Franqois A u g e r , GE44-IUT de Saint Nazaire, Bd de l'Universit@, BP 406, 44602 Saint Nazaire Cedex, France (auger@iutsn.univ-nantes.fr, auger@ge44.univ-nantes.fr); P. F l a n drin, l~cole Normale Sup@rieure de Lyon, Labo. de Physique, UMR 5672 CNRS, 46 all@e d'Italie, 69364 Lyon Cedex 07, France (flandrin@ens-lyon.fr); E. C h a s s a n d e - M o t t i n , Max Planck Institut ffir Gravitationphysik, Albert Einstein Institut, Am Miihlenberg, 1, D-14424 Golm, Germany (new email: ecm@obs-nice.fr). Reviewers: S. Barbarossa, R. Baraniuk, F. Hlawatsch and A. Papandreou-Suppappola.
Time-Frequency Reassignment
291
have integration bounds running from - o o to +cx~. The latter expression shows explicitly that the value of the spectrogram at a given point (t, f) is a weighted sum of all the Wigner-Ville distribution values at the neighboring points ( t - u, f - u). The number sh(t, f) is therefore the sum of a whole energy distribution located around its geometrical center (t, f). Reasoning with a mechanical analogy, the situation is as if the total mass of an object was assigned to its geometrical center, an arbitrary point which except in the very specific case of an homogeneous distribution, has no reason to suit the actual distribution. A much more meaningful choice is to assign the total mass of an object as well as the spectrogram value sh(t, f) to the center of gravity of their respective distribution. This is exactly what the reassignment performs : at each time-frequency point (t, f) where a spectrogram value is computed, we also compute the coordinates (t', f) of the local centroid of the Wigner-Ville distribution Wx, as seen through the time-frequency window Wh centered at (t, f):
tx(t,f) = S)(t,f)
uW~(u,u) W h ( t - u , f - u ) d u d u
(7.2.4)
f~(t,f) -
uW~(u,u) W h ( t - u , f - u ) d u d , .
(7.2.5)
S)(t,f)
Then, the spectrogram value S)(t, f)is moved from to define the reassigned spectrogram as
(t, f)
to (t, f). This leads us
(7.2.6) Originally, the reassignment operators t and f have been equivalently related to the phase of the STFT, an information which is generally discarded when computing the spectrogram: A
tx(t, f) --
1 07) (t, f)
27r Of
07) (t, f) . fAx (t, f) - f + -~1 -~
(7.2.7) (7.2.8)
with 7)(t, f) = arg Fh(t, f). These expressions may be interpreted respectively as the local group delay and the local instantaneous frequency of the signal observed inside the time-frequency domain imposed by the analysis window h. But it has been shown in [3] that a much more efficient implementation is possible thanks to a third expression involving two additional S T F T s with particular analysis windows :
t~(t, f ) =
t - ~ { Fth(t' f) } Fzh(t, f) '
fz(t, f) = f + ~
{ Fazh/at(t'f) } 27rFzh(t' f)
(7.2.9)
(7.2.10)
292
Chapter 7:
Measures, Performance Assessment and Enhancement
As presented here, the reassignment principle can be used with a large number of distributions, beyond the spectrogram case. For example, if the WVD of the short-time window h(t) in eq. (7.2.3)is replaced by an arbitrary (low-pass) kernel H ( u , , ) , one recognizes the general form of the quadratic time-frequency energy distributions that are covariant under time and frequency shifts, referred to as the Cohen's class 1 [1]: p~ (t, f) - / / W x
(u, .) II(t - u, f - .) du dr.
(7.2.11)
The local centroids are then given by
A
1//
W~(u,~) n(t - u, f - , ) d u dr
tx(t, f) -
Px(nt , f )
u
fx(t, f ) -
Px(nt , f )
,Wx(u,,)II(t-
u, f -
,)dud,.
(7.2.12) (7.2.13)
and the corresponding reassigned distribution becomes (7.2.14) From a theoretical point of view, this reassigned representation is no longer bilinear, but it still remains an energy distribution covariant under time and frequency shifts. One of the most important properties of the reassignment principle is that the application of the reassignment process defined by eqs. (7.2.12), (7.2.13) and (7.2.14) to any distribution of the Cohen's class yields perfectly localized distributions for chirp signals, frequency tones and impulses, since the WVD does so, and since the centroid of a linear distribution necessary lies on the line. When applied to multicomponent signals, reassignment improves readability by overcoming m to a certain extent the usual trade-off between cross-term level and localization: the underlying smoothing of the standard distribution guarantees some cross-term reduction, whereas reassignment acts as a squeezing that re-focuses the signal terms that had been spread out by smoothing (see Figure 7.2.1). Among the examples of Cohen's class members studied in [3], the case of the smoothed pseudo Wigner-Ville distribution yields a very versatile signal analysis tool, with independently adjustable time and frequency smoothings:
SPWV~'h(t, f) -//g(t-
u ) H ( f - ,) W x ( u , , ) d u d ,
1That is, the quadratic class as defined on p. 68.
(7.2.15)
Time-Frequency Reassignment
293
time
time
F i g . 7.2.1: Cross-term level and localization trade-off in the Cohen's class. We consider here the time-frequency representation of a signal composed of two different chirps whose instantaneous frequencies are detailed in (a). The kernel of the time-frequency distribution in the Cohen's class defines the degree of smoothing which is applied [1]: a weak smoothing favors a sharp localization at the expense of the presence of cross-terms (an example is the Wigner-Ville distribution displayed in a contour plot in (b)). Conversely, a stronger one leads to a lower cross-term level but also to a poorer localization (such as the spectrogram in (c)). Thanks to its smoothing-squeezing scheme, the reassignment method overcomes this trade-off as shown in (d) with the reassigned spectrogram.
Its reassigned version can be computed easily with two additional SPWDs:
A tx(t,f) - t-
SPWVtg,h(t,f)
(7.2.16)
S P W V ~ 'h (t, f )
fx(t, f ) - f + i
S P W V ~ 'dh/dt (t, f )
(7.2.17)
27r S P W V ~ 'h (t, f )
A different kind of generalization can be obtained when switching to time-scale energy distributions of the affine class [1], i.e., the quadratic distributions covariant under time shifts and dilations: -
~,a~ a
dud~
(7.2.18)
294
Chapter 7:
Measures, Performance Assessment and Enhancement
Within this framework, the reassignment operator in time is given directly by
A
1//
~ ,aa v
t~(t, a) - ~ ( t , a)
)
dudu,
(7.2.19)
whereas the reassignment operator in scale requires an intermediate step in the frequency domain ~x(t, a) -
and
.. f0
A(t,a)'
fx(t,a) = ~ ( t , a )
with
f0 - - / / f
vWx(u,u)l-I
II(t, f) dt df t a u'au
(7.2.20)
dudu
The most important case among this class is the scalogram (the squared modulus of the wavelet transform) [I], obtained by choosing for II the WVD of the chosen wavelet. Simple and efficient expressions of the reassignment operators also exist in this case [3, 4].
7.2.2 7.2.2.1
Variations and Related Approaches Two Variations
The original purpose of the reassignment principle was the design of time-frequency distributions with increased readability. But some useful information on the signal structure can also be directly extracted from the reassignment operators, as shown by the following two extensions: Signal/noise discrimination and supervised reassignment. When the analyzed signal includes broadband noise, the reassignment process yields peaked areas in noise-only regions, whereas rather smooth energy distributions are expected there. For such situations, an improved reassignment algorithm referred to as supervised reassignment [5] has been designed. This approach first attempts to discriminate between "signal+noise" and "noise only" regions in the time-frequency plane by means of a detector applied to the reassignment operators. Reassignment is then only performed for the points considered to belong to "signal+noise" regions. Signal component extraction and differential reassignment. Many signal processing problems such as denoising and signal classification can be solved by a relevant tiling of the time-frequency plane, so as to isolate each signal "component" (although this concept is not clearly defined). For such applications, a new reassignment process called differential reassignment [6] has been considered. Whereas the original reassignment principle moves each value by one finite jump, differential reassignment considers each time-frequency point as the starting point of an elementary particle whose velocity field is deduced from the reassignment operators. The final points called asymptotic reassignment points are gathered and lead to a time-frequency map in which each cell indicates a signal component.
Time-Frequency Reassignment
7.2.2.2
295
Related Approaches
Although original in many respects, the concept of reassignment is obviously connected with several approaches that have been proposed independently. We lack space to discuss these interactions precisely, but we cite: 9 The instantaneous frequency density [7], which yields at each time sample a histogram of the frequency reassignment operator of the spectrogram. 9 The extraction of ridges and skeletons out of the phase structure of the wavelet transform [8, 9]. These ridges are made of the fixed points of the reassignment operators, either horizontally (~x(t, a) - a) or vertically (tx(t, a) - t). 9 The synchrosqueezed plane [10], which also moves the scalogram values, but by a scale displacement only.
7.2.3
Summary and Conclusions
Reassignment can be viewed as the second step of a process whose goal is to build a readable time-frequency representation. It consists of:
1. a smoothing, whose main purpose is to rub out oscillatory interferences, but whose drawback is to smear localized components; 2. a squeezing, whose effect is to refocus the contributions which survived the smoothing. As a result, this approach yields without a drastic increase in computational complexity enhanced contrast (when compared to smoothed distributions such as spectrograms) with a much reduced level of interferences (when compared to the Wigner-Ville distribution). This is especially true when the signal-noise ratio is not too low, and when the signal components are not "too close" to each other. Finally, MATLABT M implementations of the algorithms discussed here are included in a freeware available at h t t p : / / c r t t s n . u n i v - n a n t e s . f r / ~ a u g e r / t f t b . h t m l .
References [1] P. Flandrin, Time-Frequency/Time-Scale Analysis. San Diego: Academic Press, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [2] K. Kodera, C. de Villedary, and R. Gendrin, "A new method for the numerical analysis of nonstationary signals," Physics of the Earth ~ Planetary Interiors, vol. 12, pp. 142-150, 1976. [3] F. Auger and P. Flandrin, "Improving the readability of time-frequency and time-scale representations by the reassignment method," IEEE Trans. Signal Processing, vol. 43, pp. 1068-1089, May 1995. [4] P. Flandrin, E. Chassande-Mottin, and P. Abry, "Reassigned scalograms and their fast algorithms," in Proc. SPIE: Wavelet Applications in Signal and Image Processing III, vol. 2569, pp. 152-158, Soc. of Photo-optical Instrumentation Engineers, San Diego, CA, 12-14 July 1995.
296
Chapter 7:
Measures, Performance Assessment and Enhancement
[5] E. Chassande-Mottin, F. Auger, and P. Flandrin, "Supervised time-frequency reassignment," in Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pp. 517-520, Paris, 18-21 June 1996. [6] E. Chassande-Mottin, I. Daubechies, F. Auger, and P. Flandrin, "Differential reassignment," IEEE Signal Processing Letters, vol. 4, pp. 293-294, October 1997. [7] D. Friedman, "Instantaneous frequency distribution vs. time: An interpretation of the phase structure of speech," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'85), pp. 1121-1124, Tampa, FL, 26-29 March 1985. [8] R. Carmona, W. L. Hwang, and B. Torr@sani, Practical Time-Frequency Analysis: Gabor and Wavelet Transforms with an Implementation in S. San Diego: Academic Press, 1998. [9] P. Guillemain and R. Kronland-Martinet, "Horizontal and vertical ridges associated to continuous wavelet transforms," in Proc. IEEE-SP Inter'nat. Syrup. on Time-Frequency Time-Scale Analysis, pp. 63-66, Victoria, BC, 4-6 October 1992. [10] S. Maes, "The synchrosqueezed representation yields a new reading of the wavelet transform," in Proc. SPIE: Wavelet Applications II, vol. 2491, pp. 532-559, Soc. of Photooptical Instrumentation Engineers, Orlando, FL, 17-21 April 1995.
Measuring Time-Frequency Distributions Concentration
7.3
297
MEASURING T I M E - F R E Q U E N C Y D I S T R I B U T I O N S CONCENTRATION 0
Efficient m e a s u r e m e n t of t i m e - f r e q u e n c y d i s t r i b u t i o n s ( T F D s ) c o n c e n t r a t i o n can provide a q u a n t i t a t i v e criterion for evaluation of various d i s t r i b u t i o n s p e r f o r m a n c e . It can be used for a d a p t i v e a n d a u t o m a t i c p a r a m e t e r selection in t i m e - f r e q u e n c y analysis, w i t h o u t supervision of a user. Measures for d i s t r i b u t i o n c o n c e n t r a t i o n of m o n o c o m p o n e n t signals d a t e back to [1, 2]. For m o r e c o m p l e x signals, some quantities from statistics a n d i n f o r m a t i o n t h e o r y were the inspiration for defining m e a s u r e s of the T F D s c o n c e n t r a t i o n [3,4]. T h e y p r o v i d e d g o o d q u a n t i t a t i v e m e a s u r e of the a u t o - t e r m s concentration. Various and efficient modifications are used in order to take into account the a p p e a r a n c e of oscillatory cross-terms. T h e application of c o n c e n t r a t i o n m e a s u r e s will be d e m o n s t r a t e d on a u t o m a t i c d e t e r m i n a t i o n of the "best window length" for the s p e c t r o g r a m or "the best n u m b e r of terms" in the m e t h o d t h a t provides t r a n s i t i o n form the s p e c t r o g r a m t o w a r d the p s e u d o W i g n e r d i s t r i b u t i o n (pseudo W D ) [Article 6.2].
7.3.1
Concentration Measurement
T h e basic idea for m e a s u r i n g T F D s c o n c e n t r a t i o n can be e x p l a i n e d on a simplified e x a m p l e m o t i v a t e d by the p r o b a b i l i t y theory. Consider a set of N n o n n e g a t i v e numbers pl, p2, ..., P N >_ 0, such t h a t PI + P2 q--... 9 P N -- 1. F o r m a simple test function M ( p l , P2,..., P N ) -- p2 + p2 + . . . + p2g. It is easy to conclude t h a t M ( p l , P2, ..., P N ) , u n d e r the c o n s t r a i n t Pl + P2 + ... q-PN -- 1, has the m i n i m a l value for Pl - P2 .... P N -- 1 / N , i.e., for m a x i m a l l y s p r e a d values of P l , P 2 , . . . , P N . T h e highest value of M ( p l , p 2 , ...,PN), u n d e r the same constraint, is achieved w h e n only one Pi is different from zero, Pi - ~(i - io), where io is an a r b i t r a r y integer 1 < io <_ N . This case c o r r e s p o n d s to the m a x i m a l l y c o n c e n t r a t e d values of p l , p 2 , - - . , P N , at a single Pio -- 1. Therefore, the function M ( p l , p 2 , . . . , P N ) can be used as a measure of c o n c e n t r a t i o n of the set of n u m b e r s pl, p2, ..., P N , u n d e r the unity s u m constraint. 1 In general, the c o n s t r a i n t can be included in the function itself by using the form M ( p l , P2, ..., PN) -- (p2 ~L_p2 -1L ... + p2N) / (Pl -[- P2 -[- ... -~- pN)2. For nonnegative Pl,P2, ...,PN this function has the m i n i m u m for Pl - P2 - ... - P N , a n d reaches its m a x i m a l value w h e n only one Pi is different from zero. In t i m e - f r e q u e n c y analysis this idea has been used in order to m e a s u r e the concentration. Several forms of the c o n c e n t r a t i o n measure, based on this f u n d a m e n t a l idea, are introduced. 1. M e a s u r e b a s e d o n t h e r a t i o o f n o r m s - For the W D of energy normalized signals, the r e l a t i o n ~--~-nE k p2x(?l, k) - 1 holds. Therefore, s u b s t i t u t i o n ~ LJubi~a Stankovid, Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (1.stankovic@ieee.org). Reviewers: W. J. Williams and P. Flandrin. 1in probability theory, the famous Shannon entropy -~-']i Pi log(p/) is commonly used for the same purpose. It produces the maximal value for the lowest concentration of probabilities Pi, Pl --- p2 . . . . . PN -- 1/N, and the minimal value for the highest concentration p~ ----~(i - i0).
Chapter 7:
298
Measures, Performance Assessment and Enhancement
Pi ~ p2(n, k) in the basic example, gives a function t h a t can be used for measuring the concentration of the time-frequency representation px(n, k): (L4) 4
Mjp -
~2
~n~kP~(n'k)
(7.31)
---- (En Ek P2(n, k)) 2"
This form is just the fourth power of the ratio of L4 and L2 norms of px(n, k). 2 It has been introduced by Jones and Parks in [3]. They have used the magnitude of the signal's short-time Fourier transform as the time-frequency representation px(n, k) in (7.3.1). High values of Mgp indicate that the representation px(n,k) is highly concentrated, and vice versa. In general, any other ratio of norms Lp and Lq, p > q > 1, can also be used for measuring the concentration of px(n, k) [3]. W h e n there are two or more components (or regions in time-frequency plane of a single component) of approximately equal energies (importance), whose concentrations are very different, the norm based measures will favor the distribution with a "peaky" component, due to raising of distribution values to a high power. It means t h a t if one component (region) is "extremely highly" concentrated, and all the others are "very poorly" concentrated, then the measure will not look for a trade-off, when all components are "well" concentrated. In order to deal with this kind of problems, common in time-frequency analysis, a concentration measure could be applied to smaller, local time-frequency regions [3]:
E n E k Q2( m - It, l - k)p4 (m, l) MjpL(n, k) - (E,~ E k Q(m - n, l - k)p~(m, l)) 2
(7.3.2)
The localization weighting function Q(n, k) determines the region where the concentration is measured. In [3] the Gaussian form of this function is used. 2. R@nyi e n t r o p y b a s e d m e a s u r e s : The second class of T F D measures is defined in analogy with the Rdnyi entropy. It has been introduced in time-frequency analysis by Williams et al. [4,5], with a significant contribution of [6,7] in establishing the properties of this measure. The R@nyi entropy, applied on the T F D px(n,k), has the form R~=
1 - 1a
log2(En
E k Pax(n'k))
(7.3.3)
with a > 2 being recommended for the T F D measures [7]. For a = 2 and the W D of energy normalized signals (~-]n ~-~kp2(n, k) = 1), we have R2 - 0 for all signals. Note t h a t the logarithm is a monotone function. Thus, the behavior of R~ is determined by the argument ~-~n~]k p~(n,k) behavior, as explained at the beginning of this section. In contrast to the measure (7.3.1), the entropy (7.3.3) has larger values for less concentrated distributions due to a negative coefficient 2In statistics, similar form (known as kurtosis) is used as a measure of the flatness or peakedness of a distribution. Kurtosis is zero for a Gaussian distribution. Values greater than zero mean that the distribution has more of a peak than a Gaussian distribution, while values less than zero mean flatter distributions.
Measuring Time-Frequency Distributions Concentration
299
1 / ( 1 - c~) for c~ > 2. This will be the case for all other measures which will be presented in the sequel. It is interesting to note that the S h a n n o n e n t r o p y
H - - E n E k [pz(n, k)log 2 px(n, k)] could be recovered from the R~nyi entropy, from the limit case a --~ 1, [7]. The Shannon entropy could not be used for general T F D s px(n,k), which can assume negative values [7]. 3. N o r m a l i z e d R ~ n y i e n t r o p y m e a s u r e s . In order to avoid the problem which could be caused by the fact that the R~nyi entropy based measure with (~ = 3 ignore the presence of oscillatory cross-terms (when the auto-terms are well separated), some kind of normalization should be done. It can be done in various ways, leading to a variety of possible measure definitions [4]. Normalization with the distribution volume is performed as:
1 3 RV3 - --~ log 2 E n E k [px(n,k)/ E n E k Ipx(n,k)l] 9
(7.3.4)
If the distribution contains oscillatory values, then summing their absolute values means that large cross-terms will decrease the measure RVa. This is the expected behavior of a measure, since it will seek for a balance between the cross-terms suppression and auto-terms enhancement. The volume normalized form of measure has been used for adaptive kernel design in [4]. 4. The basic idea for the measure that will be presented next comes from an obvious c l a s s i c a l d e f i n i t i o n of t h e t i m e - l i m i t e d s i g n a l d u r a t i o n . If a signal x(n) is time-limited to the interval n E [nl, n2 - 1], i.e., x(n) ~ 0 only for n E [ n : , n 2 - 1], then the duration o f x ( n ) is d = n 2 - n : . It can be written as d - limp-.oo ~-~-nIx(n)l 1/p" T h e s a m e definition applied to a two-dimensional function pz(n, k) ~: 0 only for (n, k) E Dz, gives
No -
lim
p----*oo
E n E k IP~(n'k)l:/p
(7.3.5)
where ND is the number of points within Dx. In reality, there is no a sharp edge between p~(n, k) 7/=0 and p~(n, k) = 0, so the value of (7.3.5) could, for very large p, be sensitive to small values of p~(n, k). The robustness m a y be achieved by using lower order forms, for example with p = 2. Therefore, the concentration can be measured with the function of the form
M~ -(~-~n ~-~-kIp~(n, k)l:/P) p, with
~-~n~-~-kpx(n, k)
(7.3.6)
= 1, and p > 1.
After we have presented several possible forms for measuring the concentration of TFDs, we can summarize a p r o c e d u r e for c o n s t r u c t i n g a T F D m e a s u r e
300
Chapter 7:
Measures, Performance Assessment and Enhancement
based on one-dimensional classical signal analysis definitions, or definitions from either probability, quantum mechanics, or information theory: (i) In the classical signal analysis definitions, consider the signal power Ix(t)i 2 (spectral energy density Ix(f)l 2) as the probability density function in time (frequency). This idea comes from quantum mechanics, where the absolute square of the wave function is the position's probability density function. (ii) Assume that the TFD pz(t, f ) can be treated as a joint two-dimensional probability density function. (iii) According to these assumptions, reintroduce one-dimensional definition into joint two-dimensional time-frequency domain. (iv) Additional modifications, interpretations, constraints, and normalizations are needed in order to get forms that can be used in time-frequency analysis. For example, several possible forms of the R~nyi entropy measure in time-frequency domain have been proposed and used in various problems. Example: Consider the classic L e i p n i k e n t r o p y m e a s u r e [2], and Z a k a i ' s e n t r o p y p a r a m e t e r 5t = - f_c~oo Ix(t)l 2 In Ix(t)l 2 dt of signal x(t) [2]. According to the p r o c e d u r e for c o n s t r u c t i n g a time-frequency form, based on a classical signal processing relation, we get
~t = -
/
c~
Ix(t) 12 In Ix(t)l 2 dt
-* -
oo
px(t, f ) l n p x ( t , f)dtdf (x)
(7.3.7)
oo
This is exactly the well known S h a n n o n e n t r o p y . It has already been discussed in [7] with respect to its (non)applicability in time-frequency problems. In a similar way, a logarithm of the general Zakai's signal d u r a t i o n (uncertainty)
Z a = log 2 T2a = ~1 - a log 2 (f_o~ ix(t)12 dt)a according to the proposed procedure, t r a n s f o r m s into the R~nyi entropy measure, Z~ ~
1-c~
log 2
c~
oo
p~ (t, f)dtdf = R~
where Iz(t)l 2 has been replaced by pz(t, f), and the unit signal energy is assumed.
R e m a r k : In the probability theory all results are derived for the probability values Pi, assuming that ~--~4pi = 1 and pi _> 0. The same assumptions are made in classical signal analysis for the signal power. Since a general TFD commonly does not satisfy both f-~c~ f - ~ pz(t, f) - 1 and p~(t, f ) >_ O, the obtained measures of TFD concentration may just formally look like the original entropies or classical signal analysis forms, while they can have different behavior and properties. 3 3 Q u a n t u m mechanics forms can also be used for the definition of highly c o n c e n t r a t e d signal representations. One of t h e m is the "pseudo q u a n t u m " signal representation [8] in the form of SDx(t,g~) = f-~oo x[L]( t + T/(2L))x*[L]( t - - T / ( 2 L ) ) e - j ~ r d T ' with x[L](t) = A ( t ) e x p ( j L r for z(t)
= A(t) exp(jr For example, for x(t) = A e x p ( - a t 2 / 2 + jbt2/2 + jct) we get SDx(t, fJ) = A 2 e x p ( - a t 2 ) v / 4 7 r / ( a / L 2 ) e x p ( - ( g ~ - b t - c)2/(a/L2)). For a / L 2 --, 0 it results in SD(t, fO) -- 27rA2 exp(-at2)5(g ~ - b t - c ) , w h a t is just an ideally c o n c e n t r a t e d distribution along the i n s t a n t a n e o u s frequency. For a large a, if L 2 is large enough so t h a t a / L 2 --. O, we get the d i s t r i b u t i o n highly concentrated in a very small region a r o u n d the point (t, ~o) -- (0, c).
301
Measuring Time-Frequency Distributions Concentration
7.3.2
Numerical Examples
Consider the spectrogram
k)
k)l
where F~' (n, k) - DFTm~k {w(m)x(n + m)} is the short-time Fourier transform (STFT); E is the energy of the lag window w(m). Among several spectrograms, calculated with different window lengths or forms, the best one according to the one of proposed concentration measures, denoted by .M[pz(n,k)], will be that which minimizes (or maximizes, depending on the used measure form)" w + - arg min {.M [S:(n,k)]}.
(7.3.8)
w
Let us illustrate this by an example. Consider the signal
x(t) - cos(50 cos(~t) + 10~t ~ + 7 0 ~ t ) + cos(25~t ~ + 180~t)
(7.3.9)
sampled at At - 1//256, within - 1 _< t < 1. The Hanning window w(m) with different lengths is used in the spectrogram calculation. Here, we have used the measure (7.3.6) with p - 2, although for this signal all presented measures would produce similar results [9]. Note that the presented measures would significantly differ if, for example, the second component were pure sinusoid cos(180~-t) instead of cos(25~-t 2 + 180~t). For wide lag windows, signal nonstationarity makes the spectrogram very spread in the time-frequency plane, having relatively large measure A/t [S~(n, k)] - M 2, Figs. 7.3.1(a), 7.3.1(b). For narrow lag windows its Fourier transform is very wide, causing spread distributions and large M 2, Figs. 7.3.1(d), 7.3.1(e). Obviously, between these two extreme situations there is a window that produces an acceptable trade-off between the signal nonstationarity and small window length effects. The measure M 2 is calculated for a set of spectrograms with N - 32 up to N - 256 window length, Fig. 7.3.1(f). The minimal measure value, meaning the best concentrated spectrogram according to this measure, is achieved for N - 88, Fig. 7.3.1(f). The spectrogram with N - 88 is shown in Fig. 7.3.1(c). The same procedure will be used for determination of the optimal number of terms L, in a transition from the spectrogram to the pseudo WD, according to the recursive form of the S-method (SM) [Article 6.2]" SMx(n, k; L) - SM=(n, k; L - 1) + 2~{F=(n, k + L)F:~(n, k - L)}
(7.3.10)
where SMz(n, k; 0) - I F y ( n , k)] 2, and both k + L and k - L are within the basic frequency period. Note that SM(n, k; N/2) is equal to the pseudo WD. The optimal distribution SM+(n, k; L), on our way from L - 0 (the spectrogram) toward L N/2 (the pseudo WD), is the one calculated with L producing the minimal value of
M[SMx(n,k;L)], L + - arg rn~n {A/~ [SMx(n,k;L)]}.
(7.3.11)
302
Chapter 7:
Measures, Performance Assessment and Enhancement
F i g . 7.3.1: (a)-(e) Spectrogram for various window lengths, and (f) its measure j ~ [SW(n,k)] - M 2. The lowest M 2 is achieved for N = 88, being the best window length choice according to this measure. (g)-(k) The S-method for various values of parameter L, and (I) its measure 2v[ [SMx(n, k; L)] - M 2. The lowest M 2 is obtained for L - 9.
Here, instead of ISM~(n, k; L)I, a nonnegative part of SMx(n,k; L) will be used. Distributions SMz(n,k; L) should be properly scaled in order to satisfy unbiased energy condition. The same signal is used for the illustration of the SM. Since this method is based on the WD, the best results will be achieved with a wide lag window in the STFT calculation, N - 256. The spectrogram (L - 0) is shown in Fig. 7.3.1(g). By increasing L the SM improves concentration of the spectrogram toward the pseudo WD quality, meaning lower measure {M [SMx(n, k; L)]} = M 2, Fig. 7.3.1(h), 7.3.1(i). After L has reached the value equal to the distance between the auto-terms, cross-terms start to appear, increasing M 2, Figs. 7.3.1(j), 7.3.1(k). Minimal M 2 means a trade-off between the auto-terms concentration and the crossterms appearances, Fig. 7.3.1(k). The SM with L corresponding to minimal M22 is shown in Fig. 7.3.1(1). The concentration measure is illustrated on time-frequency analysis of a pressure signal in the BMW engine with speed 2000 [rev/min], Fig. 7.3.2, [Article 15.2].
7.3.3
Parameter Optimization
Parameter optimization may be done by a straightforward computation of a distribution measure .M[px(n, k)], for various parameter values. The best choice according to this criterion (optimal distribution with respect to this measure) is the
Measuring Time-Frequency Distributions Concentration
303
Fig. 7.3.2: Concentration measure illustration on time-frequency analysis of a car engine pressure signal. Signal, and its S-method based time-frequency representations are given. Time is rescaled into corresponding crank-angle. The best choice according to this measure was L -- 3.
distribution which produces the minimal value of Ad[px(n, k)]. In the cases when one has to consider a wide region of possible parameter values for the distribution calculation (like for example window lengths in spectrogram), this approach can be numerically inefficient. Then, some more sophisticated optimization procedures, like the one using the steepest descent approach described in [4], can be used. Its simplified version will be presented here [9]. The gradient of a measure A,4[p~:(n,k)], with respect to a distribution's generalized optimization parameter denoted by ~, is
k)]
k)] op
opt(n, k)
(n,k)
"
Iterations, starting from a very low concentrated distribution toward the maximally concentrated one, i.e., toward the measure minimum, can be done according to ~rn+l -- ~rn
-
-
#C~./~[px(n, k)]/O~
(7.3.12)
where # is the step, which should be chosen in the same way as the step in the other adaptive algorithms. The step should not be too small (since the convergence would be too slow), and not too large (to miss the minimum, or cause the divergence). In discrete implementations, the gradient OAd[pz(n, k)]/O~ can be approximated based on .M[px(n, k; ~m)] calculated with ~m and its previous value ~m-1
.A/~[px(n , k; ~rn)] - .A/[[Px(ft, k; ~m-1)]
(7.3.13)
304
Chapter 7:
Measures, Performance Assessment and Enhancement
Example: The optimization procedure will be illustrated on the signal x(t), its spectrogram, and the measure form Example 2. The optimal window length is obtained in few iterations by using (7.3.13), starting from the very narrow window. Values of ~0 = N = 16 and ~1 = N = 20 in the initial and first iteration, are assumed. The next value of ~m+l -= N is calculated according to (7.3.13). During the iterations we get ~ m - - 16, 20, 76, and 90. The algorithm is stopped at ~,n = 90, when I ~ m + l - ~ml < 2, since even number of samples are used in the realization. Note that the obtained optimal value is within +2 of the value obtained by direct calculation. The value of parameter # = 1//3 has been used in all examples.
7.3.4
Summary and Conclusions
Measurement of time-frequency distributions concentration, with application to an automatic optimization of distribution parameters, is presented. It is based on the forms borrowed from the classical signal analysis, probability, or information theory, with appropriate interpretations and adjustments.
References [1] D. Gabor, "Theory of communication," J. IEE, vol. 93(III), pp. 429-457, November 1946. [2] D. E. Vakman, Sophisticated Signals and the Uncertainty Principle in Radar. New York: Springer, 1968. Translated by K. N. Trirogoff; edited by E. Jacobs. [3] D. L. Jones and T. W. Parks, "A high-resolution data-adaptive time-frequency representation," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 38, pp. 2127-2135, December 1990. [4] T.-H. Sang and W. J. Williams, "R~nyi information and signal-dependent optimal kernel design," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), vol. 2, pp. 997-1000, Detroit, 9-12 May 1995. [5] W. J. Williams, M. L. Brown, and A. O. Hero III, "Uncertainty, information and time-frequency distributions," in Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations II, vol. 1566, pp. 144-156, Soc. of Photo-optical Instrumentation Engineers, San Diego, 24-26 July 1991. [6] R. G. Baraniuk, P. Flandrin, A. J. E. M. Janssen, and O. J. J. Michel, "Measuring time-frequency information content using the R~nyi entropies," IEEE Trans. Information Theory, vol. 47, pp. 1391-1409, May 2001. [7] P. Flandrin, R. G. Baraniuk, and O. Michel, "Time-frequency complexity and information," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'94), vol. 3, pp. 329-332, Adelaide, Australia, 19-22 April 1994. [8] L. Stankovi6, "Highly concentrated time-frequency distributions: Pseudo-quantum signal representation," IEEE Trans. Signal Processing, vol. 45, pp. 543-551, March 1997. [9] L. Stankovi6, "A measure of some time-frequency distributions concentration," Signal Processing, vol. 81, pp. 621-631, March 2001.
Resolution Performance Assessment for Quadratic TFDs
7.4 7.4.1
305
R E S O L U T I O N P E R F O R M A N C E A S S E S S M E N T FOR Q U A D R A T I C TFDs o Selecting and Comparing TFDs
Quadratic time-frequency distributions (TFDs) are effective tools for extracting information from a non-stationary signal, such as the number of components, their durations and bandwidths, components' relative amplitudes and instantaneous frequency (IF) laws (see Chapters 1 and 2). The performance of TFDs depends on the type of signal (see Chapter 3) [1,2]. For example, in the case of a monocomponent linear FM signal, the Wigner-Ville distribution is known to be optimal in the sense that it achieves the best energy concentration around the signal IF law (see Article 2.1 for more details) [1]. In applications involving multicomponent signals, choosing the right TFD to analyze the signals is an immediate critical task for the signal analyst. How best to make this assessment, using current knowledge, is the subject of this article. Let us, for example, consider a multicomponent whale signal, represented in the time-frequency domain using the Wigner-Ville distribution, the spectrogram, the Choi-Williams distribution, the Born-Jordan distribution, the Zhao-Atlas-Marks (ZAM) distribution, and the recently introduced B-distribution [3] (see Fig. 7.4.1). To determine which of the TFDs in Fig. 7.4.1 "best" represents this whale signal (i.e. which one gives the best components' energy concentration and best interference terms suppression, and allows the best estimation of the components' IF laws) one could visually compare the six plots and choose the most appealing. The spectrogram and the B-distribution, being almost free from the cross-terms, seem to perform best. The performance comparison based on the visual inspection of the plots becomes more difficult and unreliable, however, when the signal components are closelyspaced in the time-frequency plane. To objectively compare the plots in Fig. 7.4.1 requires to use a quantitative performance measure for TFDs. There have been several attempts to define objective measures of "complexity" for TFDs (see Section 7.3.1). One of these measures, the R(Snyi entropy given in [4], has been used by several authors in preference to e.g. the bandwidth-duration product given in [1]. The performance measure described in this article, unlike the Re~nyi entropy, is a local measure of the TFD resolution performance, and is thus more suited to the selection problem illustrated by Fig. 7.4.1. This measure takes into account the characteristics of TFDs that influence their resolution, such as energy concentration, components separation, and interference terms minimization. Methodologies for choosing a TFD which best suits a given signal can then be developed by optimizing the resolution performance of considered TFDs and modifying their parameters to better match application-specific requirements. ~ Boualem Boashash and Victor Sucic, Signal Processing Research Centre, Queensland University of Technology, GPO Box 2434, Brisbane, Q 4001, Australia (b.boashash@qut.edu.au, v.sucic@qut.edu.au). Reviewers: W. J. Williams and LJ. Stankovi6.
Chapter 7:
306
Measures, Performance Assessment and Enhancement
Fig. '7.4.1: TFDs of a multicomponent whale signal.
7.4.2
Performance Criteria for TFDs
In the case of monocomponent FM signals, the best TFD is that which maximizes energy concentration about the signal instantaneous frequency. This is achieved by minimizing component sidelobe amplitude As relative to mainlobe amplitude Am, and mainlobe bandwidth B relative to central frequency f (see Fig. 7.4.2). The instantaneous concentration performance of a TFD may thus be quantified by the measure p expressed as: p(t) =
As(t) B(t) Am(t) f(t)
(7.4.1)
A good performance is characterized by a small value of the measure p. For example, for the Wigner-Ville distribution of a linear FM signal with infinite duration, the bandwidth B and the sidelobe amplitude As are zero [1], and we obtain p = 0. For multicomponent FM signals, the performance of a TFD can be quantitatively assessed in terms of: 9 the energy concentration of the TFD about the respective instantaneous frequency of each component, as expressed by Eq. (7.4.1), and 9 the components resolution, as measured by the frequency separation of the components' mainlobes, including the effect of cross-terms.
Resolution Performance Assessment for Quadratic TFDs
307
Fig. 7.4.2: Slice of a TFD of a monocomponent signal at time t = to. The dominant peak is the component, while the other peaks are the sidelobes. For clarity of presentation, we limit ourselves to measuring the mainlobe bandwidth at 0.71 of the component normalized amplitude Am.
F i g . 7.4.3: Diagram illustrating the resolution of a two-component signal in the absence of cross-terms. The lobes are clearly distinguished from each other; the components are said to be resolved.
For stationary signals, the frequency resolution in a power spectral density estimate of a signal composed of two single tones, fl and f2 (see Fig. 7.4.3), may be defined as the minimum difference f2 - fl for which the following inequality holds:
fl + B1/2
< f 2 - B2/2,
fl < f2
(7.4.2)
where B1 and B2 are the respective bandwidths of the first and the second sinusoid. In the case of non-stationary signals, for a TFD pz(t, f) of a two-component signal, the above definition of resolution is valid for every time slice of a cross-termfree TFD, such as the spectrogram. However, for TFDs exhibiting cross-terms, we need to take into account the effect of cross-terms on resolution. A slice of a typical quadratic TFD, with components clearly resolved, is shown in Fig. 7.4.4(a), where Bl(to), fl(to), Asl (to) and Am1 (to) represent respectively the instantaneous bandwidth, the IF, the sidelobe amplitude and the mainlobe amplitude of the first component at time t = to. Similarly, B2(t0), f2(t0), As2(to) and Am2 (to) represent the instantaneous bandwidth, the IF, the sidelobe amplitude and the mainlobe amplitude of the second component at the same time to. The amplitude Ax(to) is that of the cross-term. An example of a quadratic TFD with non-resolved components is shown in Fig. 7.4.4(b).
308
Chapter 7:
(a) The two dominant peaks are the (resolved) signal components, the middle peak is the cross-term, and the other peaks are the sidelobes
Measures, Performance Assessment and Enhancement
(b) The two components and the cross-term have merged into a single peak; we say that the components resolved are
not
F i g . 7 . 4 . 4 : Slice of a T F D of a two-component signal taken at time t -- to.
7.4.3
Resolution Performance Measure for TFDs
Eq. (7.4.2) and Fig. 7.4.4(a) suggest that the resolution performance of a TFD for a neighboring pair of components in a multicomponent signal may be defined by the minimum difference D(t) = f 2 ( t ) - fl (t) for which we still have a positive separation measure S(t) between the components' mainlobes centered about their respective IFs, fl (t) and f2(t). For best resolution performance of TFDs, S(t) should be as close as possible to the true difference between the actual IFs. The components' separation measure S(t) is expressed as [5]:
~(t) : (f2(t)B2(t))- (re(t)
Bl (2t) )
(7.4.3)
The resolution also depends on the following set of variables, all of which should be as small as possible: (a) the normalized instantaneous bandwidth of the signal component Bk(t)/fk(t), k = 1, 2, which is accounted for in S(t) (Eq. (7.4.3)), (b) the ratio of the sidelobe amplitude IAsk (t)l to the mainlobe amplitude IAmk (t)l , k = 1, 2, of the components, and (c) the ratio of the cross-term amplitude IAx(t)l to the mainlobe amplitudes of the signal components IAmk (t)l, k = 1, 2. It follows that the best TFD for multicomponent signals analysis is the one that concurrently minimizes the positive quantities (a), (b), (c), and maximizes S(t). Hence, by combining the above variables, expressions for a measure P(t) of the resolution performance of a given TFD can be defined. Two have been proposed
309
Resolution Performance Assessment for Quadratic TFDs
in [5], among these a normalized performance measure expressed as:
1{ Am(t) As(t)
P(t)- 1 - -~
1 A=(t)
Am(t)
s(t) +(1 D(t)) }
(7.4.4)
where, for a pair of signal components, A,~(t) and As(t) are respectively the average amplitudes of the components' mainlobes and sidelobes, Az(t) is the cross-term amplitude, S(t), defined by Eq. (7.4.3), is a measure of the components' separation in frequency, and D(t) = f2(t)- fl (t) is the difference between the components' actual IFs. The algorithm presented in [6] describes how the parameters in Eq. (7.4.4) are measured in practice. The measure P(t) is close to 1 for well-performing TFDs and 0 for poorlyperforming ones. Therefore, when designing a TFD we want to maximize P(t) in order to reduce the cross-terms, while preserving the components' resolution. In some applications involving real-life signals, we may need to better discriminate between different TFDs resolution performances in a given set of K TFDs. In this case, a suitable alternative to P(t) that was proposed in [7] could be used. It is expressed as: Amj(t)
Mj(t)-l--~
Amj, (t) l
max (I ]Amk(t) A*~(t) / l
9
Dj(t)
max (IA=~(t) I l
A.~ k (t) /
max
1
(Bk(t)) Dk(t)
(7.4.5) where Mj(t) (1 _< j <_ K) is the resolution performance measure of the j - t h TFD, and B is the average instantaneous bandwidth of the components mainlobes. The measure M(t) is used in Section 7.4.5 to compare the performances of quadratic TFDs of a real-life signal, as it discriminates better than the measure P(t) for real-life signals [8].
7.4.4
Application to the Selection of the Optimal T F D for a Given Multicomponent Signal
A methodology for selecting the optimal TFD for resolving closely-spaced components in a multicomponent signal involves then the following steps: 1. Define a set of comparison criteria describing the information sought from TFDs (Section 7.4.2). 2. Objectively measure the resolution performance of TFDs based on these criteria (use the measure P defined by Eq. (7.4.4)). 3. Optimize each TFD to match the criteria as close as possible [5, 6]: Select as the optimal TFD kernel parameter value the one which maximizes the overall performance measure Poveran, taken to be the mean of the instantaneous measures P in a time interval of interest.
Chapter 7:
310
Measures, Performance Assessment and Enhancement
T a b l e 7.4.1: Optimization results for the TFDs of signal s(t) defined by Eq. (7.4.6).
TFD
Optimal value of the kernel parameter
Spectrogram Wigner-Ville Choi-Williams Born-Jordan Zhao-Atlas-Marks (ZAM) Modified B
Bartlett window, length 31 N/A in this case a = 1 N/A in this case a = 2 ~ = 0.04
Po~er~]l 0.86 0.62 0.82 0.81 0.67 0.88
4. Quantitatively compare TFDs and select the best one: An optimized TFD which has the largest value of Poverall is selected as best for representing the given signal in the joint time-frequency domain. E x a m p l e : We define the following two-component signal in noise: 8(t) -- 81(t) q-
82(t)-~- n(t)
= cos (21r (0.1 t + 2 t2)) + cos (27r (0.2 t + ~ t2)) + n(t)
(7.4.6)
where a = 0.0016 is the component bandwidth-duration ratio (duration T = 128), and n(t) is additive white Gaussian noise with signal-to-noise ratio SNR = 10 dB. The sampling frequency is f~ = 1 Hz. The signal s(t) is analyzed in the time-frequency domain using the following TFDs: the spectrogram, the Wigner-Ville distribution, the Choi-Williams distribution, the Born-Jordan distribution, the Zhao-Atlas-Marks (ZAM) distribution, and the Modified B-distribution [9]. To find the optimal TFD for resolving the two components of s(t), we first find the optimal values of the TFDs kernel parameters, as described in the above methodology. The Wigner-Ville distribution and the Born-Jordan distribution have no "smoothing" parameters, hence do not need optimizing. The optimized TFD with the largest Pover~ll among the considered TFDs is then selected as optimal for representing s(t). Table 7.4.1 lists the results of the optimization process, and it shows that the signal optimal TFD is the Modified B-distribution with the parameter/~ = 0.04. All optimized TFDs are plotted in Fig. 7.4.5. From the signal optimal TFD important signal parameters can be measured (see Table 7.4.2). In addition, by optimizing components' concentration and resolution, more accurate components IF laws' estimates are obtained from the peaks of the optimal TFD's dominant ridges in the time-frequency plane [1] (see Fig. 7.4.6).
7.4.5
Use of the Performance Measure in Real-Life Situations
The methodology defined in this section enables to select a real-life signal bestperforming TFD in an objective, automatic way. Its use should make time-frequency techniques more applicable in practice (e.g. machine condition monitoring described in Articles 15.2 and 15.6, or other applications presented in Part V).
Resolution Performance Assessment for Quadratic TFDs
311
Fig. 7.4.5: Optimized TFDs of signal s(t) defined by Eq. (7.4.6). Table 7.4.2: Parameters of Sl(t) and s2(t) (Eq. (7.4.6)) measured from the signal s(t) optimal TFD (Modified B-distribution, /~ -- 0.04). The values shown are the averages over t E [32, 96].
Parameter
Component sl(t)
Component s2(t)
Instantaneous bandwidth B Mainlobe amplitude IA~nl Sidelobe amplitude IAs] Cross-term amplitude ]Axl
0.0194 1.0002 0.0900
0.0195 0.9574 0.0858 0.1503
The methodology consists of the following steps: 1. Represent the signal in the time-frequency domain with a quadratic TFD, i.e. a smoothed Wigner-Ville distribution (see Article 3.2). Following the approach described in Article 5.7, we smooth the WVD in both time t and lag T with the Hanning window of length equal to a quarter of the signal duration. This time-frequency smoothing is intended to suppress the WVD inner and outer artifacts, while preserving components time-frequency features (see Article 4.2 for more details). 2. For the different time instants of the smoothed WVD, select the two closest dominant peaks in the frequency direction. To achieve the best resolution of
312
Chapter 7:
(a) Modified B (~ = 0.04)
Measures, Performance Assessment and Enhancement
(b) Spectrogram (Bartlett, L = 31)
Fig. 7.4.6: Comparison of the measured (dashed) and true (solid) IF laws of the component sl(t) (left) and s2(t) (right) of the signal s(t) defined by Eq. (7.4.6). The mean-square-errors (MSEs) of the IF estimates obtained from the peaks of the signal optimized Modified B-distribution (best-performing TFD) are given in (a), and those obtained from the peaks of the signal optimized spectrogram (second best TFD) in (b).
the signal components, the best resolution of the two closest components at an observed time instant is sufficient [10]. Note that if a signal is monocomponent or no components exist at a particular time, this time instant is not considered. 3. For the selected pairs of components, optimize different T F D s using the resolution performance measure M defined by Eq. (7.4.5). The measure M is used over P since it is a better discriminator of real-life signals T F D s resolution performances [11]. The kernel parameter value, which from a set of different values considered, maximizes the overall performance measure (the mean of M over the observed times) is selected as the kernel parameter optimal value. 4. Calculate the measure M of the optimized T F D s for each of the selected pairs of signal components. The T F D which maximizes the average (over time) M is selected as the signal best-performing T F D among the considered TFDs. E x a m p l e : To illustrate how to use this methodology in practice, let us find the bestperforming T F D for the Noisy Miner (Manorina melanocephala) song signal. The same T F D s we considered in the synthetic signal optimal T F D selection example will be considered in this real-life signal example. We start by representing the signal in the time-frequency domain with the Wigner-Ville distribution smoothed in time and lag with the Hanning windows of length L = 3501 (Fig. 7.4.7(a)). For each time instant of the smoothed WVD we then identify the pair of closest components. From Fig. 7.4.7(b) we can see that different components form such pairs at different times. Next, as described in steps 3 and 4 of the above-defined methodology, the six considered T F D s are first optimized, after which their resolution performances are
313
Resolution Performance Assessment for Quadratic TFDs
(a) Smoothed W V D (Hanning, L = 3501)
(b) Pairs of closest components
(c) Modified B (fl = 0.91)
Fig. 7.4.7: Optimization of the bird song signal TFDs. The signal Smoothed WVD is shown in (a), and the pairs of its closest components in (b), with the overall closest pair (at time t -- 4295) marked by the squares. The signal optimized Modified B-distribution is shown in (c).
Table 7.4.3: Optimization and comparison results for the TFDs of the Noisy Miner song signal. The values of Moverall indicate that the spectrogram performs better than most traditional TFDs in this case. Only the Modified B-distribution performs better than the spectrogram and all others.
TFD
Optimal value of the kernel parameter
Moveran
Spectrogram Wigner-Ville Choi-Williams Born-Jordan Zhao-Atlas-Marks (ZAM) Modified B
Bartlett window, length 3501 N/A in this case a = 0.004 N/A in this case a = 2 fl = 0.91
0.90 0.50 0.74 0.65 0.63 0.93
evaluated using the measure M. Table 7.4.3 shows the signal TFDs kernel parameters optimization and the TFDs resolution performance results. The Modified B-distribution ibr ~ = 0.91, plotted in Fig. 7.4.7(c), is found to have the largest value of Moverall (the mean of M over the time instants). Therefore, we select this TFD as best to represent the Noisy Miner song signal in the time-frequency plane.
7.4.6
Summary and Conclusions
This article defines a measure for assessing the resolution performance of quadratic TFDs in separating closely-spaced components in the time-frequency domain. The measure takes into account key attributes of TFDs, such as components' mainlobes and sidelobes, and cross-terms. The introduction of this measure allows to quantify the quality of TFDs instead of relying solely on visual inspection of plots. The resolution performance measure also allows for selecting the optimal T F D in a given practical application, and improving methodologies for designing high resolution quadratic TFDs, such as the Modified B-distribution.
314
Chapter 7:
Measures, Performance Assessment and Enhancement
References [1] B. Boashash, "Time-frequency signal analysis," in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: Prentice-Hall, 1991. [2] L. Stankovi5, "An analysis of some time-frequency and time-scale distributions," Ann. Tdldcommunications, vol. 49, pp. 505-517, September/October 1994. [3] B. Barkat and B. Boashash, "A high-resolution quadratic time-frequency distribution for multicomponent signals analysis," IEEE Trans. Signal Processing, vol. 49, pp. 22322239, October 2001. [4] T.-H. Sang and W. J. Williams, "R~nyi information and signal-dependent optimal kernel design," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), vol. 2, pp. 997-1000, Detroit, 9-12 May 1995. [5] B. Boashash and V. Sucic, "Resolution measure criteria for the objective assessment of the performance of quadratic time-frequency distributions," IEEE Trans. Signal Processing, vol. 51, pp. 1253-1263, May 2003. [6] V. Sucic and B. Boashash, "Parameter selection for optimising time-frequency distributions and measurements of time-frequency characteristics of non-stationary signals," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), vol. 6, pp. 3557-3560, Salt Lake City, UT, 7-11 May 2001. [7] V. Sucic, B. Boashash, and K. Abed-Meraim, "A normalised performance measure for quadratic time-frequency distributions," in Proc. Second IEEE Internat. Syrup. on Signal Processing and Information Technology (ISSPIT'02), pp. 463-466, Marrakech, Morocco, 18-21 December 2002. [8] V. Sucic and B. Boashash, "Selecting the optimal time-frequency distribution for reallife multicomponent signals under given constraints," in Proc. Eleventh European Signal Processing Conf. (EUSIPCO-02), vol. 1, pp. 141-144, Toulouse, 3-6 September 2002. [9] Z. M. Hussain and B. Boashash, "Adaptive instantaneous frequency estimation of multicomponent FM signals using quadratic time-frequency distributions," IEEE Trans. Signal Processing, vol. 50, pp. 1866-1876, August 2002. [10] V. Sucic and B. Boashash, "Optimisation algorithm for selecting quadratic timefrequency distributions: Performance results and calibration," in Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), vol. 1, pp. 331-334, Kuala Lumpur, 13-16 August 2001. [11] V. Sucic and B. Boashash, "An approach for selecting a real-life signal best-performing time-frequency distribution," in Proc. Seventh Internat. Syrup. on Signal Processing and its Applications (ISSPA '03), vol. 1, pp. 100-104, Paris, 1-4 July 2003.
315
Joint-Domain Representations via Discrete-Domain Frames
7.5
JOINT-DOMAIN REPRESENTATIONS VIA DISCRETE-DOMAIN FRAMES 0
Representation of a signal in joint domains has been a very active area in signal processing. Prime examples of such representation are the short-time Fourier transform (STFT) and the discrete-time Gabor transform (DTGT) in the joint time-frequency domain, and filter banks and wavelets in the joint time-scale domain. The varieties of joint-domain transforms are unified in this article in the form of product function and cascaded frames.
7.5.1
Frames and Reconstruction Collections
A f r a m e is a collection of sequences, which we refer to as frame elements, that generates a finite representation of a signal via the inner product of the signal with the frame elements. An overview of frame theory can be found in [1, 2]. Mathematically, a collection of sequences {Vm} in a Hilbert space ]HI, m c Z, the set of integers, forms a frame for ]HI, if there exist two numbers A and B, called the lower and upper frame bound, respectively, such that 0 < A < B < c~ and
Allfll 2 ~ ~-~.l (f, vm)12 <_ Bllffl 2,
v f e H.
(7.5.1)
m
Every f c H can be reconstructed using a corresponding dual frame {Um} as f = E m (f, vm) um = E,~ (f, Urn) Vm. The corresponding dual frame {Um} in H is defined by Um = S-lyre, where S is the frame operator defined via S f = E m (f, Vm)vm. A collection {urn} in IH[ is defined to be a r e c o n s t r u c t i o n c o l l e c t i o n [3] (RC) for {vm} if every x e ]HI can be written as x = ~-~m (z, Vm) urn. It is assumed that all the collections or sequences described in this article belong to the same Hilbert space of interest. An RC {urn} is related to a frame {Vm} via the following theorem [3]. T h e o r e m 7.5.1: A collection {urn} is a reconstruction collection for a frame {Vm} for ]HI if and only if =
-
1).
(7.5.2)
m
The RC concept is similar to the pseudo-frame concept [4], where two Bessel sequences {urn} and {Vm} are a pseudo frame and pseudodual frame pair for IE if and only if (f, g) = }-~m (f, v,~)(urn, g), Vf, g C ]HI. The pseudo-frame theory, however, does not restrict the analysis collection to be a frame. By restricting the analysis collection to be a frame, the boundedness and continuity of the transform is ensured. ~ J o e l M. M o r r i s , Computer Science and Electrical Engineering Department, University of Maryland Baltimore County, 1000 Hilltop Circle, Catonsville, MD 21250, USA (morris@umbc.edu), and S a n j a y M. Joshi, Lucent Technologies, 1701 Harbor Bay Parkway, Alameda, CA 94502, USA (joshi@ieee.org). Reviewers: X.-G. Xia, R. Baraniuk, and M. J. Bastiaans.
316
Chapter 7:
Measures, Performance Assessment and Enhancement
The spaces of interest are the spaces of periodic and non-periodic square-summable sequences. The space 12(Z/L) is the space of all sequences f(k) periodic in k with integer period L. The inner product for 12(Z/L)is defined as (x,y) = L-1 ~]k=o x(k)y*(k). The designation L represents the set { 0 , 1 , 2 , . . . , L - 1}. The space/2(Z) is the space of all square-summable sequences, with the inner product defined as (x, y ) - ~-]k~___cr x(k)y*(k).
7.5.2
Product-Function Frames
Product-function frames (PFFs) generalize the windowed transforms, such as the discrete-time Gabor transforms, as seen in the following subsections.
7.5.2.1
PFF for Periodic Spaces
A P F F with two defining factor functions gm and r in 12(Z/L) is defined via {gmr The elements of this frame are (gmr - gm(k)r where m e M and n e N. Since the span of {gmr is 12(Z/L), it is necessary that M N should be greater than or equal to its dimension L, i.e., M N >_L. We choose the collection {r as an orthonormal basis for 12(z/g) and take L = M N ' = M ' N , where M ' and g ' are integers. This implies 12(z/g) C 12(z/n). The following theorem [5] characterizes the frame condition for {gmr in terms of matrices G(i), i c N, whose elements are given by M-1
Gqp(i) - E
gm(i + qN)gm(i + pN),
p,q e M'.
m--O
Theorem
7.5.2" A collection {gmr
is a frame for 12(Z/i) iff G(i) are non-
singular Vi c N. Let an RC of a frame {gmr be denoted by {Vmr defined similar to {gmr The collections {gmr and {Vmr are related via the following theorem [5]. T h e o r e m 7.5.3: A frame {Vmr gm (k) and %n (k) satisfy
is a reconstruction collection for {gmr
iff
M-1
E
g,,~(k + qN)Vm(k + pN) - 5 ( q - p),
k e N and q,p C M'.
(7.5.3)
m--O
Let gin(k) - g ( k - mN') be a time-shifted version of a sequence g(k), where N ' is the time-shift parameter. Equation (7.5.3) then becomes M-1
E
g* (k + q g - m N ' ) v ( k + p N - mN') - 5(q - p),
k e N, q, p e M'.
(7.5.4)
m--O
This condition is the same as the one for the G a b o r t r a n s f o r m in [6] and in several other papers in different forms. Since r can be chosen as any orthonormal
317
Joint-Domain Representations via Discrete-Domain Frames
transform, the same pair of g and ~/ can be used for any windowed orthonormal transform. W i n d o w e d t r a n s f o r m s are equivalent to oversampled (or criticallysampled) m o d u l a t e d f i l t e r b a n k s [7]. The relationship between the analysis and synthesis filters is in the time domain as given by Eq. (7.5.4). Instead of using a single window for the Gabor transform, we can use a number of windows, resulting in a m u l t i - w i n d o w G a b o r t r a n s f o r m [8]. The equations are modified such that grn is replaced by gp,m, where p denotes the window number and m denotes the shift. We now define a new family of transforms for periodic discrete-time sequences called p r o d u c t t r a n s f o r m s [5]. Let g i n ( k ) - g(k)~bm(k) and ~ / m ( k ) - "),(k)~bm(k), where {r is an orthonormal basis for 12(Z/M). From Theorem 7.5.3, g and ~' must satisfy M-1
g* (k + qN)7(k + pN) ~
~;~(k + qN)~m(k + pN) - ~(q - p),
(7.5.5)
m--0
where k 9 N and p, q 9 M', to guarantee {gmr and {TmCn} are reconstruction collections for each other. The dimension of the space spanned by {gCmCn } is equal to the least common multiple LCM (M, N) [5]. This scheme has been utilized for fast computation of the DFT, i.e., F F T , by using N and M-point DFTs as {r and {~m}, where M and N are relativelyprime integers. The same algorithm can be used for any orthonormal transform of periodic sequences. The frame-theoretic structure derived above allows selecting different transforms for {r and {r to obtain a variety of transforms.
7.5.2.2
PFF for Non-Periodic Spaces
For 12(Z), the elements of a P F F {gmOn} are (gmOn)(k) - gm(k)r like the periodic case. The indices m and n are integers and m c Z and n 9 N. Following Daubechies' method [1], a sufficient condition for a product-function collection in 12(Z) to be a frame can be stated as follows [3].
7.5.4: L e t {g,cCn} be a collection in 12(Z), where {r } is an orthonormal basis for 12(Z/N). If
Theorem
r e ( g ) --
inf ~
lEZ
Ig~(/)l ~ > 0,
M(g)
m
-
m
9 Z, n 9 N,
sup E ]gm(l)[2 < c~, IEZ
and
(7.5.6)
m
and sup
+
-
<
(7.5.7)
sEZ
for some ~ > O, where fl(s) - sup E IEZ
then {g,cCn } is a frame.
m
Igm(l)gm(1 - s)l,
(7.5.8)
318
Chapter 7:
Measures, Performance Assessment and Enhancement
Examples of such collections are windowed transforms and a new scale-modulation transform discussed later in this article. A necessary condition for a frame is as follows [3]. C o r o l l a r y 7.5.5: If a collection {green} defined in Theorem 7.5.~ is a frame for /2(Z), then Eq. (7.5.6) is true. For a P F F {green}, let an RC be defined as {~/mCn}. Note that this is not the only solution possible. Substituting in the generalized reconstruction condition (Eq. 7.5.2), we get gt,(k
-
-
(7.5.9)
m
Equation 7.5.9, thus, is a necessary and sufficient condition for a collection {~'mCn} to be an RC for a frame {green }. For a windowed transform, such as the discrete-time Gabor transform, gm(k) is defined as a shifted version of a mother window g(k), i.e., gin(k) - g(k - m M ) , where M > 1 is known as the shift-parameter. Assuming that ")/m(k) also has the same structure (it is not necessary), the reconstruction condition (Eq. 7.5.9) becomes ~-~.g*(k - m M - ng)~/(k - raM) = 5(n), (7.5.10) m
which is analogous to the biorthogonal-like condition [9] for defining pairs of biorthogonal-like sequences (g(k)} and {~'(k)} used for the discrete-time Gabor expansion in /2(Z). Collections (gmn} and (~/mn}, where gmn(k) = g ( k - nN)e j2~rmk/M, g(k) C/2(Z), and ~mn defined similarly, are a pair of collections of biorthogonal-like sequences if N (g, 7 (uv) ) - -~5(u)5(v), (7.5.11) where 7(uV)(k) - 7(k - uM)e j27rvk/N [9]. Equation 7.5.11 is called the b i o r t h o g o nal-like c o n d i t i o n . The product-function frame structure separates the computation of the window functions from the orthonormal basis used, thus reducing the complexity. The same windows g and 7, for instance, could be used with the discrete cosine transform (DCT) to obtained a windowed DCT. Thus, we have a generalized structure for windowed transforms. If the frame element component gm is replaced by a component gpq, where gpq(k) - g p ( k - qT), the necessary and sufficient conditions for the m u l t i - G a b o r e x p a n s i o n s in /2(Z) are obtained. In this frame, gp are different windows, T is the time-shift parameter, and qT is the shift of each window. This structure, hence, extends the multi-window Gabor schemes described in [8] for the finite-dimensional space of periodic signals, 12(z/n), to the infinite-dimensional space/2(Z). A new type of transform using a s c a l e - m o d u l a t i o n structure [3] is now derived from the PFFs. Consider C L, the space of length-L complex sequences, where for two sequences x(k) and y(k), k c L, in C L, the inner product is defined as
Joint-Domain Representations via Discrete-Domain Frames
319
Fig. 7.5.1" Reconstruction collection windows '~'m for the scaled windows discussed in the text.
( x ~ y ) - ~-~k=O L - 1 x ( k ) y * (k) and the norm is defined via I l x l l - y/(x, x). It is a finitedimensional subspace of/2(Z) with dimension L. Let L - M N , where M and N are integers. Define gin(k), m - 0, 1, 2 , . . . , M - 1, to be scaled versions of a mother window, each of length (m + 1)N and norm 1. For example, an exponentiallydecaying mother window generates scaled windows given by gin(k) = e - d k / ( ( m + l ) g ) , where 0 _< m _< M - 1, 0 < k < (m + 1)N. Let the r elements used with these windows be r = e j 2 ~ n k / g , where 0 < n, k _ N - 1. Note that the r elements are periodically extended to a length of M N . For such a frame with d = 4 and N -- 16, the corresponding RC elements are shown in Fig. 7.5.1. This new transform was used in [3] to analyze an exponentially decaying modulated signal, as found in magnetic resonance techniques. It was shown that the scale-modulation transform is likely to yield better signal estimation in the presence of noise. The shift-modulation structure of the Gabor transforms has been in use for several years. The shift-scale structure of the wavelet transforms is a more recent development. The obvious third choice is the scale-modulation structure, which is provided by the scale-modulation transform.
7.5.3
Cascaded Frames
Although the product-function structure unifies the windowed transforms, it is not a good structure to represent the other popular joint time-frequency analysis scheme: wavelets. The discrete wavelet transform (DWT) and all of its variations can be unified under a second structure, cascaded frames. In simpler t e r m s , a cascaded
320
Chapter 7:
Measures, Performance Assessment and Enhancement
&D--Ho Fig. 7.5.2" Perfect reconstruction filterbank.
frame is a 'transform of transforms', although the discussion is presented in terms of the more general frames concept. Suppose a frame {U0,m} maps a signal f in 12(Z/Lo) to a vector of coefficients C1 - - [C.I,0,Cl,I,...,Cl,L1], L1 ~_ Lo. The coefficients themselves can be treated as a signal in 12(Z/L1). We can then use a frame {ul,n} for 12(Z/L1) to map the v e c t o r E l t o another vector c2 in some 12(Z/L2), L2 >_ L1. A signal in 12(Z/Lo), thus, is effectively mapped to a coefficient vector c2 in 12(Z/L2). This can be viewed as a frame {Up}, p E L._.~2,operating on a signal f in 12(Z/Lo). We call this structure a cascaded frame. The following theorem proves that the cascaded structure corresponds to a frame [3]. T h e o r e m 7.5.6: Suppose {no,m}, m e L_A, is a frame for 12(Z/Lo), and {Ul,n}, n c L2, is a frame for 12(Z/L1), where Lo <_ LI < L2. Then there exists a frame {Up}, p e L___22,for 12(Z/Lo) obtained by cascading {U0,m} and {Ul,n}. When the number of elements in a frame for a finite-dimensional space is finite, frame theory is characterized in terms of matrix theory. Computation of an RC becomes an inversion problem of square matrices in the case of critically-sampled frames, and that of rectangular matrices in the case of oversampled frames. The cascaded structure can be extended to/2(Z) via the following theorem [3]. 7.5.7: If {urn}, m e Z, and {Vn}, n e Z, are frames for/2(Z), then cascading {urn} and {Vn} results in a frame for/2(Z).
Theorem
We now discuss some examples of cascaded frames in/2(Z). Consider a perfect reconstruction (PR) filterbank (FB) shown in Fig. 7.5.2. One of the analysis filters, say G0, is typically a low-pass filter and the other is a high-pass filter. The output of each filter, thus, gives the information about the signal in a particular time period in the corresponding frequency band. In d i s c r e t e w a v e l e t t r a n s f o r m s (DWT), the output of GO is further processed using the same filterbank, thus increasing the frequency resolution at the cost of reducing the time resolution for the transform coefficients. This cascading process is continued repeatedly to obtain the complete DWT. P R is guaranteed as long as the corresponding synthesis FB is present on the synthesis side. In w a v e l e t p a c k e t t r a n s f o r m s (WPT), the output of any filter may be subjected to another layer of cascading filters. This gives much more flexibility in
Joint-Domain Representations via Discrete-Domain Frames
x~
L-DFT
321
L-IDFT
Fig. 7.5.3: Cascaded WPT and DFT.
x(O) o X(1) r . .
:
> yo!O!
~
:1
r
~
~
X(7) e
~
_--1
o
r
GO
:l
Fig. 7.5.4: An Extension of a WPT. the output signals.
G1
/
~
.d
"-]
> Yot,) > Yl(0)
G0
>
Yl(1)
G1
>
Y2!0!
> Y2 ti)
~- Y3(0) > Y3(1)
z(0) through x(7) are input signal samples. Y0 through Y3 are
choosing the time and frequency resolution for the transform coefficients. In hyb r i d W P T , the filters GO and G1 may be different for each cascading layer. All these results can be readily extended to 12(Z/L) [10]. We need not restrict ourselves to cascaded filterbanks only. We can replace some of the filterbanks by any frame. Even a frame for a periodic space 12(Z/L), when used as a filterbank, provides a frame for/2(Z), since it is effectively a windowed transform, where the window is a rectangular window of length L moved by L samples for each shift [3]. Thus, we can combine the W P T with the D F T to obtain different T F plane partitions. Fig. 7.5.3 shows such a scheme. Instead of cascaded filterbanks, we" can use the D F T structure repeatedly to obtain various T F plane partitions, resulting in b l o c k t r a n s f o r m p a c k e t s . New arrangements of the PRFBs, such as shown in Fig. 7.5.4, can be used to obtain more extensions of the W P T . The box labeled 'GO G I' is an analysis FB like the one shown in Fig. 7.5.2. Similar to the wavelet transforms, all these frames can be easily extended to 12(Z/L) [10]. 7.5.4
Summary
and Conclusions
In this article, we have seen how two frame structures, product-function frames and cascaded frames, generalize the popular transform techniques for joint-domain representation: discrete-time Gabor transforms for periodic and non-periodic spaces, and the discrete-time wavelet transforms. The necessary and sufficient conditions for the elements to form a frame were stated. It was demonstrated that the popular discrete joint-domain representation techniques, namely, the discrete-time Gabor and wavelet transforms, are special cases of these structures. Several new transforms, namely, product-transform, scale-modulation transform, and various extensions of
322
Chapter 7:
Measures, Performance Assessment and Enhancement
the discrete wavelet transforms, were derived as special cases of these structures. These two structures result in a variety of discrete transforms, giving a user more freedom in choosing a transform better suited for a particular type of signal. To conclude, we believe that this work paves the way for m a n y new interesting discretedomain representation schemes in signal processing.
References [1] I. Daubechies, "The wavelet transform, time-frequency localization and signal analysis," IEEE Trans. Information Theory, vol. 36, pp. 961-1005, September 1990. [2] C. E. Hell and D. F. Walnut, "Continuous and discrete wavelet transforms," SIAM Review, vol. 31, pp. 628-666, December 1989. [3] S. M. Joshi, Joint-Domain Representations Using Discrete Domain Frames. PhD thesis, Computer Science and Electrical Engineering Department, University of Maryland, Baltimore County, Catonsville, MD 21250, July 2000. [4] S. Li and D. M. Healy, Jr., "A parametric class of discrete Gabor expansions," IEEE Trans. Signal Processing, vol. 44, pp. 201-211, February 1996. [5] S. M. Joshi and J. M. Morris, "Some results on product-function frames," Signal Processing, vol. 80, pp. 737-740, April 2000. [6] M. R. Portnoff, "Time-frequency representation of digital signals and systems based on short-time Fourier analysis," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 28, pp. 55-69, February 1980. [7] H. BSlcskei and F. Hlawatsch, "Oversampled modulated filter banks," in Gabor Analysis and Algorithms: Theory and Applications (H. G. Feichtinger and T. Strohmer, eds.), ch. 9, pp. 295-322, Berlin/Boston: Birkh~user, 1998. [8] M. Zibulski and Y. Y. Zeevi, "Discrete multiwindow Gabor-type transforms," IEEE Trans. Signal Processing, vol. 45, pp. 1428-1442, June 1997. [9] J. M. Morris and Y. Lu, "Generalized Gabor expansions of discrete-time signals in 12(Z) via biorthogonal-like sequences," IEEE Trans. Signal Processing, vol. 44, pp. 1378-1391, June 1996. [10] G. Strang and T. Q. Nguyen, Wavelets and Filter Banks. Wellesley, MA: WellesleyCambridge Press, 1996.
Chapter 8
M ulti-Sensor and Time-Space Processing This chapter presents time-frequency methods suitable for multi-sensor and timespace processing. The topic is covered in five articles with appropriate crossreferencing to other relevant chapters. In underwater acoustics and telecommunications, separation of signal mixtures is traditionally based on methods such as Independent Component Analysis (ICA) or Blind Source Separation (BSS). These can be formulated using TFDs for dealing with the case when the signals are non-stationary (Article 8.1). Multi-sensor data can be processed with TFDs for channel estimation and equalization. In Blind Source Separation (BSS) and Direction of Arrival (DOA) estimation problems, the time-frequency approach to array signal processing leads to improved spatial resolution and source separation performances. Methods include time-frequency MUSIC, AD-MUSIC and TFD-based BSS (8.2). In sensor array processing, for source localization, TFDs provide a good framework for hypothesis testing as they possess additional degrees of freedom provided by the t and f parameters (8.3). TFD-based array detection is formulated using the Weyl correspondence. The TFD-based structure allows the optimal detector to be implemented naturally and efficiently (8.4). In the underdetermined case, the time-frequency formulations, methodologies and algorithms for BSS are implemented using two different approaches, namely vector clustering and component extraction. Algorithms and MATLAB TM code for timefrequency BSS using the above-mentioned methods are provided (8.5).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
323
324
8.1
Chapter 8:
Multi-Sensorand Time-Space Processing
BLIND SOURCE SEPARATION USING TIME-FREQUENCY DISTRIBUTIONS 0
Blind source separation (BSS) is a fundamental problem in signal processing that is sometimes known under different names: blind array processing, signal copy, independent component analysis, waveform preserving estimation, etc. In all these instances, the underlying model is that of n 'statistically' independent signals whose m (possibly noisy) mixtures are observed. Neither the structure of the mixtures nor the source signals are known to the receivers. In this environment, we want to identify the mixtures (blind identification problem) and decouple the mixtures (blind source decoupling). BSS can have many applications in areas involving processing of multi-sensor signals. Examples of these applications include: Source localization and tracking by radar and sonar devices; speaker separation (cocktail party problem); multiuser detection in communication systems; medical signal processing (e.g., separation of EEG or ECG signals); industrial problems such as fault detection; extraction of meaningful features from data, etc. Research in this area has been very active over the last two decades. Surprisingly, this seemingly impossible problem has elegant solutions that depend on the nature of the mixtures and the nature of the source statistical information. Most approaches to blind source separation assume (explicitly or not) that each source signal is a sequence of independently and identically distributed (i.i.d.) variables [1]. In this context, the blind source separation is possible only if at most one of the sources has a Gaussian distribution. In contrast, if the source sequences are n o t i.i.d., we can blindly separate the sources even for Gaussian processes. Several authors have considered the case where each source sequence is a temporally correlated stationary process [2], in which case blind source separation is possible if the source signals have different spectra. Other contributors [3, 4], have addressed the case where the second 'i' of 'i.i.d.' is failing, that is, the non stationary case. Our focus is the exploitation of signal non stationarity for blind source separation. In this case, one can use time frequency analysis to separate and recover the incoming signals. The underlying problem can be posed as a signal synthesis from the time frequency (t-f) plane with the incorporation of the spatial diversity provided by the multisensor. Moreover, the effects of spreading the noise power while localizing the source energy in the t-f domain amounts to increasing the signal to noise ratio (SNR) and hence improved performance. This article reviews the BSS techniques that exploit the joint eigenstructure of a combined set of time frequency distribution matrices. The first part is devoted to the BSS problem of instantaneous mixtures while the second part considers the general case of blind separation of convolutive mixtures. ~ Karim Abed-Meraim (Sig. & Image Proc. Dept., Telecom Paris, France, abed@tsi.enst.fr), Adel Belouchrani (EE Dept., Ecole Nationale Polytechnique, Algiers, Algeria, belouchrani@hotmail.com), and Rahim Leyman (Center for Wireless Communication, National University of Singapore, larahim@cwc.nus.edu.sg). Reviewers: B. Barkat and A. Cickochi.
Blind Source Separation Using Time-Frequency Distributions
8.1.1
325
Separation of Instantaneous Mixtures
8.1.1.1
Data Model
Consider m sensors receiving an instantaneous linear mixture of signals emitted from n _< m sources. The m x 1 vector x(t) denotes the output of the sensors at time instant t which may be corrupted by additive noise n(t). Hence, the linear data model may be given by" x(t) - A s ( t ) + n(t),
(8.1.1)
where the m x n matrix A is called the 'mixing matrix'. The n source signals are collected in a n x 1 vector denoted s(t) which is referred to as the source signal vector. The sources are assumed to have different structures and localization properties in the time-frequency domain. The mixing matrix A is full column rank but is otherwise unknown. In contrast to traditional parametric methods, no specific structure of the mixture matrix is assumed.
8.1.1.2
Spatial Time-Frequency Distributions
The discrete-time form of a quadratic-class time-frequency distribution (TFD), for a signal x(t), may be expressed (cf. [5] and Article 6.1) as oo
D=x(t, f) -
oo
E
E
l~--CX)
m ~ - - o o
r
1)x(t + m + 1)x*(t + m - 1)e-j4~yt
(8.1.2)
where t and f represent the time index and the frequency index, respectively. The kernel r l) characterizes the distribution and is a function of both the time and lag variables. The cross-TFD of two signals xl (t) and x2(t) is defined by oo
Dzlx2(t,f)-
oo
~ l =
-
~ cx~ m
=
r
1 ) x l ( t + m + l ) x ~ ( t + m - 1 ) e -j4~ryt
(8.1.3)
-- cx)
Expressions (8.1.2) and (8.1.3) are now used to define the following data spatial
time-frequency distribution (STFD) matrix, (x)
Dxx(t, f) -
~ l~--OO
(x)
E
r
+ m + 1)x H(t + rn - 1)e-ya~yt
(8.1.4)
m----- oo
where [Dxx(t, f)]ij = Dz,xj (t, f), for i , j = 1 , . . . , m. x H denotes the conjugate transpose of x. Under the linear data model of equation (8.1.1) and assuming noise-free environment, the STFD matrix takes the following structure: Dxx(t, f) - ADss(t, f ) A g
(8.1.5)
where Ds~(t, f) is the source T F D matrix whose entries are the auto- and crossTFDs of the sources, respectively, defined as Ds~s~(t, f) and Ds~sj (t, f) for i ~: j.
Chapter 8:
326
8.1.1.3
Multi-Sensorand Time-Space Processing
BSS based on STFD
Let us now briefly review the principle of blind source separation based on the STFD [3, 6].
Whitening: Let W denotes a n x m matrix, such that ( W A ) ( W A ) H - I, i.e. U = W A is a n x n unitary matrix (this matrix is referred to as a whitening matrix, since it whitens the signal part of the observations). Pre- and Post-multiplying the STFD matrices Dxx(t, f) by W leads to the whitened STFD-matrices, defined as: Dxx(t, f) = W D x x ( t , I ) W g
(8.1.6)
From the definition of W and Eq. (8.1.5), we can express Dxx(t , f) as Dxx(t , f) = V D ~ ( t , f ) U g
(8.1.7)
Note that the whitening matrix can be computed in different ways. It can be obtained, for example, as an inverse square root of the observation autocorrelation matrix [3] or else computed from the STFD matrices as shown in [7].
Source separation using joint diagonalization (JD)" By selecting auto-term t-f points, the whitened STFD matrices will have the following structure, Dxx(t, f) = V D ~ ( t , f ) U H
(8.1.8)
where D ~ ( t , f) is diagonal. 1 This expression shows that any whitened STFD-matrix is diagonal in the basis of the columns of the matrix U (the eigenvalues of Dxx (t, f) being the diagonal entries of D ~ ( t , f)). If, for a point (t, f) of the time frequency domain, the diagonal elements of D ~ ( t , f) are all distinct, the missing unitary matrix U may be uniquely (up to permutation and scaling ambiguity 2) retrieved by computing the eigendecomposition of Dxx(t, f). Indeterminacy occurs in the case of degenerate eigenvalues, i.e., when D~,~, (t, f) -- D~sj (t, f), i ~ j. It does not seem possible to a priori choose the t-f point such that the diagonal entries of D ~ ( t , f) are all distinct. Moreover, when some eigenvalues of Dxx(t , f) come to degeneracy, the robustness of determining U from eigendecomposition of a single whitened STFD matrix is seriously impaired. The situation is more favorable when considering joint diagonalization of a combined set {Dxx(ti , fi)li = 1 , . . . ,p} of p (source auto-term) STFD matrices. This amounts to incorporating several t-f points in the source separation problem which reduces the likelihood of having degenerate eigenvalues. It is noteworthy that two source signals with identical t-f signatures cannot be separated even with the inclusion of all information in the t-f plane. The joint diagonalization (JD) [2] of a set {Mklk = 1..p} of p matrices is defined as the maximization of the JD criterion: 1Since the off-diagonal elements of Dss(t, f ) are cross-terms, the source T F D matrix is quasidiagonal for each t-f point that corresponds to a true power concentration, i.e. a source auto-term. 2The BSS problem has inherent ambiguity concerning the order and amplitudes of the sources.
Blind Source Separation Using Time-Frequency Distributions p
C ( V ) de__f~
327
n
IvHMkv~I2
~
(8.1.9)
k=l i--1
over the set of unitary matrices V - I v 1 , . . . , Vn]. An efficient joint approximate diagonalization algorithm exists in [2] and it is a generalization of the Jacobi technique [8] for the exact diagonalization of a single normal matrix.
Source separation using joint anti-diagonalization (JAb)" By selecting crossterm t-f points, the whitened STFD matrices will have the following structure: Dxx(t, f) = VDss(t, f)U H
(8.1.10)
where Ds~(t, f) is anti-diagonal. 3 The missing unitary matrix U is retrieved by Joint Anti-Diagonalization (JAD) of a combined set {Dxx(ti , f/)li = 1 , . . - , q} of q source cross-term STFD matrices [6]. The joint anti-diagonalization is explained by first noting that the problem of anti-diagonalization of a single n x n matrix N is equivalent 4 to maximizing n C ( N , V ) def _ ~
Iv~Nv~I 2
(8.1.11)
i=1
over the set of unitary matrices V = [ V l , . . . , Vn]. Hence, JAD of a set {Nk[k = 1..q} of q n x n matrices is defined as the maximization of the J A b criterion: q q n C ( V ) de__f~ C ( N k ' V ) - - ~ ~ k=l k=l i=1
Iv~Nkv~l 2
(8.1.12)
under the same unitary constraint. More generally, we can combine joint diagonalization and joint antidiagonalization of two sets {Mklk = 1..p} and {Nklk = 1..q} of n x n matrices by maximizing the J D / J A D criterion: C ( V ) de__f~ i=1
Iv~mkv~l 2
Iv/Hgkvi[ 2k=l
(8.1.13)
k=l
over the set of unitary matrices V = [ v l , . . . , Vn]. The combined J D / J A D criterion can be applied to a combined set of p (source auto-term) STFD matrices and q (source cross-term) STFD matrices to estimate the unitary matrix U. R e m a r k s " (1) The success of the JD or JAD of STFD matrices in determining the unitary matrix U depends strongly on the correct selection of the auto-term and cross-term points [6]. Therefore, it is crucial to have a selection procedure that 3Since the diagonal elements of Dss(t, f) are auto-terms, the source TFD matrix is quasi antidiagonal (i.e. its diagonal entries are close to zero) for each t-f point corresponding to a cross-term. 4This is due to the fact that the Frobenius norm of a matrix is constant under unitary transform, i.e. norm(N)--norm(VHNV).
Chapter 8:
328
Multi-Sensor and Time-Space Processing
is able to distinguish between auto-term and cross-term points based only on the STFD matrices of the observations. In [6], a selection approach that exploits the anti-diagonal structure of the source cross-term STFD matrices has been proposed. More precisely, for a source cross-term STFD matrix, we have Trace(Dxx(t , f)) -
Trace(UD.~(t, f ) U H)
= Trace(Dss (t, f)) ~ 0. Based on this observation, the following testing procedure applies: if if
Trace(Dxx (t,f))
norm(Dxx(t,f))
< ~
~
decide that (t, f) is a cross-term
Trace (D~(t,f)) norm(D___xx(t,f))
> s
)
decide that (t, f) is an auto-term
where e is a 'small' positive real scalar. (2) In practice, the source cross-term STFD matrices will not be purely antidiagonal. This is because some auto-terms, through their side lobes or main lobes, will intrude over the cross-term regions. The cross-terms will be however the dominant components. This situation is similar to the earlier work on joint diagonalization of STFD matrices selecting auto-term points [3], where the source auto-term STFD matrices are not purely diagonal because of cross-term intrusion. This impairment is mitigated by the joint approximation property of the JD//JAD algorithm and by its robustness. (3) Other classes of TFDs and techniques can also be used in BSS. In [9] a cumulantbased 4th-order Wigner distribution or Wigner trispectrum was used for source separation. In [10] blind separation of more sources than sensors (underdetermined BSS problem) has been solved using a T F domain orthogonality concept. Implementation details and the corresponding MATLAB TM code of the above algorithm are presented in Article 8.4. Sample code for computation of T F D s is given in Article 6.5.
8.1.2 8.1.2.1
Separation of Convolutive Mixtures Data Model
Consider now a convolutive multiple input multiple output (MIMO) linear time invariant model given by: n
xi(t) - E
L
E
aij(1)sj(t - l) for i = 1 , . . . , m
(8.1.14)
j=~ /=o
where sj(t), j - 1 , . . . ,n, are the n source signals (model inputs), xi(t), i = 1 , . . . , m, are the m > n sensor signals (model outputs), aij is the transfer function between the j - t h source and the i-th sensor with an overall extent of ( L + 1) taps. As
Blind Source Separation Using Time-Frequency Distributions
329
before, the sources are assumed to have different structures and localization properties in the time-frequency domain and the channel matrix A defined in (8.1.16) is full column rank. In matrix form, we have x(t) - As(t) (8.1.15) where
s(t)
--
x(t) -
[81 (t),""", [xl(t),""
81(t
--
A -
with
(L + L') + 1),..., Sn(t - (L + L') + 1)] T
,xl(t- L'+
I
0
,xm(t-L'+
All
"'"
Aln
"
"..
:
Aml
"""
Amn
I aij(O) ... Aij -
1),...
aij(L)
...
"..
"..
"..
...
aij(O)
...
1)] T
1
(8.1.16)
0 .
(8.1.17)
aij(L)
Note that A is a [mL' • n(L + L')] matrix and Aij are [L' • (L + L')] matrices. L' is chosen such that mL' >_n(L + L'). We retrieve here the same formalism as in the instantaneous mixture case. The data S T F D matrices still have the same expression as in (8.1.5). But the source auto-term (respectively, cross-term) matrices Dss(t, f) are no longer diagonal (respectively, anti-diagonal), but block-diagonal 5 (resp., block a n t i - d i a g o n a l ) w h e r e each diagonal block is of size (L + L') • (L + L'). It is this block-diagonal or block anti-diagonal structure that we propose to exploit, in the next subsection, to achieve BSS.
8.1.2.2
BSS using STFD Matrices
In this subsection we generalize the BSS method developed earlier to the case of convolutive mixtures.
Whitening: The first step of our procedure consists of whitening the data vector x(t). This is achieved by applying to x(t) a whitening matrix W, i.e. a [n(L' + L) x mL'] matrix verifying: W l iT-~oo m ( ~1 ET x(t)x(t)H ) w H - W R = W H - ( W A R s ~ ) ( W A R ~ )
H - I
t--1
5The block diagonal structure comes from the fact that the cross-terms between s i ( t - d) a r e n o t z e r o a n d d e p e n d o n t h e l o c a l c o r r e l a t i o n s t r u c t u r e of t h e s i g n a l .
(8.1.18)
si(t)
and
Chapter 8:
330
Multi-Sensorand Time-Space Processing
where Rx and Rs denote the autocorrelation matrices 6 of x(t) and s(t), respectively. Equation (8.1.18) shows that if W is a whitening matrix, then !
U - WAR2
(8.1.19)
is a [n(L ~ + L) • n(L'+ L)] unitary matrix where Rs { (Hermitian square root matrix of Rs) is block diagonal. The whitening matrix W can be determined from the eigendecomposition of the data autocorrelation Rz as in [3].
Separation using joint block diagonalization: Consider now the whitened STFD matrices D__xx(t, f) defined in (8.1.6). By (8.1.15) and (8.1.19), we obtain the key relation: Dxx(t , f) - U R / 8 9
f ) R / 8 9 g = VD(t, I ) U g 1
(8.1.20)
1
where we have set D(t, f) - R~-~Ds~(t, I ) R ; ~. Since the matrix U is unitary and D(t, f) is block diagonal, the latter just means that any whitened STFD matrix is block diagonal in the basis of the column vectors of matrix U. The unitary matrix can be retrieved by computing the block diagonalization of some matrix D__xx(t, f). But to reduce the likelihood of indeterminacy and increase the robustness of determining U, we consider the joint block diagonalization of a set {D• fi); i - 1 , . . - , p } of p whitened STFD matrices. 7 This joint block-diagonalization (JBD) is achieved by the maximization under unitary transform of the following criterion, p
C ( V ) de_f ~
n
(L'+L)l
~
~
lu*D___xx(tk, fk)Ujl 2
(8.1.21)
k=l l=l i , j = ( L ' + L ) ( l - 1 ) + l
over the set of unitary matrices U - [ u l , . . . , Un(U+L)]. Note that an efficient Jacobi-like algorithm for joint block diagonalization algorithm exists in [11,12]. Once the unitary matrix U is determined (up to a block diagonal unitary matrix D coming from the inherent indeterminacy of the JBD problem [13]), the recovered signals are obtained up to a filter by ~(n) - U H W x ( n )
(8.1.22)
According to (8.1.15) and (8.1.19), the recovered signals verify, 1
~,(n) - D R s ~ s ( n )
(8.1.23)
1
where, we recall that, the matrix R~ ~ is block diagonal and D is a block diagonal unitary matrix. 6In practice, Rx and Rs are replaced by their time-averaged estimates, e.g., Rx ()-] T=1 x ( t ) x ( t ) H ) / T .
--
7A similar procedure can be used with joint block anti-diagonalization of source cross-term STFD matrices.
Blind Source Separation Using Time-Frequency Distributions
331
Fig. 8.1.1: WVD of two mixed signals at 0 dB SNR.
R e m a r k s : (1) In practice, only n signals among the n(L' + L) recovered ones are selected. This is done by choosing the signals which lead to the smallest cross-terms coefficients. Note t h a t this information is a byproduct of the joint block diagonalization procedure and hence this selection needs no additional computations. (2) Here we have considered source separation up to a filter, instead of the full MIMO deconvolution procedure. Note that if needed a SIMO (Single Input Multi Output) deconvolution/equalization [14] can be applied to the separated sources.
8.1.3
Illustrative Examples
The two following examples illustrate the effectiveness of the time frequency approach in achieving blind separation of non-stationary sources.
8.1.3.1
Separation of Instantaneous Mixtures
In this experiment, we consider two chirp signals (n - 2), depicted by sl(t) = exp(-j0.0047rt 2) ;
s2(t) = exp(-j0.0047rt 2 - jTr0.4t) ,
impinging on an array of rn = 5 sensors at 30 and 60 degrees. W h i t e Gaussian noise was added, leading to an SNR of 0 dB. The Wigner-Ville distribution (WVD) of the mixture at the middle sensor is depicted in Figure 8.1.1. From Figure 8.1.1, we selected eight arbitrary t-f points, among which one was a cross-term. By applying the J D / J A D algorithm, we obtain the results displayed in Figure 8.1.2 with a signal rejection level estimate of-26 dB.
8.1.3.2
Separation of Convolutive Mixtures
The p a r a m e t e r settings of this example are as follows: 9 n = 2 (speech signals sampled at 8kHz), m = 3, L = 1 and L ~ = 2.
Chapter 8:
332
Multi-Sensorand Time-Space Processing
Fig. 8.1.2: WVDs of the two chirps using JD/JAD with seven auto-terms and one cross-term.
Fig. 8.1.3" Speech signal separation.
9 The transfer function matrix of the simulated multi-channel is given by:
1 A(z)
0.85 + 0.1z -1 ]
0 . 7 -~- 0 . 4 z - 1
0.25 + z -1
1 -~- 0 . 5 z - 1
0 . 7 -~- 0 . 8 5 z - 1
.
Figure 8.1.3 shows the original speech signal, their convolutive mixture and the recovered speech signals by the time frequency separation algorithm.
8.1.4
Summary and Conclusions
The problem of blind separation of linear mixtures of non-stationary source signals based on time frequency distributions has been investigated. Both instantaneous and convolutive mixture cases have been considered. For both cases, solutions based on the use of the joint matrix structure (i.e., diagonal, anti-diagonal, block diagonal,
Blind Source Separation Using Time-Frequency Distributions
333
or block anti-diagonal) of a combined set of spatial time frequency distribution matrices selected in both the auto-term and cross-term regions, have been presented.
References [1] J.-F. Cardoso, "Blind signal separation: Statistical principles," Proc. IEEE, vol. 86, pp. 2009-2025, October 1998. Special issue on blind identification and estimation. [2] A. Belouchrani, K. Abed-Meraim, J.-F. Cardoso, and E. Moulines, "Blind source separation using second order statistics," IEEE Trans. Signal Processing, vol. 45, pp. 434-444, February 1997. [3] A. Belouchrani and M. G. Amin, "Blind source separation based on time-frequency signal representation," IEEE Trans. Signal Processing, vol. 46, pp. 2888-2898, November 1998. [4] L. Parra and C. Spence, "Convolutive blind separation of non-stationary sources," IEEE Trans. on Speech ~ Audio Processing, vol. 8, pp. 320-327, May 2000. [5] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [6] A. Belouchrani, K. Abed-Meraim, M. G. Amin, and A. Zoubir, "Joint antidiagonalization for blind source separation," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), Salt Lake City, UT, 7-11 May 2001. [7] Y. Zhang and M. G. Amin, "Blind separation of sources based on their time-frequency signatures," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), vol. 5, pp. 3132-3135, Istanbul, 5-9 June 2000. [8] G. H. Golub and C. F. Van Loan, Matrix computations. Baltimore, MD: Johns Hopkins University Press, 1989. [9] A. R. Leyman, Z. M. Kamran, and K. Abed-Meraim, "Higher-order time frequencybased blind source separation technique," IEEE Signal Processing Letters, vol. 7, pp. 193196, July 2000. [10] N. Linh-Trung, A. Belouchrani, K. Abed-Meraim, and B. Boashash, "Separating more sources than sensors using time-frequency distributions," in Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), vol. 2, pp. 583-586, Kuala Lumpur, 13-16 August 2001. [11] A. Belouchrani, M. G. Amin, and K. Abed-Meraim, "Direction finding in correlated noise fields based on joint block-diagonalization of spatio-temporal correlation matrices," IEEE Signal Processing Letters, vol. 4, pp. 266-268, September 1997. [12] A. Belouchrani, K. Abed-Meraim, and Y. Hua, "Jacobi-like algorithms for joint block diagonalization: Application to source localization," in Proc. Sixth IEEE Internat. Workshop on Intelligent Signal Processing and Communication Systems (ISPACS'98), vol. 1, pp. 133-137, Melbourne, Australia, 5-6 November 1998. [13] H. Bousbia-Saleh, A. Belouchrani, and K. Abed-Meraim, "Jacobi-like algorithm for blind signal separation of convolutive mixtures," Electronics Letters, vol. 37, no. 16, pp. 1049-1050, 2001. [14] K. Abed-Meraim, W. Qiu, and Y. Hua, "Blind system identification," Proc. IEEE, vol. 85, pp. 1310-1322, August 1997.
Chapter 8:
334
Multi-Sensorand Time-Space Processing
SPATIAL TIME-FREQUENCY DISTRIBUTIONS AND THEIR APPLICATIONS 0
8.2
8.2.1
Spatial Time-Frequency Distributions
The evaluation of quadratic time-frequency distributions (TFDs) of nonstationary signals impinging on a multi-sensor receiver yields spatial time-frequency distributions (STFDs), which permit the application of eigenstructure subspace techniques to solving a large class of channel estimation and equalization, blind source separation (BSS), and high resolution direction-of-arrival (DOA) estimation problems [1-3]. STFD based techniques are appropriate to handle sources of nonstationcry waveforms that are highly localized in the time-frequency (t-f) domain. In the area of BSS, the use of the STFDs allows the separation of sources with identical spectral shape, but with different t-f localization properties, i.e., different t-f signatures. For both source separation and DOA estimation problems, spreading the noise power while localizing the source energy in the t-f domain amounts to increasing the robustness of eigenstructure signal and noise subspace estimation methods with respect to channel and receiver noise. This in turn leads to an improvement of spatial resolution and source separation performance. The quadratic class of STFD matrix of a signal vector x(t) is defined as Dxx(t, f) -
i?i?l? oo
Ix)
g(~, T)X(U + ~ ) x g ( u -- ~)eJ2"(~'u-~"-f~')dT dudl.,,
oo
(s.2.1)
where g(L,, T) is the kernel function (see Chapter 3). In narrowband array processing, when n signals arrive at an m-element array (see Fig. 8.2.1), the linear data model x(t) -- y(t) + n(t) = Ad(t)-t- n(t)
(8.2.2)
is commonly assumed, where x(t) is the m • 1 data vector received at the array, d(t) is the n x 1 source data vector, the m • n spatial matrix A = [al---a,~] represents the mixing matrix, ai is the steering vector of ith signal, and n(t) is an additive noise vector whose elements are modeled as stationary, spatially and temporally white, zero-mean complex random processes, independent of the source signals. Under the uncorrelated signal and noise assumption and the zero-mean noise property, the expectation of the crossterm TFD matrices between the signal and noise vectors is zero, i.e., Z [Dyn(t,/)] -- E [Dny(t, f ) ] - - 0, and it follows E [D~x(t,/)] = Dyy(t, f) -4- E [ D , , ( t , f)] - ADdd(t, f ) A g --t- a2I,
(8.2.3)
where cr2 is the noise power, and I is the identity matrix. Eq. (8.2.3) is similar to that which has been commonly used in array processing based on second-order statistics, ~ M o e n e s s (3. A m i n and Y i m i n Zhang, Department of Electrical and Computer Engineering, Villanova University, Villanova, PA 19085, USA (moeness~ece.villanova.edu, yimin@ieee.org). Reviewers: A. Gershman and K. Abed-Meraim.
Spatial Time-Frequency Distributions and Their Applications #1
335 #1
#2
#n
"E> #m
F i g . 8.2.1" m-element array with n signal arrivals.
relating the signal correlation matrix to the data spatial correlation matrix [1]. This implies that key problems in various applications of array processing, specifically those dealing with nonstationary signal environments, can be approached using quadratic transformations. If Ddd(t, f) is a full-rank matrix, the two subspaces spanned by the principal eigenvectors of Dxx(t, f) and the columns of A become identical. In this case, direction finding techniques based on eigenstructures can be applied. If Ddd(t, f) is diagonal, i.e., the signal cross-TFDs at the t-f point (t, f) are zeros, then both the mixing matrix and the signal waveforms can be recovered using BSS methods.
8.2.2
Fundamental Properties
There are five key advantages of array signal processing using STFD. In order to properly explain these advantages, we use the diagram in Fig. 8.2.2. We consider two sources A and B incident on a multi-sensor array. Source A occupies the t-f region Ra, whereas source B occupies the t-f region Rb. The t-f signatures of the two sources overlap, but each source still has a t-f region that is not intruded over by the other source. (1) Eq. (8.2.3) can be easily derived for any arbitrary joint-variables. Time and frequency are indeed the two most commonly used and physically understood parameters. However, by replacing the STFDs by spatial arbitrary joint-variable distributions, one can relate the sensor joint-variable distributions to the sources joint-variable distributions through the same mixing matrix A. As shown in the Examples section, there are situations where it is preferable to consider other domains such as the ambiguity lag-Doppler domain, where the locations of the signals and their cross-terms are guided by properties and mechanisms different than those associated with the t-f domain (see Article 3.2). (2) Eq. (8.2.3) is valid for all t-f points. It is well known that direction finding techniques require Ddd(t, f) to be full rank, preferably diagonal. On the other
336
Chapter 8:
Multi-Sensor and Time-Space Processing
BC
Fig. 8.2.2: Signals with different time-frequency signature.
hand, BSS techniques demand the diagonal structure of the same matrix without degenerate eigenvalues. These properties along with high signal-to-noise ratio (SNR) requirements may be difficult to achieve using a single t-f point. Two different methods can be used for integrating several t-f points into Eq. (8.2.3). One method is based on a simple averaging performed over the signatures of the sources of interest, whereas the second method is based on incorporating several desired t-f points into joint diagonalization or joint block-diagonalization schemes. (3) The TFD of the white noise is distributed all over the t-f domain, whereas the TFDs of the source waveforms are likely to be confined to much smaller regions. Referring to Fig. 8.2.2, the noise is spread over both Ra and Rb as well as the complement region Re. If the t-f points (t, f) used in either the averaging or joint diagonalization approaches belong to the noise only region Rc, then no information of the incident waveforms is used and, as such, no reasonable source localization and signal separation outcomes can be obtained. On the other hand, if all points (t, f) in Fig. 8.2.2 are used, and the employed TFD satisfies the marginal constraints, then it can be easily shown that only the signal average power is considered. As a result, the problem simplifies to the second-order covariance based matrix approach, traditionally used in high resolution DOA estimation. This is an important feature, as it casts the conventional techniques as special cases of the array signal processing framework based on t-f analysis. Finally, if we confine the (t, f) points to Ra and Rb, then only the noise part in these regions is included. The result of leaving out the points (t, f) that are not part of the t-f signatures of the signal arrivals is enhancing the input SNR, which is utilized by the source localization and signal separation techniques. (4) By only selecting t-f points that belong to the t-f signature of one source, then this source will be the only one considered by Eq. (8.2.3). This selection, in essence, is equivalent to implicitly performing spatial filtering and removing other sources from consideration. It is important to note, however, that such removal does
Spatial Time-Frequency Distributions and Their Applications
337
not come at the expense of reduction of the number of degrees-of-freedom (DOFs), as it is the case in beamspace processing, but the problem remains a sensor space processing with the original number of DOFs kept intact. This property represents a key contribution of TFDs to the direction finding and DOA estimation areas. An antenna array can be used to localize a number of sources equal or even greater than its number of sensors. The fundamental condition is that there must be tf regions over which the respective t-f signatures of the sources do not overlap. Referring to Fig. 8.2.2 and considering the case of two sensors, if all t-f points incorporated in direction finding belong to region Ra and not Rb, then the signal subspace defined by Eq. (8.2.3) is one-dimensional. Thus, by excluding source B, a one-dimensional noise subspace is established. This allows us to proceed with high resolution techniques for localization of source A. In a general scenario, one can localize one source at a time or a set of selected sources, depending on the array size, overlapping and distinct t-f regions, and the dimension of the noise subspace necessary to achieve the required resolution performance. The same concepts and advantages of t-f point selection discussed above for direction finding can be applied to BSS problems. (5) The a priori knowledge of some temporal characteristics or the nature of time-varying frequency contents of the sources of interest may permit us to directly select the t-f regions used in Eq. (8.2.3). For instance, it is known that, in the ambiguity domain, all fixed frequency sinusoidal signals map to the time-lag axis. By only incorporating the points on this axis, we have, in fact, opted to separate and localize all narrowband signals in broadband communications platforms.
8.2.3
Examples
In this Section, we present simulation examples to demonstrate the fundamental offerings discussed in the previous Section. Time-frequency MUSIC (t-f MUSIC), ambiguity-domain MUSIC (AD-MUSIC), and the BSS based on STFDs are three different techniques chosen for the demonstration. The algorithms involved in the implementation of the techniques are given in Tables 8.2.1, 8.2.2 and 8.2.3 [1,2, 4]. Example I [4]. Consider the scenario of a four-element equi-spaced linear array spaced by half a wavelength, where one chirp signal and two sinusoidal signals are received. The data record has 128 samples. All three signals have the same SNR of 20 dB. The DOAs of the chirp signal and the two sinusoidal signals are 15 ~ 10 ~ and 0 ~ respectively. While the ambiguity function of the chirp signal sweeps the ambiguity domain with contribution at the origin, the exact autoterm ambiguity function of the narrowband arrivals sl (t) and s2(t) is zero for non-zero frequencylags and may have non-zero values only along the vertical axis L, = 0. In this simulation example, we selected 24 points on the time-lag axis, excluding the origin, and as such emphasizing the narrowband components. Fig. 8.2.3 shows the ambiguity function where the two vertical lines away from the origin represent the crossterms between the sinusoidal components. Fig. 8.2.4 shows the two estimated spatial spectra for three independent trials, one corresponds to the
338
Chapter 8:
Table 8.2.1:
STEP I
Multi-Sensor and Time-Space Processing
Time-Frequency MUSIC.
Form K matrices Dxx(ti, fi) for the selected (ti, fi) points,
i=l,...,K. S T E P II
The eigenvectors of E [Dxx(t, f)] corresponding to the m - n smallest eigenvalues,el, 999, era-n, are obtained by joint blockdiagonalization, or the eigen-decomposition of averaged matrix 1
K
E
Dxx(ti, fi).
i--1
STEP III
Estimate the number of signals from the eigenvalues, and estimate the DOAs from the peaks of the t-f MUSIC spectra
f(O) = II~Ha(0> [-2 , where l~n - [ e l , - - ' ,
era-hi, and a(0) is
the steering vector corresponding to D 0 A 0.
Table 8.2.2:
Ambiguity-Domain MUSIC
Ambiguity-Domain MUSIC follows the same procedure as time-frequency MUSIC by using Dxx(Pi,Ti) instead of Dxx(ti, fi), i = 1,-.. , K .
conventional method and the other corresponds to the AD-MUSIC. There are two dominant eigenvalues for the case of the AD-MUSIC, since we have not deliberately considered the chirp signal through our careful selection of the ambiguity-domain points. It is clear that the AD-MUSIC resolves the two sinusoidal signals, while the conventional MUSIC could not separate the three signals. Example II [5]. Consider a uniform linear array of eight sensors separated by half a wavelength. Two chirp signals emitted from two sources positioned at (01,02) = ( - 1 0 ~, 10~ respectively. The data record has 1024 samples. The start and end frequencies of the chirp signal of the source at 01 a r e f s l - - 0 and f~l = 0.5, while the corresponding two frequencies for the signal of the other source at 02 are fs2 = 0.5 and f~2 = 0, respectively. Fig. 8.2.5 displays the standard deviations of the DOA estimation ~1 versus SNR. The curves in this figure show the theoretical and experimental results of the
Spatial Time-Frequency Distributions and Their Applications
339
T a b l e 8.2.3" Blind Source Separation Based on STFDs
STEP I
Estimate the auto-correlation matrix l~xx from T data samples. Denote by A a , . . . , An the n largest eigenvalues and the corresponding eigenvectors of Rxx.
S T E P II
An estimate b2 of the noise variance is the average of the m - n smallest eigenvalues of Rxx. The whitening matrix is formed as
qr -- [(/~1 -- ~2) -lhl,''" L
, ('~n -- O'2) -lhn]
H.
.J
STEP III
Form K matrices by computing the STFD of whitened vector z(t) - W x ( t ) for a fixed set of (ti, fi) points, i = 1 , . . . , K , corresponding to signal autoterms.
STEP IV
A unitary matrix U is then obtained as joint diagonalizer of the set D,.,.(ti, fi), i - 1 , . . . , K.
STEP V
The source signals are estimated as ~(t) = U H W x ( t ) , and the mixing matrix A is estimated as A = W # 1 J .
Fig. 8.2.3: The ambiguity functions of the chirp signal and two sinusoidal signals.
conventional MUSIC and t-f MUSIC. Pseudo Wigner-Ville distribution with window length L - 33 and 129 are considered. The Cramer-Rao Bound (CRB) is also shown in Fig. 8.2.5. Both signals are selected when performing t-f MUSIC. Simulation
340
Chapter 8:
Multi-Sensor and Time-Space Processing
Fig. 8.2.4: The estimated spatial spectra of AD-MUSIC and conventional MUSIC.
Fig. 8.2.5: The standard deviations of DOA estimation ~I vs. SNR.
results are averaged over 100 independent trials of Monte Carlo experiments. The advantages of t-f MUSIC in low SNR cases are evident from this figure. Fig. 8.2.6 shows estimated spatial spectra at S N R = - 2 0 dB based on t-f MUSIC (L = 129) and the conventional MUSIC. The t-f MUSIC spectral peaks are clearly resolved. Example trII [1]. In Fig. 8.2.7, we show an example of the application of STFDs to the BSS problem. A three-element equi-spaced linear array is considered where the interelement spacing is half a wavelength. Two chirp signals arrive at - 1 0 ~ and 10~ respectively. The number of data samples used to compute the STFD is 128. The number of t-f points employed in the joint diagonalization is p=128, with equal number of points on each signature. Fig. 8.2.7(b) shows the Choi-Williams distributions of two linear mixtures of the original chirp signals depicted in Fig. 8.2.7(a), corresponding to the data at the first and the second sensors. Using the STFDs, we are able to recover the original signals from their observed mixture, as shown in Fig. 8.2.7(c).
Spatial Time-Frequency Distributions and Their Applications
341
Fig. 8.2.6: The estimated spatial spectra of t-f MUSIC and conventional MUSIC.
8.2.4
Crossterm Issues in STFD
There are two sources of crossterms. The first type are the crossterms that are the results of the interactions between the components of the same source signal. The other type of crossterms are those generated from the interactions between two signal components belonging to two different sources. These crossterms are associated with cross-TFDs of the source signals and, at any given t-f point, they constitute the off-diagonal entries of the source T F D matrices Ddd(t, f ) defined in (8.2.3). Although the off-diagonal elements do not necessarily destroy the full-rank matrix property necessary for direction finding application [6], they violate the basic assumption in the problem of source separation regarding the diagonal structure of the source T F D matrix. We must therefore select the t-f points that belong to autoterm regions where crossterm contributions are at minimum, e.g., by using a priori information of the source signals. The method of spatial averaging of the STFD introduced in [7] does not reduce the crossterms as in the case with reduced interference distribution kernels, but rather move them from their locations on the off-diagonal matrix entries to be part of the matrix diagonal elements. The other parts of the matrix diagonal elements represent the contribution of the autoterms at the same point. Therefore, not only we are able to set the off-diagonal elements of the source T F D matrix to zeros, but also we can improve performance by selecting the t-f points of peak values, irrespective of whether these points belong to autoterm or crossterm regions.
342
Chapter 8:
Multi-Sensorand Time-Space Processing
F i g . 8 . 2 . 7 : Blind source separation based on STFDs.
8.2.5
Summary and Conclusions
The spatial time-frequency distribution (STFD) is an important tool for temporal and spatial separations of sources emitting nonstationary signals. It is a discriminatory tool that allows a consideration of only a subset of source signals impinging on a multi-sensor receiver. This property enhances signal parameter estimation and permits direction finding and signal separation to be applied to a number of sources that is equal or even exceeds the number of sensors.
.Spatial Time-Frequency Distributions and Their Applications
343
All material presented in this article is based on the model (8.2.2). One important change in the direction of the research in the time-frequency array signal processing area was given in [8], where the strict model of (8.2.2) was relaxed and a direction finding technique employing a STFD-based wideband root-MUSIC was proposed. Another research direction is the utilization and integration of crossterms into STFDs. It has recently been shown [9] t h a t source separation can be performed based on both autoterms and crossterms through joint diagonalization and joint anti-diagonalization schemes of STFD matrices. Article 8.4 presents algorithms and sample code for these methods.
References [1] A. Belouchrani and M. G. Amin, "Blind source separation based on time-frequency signal representation," IEEE Trans. Signal Processing, vol. 46, pp. 2888-2898, November 1998. [2] A. Belouchrani and M. G. Amin, "Time-frequency MUSIC: A new array signal processing method based on time-frequency signal representation," IEEE Signal Processing Letters, vol. 6, pp. 109-110, May 1999. [3] Y. Zhang, W. Mu, and M. G. Amin, "Time-frequency maximum likelihood methods for direction finding," J. Franklin Institute, vol. 337, pp. 483-497, July 2000. [4] M. G. Amin, A. Belouchrani, and Y. Zhang, "The spatial ambiguity function and its applications," IEEE Signal Processing Letters, vol. 7, pp. 138-140, June 2000. [5] Y. Zhang, W. Mu, and M. G. Amin, "Subspace analysis of spatial time-frequency distribution matrices," IEEE Trans. Signal Processing, vol. 49, pp. 747-759, April 2001. [6] M. G. Amin and Y. Zhang, "Direction finding based on spatial time-frequency distribution matrices," Digital Signal Processing: A Review Journal, vol. 10, pp. 325-339, October 2000. [7] Y. Zhang and M. G. Amin, "Spatial averaging of time-frequency distributions for signal recovery in uniform linear arrays," IEEE Trans. Signal Processing, vol. 48, pp. 2892-2902, October 2000. [8] A. B. Gershman and M. G. Amin, "Wideband direction-of-arrival estimation of multiple chirp signals using spatial time-frequency distributions," IEEE Signal Processing Letters, vol. 7, pp. 152-155, June 2000. [9] A. Belouchrani, K. Abed-Meraim, M. G. Amin, and A. Zoubir, "Joint antidiagonalization for blind source separation," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), Salt Lake City, UT, 7-11 May 2001.
344
Chapter 8:
Multi-Sensorand Time-Space Processing
8.3
Q U A D R A T I C D E T E C T I O N IN ARRAYS USING TFDs ~
The detection of signals in noise is a classical hypothesis-testing problem. The use of a sensor array can considerably enhance signal detection by providing a large gain in the SNR and allowing for signal source localization. The long-lasting interest devoted to the field of sensor array processing can be traced to the large number of applications where data is collected in both space and time. Some important applications include radar/sonar signal processing; ultrasonic, optical, and tomographic imaging; earth geophysical exploration such as crust mapping and oil exploration; and more recently space-time processing for wireless communications. We assume the very popular uniform linear array (ULA) geometry and a single signal source arriving from the far-field at some angle 0 with respect to the array. The goal is to detect the presence of this signal source from observations contaminated by additive noise at the sensors. In many situations, the channel may induce unknown modifications to the time and frequency parameters of the transmitted signal. Such situations include the well-known delay-Doppler situation in radar/sonar detection problems. Timefrequency distributions (TFDs), which jointly describe the time and frequency characteristics of the signal, are powerful tools for designing the optimal detector in such situations. It has been shown that the optimal quadratic detector for the detection of nonstationary Gaussian signals (i.e. signals whose statistics vary rapidly but are Gaussian at each fixed time instance) with unknown time and frequency offsets in the presence of noise can be implemented naturally within the quadratic class of TFDs [1]. Here we show how this idea can be extended to detection using an array of sensors, even those which exhibit imperfect spatial coherence. Imperfect coherence (i.e. a loss in the spatial correlation for a fixed time instance) is often attributed to signal scattering in multipath channels. Specifically, we consider the problem of detecting arbitrary nonstationary second-order signals with unknown time and frequency offsets arriving in a linear array with an unknown angle of arrival. We explicitly show how the optimal detector for such a problem can be implemented naturally and efficiently in the time-frequency domain.
8.:].1
The Detection Problem
Consider the following composite hypothesis-testing problem in continuous time: Ho
x(t)
-
H1 . x ( t )
-
s(t-
j2€
+ n(t)
where t C T, the time interval of observation, x is the observed signal, n is zeromean complex white Gaussian noise with variance (72, and s is a zero-mean complex arbitrary second-order signal with correlation function Rs(tl, t2). The parameters ~ Anil M. R a o and D o u g l a s L. Jones, Department of Electrical and Computer Engineering, Coordinated Science Laboratory, University of Illinois at Urbana-Champaign (anilrao@dsp.csl.uiuc.edu, jones@dsp.csl.uiuc.edu). Reviewers: Akbar M. Sayeed and Graeme S. Jones.
Quadratic Detection in Arrays using TFDs
345
(7, L,) represent time and frequency offset parameters that are assumed to be unknown. These parameters arise in the classic radar delay/Doppler scenario where the delay is due to the range of the target and the Doppler is due to the velocity of the target. In statistical hypothesis testing, for each observation, x, a real-valued test statistic, L ( x ) , is compared to a threshold to decide in favor of H0 or/-/1; that is, to decide whether the signal is present or not. TFDs provide a natural detection framework for such hypothesis-testing problems for two main reasons: first, detecting a second-order signal (such as a Gaussian signal) in the presence of Gaussian noise involves a quadratic function of the observations [2], and bilinear T F D s are quadratic in the observations; second, TFDs possess additional degrees of freedom provided by the time and frequency parameters.
8.3.2
Quadratic Detection in a Linear Array
In the linear array configuration, the signal comes in to the array of M sensors with spacing d at angle 0, where 0 is assumed to be unknown. We will denote the signal at the ith sensor by xi(t), i = 1 , . . . , M. Due to the linear array configuration, the signal at the ith sensor is a delayed version of the signal at the first sensor, and the value of the delay depends on the unknown angle of arrival, 0. That is, when the signal is present xi(t) - s ( t - ( i - 1)D) + ni(t), where D - dc sin(0) and c is the velocity of propagation in the medium. A concern arises when considering the use of a very large array in order to achieve high array gain; the signal received at widely separated sensors may have reduced coherence due to the complexity in the propagation of the signal from the source to spatially separated receivers (see [3-6] and references therein). Loss of spatial coherence can often be attributed to complex signal scattering arising in multipath propagation [7]. In particular, the larger the angular spread of the multipath the more rapid the loss in spatial coherence (the smaller the coherence length). Since we are only considering the second-order statistics of the signal, the model for partial coherence used will be given in terms of the correlation function. An exponential power-law model has been suggested [3,8] whereby the cross-correlation function between the ith and j t h li-jl
sensors will be scaled by the coefficient cij = e L , where L is a dimensionless characteristic correlation length. We may arrange the decorrelation coefficients in matrix form as C = {cij}. It will be convenient to deal with the aligned sensor outputs; that is, let yi(t) = x i ( t + ( i - 1)D). It will also be convenient to arrange the aligned sensor outputs in vector form as y 0 = [Yl (t) y2(t) ... yM(t)] T where the superscript 0 denotes the dependence of aligning the sensor signals on the unknown angle of arrival. The optimal test statistic based on the deflection 1 criterion and using a gen1Deflection-optimal detectors can be interpreted as " m a x i m u m signal-to-noise ratio (SNR)" detectors because deflection is a measure of SNR. T h e deflection is defined as
q -- E[L(x)IH1] - E[L(x)IHo]
Var[L(x)lHo]
Chapter 8:
346
Multi-Sensorand Time-Space Processing
eralized likelihood ratio test (GLRT) technique to deal with the unknown time, frequency, and angle parameters is given by Lopt -
max ( ( C @ R~ ~'~)) y 0 y 0 ) (r,u,O) \ / ' '
(8.3.2)
where | represents the Kronecker product, Rs denotes the linear operator defined by the corresponding correlation function Rs as ( R s x ) ( t ) = f Rs(t ,T)X(T)dT ,
(8.3.3)
and R~r'~) (tl, t2) - Rs(tl - T, t2 - T)e j27rvtx e -j2~rut2 denotes the correlation function corresponding the signal source with the unknown time and frequency offsets. Upon expanding the inner product in (8.3.2) we obtain M
Lopt -- max E
M
E cij
(8.3.4)
(T,C~,0) i--1 j--1
8.3.3
TFD
Based
Array
Detection
The connection to TFDs is made through the use of the Weyl correspondence which relates inner products, positive definite linear operators, and the Wigner distribution. Using the fact that the Weyl correspondence involves a covariance to time, frequency, and scale offsets, using the methods in [1] it can be verified that the test statistic in (8.3.4) may conveniently be expressed in terms of TFDs, allowing for a natural and efficient implementation of the optimal detector. The optimal test statistic may be expressed as M
Lopt
-
M
max E E e (t, f; g = WSR~), (t,f,O) i - 1 j - 1 cijpy~y~
(8.3.5)
where g is the Doppler-lag kernel filter and WSR~ is the Weyl-symbol of Rs defined by an,
W S R s (u, v) - / Rs(u + T/2, u -- T/2)e-J2~VrdT .
(8.3.6)
We use the superscript 0 here to denote the fact that the TFD must be formed for each hypothesized angle of arrival. Observe that in (8.3.5) we must form the sum of all weighted cross-TFDs; we will refer to this quantity as a matrix TFD. Figure 8.3.1 illustrates the detector structure for the partially coherent case. Because the detector involves forming TFDs of signals that are aligned to examine different spatial directions, we may think of this detection structure in terms of time-frequency-space or time-scale-space. Since we have not assumed any spatial statistical characteristics of the signal, the kernel is the same regardless of the angle of arrival being analyzed.
Quadratic Detection in Arrays using TFDs
x (t)
I-
x2(t)
I
1
I
9 I
347
. . . .
AlignQ
Matrix TFR
E
max
L opt
(t,f,O)
I
Fig. 8.3.1: TFD-Based Optimal Quadratic Array Processor.
If the array environment is perfectly coherent, then cij = 1 Vi, j. Using fundamental properties of bilinear TFDs, it can be verified that the optimal test statistic will involve first summing the sensor observations and then applying the T F D (with the kernel as before) for each hypothesized angle of arrival and choosing the maximum value. Hence the matrix processing of Figure 8.3.1 is replaced with simple summation. If the array environment is noncoherent, then C = I and it can be verified that the optimal test statistic will include first taking the T F D of each sensor observation (again, with kernel as before), and then summing the resulting TFDs for each hypothesized angle of arrival and choosing the maximum value.
8.3.4
Summary and Conclusions
We demonstrated that time-frequency based detectors are naturally suited to quadratic detection in an array environment. By using a GLRT approach, the deflection-optimal test statistic was cast in the form of TFDs. The TFD-based structure allows the optimal detector to be implemented naturally and efficiently by exploiting the many degrees of freedom available. In the general case of a partially coherent environment, the test statistic included a weighted sum of all crossTFDs of the aligned sensor outputs for each value of hypothesized angle of arrival. Completely coherent and noncoherent cases were shown to be special cases of the partially coherent model. In the coherent case, the optimal test statistic simplified to include a single auto-TFD of the sum of the aligned sensor outputs for each hypothesized angle of arrival. In the noncoherent case, the optimal test statistic simplified to include the sum of auto-TFDs of the aligned sensor outputs for each hypothesized angle of arrival.
References [1] A. M. Sayeed and D. J. Jones, "Optimal detection using bilinear time-frequency and time-scale representations," IEEE Trans. Signal Processing, vol. 43, pp. 2872-2883, December 1995.
348
Chapter 8:
Multi-Sensorand Time-Space Processing
[2] H. V. Poor, An Introduction to Signal Detection and Estimation. New York: Springer, 1988. [3] D. R. Morgan and T. M. Smith, "Coherence effects on the detection performance of quadratic array processors with application to large-array matched-field beamforming," J. Acoustical Soc. of America, vol. 87, pp. 737-747, February 1990. [4] A. M. Rao and D. L. Jones, "A denoising approach to multisensor signal estimation," IEEE Trans. Signal Processing, vol. 48, pp. 1225-1234, May 2000. [5] A. M. Rao and D. L. Jones, "Nonstationary array signal detection using time-frequency and time-scale representations," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'98), vol. 4, pp. 1989-1992, Seattle, 12-15 May 1998. [6] A. M. Rao and D. L. Jones, "Efficient structures for quadratic time-frequency and time-scale array processors," in Proc. IEEE-SP Internat. Symp. on Time-Frequency Time-Scale Analysis, pp. 397-400, Pittsburgh, PA, 6-9 October 1998. [7] R. B. Ertel, P. Cardieri, K. W. Sowerby, T. S. Rappaport, and J. H. Reed, "Overview of spatial channel models for antenna array communication systems," IEEE Personal Communications, vol. 5, pp. 10-22, February 1998. [8] H. Cox, "Line array performance when the signal coherence is spatially dependent," J. Acoustical Soc. of America, vol. 54, pp. 1743-1746, July 1973.
Implementation of STFDs-Based Source Separation Algorithms
8.4
349
I M P L E M E N T A T I O N OF STFDs-BASED SOURCE SEPARATION ALGORITHMS o
Spatial time-frequency distributions (STFDs) are a generalization of the timefrequency distributions to a vector of multi-sensor signals. Under a linear model, the STFDs and the commonly known covariance matrix exhibit the same eigenstructure. In applications involving multi-sensor signals, the aforementioned structure is often exploited to estimate some signal parameters through subspace-based techniques. Algorithms based on the covariance matrix assume stationary signals. When the frequency content of the measured signals is time-varying, approaches of this class can still be applied; however, they do not use the signal time-frequency information. Algorithms based on STFDs properly use this time-frequency information to significantly improve performance. This improvement comes essentially from the fact that the effects of spreading the noise power while localizing the source energy in the (t, f) domain amounts to increasing the signal-to-noise ratio (SNR). STFDs-based algorithms exploit the time-frequency representation of the signals together with the spatial diversity provided by the multiple sensors. The concept of the STFD was introduced for the first time in 1996. It was used successfully in solving the problem of the blind separation of non-stationary signals [1-3]. This concept was then applied to solve the problem of direction of arrival (DOA) estimation [4]. Since then, other works have been conducted in this area using the new concept of STFDs [5-9]. This article examines the implementation of STFDs-based source separation algorithms. Some theoretical aspects of these algorithms are presented in Articles 8.1 and 8.2. The MATLAB TM codes of the aforementioned algorithms are provided in the next sections.
8.4.1
The Spatial TFD (STFD)
Given an analytic signal vector z(t), the spatial instantaneous autocorrelation function (SIAF), which is the generalization of the instantaneous autocorrelation function (IAF) defined in Chapter 2, is given by Kz,~.(t, T) -- z(t + ~ ) z * ( t - ~)
(8.4.1)
Define also the smoothed SIAF as R~.,~.(t, 7) - G(t, 7) 9 K~.,~.(t, 7),
(8.4.2)
where G(t, T) is the time-lag kernel. The time convolution operator *t is applied to each entry of the matrix K~.,z(t, T). The class of quadratic Spatial TFDs (STFDs) ~ A d e l B e l o u c h r a n i , EE Dept., Ecole Nationale Polytechnique, Algiers, Algeria (belouchrani@hotmail.com). Reviewers: K. Abed-Meraim and H. Bousbia-Salah.
Chapter 8:
350
Multi-Sensor and -I-line-Space Processing
are then defined as Dz ~(t, f) -
~ {R~,~(t, T)},
(8.4.3)
where the Fourier transform 9~ is applied to each entry of the matrix Rz,z(t, r). The discrete-time definition equivalent to Eqs. (8.4.3) and (8.4.2), which leads to the simple implementation of a STFD, is
D~.,z (n, k) - m---, :D.T" {G(n, m) n9K~.,.(n, m)}, k
(8.4.4)
which can be written M
D,.,,. (n, k) -
M
E
E
m=-M
p=-M
G ( n - p , m ) z ( p + m ) z * ( p - m)e -ja=-~-,
(8.4.5)
where the discrete Fourier transform DJ c and the discrete time convolution operator *n are applied to each entry of the matrix G(n, m)*n Kz,z(n, m) and matrix Kz,.(n, m), respectively. N = 2M + 1 is the signal length. Note that the STFD of a signal vector is a matrix whose diagonal entries are the classical auto-TFDs of the vector components and the off-diagonal entries are the cross-TFDs. A more general definition of the STFD can be given as D~.,~.(n, k) -
M
M
E
E
m---M
p---M
G(n
-p,m)|
m ) z * ( p - m)e -j4~-~,
(8.4.6)
where | denotes the Hadamard product, and [G(n, m)]ij = Gij(n, m) is the timelag kernel associated with the pair of the sensor signals zi(n) and zj(n). The following MATLAB code calculates half of the STFD of a vector signal z(n) (the other half is obtained by Hermitian symmetry)" Y, M A T L A B code 1 Y, I n i t i a l i z a t i o n D = zeros (fft_length, sample_size, s e n s o r _ n u m b e r , s e n s o r _ n u m b e r ) ;
Y, STFD computation for k = I : K; Y, K: sensor n u m b e r for 1 =k : K; D(:,:,k,l) = C r o s s - T F D ( z ( k , : ) , z ( l , : ) , w i n d o w _ l e n g t h , f f t _ l e n g t h ) ; end end
where the function Cross-TFD (z (k, : ) , z (1, : ) , w i n d o w _ l e n g t h , f f t _ l e n g t h )
calculates the cross-TFD of the signals such functions, see Chapter 6.]
zk(n) and zz(n). [For the implementation of
Implementation of STFDs-Based Source Separation Algorithms
8.4.1.1
351
Structure under Linear Model
Consider the following linear model of the multi-sensor signal z(n), z(n) = As(n),
(8.4.7)
where A is a K x L matrix (K >_ L) and s(n) is a L x 1 vector which is referred to as the source signal vector. Under this linear model the STFDs take the following structure:
D~.~.(n, k) = A D ~ ( n , k)A H
(8.4.8)
where Ds~(n, k) is the source STFD of vector s(n) whose entries are the auto- and cross-TFDs of the source signals. The auto STFD denoted by D ~ ( n , k) is the STFD D~.~(n, k) evaluated at autoterm (t, f) points only. Correspondingly, the cross STFD DC~.(n,k) is the STFD Dz~.(n, k) evaluated at cross-term (t, f) points. Note that the diagonal (off-diagonal) elements of D~s(n, k) are auto-terms (crossterms). Thus, the auto (cross)STFD Da~(n, k) (D~(n, k))is diagonal (off-diagonal) for each (t, f) point that corresponds to a source auto-term (cross-term), provided the window effect is neglected.
8.4.2
STFDs-Based Source Separation
The multi-sensor signal z(n) is assumed to be non-stationary and to satisfy the linear model (8.4.7). The problem of source separation consists of identifying the matrix A and/or recovering the source signals s(n) up to a fixed permutation and some complex factors (see Article 8.1 for more details). Denote by W the L x K whitening matrix, such that (WA)(WA) n = U U n = I.
(8.4.9)
Pre- and post-multiplying the STFD Dzz(n, k) by W leads to the whitened STFD, defined as D~.~.(n, k) = WD,~.(n, k ) W H = UDss(n, k)U H (8.4.10) where the second equality stems from the definition of W and Eq. (8.4.8). By selecting auto-term (t, f) points, the whitened auto STFD will have the following structure: D~.(n, k) - VD~s(n , k ) C H (8.4.11) where Das(n,k) is diagonal. The missing unitary matrix U is retrieved (up to permutation and phase shifts) by Joint Diagonalization (JD) of a combined set {D~.(ni, ki)li - 1,... , P } of P auto STFDs. The incorporation of several autoterm (t, f) points in the JD reduces the likelihood of having degenerate eigenvalues and increases robustness to a possible additive noise.
Chapter 8:
352
Multi-Sensorand Time-Space Processing
The selection of cross-term (t, f) points leads the following whitened cross STFD: DCz(n, k) = UDsCs(n, k ) U H
(8.4.12)
where D~s(n,k ) is off-diagonal. The unitary matrix U is found up to permutation and phase shifts by Joint Off-Diagonalization (JOD) of a combined set {D~.(ni, k ~ ) l i - 1 , . . . , Q} of Q auto STFDs. The unitary matrix U can also be find up to permutation and phase shifts by a combined J D / J O D of the two sets {D~.(ni, ki)li - 1 , . . . , P} and {DC~.(ni, ki)li 1 , . . . , Q}. Once the unitary matrix U is obtained from either the JD or the JOD or the combined J D / J O D , an estimate of the mixing matrix A can be computed by the product W # U , where ~ denotes the pseudo-inverse operator. An estimate of the source signals s(n) is then obtained by the product A # z ( n ) .
8.4.3
Implementation of the Whitening
The implementation of the whitening goes as follows: 9 Estimate the sample covariance R from T data samples. 9 Denote by A1,... ,AL the L largest eigenvalues, and by h i , . . . ,hL the corresponding eigenvectors of R. 9 Under the white noise assumption, an estimate of the noise variance 6 2 is the average of the K - L smallest eigenvalues of R. 9 The whitening matrix is computed by W = [(A1 - a2) - 89h i , . . . ,
(AL - a2) - 89hL] T
(8.4.13)
9 The whitened signals are then computed by z(n) = W z ( n ) . Note that in the last step, the dimension of the problem is reduced from K to L. The above procedure is valid in the case of more sensors than sources, i.e. K > L. When K = L, no noise is assumed and the whitening matrix is computed as the matrix square root of the sample covariance matrix R. The following MATLAB code gives the implementation of the whitening 9 ~.
MATLAB code 2 C o m p u t a t i o n of the covariance matrix R = z*z'/sample_size" if L < K, Y, Assumes white noise Y, K: sensor number, L: source Y, Compute the eigen decomposition of R [U,d] = eig(R) ; [power,k] = sort (diag(real (d)) ) ; sigma = mean(power (i :K-L) ) ; wl = ones(L,l)./sqrt(power(K-L+l:K)-sigma) ; Y, C o m p u t a t i o n of the whitening matrix W = diag(wl) *U(1 :K,k(K-L+I :K) ) ' ;
number
Implementation of 5TFDs-Based 5ource 5eparat]on Algor]thms else W = inv(sqrtm(R)) ; end; % Whitening the signals z = W*z ;
8.4.4
353
Assumes no noise
Selection of Auto-Terms and Cross-Terms
The selection procedure of the auto-terms and cross-terms exploits the off-diagonal structure of the source cross STFD matrices. More precisely, for a source cross STFD, we have Trace(D~.(n, k)) - Trace(UD~s(n, k ) U H) - T r a c e ( D ~ ( n , k)) ..~ 0.
(8.4.14)
Based on this observation, the following testing procedure applies:
k))/norm(D~.~.(n,k)) < Trace(D~.z(n,k))/norm(D~.~.(n,k)) >
if Trace(D~.~.(n,
e Then decide t h a t (n, k) is a cross-term
if
e Then decide t h a t
(n,k)is
an auto-term
where e is a 'small' positive real scalar. The following MATLAB code implements the selection procedure: MATLAB code 3 Initialization M = []; ~ will contain the auto STFDs N = []; % will contain the cross STFDs mp = 2*L; ~ number of selected frequencies at a given time sample where L is the source number esp = 0.i; ~ threshold To select only (t-f) points with significant energy [Dsort,Ii] = sort(D); ~ D is the STFD computed at MATLAB code 1 for tp=l:skip:sample_size ~ skip is some chosen step for k = l:mp fp = Ii(fft_length-k,tp); Z Selection of the STFDs associated to the (t-f) point tp,fp Ds = squeeze(D(fp,tp,:,:)); Form the other half of Ds by Hermitian symmetry Ds = Ds+ (Ds-diag(diag(Ds)))';
if (trace(Ds)/norm(Ds))> esp, ~ Selection criterion M = [M Ds]; % Selection of auto STFDs else N = [N Ds]; % Selection of cross STFDs end; end; end; Note that the matrices M and N are the concatenation matrices the auto-STFD matrices and cross-STFD matrices, respectively.
of
Chapter 8:
354
8.4.5
Multi-Sensorand Time-Space Processing
Implementation of JD and JOD
The joint diagonalization (JD) [10] of a set { M k l k = 1 , . . . , P } of P matrices of dimensions L • L is defined as the maximization of the JD criterion P
L
CjD(V) de_j~ ~ iv~Mkv~[2
(8.4.15)
k=l i=1
over the set of unitary matrices V = I v 1 , . . . , VL]. The joint Off-diagonalization (JOD) [8] of a set { N k l k = 1 , . . . , Q} of Q matrices of dimensions L • L is defined as the maximization of the J O D criterion P
L
C J O D ( V ) de f _ E E k=l i--1
Iv~Nkv~12
(8.4.16)
over the set of unitary matrices V -- I v 1 , . . . , VL]. The combined joint diagonalization and joint off-diagonalization [8] of two sets { M k l k = 1 , . . . , P } and { N k l k = 1 , . - . , Q } of L x L matrices is defined as the maximization, over the set of unitary matrices V - I v 1 , . . . , VL], of the J D / J O D criterion
CjD/JOD(V) def E i=l
Iv~Mkv~]2 - ~ k=l
Iv~mkv~l 2
.
(8.4.17)
k=l
Maximization of Eq. (8.4.17) is performed by successive Givens rotations as follows: A unitary matrix V = [v~j] is sought such that the criterion in Eq. (8.4.17) is maximized w.r.t, the matrices M k = Ira/k], k = 1 , - . . , P , and Nk = [nkij], k = 1 , . . . , Q. At each Givens rotation step, a pivot pair r, s is chosen. W i t h 5ij denoting the Kronecker delta operator, vii is set to 5ij, except for the elements V r r : V s s = cos(0), vrs = e jr sin(0) and Vsr = - e - j r sin(0). One can show [10] that optimization of Eq. (8.4.17) is equivalent to the maximization of Q = vT~Re(G)v
(8.4.18)
where G
P T T -- ~-~'~k=l g l , k g l , k -- ~ Q = I g2,kg2,k
v T - [cos 20, - sin 20 cos r - sin 20 sin r g T, k = [mrr k -- ross, k m rks + m skr , j(mksr _ mks)] 82T . k _ [nrr k -- nss, k nrs k + nsr,j(nksr k k -- nrs)]"
(8.4.19) (8.4.20) (8.4.21) (8.4.22)
The next step is to recognize that the particular parameterization of v is equivalent to the condition v T v = 1. Maximization of a quadratic form under the unit norm constraint of its argument is obtained by taking v to be the eigenvector of ~ e ( G ) associated with the largest eigenvalue.
Implementation of 5TFDs-Based Source Separation Algorithms
355
The reader may check that if we set P - 1 and Q - 0,1 the above boils down to the standard Jacobi diagonalization procedure [11]. Also note that the main computational cost in this kind of technique is the update under Givens rotations of the various matrices involved in the diagonalization. This makes the computational cost of the J D / J O D procedure similar to P+Q times the diagonalization of a single matrix. The following MATLAB code gives the implementation of the J D / J O D : Y, MATLAB code 4 Y, Initialization L = rain(size(M))" Y, M" auto STFDs given at MATLAB code 3 nml = max(size(M)); nm2 = max(size(N))" '/, N: cross STFDs given at MATLAB code S V = eye(L); '/, L is the source number threshold = i/sqrt (sample_size)/I00; more = I" while more, more=O; for p=I:L-I, for q=p+l :L, Y, Givens rotations gl=[M(p,p:L:nml)-M(q,q:L:nml) ;M(p,q:L:nml)+M(q,p:L:nml) ; i* (M(q,p :L :nml)-M(p,q: L :nml) ) ] ; g2= [M(p,p:L:nm2)-M(q,q:L:nm2) ;M(p,q:L:nm2)+M(q,p:L:nm2) ; i*(M(q,p:L:nm2)-M(p,q:L:nm2)) ]; [vcp,d] = eig(real(gl.gl'-g2*g2')); [la,Ki]=sort(diag(d)); angles=vcp (:, Ki (i)) angles=sign 9 (angles (I)),angles; c=sqrt (0.5+angles (I)/2) ; sr=O .S* (angles (2) -j *angles (3) ) /c ; sc=conj(sr) ; yes = abs(sr) > threshold" more=more I yes ; if yes, Y. Update of the M, N and V matrices colpl=M(: ,p:L-nml); colql=M(: ,q:L:nml) ; M(" ,p:L:nml)=c*colpl+sr*colql-M(: ,q:L:nml)=c*colql-sc*colpl; rowpl=M(p, :) ;rowql=M(q, :) ; M (p, 9) =c*rowpl+sc*rowql ;M (q, :) =c*rowql-sr*rowpl ; colp2=N( : ,p:L:nm2); colq2=N(: ,q:L:nm2) 9 N(" ,p" L'nm2) =c*colp2+sr.colq2;N( : ,q:L:nm2)=c*colq2-sc*colp2 ; rowp2=N (p, :) ;rowq2=N (q, :) ; N(p, 9) =c*rowp2+sc*rowq2 "N(q, :) =c*rowq2-sr.rowp2 ; temp=V( : ,p) ; V(- ,p)=c*V(: ,p)+sr.V(: ,q) ;V(: ,q)=c*V(: ,q)-sc.temp; end" Y. if end; Y, q loop end; Y, p loop end; Y, while
8.4.6
Summary and Conclusions
The detailed implementation of non-stationary source separation algorithms based on Spatial TFDs has been presented together with the associated MATLAB codes. 1That is, only matrix M1 is incorporated in the criterion of Eq. (8.4.17).
356
Chapter 8:
Multi-Sensorand Time-Space Processing
The implementation details included whitening, selection of auto-term and crosst e r m regions and the combined Joint Diagonalization and Joint Off-Diagonalization.
References [1] A. Belouchrani and M. G. Amin, "New approach for blind source separation using time frequency distributions," in Proc. SPIE: Advanced Signal Processing Algorithms, Architectures and Implementations VI, vol. 2846, pp. 193-203, Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 4-9 August 1996. [2] A. Belouchrani and M. G. Amin, "Blind source separation using time-frequency distributions: Algorithm and asymptotic performance," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), vol. 5, pp. 3469-3472, Munich, 21-24 April 1997. [3] A. Belouchrani and M. G. Amin, "Blind source separation based on time-frequency signal representation," IEEE Trans. Signal Processing, vol. 46, pp. 2888-2898, November 1998. [4] A. Belouchrani and M. G. Amin, "Time-frequency MUSIC: A new array signal processing method based on time-frequency signal representation," IEEE Signal Processing Letters, vol. 6, pp. 109-110, May 1999. [5] A. S. Kayhan and M. G. Amin, "Spatial evolutionary spectrum for DOA estimation and blind signal separation," IEEE Trans. Signal Processing, vol. 48, pp. 791-798, March 2000. [6] A. R. Leyman, Z. M. Kamran, and K. Abed-Meraim, "Higher-order time frequencybased blind source separation technique," IEEE Signal Processing Letters, vol. 7, pp. 193196, July 2000. [7] Y. Zhang, W. Mu, and M. G. Amin, "Subspace analysis of spatial time-frequency distribution matrices," IEEE Trans. Signal Processing, vol. 49, pp. 747-759, April 2001. [8] A. Belouchrani, K. Abed-Meraim, M. G. Amin, and A. Zoubir, "Joint antidiagonalization for blind source separation," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), Salt Lake City, UT, 7-11 May 2001. [9] N. Linh-Trung, A. Belouchrani, K. Abed-Meraim, and B. Boashash, "Separating more sources than sensors using time-frequency distributions," in Proc. Sixth Internat. Symp. on Signal Processing and its Applications (ISSPA '01), vol. 2, pp. 583-586, Kuala Lumpur, 13-16 August 2001. [10] A. Belouchrani, K. Abed-Meraim, J.-F. Cardoso, and E. Moulines, "Blind source separation using second order statistics," IEEE Trans. Signal Processing, vol. 45, pp. 434444, February 1997. [11] G. H. Golub and C. F. Van Loan, Matrix computations. Hopkins University Press, 1989.
Baltimore, MD: Johns
357
Underdetermined Blind Source Separation for FM-like Signals
8.5
U N D E R D E T E R M I N E D BLIND FOR FM-LIKE SIGNALS~
SOURCE
SEPARATION
In many real-life engineering applications, including sonar and telecommunications, the signal under consideration may consist of a mixture of a number of signals. The user may not be interested in the whole mixture signal (also called a multicomponent signal), but rather in a particular component of it. For instance, in telecommunications the received signal may be a mixture of several source signals (multiple access interference) but the user may wish to recover only one or some individual source signals. The various component signals may be non-stationary (i.e., their spectral contents may vary with time); and since time-frequency (TF) analysis has proved to be a powerful tool in the analysis of such signals, we use this technique to extract the desired source signal. In particular, we use the timefrequency distribution (TFD) of the mixture signal in order to separate and recover any signal component (see Article 8.1 for a detailed justification). In this article, we focus on the i m p o r t a n t problem of underdetermined blind source separation (UBSS) where the number of sources is larger t h a n the number of sensors. The sources under consideration are non-stationary frequency-modulated (FM) signals encountered in radar and certain wireless communication systems. For instance, linear FM chirp signaling is considered for frequency-hopped CDMA (Code Division Multiple Access) systems [1-3], in which context UBSS coincides with the blind multiuser detection problem. We present here two classes of methods where UBSS is achieved using source spatial signatures in conjunction with their T F signatures. The two methods are based on a vector clustering technique and a mono-component extraction technique, respectively.
8.5.1
Data Model and Assumptions
Assume t h a t an n-dimensional vector s(t) - [81(~:),... ,Sn(t)] T E C (nxl) corresponds to n non-stationary complex source signals s i ( t ) , i - 1 , . . . , n. The source signals are t r a n s m i t t e d through a medium so that an array of m sensors picks up a set of mixed signals represented by an m-dimensional vector x(t) - [ X l ( t ) , . . . , X m ( t ) ] T C C (re• For an instantaneous linear mixture medium, the observed signals can, then, be modeled as: x(t) - A s ( t ) + r/(t), where A c C (mxn) is the mixing matrix and r/(t) - [r/l(t),r/2(t),...,~lm(t)] T e C (rex1) is the observation noise vector. For the UBSS problem, i.e. for n > m, the mixing m a t r i x A is not (left) invertible. However, the column vectors of matrix ~ K a r i m A b e d - M e r a i m and Linh-Wrung N g u y e n , Sig. & Image Proc. Dept., Telecom Paris, France (abed@tsi.enst.fr, trung@tsi.enst.fr), and A d e l B e l o u c h r a n i , EE Dept., Ecole Nationale Polytechnique, Algiers, Algeria (belouchrani@hotmail.com). Reviewers" B. Boashash, V. Chandran and A. Beghdadi.
Chapter 8:
358
Multi-Sensorand Time-Space Processing
Fig. 8.5.1: A time-frequency distribution of a multicomponent signal.
A = [al, a 2 , . . . , an] are assumed to be pairwise linearly independent, i.e., for any a i and aj are linearly independent. Here, the sources are assumed to be multicomponent FM signals. By a multicomponent signal, we mean a signal whose TF representation presents multiple ridges in the TF plane. Analytically, the kth source may be defined as,
i , j E 1, 2 , . . . , n and i ~: j,
Mk
- Z
l=1
(s.5.2)
where each component sk,L(t), of the form
sk,t(t) = ak,z(t) ejCk'~(t),
(8.5.3)
is assumed to have only one ridge in the TF plane. An example of a multicomponent signal, consisting of three components, is displayed in Fig. 8.5.1.
8.5.2
Separation using Vector Clustering
In this approach, the sources are assumed to have different structures and localization properties in the TF domain. More precisely, we assume the sources to be orthogonal in the TF domain (Fig. 8.5.2) in the sense that their TF supports 1 are disjoint. The above assumption can be applied to any TFD. It is clear that the TF orthogonality is too restrictive and will almost never be satisfied exactly in practice. Fortunately, only approximate orthogonality, referred to as quasi-orthogonality, is needed in practice to achieve source separation [4]. Note, that two FM signals with different FM laws satisfy the quasi-orthogonality assumption. Under the above assumption, one can notice that two auto-term spatial time frequency distribution (STFD) matrices (see Article 8.1 for a thorough definition) 1A T F support of a given signal sents the T F D of s(t).
s(t) is defined by {(t, f)[Dss(t, f) :fl 0} where Dss(t, f) repre-
359
Underdetermined Blind Source Separation for FM-like Signals
1, fl)
f21
f
frequency
Fig. 8.5.2: TF orthogonal sources; the TF supports of two sources are disjoint.
__~ Aut~
Selection
I {(ta' fa i } I
x(t) = As(t) I STFD I D'~( t,I
"-I WVD I
~
classifier
I
I
'=lget "rul
/
cL~ . . . . ,b .... (t,/)
~iget "rUl
:1 "r-~Yn I
=
,~.~ = ~,,(t) ~1/I--:~yn I
Fig. 8.5.3: Diagram of the UBSS algorithm. of the observation D x x ( t l , f l ) and si(t) are such that:
Dxx(t2,f2)
corresponding to the same source
D x x ( t l , f l ) = Ds~s~ ( t l , f l ) a / a / H ,
Dxx(t2, f2) -
Ds~s~(t2, f2)aia H.
(8.5.4)
Eq. (8.5.4) indicates that Dxx(tl, fl) and Dxx(t2, f2) have the same principal eigenvector ai. The idea of the proposed algorithm is to group together the auto-term points associated with the same principal eigenvector representing a particular source signal. The T F D of this source (Ds~s~(t, f)) is obtained as the principal eigenvalues of the STFD matrices at the auto-term points. The proposed algorithm is shown diagrammatically in Fig. 8.5.3. It consists of the following four steps:
(1) STFD computation and noise thresholding:
Given a finite set of observation
vectors, the STFD matrices of the observation Dxx(t, f) can be estimated using time-lag domain discrete implementation as shown in Articles 6.1 and 8.4. These STFD matrices are next processed to extract the source signals. In order to reduce the computational complexity by processing only "significant" STFD matrices, a noise thresholding is applied to the signal TFD. More precisely, a threshold cl (typically, el = 0.05 of the point with maximum energy) is used to keep only the
360
Chapter 8:
points
Multi-Sensor and Time-Space Processing
{(ts, fs)} with sufficient energy: Keep
(ts, fs) if ]lDxx(ts,fs)[[ > cl.
(s.5.5)
(2) Auto-term selection: The second algorithm step consists of separating the auto-term points from cross-term points using an appropriate testing criterion. For that, we exploit the sources' T F orthogonality. Under this assumption, each autoterm STFD matrix is of rank one, or at least has one "large" eigenvalue compared to its other eigenvalues. Therefore, one can use rank selection criteria, such as MDL (minimum description length) or AIC (Akaike information criterion), to select autoterm points as those corresponding to STFD matrices of selected rank equal to one. For simplicity, we use the following criterion: if
/~max{Dxx(t, f)} _ 1 > e2 norm {Dxx(t, f)}
> decide that (t, f) is a cross-term point
where c2 is a small positive scalar (typically, c2 = 10-4), and ,~max {Dxx(t, f)} represents the largest eigenvalue of Dxx(t, f).
(3) Clustering and source TFD estimation: Once the auto-term points have been selected, a clustering step based on the spatial signatures of the sources is performed. This clustering is based on the observation that two STFD matrices corresponding to the same source signal have the same principal eigenvectors. Moreover, the corresponding principal eigenvalues are given by the desired source TFD. This leads to the core step of our source separation method that uses vector clustering. This is implemented by the following four operations: (a) For each auto-term point, (ta, fa), compute the main eigenvector, and its corresponding eigenvalue, A(ta, fa), of Dxx(ta, f~).
a(ta, fa),
(b) As the vectors {a(ta, fa)} are estimated up to a random phase e jr r E [0, 27r), we force them to have, without loss of generality, their first entries real and positive. (c) These vectors are then clustered into different classes {C~}. Mathematically, a(ti, f~) and a(tj, fj) belong to the same class if:
d(a(ti,fi),a(tj,fj))
< s
(8.5.6)
where s is a properly chosen positive scalar and d is a distance measure (different strategies for choosing the threshold c3 and the distance d or even the clustering method can be found in [5]). As an example, in the simulated experiment, we used the angle between the two vectors as a distance measure: d(ai, aj) - arccos(5.Tfij) where fi = [Re(a) T, Im(a)T]T/llall.
(8.5.7)
Underdetermined Blind Source Separation for FM-like Signals
361
(d) Set the number of sources equal to the number of "significant" classes and, for each source si (i.e. each class Ci), estimate its TFD as:
D s ~ ( t , f) --
{)~(ta, fa), 0,
if (t, f) -- (ta, fa) e Ci otherwise.
(8.5.8)
(4) Source signal synthesis: Use an adequate source synthesis procedure to estimate the source signals, si(t) (i = 1 , . . . , n), from their respective TFD estimates D s~s~. TF synthesis algorithms can be found in Chapter 11 and ref. [6]. An example in Fig. 8.5.4 illustrates the performance of the proposed algorithm.
8.5.3
Separation using Monocomponent Extraction
To achieve UBSS, we introduce here a four-step procedure consisting of: 1. Computation and spatial averaging of the observed signal TFDs; 2. Component extraction to separate all signal (mono) components; 3. Component clustering to group together components belonging to the same multicomponent source signal; 4. TF signal synthesis to recover the original source waveforms. To have a "clean" TFD (i.e. one that can reveal the features of the signal clearly, without any "ghost" components), we use a newly developed high resolution quadratic TFD called the B-distribution (see Articles 2.7, 3.3 and 5.7). In addition, we use a spatial averaging that mitigates further the sources cross-terms by a factor depending on their spatial signatures angle (see Article 8.2). More precisely, we compute the averaged TFD (on which line detection is applied) as: m
D(t, f) - Trace(Dxx(t, f)) - E D=~x, (t, f).
(s.5.9)
/=1
For component clustering, we use the observation that the STFD matrices at two auto-term points corresponding to the same source signal have the same principal eigenvector. Therefore, the proposed component clustering procedure consists of grouping together components associated with the same spatial direction representing a particular source signal. This spatial direction is estimated as the averaged value over all component points of the principal eigenvectors of the corresponding STFD matrices. More precisely, for each extracted component C, one estimates the corresponding spatial direction as: I ~~ca(ti, fi ) ac = Nzc
(8.5.10)
362
Chapter 8:
Multi-Sensorand Time-Space Processing
where I v denotes the set of points of component C, Nzc denotes the number of points in :Iv and a(ti, fi) is the estimated principal eigenvector of the ith component point STFD matrix Dxx(ti, fi). These vectors are then clustered into different classes using the clustering algorithm in Section 8.5.2. For the component extraction, two different techniques, presented next, can be used. The first one is based on a "road extraction" procedure initially proposed for road detection in satellite image processing [7] while the second uses a "peak detection and tracking" procedure [8].
8.5.3.1
A "Road Network Tracking" Approach
The component-extraction method is divided into three main steps [9]: (i) preprocessing: because of the particularity of the TFD image, a preprocessing is needed before applying the component extraction procedure; (ii) line detection giving local binary detection of the potential linear structures (segments); and (iii) global optimization giving a set of labeled components. Due to space limitation, we only give a brief review of the principle of the method. Additional information and mathematical details can be found in [7] and references therein. (1) Preprocessing: First the TF image is transformed to a real positive-valued image by forcing to zero all negative values 2 of the TFD and by using a gray scale in the range from 1 to 256. Also, line detectors are usually limited to a line width of 5 pixels. If the researched components do not respect this limit (which is usually the case for a TF image), an image subsampling by block-averaging is applied to reduce the pixel size. Despite the blurring effect, this filter presents the advantage of reducing the noise in the TF image. Moreover, as the TF image is unisotropic (i.e., it contains horizontal lines as can be observed in Fig. 8.5.1), this image downsampling [see Fig. 8.5.4(f)] removes this particular feature of the WE image. (2) Line detection (Local optimization): A line detector is applied at each pixel of the image. We used a detector proposed in [7] for radar image processing. For a given direction, its response is based on the ratio of the means computed on both sides of the suspected line and the mean of the line itself. Height directions are studied and the best response is kept. The resulting image is then binarized using a simple thresholding. If statistics on the image are available (noise distribution, additive or multiplicative noise, etc.), a statistical study of the line detector performance can be made to choose the more adapted threshold (for instance the threshold corresponding to a fixed false alarm rate in homogeneous areas).
(3) Road detection (global optimization): This step is a global step introducing constraints on the shape of the linear features to extract connected components and to suppress the false alarms [7]. It works on segments extracted on the thresholded line response image by thinning and linearization. The previously detected segments are connected depending on proximity and alignment constraints (specially 2Negative values correspond mainly to undesired cross-terms or noise.
Underdetermined Blind Source Separation for FM-like Signals
363
F i g . 8 . 5 . 4 : Testing the "vector clustering" UBSS algorithm for m -- 2 mixtures of two monocomponent and one multicomponent LFM signals denoted by st(t), s2(t) and s3(t) (a-c). The recovered source signals (m-o) indicated the success of the UBSS algorithm. Source s3(t) was not falsely separated into two monocomponent sources.
Chapter 8:
364
Multi-Sensorand Time-Space Processing
on the line curvature) to form coherent components. Small isolated segments are suppressed. The algorithm depends on the following thresholds: the maximum gap between two segments to connect them, the allowed angular difference between the two segments, and the minimum size of a component. The result is a labeled image of components. Fig. 8.5.5 shows an example illustrating the separation quality that can be obrained by the above technique.
8.5.3.2
A "Peak Detection and Tracking" Approach
We present here a component separation algorithm [8] relying on the additional assumption that all components of the signal exist at almost all time instants. Fig. 8.5.6, Table 8.5.1 and Table 8.5.2 illustrate this algorithm. Fig. 8.5.6 provides the algorithm flowchart, Table 8.5.1 describes the estimation technique of the number of components and Table 8.5.2 details the component separation technique.
(1) Noise thresholding: The first step of the algorithm consists in noise thresholding to remove the undesired "low" energy peaks in the T F domain. 3 We set to zero the T F D values smaller than a properly chosen threshold e. Using M A T L A B TM notation, this operation can be written as: Dth (t, f) =
D(t, f). 9(D(t, f) > c).
(8.5.11)
(2) Estimation of the number of components: In general, for a noiseless and cross-terms free TFD, the number of components at a given time instant t can be estimated as the number of peaks of the T F D slice D(t, f). Here, we propose a simple technique to estimate the number of components in the case where all components exist simultaneously at almost all time instants. In this situation, we can efficiently evaluate the number of components (see Table 8.5.1) as the maximum argument of the histogram of the number of peaks computed for each time instant t in the range [1, 2 , . . . , tmax] (where tmax • fmax is the dimension of the T F D matrix).
(3) Component separation procedure: The proposed algorithm assumes that
(i) all components exist at all time instants in the T F plane and (ii) any component intersection is a crossing point. Under these two assumptions, we note that if, at a time instant t, two or more components are crossing, then the number of peaks (at this particular slice D(t, f)) is smaller than the total number of components d. The details of the proposed separation technique is outlined in Table 8.5.2. To validate the proposed algorithm, we reconsider the same multicomponent signal analyzed in Fig. 8.5.1. This signal consists of a mixture of two quadratic frequency modulated (FM) components and a linear FM component. The mixture signal is added to a zero-mean white Gaussian noise, with SNR equal to 0dB. 3This noise thresholding is justified by the fact t h a t the noise energy is spread over the whole T F domain while the components' energies are well localized around their respective IFs leading to high energy peaks for the latter (assuming no cross-terms).
Underdetermined Blind Source Separation for FM-like Signals
365
Fig. 8.5.5: (a--c) WVD of st(t), s2(t), s3(t); (d,e) spatial-averaged TFD of the mixture outputs using WVD and MWVD; (f) STFD mixture converted to image; (g-h) extraction of source components using "road network tracking"; (i) auto-term points of known components; (j-l) TFD estimates of the sources; (m-o) TFD of estimated sources after TF synthesis: m = 2 sensors and SNR = 10dB.
Chapter 8:
366
Multi-Sensor and Time-Space Processing
Input 1D signal
SignaI TFD B-dist,ibution
Masking J C, (t, j') ..
Cl
D(t,f)
Nr ise
Thresl ,olding
Dth (t: f)
Components Separation
Num ~erof Compments Estin ation
d
t
--~--I Masking] C~(t''f) "
~1"
Masking
II Cd(t'l)
F i g . 8.5.6: Flowchart of the "component extraction" based algorithm.
Table
8 . 5 . 1 " Estimation of the number of components.
1. For t = 1,...,tmax number_components(t) = number of peaks of end
D(t, :)
2. Compute the histogram H of number_components. 3. Evaluate the number of components as d = arg max H
The B-distribution of the noisy signal as well as the components resulting from the separation algorithm are displayed in Fig. 8.5.7. Another relevant component separation algorithm can be found in [10].
8.5.4
Summary and Conclusions
This article presents TF-based methods for the underdetermined blind separation of FM-like signals. The first class of methods is based on a vector clustering approach while the second is based on a component extraction approach. Two component extraction techniques are introduced using a "road network tracking" algorithm (initially developed for road detection in satellite image processing) and a dedicated "peak detection and tracking" algorithm, respectively. Simulation examples are provided to illustrate the performances of considered blind separation methods. Note that the UBSS method based on vector clustering is more general in the sense that it can be applied to separate a larger class of (not necessarily FM) signals as long as they satisfy the TF orthogonality property. In fact, it can be seen as a member of the "sparse decomposition" based UBSS methods [11] which essentially use the sparse representation of the source signals in a
Underdetermined Blind Source Separation for FM-like Signals
367
Table 8.5.2: Component separation procedure for the "peak detection and tracking" algorithm.
1. Assign an index to each of the d components in an orderly manner. 2. For each time instant t (starting from t = 1), find the component frequencies as the peak positions of the TFD slice D(t, f). 3. Assign a peak to a particular component based on the smallest distance to the peaks of the previous slice D ( t - 1, f) (IFs continuous functions of time). 4. If at a time instant t a crossing point exists (i.e., number of peaks smaller than the number of components), identify the crossing components using the smallest distance criterion by comparing the distances of the actual peaks to those of the previous slice. 5. Permute the indices of the corresponding crossing components.
Fig. 8.5.7: The B-distribution of the original signal (top left) as well as the extracted components using the "peak detection and tracking" algorithm.
given transformed domain (in our case, it is the TF domain). The "vector clustering" based method is however less performant than the second class of methods using component extraction when considering only FM-type signals. Several component extraction techniques exist in the literature [8-10, 12] among which two have been presented in this article. The best promising technique (i.e. the one providing the best separation quality) is the "image processing" based one using a "road detection"
368
Chapter 8:
Multi-Sensorand Time-Space Processing
algorithm. In terms of computational cost, it is however much more expensive than the "peak detection and tracking" technique.
References [1] S. E1-Khamy, S. E. Shaaban, and E. A. Thabet, "Frequency-hopped multi-user chirp modulation (FH/M-CM) for multipath fading channels," in Proc. Sixteenth National Radio Science Conference (NRSC'99), pp. C6/1-8, Ain Shams Univ., Cairo, 23-25 February 1999. [2] C. Gupta and A. Papandreou-Suppappola, "Wireless CDMA communications using time-varying signals," in Proc. Sixth Internat. Syrnp. on Signal Processing and its Applications (ISSPA '01), vol. 1, pp. 242-245, Kuala Lumpur, 13-16 August 2001. [3] A. Persson, T. Ottosson, and E. Strom, "Time-frequency localized CDMA for downlink multi-carrier systems," in Proc. Seventh Inter'nat. Syrnp. on Spread Spectrum Techniques and Applications (ISSSTA-02), vol. 1, pp. 118-122, Prague, 2-5 September 2002. [4] N. Linh-Trung, A. Belouchrani, K. Abed-Meraim, and B. Boashash, "Separating more sources than sensors using time-frequency distributions," in Proc. Sixth Internat. Syrnp. on Signal Processing and its Applications (ISSPA '01), vol. 2, pp. 583-586, Kuala Lumpur, 13-16 August 2001. [5] A. Gersho and R. M. Gray, Vector Quantization and Signal Compression. Kluwer, 1991. [6] G. F. Boudreaux-Bartels and T. W. Parks, "Time-varying filtering and signal estimation using Wigner distribution synthesis techniques," IEEE Trans. Acoustics, Speech, eJ Signal Processing, vol. 34, pp. 442-451, June 1986. [7] F. Tupin, H. Maitre, J.-F. Mangin, J.-M. Nicolas, and E. Pechersky, "Detection of linear features in SAR images: Application to road network extraction," IEEE Trans. Geoscience ~ Remote Sensing, vol. 36, pp. 434-453, March 1998. [8] K. Abed-Meraim and B. Barkat, "Blind source separation using the time-frequency distribution of the mixture signal," in Proc. Second IEEE Internat. Syrnp. on Signal Processing and Information Technology (ISSPIT'02), vol. 2, pp. 663-666, Marrakech, Morocco, 18-21 December 2002. [9] K. Abed-Meraim, N. Linh-Trung, V. Sucic, F. Tupin, and B. Boashash, "An image processing approach for underdetermined blind separation of nonstationary sources," in Proc. Third Internat. Syrnp. on Image eJ Signal Processing and Analysis (ISPA-03), Rome, 1820 September 2003. [10] B. Barkat and K. Abed-Meraim, "A blind components separation procedure for FM signal analysis," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), vol. 2, pp. 1425-1428, Orlando, FL, 13-17 May 2002. [11] P. Bofill and M. Zibulevsky, "Underdetermined blind source separation using sparse representations," Signal Processing, vol. 81, pp. 2353-2362, November 2001. [12] F. S. Cohen, S. Kadambe, and G. F. Boudreaux-Bartels, "Tracking of unknown nonstationary chirp signals using unsupervised clustering in the Wigner distribution space," IEEE Trans. Signal Processing, vol. 41, pp. 3085-3101, November 1993.
P a r t IV
Statistical Techniques
This Page Intentionally Left Blank
Chapter 9
Random Processes and Noise Analysis Time-Frequency Distributions (TFDs) have been studied initially for deterministic signals. Noise is an ever permanent feature in signals and so there is a need to extend the formulation of TFDs to the case of random signals. This chapter describes timefrequency methodologies developed for dealing with randomness in non-stationary signals and systems. The topic is covered in five articles with appropriate crossreferencing. Noise analysis for TFDs is presented with derivations of mean and variance of TFDs (Article 9.1). Both cases of additive and multiplicative noise, including white and colored noise, are treated. Time-varying random processes and systems can be represented with dispersive time-frequency characteristics using the Weyl symbol (9.2). This approach allows the adaptation of analysis tools to systems or processes that have specific time-frequency (TF) characteristics. TFDs such as the spectrogram and WVD can be designed that are robust to impulse noise (9.3). Time-varying power spectra can be defined based on generalizations of the Wigner-Ville spectrum and evolutionary spectrum. These are particularly suitable for underspread non-stationary processes (i.e. processes with small time-frequency correlations) (9.4). Time-varying random channels are also described using a timefrequency approach (9.5).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
371
Chapter 9:
372
Random Processes and Noise Analysis
ANALYSIS OF NOISE IN T I M E - F R E Q U E N C Y DISTRIBUTIONS ~
9.1
This article presents noise analysis for commonly used time-frequency distributions such as those presented in Chapters 2 and 3. The Wigner distribution, as a basic time-frequency representation, is studied first. The bias and variance in the case of complex white noise are derived. The analysis of noise is extended to other quadratic distributions, and to different types of additive and multiplicative noise, including: stationary white noise, nonstationary white noise, and colored stationary noise. Exact expressions for the mean value and the variance of quadratic distributions for each point in the time-frequency plane are given.
9.1.1
Wigner Distribution
The pseudo Wigner distribution (WD) of a discrete-time noisy signal x(n) --- s(n)+ e(n) is defined by1:
W=(n, f)
- Z m
w(m)w(-m)x(n + re)x* (n
-
m)e -j4'~/m.
(9.1.1)
where w(m) is a real-valued lag window, such that w(O) = 1. Consider first the case when s(n) is deterministic and the noise e(n) is a white, Gaussian, complex, stationary, zero-mean process, with independent real and imaginary parts having equal variances. Its autocorrelation function is Ree(m) - ff~5(m). The WD mean for the noisy signal x(n) is
E{W=(n, f)} = ~-~mw(m)w(-m)s(n + m)s*(n- ?'n)e -j4rrfm
+ ~m w(~n)w(-~)R"(2m)~-J~m l/4
= 2
fJ-~/4 W~(n, f - a)Fw (2a)da + a~, 2
(9.1.2)
where Fw(f) = ~'m~:[w(m)w(-m)] is the Fourier transform (FT) of the product w(m)w(-m), and Ws(n, f) is the original WD of s(n), without a lag window. The lag window w(m) causes the W D bias. The second term on the right-hand side in (9.1.2) is constant, so one can assume that it does not distort the WD. Expanding Ws(n, f - a) into a Taylor series, around f, we get 1/4
2
J-1/4
(')2
w~ (~, f - ~)F~ (2~)d~ -~ W~(~ f) + ~ '
W~(~,/) ~n~ + .... Of 2
(9.1.3)
~ L J u b i ~ a S t a n k o v i d , Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (1.stankovic@ieee.org). Reviewers: M. G. Amin, and M. J. Bastiaans. 1Notation ~ m, without limits, will be used for Y~-m=-oo" oo The constant factor of 2 is omitted in the WD definition, and in other T F D definitions.
373
Analysis of Noise in Time-Frequency Distributions
Thus, the bias can be approximated by
1 02W,(n, f) Of u
?Tt2 ~ ~1 b(n,
bias (n, f) ~- -~
f)m2
where m2 __ /'1/2 J - 1/9 f2F~(f)df. For the regions where the W D variations in the frequency direction are small, the bias is small, and vice versa. The WD estimator v a r i a n c e , at a given point (n, f), is defined by:
cr~VD(n, f) -- E{Wx(n, f)Wx(n, f)} - E{W,(n, f)}E{W:~(n, f)}.
(9.1.4)
For signals x(n) -- s(n)+ e(n) it results in Cr~D(n' f) -- ~-~ml ~--~m: W ( ~ l ) W ( - - ~ T t l ) W ( m 2 ) w ( - - g r t 2 ) e
-j4rrf(ml-m2)
• + .~)~* (~ + . ~ : ) R ~ ( ~ - m:, ~ - -~1) +~*(~ - . ~ ) ~ ( ~ - . ~ : ) R ~ ( ~ + . ~ , ~ + . ~ ) +~(~ + - ~ ) ~ ( ~ - -~:)R;~. (~ - -~1, ~ + m:) +~*(~ - . ~ ) ~ * ( ~ + . ~ ) R ~ . (~ + . ~ 1 , ~ - - ~ ) + R ~ ( ~ + . ~ , ~ + m ~ ) R ~ ( ~ - . ~ , ~ - m~) + R ~ . (~ + m ~ , ~ - . ~ ) R ~ . (~ - . ~ , ~ + .~)1.
(9.~.s)
The fourth-order moment of noise is reduced to the correlation functions by using the relation E{qz2z3z4} - E{zlz2}E{z3z4} + E{zlz3}E{z2z4} + E{zlz4}E{z2z3}, which holds for Gaussian zero-mean random variables zi, i = 1,2,3,4. For the considered complex noise R~(n, m) - ~ 25(n - m) and R ~ . (n m) - 0. The variance of the W D estimator reduces to
- 4 Em
+
+ 4]
It is frequency independent. For constant modulus signals, s(n) - a exp[jr the variance is constant @VD(n, f ) -- a~Ew 2 (2a2 + cr~), 2 where Ew - Em[w(rn)w(-m)] 2 is the energy of w ( m ) w ( - m ) window. A finite energy lag window is sufficient to make the variance of Wx(n, f) finite. The optimal lag window width can be obtained by minimizing the error e 2 = bias 2 (n, f) + cr~/D(n , f). For example, for constant modulus signals, and the Hanning window w ( m ) w ( - m ) of the width N, when Ew = 3N/8 and rn2 - 1/(2N2), we get: e2_~__1 b2(n f ) + ~ c r 256N 4 '
2(2a 2 + o r 2)
From Oe2/i)N - 0 the approximation of optimal window width follows:
Nopt(n, f) ~-
b2(n,f )
24a~(2a 2 + a~)"
Chapter 9: RandomProcessesand NoiseAnalysis
374
An approach to the calculation of the estimate of Nopt(n, f), without using the value of b2(n, f), is presented in [1], [Article 10.2]. Other statistical properties of the WAgner distribution are studied in [2].
9.1.2
Noise in Quadratic Time-Frequency Distributions
A discrete-time form of the Cohen class 2 of distributions of noise e(n) is defined by:
p~(n, f; G) = ~-~, ~-~m G(m, l)e(n + m + l)e* (n + m - l)e -j4€
(9.1.6)
where G(m, l) is the kernel in the time-lag domain. Its m e a n value, for a general nonstationary noise, is
E{p~(n, f; G)} - ~-~L ~-~m G(m, 1)R~(n Jr- m 4. l, n 4- m - l)e -jd~yl, where Roe (m, n) is the noise autocorrelation function. For special cases of noise the values of E{p~(n, f; G)} follow. (1) Stationary white noise, R~ (m, n) - a ~ ( m - n),
2 ~g(0,0).
E(p,(n, f; G)} -
(2) Nonstationary white noise, Rcc (m, n) -- I ( n ) 5 ( m - n), I(n) > O,
E{pc(n, f; G)} - ~-~m G(m, O)I(n + m). (3) Stationary colored noise, Rr162(m, n) = Roe ( m - n), l/2 -G(0, 2(f - a))S~(a)da,
f
where S ~ ( f ) - .Tm~/[R~ (m)] is the noise power spectrum density, and the kernel forms in time-lag, Doppler-lag, and Doppler-frequency domains are denoted by: l/2
~-~m G(m, 1)e -j2€
= g(v, l) -
fJ - l / 2 G(v, f)eJ2~fZdf .
(9.1.7)
The v a r i a n c e of p~(n, f; G), is defined by a~2 ( n , f ) -
E{p~(n,f;G)p~(n,f;G)}* - E{p~(n,f;G)}E{p~(n, f; G)}.
For Gaussian noise, as in (9.1.4)-(9.1.5), we get: 2
O'ee(n,
f) -- ~ , , ~ ,~ ~-~m, ~-~m2 G(ml , ll )G* (m2 , /2)
x [Ree(n + m l + / 1 , n + m2 + 12)R'~(n +
+R~.
(9 1.8)
m, - 1 , , n 4. m2 - / 2 )
(n + m~ + ll, n + m2 -12)R~.~(n + m~ - l l , n + m2 + 12)]e-Jd.f(L,-z~).
2 (n, f) for the specific noises will be presented next. Form of ace 2That is, the quadratic class; see p. 68n.
375
Analysis o f Noise in Time-Frequency Distributions
C o m p l e x s t a t i o n a r y a n d n o n s t a t i o n a r y w h i t e noise For nonstationary complex white noise, with independent real and imaginary part of equal variance, Ree (m, n) = I(n)5(m - n), Ree* (n, m) = 0, we get
2 (n, f) - E L E m IG(m'/)12 I(n + m + 1)I* (n + m - l) Gee
pi(n,
0; lal ).
(9.1.9)
2 the variance is proportional to the kernel For stationary white noise, I(n) - he, energy, 2 ( n , f ) - a e4E z E m [ G ( m , 1) 12 . (9.1.10) Gee
Colored stationary noise For complex colored stationary noise, the variance (9.1.8) can be written as
2 ~r(e(rt'f)Z l l Z m l G(ml,/l) { ~-'~"rn2El2 G*(m2,12)
x [/~ee(?Ttl -- //12 -[-11 --12)R2e(ml - m2
(11 -12))]e -jnTrf(l~-12) },
-
or 2 (n,f) Gee
- E l E m G(m'l) {G(m,l)*z *m[R:e(m + 1)Ree(m
/)eJa'fz]} * ,
where " *z *m" denotes a two-dimensional convolution in l, m. Consider the product of G(m,l) and Y*(m,l) - {G(m, 1)*l *m[R~e(m + 1)Ree(m-/)eJ4rfl]} * in the last expression. Two-dimensional FTs of these terms are G(L,,~) and y(v,~) = G(v, ~)See(f - (~ - ~')/2)S~e(f - (~ + v ) / 2 ) / 2 . According to the Parseval's theorem we get: 2( n, f) __ l f l / 2 f l / 2 2 J-1/2 J-1/2
(Tee
-
Ps~ (0, f;
Igl ),
~ sh(f
-
V +
2
v )d~'d~ 2 (9.1.11)
for If - (~ - v)/21 < 1/2 and I f - (~ + v)/21 < 1/2. The transforms in (9.1.11) are periodic in v and ~ with period 1. It means that we should take into account all v and when If - [ ( ~ + kl) - (v + k2)]/2[ < 1/2 and I f - [(~ + kl) -+-(/] -{- k2)]/21 < 1/2, where kl and k2 are integers. Note that the F T of a colored stationary noise is a white nonstationary noise, with autocorrelation in the frequency domain
R z z ( f l , f2) - ~-~m E n E{e(m)e*(n)}e(-J2r'flm+j27r'f2n) - See(f2)~p(fl - f2), where 5p(f) is a periodic delta function with period 1. Thus, (9.1.11) is just a form dual to (9.1.9).
376
Chapter 9:
Random Processes and Noise Analysis
A n a l y t i c noise In the numerical implementation of quadratic distributions, an analytic part of the signal is commonly used, rather than the signal itself. The analytic part of noise can be written as ca(n) = e ( n ) + jeh(n), where eh(n) = 7-/[e(n)] is the Hilbert transform of e(n). Spectral power density of ca(n), within the basic period Ifl < 1/2, for a white noise e(n), is Seoea(f) - 2(T2U(f), where U(f)is the unit step function. The variance follows from (9.1.11) in the form
2 (n f ) -
(Tee
'
2o.4
fl/2 fd(f ,e) [~(/],~)[2 d~du
for
J-l~2 J-d(f,~)
1 12f[ _< ~,
(9.1.12)
where the integration limits are defined by d(f, ~) = larcsin(sin(Tr(2f - ~)))l/Tr (for details see [3]). The kernel {7(u, ~c) is mainly concentrated at and around the (u, ~) origin and - 0 axis. Having this in mind, as well as the fact that l{7(u, ~)12 is always positive, 2 (n, f) is for f -- 0. The we may easily conclude that the minimal value of ace maximal value will be obtained for If] = 1/4. It is very close to [3]: m a x { ~ ( n , f)} ~-2o -4
fl/2 fl/2 -
2a~ ~-~t ~--~.m IG(m,/)12
J-1/2J-1/2
R e a l noise Now consider a real stationary white Gaussian noise e(n) with variance a e2 . In this case, variance (9.1.8) contains all terms. It can be written as:
2 (n, (Tee
f) - a e4 E l Z r n jIG(re' l) 12 + G(m, l) G* (m,-1)e -j87r/l]
(9.1.13)
For distributions whose kernel is symmetric with respect to l, a ( m , t) = a ( r n , - / ) holds. The FT is therefore applied to the positive and even function IG(rn,/)l 2. The transform's maximal value is reached at f = 0, and Ifl = 1/4. Accordingly: max {ae2e(n, f)}
- 2a~ E l E m ]G(m, l)] 2 .
(9.1.14)
The crucial parameter in all previous cases is the kernel energy }-]-I~ m la(m, t)l Its minimization is thoroughly studied in [4]. It has been concluded that, out of all the quadratic distributions satisfying the marginal and time-support conditions, the Born-Jordan distribution is optimal with respect to this parameter.
9.1.3
Noisy Signals
Analysis of deterministic signals s(n) corrupted by noise, x(n) = s(n)+ e(n), is highly signal dependent. It can be easily shown [3,4], that the distribution variance (Tp2(n, f) consists of two components:
f)
+
(n, f).
(9.1.15)
377
Analysis of Noise in Time-Frequency Distributions
The first variance component, and the distribution mean value, have already been studied in detail. 3 For the analysis of the second, signal dependent, component 2 (n, f) we will use the inner product form of the Cohen class of distributions: O's~
px(n, f; G) - E l E m G(m, l) [x(n + m)e -j2"fm] [x(n + 1)e-J2"ll] * , (9.1.16) where C_,(m,l) - G((m + 1)/2, (m l)/2). Calculation of (~(m, l) is described in the next section. For a real and symmetric G(m, 1), and complex noise, we get -
a~
G(mx, ll)(~* (m2,12)s(n + ml)s (n + m2)
,
xR~(n + 12, n + ll)e -j27rf(ml-ll-m2+12), what can be written as ~ z (n, f) - 2 E m
1 Em
2 ~)(~1 , T/~2) [8(TL -~- TY~l)e-j27rfml][s( T~ 2i- ~2)e--J27rfm2] * ,
(9.1.17)
where the new kernel (I)(ml, m 2 ) reads ~(TTtl, m 2 )
-- ~-~11 ~-~12 d(ml'll)d*(m2'12)e-J2=f(z2-Zl)Ree(n+12'
n+ll). (9.1.18)
The signal dependent part of the variance a 2 (n, f) is a quadratic distribution of the signal, with the new kernel (~(ml,m2), i.e., asc 2(n , f ) = 2ps(n f; #P).
Special case 1: W h i t e s t a t i o n a r y c o m p l e x noise, when R~(n + ll, n +/2) O-c26(/1 m /2), produces
~)(ml, m 2 ) - cr~ ~-~lG(mx,1)G*(m2,1).
(9.1.19)
For time-frequency kernels we assumed realness and symmetry throughout the article, i.e., (~* (m2, l) -_ (~l, m2). Thus,_ for finite limits (9.1.19) is a matrix multiplication form, 9 - a~2G 9G* - a~2G2 . Boldface letters, without arguments, will be used to denote a matrix. For example G is a matrix with elements G(m, l). Thus, as~2(n, f) - 2ps (n, f; a~2~2).
Note: Any two distributions with k e r n e l s GI(?Tt, l) -- G 2 ( m , - 1 ) variance, since E1
(9.1.20) have the same
Gl(?Ttl,/)G~(Trt2' l)- E/G1(17~1 ' -/)G~(ITt 2' -l)- E/G2(?Ttl ' /)G~(?Yt2' l) "
Corollary: A distribution with real and symmetric product kernel g(pT) and the distribution with its dual kernel gd(~T) -- 9~--~.,#--.~ [g(afl)] have the same variance. 3An analysis of the bias, i.e., kernel influence on the form of
ps(n, f; G) may be found in [1].
Chapter 9:
378
Random Processes and Noise Analysis
Proof: Consider all coordinates in the analog domain. The time-lag domain forms of g(vT), G(t,T) -- Jc,--,t[g(vT)], and gd(VT), Gd(VT) -- ~,--,t[gd(VT)] are related_ by _G(t,~-) - Gd(~',t). In the rotated domain this relation produces G(tl, t2) - Gd(tl,--t2), what ends the proof, according to the previous note. Example" The WD has the kernel g(vT) = 1, G(m, l) - 5(m + l). According to the Corollary, the WD has the same variance as its dual kernel counterpart, with g(v7) - 5(V,T), G(m,l) = 5 ( m - 1). This dual kernel correspondsto the signal energy ~ m [x(n + m)[ 2 (see (9.1.16)). Thus, the WD and the signal energy have the same variance. The same holds for the smoothed spectrogram, and the S-method [1], [Article 6.2], whose kernels are G(m,l) = w(m)p(m + l)w(l), and G(m, l) - w(m)p(m -1)w(1), respectively. Their variance is the same.
Eigenvalue decomposition: Assume that both the summation limits and values of (~(m, l) are finite. It is true when the kernel G(m, l) is calculated from the well defined kernel in a finite Doppler-lag domain, G(m,l) - .T,~m[g(v,l)], using a 2 (n, f) can finite number of samples. The signal dependent part of the variance as~ be calculated, like other distributions from the Cohen class, by using eigenvalue decomposition of matrix G, [5, 6]. The distribution of non-noisy signal (9.1.16) is N/2-1
ps(n, f) =
E )hSs(n, f; qi) = ps(n, f;)~, q), i=-g/2
(9.1.21)
where Ai and qi(m) are eigenvalues and eigenvectors of the matrix (~, respectively, and Ss(n, f; qi) -
z-.~i--N/2
§
is the spectrogram of signal
s(n) calculated by using qi(m) as a lag window. Since (~ -- a~2~2 , its eigenvalues and eigenvectors are a~2 IAi ]2 and qi(m), respectively. Thus, according to (9.1.20) N/2-1
2 (n, f)
O'Se
Ss(n,f;qi)
---
__
2a2ps(n,f; I,kl2 q).
(9.1.22)
i----N~2
2 (n , f) kernel: According Relation between the original kernel and variance a~r to (9.1.21), we can conclude that the original kernel in the Doppler-lag domain ~N/~.-~ can be decomposed into g(u, l) - ,--.,i=-N/2 .Xiai(u, 1), where ai(u, l) are ambiguity functions of the eigenvectors qi(m). The kernel of ps(n, f; I~12 , q), in (9.1.22), is ga(l], l) -- ~--,N/2-1 z-.~=-Y/2 [A~[2 hi(V, l). A detailed analysis of distributions, with respect
to their eigenvalue properties, is presented in [6], [Article 6.4]. In the sense of that analysis, the signal dependent variance is just "an energetic map of the timefrequency distribution" of the original signal. The mean value of variance (9.1.17) is:
ae~r
f) _
J-i/2
2 (n, f ) d f - 2a 2 E m ~(m, m)Is(n + m)] 2 .
a~r
(9 .1.23)
379
Analysis of Noise in Time-Frequency Distributions
Fig. 9.1.1: Time-frequency representations of a non-noisy signal (First column); One realization of time-frequency representations of the signal corrupted by a white stationary complex noise (Second column); Variances of the distributions, obtained numerically by averaging over i000 realizations (Third column): (a) Spectrogram, (b) Smoothed spectrogram, (c) S-method, (d) Choi-Williams distribution, (e) Pseudo Wigner distribution. For frequency modulated signals s(n) - a exp[jr to the kernel energy [4].
it is a constant proportional
Special case 2: For n o n s t a t i o n a r y w h i t e c o m p l e x noise, (9.1.18) results in:
N/2-1 ~(ml, m2) :
E
i---n~2
I(n
+/)G(ml, 1)G*(m2, l),
(9.1.24)
or 9 -- GI~G, where In is a diagonal matrix, with the elements I ( n + l). For the
Chapter 9:
380
Random Processes and Noise Analysis
quasi-stationary case, I(n + ll)5(ll -12) -~ I(n)5(ll -12), we have Pi = I(n) I,~il2, with all other parameters as in (9.1.22).
Special case 3: In the case of c o l o r e d s t a t i o n a r y c o m p l e x noise, relations dual to those in Special case 2, hold (like (9.1.9) and (9.1.11)). Special case 4: Let x(n) = s(n)(1 +p(n)), where p(n) is a m u l t i p l i c a t i v e noise. We can write x(n) = s(n) + s(n)p(n) = s(n) + e(n), where e(n) = s(n)p(n) is an additive noise. Thus, the case of this kind of multiplicative noise can be analyzed in the same way as the additive noise. For example, if the noise #(n) is a nonstationary white complex one with R , , ( m , n) = It,(n)5(n- m), then R~(m, n) = I~(n)5(nm), where I ~ ( n ) - Is(n)] 2 It,(n ).
9.1.4
Numerical Example
Consider the signal
x(t) - exp(jl100(t + 0.1) 2) + e -25(t-~ +e -25(t-~
exp(jl000(t + 0.75) 2)
exp(jl000(t - 0.4) 2) + exp(j2850t) + e(t),
within the interval [0, 1], sampled at At = 1/1024. A Hanning lag window of the width Tw = 1/4 is used. Stationary white complex noise with variance a~2 = 2 is assumed. The spectrogram, smoothed spectrogram, S-method, Choi-Williams distribution (CWD), and the WD, of signal without noise are presented in the first column of Fig. 9.1.1, respectively. For the CWD, the kernel g(u, T) = exp(--(UT) 2) is used, with normalized coordinates -v/TrN/2 <_ [27ru] < V/TrN/2, -v/TrN/2 <_ IT[ < V/TrN/2, and 128 samples within the intervals. Elements of the matrix G were calculated as, [6]
p=-N/2 g(pA~, (m -/)A~-) exp(-j27r(m + 1)p/(2N))A~.
(9.1.25)
The normalized eigenvalues of the matrix ,I, were Ai - {1,-0.87, 0.69,-0.58, 0.41, -0.30, ...} and #i - jail 2 - {1, 0.76, 0.47, 0.33, 0.17, 0.09, ...}. In the spectrogram and smoothed spectrogram the whole signal dependent part of variance is "located" just on the signal components, while in the WD it is "spread" over the entire timefrequency plane. Variance behavior in other two distributions is between these two extreme cases. As it has been shown, the variances in the smoothed spectrogram and the S-method are the same [Fig. 9.1.1(b) and (c)].
9.1.5
Summary and Conclusions
The variance values for a white nonstationary complex noise, with R ~ ( m , n ) I(n)5(m - n), I(n) >_ O, for some distributions, are summarized next. -Pseudo Wigner distribution Ws(n, f; w), with G(m, l) - w(m)5(m + l)w(1):
2 (n, f) - Wi(n, 0; w 2) + 2Wi,isl2(n, 0; w2), a~vD(n, f) -- a~2 (n, f) + a,~
-
(9.1.26)
Analysis of Noise in Time-Frequency Distributions
381
where Wt,lsl 2 denotes the cross Wigner distribution for I(n) and Is(n)[ 2. -Spectrogram Ss(n, f; w), with G(m, l) - w(m)w(1)"
2
CrSPEC
(n, f) - Si(n, 0; w 2) + 2Fi(n, O": w2)Ss(n, f": w).
(9.1.27)
The S T F T of I(n), calculated using the window w2(m), is denoted by Fi(n, f; w2). -A general quadratic distribution, with kernel G(rn, l) - G((rn + 1)/2, (m - / ) / 2 ) , in (9.1.6) or (9.1.16), and G being a matrix with elements G(rn, l):
2(n, y) - pi(~, o; IGI 2) + 2p~(n, f. , GI~G).
O'p
(9.1.28)
The first two formulae are special cases of (9.1.28). Expressions for stationary white noise follow with I(n) - cry. 2 Dual expressions hold for a colored stationary noise. Further details can be found in [7].
References [1] L. Stankovid and V. Katkovnik, "The Wigner distribution of noisy signals with adaptive time-frequency varying window," IEEE Trans. Signal Processing, vol. 47, pp. 1099-1108, April 1999. [2] P. Duvaut and D. Declerq, "Statistical properties of the pseudo-Wigner-Ville representation of normal random processes," Signal Processing, vol. 75, pp. 93-98, 5 January 1999. [3] L. Stankovid and V. Ivanovid, "Further results on the minimum variance time-frequency distributions kernels," IEEE Trans. Signal Processing, vol. 45, pp. 1650-1655, June 1997. [4] M. G. Amin, "Minimum-variance time-frequency distribution kernels for signals in additive noise," IEEE Trans. Signal Processing, vol. 44, pp. 2352-2356, September 1996. [5] M. G. Amin, "Spectral decomposition of time-frequency distribution kernels," IEEE Trans. Signal Processing, vol. 42, pp. 1156-1165, May 1994. [6] G. S. Cunningham and W. J. Williams, "Kernel decomposition of time-frequency distributions," IEEE Trans. Signal Processing, vol. 42, pp. 1425-1442, June 1994. [7] L. Stankovid, "Analysis of noise in time-frequency distributions," IEEE Signal Processing Letters, vol. 9, pp. 286-289, September 2002. [8] P. Flandrin and W. Martin, "The Wigner-Ville spectrum of nonstationary random signals," in The Wigner Distribution--Theory and Applications in Signal Processing (W. Mecklenbr/~uker and F. Hlawatsch, eds.), pp. 211-267, Amsterdam: Elsevier, 1997. [9] S. B. Hearon and M. G. Amin, "Minimum-variance time-frequency distribution kernels," IEEE Trans. Signal Processing, vol. 43, pp. 1258-1262, May 1995. [10] L. Stankovid and S. Stankovid, "On the Wigner distribution of the discrete-time noisy signals with application to the study of quantization effects," IEEE Trans. Signal Processing, vol. 42, pp. 1863-1867, July 1994.
382 9.2 9.2.1
Chapter 9:
Random Processes and Noise
Analysis
STATISTICAL PROCESSING OF DISPERSIVE SYSTEMS A N D SIGNALS o Processing Tools For Time-Varying Systems and Signals
Time-frequency representations (TFRs) such as the Wigner distribution (WD) are powerful tools for analyzing deterministic signals whose spectral characteristics vary with time. In many applications, one can observe linear time-varying (LTV) systems (e.g. wireless communication or sonar channels) and nonstationary random processes (e.g. noisy signals with random parameters). Due to their time fluctuations, both of these can exhibit time-frequency (TF) characteristics that could be embedded in the model design of a system or in the autocorrelation function of a random process. Thus, it would be ideal to design transforms or T F R s that would extract useful information from these characteristics [1-5]. The transforms could preserve some important system or random process changes. Specifically, signals propagating over LTV systems may be shifted by a constant amount in frequency or shifted in a non-linear time-dependent manner due to a dispersive change in the signal's instantaneous frequency (IF). Note that dispersive implies that the IF change or shift varies non-linearly with time. Information of a constant frequency change, for example, is very useful in improving the Doppler diversity of a communications channel (see [6] and Article 13.2). Furthermore, information of an IF change on the input signal to a dispersive system can be used to improve system performance. An appropriate analysis tool for LTV systems and random processes is a class of T F R s called TF symbols [1-5] which are 2-D functions that depend on an integral operator s defined as (s = f_oo Tz.(t,T)x(T)dT. The kernel Tx:(t,T) of the operator can be considered as the time-varying impulse response of an LTV system or the autocorrelation function of a nonstationary random process.
Narrowband Weyl Symbol The narrowband Weyl symbol (WS) of a linear operator L: on L2(R), defined as WSL(t,f)-
TL
(t + ~ , t -
cx:)
is an important tool for analyzing LTV systems and nonstationary random processes characterized by constant T F shifts and scale changes [1, 2]. It can be interpreted as a time-varying transfer function of an LTV system (see [1] and Articles 4.7 & 12.4) or as a time-varying spectrum of a nonstationary random process (Article 9.4). When L: is the autocorrelation operator 7~z of a random process x(t) whose kernel ~ A. Papandreou-Suppappola, Telecommunications Research Center, Department of Electrical Engineering, Arizona State University, Tempe, AZ 85287-7206 USA (papandreou@asu.edu), B.-G. Iem, Electronics Department, Kangnung National University, Kangwondo, 210-702 South Korea (iembg@yahoo.com), and G. Faye Boudreaux-Bartels, Department of Electrical and Computer Engineering, University of Rhode Island, Kingston, RI 02881 USA (boud@ele.uri.edu). Reviewers: A. M. Sayeed and G. Matz.
StatisticM Processingof DispersiveSystems and Signals
383
Table 9.2.1: Summary of some commonly used operators. Depending on the sign of t, sgn(t)=-t-1.
Operators
(S~x)(t)
(g~) x)(t)
Operator Definitions
~) (w/-~ s,~r we x)(t) 9 (t
(MvX)(t) x(t) ej2"vt - (w( (~(J) x)(t) x(t) ~ ~ ( ~ ) x(t) eJ2rt31n( ~ ) (~x)(t) x(t) ej2~zsgn(t) I~1" (7~ x)(t) (E~x)(t) x(t) ej 2 ~ / ~ (Cax)(t) ,/~ x(t~) (~V(I Ca W~ X)(t) (Sa~)x)(t) t
~M z / .
w~ ~)(t)
--1
1
(~v~~)(t)
X (tr~ -1 (~r))I try ( tr~-I
(t-~)) I 1/2
is the autocorrelation function Tn~ (t, 7-) = E { x ( t ) x * ( T ) } , the WS of 7~x is the expected value of the WD, i.e. WSnz(t, f) = E{WDx(t, f)} (called the WignerVille spectrum in [7] and Section 2.2.1). Here, E{.} denotes statistical expectation and the WD is defined as WDx(t, f) = f-~c~ x(t + ~ ) x * ( t - ~)e -j27rT/dT. When /: is an innovations system, the squared magnitude of (9.2.1) is known as the Weyl spectrum in [8]. Note that the WS has been used to provide a definition of a TF concentration measure [2], and is useful in TF detection applications [7, 8]. The WS satisfies many desirable properties. Specifically, it preserves constant TF shifts, and scale changes (see Table 9.2.1) on LTV systems and nonstationary random processes. For example, the WS of the autocorrelation operator 7~x of a nonstationary random process x(t)satisfies [1] y(t) = (ST x)(t) -- x(t -- 7) =~ WSny (t, f) - WSnx (t - T, f)
y(t) - (My x)(t) = x(t) ~j~v~ ~ w s ~ (t. f ) - w s ~ (t. f - ~) 1 (t) ( t ) y(t) - (Ca x)(t) = V ~ x =~ WSn,~ (t, f) = WSnx -'a a f
(9.2.2) (9.2.3) (9.2.4)
where ST, Ad. and Ca are the constant time-shift, constant frequency-shift and scale operators, respectively. The WS also satisfies the unitarity property given by WSc(t,f)WSk(t,f)dtdf OO
OO
-
?/? CK)
Tc(t,T) T ) ( t , T ) d t d T
(9.2.5)
OO
where TL(t, T) and T~r(t, T) are the kernels of the operators s and Af, respectively. This is an important property for preserving energy or norms. Since it is unitary,
384
Chapter 9:
Random Processes and Noise Analysis
the WS is associated with the unitary WD and preserves the quadratic form (s
WSc(t, f) WDx (t, f ) dt df .
x* (t) dt -
oo
oo
(9.2.6)
(x)
Here, the correlation of the WS of a system s with the WD of the system's input with x ( t ) [1,2].
x(t) results in the correlation of the system's output ( s
Narrowband Spreading Function The spreading function (SF) of a linear operator s is defined as the 2-D Fourier transform of the WS, SFL(T, 1,') =
i_ 1? oo
WSL(t, f ) e - j 2 " ( t ~ - f r ) dt df
oo
TL t +
-
CK)
2't-
7
(9.2.7)
dt.
Using the SF, the output of an LTV system s can be interpreted as a superposition of TF shifted versions of the input signal each weighted by the SF [2] (s
SFc(7, u) e - j ' ' " x ( t - 7) e j2'~t d7 du .
oo
(9.2.8)
Oo
The system output interpretation in (9.2.8) is comparable to that of a linear timeinvariant (LTI) system, and it can be used to formulate TF receiver structures. For example, it can provide critical information on the output signal of a time and frequency selective wireless communications channel. The SF is also used in the classification of operators as underspread or overspread [1].
Wideband Weyl Symbol The TF version of the affine WS , introduced in [2], is called P0-Weyl symbol (PoWS) in [3,4], and for A(c~) = PoWS~(t, f) - f
(
FB fA(a)e~, f A ( a ) e - ~ oo
s i n ha/2 (c~/2)'
it is defined as
) A(a)e j2'~st~d a ,
f > O,
where ( B X ) ( f ) - f o FB(/, ~ ) X ( , ) d u , and rB(f, ~) is the kernel of the frequency domain operator 13 on L2(I~+). The PoWS is a unitary symbol that is associated with the unitary Bertrand P0-distribution [9]. The wideband SF (WSF) is the 2-D modified Fourier transform of PoWSB(t, f) [2], and they are both important for analyzing random processes and LTV systems characterized by constant or hyperbolic time shifts and scale changes [2-4].
9.2.2 DispersiveTime-Frequency Symbols In nature, there are systems and random processes characterized by dispersive time or frequency shifts (see [9] and Article 5.6). For example, the ocean is a system that can cause echoes with power dispersive characteristics from acoustic waves reflected from immersed spherical shells [10]. The WS is not well-suited to analyze
Statistical Processing
of
Dispersive Systems and Signals
385
such systems and processes as it is better matched to constant TF shifts [1, 3, 4]. Instead, as will be shown next, modified versions of the narrowband WS and SF were developed for analyzing random processes and LTV systems characterized by dispersive (such as power) IF shifts [3]. Note that extensions of the wideband PoWS for arbitrary dispersive analysis is provided in [5].
Dispersive Weyl Symbol Let the linear operator Z, defined in the time domain on L2([p, q]), be ( Z x ) ( t ) - f ; T z ( t , T)x(~-)dr, with kernel T z ( t , 7). The dispersive Weyl symbol, DWS~ ) (t, f), of Z is defined in [3] as DWS~) (t, f) - fcc d T z ( l ( t , ~ ) ,
=wsw, zw; ~
l(t,-~)) t~ ~
U
I~ ]v(l(t,~))v(l(t,-~))] d~ e-J2~'--~ 89
(9.2.9)
' t~ v(t)
where/(t, ~) - tr ~-l(~c(Kt ) + 2~), ~- 1 (~(b)) - b, v(t) = d ~ ( Kt ) , and the integration range [c, d] in (9.2.9) and [iv, q] above depend on the range and domain, respectively, of ~(.). Note that tr > 0 is a reference time point that is needed for unit precision (unless otherwise stated, it could be taken as t~ = 1). Thus, the DWS is obtained by warping the operator Z and the WS in (9.2.1) using a unitary transformation ( [3, 9, 11] and Article 5.6) based on the one-to-one warping function ~C(b). With ( W ( 1 W~ x)(t) - x(t), the unitary warping operator W~ in (9.2.10) is given by 1/2
The DWS preserves dispersive IF shifts on a random process x(t), i.e.
y(t) - x(t)eJ2"Z~(~ )
=~
DWS 7~., (~) (t , f) - DWS 7~x (~) (t , f - ~ v ( t ) ) , (9.2.11)
where 1 y(t) - (M(~) x)(t) - x ( t ) e j2~z~(~), .~(~) is the IF shift operator, and fi v(t) is the change in IF. This follows as the WS preserves constant frequency shifts (A/l,x)(t) in (9.2.3), and the warping in (9.2.10)yields W~-1A/tZ/t~ I/Y~ = M(~). Because of this important property, the DWS is potentially useful in analyzing random processes or LTV systems with characteristics that may be constant (when ~C(b) = b) or dispersive (when ~C(b) is non-linear). The DWS also preserves warped time shifts ( , ~ ) x)(t) - (l/Y( 1 $t~r W~ x)(t) (cf. (9.2.2)) and warped scale changes
(C(~) x)(t) - (W~-1 Ca W~ x)(t) (cf. (9.2.4)) as defined in [3]. The importance of the DWS is further emphasized when used to analyze random non-linear frequency modulated (FM) signals x(t) - aLo(t)eJ2~Zr ) with phase function r and random amplitude a (see [9] and Article 5.6). When the phase 1The tilde above operators indicates the warped versions of the operators in (9.2.2)-(9.2.4).
Chapter
38(5
9:
Random Processes and Noise Analysis
function of the non-linear FM signal is matched to the warping function ~(b) used t in the DWS, i.e. when r = ~(b) and co(t) - ] d r ]1/2 = Iv(t)] 1 / 2 , then the DWS of the autocorrelation operator of the process produces an ideally localized representation,
DWS n~ (~) (t, f ) -
E{[a] 2 } 5 (f - / 3 v(t)) ,
provided that the range of ~(b) is R. The DWS is localized along the curve f = /3 v(t), corresponding to the IF of the non-linear FM signal.
Dispersive Spreading Function The dispersive spreading function (DSF) is obtained as the SF in (9.2.7) of the warped operator [3] ~V~ Z )4)~-1, (9.2.12)
- SFw, z w[' (tr r fl/tr) .
DSF~ ) (r
For an LTV system Z, the DSF provides an interpretation of the operator output as a weighted superposition of warped TF shifted versions of the input signal x(t),
(Zx)(t) =
?F oo
DSF~ ) (r
e -j'~r
(M(~) 8~) x)(t) de dfl.
(9.2.13)
oo
Depending on the warping function ~(b), the formulation in (9.2.13) simplifies to a specific interpretation on the operator output. For example, when ~(b) - ~ln(b) = lnb, then ( ~ ( ~ l n ) ~ l n ) x ) ( t ) - e-~ x(te-r j2~z ln(~), and (9.2.13) describes the operator Z as a weighted superposition of hyperbolic IF shifts and scale changes (by e r on the input x(t) [3].
Unitarity and Quadratic Form Properties of the DWS
The DWS satisfies the
unitarity property in (9.2.5) since [3]
qDWS~)(t,f)DWS~)*(t,f)dtdf
-
T z ( t , T ) T*x(t,T)dtdT.(9.2.14 )
oo
)
[ which is the The DWS is associated with DWD(~)(t, f) = WDw~x tr~(Ut ), try(t) ] IF shift covariant version of the WD (see [9] and Article 5.6). Using the DWS, a general expression of the quadratic form in (9.2.6) is (Zx) (t) x* (t)dt =
(x)
DWS~ ) (t, f) DWD (~) (t, f) dt df
(9.2.15)
with potential detection applications for non-linear TF processes [3].
9.2.3
Special Cases of Dispersive Time-Frequency Symbols
Depending on the warping function ~(b), the DWS may simplify to TF symbols that are matched to linear or non-linear (dispersive) TF structures. Note that the trivial case of ~(b) = b simplifies the DWS to the WS. Some other examples are described below.
387
Statistical Processing of Dispersive Systems and Signals
Hyperbolic W S and SF The hyperbolic Weyl symbol (HWS) is obtained as the warped version of the WS when ~(b) = ~ln(b) = ln b in the DWS formulation in (9.2.10) and (9.2.11). That is, for a linear time domain operator ,7 on L2(R +), t~ ' t > 0 " Thisyields H W S s ( t , f) - DWS '~) (t , f) - WS~/~(ln ,j w-1 (ln ( t r(l nt ) g ' t-/-)
HWSs(t, f)-
T s (te r
t J~
te -r
e -j2€162 d~
(9.2.16)
oo
where T j(t, T) is the kernel of ,7 [3]. The HWS is significant for processing systems and nonstationary signals with hyperbolic TF characteristics. Specifically, it preserves hyperbolic IF changes on a random process x(t) y ( t ) -- x ( t ) e j27r/3 In ( ~ )
=~
HWSn,~ (t, f) - HWSn~
t, f - 7
where y(t) - (Ad~,ln)~-4,x)(t) - (7-l~ x)(t) is the hyperbolic IF shift operator (obtained when ~(b) = In b in (9.2.11)). The HWS also satisfies the scale covariance property in (9.2.4) since ~- -l n ) _ Cer [3]. The HWS satisfies the unitarity property in (9.2.14) with ~(b) = In b and [p, q] = [0, oc). It is associated with the dual form of the Altes Q-distribution ([9] and Article 5.6). As an example of the HWS, if the output of a system ,7 is the scale convolution of an input signal x(t) and some function g(t), then the HWS in (9.2.16) is the Mellin transform of g(t), i.e. H W S j ( t , f) - focx~ g(T)-~1 e-j27rtf In ( ~ ) dy. This is intuitive as the Mellin is a natural transform for scale operations. For comparison, the WS in (9.2.1) , W S j ( t , f ) - f o t-T~2 v~ g ( ttrt_~/2 + r / 2 ) e -j2~Tf dT, of the same operator ,7 is difficult to interpret. The hyperbolic spreading function (HSF) is obtained from (9.2.12) when ~(b) = -1 (try, ~/tr) yielding [3] ~ln(b) - - l n b as H S F j ( r , ~ ) - DSF~'n)(~ , /3) - - SF~42~ln,J~ln HSFj(~,/3) - fro~
Tj(ter
te-r
e-J2"Zln(~)dt
It is related to the HWS using a modified Fourier transform and a Mellin transform as HSFs((,/3) - f_o~ f ~ HWSs(t,f)eJ2~r e_J2~Zln(~)dtdf [3]. The HSF provides an alternative interpretation of the operator output as a weighted superposition of hyperbolic IF shifted and scale changed versions of the input signal where the weight is the HSF, i.e.,
(yz)(t) - / ?
H S F j (~,/3) e -j€162 (TlZ Cer x)(t) d~ d/3, t > O . oo
oo
It is useful for analyzing systems characterized by hyperbolic IF shifts and scale changes. The hyperbolic version of the quadratic form in (9.2.15) with ~(b) = In b is useful in detector formulations when signals in noise have hyperbolic TF characteristics [3].
388
Chapter 9:
Random Processes and Noise Analysis
~C~(b) - sgn(b)ibl ~ and v~(t) d ~ ( K )t , the DWS in (9.2.9) and the DSF in (9.2.12) simplify, respectively, to the nth power WS (PWS) and the t~th power SF (PSF) for 12 E L2(R)
Power WS and SF When ~(b)
-
-
PSF (~) (~,/3) - DSF~ ") (~, 13) - SFw,. Lw~,: ( t ~ , fi/t~) where
-
x
t )
It,v,<(t,
2
1 t --1/2 9 (m))l
The PWS preserves power IF shifts on a random process x(t), i.e.
where y(t) - ( M ( f ~ ) x ) ( t ) - (P~ x)(t) is the power IF shift operator (the operator in (9.2.11) when ~(b) = ~(b)). The PWS also preserves scale changes since warped scale covariance simplifies to the scale covariance in (9.2.4), i.e. C(f~) - Cf~/~(a). The corresponding operator output can be interpreted as a weighted superposition of power IF shifted and power warped time-shifted versions of the input signal. Also, the PWS is unitary as it satisfies (9.2.14) and it is associated with the power WD ( [9] and Articles 5.6 & 15.3) in the quadratic form in (9.2.15) when ~(b) = ~(b) [3].
Exponential WS and SF For a linear operator s defined on L;(R), the exponential WS, EWSL(t, f) - WSw~L~v/1 (tre t/tr, f e - t / t r ) , and the exponential SF, ESFL(~, fi) = SF~v~L~v/1 (trY, ~/tr), are given as exponentially warped versions of the narrowband WS and SF, respectively. Here, ()4;~ x) (t) - X(tr l n ( Kt ) ) ~ r / t , t > 0. The EWS and the ESF are obtained from the DWS in (9.2.9) and the DSF in (9.2.12), respectively, when ~(b) = e b. The EWS preserves exponential IF shifts on a random process x(t),
y(t) - x(t) d 2~B ~ ' / ~
=~
E W S % (t, f) - E W S ~ , (t, f - ~ e t/t~ Its)
where y(t) - (7:)(~) x)(t) - ($~ x)(t). Also, the EWS preserves constant time shifts in (9.2.2) since C(~) - S t r in a, is unitary, and satisfies the quadratic form in (9.2.15) with association with the exponential WD (see [9] and Article 5.6).
9.2.4
Analysis Application Examples
When a process or system has distinct TF characteristics in a particular application, it is important to choose an adequate analysis tool. As shown next, a dispersive WS produces an ideally localized spectrum of a process when they both have similar dispersive TF characteristics. Fig. 9.2.1 demonstrates the advantage of the HWS over the WS when used to analyze signals with hyperbolic TF characteristics. The
389
Statistical Processing o f Dispersive S y s t e m s and Signals
Fig. 9.2.1: (a) Narrowband Weyl symbol, and (b) hyperbolic Weyl symbol of a random process consisting of signal components with random amplitude and hyperbolic instantaneous frequency.
deterministic signal components are weighted by random, zero mean amplitudes, i.e x(t) [ 3}-~,~=1amsm(t) ] w(t) where w(t) is a shifted Hanning window, and am 9
-
are uncorrelated random variables with zero means and constant variances a 2 am
9
Also, sin(t) - -~t ej47rrnIn (t-tj), t > 0, is a deterministic hyperbolic FM signal in time, with hyperbolic IF (see [9] and Article 5.6). For the ideal case without windowing, the theoretical HWS of the process 2(t) 3 Em--1 amSm(t) is -
HWST~ (t, f ) -
E m= l
E{lam
} HWST~m (t, f ) -
-
~- E
a2am (~ f -
, t>O.
rn= l
It consists of three Dirac delta functions centered along the hyperbolae f = 2m/t, corresponding to the hyperbolic IF of sin(t), m = 1, 2, 3. Both the WS and the HWS produce no outer cross terms between neighboring hyperbolae since the random amplitudes am are uncorrelated. However, the WS in Fig. 9.2.1(a) has inner interference terms inside each hyperbola since it does not match the hyperbolic TF characteristics of the process. The HWS in Fig. 9.2.1(b) is an ideally concentrated T F R without any interference terms as it is matched to the process. Fig. 9.2.2 demonstrates the advantage of the n = 3 power WS over the narrowband WS when the analysis random process is a power FM signal x(t) = a v e ( t ) eJ2~Z~(~ ) (also with ~ - 3) [see [9] and Articles 5.6 & 15.3]. The power WS in Fig. 9.2.2(b) results in an ideal T F R for this process since, unlike the WS in Fig. 9.2.2(a), it matches the signal's power TF characteristics without any inner interference.
Chapter 9:
390
Random Processes and Noise Analysis
Fig. 9.2.2: (a) Narrowband Weyl symbol, and (b) power Weyl symbol of a random processconsisting of a signal component with random amplitude and power instantaneous frequency. 9.2.5
Summary
and Conclusions
This article discussed the use of TF symbols for the statistical signal processing of linear time-varying systems and nonstationary random processes. These symbols form the time-varying counterparts of transfer functions for systems and of power spectral densities for random processes. The Weyl symbol has been successfully used for narrowband systems with constant TF structures whereas the P0-Weyl symbol has been used for wideband systems with either constant or hyperbolic TF characteristics. For systems with non-linear dispersive structures, new symbols were presented for appropriate matched processing. For example, such non-linear TF symbols would be suitable for analyzing tetherless underwater communication systems in dispersive mediums. References
[1] W. Kozek, "On the transfer function calculus for underspread LTV channels," IEEE Trans. Signal Processing, vol. 45, pp. 219-223, January 1997. [2] R. G. Shenoy and T. W. Parks, "The Weyl correspondence and time-frequency analysis," IEEE Trans. Signal Processing, vol. 42, pp. 318-331, February 1994. [3] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels, "New concepts in narrowband and wideband Weyl correspondence time-frequency techniques," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'98), vol. 3, pp. 1573-1576, Seattle, 12-15 May 1998. [4] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels, "Classes of smoothed Weyl symbols," IEEE Signal Processing Letters, vol. 7, pp. 186-188, July 2000.
Statistical Processing of Dispersive Systems and Signals
391
[5] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels, "A wideband time-frequency Weyl symbol and its generalization," in Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pp. 29-32, Pittsburgh, PA, 6-9 October 1998.
[6] A. M. Sayeed and B. Aazhang, "Joint multipath-Doppler diversity in mobile wireless communications," IEEE Trans. Communications, vol. 47, pp. 123-132, January 1999. [7] P. Flandrin, "A time-frequency formulation of optimum detection," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 36, pp. 1377-1384, September 1988.
[8] G. Matz and F. Hlawatsch, "Time-frequency formulation and design of optimal detectors," in Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ Time-Scale Analysis, pp. 213-216, Paris, 18-21 June 1996. [9] A. Papandreou-Suppappola, R. L. Murray, B. G. Iem, and G. F. Boudreaux-Bartels, "Group delay shift covariant quadratic time-frequency representations," IEEE Trans. Signal Processing, vol. 49, pp. 2549-2564, November 2001. [10] J. P. Sessarego, J. Sageloli, P. Flandrin, and M. Zakharia, "Time-frequency WignerVille analysis of echoes scattered by a spherical shell," in Wavelets: Time-frequency Methods and Phase Space (J.-M. Combes, A. Grossmann, and P. Tchamitchian, eds.), pp. 147-153, Springer, 1989. Proc. of the Internat. Conf., Marseille, 14-18 December, 1987. 315pp. [11] R. G. Baraniuk and D. L. Jones, "Unitary equivalence: A new twist on signal processing," IEEE Trans. Signal Processing, vol. 43, pp. 2269-2282, October 1995.
Chapter 9:
392
9.3
Random Processes and Noise Analysis
ROBUST TIME-FREQUENCY DISTRIBUTIONS~
Study of the additive Gaussian noise influence on time-frequency (TF) distributions is an important issue (see Article 9.1). However, in many practical applications, especially in communications, signals are disturbed by a kind of impulse noise. These noises are commonly modeled by heavy-tailed (long-tailed) probability density functions (pdfs) [1]. It is well known that the conventional TF distributions are quite sensitive to this kind of noise, which is able to destroy sensitive signal information. The minimax Huber M-estimates [2] can be applied in order to design the periodogram and TF distributions, robust with respect to the impulse noise. For nonstationary signals, the robust TF distributions are developed as an extension of the robust M-estimation approach.
9.3.1
Robust Spectrogram
The standard short-time Fourier transform (STFT) at a given point (t, f), introduced in Section 2.3.1, can be alternatively defined as a solution of the following optimization problem:
Fz(t, f) - arg {mimnI(t , f ,m) } ,
(9.3.1)
N/2-1
I(t, f, m) -
E
w(nAt)F(e(t, f , n)).
(9.3.2)
n=-N/2
Here, the loss function is given as F(e) - ] e l 2, w(nAt) is a window function and At is a sampling interval. The error function has the form:
e(t, f, n) - z(t + nAt)e -j2~fnAt - m,
(9.3.3)
where m is a complex-valued optimization parameter in (9.3.1). The error function can be considered as a residuum expressing the "similarity" between the signal and a given harmonic exp(j27rfnAt). The solution of (9.3.1) easily follows from
OI(t,f,m) Om* =0
(9.3.4)
in the form of the well-known standard STFT (cf. Section 6.5.5)"
Fz(t, f) -
1
N/2-1
E
aw n=-N/2
w(nAt)z(t + nAt)e -j2~fnAt,
(9.3.5)
~ V l a d i m i r K a t k o v n i k , Department of Mechatronics, Kwangju Institute of Science and Technology (katkov@kjist.ac.kr); Igor Djurovid, Department of Mechanical and System Engineering, Kyoto institute of Technology, Kyoto 606-8585, Japan (igordj@serverl.cis.cg.ac.yu), on leave from the University of Montenegro; LJubi~a Stankovid, Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (1.stankovic@ieee.org). Reviewer: S. Stankovid.
Robust Time-Frequency Distributions
393
where
N/2-1
aw-
E
w(nAt).
(9.3.6)
n=-N/2
The corresponding spectrogram is defined by
Sz(t, f) = ]Fz(t, f)l 2.
(9.3.7)
The maximum likelihood (ML) approach can be used for selection of the appropriate loss function F(e) if the pdf p(e) of the noise is known. The ML approach suggests the loss function F(e) ~ - l o g p ( e ) . For example, the loss function F(e) = le] 2 gives the standard STFT, as the ML estimate of spectra for signals corrupted with the Gaussian noise, p(e) ~ exp(-]e]2). The standard S T F T produces poor results for signals corrupted by impulse noise. Additionally, in many cases the ML estimates are quite sensitive to deviations from the parametric model and the hypothetical distribution. Even a slight deviation from the hypothesis can result in a strong degradation of the ML estimate. The minimax robust approach has been developed in statistics as an alternative to the conventional ML in order to decrease the ML estimates sensitivity, and to improve the efficiency in an environment with the heavy-tailed pdfs. The loss function F(e) = le[ = V/~ 2{e} + ~2{e}
(9.3.8)
is recommended by the robust estimation theory for a wide class of heavy-tailed pdfs. It is worth noting that the loss function
f(~) -I~{~}1 + I~{e}l
(9.3.9)
is the ML selection for the Laplacian distribution of independent real and imaginary parts of the complex valued noise. Nonquadratic loss functions in (9.3.1) can improve filtering properties for impulse noises. Namely, in [3,4] it is proved that there is a natural link between the problem of spectra resistance to the impulse noise and the minimax Huber's estimation theory. It has been shown that the loss function derived in this theory could be applied to the design of a new class of robust spectra, inheriting properties of strong resistance to impulse noises. In particular, the robust M - S T F T has been derived by using the absolute error loss function F(e) = ]e] in (9.3.1)-(9.3.4) [3]. It is a solution of the nonlinear equation: N/2-1
1 Fz(t, f) - aw(t, f)
E
d(t, f,n)z(t + nAt)e -j2~/nAt,
(9.3.10)
n=-N/2
where:
d(t,f,n) -
w(nAt) Iz(t + n A t ) e -j2 f t - Fz(t, f)]'
(9.3.11)
Chapter 9:
394
and
Random Processes and Noise Analysis
N/2-1
aw(t, f) -
E
d(t, f , n).
(9.3.12)
n---N~2
If real and imaginary parts of the additive noise are independent, the statistically optimal robust estimation theory requires replacement of (9.3.1) with [4]: (9.3.13) N/2-1
II(t, f, m) -
E
w(nAt)[F(~{el }) + F(~{el})]
(9.3.14)
n---N~2
where el is an error function of the form
ez (t, f, n) = z(t + n a t ) -
me j2~/n~t.
(9.3.15)
For F(e) = ]el, the robust STFT (9.3.13) can be presented as a solution of (9.3.10), where d(t, f, n) is given by:
I~{el(t,f,n)}l + I~{el(t,f,n)}J
d(t, f, n) - w(nAt) i~--~e-z(~i-f, n-~2 +
]~{el (t, f, n)}l 2"
(9.3.16)
The robust spectrogram defined in the form
Sz (t, f) = I1 (t, f, O) - I1 (t, f, Fz (t, f))
(9.3.17)
is called the residual spectrogram, in order to distinguish it from the amplitude spectrogram (9.3.7). For the quadratic loss function F(e) the residual spectrogram (9.3.17) coincides with the standard amplitude spectrogram (9.3.7). In [4] it has been shown that, in a heavy-tailed noise environment, the residual robust spectrogram performs better than its amplitude counterpart. The accuracy analysis of the robust spectrograms, as well as a discussion on further details on the minimax approach, can be found in [3, 4].
9.3.2 9.3.2.1
Realization of the Robust STFT Iterative Procedure
The expression (9.3.10) includes Fz(t, f) on the right hand side. Therefore, to get the robust STFT we have to solve a nonlinear equation of the form x = f(x). Here, we will use the fixed point iterative algorithm xi = f(xi-1), with the stopping rule I x i - X~-ll/Ixil < 7, where 7/defines the solution precision. This procedure, applied to (9.3.10), can be summarized as follows. S t e p (0)" Calculate the standard STFT (9.3.5)" F (~ (t, f) - Fz(t, f), and i -- 0.
395
Robust Time-Frequency Distributions
S t e p (i)" Set i - i
+ 1. Calculate
d(i)(t,f,n)
for
F(i-1)(t,f)
determined from
(9.3.11) or (9.3.16). Calculate F (i) (t, f ) as:
F(zi) (t, f ) -
E
N/2-1
1 N/2-1 d(i)(t f,n) n=-N/2
~N/2 n=-
'
d (i) (t, f,
n)z(t + nAt)e -j2crfnAt.
(9.3.18)
S t e p (ii): If the relative absolute difference between two iterations is smaller than q: [Fz(~) (t, f ) - F (i-1) (t, f)[ _< v,
(9.3.19)
[F(z~)(t, f)] then the robust S T F T is obtained as
9.3.2.2
Fz(t, f ) - F (0 (t, f).
Vector Filter Approach
Note that the standard S T F T (9.3.5) can be treated as an estimate of the mean, calculated over a set of complex-valued observations: E (t'f) -
{z(t + nAt)e-J2~fnAt'n e [-N/2, N/2)}.
(9.3.20)
If we restrict possible values of m in (9.3.1) to the set E (t'f), the vector filter concept [5-7] can be applied to get a simple approximation of the robust estimate of the STFT. Here, the coordinates of vector-valued variable are real and imaginary parts of z(t + nAt)e -j2~fnAt. The vector estimate of the S T F T is defined as Fz (t, f) = m, where m E E (t'f), and for all k c [ - N / 2 , N/2) the following inequality holds: N/2-1
F(lm- z(t
+
I) ___
n=-N/2 N/2-1
F(]z(t + kAt)
- z(t +
(9.3.21)
n=-N/2
For F(e) = [el this estimate is called the vector median. The marginal median can be used for independent estimation of real and imaginary parts of Fz(t, f). It results in
~{Fz(t, f)} ~{Fz(t, f)}
= median {~{z(t + = median {~{z(t +
nAt)e-J27r:fnAt}:n C [-N/2, N/2)}, nAt)e-J2~fnAt}:n C [-N/2, N/2)}(9.3.22)
The separate estimation of the real and imaginary parts of Fz(t, f) assumes independence of the real and imaginary parts of z(t + nAt)e -j2~fnAt, what in general does not hold here. However, in numerous experiments the accuracy of the median estimates (9.3.21) and (9.3.22) is of the same order. A simplicity of calculation is the advantage of these median estimates over the iterative procedures.
396
Chapter 9:
9.3.3
Random Processes and Noise Analysis
Robust Wigner Distribution
The standard (windowed) Wigner distribution (WD) of a discrete-time signal is defined as 1
Wz(t, f) -
N/2
E
aw n=-N/2
w(nAt)z(t + n A t ) z * ( t - nAt)e -j47rSnAt,
(9.3.23)
with the normalization factor N/2
aw-
E
w(nAt)
(9.3.24)
n=-N/2
(cf. Article 6.1). It can be interpreted as a solution of the problem
Wz(t,f) - a r g { m i m n J ( t , f , m ) } ,
(9.3.25)
N/2
Y(t, f , m) -
E
w(nAt)F(Iz(t + nAt)z*(t - nAt)e -j4~ynAt - ml),
(9.3.26)
n=-N/2
where f ( e ) - l e l 2. For the loss function f ( e ) - ] e ] , solution of (9.3.25)-(9.3.26)is a WD robust to the impulse noise. It can be obtained as a solution of the nonlinear equation [8] 1
Wz(t, f) -
awe
(t, f)
N/2
E
d(t, f, n)z(t + nAt)z*(t - nAt)e -ja~ynAt,
(9.3.27)
n---N/2
with
d(t, f, n) -
( zxt)
Iz(t + n A t ) z * ( t - nAt)e-Ja~S nat - Wz(t, f)l'
(9.3.2s)
N/2
awe(t, f) -
~
d(t, f , n).
n=-N/2
An iterative procedure similar to the one described for the robust STFT can be used to find Wz(t, f) from (9.3.27)-(9.3.28). 9.3.3.1
Properties of the Robust WD
(1) The robust WD is real-valued for real and symmetric window function:
W~(t, f)
-
1 N/2 w*(nAt)z*(t + n A t ) z ( t - nAt)e j4~fnAt a~e(t, f) n=~N/2 Iz*(t + n A t ) z ( t - nAt)eJ4~Y nat - Wz(t , f)l
397
Robust Time-Frequency Distributions
Fig. 9.3.1: TF representations of signal corrupted by impulse noise: (a) Standard spectrogram; (b) Ro-
bust spectrogram; (c) Standard WD; (d) Robust WD.
1 = awe(t,f)
N/2
w* ( - n A t ) z ( t + nAt)z* ( t - nAt)e -j4rfnAt Z Iz(t + n A t ) z * ( t - nAt)e-J4~Y n a t - W z ( t , f ) l = Wz(t,f). n=-N/2
(2) The robust WD is T F invariant.
(9.3.29) For signal y(t) = z ( t - to)d 2~y~ we get
Wy(t, f) = Wz(t - to, f - fo).
(3) For linear FM signals z(t) = exp(jat2/2 + jbt), when w ( n A t ) i s very wide window, the WD is an almost ideally concentrated T F distribution. 9.3.3.2
Median W D
For rectangular window, the standard WD can be treated as an estimate of the mean, calculated over a set of complex-valued observations G ={z(t + n A t ) z * ( t - n A t ) e - J 4 r f n A t : n C [ - N / 2 , N/2]},
(9.3.30)
398
Chapter 9:
Random Processes and Noise Analysis
i.e., 1
Wz(t, f) = N + 1
N/2
E
z(t + nAt)z*(t - nAt)e -j4€
(9.3.31)
n=-N/2
From (9.3.29) follows that the robust WD is real-valued, thus the minimization of J(t, f, m) can be done with respect to the real part of z ( t + n A t ) z * ( t - n A t ) e -j4~fnAt only. A form of the robust WD, the median WD, can be introduced as:
Wz(t, f) = median {N{z(t + nAt)z*(t - nAt)e-J4~ynAt}:n e [-N/2, N/2]}. (9.3.32) Generally, it can be shown that any robust TF distribution, obtained by using the Hermitian local auto-correlation function (LAF), Rz(t, nAt) = R : ( t , - n A t ) in the minimization, is real-valued. In the WD case this condition is satisfied, since Rz(t, nAt) = z(t + nAt)z*(t - nat). For a general quadratic distribution from the Cohen class with a Hermitian LAF, the proposed robust version reads
pc(t, f) - median {~{Rz(t, nAt)e-J4~lnat}'n E [-N/2, N/2]},
(9.3.33)
where Rz(t, nat) includes the kernel in time-lag domain. Note, that for an input Gaussian noise the resulting noise in the WD has both Gaussian and impulse component, due to the WD's quadratic nature. Thus, as it is shown in [9], robust WD forms can improve performance of the standard WD, even in a high Gaussian input noise environment.
9.3.4
Example
Consider the nonstationary FM signal:
z(t) = exp(j204.87rt]tl) ,
(9.3.34)
corrupted with a high amount of the heavy-tailed noise: r
- 0.5(~3(t) + jr
(9.3.35)
where ~(t), i = 1,2 are mutually independent Gaussian white noises A/'(0, 1). We consider the interval t E [-7/8, 7/8] with a sampling rate At = 1/512 for spectrograms, and At -- 1/1024 for WDs. The rectangular window width is N = 256 in all cases. The standard spectrogram and the WD (Figs. 9.3.1(a),(c)) arecalculated according to (9.3.5) and (9.3.23). The robust spectrogram (Fig. 9.3.1(b)) is calculated by using iterative procedure (9.3.18)-(9.3.19). In this case, similar results would be produced by residual spectrogram (9.3.13)-(9.3.17), vector median (9.3.21), and marginal median (9.3.22). The robust WD (Fig. 9.3.1(d)) is calculated by using expression (9.3.32) for the considered TF point. It can be concluded from Fig. 9.3.1 that the robust spectrogram and the robust WD filter the heavy-tailed noise significantly better than the standard spectrogram and the standard WD. Note that the standard and the robust WD exhibit higher TF resolution in comparison with the corresponding spectrograms.
Robust Time-Frequency Distributions
9.3.5
399
Summary and Conclusions
The T F distributions are defined within the Huber robust statistics framework. The loss function F(e) = lel gives distributions robust to the impulse noise influence. They can be realized by using: the iterative procedures, the vector median, or the marginal median approach. All calculation procedures produce accuracy of the same order of magnitude. Articles 6.1 and 6.5 further discuss the definition and computation of discretetime TFDs.
References [1] S. A. Kassam, Signal detection in non-Gaussian noise. Springer, 1988. [2] P. J. Huber, Robust statistics. Wiley, 1981. [3] V. Katkovnik, "Robust M-periodogram," IEEE Trans. Signal Processing, vol. 46, pp. 3104-3109, November 1998. [4] V. Katkovnik, "Robust M-estimates of the frequency and amplitude of a complexvalued harmonic," Signal Processing, vol. 77, pp. 71-84, August 1999. [5] J. Astola, P. Haavisto, and Y. Neuvo, "Vector median filters," Proc. IEEE, vol. 78, pp. 678-689, April 1990. [6] I. Djurovi5, V. Katkovnik, and L. Stankovid, "Median filter based realizations of the robust time-frequency distributions," Signal Processing, vol. 81, pp. 1771-1776, August 2001. [7] I. Pitas and A. N. Venetsanopoulos, Nonlinear digital filters: Principles and applications. Kluwer, 1990. [8] I. Djurovid and L. Stankovid, "Robust Wigner distribution with application to the instantaneous frequency estimation," IEEE Trans. Signal Processing, vol. 49, pp. 29852993, December 2001. [9] I. Djurovi5, L. Stankovid, and J. F. BShme, "Estimates of the Wigner distribution in Gaussian noise environment," Archiv fiir Elektronik und Ubertragungstechnik (Internat. Y. of Electronics ~ Communications), vol. 56, no. 5, pp. 337-340, 2002.
400
9.4
9.4.1
Chapter g:
Random Processes and Noise Analysis
TIME-VARYING POWER SPECTRA OF NONSTATIONARY R A N D O M PROCESSES o Nonstationary Random Processes
The second-order statistics of a (generally nonstationary) random process I x(t) are characterized by the correlation function rx(t,t') = E { x ( t ) x * ( t ' ) } (with E{-} denoting expectation). In the special case of a (wide-sense) stationary random process, the correlation function is of the form rx(t, t') = ~x(t - t') and the Fourier transform of ~x (7),
Px(f) -
/?
~x(7) e -j2~/" d7 >_ O,
(9.4.1)
oo
is known as the power spectral density (PSD) [1]. The PSD describes the distribution of the process' mean power over frequency f and is extremely useful in statistical signal processing. The time-frequency dual of stationary processes is given by white processes with correlation functions of the form rz(t, t') = q~(t)5(t - t'). Here, the mean instantaneous intensity qx(t) >_ 0 is the time-frequency dual of the PSD. In many applications, the random signals under analysis are nonstationary and thus do not possess a PSD. Various extensions of the PSD to the nonstationary case have been proposed, such as the generalized Wigner-Ville spectrum [2-7] and the generalized evolutionary spectrum [7, 8]. In this article, we will briefly discuss these "time-varying power spectra" and show that they yield satisfactory descriptions for the important class of underspread nonstationary processes.
9.4.2
The Generalized Wigner-Ville Spectrum
The generalized Wigner-Ville spectrum (GWVS) of a nonstationary process x(t) is defined as [2-7] W: ~) (t, f) ~-
//
r (~) (t, T) e-J2=/'d7
oo
with t+
,
(9.4.2)
where c~ is a real-valued parameter. The GWVS equals the generalized Weyl symbol (see Article 4.7) of the correlation operator Rx (the linear operator whose kernel is the correlation function r x ( t , t ' ) = E{x(t)x*(t')}) and, under mild assumptions, it equals the expectation of the generalized Wigner distribution [5] of x(t). For c~ = 0, ~ G. M a t z and F. H l a w a t s c h , Institute of Communications and RadioFrequency Engineering, Vienna University of Technology, Gusshausstrasse 25/389, A-1040 Vienna, Austria (email: g.matz@ieee.org, fhlawats@pop.tuwien.ac.at, web: http://www.nt.tuwien.ac.at/dspgroup/time.html). Reviewers: M. Amin and A. PapandreouSuppappola. 1in what follows, all random processes are assumed to be real or circular complex as well as zero-mean.
401
Time- Varying Power Spectra of Nonstationary Random Processes
the GWVS becomes the ordinary Wigner-Ville spectrum, and for c~ = 1/2 it reduces to the Rihaczek spectrum [2-7,9]"
W~ (t,f) -
rz t + 7 , t -
/?
dT,
(X)
(t, f)
-
t -
.
O0
The GWVS W~(~) (t, f ) is a complete characterization of the second-order statistics of x(t) since the correlation function rx(t, t') can be recovered from it. Integration of the GWVS gives the marginal properties W (~) (t, f) dt -
E{[X(f)12},
O<3
I? W~~)(t,
f)df
-
E{lx(t)12},
O(3
provided that the expectations on the right-hand sides exist. In this sense, the GWVS can be considered as a time-frequency (TF) distribution of the mean energy m(0) of x(t). However, in general the GWVS is not real-valued; for c~ - 0, W x (t, f ) is real-valued though possibly not everywhere nonnegative. For further interesting properties of the GWVS, see [2-7, 9]. We next discuss the GWVS of three fundamental types of processes. 9 The GWVS of a stationary process with correlation function rx(t,t')
=
?=x(t- t') reduces to the PSD P~(f) for all t, i.e., W~(~) (t, f) - Px(f). 9 The GWVS of a (generally nonstationary) white process with correlation function rx(t, t') - qx(t)a(t- t') reduces to the mean instantaneous intensity qx(t) for all f, i.e., W (~) (t, f ) -
qx(t).
9 The GWVS of a stationary white process with correlation function r~(t, t') -
5 ( t - t') is given by W : c~)(t, f ) - r/ (i.e., constant mean energy distribution over the entire T F plane). These results show t h a t the GWVS is consistent with the PSD of stationary processes and the mean instantaneous intensity of white processes. The GWVS will be further considered in Section 9.4.6. Before that, we consider an alternative definition of a "time-varying power spectrum" in the next section.
9.4.3
The Generalized Evolutionary Spectrum
The PSD of a stationary random process x(t) can alternatively be defined using an innovations system representation. Here, x(t) is viewed as the output of a linear, time-invariant system H with impulse response h(T) (the innovations system) that is driven by stationary white noise n(t) with PSD Pn(f) = 1, i.e., x(t) = ( H n ) ( t ) = f _ ~ h(7-)n(t - T)dT. The PSD of x(t) can then be written as
Pz(f)-
IH(f)l 2,
(9.4.3)
402
Chapter 9:
RandomProcesses and Noise AnalTsis
where H ( f ) = f-~oo h(T)e -j2rf" dT is the transfer function (frequency response) of H.
A similar innovations system representation is also possible in the nonstationary case. The innovations system H of a nonstationary random process x(t) is a linear, time-varying system defined by H H + = Rx (here, the superscript + denotes the adjoint [101). Note that H is not uniquely defined; indeed, all innovations systems can be written as H = H p V where Hp is the positive (semi-) definite [10] innovations system (which is unique) and U is a linear operator satisfying U U + = I [8]. In analogy to the PSD expression in (9.4.3), the generalized evolutionary spectrum (GES) of a nonstationary process x(t) is now defined as [7,8] G (~) (t, f) -~ ]L(H ~) (t, f)]2.
(9.4.4)
Here, L(Ha) (t, f) is the generalized Weyl symbol (see Article 4.7) of an innovations system H of x(t), i.e.,
L(Ha) (t, f) A
h t + -~ - ~ v, t -
-~ + a T
e
dT
(9.4.5)
(x)
where h(t,t') is the kernel of H. Note that the nonuniqueness of H implies a corresponding nonuniqueness of the GES. For c~ : 1/2, c~ : - 1 / 2 , and a : 0, the GES reduces to the ordinary evolutionary spectrum 2 [11, 12], the transitory evolutionary spectrum [8,13], and the Weyl spectrum [8], respectively. In contrast to the GWVS, the GES is a nonnegative real-valued function. However, it is not a complete second-order description of x(t) since in general the correlation function rx(t, t') cannot be recovered from it. For c~ : 4-1/2 and normal innovations system (i.e., H satisfies H H + : H + H [10]; note, in particular, that Hp is always normal), the GES satisfies the marginal properties, i.e.,
/?
G (+1/2) (t, f) dt - E{]X(f)12},
(x)
/5
G (:i=1/2) (t, f) df - E{]x(t)]2}.
oo
Other properties of the GES are discussed in [7, 8]. Next, we consider the GES of our three fundamental types of processes, assuming that the positive (semi-) definite innovations system Hp is used in the GES definition (9.4.4). 9 For a stationary process with PSD Pz(f), Hp is time-invariant with frequency response Hp(f) = v/P~(f). Here, the GES reduces to the PSD P~(f) for all t, i.e., V(")(t, f ) -
Px(f).
9 For a (generally nonstationary) white process with mean instantaneous intensity qx(t), Hp is "frequency-invariant" with kernel hp(t, t') = v/qx(t) 6 ( t - t'). The GES here reduces to q~(t) for all f, i.e., G(~)(t, f) - qx(t). 2We note that Priestley's original definition of the evolutionary spectrum was based on a conceptually different approach using "oscillatory processes" [11, 12].
403
Time-Varying Power Spectra of Nonstationary Random Processes
9 For a stationary and white process with correlation function rx(t, t I) - rl 5 ( t t/), we have Hp - v ~ I with I the identity operator. Thus, the GES is given by G(~ ) (t, f) = rl. In Section 9.4.5, we shall consider conditions allowing the interpretation of the GWVS and GES as a "time-varying power spectrum." The formulation of these conditions will be based on a further TF representation of nonstationary processes, to be discussed next.
9.4.4
The Generalized Expected Ambiguity Function
The generalized ezpected ambiguity function (GEAF) is defined as [6-S]
r (~) (t, r)e -j2€
dt,
(x)
with r(~)(t, T) as in (9.4.2). The interpretation of the GEAF is quite different from that of a "time-varying power spectrum:" For a given frequency lag p and a given time lag T, the GEAF A(~)(t~, T) quantifies the statistical correlations of all process components separated in frequency by u and in time by T [6]. Hence, the extension of _A(~)(u, ~-) about the origin of the (~, T) plane indicates the amount of "TF correlations" of x(t). In particular, if fi~(a)(u, 7) extends far in the u direction, this indicates that x(t) has a large spectral correlation width (i.e., x(t) is highly nonstationary), and if fi~(a)(u, T) extends far in the 7 direction, this indicates that x(t) has a large temporal correlation width. The GEAF equals the generalized spreading function (see Article 4.7) of the correlation operator Rx. Like the GWVS, the GEAF is a complete second-order statistic. GEAFs with different a values differ merely by a phase factor, i.e.,
Therefore, the GEAF magnitude is independent of a, Iris(a1)(-, T ) [ - Iris(a2)(-, r)l, and we may thus simply write lAb(r, u)l. GWVS and GEAF are related by a 2-D Fourier transform,
W(~ ) (t, f ) - /: /: A(~) (~,,T)e -j2€ oo
oo
d, d'r ;
(9.4.6)
this extends the Wiener-Khintchine relation (9.4.1) to the nonstationary case. Again, it is instructive to consider our three process types (see Fig. 9.4.1; this figure should be compared to that in Section 4.7.3)" 9 The GEAF of a stationary process x(t) with correlation function r~(t,t') ~ x ( t - t')is given by .A(a)(u, ~ - ) - 5(u)~x(T) (i.e., only temporal correlations that are characterized by ~x(T)).
Chapter 9:
404
Random Processes and Noise Analysis
....._ w
Y
(a)
/2
l]
(b)
(c)
Fig. 9.4.1" Schematic representation of the GEAF magnitude of some (classes of) random processes: (a) stationary process, (b) white process, (c) stationary white process.
9 The GEAF of a (generally nonstationary) white process x(t) with correlation function rx(t,t') - q x ( t ) 5 ( t - t') is given by ft.(a)(,, 7-) = Qx(u)5(7), where Q~(u) is the Fourier transform of the mean instantaneous intensity qx(t) (i.e., only spectral correlations that are characterized by Q~(u)).
9 The GEAF of a stationary and white process x(t) with correlation function rx(t, t') - ~ 5(t - t') is given by A(~) (~, 7) - 775(~,)5(7") (i.e., neither temporal nor spectral correlations).
9.4.5
Underspread Processes
A nonstationary random process is said to be underspread if its GEAF is well concentrated about the origin of the (u, T) plane, thus implying a small "TF correlation width." In contrast, a process with large TF correlation width is termed overspread. We will see in Section 9.4.6 that the GWVS and GES of an underspread process are approximately equivalent and can be interpreted as "time-varying power spectra." There are two alternative mathematical definitions of underspread processes [6, 7]. The first one [6] assumes that the GEAF fi.(~)(~, T) is supported in a compact region Gx about the origin of the (~, 7) plane, i.e., I.~x(~, T)I -- 0 for (~, T) r Gx. Let ~x A max(~,r)e~. ItJl and Tx -~ max(v,r)e~x 171 denote the maximum frequency lag and time lag, respectively, for which the process x(t) features TF correlations. The TF correlation spread of x(t) is defined as az _n 4~xTz, which is the area of the rectangle [ - ~ , ~x] • [--T~, Tx] enclosing Gx. The process x(t) is considered underspread if az << 1 [6].
Time-Varying Power Spectra of Nonstationary Random Processes
405
F i g . 9.4.2" Schematic representation of the GEAF magnitude of various types of nonstationary processes: (a) underspread process with small /Tt(zl'l); (b) underspread process with small m(l'~176 (c) ,'chirpy" underspread process [7]" (d) quasi-stationary process (small m s(0' 1) ); (e) quasi-white process _(1,0) (small ,,~x ).
An alternative description of the GEAF's extension that avoids the assumption of compact GEAF support uses the normalized weighted GEAF integrals 3 [7]
(3o
oo
oo
oo
Here, r ~-) is a nonnegative weighting function satisfying r T) _> r 0) = 0 and penalizing G E A F contributions located away from the origin. Important obtained with the weighting special cases are the GEAF moments m (k'l) -~ m (r functions Ck,t(u, ~-) = I~lzlT-Ik with k, 1 C N0. A random process x(t) can now be considered underspread if suitable weighted G E A F integrals or moments are "small." Processes that are underspread in the compact-support sense considered previously are easily shown to be a special case of this extended, more flexible definition of underspread processes. Examples of underspread processes are illustrated in Fig. 9.4.2 (this figure should be compared to that in Section 4.7.4). We caution that the concept of underspread processes is not equivalent to that of quasi-stationary processes: indeed, a quasistationary process may be overspread if its temporal correlation width is very large. Finally, note that according to the Fourier transform relation (9.4.6), the GWVS of an underspread process is a smooth function.
9.4.6
Time-Varying Spectral Analysis of Underspread Processes
For underspread nonstationary processes, the GWVS and GES can be interpreted as "time-varying power spectra" that generalize the PSD of stationary processes and the mean instantaneous intensity of white processes. Indeed, small weighted G E A F _(k,l) integrals m(xr (or small moments ,,~x or a small T F correlation spread az) ensure the validity of the approximations described in what follows [6-8]. 3Further definitions of weighted G E A F integrals and moments can be found in [7].
406
Chapter 9:
Random Processes and Noise Analysis
Fig. 9.4.3: GWVS and GES of an underspread process x(t): (a) Wigner-Ville spectrum W (~ f), (b) real part of Rihaczek spectrum ~(1/2)(t, f), (c) Weyl spectrum G (0) (t, f), (d) evolutionary spectrum G (1/2)(t, f). In this and subsequent simulations, signal duration is 256 samples and normalized frequency ranges from -- 1/4 to 1/4.
Approximate equivalence. In general, the GWVS and GES of a given process x(t) may yield very different results which, moreover, may strongly depend on the parameter a used. However, for an underspread process x(t), all these results are approximately equal, i.e.,
w~(:')(t, f) ~ w~(::)(t, f), G(~')(t, f) ~ G(~)(t, f) , W (a~) (t, f) ~ G (a2) (t, f ) . Indeed, it can be shown [7] that the approximation error W.(a') (t, f ) is upper bounded as
IW~(a') (t, f ) - W..~(<~2)(t, S)l -< 27r lal-a2111A~II~
~()'~),
W..(a2) (t, f)
(9.4.7)
with [[Ax[[1 - f _ ~ f-~oo ]fi.z(u, 7)ldu d7. Thus, for an underspread system where m 0'1) is small, W(~l)(t, f) and W..(~2)(t, f) will be approximately equal as long as In1 - a 2 ] is not too large. Similar bounds can be developed for the approximation errors G ? ~) (t, f) - G ('~2) (t, f) and W.(el) (t, f) - G (a2) (t, f) [7]. We can conclude from these results that for an underspread process, the choice of a specific spectrum is not critical. An example is shown in Fig. 9.4.3. For this example, the maximum normalized differences between the spectra shown are f)l = 0.029). all around 0.03 (e.g. maxt,f [W..(~ (t, f) - a (~ (t, f)[/maxt,s ]W(~ A counterexample involving an overspread process is shown in Fig. 9.4.4. Here, the results obtained with the various spectra are seen to be dramatically different, and indeed the maximum normalized differences range from 1 to 8.5 (e.g.
IW..J
s)l
maxt,s IW..(1/2)(t, f) - G (1/2)(t, S)[/maxt,s = 2.13). it can be seen that all spectra contain oscillating components (so-called statistical cross-terms) which are indicative of TF correlations [7]. Such statistical cross-terms are reduced
Time-Varying Power Spectra of Nonstationary Random Processes
407
Fig. 9.4.4: GWVS and GES of an overspread process x(t): (a) Wigner-Ville spectrum ~(0)(t, f), (b) real part of Rihaczek spectrum W (1/2) (t, f), (c) Weyl spectrum G (0) (t, f), (d) evolutionary spectrum G 0/2) (t, f).
in extensions of the GWVS and GES that contain a TF smoothing [2-5, 7]. For underspread processes, on the other hand, a TF smoothing does not cause a big difference.
Approximate real-valuedness and positivity of the GWVS. The PSD of stationary processes and the mean instantaneous intensity of white processes are realvalued and nonnegative. This is also true for the GES of arbitrary processes. In contrast, the GWVS is real-valued only for a = 0 and generally not everywhere nonnegative. In the case of underspread processes, however, it can be shown [7] that the imaginary part of the GWVS is approximately zero and the real part of the GWVS is approximately nonnegative, i.e., {Wx(~) (t, f)} ~ 0,
Re {Wx(~) (t, f)} >~ 0.
Upper bounds on the associated approximation errors (similar to (9.4.7)) can again be provided [7]. As an example, we reconsider the underspread process from Fig. 9.4.3. The normalized maximum of the imaginary part of the Rihaczek spectrum (the real part is shown in Fig. 9.4.3(b))is maxt,l I.~{W(1/2)(t,f)}l/maxt,i IW(1/2)(t,f)l = 0.024 and the normalized maximum of the negative real part is maxt,l { -
R e { W (1/2) (t, f)}
} / m a x t , f IW(1/2> (t,
f)l
- 0.006.
Approximate input-output relations. If a stationary process x(t) with PSD Px(f) is passed through a time-invariant linear system with impulse response k(T) and transfer function K(f), the output y(t) = (x. k)(t) is also stationary and its PSD equals P~(f) = IK(/)I 2 P~(f). Similarly, the response y(t) = w(t)x(t) of a linear frequency-invariant system (see Article 4.7) to a white process x(t) with mean instantaneous intensity qx(t) is again white with qy(t) = Iw(t)] 2 q~(t). A similar input-output relation does not exist for a general nonstationary process x(t) that is passed through a general time-varying linear system K. However, for an underspread process that is passed through an underspread system (i.e., a time-varying linear system introducing only small TF shifts, see Article 4.7), one can show the
408
Chapter 9:
Random Processes and Noise Analysis
Fig. 9.4.5: Approximate input-output relation for the GWVS: (a) Wigner-Ville spectrum W (~ (t, f) of
input process x(t), (b) Weyl symbol L~ ) (t, f) of LTV system K, (c) Wigner-Ville spectrum W (0) (t, f) of filtered process y(t) = (Kx)(t), (d) approximation IL(K~ (t, f)12 W(o) (t, f).
following approximate input-output relations of the GWVS and GES:
G(y~)(t, f) ~ IL(K~)(t,f)l 2 G(~)(t, f), with y(t) - (Kx)(t). Note that the generalized Weyl symbol L(a)(t, f) of K (see (9.4.5)) takes the place of the transfer function K ( f ) or w(t). An example for the Wigner-Ville spectrum (GWVS with a = 0) is shown in Fig. 9.4.5. In this example, the normalized maximum approximation error is m a x t , / { lW " y (~ (t, f) IL~ ) (t, f)[ 2 --(o) --(o) (t, f)l - 0.017. Wx (t, f ) l } / m a x t , i IWy Discussion. The above approximations (more can be found in [6-8]) corroborate the interpretation of the GWVS and GES of underspread processes as time-varying power spectra. A mathematical underpinning of these approximations is provided by explicit upper bounds on the associated approximation errors [7]; these bounds involve the GEAF parameters rn (r , ~x _(k,z) or a= defined in Section 9.4.5. In the underspread case, these GEAF parameters are small and thus the approximations are guaranteed to be good. On the other hand, we caution that the approximations are not valid for overspread processes (cf. Fig. 9.4.4).
9.4.7
Summary and Conclusions
In this article, we have shown that for the practically important class of underspread processes (i.e., processes with small time-frequency correlations), the generalized Wigner-Ville spectrum and generalized evolutionary spectrum can be interpreted in a meaningful way as time-varying power spectra. Indeed, for underspread processes the generalized Wigner-Ville spectrum and the generalized evolutionary spectrum (approximately) satisfy desirable properties that any reasonable definition of a timevarying power spectrum would be expected to satisfy. We note that applications of the generalized Wigner-Ville spectrum in statistical signal processing are considered in Articles 12.1 and 12.4.
Time-Varying Power Spectra of Nonstationary Random Processes
409
References [1] A. Papoulis, Probability, Random Variables, and Stochastic Processes. McGraw-Hill, 3rd ed., 1991.
New York:
[2] M. G. Amin, "Time-frequency spectrum analysis and estimation for non-stationary random processes," in Time-Frequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 9, pp. 208-232, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [3] P. Flandrin, "Time-dependent spectra for nonstationary stochastic processes," in Time and Frequency Representation of Signals and Systems (G. Longo and B. Picinbono, eds.), pp. 69-124, Vienna: Springer, 1989. [4] P. Flandrin and W. Martin, "The Wigner-Ville spectrum of nonstationary random signals," in The Wigner Distribution--Theory and Applications in Signal Processing (W. Mecklenbr/~uker and F. Hlawatsch, eds.), pp. 211-267, Amsterdam: Elsevier, 1997. [5] P. Flandrin, Time-Frequency/Time-Scale Analysis. San Diego: Academic Press, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [6] W. Kozek, F. Hlawatsch, H. Kirchauer, and U. Trautwein, "Correlative time-frequency analysis and classification of nonstationary random processes," in Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~J Time-Scale Analysis, pp. 417-420, Philadelphia, PA, 25-28 October 1994. [7] G. Matz and F. Hlawatsch, "Time-varying spectra for underspread and overspread nonstationary processes," in Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, pp. 282-286, Pacific Grove, CA, 1-4 November 1998. [8] G. Matz, F. Hlawatsch, and W. Kozek, "Generalized evolutionary spectral analysis and the Weyl spectrum of nonstationary random processes," IEEE Trans. Signal Processing, vol. 45, pp. 1520-1534, June 1997. [9] W. Martin and P. Flandrin, "Wigner-Ville spectral analysis of nonstationary processes," IEEE Trans. Acoustics, Speech, 8J Signal Processing, vol. 33, pp. 1461-1470, December 1985. [10] A. W. Naylor and G. R. Sell, Linear Operator Theory in Engineering and Science. New York: Springer, 2nd ed., 1982. [11] M. B. Priestly, "Evolutionary spectra and non-stationary processes," J. Royal Statistical Soc. (Series B), vol. 27, no. 2, pp. 204-237, 1965. [12] M. B. Priestley, Spectral Analysis and Time SeriesmPart II. London: Academic Press, 1981. [13] C. S. Detka and A. El-Jaroudi, "The transitory evolutionary spectrum," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), vol. 4, pp. 289-292, Adelaide, Australia, 19-22 April 1994.
410
9.5
9.5.1
Chapter 9:
Random Processes and Noise Analysis
T I M E - F R E Q U E N C Y C H A R A C T E R I Z A T I O N OF R A N D O M T I M E - V A R Y I N G CHANNELS o
Time-Varying Channels
In many practical communication systems, the channel is modeled as linear but time-varying and random. Examples are the mobile radio, ionospheric, tropospheric, and underwater acoustic channels [1-7]. In this article, we will discuss time-frequency (TF) descriptions of both the channel (see also Article 4.7 for more details) and its second-order statistics. The input-output relation of a linear, time-varying, random channel H is
r(t)
=
(Hs)(t)
h(t, t')
-
dr' -
CXD
/?
s
r) s(t-r) dr,
(9.5.1)
(X)
where s(t) is the transmit signal, r(t) is the received signal, h(t, t') is the (random) kernel of H, and/~(t, T) = h(t, t - r) is the (random) impulse response of H. Two major physical phenomena underlying practical channels are multipath propagation and Doppler spreading. Multipath propagation (i.e., several different propagation paths from the transmitter to the receiver via various scattering objects) causes the received signal to consist of several delayed versions of the transmit signal. Doppler spreading is due to the movement of transmitter and/or receiver and/or scatterers; for a narrowband transmit signal s(t), it causes the multipath signals to be frequency-shifted. The received signal r(t) thus consists of several TF shifted (i.e., delayed and modulated) versions of the transmit signal s(t) [1-4], N -
k=l
Here, N is the number of scatterers and 7k, ~'k, and ak are respectively the (random) delay, Doppler frequency, and reflectivity of the kth scatterer. The above relation can be extended to a continuum of scatterers (corresponding to a continuum of delays T and Doppler frequency shifts u) as
r(t) =
O(D
O0
S(H~) (u, T) S(~)(t)dT du V~T
(9.5.2)
-- s ( t - 7)e j27rut ej27rvr(c~-1/2) where c~ is a real-valued parameter 1 with ~,,r(t) ~(~) that is arbitrary but assumed fixed and S(H~) (u, T) denotes the generalized (delay~ G. M a t z and F. H l a w a t s c h , Institute of Communications and RadioFrequency Engineering, Vienna University of Technology, Gusshausstrasse 25/389, A-1040 Vienna, Austria (email: g.matz@ieee.org, fhlawats@pop.tuwien.ac.at, web: h t t p : / / w w w . n t . t u w i e n . a c . a t / d s p g r o u p / t i m e . h t m l ) . Reviewers: P. Flandrin and A. Sayeed. 1The parameter c~ in ~u,r ^(~) (t) corresponds to the infinitely many ways of defining a joint T F shift by combining time shifts and frequency shifts. In particular, the case c~ = 1/:2 corresponds to first shifting in time and then shifting in frequency, whereas c~ = - 1 / 2 corresponds to first shifting in frequency and then in time.
Time-Frequency Characterization of Random Time-Varying Channels
411
Doppler) spreading function (GSF) of the channel [1] (see also Article 4.7). The
GSF is defined as
~ / _ ~ h ( t +( 1~ - )a ( 7,1t -
S(a)(
- -
o o
-~ + a ) T ) e _j2~tdt.
(9.5.3)
It can be shown that the input-output relation (9.5.2) is mathematically equivalent to (9.5.1). In what follows, we will also use the generalized Weyl symbol (GWS)
L (~)(t,f) a_
h t + - ~ - ~ T,t-- -~+a T e-J2~frdT
(9.5.4)
OO
ITJ?
S(H'~)(u, T) ej2€
=
(DO
dv du.
(9.5.5)
O0
The GWS can be interpreted (with certain precautions, see Article 4.7) as a "TF transfer function" of H. In the mobile communications literature, the parameter a is usually chosen as 1//2. In this case, (9.5.3)and (9.5.4) become
S~/2) (u, T) --
/?
[z(t, T)e -j2~t dt ,
L~/2) (t, f) -
O0
9.5.2
f?
[z(t, T) e -j2€
dT .
OO
WSSUS Channels
Since the channel H is random, its GSF S(a)(u, r), GWS L H (~) (t, f), and impulse response h(t, T) are 2-D random functions (random processes). Hereafter, these random processes will be assumed zero-mean. The second-order statistics of H are characterized by the 4-D correlation functions E{ S (a) (u, T) S(~)*H(pt, 7.r } , E { L ~ ) (t, f) L(H~1. (t', f')}, and E{/~(t, 7)/~* (t', T')}, which are all mathematically equivalent. Definition and description of WSSUS channels. An important simplification results from the assumption of wide-sense stationary uncorrelated scattering (WSSUS) [1,2, 4, 6, 7]. For WSSUS channels, by definition, the reflectivities of scatterers corresponding to paths with different delay or Doppler are uncorrelated. In terms of the GSF S(H~) (u, 7), this means E{
s . (..
+* ( . ' . ) }
-
(9.5.6)
i.e., S (a) (u, T) is a wide-sense white random process. The mean intensity function of this white random process, CH(U, ~-) _~ 0, is known as the scattering function
[1-8].
Together with the Fourier transform relation (9.5.5), the WSSUS relation (9.5.6) implies that E{L(a)(t,f) L(~)*(t',f')} - R H ( t - - t ' , f - - f ' ) , (9.5.7) with the TF correlation function [1-7]
412
Chapter 9:
_RH(/Nt, A f) -
FF oo
R a n d o m Processes and Noise Analysis
CH(/Y, T) ej2~(~At-r
dT d~.
(9.5.8)
oo
The TF correlation function satisfies IRH(At, Af)] _< RH(0, 0) and R ~ ( - A t , - A
f)
= RH(At, Af). Eq. (9.5.7) shows that the GWS L(H~)(t,f) of a WSSUS channel is a 2-D wide-sense stationary process. According to (9.5.8), the scattering function CH(U, 7) is the Fourier transform of the correlation function RH(At, A f) of L(i_i~)(t,f). Thus, CH(~', T) can be interpreted as the power spectral density of L (c~)(t, f). The path loss [31 is defined as p~ A R H ( 0 , 0 ) -
E{lL(Ha)(t,f)l 2} -
f?/? oo
CH(.,T) dTd..
oo
Finally, in terms of the impulse response h(t,T), the WSSUS property is expressed as E{h(t,T) h*(t',T')} = rH(t--t',T) 5(~'--~"), where rH(At, T) is related by Fourier transforms to CH(V,T) and RH(At, Af). Thus, h(t, 7-) is wide-sense stationary with respect to time t and uncorrelated for different delays 7. Note that this stationarity with respect to t refers to the secondorder statistics of the channel and does not imply that the channel's realizations are time-invariant systems (cf. the first of the examples given below). The 2-D functions CH(/],T), RH(At, Af), and rH(At, T) are mathematically equivalent descriptions of the second-order statistics of a WSSUS channel. They are related by Fourier transforms and do not dependent on a. The composition (series connection) H2H1 of two statistically independent WSSUS channels H1 and H2 can be shown to be again a WSSUS channel. Its scattering function and TF correlation function are given by -
(c..
RH2H1 (At, A f) -- RH2 (At, A f) RH1 (At, A f ) , where ** denotes 2-D convolution. Note that H2H1 and HIH2 have the same second-order statistics. From the 2-D functions CH(~, T) and RH(At, A f), several 1-D channel descriptions can be derived. In particular, the delay power profile and Doppler power profile are respectively defined as [3] PH(T) -~
F
CH (L', T)dL',
QH(~) A
F
CH(~, 7")dT.
co
oo
Their Fourier transforms, pH(Af) A
F F
PH(T) e -j2€
dT -- RH(0, A f ) ,
oo
qH(At) a__
oo
QH(/]) ej2~vAt d~'
--
RH(At, 0),
Time-Frequency Characterization of Random Time- Varying Channels
413
are known as time correlation function and frequency correlation function, respectively. Often, for the sake of simplicity, a separable model is assumed for the scattering function and the TF correlation function, i.e., CH(//, 7.) -- pl--kQg(//) PH(7-) and RH(At, A f)
-
-
9--~nq H ( A t ) p H ( A f ) .
Examples of WSSUS channels. In the following, we briefly present some special cases and important examples of WSSUS channels. 9 Time-invariant W S S U S channel. The impulse response of a time-invariant channel has the form h(t, 7.) - g(7.). The WSSUS property here implies E{g(7.) g*(7')} - PH(7)5(7. -- 7.'). It follows that CH(//, 7.) -- (~(//) PH(T)
~nd Rn(/xt, A f) - p n ( A f ) . 9 Frequency-invariant W S S U S channel. Next, we consider a "frequency-invariant" channel with impulse response h(t, 7.) - w(t)5(7.), i.e., the input signal is simply multiplied by w(t). With W(//) denoting the Fourier transform of w(t), the WSSUS property here implies E{W(//)W*(//')} - Q H ( / / ) 5 ( / / - / / ' ) . It tbllows that CH(//, 7-) QH(V)5(7.) and RH(At, A f) -- qH(At). -
-
9 R a n d o m T F shift. The GSF of a channel effecting a random frequency shift by//0 and a random time shift by 7.0 is given by S(~)(//, 7.) - 5(//-//0)5(7.-7.o). It can be shown that this channel is WSSUS with scattering function Cg(v, 7.) -- f~o,To(v, 7.), where f~o,To(//, 7.) is the joint probability density function of' (v0, 7.0) [9]. Furthermore, R H ( A t , A f ) - @~o,~o(At,-Af), with ~-o,~o (At, A f) - f-~o~ f - ~ f,o,~o (u, 7.) e j 2 7 r ( v A t + r A f ) du dr being the characteristic function of (u0, 7.0). 9 Typical mobile radio channel. A channel model popular in the mobile radio literature [3] uses a separable scattering function CH(//, 7) -- -~HHQH(//)PH(7) with an exponential delay power profile ( ..2 _ > 0, ( O,
r
(0.5.0)
and a Jakes Doppler power profile QH(V) --
P~ zrv///2max_ / / 2 ' 0,
I~1
//max, I~l > ~m~x<
(9.5.10)
The associated TF correlation function is RH(At, A f) -- ~-~Hq g ( A t ) p H ( A f ) , with p n ( A f ) --
p~ 1 + j27rToAf'
qg(At) -- p~ J0(27r//maxAt),
where Jo(') denotes the zero-order Bessel function of the first kind.
Chapter 9:
414
Random Processes and Noise Analysis
Statistical input-output relations for WSSUS channels. The scattering function
and TF correlation function are useful for formulating input-output relations that show how the second-order statistics of the channel output signal r(t) depend on the second-order statistics of the input signal s(t). Let s(t) be a nonstationary random process that is statistically independent of the random channel H. The second-order statistics of a nonstationary random process x(t) with correlation operato~ Rx can be described by the generalized Wigner-Ville spectrum W(a)(t ' f) ~- L(~)(t, f) or, l:tx alternatively, by the generalized expected ambiguity function A(~) (v, T) ~- S R(~) x x (v, T) which is the 2-D Fourier transform of W(a) (t, f) (see Article 9.4). It can then be shown that
/?/?
W(") (t, f) -
co
CH(V, T) ~ ( " ) ( t - - T , f--v) dT dr,
(9.5.11)
co
(9.5.12)
-
For a (wide-sense) stationary transmit signal s(t) with power spectral density Ps(f) and autocorrelation rs(T), the received signal r(t) is stationary as well and (9.5.11) and (9.5.12)reduce to
P~(f) -
F
QH(V) P~(f - v ) d r ,
=
oo
Furthermore, E{Ir(t)l 2} - p~ E{Is(t)12}. Dual results are obtained for a nonstationary white transmit signal s(t). Finally, if s(t) is cyclostationary with period T, cyclic correlation function r~k)(7-), and cyclic spectral density p(k) (f) [10], the received signal r(t)is cyclostationary with the same period T and we have
p(k) (f) _
F
Q(Hk) (~) p(k)(f _ ~) d~,
=
oo
where q(Hk) (At) -- RH(At, k/T) and Q(I_I k) (v) - f _ ~ q(Hk) (At) e -j2~At dAt.
9.5.3
Underspread WSSUS Channels
A fundamental classification of WSSUS channels is into underspread and overspread channels [2,4, 7]. As we will show in this section, underspread WSSUS channels have some interesting properties. We note that the underspread property for WSSUS random channels is analogous to the underspread property for deterministic timevarying systems that was considered in Article 4.7.
Definition of underspread channels. A WSSUS channel is underspread [4, 7] if its scattering function is highly concentrated about the origin. 2 The underspread property is practically relevant as most mobile radio channels are underspread. 2For simplicity, we a s s u m e t h a t the s c a t t e r i n g functi on is c e n t e r e d a b o u t T -- 0, which m e a n s t h a t an overall delay ~'0 > 0 has been split off from t h e channel.
415
Time-Frequency Characterization of Random Time- Varying Channels
A simple method for quantifying the concentration of the scattering function CH(v,r) is based on the assumption that the support of CH(V,T) is contained within a rectangle [-Vmax,/-'max] X [--Tm~x, Tm~x] about the origin of the (u, "1) plane. (This implies that, with probability one, the GSF S (~) (u, 7-) is supported within this rectangle as well.) The channel's delay-Doppler spread is then defined as the area of this rectangle, aH -~ 4/JmaxTmax, and the channel is said to be underspread if aH _< 1 and overspread if aH > 1 [4, 7]. An alternative characterization of scattering function concentration that avoids the assumption of compact support uses normalized weighted integrals of the form
1FF
P~I
oo
;/; oo
T) CH (u, T)du dT =
cr r
T) CH (u, T) du dT
r oo
; / ? CH(/],T) du dT oo
oo
where r T) _> 0 is a weighting function that satisfies r T) >_ r 0) = 0 and penalizes scattering function components lying away from the origin. Special cases are the moments ~(Hk'z) -~ ~(Hr obtained with the weighting functions Ck,Z(u, T) = ]~l l Irl k with k, l C No. Of particular importance are the delay spread ~'H and Doppler spread VH that are obtained with k - 2, 1 = 0 and k - 0, l - 2, respectively: 7~ a _(2,0) =
1
u~ ~- ~(~,2) =
1
-
m.
/5 //5 ~ ~
00
Te
cH(-,
u2
T) du dT T) du dr
,
(9.5 13) (9.5.14)
Within this framework, a WSSUS channel is called underspread if specific weighted integrals and moments of the scattering function are small.
Approximate eigenfunctions and eigenvalues of underspread channels. It is known [2, 4, 7] that signals with good time and/or frequency concentration can pass an underspread WSSUS channel almost undistorted, i.e., merely multiplied by a random complex factor. We will analyze this effect using the approach in [11]. We note that similar results in a deterministic context are reported in Articles 4.7 and 13.3. A normalized transmit signal s(t) that remains undistorted, i.e., (Hs)(t) = As(t), is an eigenfunction of the system H; the associated eigenvalue is given by = (Hs, s). Since H is random, the relation (Hs)(t) = (Hs, s} s(t)is more appropriately formulated in the mean-square sense, i.e.,
E{IrH -
/ 112} = 0.
The eigenfunctions of a WSSUS channel H are random and generally do not possess a specific structure. However, in the underspread case, TF translates of a function
416
Chapter g:
Random Processes and Noise Analysis
g(t) with good TF concentration are approximate eigenfunctions. Specifically, consider the TF translates
gto,fo (t) - g(t-to) j2,:ot , where g(t) is a normalized function that is well concentrated about the origin of the TF plane. One can show
E{ IlHgo,fo
-
(Hgto,:o, gto,:o} gto,yo II=}
(Iir ,
(9.5.15)
T) - 1 - IA(~) (u, T)I 2 where A (~) (u, T) -- f _ ~ g(t + ( 89--a)T) g* (t -- (1 + a)T) e -j2€ dt denotes the generalized ambiguity function of g(t). Therefore, if the channel is underspread, i.e., the channel's scattering function is concentrated about the origin (where IA(~) (u, T)I 2 ~ IA(~) (0, 0)12 - 1 and thus r T) ~ 0), the weighted integral ~ ) will be small and one has the approximation (valid in the mean-square sense) with r
(9.5.16)
(Hgto,fo)(t) ,,~ (Hgto,yo, gto,:o) gto,:o(t) .
This shows that gto,fo (t) is an approximate eigenfunction of H. Furthermore, it can be shown that
E{l(Hg o,fo,g o,fo)- L(H )(t0, f0)l with r have
<
r ,
(9.5.17)
w) - I1 - A(a)(u, T)I 2. Thus, under the same conditions as before, we (Hgto,:o, gto,:o) ,~ L(")(to, fo)
(9.5.18)
(again valid in the mean-square sense), which shows that the approximate eigenvalue (Hgto,fo, gto,:o) is approximately equal to the GWS at the TF point (to, fo). In contrast to the exact eigenfunctions of H, the approximate eigenfunctions gto,:o (t) are TF translates of a single prototype function g(t) and thus highly structured; they do not depend on the specific channel realization and their parameters to, f0 have an immediate physical interpretation. To illustrate the above eigenfunction/eigenvalue approximations, we simulated the transmission of a signal gto,:o (t), with g(t) a Hanning window of duration Tg = 128 #s, over a WSSUS channel. The channel's scattering function was CH(U, T) -1 QH(U) PH(T) with exponential PH(T) (nq. (9.5.9) with TO -- 1 #S) and Jakes-type QH(U) (Eq. (9.5.10) with Um~x - 305 Hz). Fig. 9.5.1 illustrates the approximations (9.5.16) and (9.5.18) for a single channel realization. It is seen that the received signal (Hgto,:o)(t) and the approximation L~/2) (to, f0) gto,:o (t) are practically identical. Furthermore, we used 500 realizations of H to estimate the normalized meansquare error E{llHgto,f o -(Hgto,fo,gto,fo} gto,:oll2}/P~-i (see (9.5.15)). The result, 9 . 1 0 -4, confirms the validity of the eigenfunction/eigenvalue approximation. The associated upper bound ~(r (see (9.5.15)) was calculated as 5 . 10 -3 Finally, ""H
Time-Frequency Characterization of Random Time-Varying Channels
417
Fig. 9.5.1" Eigenfunction/eigenvalue approximation for an underspread WSSUS channel: (a) Wigner distribution [12] (top) and real and imaginary parts (bottom) of received signal r(t) = (Hgto,fo)(t), (b) Wigner distribution (top) and real and imaginary parts (bottom) of approximation
LH(1/2) (to, fo) gto,fo (t). the normalized mean-square error
E{l(Hg~o,fo,g~o,fo)
- L~/2)(to, fo)]2}/p~I
(9.5.17)) was estimated as 5 . 1 0 -6 and the associated upper bound ~(r culated as 2 . 1 0 -3.
(see
was cal-
Sampling approximation for underspread channels. Next, we consider 2-D sampling of the channel's transfer function (GWS) L(H~)(t,f). This is important for simplified channel representations that are used e.g. in the context of orthogonal frequency division multiplexing (OFDM)modulation [13,14]. Consider the representation of a WSSUS channel H by the samples L~ )(kT, 1F) of its GWS taken on the uniform sampling grid (kT, 1F). The reconstructed (interpolated) GWS is given by CX:)
Z(s~)(t, f ) --
OO
E
L(H")(kT'1F)sinc (~~ (t- kT))sinc( -7rf ( f - 1F)) ,
k=-e~l=-~x~
with sinc(x) - sin(x)/x. For WSSUS channels with scattering function CH(U,T) compactly supported within a rectangular area [-Umax, Umax] • [--Tmax, Tmax] and for sampling grid constants satisfying T < 1/(2Um~x) and F < 1/(27m~x), the above reconstruction can be shown to be exact in the sense that E{IL(H~)(t, f ) L(H~) (t, f)]2} _ 0. Note that a smaller channel spread aH -- 4Umax~-m~x allows for a coarser sampling grid and thus for a more parsimonious channel representation. If the above conditions are not satisfied, the reconstructed GWS L(H~) (t, f) will
Chapter 9:
418
Random Processes and Noise Analysis
Fig. 9.5.2: Sampling approximation for an underspread WSSUS channel: (a)IL~/2)(t, f)l 2 and (b) IL~/2) (t, f)l 2 (in dB; time in ms, frequency in MHz).
contain errors due to aliasing. bounded as
However, it can be shown that these errors are
E{ ]L(H")(t, f ) L-( " )H (t,f)]
2 } <_ 2p~ (T~F 2 + u2T2),
(9.5.19)
where TH and /]H are the delay spread and Doppler spread as defined in (9.5.13), (9.5.14). Thus, for WSSUS channels with small TH and/or small UH, i.e., for underspread channels, a sampling of the transfer function will result in negligible errors provided that the sampling periods T and F are chosen appropriately. Specifically, the upper error bound in (9.5.19) is minimized when T / F = TH/VH. For the WSSUS channel with exponential/Jakes scattering function that was considered further above, and for sampling periods T - 138 #s, F = 136.72 kHz, the normalized mean-square error E{]L~/2) (t, f)-L~_i/2) (t, f)l 2 }/P~-I was estimated from 500 channel realizations as 6.4.10 -3, and the upper bound 2 (T~F 2 + u~T 2) was calculated as 3.2.10 -2 . Fig. 9.5.2 shows the squared magnitude of the true t , H f) and of the reconstruction L~/2)(t, f) for a channel transfer function L(1/2)( specific channel realization.
9.5.4
Summary and Conclusions
In this article, we have considered time-frequency characterizations of (the secondorder statistics of) random linear time-varying channels satisfying the assumption of wide-sense stationary uncorrelated scattering (WSSUS). We have shown that the practically important class of underspread WSSUS channels allows some interesting approximations. In particular, underspread WSSUS channels possess approximate eigenfunctions with time-frequency shift structure (which suggests the use of OFDM), and they can be discretized by means of a time-frequency sampling. Related considerations and results can be found in Articles 13.2 and 13.3.
Time-Frequency Characterization of Random Time- Varying Channels
419
References [1] P. A. Bello, "Characterization of randomly time-variant linear channels," IEEE Trans. Communication Systems, vol. 11, pp. 360-393, December 1963. [2] R. S. Kennedy, Fading dispersive communication channels. New York: Wiley, 1969. [3] J. D. Parsons, The Mobile Radio Propagation Channel. London: Pentech Press, 1992. [4] J. G. Proakis, Digital Communications. New York: McGraw-Hill, 3rd ed., 1995. [5] T. S. Rappaport, Wireless Communications: Principles ~ Practice. Upper Saddle River, N J: Prentice-Hall, 1996. [6] K. A. Sostrand, "Mathematics of the time-varying channel," in Proc. NATO Advanced Study Inst. on Signal Processing with Emphasis on Underwater Acoustics, vol. 2, pp. 25.125.20, 1968. [7] H. L. L. Van Trees, Detection, Estimation, and Modulation Theory, vol. III: "RadarSonar Signal Processing and Gaussian Signals in Noise". New York: Wiley, 1971. Reprinted Malabar, FL: Krieger, 1992. Reprinted New York: Wiley, 2001. [8] H. Art,s, G. Matz, and F. Hlawatsch, "Unbiased scattering function estimation during data transmission," in Proc. IEEE Vehicular Technology Conf. (VTC'99-Fall), pp. 15351539, Amsterdam, 19-22 September 1999. [9] P. Flandrin, Time-Frequency/Time-Scale Analysis. San Diego: Academic Press, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [10] W. A. Gardner, ed., Cyclostationarity in Communications and Signal Processing. Piscataway, N J: IEEE Press, 1995. [11] W. Kozek and A. F. Molisch, "On the eigenstructure of underspread WSSUS channels," in Proc. IEEE-SP Workshop on Signal Processing Advances in Wireless Communications (SPA WC'97), pp. 325-328, Paris, 16-18 April 1997. [12] W. Mecklenbr~uker and F. Hlawatsch, eds., The Wigner Distribution--Theory and Applications in Signal Processing. Amsterdam: Elsevier, 1997. [13] J. A. C. Bingham, "Multicarrier modulation for data transmission: An idea whose time has come," IEEE Communications Magazine, vol. 28, pp. 5-14, May 1990. [14] W. Kozek and A. F. Molisch, "Nonorthogonal pulseshapes for multicarrier communications in doubly dispersive channels," IEEE J. on Selected Areas in Communications, vol. 16, pp. 1579-1589, October 1998.
This Page Intentionally Left Blank
Chapter 10
Instantaneous Frequency Estimation and Localization A critical feature of a non-stationary signal is provided by its instantaneous frequency (IF), which accounts for the signal spectral variations as a function of time. This chapter presents methods and algorithms for the localization and estimation of the signal IF. The topic is covered in five articles with appropriate internal crossreferencing to this and other chapters. The first conventional approach for IF estimation used the spectrogram. To account for its window-dependent resolution, improvements were made by introducing iterative methodologies on the estimate provided by the first moment of the spectrogram (Article 10.1). Another approach uses an adaptive algorithm for IF estimation using the peak of suitable TFDs with adaptive window length (10.2). This method was extended to the case of multicomponent signals using the modified B-distribution (10.3). When the signals considered have polynomial FM characteristics, both the peak of the polynomial WVD and higher-order ambiguity functions can be used (10.4). In the special case when the signals are subject to random amplitude modulation (or multiplicative noise), IF estimation procedures are described using the peak of the WVD for linear FM signals, and the peak of the PWVD for non-linear FM signals (10.5).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
421
422
10.1 10.1.1
Chapter 10:
Instantaneous Frequency Estimation and Localization
ITERATIVE INSTANTANEOUS FREQUENCY ESTIMATION FOR R A N D O M SIGNALS 0 IF Estimation: Introduction and Background
The instantaneous frequency (IF) is a basic parameter which may be used to describe the nonstationarity in a process (see Article 1.3). It is used in many areas such as seismic, radar, sonar, communications and biomedical applications [1-3]. The concept of IF is often introduced in the context of frequency modulation in communication theory. The IF of a signal at time to could be viewed as the frequency of the sinusoid which locally fits the signal at that time. Using the definition of Gabor and later Ville [4], the IF, f(t), of a signal x(t) can be defined in terms of the derivative of the phase of its analytic signal z(t); i.e., 1 d arg(z(t)).
f (t) = 2~ dt
(10.1.1)
Another interpretation of the IF comes from the time-frequency distribution (TFD) point of view, where the IF of a signal at time t is defined as the weighted average of the frequencies which exist in the signal at time t [5]; i.e.,
](t) = f - ~ f P(t, f ) d f f P (t , )-)
(lo.1.2)
where/5(t, f) is the TFD estimate. The IF has many applications in time-frequency signal analysis. For example a matched spectrogram, which uses chirp windows, is shown to perform better than a regular spectrogram for chirp-like signals when the chirp rate of the window is matched to the IF of the signal [6]. The IF can also be utilized to obtain distributions mainly concentrated along the IF for monocomponent signals. For example, [7] shows how to obtain distributions infinitely concentrated along the IF for pure tones and for amplitude- and frequency-modulated signals, and considers kernel functions for bilinear distributions with these properties. There are many ways to estimate the IF of a signal. The current IF estimation algorithms can be grouped as phase differencing methods, signal modeling methods (e.g., short time AR modeling, time-varying AR modeling), phase modeling methods, and time-frequency-representation methods. A good review of these methods can be found in [8]. One important time-frequency-representation method is the iterative cross Wigner-Ville IF estimation technique [8], in which, at each iteration, we construct a constant-amplitude FM signal whose IF matches the previous IF estimate, then take the cross Wigner-Ville distribution between the constructed signal and the ~ A m r o E I - J a r o u d i and M u s t a f a K. E m r e s o y , 348 Benedum Engineering Hall, Dept. of Electrical Engineering, University of Pittsburgh, Pittsburgh, PA, 15261, USA (amro@ee.pitt.edu, emresoy@siglab.ee.pitt.edu). Reviewers: M. Mesbah and G. R. Putland.
Iterative Instantaneous Frequency Estimation for Random Signals
423
signal under analysis, and then use the cross-WVD to produce a new IF estimate. This method is shown to perform very well at low signal-to-noise ratios (cf. Subsection 3.1.2.3, especially at item 2). In the next two Sections, we present an iterative algorithm to estimate the IF and matched spectrogram of a nonstationary sinusoidal signal, and then analyze the convergence of the algorithm. Our algorithm is similar to the cross Wigner-Ville algorithm in that we use a time-frequency representation iteratively to obtain an estimate of the IF. The iterative IF estimation method can be used to improve the performance of many time-frequency methods [9,10].
10.1.2
Iterative Algorithm for IF Estimation
The basic idea behind the iterative IF estimation is to use a time-frequency estimator (in this case, the spectrogram) to obtain an IF estimate using (10.1.2) above, then use this IF estimate to recalculate the spectrogram. This process is repeated until convergence is reached. The principle at work here is that improving the IF estimate makes the matched spectrogram estimate better and vice versa. Consequently each iteration will lead to improvements in both quantities. We will assume that the signal under analysis is a complex signal of the form
x(t) - A(t) ejr
(10.1.3)
where A(t) is the time-varying amplitude and r is the phase of the signal. In the iterative algorithm given below, the instantaneous frequency and the spectrogram as a TFD are estimated by using the whole frequency axis. Then the signal is "demodulated" by subtracting the phase estimate (obtained by integrating the IF estimate) from the phase of the signal. This demodulation shifts the spectrum of the nonstationary signal around zero frequency. The demodulated signal is then reanalyzed. Consequently we do not restrict the definition of the IF to use only positive frequencies, since after the first iteration the signal under analysis has non-redundant information over positive and negative frequencies. But when the algorithm is applied to real signals, we use the analytic signal to start the iterations, ensuring that the first IF estimate is not trivially zero. Note that although we use the analytic signal at first, the demodulated signal used in the subsequent iteration is not analytic. The outputs of the algorithm are the estimated IF and the matched spectrogram of the signal.
Algorithm: 1. Calculate the spectrogram/5(t, f) of the signal (or of the analytic component if the signal is real). 2. Estimate fi(t) by using the first moment of the spectrogram at each time and find the phase estimate
f-oof
i(t,f) df
(10.1.4)
Chapter 10:
424
InstantaneousFrequency Estimation and Localization
(10.1.5)
(X)
3. Demodulate the signal along the estimated IF
2i(t) - A(t) e -j(r
= A(t) e -j$~(t).
(10.1.6)
4. Take the spectrogram Pi(t, f) of 2(t) and compensate for the demodulation to obtain a new matched spectrogram estimate /)i+1 (t, f) --/5i (t, f -- J?i(t)).
(10.1.7)
5. Go to step 2. The algorithm is stopped once the IF estimate has converged. The convergence can be checked by comparing the difference between consecutive iterations with stopping threshold. While the algorithm seems very simple, we show below that its convergence can be guaranteed given certain conditions on the analysis signal.
10.1.3
Convergence of the Estimation Algorithm
In order to examine the convergence of the iterative algorithm, we derive an expression for the IF estimate after each iteration. The IF estimate of the complex signal in (10.1.3) obtained by using the first moment of a spectrogram with a real analysis window, h(t), is given by [5]
1 ~ ](t) - ~
Mk(t) r k!
(t)
(10.1.8)
Mn(t) - f A2(t+7) h2(T)TndT f A 2(t + T) h 2(T) dT
(10.1.9)
k=0
where
and r
is the ( n + l ) TM derivative of the phase of the signal, with
Mo(t) O!
=
1.
(I0.I.i0)
Our hypothesis is that the general expression for the IF estimate at the end of the n TM iteration is
in(t) -- f(t) Jr ( - 1 ) (n+l) 1 ~-~. ~ . ~ 1 7 6oo Mi~(t) {. { Millt ) r in=l
i1"-1
"
)
il.
(10.1.11)
425
Iterative Instantaneous Frequency Estimation for Random Signals
where f (t) - ~1r (t). We prove the hypothesis using the method of induction. To show that the equation (10.1.11) is true for the first iteration, we remove the summations i2 to in and substitute n - 1 in equation (10.1.11), obtaining
1 k Mi~ (t) i1! i1:1
L (t) -- f(t)+ ~
r
+l) (t).
(10.1.12)
This is the same as equation (10.1.8); i.e., equation (10.1.11) is true for n - 1. For the inductive step, we assume that equation (10.1.11) is true for iteration ( n - 1), so that j~n-l(t) --
f(t)+
(_1) n ~1 ~ ~
"" Zc~
i,,_1 = 1
Mi,,_l(t)in__Tii {'''{ Mi~It)il.r
i 1= 1
(10.1.13)
and show that it is also true for the n th iteration. Following the algorithm given in the previous section, the phase error r at the n TM iteration is given by
~n (t)
--
r
--
2fT
i oo
(10.1.14)
L--1 (T) dT.
By combining the previous two equations, we obtain
&(t)-
i,,,,-1 :
1
...
9 :
oo
in-Tii
{{
il.
(10.1.15)
The IF estimate at the end of n TM iteration can be found by adding the IF estimate from iteration n - 1 and the IF estimate of the signal ~(t) whose phase is given by (10.1.15); that is,
]n(t)-
f n - l ( t ) + ~--~
in=0
in!
(10.1.16)
Substituting for ] n - l ( t ) and Cn(t) in the above equation, we find that the IF estimate at the end of the n TM iteration is
fn(t)
- f(t) +
(_1) n ~l Ec~ ""Ec~ Min_l(t)~n_l! {'''{ Millt)il. in_l=1 i1=1
-J-(-- 1) (n+l) in=0E"'"i1=1~Min(t)in! it___(x) {...{ Milil.T)r
r
..}(in-~-l)dT
which simplifies to equation (10.1.11), as claimed. This completes the proof.
426
Chapter 10:
Instantaneous Frequency Estimation and Localization
In equation (10.1.11), the first term represents the actual IF whereas the second term represents the error in the IF estimate at the end of the nth iteration. In general the error term is very difficult to analyze, being affected by both the amplitude and phase variations in the signal. Assuming, however, that the spectrogram window is Gaussian, and that the phase function of the signal has bounded derivatives of all orders, and that the signal is dominantly FM modulated (i.e. has constant or slowly varying amplitude), we may show that the iteration in (10.1.11) converges to the true IF of the signal. Let the spectrogram window be
h(t)
-
(~)1/4 e - a t 2 ~ 2 ,
(10.1.17)
and let the amplitude A(t) be approximated by the constant A within the window, so that x(t) = A ejr (10.1.18) Making these two substitutions in (10.1.9), we find
Mn =
-
(I+(-I) n) 7{, n+l 2 )
{o
l--in~2 (n+(2k- 1))V~
-
1lk-0
c~,~/2
(10.1.19) if n is odd if n is even.
( 0.1.20)
Then, from (10.1.11), the general expression for the IF estimate is
fn(t) -- f ( t ) + ( - 1 ) n 2--~1
E~176
il ,i2,... ,in =2
(t).
i1! i2!..in!
(10.1.21)
If we define Cm ~
E
il!i2!...in!
il q-i2-}-...+in = m
'
(10.1.22)
then (10.1.21) can be written as oo
A ( t ) - f ( t ) + ( - 1 ) n E Cmr 2~" m=2
(10.1.23)
It is important to note that {Cm} are independent of the analysis signal and are mainly a function of the window parameter a. It is easy to show that after each iteration, two coefficients of the coefficient series {Cm} become zero and overall the coefficients decay rapidly to zero. It is clear that, as n --. co, the coefficients go to zero. If we assume that the phase of the signal is continuous and has bounded derivatives, then the sum in equation (10.1.23) also goes to zero which implies
]n (t) --~ f (t).
Iterative Instantaneous Frequency Estimation for Random Signals
427
Convergence properties: 9 Equation (10.1.20) implies that the convergence rate of the algorithm is proportional to an/2. That is, for small a (long window) the convergence is slow whereas for large a (short window) the convergence is very fast. 9 The convergence rate is also affected by the rate of change of the IF. For signals whose IF's have nonzero higher order derivatives, the convergence is slower. But for signals with polynomial phases, the algorithm theoretically converges with an iteration number equal to half of the degree of the polynomial. This can be seen by examining equation (10.1.21). At iteration n the first 2n coefficients in equation (10.1.23) become zero. 9 If the phase of the signal has discontinuities at some points ti, the derivatives of r go to +c~. Then we cannot expect the algorithm to converge exactly to the f(t). If r has bounded derivatives for all orders and for all t, then the algorithm converges to f(t).
10.1.4
Summary and Conclusions
In this article, we presented an iterative algorithm to estimate the instantaneous frequency (IF) and matched spectrogram of nonstationary sinusoidal signals. The matched spectrogram obtained by this method is concentrated along the IF for monocomponent signals. The convergence analysis and the properties of the IF estimation algorithm are presented.
References [1] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal Part 1: Fundamentals," Proc. IEEE, vol. 80, pp. 520-538, April 1992. [2] B. Ferguson, "A ground based narrow-band passive acoustic technique for estimating the altitude and speed of a propeller driven aircraft," J. Acoustical Soc. of America, vol. 92, pp. 1403-1407, September 1992. [3] M. T. Taner, F. Koehler, and R. E. Sheriff, "Complex seismic trace analysis," Geophysics, vol. 44, pp. 1041-1063, June 1979. [4] J. Ville, "Th@orie et applications de la notion de signal analytique," Cables et Transmissions, vol. 2A, no. 1, pp. 61-74, 1948. In French. English translation: I. Selin, Theory and applications of the notion of complex signal, Rand Corporation Report T-92 (Santa Monica, CA, August 1958). [5] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [6] D. L. Jones and T. W. Parks, "A high-resolution data-adaptive time-frequency representation," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 38, pp. 2127-2135, December 1990. [7] L. Cohen, "Distributions concentrated along the instantaneous frequency," in Proc. SPIE: Advanced Signal-Processing Algorithms, Architectures, and Implementations,
428
Chapter 10:
InstantaneousFrequency Estimation and Localization
vol. 1348, pp. 149-157, Soc. of Photo-optical Instrumentation Engineers, San Diego, 10-12 July 1990. [8] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal m Part 2: Algorithms and applications," Proc. IEEE, vol. 80, pp. 540-568, April 1992. [9] M. K. Emresoy and A. E1-Jaroudi, "Iterative instantaneous frequency estimation and adaptive matched spectrogram," Signal Processing, vol. 64, pp. 157-65, January 1998. [10] D. L. Jones and R. G. Baraniuk, "A simple scheme for adapting time-frequency representations," IEEE Trans. Signal Processing, vol. 42, pp. 3530-3535, December 1994.
429
Adaptive Instantaneous Frequency Estimation Using TFDs
10.2
ADAPTIVE INSTANTANEOUS FREQUENCY ESTIMATION USING TFDs~
Instantaneous frequency (IF) estimators based on maxima of time-frequency representations have variance and bias which are highly dependent on the lag window width. The optimal window width may be determined by minimizing the estimation mean squared error (MSE), provided that some signal and noise parameters are explicitly known. However, these parameters are not available in advance. This is especially true for the IF derivatives which determine the estimation bias. In this article, an adaptive algorithm for the lag window width determination, based on the confidence intervals intersection, will be presented [1-5]. This algorithm does not require knowledge of the estimation bias value. The theory and algorithm presented here are not limited to the IF estimation and time-frequency analysis. They may be applied to a parameter value selection in various problems.
10.2.1
Optimal Window Width
Consider a noisy signal"
x ( n A t ) - s ( n A t ) + e(nAt), s(t) - a e x p ( j r
(10.2.1)
with s ( n A t ) being a signal and e(nAt) being a white complex-valued Gaussian noise with mutually independent real and imaginary parts of equal variances a~2/2. Sampling interval is denoted by At. Consider the problem of the IF, fi(t) = r estimation from the discrete-time observations x ( n A t ) , based on maxima of a timefrequency distribution px(t, f), f(t) - arg{m~xpx(t, f)}.
(10.2.2)
Let A f ( t ) - f i ( t ) - f(t) be the estimation error. The MSE, E { ( A f ( t ) ) 2 } , is used for the accuracy characterization at a given time instant t. Asymptotically, the MSE for commonly used time-frequency representations (e.g. the spectrogram, the pseudo Wigner distribution (WD), and its higher order versions) can be expressed in the following form [2-6] [Articles 10.3, 10.4] E ~-( A f (~t )-) 2
V
~J - ~-~ +
B(t)h ~,
(10.2.3)
where h is a lag window Wh (t) width, such that wh(t) -- 0 for Itl > h/2. It is related to the number of samples N by h - N A t . The variance and the bias of estimate, for a given h, are
a2(h) - V / h m,
bias (t, h) = v / B ( t ) h n.
(10.2.4)
~ LJubi~a Stankovid, Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (1.stankovic@ieee.org). Reviewers: B. Barkat, I. Djurovi6 and V. Ivanovi6.
430
Chapter 10:
InstantaneousFrequency Estimation and Localization
The expression for B(t) is a function of the IF derivatives. For example, for the WD with a rectangular lag window we have [3] 6a 2At 1 ~_ ( r E j""l,],(Aj(t))2 80~ ,~ S _ (2~a) 2 h 3
2 ha'
(10.2.5)
corresponding to m = 3 and n = 4 in (10.2.3). Values of m and n for some other distributions are indicated in Table 10.2.1, according to the results from [2-5]. The MSE in (10.2.3) has a minimum with respect to h. This minimum occurs for the optimal value of h given by
hopt(t)- [mV/(nB(t))] 1/(m+~).
(10.2.6)
Note that this relation is not useful in practice, because its right hand-side contains
B(t) which depends on derivatives of the unknown IF.
10.2.2 Adaptive Algorithm Here, we present an adaptive method which can produce an estimate of hopt(t) without having to know the value of B(t). For the optimal window width, according to (10.2.3), holds
Oh
= -m
h m+l
+ nB(t)h n-1 = Oih=hopt.
(10.2.7)
Multiplying (10.2.7) by h, we get the relationship between the bias and standard deviation, (10.2.4), for h - - hopt, bias (t, hopt )
=
~/ :-~a(hopt).
(10.2.8)
It will be assumed, without loss of generality, that the bias is positive. The IF estimate s (obtained from (10.2.2) by using the lag window of width h) is a random variable distributed around the true IF f~(t) with the bias bias (t, h) and the standard deviation a(h). Thus, we may write the relation: (10.2.9) where the inequality holds with probability P ( n ) depending on parameter n.1 We will assume that n is such that P ( a ) --, 1. 1If we assume, for example, that the random variable ]h(t) is Gaussian, with the mean value M - fi(t)+ bias(t, h)and the standard deviation a(h), then the probability that ]h(t) takes a value within the interval [ i - na(h), U + na(h)] is P(a) - 0.95 for a -- 2, and P(a) - 0.997 for
431
Adaptive Instantaneous Frequency Estimation Using TFDs
Let us introduce a set of discrete dyadic window-width values, h E H,
H=
{hs l hs = 2 h s - l , s = l, 2, ..., J } .
(10.2.10)
Define the confidence intervals Ds = [Ls, Us] of the IF estimates, with the following upper and lower bounds
Ls = fh~(t) - (t~ + A ~ ) a ( h s ) ,
Us - fh.~ (t) + (~ + A ~ ) a ( h s ) ,
(10.2.11)
where fh~(t) is an estimate of the IF, for the window width h - hs, and a(hs) is its standard deviation. Assume that a window width denoted by hs+ E H is of hopt order, hs+ ~ hopt. Since hopt does not correspond to any hs from the set H, for the analysis that follows we can write hs+ = 2Phopt, where p is a constant close to 0. According to (10.2.10) all other windows can be written as a function of hs+ as
hs - hs+2 (s-s+) = hopt2 s-s++p,
( s - s +) - . . . , - 2 , - 1 , 0 ,
1, 2, ....
(10.2.12)
With this notation, having in mind (10.2.8), the standard deviation and the bias from (10.2.4) can be expressed by
a(hs) -
v / V / h s m - a(hopt)2 -(s-s++p)m/2,
(10.2.13)
bias (t, hs) = v / B ( t ) h y - v / m / n a(hopt)2 (s-s++p)n/2. For small window widths hs, when s << s +, the bias of fh~ (t) is negligible, thus f~(t) e Ds (with probability P(~ + A~) ~ 1). Then, obviously, Ds-1 A Ds ~= O, since at least the true IF, fi(t), belongs to both confidence intervals. For s >> s + the variance is small, but the bias is large. It is clear that for bias (t, hs) ~ 0 there exists such a large s that D s N D s+l = 0 for a finite ~ + A~. The i d e a b e h i n d t h e a l g o r i t h m is that A~ in Ds can be found in such a way that the largest s, for which the sequence of the pairs of the confidence intervals D s-1 and D s has at least a point in common, is s = s +. Such a value of A~ exists because the bias and the variance are monotonically increasing and decreasing functions of h, respectively, (10.2.13). As soon as this value of At~ is found, an intersection of the confidence intervals Ds-1 and D s ,
[/h~_l(t) - fhs(t)] <_ (~ + A ~ ) [ a ( h s _ l ) + a(hs)],
(10.2.14)
works as an indicator of the event s = s +, i.e., the event hs = hs+ ~ hopt. The value of hs+ is the last hs when (10.2.14) is still satisfied.
10.2.2.1
Parameters in the Adaptive Algorithm
There are three possible approaches to choosing algorithm parameters t~, A~, and p. Their performance do not differ significantly. (1) When our knowledge about the variance and bias behavior, given by (10.2.3), is not quite reliable, an approximative approach for t~, A ~ , and p determination
Chapter 10:
432
InstantaneousFrequency Estimation and Localization
T a b l e 1 0 . 2 . 1 : Parameters in the adaptive algorithm for various m , n , m : m = 3, n = 4 for the spectrogram, Wigner and L-Wigner distribution based IF estimators; rn = 3, n = 8 for the fourth order polynomial Wigner-Ville distribution, and local polynomial distribution based IF estimators; m = 1, n -4 for the Wigner distribution as a spectrum estimator.
m
1
1
3
3
3
3
3
rt
4
4
4
4
4
8
8
3 8
t~
2
3
2
3
5
2
3
5 0.23
An
0.86
1.29
0.39
0.58
0.97
0.09
0.14
p
0.99
1.22
0.34
0.51
0.72
-0.13
-0.03
0.11
0.30
0.43
. ,
Pl
1.18
1.41
0.59
0.76
0.97
0.19
can be used. Then, we can assume a value of n --- 2.5, such that P(n) -~ 0.99 for Gaussian distribution of estimation error. The value of A~ should take into account the bias for the expected optimal window width (10.2.8). It is common to assume that, for the optimal value of h, the bias and variance are of the same order, resulting in Ate -~ 1. Then we can expect that the obtained value hs+ is close to hopt, thus p -~ 0, and all parameters for t h e key a l g o r i t h m e q u a t i o n (10.2.14) are defined. This simple heuristic form has been successfully used in [2] and [3], and it is highly recommended for most of practical applications. Estimation of the standard deviation cr(h~) will be discussed within the Numerical example. (2) When the knowledge about the variance and bias behavior is reliable, i.e., when (10.2.3) accurately describes estimation error, then we can calculate all algorithm parameters. According to the algorithm basic idea, only three confidence intervals, Ds+_I,D~+, and D~++I, should be considered. The confidence intervals Ds+_ 1 and D~+ s h o u l d have, while Ds+ and D~++I s h o u l d not have, at least one point in common. Assuming that relation (10.2.9) holds, and that the bias is positive, this condition means that the minimal possible value of upper D s + - i bound, (10.2.11), denoted by min{U~+_l}, is always greater than or equal to the maximal possible value of the lower D~+ bound, denoted by max{Ls+ }, i.e., min{U~+_l} > max{Ls+}. The condition that Ds+ and Ds++l do not intersect is given by max{Us+} < min{L~++l}. According to (10.2.9) and (10.2.11) the above analysis results in bias (hs+ -1) + A~a(hs+
- 1 ) --~
bias (hs+) - Ana(hs+ ),
bias ( h s + ) + (2n + An)a(hs+) < bias ( h s + + l ) - (2n + An)o(hs++l).(10.2.15) Since the inequalities are written for the worst case, we can calculate the algorithm parameters by using the corresponding equalities. With (10.2.13) we get A ~ - 2n/[2 (re+n)~2 - 1], 2p -
[ A t ~ v / n / m (2m/2 + 1 ) / / ( 1 -
(10.2.16) 2-n/2)] 2~(re+n) "
Adaptive Instantaneous Frequency Estimation Using TFDs
433
F i g . 10.2.1: Optimal window width (straight thick line), and adaptive window widths (end of the vertical lines, starting from the optimal window width line) for m = 3, n = 4, V - I. The variance to bias ratio V/B(t) is logarithmically varied. The adaptive width ha = hs+/2m is obtained from hs+, according to (10.2.14), after correction for the corresponding values of p l given in Table 10.2.1.
Values of the parameters
A s a n d p for v a r i o u s d i s t r i b u t i o n s ,
i.e., for v a r i o u s v a l u e s
o f m a n d n, a r e g i v e n in T a b l e 10.2.1. For further, and very fine t u n i n g o f t h e a l g o r i t h m p a r a m e t e r s , one may want that the adaptive window is unbiased in logarithmic, instead of in linear scale (due to definition (10.2.10)). The estimation bias and variance are exponential functions with respect to m and n, (10.2.13). Thus the confidence interval limits vary a s 2 (s-s+)(m+n)/2. The mean value for this exponential function, for two successive confidence intervals, for example ( s - s +) = 0 and ( s - s +) = 1, is (1 + 2(m+")/2)/2. It is shifted with respect to the geometrical mean x/2(m+n)/2 of these two intervals, by approximately Ap -~ [log 2 ((1 + 2(rn+n)/2)/2)] 2 1 resulting in the total logarithmic shift pl = P + Ap, presented in Table 10.2.1. Therefore the adaptive window width (as an estimate of the optimal window width) should be hopt = hs+/2 pl. Note that the set H of window widths h is a priori assumed. Therefore, as long as we can calculate P l , we can use it in the following ways: a) To calculate distribution with the new window width ha = hs+/2 pl a s the best estimate of hopt, b ) To remain within the assumed set of hs C H, and to decide only whether to correct the obtained hs+ or not. For example, if [Pl[ <_ 1/2 the correction is smaller than the window discretization step. Thus, we can use ha = hs+. For 1/2 < Pl <_ 3/2 it is better to use ha = hs+/2 = h~+_l, as the adaptive window width value. Fortunately, the loss of accuracy for the adaptive widths ha, as far as they are of hopt order, is not significant since the MSE varies slowly around its stationary point. Thus, in numerical implementations we can use only the lag windows from the given set H.
10.2.2.2
Illustration
We have simulated the IF estimates
f h ( t ) -- a v / V / h having the MSE unity-variance)
as a r a n d o m
m + v/B(t)h n + fi(t),
given by (10.2.3), where arandom
variable
A f ( 0 , 1) is a G a u s s i a n
(10.2.17) (zero-mean,
v a r i a b l e , m = 3, n = 4, a n d V = 1. F o r t h e t r u e I F v a l u e
Chapter 10:
434
Instantaneous Frequency Estimation and Localization
fi(t), at a given t, any constant can be assumed. The bias parameter B(t) in fh(t) logarithmically varies within ~l log2(mV/nB(t)) C [-4,3] with step 0.05. -For each value of parameter B(t) we have calculated optimal window width according to (10.2.6), and plotted log2(hopt ) as a thick line in Fig. 10.2.1. -The value of ]h (t) was simulated for each B(t) and hs c H. The assumed set of possible window widths was H = {1/16, 1/8, 1/4, 1/2, 1,2,4,8, 16,32}, and n = 2. The key algorithm relation (10.2.14) was tested each time, with the known standard deviation a(hs) = v/V/h m. The largest value of hs when the key equation (10.2.14) was still satisfied was denoted by hs+. Value An = 0.39, corresponding to m = 3, n = 4, n = 2, was used (Table 10.2.1). The adaptive values ha = hs+/2 pl , Pl = 0.59 (Table 10.2.1), produced in this way, are connected with the optimal window line, by thin vertical lines in Fig. 10.2.1. -The same simulation is repeated with n = 3 and n = 5. -We can conclude that the presented algorithm almost always chooses the width h~ from H which is the nearest to the optimal one. However, for relatively small n = 2 there are few complete misses of the optimal window width, since (10.2.9) is satisfied only with probability P(2) = 0.95. For n = 2, two successive confideuce intervals do not intersect when the bias is small, producing false result, with probability of 2(0.05) 9. ~ 10 -2 order. (3) The third approach for the parameter (n + An) estimation is based on the statistical nature of confidence intervals, and a posteriori check of the fitting quality [2]. This approach is beyond the scope of this article.
10.2.3
Numerical Example
In the example we assumed a signal of (10.2.1) form, with the given IF,
fi(nAt) = 128 arctan(250(nAt - 0.5))/~ + 128, and the phase r - 2~At ~-~m=Ofi(mAt). The signal amplitude was a = 1, and 201og(a/a~) = 10[dB] (a/a~ = 3.16). Considered time interval was 0 < n a t < 1, with At -- 1/1024. The IF is estimated by using the discrete WD with a rectangular lag-window, wh(t, f) = DFTn__.f [wh(nAt)x(t + nAt)x*(t - n a t ) ] , calculated with the standard F F T routines. T h e a l g o r i t h m is implemented as follows" (1) A set H of window widths hs, corresponding to the following number of signal samples N - {4, 8, 16, 32, 64,128,256,512}, is assumed. In order to have the same number of frequency samples, as well as to reduce the quantization error, all windows are zero-padded up to the maximal window width. (2) For a given time instant t = nat, the WDs are calculated starting from the smallest toward the wider window widths hs. (3) The IF is estimated using equation (10.2.2) and w h s ( t , f ) . (4) The confidence intervals intersection, (10.2.14), is checked for the estimated IF, (t), and a(h~) = v/3a2At/(2~ea2h3) with, for example, ~ + A~ - 6, when p~ -~ 1, and P(n) ~ 1 (see Table 10.2.1, and the Comment that follows).
/hs
Adaptive Instantaneous Frequency Estimation Using TFDs
435
Fig. 10.2.2: Time-frequency analysis of a noisy signal: (a) Wigner distribution with N = 16 ; (b) Wigner distribution with N = 256; (c) Estimated instantaneous frequency using the Wigner distribution with N = 8; (d) Estimated instantaneous frequency using the Wigner distribution with N = 256; (e) Wigner distribution with adaptive window width; (f) Adaptive window width as a function of time; (g) Estimated instantaneous frequency using the Wigner distribution with the adaptive window width; (h) Absolute mean error as a function of the window width; the line represents the mean absolute error value for the adaptive window width.
(5) T h e a d a p t i v e w i n d o w w i d t h ha - h ~ + / 2 is o b t a i n e d f r o m t h e last hs - hs+ w h e n ( 1 0 . 2 . 1 4 ) i s still satisfied. B a c k to (2). C o m m e n t : E s t i m a t i o n of the signal and noise p a r a m e t e r s a and a 2 can be done by using I&l2 + S e2 = -~1 Y]n=N1 [x(nAt)l 2. T h e variance is e s t i m a t e d by 52 = &2r + ae i^2 , where 5r
= { median ( x ~ , i ( n A t )
- Xr,i((n - 1)At)l 9n -- 2, .., N)}/(0.6745v/-2),
(10.2.18)
with xr(nAt) and xi(nAt) being the real and imaginary p a r t of x(nAt). It is assumed t h a t N is large, and A t is small [2-5]. For this estimation we oversampled the signal by factor of four.
T h e W D s w i t h c o n s t a n t w i n d o w w i d t h s Ns -
16 a n d Ns -
256 are p r e s e n t e d
Chapter 10:
436
Instantaneous Frequency Estimation and Localization
in Fig. 10.2.2(a), and Fig. 10.2.2(b), respectively. The IF estimates using the WDs with constant window widths Ns = 8, and Ns = 256 are given in Fig. 10.2.2(c) and Fig. 10.2.2(d). Fig. 10.2.2(e) shows the WD with adaptive window width. Values of the adaptive window width, determined by the algorithm, are presented in Fig. 10.2.2(f). We can see that when the IF variations are small the algorithm uses the widest window in order to reduce the variance. Around the point n a t = 0.5, where the IF variations are fast, the windows with smaller widths are used. The IF estimate with adaptive window width is presented in Fig. 10.2.2(g). Mean absolute error, normalized to the discretization step, is shown in Fig. 10.2.2(h) for each considered window width. The line represents value of the mean absolute error for the adaptive window width.
10.2.4
Summary and Conclusions
An algorithm that can produce an accurate estimate of the optimal window width, without using the bias value, is presented. The IF estimates obtained by using this algorithm and the WD have lower error than by using the best constant-window width, which also is not known in advance. Additional examples, including distributions with adaptive order, the WD as a spectrum estimator, algorithm application to the sensor array signal tracking, as well as other realization details can be found in [2-5, 7].
References [1] A. Goldenshluger and A. Nemirovski, "Adaptive denoising of signals satisfying differential inequalities," IEEE Trans. Information Theory, vol. 43, pp. 873-889, May 1997. [2] V. Katkovnik and L. StankoviS, "Periodogram with varying and data-driven window length," Signal Processing, vol. 67, pp. 345-358, 30 June 1998. [3] V. Katkovnik and L. StankoviS, "Instantaneous frequency estimation using the Wigner distribution with varying and data-driven window length," IEEE Trans. Signal Processing, vol. 46, pp. 2315-2325, September 1998. [4] L. Stankovi5 and V. Katkovnik, "Instantaneous frequency estimation using the higher order L-Wigner distributions with the data driven order and window length," IEEE Trans. Information Theory, vol. 46, pp. 302-311, January 2000. [5] L. Stankovi5 and V. Katkovnik, "The Wigner distribution of noisy signals with adaptive time-frequency varying window," IEEE Trans. Signal Processing, vol. 47, pp. 1099-1108, April 1999. [6] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signalu Part 1: Fundamentals," Proc. IEEE, vol. 80, pp. 520-538, April 1992. [7] A. B. Gershman, L. StankoviS, and V. Katkovnik, "Sensor array signal tracking using a data-driven window approach," Signal Processing, vol. 80, pp. 2507-2515, December 20O0.
437
IF Estimation for Multicomponent Signals
10.3 10.3.1
IF E S T I M A T I O N
FOR M U L T I C O M P O N E N T
SIGNALS~
Time-Frequency Peak IF Estimation
There is a wide range of applications where we encounter signals comprised of M components with different IF laws f,~(t) and different envelopes am(t), in additive noise. It is often desired from such an observed signal, to determine the number of components M, the IF law of each component and the corresponding envelope am (t). This can be achieved by representing the observed signal z(t) in a time-frequency (t-f) domain and use time-frequency filtering methods to recover the individual components [1]. Another approach involves extending algorithms for IF estimation of monocomponent FM signals to the case of multicomponent signals and design an algorithm that simultaneously tracks the various IF components of the observed signal [2,3]. Both approaches require the use of time-frequency distributions (TFDs) with very specific properties such as high time-frequency localization of the IF components and high reduction of cross-terms interferences. The basic concept of instantaneous frequency is described in Pt. 1 of reference [4] and in Chapter 1 of this book. Methods of IF estimation are reported in Pt. 2 of [4] and in Chapter 10. Essential results are reproduced below, for greater clarity.
10.3.1.1
Spectrogram Peak IF estimation
Various approaches for IF estimation of monocomponent signals exist [4]. Most of these algorithms are suited to a particular class of signals, and both fixed and adaptive algorithms have been proposed. Our aim here is to approach the problem from a general viewpoint in order to define a general IF methodology that would be suitable for the largest class of signals found in practical applications. To illustrate this approach, we thus consider from the outset multicomponent signals in additive noise, which can be expressed as follows:
M
z(t)- Z
M
+
m--1
am(t)
+
(10.3.1)
m--1
where the amplitudes am(t) are the component amplitudes, era(t) are the component phases, and ~(t) is a complex-valued white Gaussian noise process of independent and identically distributed (i.i.d.) real and imaginary parts with total variance O'(:. 2 The individual IF laws for each component are given by [1]"
1 dCm(t) fm(t)-~-~ dt
; m = 1,...,M.
(10.3.2)
~ Zahir M. Hussain and Boualem Boashash, Signal Processing Research Centre, Queensland University of Technology, 2 George Street, Brisbane, Queensland 4000, Australia (zahir.hussain@rmit.edu.au, b.boashash@qut.edu.au; Z. M. Hussain is now at the School of Electrical and Computer Systems Engineering, RMIT, GPO Box 2476V, Melbourne, Victoria 3001, Australia). Reviewers" LJ. Stankovi5 and V. Katkovnik.
438
Chapter 10:
Instantaneous Frequency Estimation and Localization
F i g . 10.3.1: Left: The spectrogram of a bat signal using a small analysis window. Right: The modified B-distribution of the same signal with parameter/~ = 0.05. Total signal length is N = 400 and sampling interval /kt = 1. The spectrogram cannot show the weakest component.
A conventional approach to represent and analyze such signals for IF estimation is to take the spectrogram of z(t) and search for the peaks in the t-f domain (see Article 10.1). Curves formed by a continuum of these peaks describe the IF laws of the individual components of the observed signal z(t), as illustrated in Fig. 10.3.1 using a bat signal. Analytically, this can be expressed as follows:
fro(t) -- arg[m~x pro(t, f)]; 0 <_ f <_ fs/2
(10.3.3)
where pro(t, f) is the mth peak of the spectrogram. This spectrogram based approach has several advantages: it is easy to understand, easy to use, and there are no cross-terms producing unwanted interferences. A major disadvantage though is that the time-frequency resolution of the spectrogram for closely spaced components is often poor, especially if one of the components is much weaker, as illustrated in Fig. 10.3.2 for a two-component linear FM signal with one component weaker than the other. Fig. 10.3.2 also compares the performance of the spectrogram with the modified B-distribution (MBD) that was designed specifically for multicomponent IF estimation, as discussed later. 10.3.1.2
Peaks of WVD, PWVD, and RIDs
To improve upon the resolution of the spectrogram, various TFDs were proposed for IF estimation, one of the most important being the Wigner-Ville distribution (WVD). IF estimation using the peak of the Wigner-Ville distribution (WVD) is optimal for linear FM signals with high to moderate signal-to-noise ratios (SNRs) [4], but its performance degrades significantly at low SNRs, and in this case the cross WVD (XWVD) peak can be used as an IF estimator [5]. For polynomial FM signals it was shown that the polynomial WVD (PWVD) gives the best performance, especially at high SNRs (see [6] and Article 10.4). However, both WVD and PWVD suffer from cross-terms when used to analyze multicomponent signals. These crossterms generate artifacts that obscure the (t-f) representation of the signal, leading to
439
IF Estimation for Multicomponent Signals
F i g . 10.3.2: Performance comparison between the spectrogram Spect(t, f ) and the modified Bdistribution (MBD) for/3 = 0.06 using a two-component noise-free linear FM signal at the sampling instant n = t / A t = 64. Total signal length is N = 128 and the sampling interval is A t = 1. The right component is five times larger in amplitude than the left component. Left: Spectrogram with small analysis window length (A = 23). Right: Spectrogram with large analysis window length (A = 83). In both cases the spectrogram fails to resolve the two components. In addition, time resolution is bad for a large window length.
the development of reduced interference distributions (RIDs) to remedy the problem [7]. Straightforward IF estimation using the peak of RIDs give an IF estimate that is biased from the true IF law, and this bias is different for different RIDs. Although reduced, cross-terms still exist and can obscure weak components, hence the need to define special purpose RIDs with efficient cross-terms reduction, high time-frequency resolution and minimum bias from the true IF laws, such as the MBD [3, 8].
10.3.2 10.3.2.1
Properties of IF Estimates Based on Quadratic TFDs IF Estimates and Window Length
We consider an analytic signal z(t) of the form z(t) = ae jr -t-e(t) where the amplitude a is constant, and e(t) is a complex-valued white Gaussian noise with independent identically distributed (i.i.d.) real and imaginary parts with total varia n c e (7e2 . The IF of z(t) is given by eq. (10.3.2), and it is assumed to be an arbitrary, smooth and differentiable function of time with bounded derivatives of all orders. The general equation for quadratic time-frequency representation of the signal z(t) is given by [1]
pz(t, f) -
~ [a(t, r) . K;(t, r)]
r~f
(t)
where G(t,T) is the time-lag kernel, Kz(t,~') - z(t + ~ ) z * ( t - ~) is the signal kernel or the instantaneous autocorrelation function (IAF), and 9 denotes time (t) convolution. For smoothing and localization on the IAF we apply a window function Wh(7) -- -h--zXtw(~) on the instantaneous autocorrelation Kz(t, T), where w(t) is a 1 hence real-valued symmetric window with unity length, i.e., w(t) - 0 for [ t l> 7; the window length is h.
Chapter
440
Instantaneous Frequency Estimation and Localization
10:
The TFD is now dependent on the window length h as follows:
Pz h(t, f) '
-
~
~-~f
[Wh(T)C(t, T) 9Kz(t T)]. (t)
(10.3.4)
'
If pz,h(t, f) is discretized over time, lag, and frequency then we have Ns - 1
Ns - 1
pz,h(n,k) -- E
E
l=-Ns
wh(mAt)Kz(1At, 2mAt)G(nAt - lAt, 2mAt)e-J2~2~
m=-N~
where 2Ns is the number of samples and At is the sampling interval. The IF estimate is a solution of the following optimization j~(t) -- arg[rn~x pz,h(t, f)]
(10.3.5)
(10.3.6)
; 0 < f < fs/2
where fs = 1~At is the sampling frequency. 10.3.2.2
Bias and Variance of the IF Estimate
By extending the results in [9], the estimation bias and variance are found to be [3]
E[Afh(t)] - L h ( t )
var(A]h(t))--
2Fh '
2
cr~
21a I=
[1+
2
a~
21a I=
]
Eh
F~
(10.3.7)
where
zxf (t)-
1r
Lh(t) --
Fh =
-
F
oo
E
wh(mAt)(27rmAt)2G(u, 2mAt)du (X:) m - - - - c x : )
wh(mAt)Ar
mAt)(27rmAt)G(t - u, 2mAt)du
O0 m - - - - o o
Eh--
E
wh(mAt)2(27rmAt)2G(u'2mAt)du
(X) m - - - - ( x )
where Ar T) -- r + r/2) -- r -- r/2) -- Tr (t). Equations (10.3.7) and (10.3.8)indicate that the bias and the variance of the estimate depend on the lag window length h for any kernel G(t, T). To see how the bias and the variance vary with h, asymptotic analysis as At --~ 0 is necessary for the chosen TFD. 10.3.2.3
TFD Properties Needed for Multicomponent IF Estimation
The results above indicate that a general method for IF estimation of multicomponent FM signals in additive Gaussian noise that is based on quadratic time-frequency distributions requires the quadratic TFDs to satisfy the following conditions [3]:
IF Estimation for Multicomponent Signals
441
(1) p(t, f) should have a high time-frequency resolution while suppressing crossterms efficiently so as to give a robust IF estimate for mono- and multicomponent FM signals. (2) p(t, f) should enable amplitude estimation for the individual components of the signal, as the amplitude is necessary for evaluating the variance of the IF estimate for each component [2, 3, 9], and to allow for the reconstruction of the individual components of the signal. (3) the choice of the lag window length should lead to a bias-variance tradeoff (see eqs. (10.3.13)and (10.3.14)). Although some TFDs, like Choi-Williams distribution (CWD) and the spectrogram, can satisfy some of these conditions, they do not meet the second requirement, i.e. allowing direct amplitude estimation. The design of TFDs which satisfies all of these required properties is considered next.
10.3.3 10.3.3.1
Design of Quadratic TFDs for Multicomponent IF Estimation Desirable Time-Lag Kernel
A TFD referred to as the B-distribution (BD) was proposed and shown to be superior to other fixed-kernel TFDs in terms of cross-terms reduction and resolution enhancement [10]. As it does not allow direct component amplitudes estimation [2], as per the second condition on TFDs required for multicomponent IF estimation listed above, the BD kernel was modified as [3]
V(t, T) - Ge(t ) - ke/cosh2e(t)
(10.3.9)
where ~ is a real positive number and k e = F(2fl)/(2 2e-1 F2(fl)), F stands for the gamma function. This modified B-distribution MBD(t, f) is also referred to as the hyperbolic T-distribution ( H T D ) i n [8].
10.3.3.2
Relevant Properties of the Modified B-Distribution (MBD)
Most of the desirable properties of time-frequency distributions relevant to IF estimation (as explained in [1] and [4]) are satisfied by the MBD kernel. In particular, realness, time-shift and frequency shift invariance, frequency marginal and group delay, and the frequency support properties are satisfied. The time support property is not strictly satisfied, but it is approximately true [3]. The three required conditions listed in Subsection 10.3.2.3 are discussed in detail below. (1) R e d u c e d i n t e r f e r e n c e and resolution: This property is satisfied by MBD. For example, consider the sum of two complex sinusoidal signals z(t) = zl ( t ) + z2(t) = a l e j(27rflt+01) -+- a 2 e j(27rf2t+02) where al, a2, 01 and 02 are constants. The TFD of the signal z(t) is obtained as [3] MBD(t, f) - a~ 5(f
- fl)
-t- a 2 5(f - f2) + 2ala27z(t) 5[f - (fl + f2)/2] (10.3.10)
Chapter 10:
442
Instantaneous Frequency Estimation and Localization
\
/
where 7z(t) - IF(/3 + j ~ ( f l - f2))] 2 cos [2~(fl - f2)t + 01 - 02)/F2(~). The crossterms are oscillatory in time and depend on the frequency separation between signal components. If fl and f2 are well separated then the term ]F(~ + jT~(fl - f2))l 2 can be substantially reduced, while F2(~) can be made high if ~ is small. When fl and f2 are not well separated, the MBD still performs better than most quadratic TFDs (see Article 7.4). (2) D i r e c t a m p l i t u d e a n d I F e s t i m a t i o n : The MBD allows direct IF estimation by peak localization, i.e., at any time instant t, it has an absolute maximum at f - 2 ~ l dr for linear FM signals . For non-linear FM signals this estimate is biased, but this bias can be accounted for in the adaptive IF estimation, as presented next. For an FM signal of the form z ( t ) = a ejr the MBD is approximated by [3] c~
MBD(t, f) ~l a ]2 f
J_
G z ( t - u) 5 [ 1 r
(u)- fldu = [a[eG~(t- r162
(f)
oc
, r (f) (10.3.11) where r is the inverse of ~-~r i.e (r = f. Assuming that is not a highly peaked function of f and knowing that G z ( t - r is peaked at t - r the absolute maximum of MBD(t, f) for any time t would be at r = t, or 1 ' (t), which is the IF of the FM signal z(t). For non-linear FM signals, the f = ~--~r energy peak of the MBD is actually biased from the IF because of the extra term Ek=3(kC<) odd) k!2 k-l~'k-1 r (u). The major contribution in this term is due t o r Therefore at the instants of rapid change in the IF law the bias is not negligible a n d eq. (10.3.11) would not be an accurate approximation to the MBD unless suitable windowing in the lag direction is used. For linear FM signals we have r -- 0 for k >_ 3. Assuming that z(t) = aeJ27r(fot+-~ -t2) , where fo and 3o are constants, we have
1 lal2Gf ~ ( t _ -~o(f 1 MBD(t, f) - -~o - fo))
(10.3.12)
which has an absolute maximum at f - fo + ~ot, the IF of the linear FM signal z(t). As ~o -~ 0, i.e., z(t) approaches a sinusoid, we have MBD(t, f ) -~ la125(f - fo), in accordance with eq. (10.3.10). As for amplitude estimation, eqs. (10.3.11) and (10.3.12)indicate that the MBD can support amplitude estimation [3]. (3) A s y m p t o t i c F o r m u l a s U s i n g M B D : The asymptotic formulas for the variance and the bias as At -~ 0 using a rectangular lag window are given by [3]" O"c2 At ( A ~ ( ~ = 27r2 i a 12 [ 1 + 2[al 2 ] - ~ var,_~.o,t,,
(10.3.13)
443
IF Estimation for Multicomponent Signals
and
h2 / j E(AA(t))-
~
A(u)du o0 cosh2Z(t- u) ;
M2 h 2 E(A fh(t)) <_--40
(10.3.14)
where A(t) = f(2)(t + ~'1) + f(2)(t - 71), f(2)(t) is the second derivative of the IF, and sup]f(2)(t)] _< M2. For small h, the optimal window length that minimizes the t
mean squared error is obtained by extending the result in [6] as: 2
hopt(t) =
[7c21a12(f(2)(t)(t)91/cosh2Z(t)) 2
"
Thus, the optimal window length depends on the second derivative of the instantaneous frequency f(2)(t), which is time and signal dependent. Eqs. (10.3.13) and (10.3.14) indicate that the variance and bias of the IF estimate using MBD have the same rates of change with respect to the window length h as those using WVD [9].
10.3.3.3
Examples of Quadratic TFDs Suitable for Multicomponent IF Estimation
TFDs with time-only (or lag-independent) kernels constitute a subclass of the quadratic class of TFDs. These TFDs share the important properties of crossterms suppression, high-resolution, and supporting amplitude estimation, making them well suited for multicomponent IF estimation. The modified B-distribution (a. k. a. hyperbolic T-distribution)was defined earlier in this section. Another example is the exponential T-distribution, which is defined in terms of its time-lag kernel as [8] G(t, T) = G~(t) - ~ exp(-r/t 2) where r/is a real parameter and v/-~-/Tr is a normalization factor. The resulting TFD used for multicomponent IF estimation is then given by eq. (10.3.4).
10.3.4
An Adaptive Algorithm for Multicomponent IF Estimation
Eq. (10.3.15) shows that the optimal window length using the MBD is a function of time and depends on the second derivative of the IF law f(2)(t); it decreases when the IF law f(t) has a high variation. Hence a time-varying window length is needed to optimize the estimation. The Stankovi(~-Katkovnik adaptive algorithm developed in [9] for monocomponent FM signals can be used since the IF estimation variance is a continuously decreasing function of h while its bias is continuously increasing, as shown in eqs. (10.3.13) and (10.3.14); see also Article 10.2. These conditions are necessary for bias-variance tradeoff such that the algorithm converges at the optimum window length that resolves this tradeoff. It is shown in [9] that, if h is
Chapter I0:
444
InstantaneousFrequency Estimation and Localization
small enough then the IF estimate fh(t) is inside the confidence interval D defined as follows
D - [ f h ( t ) - 2~v/var(Afh(t)), fh(t)+ 2~;v/var(Afh(t))]
(10.3.16)
with Gaussian probability P(~;), ~ being a parameter (usually 2); while for large h, fh(t) is outside D. Hence, if we consider an increasing sequence of window lengths {hr[r = 1 : N} (N being the number of samples) and calculate the MBD (and hence fhr(t)) for each hr, then all {Dr} have at least one point in common (which is j?hr (t)) if hr is sufficiently small. The first hr for which Dr-1 and Dr have no point in common is considered optimal as it decides the bias-variance tradeoff. The estimates for the amplitude of the signal a and the variance of noise a~2 [used in eq. (10.3.13) and implicitly in eq. (10.3.16)] were given in [9] as:
~2 + ~~ 2
i ~
-
-N
2..,
n=l
I z(nAt)
2
. ~-2 _
,
a~
-
I N y~, [ z ( n A t ) - z ( ( n - 1 ) A t ) 2N
2
[ (10.3.17)
n=2
where N is the number of samples. For further details of this adaptive algorithm see [9]. For a multicomponent analytic signal of the form stated in eqs. (10.3.1) and (10.3.2), with {am} constant, we can use the extension of the monocomponent IF estimation algorithm in [9] for multicomponent signals as described in [2, 3]. This algorithm tracks component maxima in the time-frequency plane and requires a threshold Tp(t) so as to ignore the local maxima caused by the cross-terms and windowing. In fact, Tp(t) is application and distribution dependent. The algorithm requires the knowledge of the confidence intervals Dr, m for each component, where r refers to the window length (hr) and m refers to the signal component. The calculation of Dr,m depends on the estimation of the individual amplitudes am of the components. Using the MBD, the actual amplitudes [am[ can be estimated as shown in [3]. Using I~m]2 and ~2 to calculate var(Aj~(t)) [given by eq. (10.3.13) for MBD(t, f)], we can define the confidence intervals {Dr,m} for all components as in [2, 3]. The IF fm (t) is contained in at least one of the confidence intervals {Dr, m} if hr is sufficiently small, and the optimal window length is the first hr (from the increasing sequence {h~[r = 1: N}) for which D~-l,m and D~,m have no point in common. E x a m p l e : We consider a three-component FM signal z(nAt) with amplitudes al = 0.5, a2 - 1, and a3 - 1.5 and non-linear IF laws: fl 47+2.5 s i n h - i ( 2 0 ( n A t - 0 . 2 ) ) , f2 - 30+ 2.hsgn(40(nAt-0.6)), and f3 - 10+ 2 sin(10(nAt-0.7)), with S N R - 15 dB, ~ - 0.1, ~ - 2, 0 _< nat <_ 1, and At = 1/128. Fig. 10.3.3 shows the result of the tracking adaptive algorithm for IF estimation of z(nAt) using the peaks of the MBD and the spectrogram. Fig. 10.3.4 shows the conventional peak IF estimation for the same signal using MBD and the spectrogram. Both TFDs fail to give a robust IF estimation at the -
-
IF Estimation for Multicomponent Signals
445
Fig. 10.3.3: Left: Adaptive IF estimation using the peak of the MBD for a three-component FM signal with total length N = 128, SNR = 15 dB, and A t = 1/128. Dashed lines represent the true IF laws. Right: Adaptive IF estimation using the peak of the spectrogram for the same signal, assuming that component amplitudes are known.
Fig. 10.3.4: Left: IF estimation for the three-component FM signal as in Fig. 10.3.3 using the conventional (constant window) peak estimation. Left: MBD peak IF estimation. Right: Spectrogram peak IF estimation. (in addition to failure of IF estimation at instants of rapid frequency change, the spectrogram has poor tracking performance. Note also that both methods cannot track the continuously varying frequency of the third component.)
instants of rapid frequency change. In addition, the spectrogram has poor timefrequency resolution, both in adaptive and constant-window IF estimation.
10.3.5
Summary and Conclusions
Concurrent IF estimation of the separate components of a multicomponent FM signal using TFD peaks location requires conditions on the selection of a suitable quadratic TFD. Required properties are: (1) high time-frequency resolution while suppressing cross-terms eMciently, (2) the TFD to enable direct amplitude estimation for the individual components, (3) the variance of the IF estimate using the TFD should be a continuously decreasing function of the lag window length while the bias is continuously increasing. Quadratic time-frequency distributions that satisfy these conditions were presented and discussed. A constant-window tracking
446
Chapter 10:
Instantaneous Frequency Estimation and Localization
algorithm may not give a robust IF estimate if the IF changes rapidly with time due to the effect of the higher-order derivatives of the IF law. Hence an adaptive algorithm is used for robust multicomponent IF estimation.
References [1] B. Boashash, "Time-frequency signal analysis," in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: Prentice-Hall, 1991. [2] Z. M. Hussain and B. Boashash, "Adaptive instantaneous frequency estimation of multicomponent FM signals," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), vol. II, pp. 657-660, Istanbul, 5-9 June 2000. [3] Z. M. Hussain and B. Boashash, "Multi-component IF estimation," in Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pp. 559-563, Pocono Manor, PA, 14-16 August 2000. [4] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 1: Fundamentals; Part 2: Algorithms and applications," Proc. IEEE, vol. 80, pp. 519-568, April 1992. With introductions by Michael J. Riezenman. [5] B. Boashash and P. O'Shea, "Use of the cross Wigner-Ville distribution for estimation of instantaneous frequency," IEEE Trans. Signal Processing, vol. 41, pp. 1439-1445, March 1993. [6] B. Barkat and B. Boashash, "Instantaneous frequency estimation of polynomial FM signals using the peak of the PWVD: Statistical performance in the presence of additive Gaussian noise," IEEE Trans. Signal Processing, vol. 47, pp. 2480-2490, September 1999. [7] J. Jeong and W. J. Williams, "Kernel design for reduced interference distributions," IEEE Trans. Signal Processing, vol. 40, pp. 402-412, February 1992. [8] Z. M. Hussain and B. Boashash, "Design of time-frequency distributions for amplitude and IF estimation of multicomponent signals," in Proc. Sixth Internat. Symp. on Signal Processing and its Applications (ISSPA '01), vol. 1, pp. 339-342, Kuala Lumpur, 13-16 August 2001. [9] V. Katkovnik and L. StankoviS, "Instantaneous frequency estimation using the Wigner distribution with varying and data-driven window length," IEEE Trans. Signal Processing, vol. 46, pp. 2315-2325, September 1998. [10] B. Barkat and B. Boashash, "A high-resolution quadratic time-frequency distribution for multicomponent signals analysis," IEEE Trans. Signal Processing, vol. 49, pp. 22322239, October 2001.
Analysis of Polynomial FM Signals in Additive Noise
A N A L Y S I S OF P O L Y N O M I A L NOISE ~
10.4
447
F M S I G N A L S IN A D D I T I V E
In this article, we focus our study on the analysis of polynomial frequency modulated signals (a.k.a. polynomial phase signals), corrupted by additive Gaussian noise. A noiseless complex polynomial frequency modulated (FM) signal, z(t), can be expressed as:
z(t) - A . exp(jr
j E aktk k-O
= A.exp
t : [0, T].
(10.4.1)
Here A is the signal amplitude, r is the signal phase, P is the polynomial phase order, the {ak} are arbitrary parameters and T is the signal duration. Polynomial FM signals are found in many important applications such as radar, sonar and telecommunications. The FM law of the signal corresponds to the instantaneous frequency (IF) trajectory, defined by: 1 de(t)
fi(t) - 27r dt
P
1
= 27r E
k--1
kaktk-1
t - - [ 0 , T].
(10.4.2)
In most applications, polynomial FM signals are immersed in noise. In additive noise, the signal model can be re-written as:
y(t) - z(t) + w(t) - A . e x p
j E
aktk
+ w(t)
t = [0, T],
(10.4.3)
k=0 where w(t) is complex, Gaussian noise. It is often desired to estimate the IF of the class of signals defined by Equation (10.4.3). In the following text, we describe a method which can perform this task in an accurate way.
10.4.1 10.4.1.1
The Polynomial Wigner-Ville Distributions IF Estimation
If the order of the polynomial phase in Equation (10.4.1) is equal to 1, then the signal, z(t), is a complex sinusoidal (i.e. linear phase) signal. For this special case, the IF is a constant independent of the time variable. It can easily be estimated by using classical spectrum analysis methods. For instance, the frequency at which the Fourier transform of the data has its peak can be used as an estimate of the IF. The Fourier transform proves to be a useful tool in this case because it concentrates the signal energy in frequency (i.e. about the IF), while dispersing the noise energy ~ P. O'Shea, School of Electrical and Electronic Systems Engineering, QUT, Brisbane, Australia, 4001 (pj.oshea@qut.edu.au), and B. Barkat, Nanyang Technological University, School of Electrical & Electronic Engineering, Block $2, Nanyang Avenue, Singapore 63978 (ebarkat@ntu.edu.sg).
Chapter 10:
448
InstantaneousFrequency Estimation and Localization
over a wide band. In the Fourier (frequency) domain, the signal energy tends to be quite peaked, while the noise contribution tends to be broad and relatively low. The Fourier transform peak actually provides an optimal estimator for the frequency of a complex sinusoidal data sequence, assuming the noise is white and Gaussian [1]. If the order of the polynomial phase in Equation (10.4.1) is larger than 1, then the signal, z(t), is a non-linear phase signal. For this general case the IF is not constant, but varies with time. The Fourier transform in this case is far less effective for IF estimation; the signal energy is spread over a continuum of frequencies and the signal-to-noise ratio (SNR) in the spectrum is reduced compared with the complex sinusoidal case. It would be convenient if an operator could be applied which could transform the non-linear phase signal into a linear phase one with frequency equal to the IF. Then the Fourier transform could be applied to estimate the IF. One technique that can transform a polynomial FM signal into a sinusoid for a given time instant was proposed in [2] and is described in Articles 5.4 and 5.5. The general form for this transform is I
KP(t,T) -- r l z ( t +ciy) k` . z , ( t _ c i y ) k~
t - [ 0 , T],
(10.4.4)
i--1
where the ci are the coefficients of the transformation, K P, and I is the total I number of distinct ci values. The order of the transformation is q = Y'~i=l 2ki. A general procedure to obtain the ci, ki and q for a fixed polynomial phase order, P, is outlined in [3]. For example, for P - 2 , we find q = 2 and cl - 0.5. For P - 3 or P = 4 we find q = 6, cl = 0.62, c2 = 0.75 and c3 = - 0 . 8 7 [3]. By applying the Fourier transform to the "kernel", K P, of the signal we can determine the IF estimate. T h a t is, the peak of the function,
wP(t, f) =
F
KP(t, w) . e-J2~f~dT,
(10.4.5)
co
yields the IF of the signal, z(t). The function wP(t, f) is known as the polynomial Wigner-Ville distribution (PWVD) [2]. This terminology has arisen because W P (t, f) may be considered to be a generalization of the Wigner-Ville distribution (WVD), which has long been known to be effective for IF estimation of quadratic phase signals. Figure 10.4.1 displays the P W V D of a quadratic ( P = 3) FM signal. The plot on the left is the P W V D of a noiseless signal, while the plot on the right is the P W V D of the same signal immersed in Gaussian noise. In the latter case, the IF law of the original signal is still apparent, despite the degradation in the time-frequency distribution due to the noise. T h a t is, the P W V D is useful for estimating the IF of noisy signals. The P W V D is very good for analyzing polynomial phase signals at high signalto-noise ratio (SNR) [4], but is very poor for low SNR signals. In order to achieve accurate IF estimation at low SNR values, one can use an iterative procedure [5].
Analysis of Polynomial FM Signals in Additive Noise
449
Fig. 10.4.1" PWVDs of a noiseless (left) and noisy (right) quadratic FM signal.
The first step involves forming a preliminary estimate of the IF law, ]i(t), by using a method which has a low SNR threshold but possibly limited accuracy (e.g. the peak of the short-time Fourier transform). ]i(t) is then used to reconstruct an estimate, ~)(t), of the noisy observation, y(t). ~)(t) is used together with y(t) to form a Cross Polynomial Wigner-Ville Distribution (XPWVD)"
X W y P (t, f) -
F
X K ff (t, T) . e-J2~/~dw,
(10.4.6)
O0
where I
X K f f (t, T) - y(t-~ClT)a'~)*(t--clT) al l-I ?)(t+ciT)a~'y*(t-ciT)a~ ; t -- [0,T]. (10.4.7) i=2 ,P
X P W V D peak extraction is then used to form a new estimate of the IF law, which in turn is used to form an updated estimate of y(t). The process of (i) X P W V D formation, (ii) IF law estimation and (iii) reestimation of y(t) is repeated iteratively until there is an acceptably small difference between successive signal estimates. The SNR threshold of this method is typically very similar to that of the preliminary IF estimation method, but the accuracy tends to be much greater. More details about the technique can be found in [5].
10.4.1.2
Discrete-Time Implementation
In practice it is often necessary to operate on discrete-time signals. In this article z(n), y(n) and K ff (n, m) signify the discrete-time signals obtained by sampling z(t), y(t) and K~(t, T) respectively with a sampling rate of 1. The number of samples available from both z(n) and y(n) is denoted N(..~ T). The discrete-time version of the P W V D is: W P ( n , f) - D T F T K g(n, m) (10.4.8)
m--, f
450
Chapter 10:
Instantaneous Frequency Estimation and Localization
where the D T F T operation specifies discrete-time Fourier transformation from the m to f variables. It should be noted that to form the kernel term, KP(n, m), in the above expression, one must obtain samples of y(t) at non-integer values of time. This typically means that in practice one must use interpolation of the discretetime observation, y(n). To limit the number of samples which must be obtained by interpolation it is convenient to introduce some 'lag scaling' and 'frequency scaling' operations into the definition for the discrete-time PWVD. The resulting definition then becomes: (10.4.9) WP (n, f ) DTFT KyP(n,m) s
m--~
$
C/max
where K P (t, 7) -
H i= 1
Ci
y t+
~-
9y* t
C/max
Ci T Cimax
,
(10.4.10)
and K P (n, m ) i s obtained from KyP (t, 7) by sampling at a rate of 1. Cimax is the ci value with the largest magnitude. The discrete-time PWVD defined above can be used for obtaining low variance IF estimates from discrete-time measurements. A derivation of the variance of such estimates is given in the Appendix.
10.4.1,3
Polynomial Phase Parameter Estimation
The PWVD can also be used to estimate the ak parameters themselves, rather than simply the IF. It will be assumed for this estimation task that discrete-time measurements are available and that the noise on these measurements is white and Gaussian. Basically, the estimation procedure involves (a) forming P PWVD slices, (b) determining P IF estimates from these slices, (c) fitting a polynomial IF law to these P IF estimates, (d) deducing the ak parameters from this polynomial IF law, and (e) performing a spectral zoom type technique to refine the parameter estimates. The procedure is specified mathematically below: 1. For the observation, y(n), form the set of PWVD slices, W yP (f) - [WP(nl, f) WP(n2, f) ... WP(nP, I)]T,
(10.4.11)
where nl, n 2 , . . . , np are different, well-spaced discrete-time positions. There is some flexibility in the selection of {nl, n 2 , . . . , np}. Having them equally spaced between 0.25N and 0.75N is typically a good choice, i.e. ni=0.25N+0.5N(i-1)/(P-1), i=l,...,P. 2. Estimate a vector of IF estimates according to: fi-
arg
m a x {WyP(f)},
/
(10.4.12)
where a r g m a x here denotes the vector argument of a vector (row by row) maximization.
Analysis of Polynomial FM Signals in Additive Noise
451
3. Determine the vector of initial coarse estimates for the polynomial phase parameters estimates. This vector is denoted e~i = [dil di2 di3 ... dip] T and is obtained by evaluating the matrix equation: iii -- 2 7 r X - l f i
where
X--
(10.4.13)
1
2nl
3n 2
...
p n P-1
1
2n2
3n~
...
P n g-1
1
2up
3n~
...
Pn~ -~
(10.4.14)
4. Dechirp the observation by t h e e s t i m a t e d polynomial phase law:
yd(n) -- y ( n ) e x p ( - j ( d i l n + di2 n2 + . . . + dipnP)).
(10.4.15)
(The dechirped observation will now have spectral content which is highly localized around DC). 5. Filter yd(n) with an ideal low-pass filter whose bandwidth is B. Decimate the result by a factor, 1/B, to obtain yo(n). 6. Unwrap the phase of yo(n). The vector of unwrapped phase values, V - [ V ( 0 ) V(1) ... V ( N B - 1)] T, is also modeled as a (noisy) polynomial phase signal with phase parameters, ad=[ad0 adl ... adp] T. One can then estimate ad with the following linear regression: a:d = ( G T G ) - I G T v , where G -
1 1 1 1
0 1 2 NB-
... ... ... 1
(10.4.16)
0 1P 2P
...
,
(10.4.17)
1)
7. Use the results from Step 6 to refine the parameter estimates from Step 3: a f - ai + a~d.B
(10.4.18)
where B - [ 1 B B 2 . . . BP] T, and a-d.B denotes element by element multiplication of ad and B. Note that at least P + 1 samples of yo(n) are required for the regression in Step 6 to be properly defined. In practice, it is recommended that a large number of samples be obtained. Note also that if y(n) is white then yo(n) will also be white,
452
Chapter 10:
Instantaneous Frequency Estimation and Localization
and being a filtered version of y(n), will have a higher SNR (by a factor of 1/B). The unwrapped phase of yo(n) will also be very close to white, provided that the SNR of yo(n) is above about 9dB [6]. The overall algorithm yields estimates which are asymptotically optimal above threshold. The threshold can be predicted with the formulae in [7].
10.4.2
Higher Order Ambiguity Functions
An alternative approach for analyzing polynomial phase signals has been provided by Peleg, Porat and Friedlander [8], [9]. The first step in the method is to apply a transform to convert an arbitrary pth order polynomial phase signal into a linear phase one whose frequency is proportional to the ag parameter. This can be done with "Higher order ambiguity functions (HAFs)". After the transformation in the first step, it is possible to estimate the ag parameter with the Fourier transform. The second step in the method is to reduce the original signal to a ( P - 1) th order polynomial phase signal by "de-chirping" the highest order phase term out of the signal. A similar process can then be used to estimate the ap-1 parameter, and subsequently all lower order parameters. The formal algorithmic specification for the phase parameter estimators in [8] is: 1. Let p -
P and let y(P) (n) - y(n), 1 <_ n <_ N.
N Calculate the ap estimate using the formula: 2. Set ~-p - ~-.
s
---
w P !Tp - 1 argmax
I-Iq=~
E
p-1
ySq
(n -
e -j~n
n--(M--1)Tp+l
1
= P! 7~_1 argrrax {IHAFy(w, Tp)l},
(10.4.19)
where ySq(n) -- { y*(n)Y(n)ififqqisiseVenodd ' * signifies complex conjugation, and HAFy(w, Tp) denotes the pth order "higher order ambiguity function". 3. Set y(p-1)(n) - yP(n)exp(-j~pnP).
Then set p - p - 1.
4. If p > 1, go back to Step2. Else proceed. 5. a0 - a r g { y ~
A-
ly~
9
The parameter estimates obtained with the above algorithm are good but not optimal [9]. The estimates can be refined (to the point of optimality) with either a Newton algorithm or with a procedure similar to the one outlined in Steps 4-7 of Section 10.4.1.3. Once the co, a1,... ,ag parameters have been estimated, (10.4.1) can be used to obtain a reconstruction of the entire signal phase (and using (10.4.2), the IF).
Analysis of Polynomial FM Signals in Additive Noise
453
Like the PWVD approach this method performs very well at high SNR, but suffers from threshold effects at low SNR. As with the PWVDs, an iterative method can be used to improve performance at low SNR, the details being provided in [10].
10.4.3
A Comparison of the Polynomial Wigner-Ville Distributions and the Higher Order Ambiguity Functions
10.4.3.1 Single Component Analysis For analyzing a single polynomial phase signal the computational burdens of the PWVD and HAF methods are similar. The asymptotic variances of the ak parameter estimates using the PWVD method are also similar. In fact, if the estimate refinement scheme outlined in Steps 4-7 of Section 10.4.1.3 is used for both methods, then the two techniques are both optimal above threshold. The SNR thresholds for the HAF and PWVD techniques are similar for lower order phase laws (i.e. third order and lower), while for higher order laws the PWVDs tend to have lower thresholds. Formulae for the threshold values are given in [8] and [7].
10.4.3.2
Multiple Component Analysis
Both the PWVDs and the HAFs are non-linear transforms and hence produce "crossterms" when operating on a sum of two or more polynomial phase signals. These cross-terms complicate the task of analyzing such "multi-component" signals. The PWVDs produce cross-terms which are often spectrally concentrated, while the HAFs give rise to cross-terms which are spectrally dispersed. The spectrally dispersed cross-terms prove to be more conducive to analysis, because they appear more like white noise. In analyzing multiple components with the HAFs one can proceed by first estimating the parameters of the highest amplitude component, and then removing the estimate of this component from the signal [11]. Then the next highest amplitude component can be analyzed. The process can be repeated until all components have been accounted for. Pre-requisites for the effective analysis of multi-component signals include (1) higher required SNR thresholds than for single component analysis, (2) good separation of the various components in the HAFs, and (3) phase polynomial orders of the constituent signals which are not too high. Requirement (2) is necessary because the level of dispersion of the cross-terms is related to the extent of the separation of the components in the HAFs. Requirement (3) is necessary because the energy of the cross-terms relative to the auto-terms increases with increasing HAF order.
10.4.4
Appendix: Asymptotic Mean-Square Error of a PWVD-Based IF Estimate
It is assumed that the IF of a polynomial phase signal is to be estimated, using the peak of the discrete-time PWVD. It is further assumed that the signal is embedded in complex, white, Gaussian noise with power, cr2, and that the estimate of the IF
454
Chapter 10:
InstantaneousFrequency Estimation and Localization
is required at time, n - 0. This appendix derives the asymptotic mean-square error (a.m.s.e.) of the IF estimate. The P W V D based IF estimate, evaluated at n = 0, is defined by: ]0 - arg m~x{Wg(n, f)ln=0} - arg m ~ x { W P ( f ) } , where WP ( f ) =
P(n-O,m) DTFT Ky, m---+ C/max [
(10.4.20)
(10.4.21)
The P W V D of a noisy polynomial phase signal, evaluated at n = 0, is: (N-1)/2
w P (f) -
I
E
H [y(mci/Cim~x) " Y* (--mci/Cima,,)] k' e -j2=fm/c'm~'' 9
m=-(N-1)/2i=l
(10.4.22) In the absence of noise W f (f) has a global maxima at f = f0, where fo is the value of the IF at n - 0. The perturbation of w g ( f ) which occurs when noise is added to the observation is given by: (N-l)/2
5wP (f) =
E
zw(m)e-J2~Sm/~'m~"'
(10.4.23)
m=-(N-1)/2
where I
z~(,~) - I I [Y(mo,/O,m..)Y* (-mo,/O,m..)] k' -[z(m~,/O,m..)z* ( - m ~ , / o , . . . ) ] k' . i=l
(10.4.24) With this perturbation, the peak of W f (f) shifts from f = f0 to f = fo + 5f. This appendix derives the a.m.s.e, of 5f (i.e. the a.m.s.e, of the IF estimate). To achieve this, a formula for the asymptotic mean-square fluctuations of the maximum of a real valued random function is used [12]. This formula, re-expressed for the scenario in this section, is: -2 E
-
E
L
Of 2
(10.4.25)
where E{.} denotes the expected value. Under the simplifying assumption that all samples of the kernel are uncorrelated, relevant terms on the right hand side of (10.4.25) are found to be:
02wP(fo) Of 2
-4~?AqN 3 12C2m~~
'
(10.4.26)
Analysis of Polynomial FM Signals in Additive Noise
455
N/2
oswy(fo) of
-j27r
m=-N/2
m . z w ( m ) e -j2rfm/cimax ,
4 ~2 A 2qN 3 Vkern E
where
gkern,
[
Of
"~
12
(10.4.27)
(10.4.28)
the "noise to signal ratio" in the kernel, is [4], [13]:
Vkern-- [ ( ~ I kl 2 \i=0
o'2i / 2 x
...
x
i )
/s \i=o
ki i
/
ii "A--~
- 1. (10.4.29)
(10.4.26) and (10.4.28) can be substituted into (10.4.25) to obtain: E {6/2} ~
10.4.5
12-c 2 " Ykern lmax 4~-2. N 3 9
(10.4.30)
Summary and Conclusions
In this article the problem of analyzing polynomial FM signals in additive noise has been addressed. Two different approaches have been described, the polynomial Wigner-Ville distributions and the higher order ambiguity functions. Both methods perform well at high SNR, and with some modifications, can be made to work well even at low SNRs. A brief comparison of both methods has been made.
References [1] D. C. Rife and R. R. Boorstyn, "Single-tone parameter estimation from discrete-time observations," IEEE Trans. Information Theory, vol. 20, pp. 591-598, September 1974. [2] B. Boashash and P. J. O'Shea, "Polynomial Wigner-Ville distributions and their relationship to time-varying higher order spectra," IEEE Trans. Signal Processing, vol. 42, pp. 216-220, January 1994. [3] B. Barkat and B. Boashash, "Design of higher order polynomial Wigner-Ville distributions," IEEE Trans. Signal Processing, vol. 47, pp. 2608-2611, September 1999. [4] B. Barkat and B. Boashash, "Instantaneous frequency estimation of polynomial FM signals using the peak of the PWVD: Statistical performance in the presence of additive Gaussian noise," IEEE Trans. Signal Processing, vol. 47, pp. 2480-2490, September 1999. [5] B. Ristic and B. Boashash, "Instantaneous frequency estimation of quadratic and cubic polynomial FM signals using the cross polynomial Wigner-Ville distribution," IEEE Trans. Signal Processing, vol. 44, pp. 1549-1553, 1996. [6] S. A. Tretter, "Estimating the frequency of a noisy sinusoid by linear regression," IEEE Trans. Information Theory, vol. 31, pp. 832-835, November 1985.
456
Chapter 10:
Instantaneous Frequency Estimation and Localization
[7] D. C. Reid, A. M. Zoubir, and B. Boashash, "Aircraft flight parameter estimation based on passive acoustic techniques using the polynomial Wigner-Ville distribution," J. Acoustical Soc. of America, vol. 102, pp. 207-23, July 1997. [8] S. Peleg and B. Porat, "Estimation and classification of polynomial-phase signals," IEEE Trans. Information Theory, vol. 37, pp. 422-430, March 1991. [9] B. Porat and B. Friedlander, "Asymptotic statistical analysis of the high-order ambiguity function for parameter estimation of polynomial phase signal," IEEE Trans. Information Theory, vol. 42, pp. 995-1001, May 1996. [10] P. O'Shea, "An iterative algorithm for estimating the parameters of polynomial phase signals," in Proc. Fourth Internat. Symp. on Signal Processing and its Applications (ISSPA '96), vol. 2, pp. 730-731, Gold Coast, Australia, 25-30 August 1996. [11] S. Peleg and B. Friedlander, "Multicomponent signal analysis using the polynomialphase transform," IEEE Trans. Aerospace ~ Electronic Systems, vol. 32, pp. 378-386, January 1996. [12] S. Peleg and B. Porat, "Linear FM signal parameter estimation from discrete-time observations," IEEE Trans. Aerospace ~ Electronic Systems, vol. 27, pp. 607-616, July 1991. [13] G. Reina and B. Porat, "Comparative performance analysis of two algorithms for instantaneous frequency estimation," in Proc. Eighth IEEE Workshop on Statistical Signal and Array Processing (SSAP-96), pp. 448-451, Corfu, Greece, 24-26 June 1996.
IF Estimation o f F M Signals in Multiplicative Noise
10.5 10.5.1
457
IF E S T I M A T I O N OF FM SIGNALS IN M U L T I P L I C A T I V E NOISE ~ Random Amplitude Modulation
Most IF estimation techniques, such as those presented in the previous articles of this chapter, assume that the signal of interest has a constant amplitude. While this is a valid assumption in a wide range of scenarios, there are several important applications in which this assumption does not hold. Indeed, in many situations the signal may be subjected to a random amplitude modulation which behaves as multiplicative noise. Examples include fading in wireless communications [1], fluctuating targets in radar [2], and structural vibration of a spacecraft during launch and atmospheric turbulence [3]. In this article, we focus on non-parametric methods. In particular, we show that the Wigner-Ville distribution (defined in Section 2.1.4) is able to display the IF of a signal affected by multiplicative noise, and that this representation is optimal in the sense of maximum energy concentration for a linear FM signal. For higher-order polynomial FM signals, the use of the polynomial Wigner-Ville distribution (PWVD), presented in Article 5.4, is shown to give optireal representations. Statistical performance of each case will be presented here.
10.5.2
Linear FM Signal
In this section, we study the case of a linear FM signal and assume that the multiplicative noise is a real-valued process.
10.5.2.1
Optimality of the Wigner-Ville Spectrum
First we show that the Wigner-Ville spectrum (WVS) is optimal, in the sense of IF localization, for the time-frequency analysis of linear FM signals affected by multiplicative noise. Consider the signal y(t) given by
y(t)
-
a(t) . z(t)
(10.5.1)
where a(t) is a non-zero-mean real-valued stationary noise and z(t) is a deterministic FM signal given by z(t) = exp{jC(t)}. For a linear FM signal, r is a secondorder polynomial. Using the expectation operator notation, the autocorrelation of the signal above can be expressed as
-
E [y(t - i) y* (t + i)]
= E [ a ( t - ~)a(t + ~)]. { z ( t - ~) z*(t + ~)} -- T~a(T)I(z(t, T).
(10.5.2)
~ B. Barkat, Nanyang Technological University, School of Electrical & Electronic Engineering, Block $2, Nanyang Avenue, Singapore 639798 (ebarkat@ntu.edu.sg), and B. Boashash, Queensland University of Technology, Signal Processing Research Centre, GPO Box 2434, Brisbane, Q 4001, Australia (b.boashash@qut.edu.au). Reviewers: Dr. F. Sattar and S. Gulam Razul.
458
Chapter 10:
Instantaneous Frequency Estimation and Localization
The WVS of y(t), which is defined as the Fourier transform of Ey(t, 7) [4], can be expressed as
)/Yy(t, f) - J%_.f {ICy(t, ~-)} (10.5.3) = ~ _ ~ f { z ( t - ~i)z*(t + ~)} . f ~__.f {E [ a ( t - ~)a(t + ~)]} (10.5.4) = Wz(t, f) . f Sa(f) (10.5.5) where , f is the convolution operation in the frequency space. If' we express the non-zero-mean random process a(t) as a(t) = ~a +ao(t), where p~ is a constant mean of a(t) and ao(t) is a zero-mean noise with autocorrelation Rao (T), we can re-write the WVS of y(t) as
W y ( t , f ) - # 2 W z ( t , f ) + Sao(f) *f W z ( t , f ) .
(10.5.6)
For the case of a linear FM signal, the Wigner-Ville distribution (WVD) is given in Article 2.1 by [4] Wz(t, f) = 5(f - fi(t)), (10.5.7) where fi(t) is the signal IF and 5 is the Dirac delta function. In this case, we obtain
Wy(t, f) - #25(f
-
fi(t)) + Sao(f - fi(t)).
(10.5.8)
Note that Eq. (10.5.8) exhibits the presence of a spectral line at the frequency f~(t) for all time instants. This means that, theoretically, the WVS always localizes the IF of a linear FM signal. This makes it a powerful tool in the analysis of linear FM signals affected by multiplicative noise. Also note that when #a = 0, the WVS will not exhibit a peak at the signal IF, indicating a breakdown of the WVS to analyze the noisy signal. As an illustration, consider a unit-modulus linear FM signal sampled at 1 Hz, whose frequency range lies between 0.1 Hz and 0.4 Hz. The signal length (in samples) is chosen as N = 511. This signal is multiplied by a real-valued i.i.d. Gaussian noise with a standard deviation aa = 1 and a mean equal to 0 and 1, respectively. Fig. 10.5.1 displays the WVS (one realization) of the noisy signal for both values of the mean. As expected, the WVS for the zero-mean case cannot reveal the signal IF; however, it can do so for the other case. In this last case, the peak of the WVS can be used to estimate the IF of the signal. In what follows, we will evaluate the statistical performance of such an estimator.
10.5.2.2
Statistical Performance Evaluation
Here, for a more complete study, we consider the presence of additive noise as well as the multiplicative noise. The objective is to derive the asymptotic variance of the IF estimator, based on the peak of the WVS, for this case. Let the discrete-time version of the noisy signal be
y(n) - a(n)e jr
+ w(n) , n = O , . . . , N - 1 .
459
IF Estimation of FM Signals in Multiplicative Noise
The Wigner-Ville spectrum of a linear FM signal affected by a real-valued Gaussian multiplicative noise. The noise variance is I (both plots) and its mean is 0 (left plot) and 1 (right plot).
F i g . 10.5.1:
The process a(n) is considered to be a real-valued stationary Gaussian noise with 2 respectively. The complex zero-mean admean and variance given by pa and Ca, ditive process w ( n ) is assumed to be stationary, white, circular and Gaussian with 2 In addition, both noises are assumed to be independent. variance equal to a w. The WVS used to estimate the signal IF is defined, in the discrete-time domain, using the expression given in Article 6.1, as [4] Wz(n, f) - E
2
~
y ( n + m ) . y*(n - m ) e - j 4 ~ f m
.
(10.5.10)
m---M
Using straightforward derivations, we can show that, for increasing window length (2M + 1), the WVS converges in probability to # 2 5 ( f - f i ( t ) ) [5]. We can also show that the IF estimator asymptotic variance is approximately equal to [5] V a r ( f i ( n ) ) - (27r)2,5. (2M + 1) a
2+ ~ + ~
(10.5.11)
2 2 and $ ~ - p a2 / a 2. Note that" where Sa - p~/a~
(i) W h e n #a = 0, the variance goes to infinity indicating that the WVS based estimator breaks down. This result confirms the analysis presented earlier. (ii) When #a = A where A is a constant and aa = 0, i.e., the signal under consideration is just a constant amplitude linear FM signal embedded in noise, the variance expression can be-rewritten as 2 3a212A 2 + a~] V a r ( ] i ( n ) ) - (27r)2A4(2 M + 1) 3 .
Chapter 10:
460
Instantaneous Frequency Estimation and Localization
corrupted right plot is for Sw = 5 dB.
F i g . 1 0 . 5 . 2 : Theoretical (dashed curve) and estimated ( ' + ' ) variances of a linear FM signal
by multiplicative and additive noise. Left plot is for 8w = 0 dB and
2 If, in addition, we assume high signal-to-noise ratio (SNR) (i.e., A 2 :>> aw) the asymptotic variance expression reduces to
V a r (fi (n)) -
(27r)2(A2)(2M + 1) 3
which is similar to the result obtained in [6]. The above theoretical results were confirmed by Monte-Carlo simulations. Specifically, we estimate the IF estimator variance using 5000 realizations of the signal given by (10.5.9). In Fig. 10.5.2, we display the theoretical (dashed curve) and the estimated ('+') variances plotted against ,S'a for Sw = 0 and 5 dB, respectively. If the signal under consideration is not a linear FM but a higher-order polynomial FM signal, the WVS becomes inappropriate because it introduces some artifacts which might hide the real features of the signal and its peak based IF estimator is biased for such signals [6]. In this situation, a different tool is needed for the analysis. This is the topic of the next section.
10.5.3
Polynomial FM Signals
Compared to the previous section, the extension here is two-fold: (i) the signal considered is assumed to be a polynomial FM signal of arbitrary order and (ii) the multiplicative noise is no longer limited to a real-valued process but is assumed to be a non-zero complex circular Gaussian process. Based on this, the noisy signal y(t) is now written as y(t) = a(t) . z(t) + w(t) (10.5.12) where the stationary processes a(t) and w(t) are both assumed circular, complex, 2 Gaussian and independent with means and variances given by (Pa, a 2) and (0, aw),
461
IF Estimation of FM Signals in Multiplicative Noise
respectively. The noiseless polynomial FM signal z(t) is given by
z(t) -- e jr
-- exp
j E
aiti
(10.5.13)
i--0
where the ai are real coefficients and P is the order of the polynomial phase. Note that in the derivation below we do not require the knowledge of the coefficients ai; we only assume a polynomial FM signal. The IF of the signal z(t) is given in Article 1.3 by P
fi(t)=
1 de(t) = ~ i a i t i _ 2rr dt
1
i--1
and our primary objective here is to estimate fi(t) from the noisy signal y(t). For that purpose, we use the polynomial Wigner-Ville distribution (PWVD) defined in Article 5.4 as w}q)(t, f ) -
z(t + ciT)z*(t + c _ i 7 ) e - J 2 " f r d 7
(10.5.14)
cx~ i--1
-
oo
K(q)(t, 7). e - J 2 ~ f ' d T
(10.5.15)
where q is an even integer which indicates the order of non-linearity of the PWVD. The coefficients ci and c-i (i = 1, 2 , . . . , q/2) are calculated so that the P W V D is real and equal to w ( q ) ( n , f ) - 5 ( f - fi(t)), for signals given by Eq. (10.5.13). Note that the realness of the P W V D implies that ci = - c - i . Also note that the WVD is a member of the P W V D s class with parameters q = 2 and cl = - c - 1 = 0.5. Full details of the design procedure may be found in [7] and Articles 5.4 and 5.5. The choice of the P W V D stems from the fact that it yields a continuum of delta functions around the IF for a given polynomial FM signal. This property implies that the peak of the P W V D can be used as an IF estimator for polynomial FM in a noisy environment. In [8], the statistical performance of this estimator was evaluated for noisy signals described by Eq. (10.5.12). It shows that this estimator is unbiased and its asymptotic variance is approximately equal to [8] 2 nl 2 6 (0 .2 -{--0.w) E i - - 1 ki
V a r ( f i ( n ) ) - (2rr)21#a12(2M q- 1) 3 .
(10.5.16)
In the above expression, nl represents the number of the different coefficients ci in the P W V D kernel, while ki (for i = 1 , . . . , nl) represents the multiplicity of each of these coefficients ci, and (2M + 1) is the window length considered in the P W V D discrete-time implementation. Note that:
462
Chapter 10:
Instantaneous Frequency Estimation and Localization
(i) When #a = 0, the variance goes to infinity indicating that the P W V D based estimator breaks down. (ii) When #a = A where A is a constant and aa = 0, i.e., the signal under consideration is just a constant amplitude polynomial FM signal embedded in complex Gaussian noise, the variance expression can be-rewritten as nl 6 a~2 ~i=1 k~2
Var(j~) = (27r)2A2(2 M + 1) 3 . The above expression is exactly similar to the result obtained in [6], which treats constant amplitude polynomial FM signals only.
10.5.3.1 Monte-CarloSimulations To confirm the validity of the above theoretical results, we consider the IF estimation of a quadratic FM signal at the middle of the signal interval. The peak of the sixthorder P WVD, whose signal kernel is given by [7] = [y(t + 0.62 ) y*(t •
[y(t + 0.75 ) y*(t - 0.75 )]
[y(t - 0.sTy) y*(t +
is used here as the IF estimator. The noisy signal y(t) is generated as suggested by Eq. (10.5.12). For this example, we choose the sampling period equal to T = 1, the signal length N equal to 129, the window length equal to (2M + 1 = N = 129) and the noise variances to be equal (i.e., a a2 _ a2). In the simulations, the overall signal-to-noise ratio, defined as 2 SNRwl = lO loglo(l#al2/(a 2 + aw)),
is varied in a 1 dB step from 0 to 15dB. Monte-Carlo simulations for 1000 realizations are run for each value of SNRwl. The results of two different experiments, one performed for l ~ a l - - 0 . 0 1 and the other performed for I#al - 1, are displayed in Fig. 10.5.3 (left plot). We observe that, above a certain threshold, the estimated variances represented by ' + ' (for IPa[ = 0.01) and 'o' (for I#al = 1) are in total agreement with the theoretical ones given by Eq. (10.5.16) and represented by the continuous lines (superimposed). Simulations run under the same noise conditions for other polynomial FM signals using the appropriate P W V D order also confirm the theoretical results presented above. One such case is when y(t) is a linear FM signal and the P W V D considered is the second-order PWVD. The results of this experiment are displayed in the right plot of Fig. 10.5.3.
10.5.4
Time-Varying Higher-Order Spectra
Time-Varying Higher-Order Spectra (TV-HOS) based on the polynomial WignerVille distribution are defined as the expected value of the P W V D [9], namely
w(q)(t, f) = E rW(q)(t, f)] L
J
(10.5.17)
IF Estimation of FM Signals in Multiplicative Noise
463
Fig. 10.5.3: Results of the experiments for a quadratic (left plot) and a linear (right plot) FM signal corrupted by complex (;aussian multiplicative and additive noise processes. The continuous lines (superimposed) represent the theoretical variances while '+' and 'o' correspond to the estimated variances for I ~ 1 = 0.01 and I~ol = 1, respectively.
where w(q)(t, f) is the qth-order PWVD defined by (10.5.14). Interchanging the expectation operator with the integration in the PWVD, one obtains ]/Y~q)(t, f) -
z(t + ci~')z* (t + c_iT) e-J2=f'dT
E oo
=
(10.5.18)
Li--1
(t,
(10.5.19)
oo
where/fi (q) (t, T) represents a slice of a time-varying qth-order moment function [9]. If the quantity/C (q) (t, ~-) is absolutely integrable, then, l/Y(q) (t, f) can be interpreted as a form of the time-varying higher-order moment spectrum. In [9], the authors showed that TV-HOS combine the advantages of classical time-frequency analysis with the benefits of higher-order spectra. To avoid the problem of non-superposition of the higher-order moments, the authors in [9] used higher-order cumulants instead. It is important to note that since non-stationary random signals are non-ergodic, the ensemble averaging above cannot be replaced by time averaging. In this situation, local ergodicity has to be assumed. Readers interested in TV-HOS are referred to [9] for more details, including examples of the efficacy of TV-HOS in the analysis of random FM signals affected by multiplicative noise. (See also Section 14.5.4 and the references in [9].)
10.5.5
Summary and Conclusions
The Wigner-Ville spectrum (WVS) and polynomial Wigner-Ville distributions (PWVDs) are considered tbr the analysis of polynomial FM signals corrupted by multiplicative and additive noise. In the noisy linear FM case, the WVS is shown
464
Chapter 10:
InstantaneousFrequency Estimation and Localization
to give optimal IF localization. Accordingly, the peak of the WVS is proposed as an IF estimator. A statistical performance test shows that this estimator is very accurate even at low signal-to-noise ratio values. For the case of the noisy higher-order polynomial F M signal, the peak of the P W V D is shown to be a very consistent and accurate IF estimator.
References [1] R. S. Kennedy, Fading dispersive communication channels. New York: Wiley, 1969. [2] H. L. L. Van Trees, Detection, Estimation, and Modulation Theory, vol. III: "RadarSonar Signal Processing and Gaussian Signals in Noise". New York: Wiley, 1971. Reprinted Malabar, FL: Krieger, 1992. Reprinted New York: Wiley, 2001. [3] A. G. Piersol, "Power spectra measurements for space vibration data," J. Spacecraft and Rockets, vol. 4, p. 1613, December 1967. [4] B. Boashash, "Time-frequency signal analysis," in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: Prentice-Hall, 1991. [5] M. R. Morelande, B. Barkat, and A. M. Zoubir, "Statistical performance comparison of a parametric and a non-parametric method for IF estimation of random amplitude linear FM signals in additive noise," in Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pp. 262-266, Pocono Manor, PA, 14-16 August 2000. [6] B. Barkat and B. Boashash, "Instantaneous frequency estimation of polynomial FM signals using the peak of the PWVD: Statistical performance in the presence of additive Gaussian noise," IEEE Trans. Signal Processing, vol. 47, pp. 2480-2490, September 1999. [7] B. Barkat and B. Boashash, "Design of higher order polynomial Wigner-Ville distributions," IEEE Trans. Signal Processing, vol. 47, pp. 2608-2611, September 1999. [8] B. Barkat, "Instantaneous frequency estimation of nonlinear frequency-modulated signals in the presence of multiplicative and additive noise," IEEE Trans. Signal Processing, vol. 49, pp. 2214-2222, October 2001. [9] B. Boashash and B. Ristic, "Polynomial time-frequency distributions and time-varying higher order spectra: Application to the analysis of multicomponent FM signal and to the treatment of multiplicative noise," Signal Processing, vol. 67, pp. 1-23, May 1998.
Chapter 11
Time-Frequency Synthesis and Filtering To model and predict accurately the effects of linear systems on non-stationary signals in applications such as signal cleansing and enhancement, we need the capability to design time-varying linear systems with precise time-frequency specifications. This topic is covered in four articles with appropriate cross-referencing to other chapters. The design of time-varying filters is useful in applications where it is desired to separate, suppress or reduce undesirable non-stationary signal components. This can be achieved with a number of methods such as the STFT and Gabor transtbrm (Article 11.1). In particular, the use of the Gabor expansion for time-varying filtering is illustrated on an application to monitoring machine vibrations (Article 11.2). Another illustration of the procedure for designing a time-varying filter is provided in the context of an application involving hands-free telephone speech signals (11.3). Another important application of time-varying filtering, namely signal enhancement, is described using an iterative algorithm based on time-frequency peak filtering (11.4).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
465
Chapter 11:
466
11.1 11.1.1
Time-Frequency Synthesis and Filtering
LINEAR TIME-FREQUENCY FILTERS~ Time-Frequency Design of Linear, Time-Varying Filters
Linear, time-varying (LTV) filters are useful in many applications, especially for weighting, suppressing, or separating nonstationary signal components. The inputoutput relation of an LTV filter H with kernel (impulse response) h(t, t') reads
y(t) - (Hx)(t) =
F
h(t, t') x(t') dt' .
(11.1.1)
oo
The nonstationary nature of input signal x(t), output signal y(t), and LTV filter H suggests the use of time-frequency (TF) representations for analyzing, designing, and/or implementing LTV filters. There are two fundamentally different approaches to a TF design of LTV filters, namely, the "explicit" and "implicit" design philosophies [1,2]. Both are based on a prescribed TF weight function M(t, f) that provides a TF specification of the desired filtering characteristic. 9 E x p l i c i t design: The impulse response h(t, t ~) of the LTV filter H is calculated (designed) such that a TF representation of H is equal to or best approximates the TF weight function M(t, f). In this article, the TF representation of H will be chosen as the generalized Weyl symbol (see Article 4.7). An alternative explicit design of LTV filters using the Wigner distribution of an LTV system has been considered in [3]. The filtering itself is performed in the time domain according to (11.1.1). 9 I m p l i c i t design: The LTV filter H is designed implicitly during the filtering, which is a three-step analysis-weighting-synthesis procedure. First (analysis step), a linear TF representationmsuch as the short-time Fourier transform-of the input signal x(t) is calculated. Second (weighting step), this TF representation is multiplied by the TF weight function M(t, f). Third (synthesis step), the output signal y(t) is calculated in a linear manner from the TF function obtained in Step 2. Since all processing steps are linear, the overall procedure amounts to an LTV filter. In this article, we will consider explicit TF filter designs based on the generalized Weyl symbol [1,2,4,5] and implicit TF filter designs based on the short-time Fourier transform [1,2,6-10] and the Gabor transform [2,11,12]. In particular, we will show that the resulting filters tend to perform similarly if the TF weight function M(t, f) is sufficiently smooth. ~ F. H l a w a t s c h and G. M a t z , Institute of Communications and RadioFrequency Engineering, Vienna University of Technology, Gusshausstrasse 25/389, A-1040 Vienna, Austria (email: fhlawats@pop.tuwien.ac.at, g.matz@ieee.org, web: http://www.nt.tuwien.ac.at/dspgroup/time.html). Reviewers: M. Amin and D. L. Jones.
467
Linear Time-Frequency Filters
11.1.2
Explicit Design~The Generalized Weyl Filter
The generalized Weyl symbol (GWS) of an LTV system H is defined as
L(~)(t,f) ~
w , t - -~+a 7
h t+ -~ - a
dr,
(11.1.2)
oo
where a is a real-valued parameter. The special cases a = 0 and a = 1/2 give the Weyl symbol and Zadeh's time-varying transfer function, respectively. For underspread LTV systems (i.e., LTV systems that produce only moderate T F displacements), the GWS can be interpreted as a T F transfer function describing the T F weighting produced by the system (see Article 4.7). Hence, a conceptually simple T F design of an LTV filter H from a prescribed T F weight function M(t, f) is based on setting the filter's GWS equal to M(t, f) [1,2, 5], L H(~) G W F (t, f) - M(t, f)
The impulse response of the L T V inverse of (i I.1.2), i.e.,
hGwr(t,t')--
oo
M
filter H G W F
(-~+a t+(-~-a
(11.1.3)
thus defined is obtained via the
f e'2~/
df.
(11.1.4)
The filter H G W F in (11.1.4) is termed generalized Weyl filter [1]; it depends on the choice of the GWS parameter a used in (11.1.3). In particular, the choices a = 0 and a = 1/2 lead to the Weyl filter and Zadeh filter, respectively. For a = 0, a real-valued weight function M(t, f) will result in a self-adjoint [13] Weyl filter. Dependence on a. The dependence of HCWF on a effectively disappears in the case of a smooth T F weight function M(t, f) (yielding an underspread LTV system HGWF as discussed in Article 4.7; also note that smoothness of M(t, f) is incompatible with a sharp T F cutoff) 9 Let ~(1) - ' G W F and ~(2) - - G W F be two generalized Weyl filters designed according to (11.1.4) with GWS parameter a l and a2, respectively. Then, one can show that the difference (HawFX)(t) (1) - [~(2) k ~ G W F x)(t) of the output signals of these filters satisfies
I-IGwFX- "GWFXII2 Ilxll
< s
z~
[c~1-a2[ OrOf
2
-
where I1" ]]2 denotes the L2 norm. Thus, it is seen that the generalized Weyl filter design is almost independent of a if cl is small, i.e., if M(t, f) is a smooth function. TF projection filter.
Formally, (11.1.3) can be viewed as the solution to the (un-
constrained) minimization problem HGWF -- argminH I[M - L H (a) [[2" Solving this minimization problem under the side constraint that H be an orthogonal projection operator 1 yields the TF projection filter Hp introduced in [4]. More specifically, let uk(t) and Ak denote the eigenfunctions and eigenvalues, respectively, of 1An orthogonal projection operator is characterized by being self-adjoint (H + = H) and idempotent ( H H : H)[13].
Chapter 11:
458
Time-FrequencySynthesis and Filtering
+ + (HowE + HGWF)//2, where HGW F denotes the adjoint of HGWF [13]. Then, the impulse response of Hp can be shown [4] to equal
h (t, t') =
u (t) kE~
where I is the set of indices k for which/~k > 1/2. The TF projection filter is only able to pass or suppress signal components, with no other weights possible. It is advantageous in some situations since it is capable of realizing very sharp TF cutoff characteristics. However, compared to HGWF, the calculation of Hp requires the additional solution of an eigenproblem. An efficient online implementation of the TF projection filter is proposed in [14].
11.1.3
Implicit Design I ~ T h e STFT Filter
An S T F T filter consists of the following three steps [1,2, 6, 7, 9,10]:
9 Analysis: Calculation of the short-time Fourier transform (STFT) [9, 10] of the input signal x(t),
F2 (t, f) =
/?
x(t') "/t*/(t') dt' ,
oo
where O/t,/(t') - " / ( t ' - t ) e j2~ft' with ~,(t) being an analysis window (see Section 2.3.1).
9 Weighting: Multiplication of the STFT by the TF weight function M(t, f), i.e., calculation of M(t, f) F2(t, f). 9 Synthesis: The output signal y(t) is obtained via an inverse STFT [9,10], y(t) =
[M(t', f') F:(t', f')] gt,,/,(t) dt'df' . (x)
oo
Here, gt,/(t') = g ( t ' - t ) e j2~/t' where g(t) is a synthesis window that is usually assumed to satisfy f _ ~ g(t)V* (t)dt = 1 (this guarantees perfect reconstruction for M(t, f ) - 1). These steps implement an LTV filter~hereafter denoted H ~ , g ~ t h a t depends on the TF weight function M(t, f) and the windows v(t) and g(t). Multiwindow STFT filter. dow S T F T filter [2, 8] N
HN A
An extension of the STFT filter H~,g is the multiwin-
Er/iH~(i),g(i), i--1
N
with E T ] i -
1, ?~iC]l~.
i--1
This is a linear combination of N STFT filters H~(~),g(,) with the same TF weight function M(t, f) but different analysis windows 7(i)(t) and different synthesis windows g(i)(t). Note that the STFT filter H.y,g is a special case with N = 1. Using a larger number N of STFT filters yields increased flexibility of design at the expense
Linear Time-Frequency Filters
469
Fig. 11.1.1" lllustration of the case where the effective support of S(C~)(v, ~-) covers the effective support of rn(~, "r).
of increased computational complexity. The impulse response of the multiwindow STFT filter H N can be calculated as
M(t', f') p ( t - t ' , [ - t ' ) ej2~I' (t-~) dt' dr',
hg(t, D oo
with p(t,t') - ~ - 1
(x)
r]~g(i)(t)7(i)*(t'). Furthermore, the GWS of HN is obtained as L (~) (t, f) - M(t, f) ** L(~)(t f) HN
~
~
(11 " 1.5)
where ** denotes two-dimensional convolution and P is the LTV system with impulse response p(t, t').
Comparison with generalized Weyl filter. Comparing(11.1.5) with (11.1.3), we see that the multiwindow STFT filter HN using TF weight function M(t,f) is equivalent to a generalized Weyl filter using the modified TF weight function .~(t, f) - M(t, f) 9 9L(p~) (t, f). For nonnegative coefficients r/i, M(t, f) will be a smoothed version of M(t, f). However, for qi = ( - 1 ) i and N --~ oc, it is possible to have L HN (~) (t, f) --~ M(t ' f) and thus HN -+ HGWF ~ i.e. the multiwindow STFT filter approaches the generalized Weyl filter using the TF weight function M(t, f). It can be shown that the difference (Hyx)(t) - (HGwFx)(t) of the output signals of HN and HGWF (both based on the same TF weight function M(t, f)) satisfies I]HNx -- HGWFXl]2 < c2 A xll
]rn(u, ,)]211 -- S (a) (u, 7)]2 du d7
.
(11.1.6)
Here, m(~, 7) - f _ ~ f _ ~ M(t, f) e -j2~('t-~f) dt df and S (a) (u, 7) is the generalized spreading function of P (see Article 4.7) The constant ~2 is related to the operator P that characterizes the effect of the windows 7 (~)(t), g(i)(t), in particular, c2 will be small if the effective support of S (~) (1,,,7) covers the effective support of m(u, 7), so
that [1 S(pa)(.,"r)l 2 ~ 0 on the support of re(u, r) (see Fig. Ii.i.I). This is favored by a smooth TF weight function M(t, f). Here, re(u, T) is well concentrated about the origin and thus its effective support can easily be covered by S(pa)(u, T), even using a small N. Hence, for a smooth TF weight function M(t, f), the generalized Weyl filter can easily be approximated by the (multiwindow) STFT filter. This will be verified experimentally in Section 11.1.6. -
Chapter 11:
470
11.1.4
Time-FrequencySynthesis and Filtering
Implicit Design II---The Gabor Filter
The Gabor transform is the STFT evaluated on a TF lattice (nT, kF) with n, k E Z [12]. A Gabor filter (see [2,11] and Article 11.2) consists of the following steps:
9 Analysis: Calculation of the Gabor coefficients of the input signal x(t) [12],
/?
Cn,k =
x(t) "Yn,k* (t) dt ,
O0
where 7n,k(t) = 7(t -- nT) ej27rkgt with 7(t) being a suitable analysis window.
9 Weighting: Multiplication of the Gabor coefficients by the weights Mn,k = M(nT, kF), i.e., calculation of Mn,k Cn,k. 9 Synthesis: The output signal y(t) is obtained via Gabor synthesis [12], OO
y(t) -
(X)
E
E
n -- -- cx~ k =
-
Mn,k Cn,k gn,k(t) , cx:)
where gn,k(t) = g ( t - nT) ej2~kFt with g(t) being a suitable synthesis window. This scheme implements an LTV filter that will be denoted HT,g. The windows 7(t) and g(t) are usually assumed to satisfy the perfect-reconstruction (biorthogonality) condition f _ ~ g(t) 7" ( t - ~) e -j2rkt/T dt = 5n 5k, which presupposes critical sampling (TF = 1) or oversampling (TF < 1). Multiwindow Gabor filter. dow Gabor filter [2]
An extension of the Gabor filter HT,g is the multiwin-
N
HN ~
N
?']i H~(~) ,g(,),
with
i--1
r/i = 1, r/i E R, i=1
i.e., a linear combination of Gabor filters H7(~),9(~) with the same TF weights Mn,k but different analysis windows 7 (0 (t) and different synthesis windows g(O(t). The Gabor filter HT,g is reobtained with N - 1. Using a larger number N of Gabor filters allows to reduce the TF sampling density TF (cf. [15]). The impulse response A~ of the multiwindow Gabor filter HN is given by OO
CO
hN(t,t') = E n -
-- C~
E k--
-
Mn,k p(t-nT, t ' - n T ) e j2~kF(t-t'), c x :)
with p(t, t') - ~]i=l N ~i g (i) (t)7(0*(t'), and the GWS of HN is OO
L(-H~) (t, f) = E N n--
- - cx:) k - -
OO
E -
Mn,k L(p")(t-nT, f - k F ) (:x)
where P is the LTV system with impulse response p(t, t~).
Linear Time-Frequency Filters
471
Fig. 11.1.2" Illustration of windowing and aliasing effects involved in the Gabor filter design. The dark gray ellipse indicates the effective support of re(u, T). The light gray ellipses indicate the effective
support of m ( ~ - ~, T - ~-) for (n, k) # (0, 0).
Comparison with generalized Weyl filter. We next analyze how close the multiwindow Gabor filter HN (using TF weights Mn,k -- M(nT, kF)) is to the generalized Weyl filter HCWF (using TF weight function M(t, f)). One can show that
Ilngx- I-I~
_< ~2 + ~a,
Ilxll2 where ~2 was given in (11.1.6) and c3 is defined as
[FF oo
oo
(~
1/2
2
d~' dT"]
nr k#O
As in the case of the STFT filter, the term c2 is related to the operator P that describes the effect of the windows 7(i)(t), g(i)(t). If M(t, f) is smooth so that m(u, 7) is well concentrated about the origin, a suitable choice of P allows to cover the effective support of re(u, 7) by the effective support of S (~) (u, 7), which results in a small value of c2 (cf. our discussion in Section 11.1.3). The additional term e3 is mainly due to potential aliasing errors which are caused by the sampling Mn,k = M(nT, kF) that distinguishes the Gabor filter from the STFT filter. For ~3 to be small, it is necessary that the effective support of S(~)(u, T) does not overlap with the periodic repetitions m ( u - ~, T - ~) of re(u, 7-) (see Fig. 11.1.2). This can be achieved if (i) re(u, T) is well concentrated about the origin and thus m ( u - ~, T - k) is well localized about (~, ~) and (ii) T and F are small enough to ensure that the periodic repetitions m ( u - ~, 7 - k) are sufficiently separated. For m(u, T) well concentrated, the latter condition can be met even for TF > 1. Hence, we conclude that for a smooth M(t, f), the generalized Weyl filter can be accurately approximated by the (multiwindow) Gabor filter.
11.1.5
The Discrete-Time Case
While our discussion of TF filters has so far been placed in a continuous-time framework, practical implementation of these filters calls for a discrete-time formulation.
Chapter 11"
472
Time-FrequencySynthesis and Filtering
The input-output relation of a discrete-time LTV system H reads (x)
y[n] - (Hx)[n] -
h[n, n'] x[n'] ,
E n
t --
--
(:x)
with h[n, n ~] being the impulse response of H. The GWS with arbitrary c~ is not easily reformulated in a discrete-time setting. However, for c~ - 0 (Weyl symbol) and c~ = 1//2 (Zadeh's time-varying transfer function), which are the main cases of practical interest, discrete-time formulations are given by cx:)
m-----
oo
(2<3
(n, 0) = / } ' t - - - - (:X3
Here, 0 denotes normalized frequency. We note that in order for L ~ ) (n, 9) to be in one-to-one correspondence to h[n, n~], H has to be a hal]band system, i.e., an LTV system that accepts input signal components only within a specified halfband (e.g., 0 c [-1/4, 1/4)) and maps them to a halfband output signal [2]. The TF system representations L(~ ) (n, 0) and L(~/2) (n, O) can be used to design discrete-time LTV filters from a TF weight function M(n, O) via an explicit filter design (cf. Section 11.1.2)[2]. The discrete-time Zadeh filter (c~ = 1 / 2 ) i s defined by setting L H(1/2) (n 0) - M(n, 0); its impulse response is obtained as
h[n, n']
-
l/2 M(n, O) e j2~rO(n-n') dO. y - 1/2
f
In a similar manner, the discrete-time Weyl filter (a = 0) is obtained as
bin+m, n - m ] -
l/4
fiJ-l~4 M(n, O)e j27r~ dO.
(11.1.7)
Since L ~ ) (n, 0) is meaningful only for halfband systems, M(n, 9) here is specified on the halfband [-1/4, 1/4). According to (11.1.7), the impulse response h[nl, n:] of the discrete-time Weyl filter is obtained only for nl + n: even (since nl + n2 = n+m+n-m = 2n). If we assume H to be a halfband system, h[nl,n2] is completely specified by these samples. The missing samples (for nl H-n2 odd) could be obtained by interpolation; however, this is not necessary since the entire filtering can be performed using only the even-indexed samples [2]. We note that in some cases (especially for "chirpy" TF weight functions), the Weyl filter design results in better filtering performance than the Zadeh filter design [2]. Discrete-time versions of the implicit filter design methods from Sections 11.1.3 and 11.1.4 can be obtained in a straightforward manner; see [2,6,9,10] for discretetime STFT filters and [2,11] for discrete-time Gabor filters.
Linear Time-Frequency Filters
473
Fig. i i . I . 3 : Comparison of explicit and implicit TF filter designs: (a) Specified TF weight function M(t, f), (b)-(d) spectrogram (top) and real part (bottom) of (b)input signal x(t), (c) output signal y(t) obtained with Zadeh filter, and (d) output signal y(t) obtained with STFT filter. The dashed lines in the spectrograms indicate the TF pass region. The time duration is 2048 samples; the (normalized) frequency interval shown is [ - 1/2, 1/2).
11.1.6
Simulation Results
Our first simulation example, shown in Fig. 11.1.3, compares the performance of the Zadeh filter HCWF (generalized Weyl filter with a = 1/2) and the STFT filter H~,g. The TF weight function M(t, f) (see Fig. 11.1.3(a)) models a bandpass filter with sinusoidally time-varying center frequency and time-varying gain. The gain is 1 in the first (earlier) half and 1/2 in the second (later) half, with a roll-off in between. The two filters were applied to an input signal x(t) consisting of two chirps (see Fig. 11.1.3(b)). The resulting output signals (shown in Figs. 11.1.3(c),(d)) are seen to conform to the specified TF weighting. Furthermore, they are effectively identical (we obtained IIH~/,gX- HGWFX]]2/IIHGwFXll 2 -- 0.047), which is due to the smoothness of M(t, f) and confirms our approximation in (11.1.6). The application of a multiwindow Gabor filter HN to speech enhancement (denoising) is considered in Fig. 11.1.4. The speech signal s(t) and its noisy version x(t) = s(t)+ n(t) (where n(t) is white noise with an SNR of 0dB) are shown in Fig. ll.l.4(a),(b). The multiwindow Gabor filter has N = 5 branches and lattice parameters T = 5.8ms, F = 172.25Hz. The analysis/synthesis windows 7(i)(t) - g(i)(t) and the branch weights ~i were chosen as discussed in [2, Section 4.6.5]. The weights Mn,k were computed from the multiwindow Gabor coefficients c(~) ~,k - f -~~ x(t) 7(i,), (t) dt in a signal-adaptive online manner that does not require knowledge about the clean speech signal or its statistics [2, Section 4.7.3]. The filter output is shown in Fig. 11.1.4(c); the SNR improvement is 4.92dB.
4"/4
Chapter 11:
Time-FrequencySynthesis and Filtering
Fig. 11.1.4: Speech enhancement using a multiwindow Gabor filter HN. The figure shows the smoothed pseudo-Wigner distribution (top) and the time-domain signal (bottom) of (a) the clean speech, (b) the noisy speech (input of HN), and (c) the enhanced speech (output of HN). The time duration is 4096 samples; the (normalized) frequency interval shown is [0, 1/2).
11.1.7
Summaryand
Conclusions
We have discussed "explicit" and "implicit" time-frequency (TF) designs of linear, time-varying filters. These design methods are useful for filtering nonstationary signals if the filter characteristic can be specified in the TF domain via a TF weight function. All filters discussed (except the TF projection filter) tend to perform similarly if the TF weight function is sufficiently smooth. In the opposite case, however, different designs may result in filters that perform very differently [2]. We finally note that the application of TF filtering to nonstationary signal estimation and detection is considered in Article 12.4. Other TF approaches to timevarying filtering are described in Articles 11.2-11.4.
References [1] W. Kozek and F. Hlawatsch, "A comparative study of linear and nonlinear timefrequency filters," in Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ Time-Scale Analysis, pp. 163-166, Victoria, BC, 4-6 October 1992. [2] G. Matz and F. Hlawatsch, "Linear time-frequency filters: On-line algorithms and applications," in Applications in Time-Frequency Signal Processing (A. PapandreouSuppappola, ed.), ch. 6, pp. 205-271, Boca Raton, FL" CRC Press, 2002.
Linear Time-Frequency Filters
475
[3] F. Hlawatsch and G. Matz, "Quadratic time-frequency analysis of linear time-varying systems," in Wavelet Transforms and Time-Frequency Signal Analysis (L. Debnath, ed.), ch. 9, pp. 235-287, Boston: Birkh/~user, 2001. [4] F. Hlawatsch, Time-Frequency Analysis and Synthesis of Linear Signal Spaces: TimeFrequency Filters, Signal Detection and Estimation, and Range-Doppler Estimation. Boston: Kluwer, 1998. [5] W. Kozek, "Time-frequency signal processing based on the Wigner-Weyl framework," Signal Processing, vol. 29, pp. 77-92, October 1992. [6] R. Bourdier, J. F. Allard, and K. Trumpf, "Effective frequency response and signal replica generation for filtering algorithms using multiplicative modifications of the STFT," Signal Processing, vol. 15, pp. 193-201, September 1988. [7] I. Daubechies, "Time-frequency localization operators: A geometric phase space approach," IEEE Trans. Information Theory, vol. 34, pp. 605-612, July 1988. [8] W. Kozek, H. G. Feichtinger, and J. Scharinger, "Matched multiwindow methods for the estimation and filtering of nonstationary processes," in Proc. IEEE Internat. Symp. on Circuits and Systems (ISCAS 96), vol. 2, pp. 509-512, Atlanta, GA, 12-15 May 1996. [9] S. H. Nawab and T. F. Quatieri, "Short-time Fourier transform," in Advanced Topics in Signal Processing (J. S. Lim and A. V. Oppenheim, eds.), ch. 6, pp. 289-337, Englewood Cliffs, N J: Prentice-Hall, 1988. [10] M. R. Portnoff, "Time-frequency representation of digital signals and systems based on short-time Fourier analysis," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 28, pp. 55-69, February 1980. [11] S. Farkash and S. Raz, "Linear systems in Gabor time-frequency space," IEEE Trans. Signal Processing, vol. 42, pp. 611-617, March 1994. [12] H. G. Feichtinger and T. Strohmer, eds., Gabor Analysis and Algorithms: Theory and Applications. Berlin/Boston: Birkh/~user, 1998. [13] A. W. Naylor and C. R. Sell, Linear Operator Theory in Engineering and Science. New York: Springer, 2nd ed., 1982. [14] G. Matz and F. Hlawatsch, "Time-frequency projection filters: Online implementation, subspace tracking, and application to interference excision," in Proc. IEEE Interact. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), pp. 1213-1216, Orlando, FL, 13-17 May 2002. [15] Y. Y. Zeevi, M. Zibulski, and M. Porat, "Multi-window Gabor schemes in signal and image representations," in Gabor Analysis and Algorithms: Theory and Applications (H. G. Feichtinger and T. Strohmer, eds.), ch. 12, pp. 381-407, Berlin/Boston: Birkh~user, 1998.
Chapter 11:
476
11.2 11.2.1
TIME-VARYING
Time-FrequencySynthesis and Filtering
FILTER V I A G A B O R E X P A N S I O N o
Filtering a Six-Cylinder Engine Sound
One of the most important applications of the Gabor expansion is in time-varying filtering. It is well understood that the Fourier transform can be employed to effectively study a signal's frequency behavior as long as the signal's frequency content does not evolve during the period of observation. However, this is not the case when the signal's frequency changes rapidly. Fig. 11.2.1 illustrates a sound waveform recorded during the run-up of a sixcylinder engine. Intuitively, the sound will be created not only by the engine rotation, but also by the other parts that vibrate due to the engine rotation. The sound waveform plotted in the bottom is indeed a combination of all kinds of vibrations caused by engine rotation. Moreover, the vibration frequencies are multiples of the fundamental frequencyNthe engine rotation speed. When the engine speed is constant, the classical Fourier transform can well isolate the vibrations. When the engine runs up/down, the fundamental frequency and its multiples increase/decrease with time. The corresponding frequency bandwidths will become wide and overlap each other, as shown in the left plot. Consequently, the Fourier transform based power spectrum will no longer be able to distinguish different individual vibrations. In the automobile industry and some other industries, such time-varying harmonics are named as orders to distinguish them from the time-invariant harmonics. To evaluate a vibration, engineers use adaptive filters, such as Kalman filter [1], to extract the time waveform corresponding to a particular order. Based on the time waveform, engineers can then further compute other information, such as phase and amplitude. Recently it has been discovered that the order tracking process can also be effectively achieved by the Gabor expansion (see Section 2.3.3 on the connection between the Gabor transform and the STFT). The middle plot of Fig. 11.2.1 shows the magnitudes of the Gabor coefficients for the sound signal. While neither the time waveform nor the Fourier transform based power spectrum gives us an idea about the structure of the engine sound, orders have distinct signatures in the joint time-frequency plot. Taking this advantage, we can select the desired Gabor coefficients and perform the Gabor expansion to obtain the corresponding time waveform. Such processing can be thought of as time-varying filtering.
11.2.2
Discrete Gabor Expansion
Before investigating the nature of the Gabor expansion based time-varying filter, first we shall briefly review the discrete Gabor expansion. For a discrete time signal s[k], its Gabor expansion is defined as [2] N-1
s[k] - E m
E
Cm,n h[k - roT] ej2~nk/g
(11.2.1)
n=O
~ S h i e Qian, National Instruments Corp., Austin, TX 78759, USA (shie.qian@ni.com). Reviewers: X.-G. Xia and F. Munk.
Time-Varying Filter via Gabor Expansion
477
Fig. 11.2.1: Gabor expansion based Time-varying Filter for Order Tracking. The bottom plot illustrates the original engine run-up sound (light color) vs. the extracted sixth-order (dark color) time waveforms. The dark line in the Gabor coefficients plot marks the Gabor coefficients corresponding to the sixth order.
where the Gabor coefficients, am,n, Fourier transform), i.e.,
are
computed by the regular S T F T (short-time
c.~,n - ~ s[k] 7*[k - roT] e -j2~nk/y.
(11.2.2)
m
The parameters T and N denote the time sampling interval (or decimation) and the number of frequency bins (or bands). The ratio of N and T determines the sampling rate. When this ratio is equal to one, we have critical sampling; in this case there is no redundancy for the resulting Gabor coefficients. When the sampling rate is greater than one, we have oversampling. For stable reconstruction, the ratio of N and T must be greater than or equal to one. Note that the window functions h[k] and q,[k] are exchangeable. In other words, either of them can be used as the analysis or the synthesis function. They satisfy the so-called dual relation. The central issue of the Gabor expansion is how to compute the dual function for a given function (either h[k] or q,[k]). Over the years, many schemes of computing the dual function have been proposed. Each method has its pros and cons. A special feature of the approach presented here is that the resulting dual window function always has the same length as the given function. Without loss of generality, let us assume that the given window function is h[k] with L points. Moreover, h[k] has unit energy. Then, the corresponding dual
Chapter 11"
478
Time-FrequencySynthesis and Filtering
function v[k] can be solved by T independent linear equations, i.e., At~l = g l ,
/ = 0,1,...,T- 1
(11.2.3)
where the elements of the matrix Ak, and vectors VL and gl are defined as
az [q, p] - 5[l + pT + qN] 71[P] - V[/+ pT] ul[q] = (N-1 O, ,O,...)T where 0 <_ p < LIT. The periodic auxiliary function 5[k] is defined as
5[k + i ( 2 L - N ) ] -
h[k]
0 _< k < L,
0
L<_k<2L-N,
2L 0_
(11.2.4)
If the window length is equal to the signal length, the periodic auxiliary function 5[k] is simply L
5[k + iL] = h[k] ,
O <_ q < -~.
Note that the solution of (11.2.3) is not unique for oversampling. To ensure that the analysis and synthesis window functions are both concentrated in the joint time-frequency domain, we require that the dual function is optimally similar, in the sense of LMSE (least mean square error), to the given window function, i.e.,
F= A'r--u minll When the error is small, i.e. when ~ ~ h, eq. (11.2.2) becomes
Cm,n - ~
s[k] h*[k - roT] e -j2~nk/g.
(11.2.5)
m
Then (11.2.5) and (11.2.1) form an orthogonal-like Gabor transform pair. In this case, the Gabor coefficients Cm,n are exactly the signal's projection on the synthesis window function h[k]. Since at critical sampling the pair of dual functions cannot be simultaneously concentrated in both the time and frequency domains, usually we always employ the oversampling scheme. Consequently, the resulting Gabor transformation is redundant. In this case, the Gabor coefficients will be the sub-space of two-dimensional functions. In other words, for an arbitrary two-dimensional function, there may be no corresponding time waveform. For example, assume that we have a modified two-dimensional function Crn,n - - C m , n C m , n
where Cm,n denotes a binary mask function, equal to either zero or one. Apply the Gabor expansion (11.2.1) to obtain N-1
~[k] - ~ m
~ n--O
Cm,n h[k - mT] ej2~'~k/g.
(11.2.6)
479
-lime-Varying Filter via Gabor Expansion
Then we find that S[k] 7* [k - ?T~T] e -j27rnk/N ~ Cm,n.
m
The Gabor coefficients of the reconstructed time waveform g[k] will not be inside the masked area determined by the mask function Cm,n 9
11.2.3
Time-Varying Filtering
To overcome this problem, we will introduce an iteration approach as follows. That is, for a set of two-dimensional Gabor coefficients, first determine a binary mask matrix. Apply the mask to the two-dimensional Gabor coefficients to reserve desirable and remove unwanted coefficients. Then, compute the Gabor expansion. Once the time waveform is obtained, compute the new Gabor coefficients. Repeat this process until the time waveforms converge. For the sake of an efficient representation, let us rewrite the pair of the Gabor expansion (11.2.1) and (11.2.2) in matrix form, i.e., C-G~ F - H TG~
where H and G denote the analysis and synthesis matrices, respectively. Note that for oversampling H T G = I,
G H T 7~ I.
Then, the iterative process can be described as
~ ~1
-
r
_
HT~I
C2 -
G~ 1 - G H T ~ C
~2
OC 2
_
-- H T 5 2 C3 .
.
Gs-e = G H T O G H T O C -
(GHTO)2 C
.
Ck -
(GHTO)k-1C
It can be shown [3] that if and only if ~N- - 1
E
L N
?*Jig + k]h[ig + k + roT]-
i=0
E
1
h*[iN + k]v[iN + k + mT]
(11.2.7)
i=0
for 0 _< k < N and 0 _< m < M, where M is equal to the number time sampling points and L denotes the window length, then 1. C k and ~ converge.
Chapter 11:
480
Time-FrequencySynthesis and Filtering
2. C k = O C k , k ~ oc. That is, the support of C k in the time-frequency domain is inside the masked area.
Two trivial cases tbr (11.2.7) are 1. the critical sampling, N = T. Note that in this case the analysis and synthesis windows cannot both be localized in the joint time-frequency domain. 2. 7[k] = h[k]. In this case the Gabor coefficients C 2, after the first iteration, are closest in the LMSE sense to the masked Gabor coefficients O C (that is, desirable Gabor coefficients). The second case usually implies heavy oversampling (or huge redundancy) and is therefore impractical due to the computation speed and memory consumption. Usually, we pursue the orthogonal-like representation introduced early. It has been found that for those commonly used window functions, such as Gaussian and Hanning windows, the difference between the analysis and synthesis windows would be negligible when the oversampling rate is four.
11.2.4
Numerical Simulation
Fig. 11.2.1 illustrates the application of the Gabor expansion based time-varying filter to order tracking. The dark waveform in the bottom plot depicts the sixth order extracted by the Gabor expansion based time-varying filter. In this example, h[k] is the Hanning window. The oversampling rate was selected high enough (four) so that the dual function 7[k] has a form that is almost identical to h[k]. It has been found that after a few iterations, the difference (LMSE) between ~ and ~ - 1 reduces to 10 4. The resulting time waveform is found to be almost identical to that computed by other methods.
11.2.5
Summary and Conclusions
In this article, the basic concept and a real-world application of the Gabor expansion based time-varying filter are introduced. Compared with other time-varying filter schemes, the method presented here is much simpler and thereby has great potential in wide range of applications.
References [1] H. Void and J. Leuridan, "High resolution order tracking at extreme slew rates using Kalman tracking filters," in Proc. Noise ~A Vibration Conf. ~A Exposition, Soc. of Automotive Engineers, Traverse City, MI, 17-20 May 1993. SAE paper no. 931288. [2] S. Qian and D. Chen, Joint Time-Frequency Analysis: Methods ~4 Applications. Upper Saddle River, N J: Prentice-Hall, 1996. [3] X.-G. Xia and S. Qian, "Convergence of an iterative time-variant filtering based on discrete Gabor transform," IEEE Trans. Signal Processing, vol. 47, pp. 2894-2899, October 1999.
Time-Frequency Filtering of Speech Signals in Hands-Free Telephone Systems
11.3
481
TIME-FREQUENCY FILTERING OF SPEECH SIGNALS IN HANDS-FREE TELEPHONE SYSTEMS 0
Time-varying filtering of noisy speech signals is a very attractive challenge, with the main question: what does the most appropriate time-varying filter scheme look like? Speech signals are of highly nonstationary and multicomponent nature. If we deal with filtering of noisy speech signals, as they occur in hands-free telephone systerns, then the desired scheme would provide a signal-to-noise ratio (SNR) greater than approximately 12dB. At the same time it should be suited for real-time implementation, with time delay less than 39ms for mobile telephony, and 2ms for circuit-switched telephony. The second requirement can cause additional difficulties and restrictions on finding an appropriate time-varying filter procedure. The most commonly used approach in the filtering of speech signals is the so called quasi-stationary approach, where it is assumed that the signal is stationary in the time interval T, with T between 20ms and 40ms being often used [1, 2]. In this interval of time, classical speech enhancement schemes such as those given in Table 11.3.1 are used [3,4]. The noise is reduced by applying frequency-dependent suppression factors according to the various filtering rules given in Table 11.3.1. From the aspect of time-varying filtering we can say that this technique is quasi time-varying filtering. Thus, we have a sliding window of duration T along the signal where the filtering is performed after every T or after every T/2 (the second case is used in an overlap-add scheme in order to avoid block effects). From the point of view of time-frequency analysis, having in mind the high nonstationarity of speech signals, we can conclude that the quasi-stationary approach of filtering is approximate in nature, and that it will more or less satisfy subjective perception requirements. If we want to achieve more accurate and more objective filtering of speech signals, time-varying filtering n e e d s t o be applied. Since a unique definition of time-frequency spectra does not exist (as is shown by the variety of T F D s derived in Chapter 2), several approaches to time-varying filtering have been proposed. We will use the one based on the Wigner distribution. It uses the Wigner spectrum, where the statistically independent cross-terms in Wigner distribution are averaged out. However, in order to calculate the Wigner spectrum it is necessary to have many different realizations of the same random process at a given instant. Obviously, in the case of real-time applications, the processing has to be based on a single noisy speech realization. It is the reason for using an approximation, in the sense that the Wigner spectrum is replaced by a cross-terms free (reduced) time-frequency distribution. According to the additional criterion of realization simplicity, special attention will be devoted to the filtering based on the spectrogram and distributions whose realization is directly related to the spectrogram. The use of other reduced interference time-frequency distributions in the ~ Srdjan Stankovid, Elektrotehnicki fakultet, University of Montenegro, 81000 Podgorica, Montenegro (srdjan@cg.ac.yu). Reviewers: J. Tilp and LJ. Stankovi6.
Chapter 11"
482
Time-Frequency Synthesis and Filtering
Table 11.S.I:
Filter transfer function for different algorithms, where S2x(w) and S2e(w) are the power spectra of the noisy signal and noise respectively, and A is an overestimation factor.
Algorithm Filter tran. fun.
Spectral subtraction i// ~sL (~) 1 s~(w)
Wiener s~(~) s~(~)
Maximum likelihood 111 + V/1
s~(,~) s~(~)]
Magnitude subtraction 1
,/S~r (~) y s~(~)
filtering, in place of the spectrogram, is straightforward.
11.3.1
Time-Variant Filtering of Speech Signals
By analogy with the filtering of stationary signals, nonstationary time-varying filtering of a noisy signal can be defined by [5-7]"
(Hx)(t) -- ;
7 t - -~ T)x(t + T)dT. h(t + -~,
(11.3.1)
(X)
The signal z(t) =
+ ,(t)
is a noisy one with the desired signal s(t) and the noise e(t). Impulse response of the time-varying filter is h(t + 7, t - 7)" The optimal transfer function 7"
LH(t, f) --
F
7.
7"
T
h(t + -~,t- -~)x(t + T)e--J2~YT.dT O0
is defined by the relation [8-11]" u
W~(t, /) = LH(t, / ) W ~ ( t , f),
(11.3.2)
where
Wxx(t, f) - E{Wxx(t, f)} -
E{x(t + -~)x*(t -
)}e-J2~f~dT
(ii.3.3)
O0
is the mean value of the Wigner distribution Wxz(t, f) of signal x(t) (i.e. the Wigner spectrum of signal z(t) [12]). We can conclude that (11.3.2) is of the same form as the Wiener filter for the stationary case. If the signal and noise are not correlated, we have" m
Wss(t,f) LH(t, f) = Wss(t, f) + W,,(t, f)"
(11.3.4)
Time-Frequency Filtering of Speech Signals in Hands-Free Telephone Systems
483
Consider now relation (11.3.4). Obviously, the mean value E { W s s ( t , f ) } Ws~(t, f) will eliminate uncorrelated cross-terms in the Wigner distribution, since 7- * ( t - - 7" E{si(t + ~)sj ~)}-0
for i T~ j,
as long as components si(t) and sj(t) are not correlated [12]. However, if we have to perform filtering on the base of a single realization, the Wigner distribution should be used instead of the Wigner spectrum in (11.3.4). For filtering of multicomponent signals equation (11.3.4) is useless because emphatic cross-terms will appear. The problem of cross-terms will be partially overcome if we modify the definition (11.3.4) so that we apply some of the cross-terms reduced distributions p(t, f) instead of the Wigner distribution. In this case we have:
pss(t,f) LH(t, f) -- pss(t, f) + p~(t, f)"
(11.3.5)
It is clear that definition (11.3.5)is an approximation of (11.3.4) with p(t,f) approximating the Wigner spectrum. In order to obtain a more efficient filter, for numerical implementation, the previous definitions can be slightly modified by using their pseudo form:
(Hx)(t) -
f_
~ h(t + -~, r t - -~ r )W(T)x(t + r)dr. O0
Here, a lag window w(r) is introduced. It can be shown that, for frequency modulated signals, w(r) does not influence the output signal (Hx)(t) if w(0) - 1 [6]. By using Parseval's theorem, (11.3.6) can be written in the form:
F /?
(Hx)(t) -
LH(t, f)Fx(t, f)df
(11.3.7)
O0
where Fx (t, I ) -
+
CO
is the short-time Fourier transform of the signal x(t). The choice of p(t,f), in (11.3.5) will play a crucial role in the time-varying filter scheme. Obviously, for an efficient time-varying filtering, it is desired that the chosen p(t, f) satisfies three main conditions: (1) satisfactory noise reduction, (2) appropriateness for real-time realization, (3) its auto-terms are close to those in the Wigner spectrum. The simplest and most commonly used p(t, f), for which the real-time application is very well studied, is the spectrogram (see Section 2.3.1), which is the squared modulus of the short-time Fourier transform:
S=(t, f) - IF~(t, f)l 2 -
z(t + CO
484
Chapter 11:
Time-Frequency Synthesis and Filtering
Fig. 11.3.1: Spectrogram of: (a) the clean speech signal, (b) the noisy signal filtered by a high-pass filter.
The main problem of using the short-time Fourier transform (and the spectrogram) is in determination of the window width w(t). A narrow window produces better time resolution, while a wider window gives better frequency resolution. The window should be chosen by a compromise of these two opposite requirements. Having in mind that a speech signal is approximately stationary within the interval T between 20ms and 40ms, for a sampling rate of fs = 8kHz, we conclude that we can use a lag window width of N = 256 samples, corresponding to T = 32ms. In order to achieve a more accurate calculation of integral (11.3.7), zero padding up to 1024 samples will be used. Consider now the spectrogram-based filtering of noisy speech signals, recorded in a car cruising along the highway. Estimations of the spectrogram of noise are performed in only one time instant during a speech pause. This assumption is made in order to have the worst filtering situation as in a real case. Since the noisy signal contains significant noise components in the low frequency range (below 98Hz) where, in our application, no speech components exist, the signal is prefiltered by using a high-pass filter with cut-off frequency 98 Hz. In this realization we will apply the time-varying Wiener filter definition (11.3.5), with the spectrogram instead of p(t, f), and the time-varying version of the spectral subtraction definition: LHw(t, f) -- 1 -
LHSS(t, f) = i l
S~(t,f)f)" Sx(t,
(11.3.8)
S~(t,f) Sx(t,f)
(11.3.9)
-- A ~
In (11.3.9) A is an overestimation factor applied in order to give some correction of the errors caused by the assumption that the noise is stationary in the interval between two pause estimations. The value A = 4 is used. Modifications of the equations (11.3.8) & (11.3.9) are used after introducing a spectral floor [2]: Luw(t, f) = max {LHw(t, f),
#}
(11.3.10)
Time-Frequency Filtering of Speech Signals in Hands-Free Telephone Systems
485
and LHSS(t, f) = max {LHss(t, f),/3}.
(11.3.11)
In our examples the spectral floors are set to 3 = 0.12 and/3 = 0.08 (in (11.3.10), and (11.3.11) respectively). Note that by increasing A, better noise reducing is obtained, but the distortion of signal becomes significant. By increasing fl more noise remains in signal, but speech distortion is audible. Thus, these two factors are chosen by compromise. The time-frequency representations of a clean signal and a noisy signal are shown in Fig. 11.3.1(a) and (b), respectively. Time-frequency representation of denoised signal, filtered by using the time-varying Wiener filtering, and the time-varying spectral subtraction filtering, based on the spectrogram, are shown in Fig. 11.3.2(a) and (b). It is obvious that the noise suppression is better when the time-varying spectral subtraction filter definition is used, because overestimation factor A provides better estimation of the spectrogram of noise. Now, there is the question whether it is possible to use some other time-frequency distributions, in order to further improve the filtering results. The answer is yes. Namely, we can use reduced interference distributions which belong to the general Cohen class 1 of distributions [13]:
pxx(t, f) -
g(7-,u)x(u + -~)x*(u - -~ O0
O0
-J2~f'eJ2'~UdudOdT
O0
(11.3.12) where the kernel g(T, ~) specifies the distribution. The most commonly used distributions include the Choi-Williams distribution, Zao-Atlas-Marks distribution, BornJordan distribution, Zhang-Sato distribution, S-method, etc. When we use the reduced interference distributions, it is important to know that, in the case of a noisy signal, the distance between two auto-terms during voiced segments of speech is approximately equal to the value of the fundamental frequency in the case of a signal without noise. In the noisy case, we also have harmonically shaped components of noise, which can occur between the auto-terms of speech, causing additional cross-terms and errors in filtering [14]. A very simple and flexible implementation can be obtained by using the Smethod (SM) [15], [Article 6.2], whose realization is straightforwardly based on the short-time Fourier transform. The result of this fact is that time-varying filtering based on the SM is a simple extension of the spectrogram-based filtering. Additionally, the SM of rnulticomponent signals: N
re(t) -- E
xi(t),
(11.3.13)
i=1
can assume the form SMxx(t, f) ~ ~i=1 N Wx~x~ (t, f), being a desired approximation of the Wigner distribution auto-terms. -
-
1 T h a t is, the q u a d r a t i c class; see p. 68n.
Chapter 11"
486
Time-Frequency Synthesis and Filtering
The SM is defined in the form" SM~(t, f) = 2
F
P(O)Fx (t, f + O)F~ (t, f - O)dO.
(11.3.14)
oo
where P(0) is a rectangular window in the frequency domain. Discretization of the SM (11.3.14), taking a rectangular window for P(1), produces: L
SMzx(n, k) =
E
Fx(n, k + 1)F~(n, k - l)
l=-L L
= IFx(n,k)l 2 + 2 a e { E F z ( n , k
+ l ) F x ( n , k - 1 ) } , (11.3.15)
/=1
From the previous equation we see that the SM realization is based on the spectrogram. Thus, filtering based on the SM will be a straightforward extension of the previously considered filter schemes: LHw(t, f) - max
SM,(t, f) ~7} 1 - S-~x~: f ) '
(11.3.16)
and
LHss(t,f) = max { ~ 1 - ASMx(t, SM~(t' f) f)'/~}"
(11.3.17)
In our experiments we have used the SM with L = 3, and spectral floors ~ = 0.12 and ~ = 0.08, respectively [16]. The denoised signals by using the time-varying Wiener filtering and the timevarying spectral subtraction filtering, based on the SM, are shown in Fig. 11.3.2(c) and (d). By comparing the results with the ones produced by using the spectrogram based filtering, the improvements are obvious. It is important to note that the SM has a form very suitable for simple hardware realization. This property is attractive for on-line applications.
11.3.2
Summary and Conclusions
Time-varying filtering of speech signals disturbed by car noise is presented. On the base of the time-varying Wiener filter form, the time-varying spectral subtraction form of filtering is introduced. The filtering is performed on the base of the spectrogram and the S-method. The proposed filter schemes are efficient and suitable fbr hardware realization.
References [1] C. Breining, P. Dreiseitel, E. Haensler, A. Mader, B. Nitsch, H. Puder, T. Schertler, G. Schmidt, and J. Tilp, "Acoustic echo control: An application of very-high-order adaptive filters," IEEE Signal Processing Magazine, vol. 16, pp. 42-69, July 1999.
Time-Frequency Filtering of Speech Signals in Hands-Free Telephone Systems
487
Fig. 11.3.2: Denoised signal obtained by filtering based on: (a) time-varying Wiener filter using the spectrogram, (b) time-varying spectral subtraction using the spectrogram, (c) time-varying Wiener filter using the SM, (d) time-varying spectral subtraction using the SM.
[2] S. V. Vaseghi, Advanced Signal Processing and Digital Noise Reduction. Wiley and Teubner, 1996. [3] J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," Proc. IEEE, vol. 67, pp. 1586-1604, December 1979. [4] J. Yang, "Frequency domain noise suppression approaches in mobile telephone systems," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'93), vol. 2, pp. 363-366, Minneapolis, 27-30 April 1993. [5] G. Matz, F. Hlawatsch, and W. Kozek, "Generalized evolutionary spectral analysis and the Weyl spectrum of nonstationary random processes," IEEE Trans. Signal Processing, vol. 45, pp. 1520-1534, June 1997. [6] L. StankoviS, S. StankoviS, and I. Djurovi5, "Space/spatial frequency based filtering," IEEE Trans. Signal Processing, vol. 48, pp. 2343-2352, August 2000. [7] H. L. L. Van Trees, Detection, Estimation, and Modulation Theory, vol. I: "Detection, Estimation, and Linear Modulation Theory". New York: Wiley, 1968. Reprinted 2001.
488
Chapter 11:
-lime-FrequencySynthesis and Filtering
[8] H. Kirchauer, F. Hlawatsch, and W. Kozek, "Time-frequency formulation and design of nonstationary Wiener filters," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), pp. 1549-1552, Detroit, 9-12 May 1995. [9] W, Kozek, "Time-frequency signal processing based on the Wigner-Weyl framework," Signal Processing, vol. 29, pp. 77-92, October 1992. [10] A. Papoulis, Signal analysis. New York: McGraw-Hill, 1977. [11] L. Stankovi(~, "On the time-frequency analysis based filtering," Ann. Tdldcommunications, vol. 55, pp. 216-225, May/June 2000. [12] P. Flandrin and W. Martin, "The Wigner-Ville spectrum of nonstationary random signals," in The Wigner Distribution--Theory and Applications in Signal Processing (W. Mecklenbr~uker and F. Hlawatsch, eds.), pp. 211-267, Amsterdam: Elsevier, 1997. [13] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [14] S. Stankovi(~, "About time-variant filtering of speech signals with time-frequency distributions for hands-free telephone systems," Signal Processing, vol. 80, pp. 1777-1785, September 2000. [15] L. Stankovi5, "A method for time-frequency analysis," IEEE Trans. Signal Processing, vol. 42, pp. 225-229, January 1994. [16] S. Stankovi5 and J. Tilp, "Time-varying filtering of speech signals using linear prediction," Electronics Letters, vol. 36, pp. 763-764, April 2000.
Signal Enhancement by Time-Frequency Peal( Filtering
11.4 11.4.1
SIGNAL ENHANCEMENT FILTERING 0
489
BY T I M E - F R E Q U E N C Y
PEAK
Signal Enhancement and Filtering
Time-frequency peak filtering (TFPF) may be regarded as an unconventional alternative to the filtering methods described earlier in this chapter [1]. The signals considered are assumed to be sums of arbitrary numbers of band-limited non-stationary components in additive noise. For high SNR situations many signal processing algorithms work well but most perform poorly when SNR decreases below a given threshold [2]. In this case, signal enhancement algorithms are required to improve the SNR by reducing the distorting effects of noise. To this effect, both adaptive and fixed methods have been developed in the case of non-stationary signals in noise. Adaptive techniques are generally superior in performance to fixed methods, but they perform poorly in certain conditions, such as filtering of a non-stationary signal whose spectral content changes rapidly with time. For example, filters designed using a least-mean-square (LMS) approach may not adapt quickly enough to track the rapidly changing signal due to the delayed convergence of the algorithm. Further, adaptive methods require that the structure of the filter (such as the number of the taps) and an estimate of SNR be imposed for optimal performance. This is often not possible as assuming a model may lead to suboptimal results and even to erroneous conclusions about the signal. This suggests the need for a more general filtering method when the SNR is low and the underlying signal statistics vary rapidly with time. The T F P F method is based on encoding the noisy signal as the IF of a unit amplitude frequency modulated (FM) analytic signal. The instantaneous frequency (IF) of the analytic signal is then estimated using standard time-frequency peak detection methods [2] to obtain an estimate of the underlying deterministic signal. For some signals, T F P F using a windowed WVD results in a significant enhancement of signals for SNR as low as - 9 dB.
11.4.2 11.4.2.1
Time-Frequency Peak Filtering Background and Definitions
Let us consider signals expressed as follows: p
s(t) - x(t) + n(t) - E xk(t) + n(t)
(11.4.1)
k=l
where n(t) is an additive white Gaussian noise (WGN) and xk(t) are band-limited non-stationary deterministic components that may have overlapping frequency spectra. It is desired to recover the signal x(t) given the observation of s(t). ~ B o u a l e m B o a s h a s h and M o s t e f a M e s b a h , Signal Processing Research Centre, Queensland University of Technology, Brisbane, Australia (b.boashash@qut.edu.au, m.mesbah@qut.edu.au). Reviewers: M. G. Amin and D. L. Jones.
Chapter 11:
490
Time-Frequency Synthesis and Filtering
The IF of an analytic signal, z(t) = a(t)e j2~r reference [2]:
fz(t)-
is defined in Chapter 1 and
I de(t) 27r dt
(11.4.2)
where r is the instantaneous phase and a(t) is the instantaneous amplitude of the analytic signal z(t) which can be expressed as [3]:
z(t) - a(t)e j2~ f~-~ Yz(X)dX
(11.4.3)
Among the existing techniques for IF estimation [2], we adopt the method that estimates the IF by taking the peak of the signal's TFD for its simplicity of implementation. The WVD is a natural first choice as a TFD for peak filtering given that the other quadratic TFDs are simply smoothed versions of the WVD [3]. The WVD of the analytic signal z(t) is defined (in Section 2.1.4) as:
Wz(t, f ) -
F
z(t + T/2)z*(t -- T/2)e--J2"YrdT.
(11.4.4)
CO
For monocomponent FM signals, the WVD will produce a time-frequency representation of the signal exhibiting significant energy concentration around the signal's IF. When the signal's IF is linear, delta functions will appear at the positions of the IF providing a perfect signal IF estimate. The IF estimate is found by maximizing the WVD over frequency [2]; that is
]z(t)
- argmax[Wz (t, /)] /
(11.4.5)
The IF estimate based on the peak of the WVD is unbiased and has variance approaching the Cramer-Rao lower bound for signals with linear IF laws in additive white zero-mean noise with moderate to high SNR [2]. However, as the order of the polynomial IF increases, the delta functions will be replaced by less peaky functions. The peak of these functions will lie away from the true IF, resulting in IF estimates which are biased. To remedy this, a windowed WVD is used, such that the signal IF is as close to linear as possible across the entire window length.
11.4.2.2
Basic Principle
Time-frequency peak filtering consists of a two step procedure whereby the signal to be filtered is first encoded as the IF of a unit amplitude FM modulated analytic signal. Then, the IF is estimated by taking the peak of a time-frequency distribution (TFD) to recover the filtered signal. This may be summarized as follows:
Step 1. Encode the noisy signal s(t) via FM modulation as" Zs(t) = e j2~u fo s(~)d~ where # is a scaling parameter analogous to the FM modulation index.
Signal Enhancement by Time-Frequency Peak Filtering
491
Step 2. Estimate the peak of the WVD of the analytic signal z~(t): ~(t) - L(t)
11.4.2.3
- argmax[Wzs
I
(t, f)]/#
Properties
The properties of the encoding and IF estimation steps of T F P F are derived for the case of WVD only. The use of other TFDs will lead to slightly different properties.
Property 1: The encoding step converts the additive noise n(t) to multiplicative noise zn(t) that modulates the signal component zx(t); that is z(t) - d 2~t'.fo s(~)d~ = zx(t)zn(t)
(11.4.6)
where the encoded noise and deterministic signal components are given by z~(t) - eJ2".lo
x(~)d~
and
z~(t) -
eJ2~".f3~(~)~
(11.4.7)
Proof: Equation 11.4.6 is obtained by a direct substitution of 11.4.1 into 11.4.3. Property 2: The Wigner-Ville Spectrum of the signal zs(t) is given by : WVSzs (t, f) - E[W~ (t, f)] - WVSzn (t, f) ~ Wz~ (t, f)
(11.4.8)
where El.] is the expectation operator, WVS~. (t, f) = E[W~, (t, f)], and 9 repref sents the convolution operation in the frequency domain.
Proof: This property follows from property 1 and the direct application of the expectation operator to the WVD of the encoded signal zs(t). This latter is given by:
E[Wz~ (t, f)] - f-~c~ Rz.,, (t, 7-)Kz~ (t, w)e-J2=frd7 "
(11.4.9)
where the time-dependent autocorrelation function of Zn(t) is ft+T/2
Rz,~ (t, T) -- E[zn(t + T / 2 ) z ~ ( t - 7 / 2 ) ] - E[eJ2"uJ~-'/2 n(~)d~]
(11.4.10)
and the time-dependent bilinear product function of zx(t) is
Kz~ (t, ~-) - z~(t + ~ / 2 ) z ; ( t - ~/2) -
9
I't+r/2x(,,~)d~
~"--~/~
(11.4.11)
Equation (11.4.8) is then obtained by using the fact that the Fourier transform of a product in time is equivalent to the convolution in frequency.
Chapter 11:
492
Time-FrequencySynthesis and Filtering
Equation (11.4.8) shows that additive noise smears the encoded signal WVD, Wz~(t,f), through convolution. Therefore, the bias of T F P F is dependent on the encoded signal zx(t) as well as the shape of the encoded noise spectrum WVSz,, (t, f). By restricting the shape of the encoded noise spectrum, a class of noise can be defined which does not introduce bias to the IF estimation. An example from this class is the WGN as will be seen next.
Property 3: The time dependent autocorrelation function of the encoded noise, rt+Tl2n(A)dA; Rz. (t, 7; p), is equal to the characteristic function of q(t, T) - 27~Jt-~/2 that is
Rz~ (t, T, p) -- E[e j2€
- Oq(t, T, p)
where (I)q(t, T, p) is the characteristic function of
~q(t,w,#)-E[eJ'q(t'~)]-exp(~ i=I
and
(11.4.12)
q(t, T) defined by [4]:
kqi(t'~-)(j#)i ) ii
(II.4.13)
kqi(t, ~-) is the ith cumulant of q(t, w).
Proof: Equation 11.4.12 is obtained by forming the autocorrelation function of Zn(t) in equation 11.4.7 and using the above definition of q(t, T). 11.4.3
Accurate T F P F
Equation (11.4.8) suggests that in general, a bias in IF estimation is introduced by the time-frequency distribution of zx(t) (deterministic bias) and/or the noise (stochastic bias). In the case where the encoded signal s(t) is composed of a deterministic signal x(t) that is linear in time and embedded in stationary WGN n(t), T F P F gives an unbiased estimate of the signal x(t).
Proof: Consider the signal s(t), given in (11.4.1), to be filtered using TFPF. For the case where n(t) is stationary WGN, the ith cumulant of n(t) is such that kni - 0 for i > 3 and q(t, T) is Gaussian with kqi - 0 for i > 3. Furthermore, if the noise is a zero-mean independent process, i.e. Rn(T) -- kn26(T), then [5, page 369] kql (t, T) - 0
and
kq2(t, T) - 4~2[~-]kn2
(11.4.14)
The characteristic function given in (11.4.13) becomes
(~q(t, 7, p) = e -2~2t'2'TIk"2
(11.4.15)
Taking the Fourier transform of this expression gives 4~2kn2P 2
E[Wn(f, $)] -- (27r2kn2#2)2 + (27rf) 2
(11.4.16)
Signal Enhancement by Time-Frequency Peak Filtering
493
This shows that the encoded noise spectrum is low-pass with a maximum at the frequency OHz. Hence, WGN will not introduce any bias to the estimate of the IF. By replacing this last expression in (11.4.8) we obtain:
W~. (t, f) - Wzx (t, f)
}
47r2kn2# 2 (2rr2kn2.2)2 + (2rrf) 2
(11.4.17)
This expression shows that the bias in the IF using the peak of Wzs (f, t) could only come from Wz:,,(t, f). For the case where the signal x(t) is linear in time; that is x(t) - at + C, where a and C are constants, equation 11.4.17 becomes
47r2kn2# 2 Wzs (t, f) - d(f - x(t)) ~ (27r2kn2#2)2 + (27r/) 2 47r2kn2p 2 m
+ (2
f- 2 x(t))
(11.4.18) (11.4.19)
The delta function ensures that the peak of this function occurs at x(t). Therefore if the signal x(t) is linear in time and embedded in stationary WGN, T F P F gives an unbiased estimate. Equation 11.4.17 shows that in the general case where the signal x(t) is a nonlinear function of time, the WVD-based T F P F is biased, requiring an appropriate windowing of the data. The window is chosen such that the signal within this window behaves almost linearly [6] (see Section 11.4.4). In the special case where the signal x(t) is a finite-order polynomial in time, the deterministic bias can be completely eliminated if the WVD is replaced by the polynomial WVD (PWVD) of an appropriate order since the PWVD exhibits delta functions along the IF law for polynomial FM signals [7].
11.4.4
Discrete-Time Algorithm for TFPF
The implementation of T F P F using the windowed WVD requires both signal scaling before encoding to prevent aliasing, and the selection of the window length for reduced bias. These two aspects are discussed next.
11.4.4.1
Signal Scaling
FM modulation of un-scaled discrete time signals can lead to aliasing which produces discontinuities in the estimated IF at the frequency boundaries of the time-frequency plane. This is avoided by amplitude scaling of the noisy signal before frequency encoding. Without loss of generality and unless otherwise specified we assume that the signal s(t) is sampled at a normalized sampling frequency of 1Hz. The scaled signal, so(m), is obtained by using the following transformation.
( a - b)
s(m) - min[s(m)] +b m a x [ s ( m ) ] - min[s(m)]
(11.4.20)
Chapter 11:
494
Time-Frequency Synthesis and Filtering
where S[.] is the scaling operator and the parameters a and b, which satisfy the constraint .5 >_ a - max[sc(rn)] > b = min[sc(m)] >_ O, are chosen to provide suitable frequency limits on the encoded signal. The operators max[.] and mini.] are the maximizing and minimizing functions respectively. The estimate of the desired signal, 2(m), is recovered by an inverse scaling operation; that is ~?(m)
I
~" .r
t~ rm~] , ,J _ (:~(m) - b)(max[s(m)] - min[s(m)]) + minis(m)] (11.4.21) a-b
where 3Cc(m) is the scaled signal obtained using T F P F on sc(m).
11.4.4.2
Reduced-Bias Window Length Selection
The bias-variance tradeoff is a key in the practical implementation of T F P F with the windowed WVD. Bias reduction requires a small window length to minimize the non-optimal nature of the WVD for higher than quadratic phase signals. On the other hand, variance reduction is achieved by increasing the window length to provide the local estimate with more information. To reduce the variance of the estimate while maintaining bias performance it becomes necessary to increase the sampling rate. Thus there is a tradeoff between estimator bias and sampling frequency which results in a relationship between window length and bias, for a given sampling frequency. Results relating to T F P F window lengths are derived in [6]. The basic results for window length Tw, sampling frequency fs and maximum value of IF, fp are given below. For the case of signal estimation: Tw <_
0.634]'8
(11.4.22)
This equation gives the maximum window length as a function of maximum signal IF and sampling frequency. In a typical application a specified window length is required fbr a given SNR. The signal is sampled at a high enough rate to satisfy (11.4.22).
11.4.4.3
The Iterative TFPF Algorithm
In the first application, T F P F may not remove as much additive noise as desired. If this situation occurs, reapplication of the procedure to the filtered signal is recommended. This leads to a 3 step iterative process: 1. Scale and encode noisy signal. 2. Apply T F P F to yield a signal estimate 2(t). 3. If ~?(t) contains substantial noise, go to step 1; else terminate the process. Simulations demonstrate the convergence of the repeated scheme to a stable estimate ~:(t) [1].
Signal Enhancement by Time-Frequency Peak Filtering
495
Fig. 11.4.1: B-distributions of a synthetic signal (left column) and an EEG signal (right column), showing the original signal (top row), and the noisy signal before enhancement (middle row) and after enhancement (bottom row).
Chapter 11"
495
11.4.5
Time-Frequency Synthesis and Filtering
Examples and Results
E x a m p l e 1 (A M u l t i c o m p o n e n t ponent signal, x(m), expressed by:
s i g n a l in W G N ) Let us consider a multicom-
0.85 sin(0.055m + 3.75 • 10 .4 sin(0.000625m)m)+ sin(0.035m) + sin(0.020m), 0 < m < L/2 0.85 sin(0.055m + 3.75 x 10 -4 sin(0.000625m)m) +2(1 - re~L)sin(0.035m + 2.75 x 10-11(m - L/2) 3) +2(1 - re~L)sin(0.020m- 2.75 x 1 0 - 1 1 ( m - L/2)3), L/2 < m < L
-
where the data length L is taken as 32768 data points. For a time-frequency illustration of this signal, the B-distribution (BD) with smoothing parameter ~ = 0.01 is shown in Fig. 11.4.5(a). 1 White Gaussian noise was added to the above signal giving an SNR of - 9 dB; the BD of the noisy signal is shown in Fig. 11.4.5(b). The windowed WVD peak filter was then implemented to recover x(m) from the noisy signal. A window length of 15 data points was chosen to satisfy the window length constraints given in (11.4.22). Fig. 11.4.5(c) shows the clean recovery of the signal after three T F P F iterations. Note that the WVD is used as the vehicle for signal recovery while the B-distribution is used only for presentation of the results. E x a m p l e 2 ( N e w b o r n E E G d a t a in W G N ) : Fig. l l . 4 . 5 ( d ) s h o w s a timefrequency representation of a real newborn EEG signal using the B-distribution with ~ = 0.01. WGN is then added to the signal at S N R = - 9 dB. The noisy signal in Fig. 11.4.5(e) shows that the time-frequency patterns of the EEG signal are not clearly visible. Using a window length of 20 data points, four iterations of the T F P F were used to recover a cleaner signal. The filtered signal in Fig. 11.4.5(f) demonstrates the efficiency of T F P F .
11.4.6
Summary and Conclusions
T F P F is a tool for signal enhancement, applicable to a large class of signals if the windowed WVD T F P F is used for reduced bias. This class includes those signals which may be represented as a sum of band-limited non-stationary processes in additive WGN. Testing on simulated and real data indicates that the method significantly enhances signals of this class by filtering out most of the additive noise. Further details of the time-frequency peak filtering method are provided in [8].
References [1] M. J. Arnold, M. Roessgen, and B. Boashash, "Filtering real signals through frequency modulation and peak detection in the time-frequency plane," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), vol. 3, pp. 345-348, Adelaide, Australia, 19-22 April 1994. 1See p. 53 for a definition of the BD and its parameter.
Signal Enhancement by Time-Frequency Peak Filtering
497
[2] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 1: Fundamentals; Part 2: Algorithms and applications," Proc. IEEE, vol. 80, pp. 519-568, April 1992. With introductions by Michael J. Riezenman. [3] B. Boashash, "Time-frequency signal analysis," in Advances in Spectrum Analysis and Array Processing (S. Haykin, ed.), vol. 1, ch. 9, pp. 418-517, Englewood Cliffs, NJ: Prentice-Hall, 1991. [4] A. Papoulis, "Random modulation: A review," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 31, pp. 96-105, February 1983. [5] A. Papoulis, Probability, Random Variables, and Stochastic Processes. McGraw-Hill, 3rd ed., 1991.
New York:
[6] M. Mesbah and B. Boashash, "Reduced bias time-frequency peak filtering," in Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), vol. 1, pp. 327-330, Kuala Lumpur, 13-16 August 2001. [7] B. Boashash and P. J. O'Shea, "Polynomial Wigner-Ville distributions and their relationship to time-varying higher order spectra," IEEE Trans. Signal Processing, vol. 42, pp. 216-220, January 1994. [8] B. Boashash and M. Mesbah, "Signal enhancement by time-frequency peak filtering," IEEE Trans. Signal Processing, vol. 51, November 2003.
This Page Intentionally Left Blank
Chapter 12
Detection, Classification and Estimation Methods for the detection, estimation and classification of non-stationary signals can be enhanced by utilizing the time-frequency characteristics of such signals. Such time-frequency formulations are described in this chapter. The topic is covered in four articles with appropriate internal cross-referencing to this and other chapters. The structure of TF methods is suitable for designing and implementing optireal detectors. Several approaches exist, such as decomposition of TFDs in sets of spectrograms (Article 12.1). For both analysis and classification, a successful TF methodology requires matching of TFDs with the structure of a signal. This can be achieved by a matching pursuit algorithm using time-frequency atoms adapted to the analyzed signals (12.2). We can perform system identification by exciting linear systems with a linear FM signal and relating TFDs of the input and output using time-frequency filtering techniques (12.3). Methods for time-frequency signal estimation and detection can be carried out using time-varying Wiener filters (12.4).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
499
500
12.1 12.1.1
Chapter 12:
Detection, Classification and Estimation
OPTIMAL TIME-FREQUENCY DETECTORS~ Time-Frequency Detection
The most predominant application of time-frequency representations (TFRs) is exploratory signal analysis: using TFRs to expose time-varying spectral characteristics of signals. TFRs are widely used in this mode in applications ranging from speech analysis to biomedical signal analysis to geophysical exploration. However, the structure of TFRs can also be fruitfully exploited for designing and implementing optimal processors for detecting a variety of nonstationary signals in noisy environments. Perhaps the simplest and best-known examples are the short-time Fourier transform (STFT) and the cross-ambiguity function in radar/sonar processing [1]. The goal of this article is to highlight the key ideas behind the role of TFRs in optimal detection. Essentially, TFRs are useful for detecting nonstationary stochastic signals that exhibit certain unknown parameters, called nuisance parameters, such as unknown time or frequency shifts. The statistical characteristics of the signal are matched via proper choice of the TFR kernel and the nuisance parameters are dealt with by exploiting the covariance of TFRs to time and frequency shifts. The structure of optimal TFR detectors also suggests a design approach for suboptimal, low-complexity TFR detectors that can potentially deliver competitive performance in scenarios in which optimal detectors are prohibitively complex. The notion of time-frequency detectors also naturally generalizes to a certain class of quadratic signal representations that extend the concept of time and frequency. The next section reviews relevant TFR concepts. Section 12.1.3 presents a class of detection scenarios for which TFRs are optimal and provides a characterization of the T F R detectors. Section 12.1.4 briefly discusses some extensions of the detection framework to generalized TFRs as well as suboptimal detectors. Finally, Section 12.1.5 provides some concluding and bibliographic remarks.
12.1.2
Time-Frequency Representations The most widely used linear TFR is the STFT that correlates the signal of interest, r(t), with TF shifted copies of a prototype (window) signal g(t) (t, f) - ]" r(T)g* (T -- t)e-J2~f~dT.
(12.1.1)
One of the most important quadratic TFRs is the real-valued Wigner distribution (WD) that essentially correlates the signal with time-frequency shifted copies of a time-reversed version of itself:
w (t, f)
f r(t + T/2)r* (t -- T/2)e-J2~frdT. J
(12.1.2)
~ A k b a r M. Sayeed, Department of Electrical and Computer Engineering, University of Wisconsin, Madison, WI 53706, USA (akbar@engr.wisc.edu). Reviewers: P. Flandrin and A. Papandreou-Suppappola.
Optimal Time-Frequency Detectors
501
A whole class of quadratic TFRs, sometimes called Cohen's class [2], can be generated from the WD via time-frequency filtering with a two-dimensional kernel (12.1.3) where the entire class is characterized by different choices of the kernel 7(t, f). A characterizing property of TFRs in Cohen's class is that they are covariant to time-frequency shifts in the signal:
r(t)
~ r(t
-
to)e j2~f~
:,
pr(t, f)
~ p r ( t - to, f
-
fo).
(12.1.4)
The magnitude squared STFT, often called the spectrogram, is a member of Cohen's class with the kernel given by the WD of the window function: IF~(t, f)l 2 =
p~(t, f ; 7 - Wg).
An important relation in the context of detection is Weyl correspondence that relates a quadratic form to the WD
(12.1.5)
where L a linear operator defined by the kernel L(tl, t2) and WSL(t, f) denotes the Weyl symbol of the operator defined as [3] WSL(t, f) - / L(t + 7/2, t - ~-/2)e-J2~f~dT.
(12.1.6)
If L(tl, t2) - Rs(tl, t2) - E[s(tl)s*(t2)], the correlation function of a random signal s(t), then the Weyl symbol of Rs(tl,t2) is also the Wigner-Ville spectrum of s(t) defined as [2] f WVS~(t, f) - C[W~(t, f)] - / R~(t + 7/2, t - T/2)e-J2~J'td7 -- WSR.
(t, f).
J
(12.1.7) The Wigner-Ville spectrum is a generalization of the notion of power spectral density to nonstationary processes and is an equivalent characterization of the second-order statistics of s(t). The kernel 7(t, f) can be interpreted as the WS of an operator L~ by inverting
L~(tl, t2) -
/
7
(tl+t2 ) (tl-t2 2 ' f eJ2"f )df .
(12.1.8)
If 7(t, f) generates a real-valued T F R then L~ is a Hermitian operator and under certain conditions (e.g, if it is a Hilbert-Schmidt operator) it admits an eigendecomposition L~(tl,t2) = ~-~kAkuk(tl)u~(t2) where {Ak} are the real-valued eigenvalues and {uk(t)} are the corresponding orthonormal eigenfunctions. Using this eigendecomposition, the kernel can be expressed as a weighted sum of
Chapter 12:
502
Detection, Classification and Estimation
WDs, 7(t, f) = E k Ak Wuk (t, f), and the corresponding T F R can be expressed as a weighted sum of spectrograms
pr(t,f;v) = E Ak/ /'Wr(u,v)Wuk(U- t,v- f)dudv k = ~-~AkIF~(t,f)l
(12.1.9)
2.
k
This decomposition in terms of spectrograms (discussed in Article 6.4) provides an intimate connection between linear and quadratic T F R s and is very useful for efficient implementation of TFR-based detectors.
12.1.3
Time-Frequency Detection Framework
We motivate TFR-based detection by considering the classical binary hypothesis testing problem encountered in radar. Let v(t) denote the complex baseband transmitred signal of duration To and bandwidth Bo. The received signal r(t) is processed over an observation interval [Ti, T/] to detect the presence of a target. The two hypothesis on r(t) are
Ho : r ( t ) = ~(t), t e [Ti, :/1/] Hi : r(t) = as(t; to, fo) + c(t), t e [Ti, T/] .
(12.1.10)
Under the null hypothesis, H0, the signal contains only the zero-mean complex additive white Gaussian noise (AWGN), c(t), of variance a c2 . Under the active hypothesis, H1, a T F shifted version of the transmitted signal, s(t; to, fo) = v(t - to)e j2~/ot, is received in the presence of noise. In (12.1.10), a denotes an unknown complex gain parameter. The unknown time and frequency parameters, (to, fo), represent the delay and Doppler shift of the received signal relative to the transmitted signal and correspond to the unknown distance and velocity of the target. Let $ = [Tmin,Tmax] • [-Fmax, Fmax] denote the possible range of values for the nuisance parameters (to, f o). The required observation interval is [Ti, Tf ] = [Train, To + Tmaz] in this c a s e . For any given value of (to, fo) E $, the optimal decision statistic, derived from the likelihood ratio [1,4], is the noncoherent matched filter statistic 2
D(t~176 = I(r's(t~ f~
-
.IT" r(t)v*(t -- to)e-J2~/~
-IF/(to,
fo)J 2, (12.1.11)
which correlates the received signal with the reference signal s(t; to, fo) and can be computed via a spectrogram with v(t) as the window. The unknown nature of the nuisance parameters is usually accounted for via a generalized likelihood ratio test (GLRT) corresponding to the decision statistic
D(r)-
max D(t'f)(r)= max IF~(t,f)[ 2 (t,f)ES (t,f)ES
(12.1.12)
503
Optimal Time-Frequency Detectors
which represents the peak of the spectrogram over $. The statistic D(r) is compared to a threshold to declare the presence (H1) or absence (H0) of a target. Thus, the optimal detector correlates the received signal with all possible time-frequency shifted versions of the transmitted signal and picks the largest correlation as the decision statistic. The location of the peak is in fact a maximum likelihood (ML) estimate of the unknown parameters: (to,/o)ML argmax(t,f)es D (t'f). The GLRT detector -
-
(12.1.12) is the statistic (12.1.11) computed at (to,/o)ML. The detector performance is governed by the shape of the magnitude squared of the auto-ambiguity function of the transmitted signal
Av(~', u) - f v(t + T/2)v* ( t - T/2)e-J2"tdt near the origin [1]. Note that IA,(T,u)[ 2 = IFW(T,u)I 2 =
(12.1.13)
ffW.(u,v)W.(u-
T, v - u)dudv. Ideally, Av(T, u) should approximate a two-dimensional Dirac delta function for perfect resolution in delay and Doppler. Waveforms with large timebandwidth products, such as those derived from pseudo-random codes used in spread-spectrum communications [5], can be used to approximate this "thumbtack" function. The simple detector (12.1.12) is based on a rank-1 quadratic 1 TFR. Higherrank quadratic T F R s can realize detectors for a much richer class of nonstationary stochastic signals. The next subsection describes this quadratic T F R detection framework. In turn, in many cases low-rank approximations of T F R detectors, implemented via a bank of a few spectrograms, suffice.
12.1.3.1
Quadratic Time-Frequency Detectors
Quadratic T F R s can be exploited for detection scenarios of the form (12.1.10) where the signal s(t; to, fo) is a time-frequency shifted version of some underlying stochastic signal whose Wigner-Ville spectrum is relatively localized in time and frequency. This represents a fairly rich class of detection scenarios for modeling the inherent randomness in the signal as well as measurement uncertainties that manifest themselves as unknown time-frequency shifts. For example, radar applications in which the transmitted signal v(t) encounters a randomly time-varying scatterer or channel may be modeled this way [1]. Similarly, signals involved in machine fault diagnostics may exhibit random characteristics along with timing jitters and frequency offsets due to physical mechanisms. The essential idea behind quadratic T F R detectors is to capture the signal statistics via the kernel 7(t, f) and the nuisance parameters via the time-frequency covariance property (12.1.4). Suppose that the signal s(t; to, fo) in (12.1.10) is a time-frequency shifted version of a zero-mean Gaussian signal w(t); that is, s(t;to, fo) = w ( t - to)e j27r$~ and Rs(to,fo)(tl, t2) - Rw(tl - to, t2 - to)e j2~f~ In this case, for any given (to, fo), aThe rank of a TFR is the number of significant nonzero eigenvalues of 7(t, f) in (12.1.9).
Chapter 12: Detection, Classificationand Estimation
504
the optimum decision statistic is [4, 6, 7]
D(to,fo) (r) - (Rs(to,lo)r, r) = <Wr, WS~(,o.,o) ),
(12.1.14)
A
2I)-lRs(to,io) is an operator defined by the signal where RS(to,fo) - (RS(to,fo) + (7~ correlation function and the second equality follows from (12.1.5). The simpler choice RS(to,fo) - Rs(to,fo) in (12.1.14) yields the locally optimum 2 detector [4]. The unknown nuisance parameters are again accounted for by exploiting the covariance property (12.1.4) which implies that WSRs(,o,~o)(t, f) = W S R ~ ( t - t o , f-h
fo) and WSfis(,o,lo ) ( t , f ) - W S h ~ (t-to , f - f o ) , where Rw - ( R ~ + a 2 I ) - I R ~
[7].3
Consequently, the quadratic decision statistic in (12.1.14) can be expressed as
D (t~176
ffw (u,,)ws (U-to, V-fo)dud,
- p~(to, f o ; 7 - W S ~ , , , ) .
(12.1.15) A
Similarly, the locally optimal detector corresponds to Rw = Rw in (12.1.15). Thus, the decision statistic corresponding to different values of (to, fo) can be computed via a T F R with kernel WS~w. The overall GLRT detector for unknown (to, fo) is given by
D(r) = max p~(t f ; WS~ ) (t,f)ES
'
max f / Wr(u, v)WS~
(t,f)E8
(u - t, v - f)dudv
(12.1.16)
where S = [Train, Tmax] x [-Fmax, Fmax]. If WVSw(t, f ) = 8[Ww(t, f)] is supported on (t, f ) e [0, Tw] x [-Bw,B~], the required observation interval is [Ti, Tf] = [Train, Tm~x + T~] and Wr(t, f) in (12.1.16) is computed over the range [Tm{~, T m ~ + T~] x [ - B ~ - Fm~x, Bw + F,~x]. The detector (12.1.16) has the intuitively appealing interpretation of nonstationary spectral correlation: the WD of the observed waveform is correlated with all possible time-frequency shifted versions of WS&~ which is the WVS of w(t) in the case of the locally optimum detector and the WVS of a modified version of w(t) in the case of the optimal detector. This interpretation of optimum T F R detection is illustrated in Fig. 12.1.1. The location of the maximum in (12.1.16) is the ML estimate of (to, fo). Estimator performance is related to the shape of F(T, v) = f f WVSw(u, v)WVSw(u - T, V -- v)dudv near the origin. 2The first term in the expansion of the likelihood ratio as a function of signal-to-noise ratio (SNR) [4]. This decision statistic is useful in low-SNR scenarios. 3We note that both ~ and Rw have the same eigenfunctions; the eigenvalues of P~w are obtained from those of Rw by adding ae2 to each.
505
Optimal Time-Frequency Detectors
Fig. 12.1.1: Illustration of nonstationary spectral correlation underlying TFR detection. (a) The Wigner-Ville spectrum, WVSw(t, f), of the underlying stochastic signal w(t). (b) The idealized TFR, pr(t, f;~f - WVSw), in the absence of noise; that is, r(t) = s(t;to,fo). (c) The WD of the noisy observed signal r(t). (d) The TFR, pr(t, f;3' = WVSw), of the noisy signal. The TFR in (d) is computed by smoothing the WD in (c) with the WVS in (a) and consists of the idealized TFR in (b) and additive noisy components. The peak of the TFR in (d) represents the decision statistic that is compared to a threshold to make a decision. The location of the peak represents ML estimates of the unknown delay and Doppler shift of the target. A
The correlation function Rw(tl, t2) is related to Rw(tl, t2) by A
Ak
Rw(tl, t2) -- E Ak -F o-2 uk(tl)U~(t2) k
where {Ak} and {uk} denote eigenvalues and eigenfunctions, respectively, of Rw(tl,t2). Analogous to (12.1.9), the decision statistic (12.1.15) can be expressed as a weighted sum of spectrograms
D(t~176
- E Ak IF~k(to, fo)l 2 k
(12.1.17)
Chapter 12:
506
Detection, Classification and Estimation
where Ak -- ~k+o~ in the case of optimum detection and Ak -- Ak in the case of locally optimum detection. The importance of the above decomposition stems from the fact that many signals encountered in practice exhibit few dominant eigenvalues. This implies that the quadratic TFR detector (12.1.16) can be efficiently implemented via a bank of a few spectrograms.
12.1.3.2
An Illustrative Example
We now illustrate the TFR detection framework in the context of an important application in which the transmitted signal v(t) travels over a randomly time-varying channel. Such a channel model is appropriate in many applications, including radar [1], wireless communications [5], and underwater acoustical communication [1]. In this case, the stochastic signal s(t; to, fo) is related to v(t) as
s(t; to, fo) -
j~0Tmh(t, T)v(t -- T -- to)eJ2~/~ (12,1.18) Bn
where h(t, T) is the time-varying channel impulse response and the equivalent representation, H(u, T) = f h(t, T)e-J2~tdt, is called the delay-Doppler spreading function [5]. In (12.1.18), the received signal is modeled as a linear combination (with stochastic weights) of a continuum of time-frequency shifted copies of v(t). The parameters Tm and Bd are the multipath and Doppler spreads of the channel, respectively. The parameters (to, fo) represent the global delay and Doppler shift encountered during propagation, and Tm and Bd represent the local time-frequency spreading around (to, fo) produced by scattering. Each (u, T) e [--Bd, Bali • [0, Tm] represents a particular infinitesimal scatterer with stochastic gain H(~, T). Under the wide-sense stationary uncorrelated scattering (WSSUS) model, {H(v, T)} are modeled as a collection of uncorrelated Gaussian random variables [5]. The underlying Gaussian signal w(t) in this case is characterized by
Rw(tl, t2) -
jf0Tm/Ba M(~, T)v(tl -- w)v*(t2 -- T)eJ2~v(t~-t2)dudT.
(12.1.19)
Bd
where M(v, T) = $[IH(v , T)I 2] is the channel scattering function which quantifies the relative power contributed by each scatterer. Note that WVSw(t, f) is supported on [0, To + Tm] • [-Bo - Bd, Bo + Bd]. We focus on the locally optimum detector for simplicity which uses Rw - Rw in (12.1.16). Using the definition of the Weyl symbol in (12.1.6), it can be shown that the detection statistic (12.1.15) takes the
Optimal Time-Frequency Detectors
507
form
D(t~176
= pr(to, fo; 7 = WVS~) = / / Wr(u, v)WVSw(u -to, v - fo)dudv
(12.1.20)
=
(12.1.21)
M(,, r)IF;(to + r, fo + ~)12 dudr. 0
Bd
Thus, the test for any given (to, fo) can be computed by smoothing Wr(t,f) in the neighborhood of (to, f o) with WVSw(t,f) or by smoothing IFrv(t,f)l 2 in the neighborhood of (to, fo) with M(u, T). The overall GLRT detector (12.1.16) can be realized by: (1) computing the Wr(t, f) for (t, f) E [Tmin, Tmaz+To+Tm]x[-FmaxBo - Bd, Fma= + Bo + Bd], or by computing IF~(t, f)l 2 for (t, f) E [Train,Tmax Jc Tin] x [-Fmax - Bd, Fmax q- Bd], (2) computing D (t'I) over S by smoothing Wr(t, f) with WVSw(t, f) as in (12.1.20) or by smoothing the spectrogram with M(u, T) as in (12.1.21), and (3) comparing the maximum of {D(t'/) : (t, f) e S} to a threshold. While the detector (12.1.21) is in a fairly simple form, the low-complexity implementation in terms of the eigenfunctions of Rw(tl, t2) takes an equally simple form. Due to the finite duration and bandwidth of v(t), the decision statistic (12.1.21) admits the following approximate sampled representation [T.,.Bo] rS.To]
D(t~176
~ ~ ~ M(m/To, l/Bo) IF~(to + I/Bo, fo + m/To)l 2 (12.1.22) l=o m=- rBdTo]
i
where M(t,, r) represents a smooth version of M(u, T) and the number of terms in the summation represents the number of delays and Doppler shifts that are resolvable at the receiver, which is also the number of dominant eigenvalues of Rw(tl, t2). Note that the number of dominant eigenvalues is approximately 2TmBdToBo which is typically relatively small since most practical channels are underspread (2TmBd << 1) [5].
12.1.4
Extensions
We now briefly discuss two extensions of the quadratic T F R detection framework described above. First, if the nuisance parameters are modeled as random with known probability density function, p(t,f), the locally 4 optimum Bayesian test 4The optimum Bayesian detector essentially corresponds to replacing pr(t,f;WSR,~) with
e-Pr(t'f;wsR,~ ) in (12.1.23).
508
Chapter 12:
Detection, Classification and Estimation
statistic can be realized in the time-frequency domain as # DBayes(r) -- ./~ fir(t, f; WVSw)p(t, f ) d t d f
=
w~(t, f)WVS~ (t, f ) d t d f .
(12.1.23)
A
where WVSw(t, f ) - f f p(t, f)WVSw(u - t, v - f ) d u d v . The Bayesian detector computes a test statistic based on our prior expectation of the likelihood of different values of the nuisance parameters. The time-frequency formulation of a quadratic detector in (12.1.14) has an interesting interpretation in view of the form of the Bayesian detector (12.1.23) and the quasi-energy density interpretation of the WD: the quadratic detector (12.1.14) weights the WD of the noisy received signal with our prior expectation of distribution of the signal energy in the time-frequency plane. This fact can be also be exploited in the design of rank-1 detectors when the underlying reference signal can only be inferred from noisy measurements [8]. In the rank-1 case, I(r, s)l 2 = (Wr, Ws), and if Ws(t, f ) is mostly concentrate along a curve, for example Ws(t, f ) = 5 ( f - u(t)), it can be estimated more accurately (as opposed to s(t) directly) due to higher SNR along the curve in the time-frequency plane. A matched filter for the underlying signal can then be inferred via the instantaneous frequency or the group delay defined by the curve. Such weighted averaging of the T F R can also be exploited (in a suboptimal manner) to design detectors that suppress unwanted signals whose TF support does not completely overlap the support of the desired signal. Another direction of generalization is going beyond TFRs to joint signal representations in terms of variables other than time and frequency - - time-scale representations (TSRs), for example [9]. The GLRT detection framework described above is best suited to representations that possess some covariance property analogous to (12.1.4). For example, TSRs are covariant to time shifts and scale changes. In general, such covariance properties are imposed via a parameterized family of unity operators [10]. On the one hand, each family of unity operators defines a class of sighal representations and on the other hand the parameters of the unitary operators provide a model for nuisance parameters in GLRT detection scenarios. In essence, each family of unity operators defines a one-to-one correspondence between a particular class of detection scenarios and a particular class of joint signal representations that serve as optimal detectors for such scenarios.
12.1.5
Summary and Conclusions
Time-frequency formulation of optimum quadratic detection, as in (12.1.14), along with a discussion of linear detectors was first developed by Flandrin in [6]; the basic ideas emphasizing the usefulness of a time-frequency approach to detection had been introduced earlier by K u m a r and Carroll (1984) and Kay and Boudreaux-Bartels (1985) (see the references in [6]). Some elements of Bayesian detection were also introduced in [6]. Altes also provided a time-frequency formulation of optimum
Optimal Time-Frequency Detectors
509
detection and classification in terms of spectrograms in [11]. The GLRT formulation emphasized in this article, exploiting the degrees of freedom in a T F R (kernel and covariance properties), was developed in [7, 10]. The same general framework has been recently extended to design data-driven detectors and classifiers that infer the optimal processor directly from data [12]. While these formulations emphasize optimum detection, a potentially fruitful research direction, which is relatively unexplored, is the (suboptimal) combination of exploratory data analysis with decision theory: exploiting the degrees of freedom in the T F R to isolate essential signal features that can serve as inputs to pattern classifiers [13].
References [1] H. L. L. Van Trees, Detection, Estimation, and Modulation Theory, vol. III: "RadarSonar Signal Processing and Gaussian Signals in Noise". New York: Wiley, 1971. Reprinted Malabar, FL: Krieger, 1992. Reprinted New York: Wiley, 2001. [2] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-HM1, 1995. [3] R. G. Shenoy and T. W. Parks, "The Weyl correspondence and time-frequency analysis," IEEE Trans. Signal Processing, vol. 42, pp. 318-331, February 1994. [4] H. V. Poor, An Introduction to Signal Detection and Estimation. New York: Springer, 1988. [5] J. G. Proakis, Digital Communications. New York: McGraw-Hill, 3rd ed., 1995. [6] P. Flandrin, "A time-frequency formulation of optimum detection," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 36, pp. 1377-1384, September 1988. [7] A. M. Sayeed and D. J. Jones, "Optimal detection using bilinear time-frequency and time-scale representations," IEEE Trans. Signal Processing, vol. 43, pp. 2872-2883, December 1995. [8] E. Chassande-Mottin and P. Flandrin, "On the time-frequency detection of chirps," Applied ~ Computational Harmonic Analysis, vol. 6, pp. 252-281, March 1999. [9] O. Rioul and P. Flandrin, "Time-scale energy distributions: A general class extending wavelet transforms," IEEE Trans. Signal Processing, vol. 40, pp. 1746-1757, July 1992. [10] A. M. Sayeed and D. L. Jones, "Optimum quadratic detection and estimation using generalized joint signal representations," IEEE Trans. Signal Processing, vol. 44, pp. 3031-3043, December 1996. [11] R.A. Altes, "Detection, estimation and classification with spectrograms," J. Acoustical Soc. of America, vol. 67, pp. 1232-1246, April 1980. [12] C. Richard and R. Lengell~, "Data-driven design and complexity control of timefrequency detectors," Signal Processing, vol. 77, pp. 37-48, August 1999. [13] M. Davy, C. Doncarli, and G. F. Boudreaux-Bartels, "Improved optimization of timefrequency-based signal classifiers," IEEE Signal Processing Letters, vol. 8, pp. 52-57, February 2001.
510
12.2 12.2.1
Chapter 12:
Detection, Classification and Estimation
T I M E - F R E Q U E N C Y SIGNAL ANALYSIS A N D CLASSIFICATION USING M A T C H I N G P U R S U I T S o Signal Time-Frequency Structures
Although time-frequency representations (TFRs) are the appropriate tools for nonstationary signal processing [1-3], it is important to match a T F R with the timefrequency (TF) structure of a signal for successful TF analysis and classification (see Section 1.1.5). In general, TFRs are ideally matched to one or two TF structures based on the properties they satisfy. However, some signals have multiple components with distinctively different instantaneous frequency (IF). For example, in a sonar acoustics application, a received signal (e.g. a linear frequency-modulated (FM) signal) may be interfered by whistles from a group of dolphins (e.g. with hyperbolic IF). The WAgner distribution (WD) can accurately represent signals with constant or linear IF whereas the Altes Q-distribution can accurately represent signals with hyperbolic (dispersive) IF. However, neither one of the two quadratic TFRs will ideally analyze all the components of a signal consisting of both linear and hyperbolic IF as in the aforementioned example. The TF analysis and classification of multicomponent signals is further complicated by the presence of cross terms when quadratic TFRs are used (see [3] and Article 4.2). Some TFRs that can analyze signals with multiple TF structures include the spectrogram [3], reassigned TFRs (see [4] and Article 7.2), and various adaptive TFRs [5]. The IF of various signals was also estimated using TFRs [6]. Although they work well in many applications, such TFRs do not automatically yield the IF of a signal in closed form, and do not always provide a well-concentrated representation without cross terms for analyzing signals with nonlinear TF structures. Thus, it is advantageous for an adaptive T F R to exactly match and classify signal components as many natural or synthetic signals have different linear and nonlinear IF.
12.2.2
Matching Pursuits for Analysis and Classification
The matching pursuit iterative algorithm of Mallat and Zhang decomposes a signal into a linear expansion of waveforms selected from a redundant and complete dictionary [7]. It uses successive approximations of the signal with orthogonal projections on dictionary elements. The dictionary consists of a basic Gaussian atom that is TF shifted and scaled, i T F R (called the modified WAgner distribution in [7])is obtained as a weighted sum of the WD of each selected element. This T F R is free of cross terms, and preserves signal energy, TF shifts, and scale changes on the analysis signal. In order to analyze linear FM signals more efficiently with fewer waveforms, rotated Gaussian atoms were included in the dictionary in [8]. On the other hand, a wave-based dictionary consisting of wavefronts, resonances, and linear ~ Antonia Papandreou-Suppappola, Telecommunications Research Center, Department of Electrical Engineering, Arizona State University, Tempe, AZ 85287-7206 USA (papandreou@asu.edu), and Seth B. Suppappola, Acoustic Technologies, Inc., 1620 South Stapley Dr., Mesa, AZ 85204 USA (seth.suppappola@acoustictech.com). Reviewers: F. Auger and X. Xia.
Time-Frequency Signal Analysis and Classification Using Matching Pursuits
511
FM signals was used to process scattering data in [9]. Although the matching pursuit algorithm in [7] works well for many signals, it uses many Gaussian atoms to represent a signal component with nonlinear TF characteristics. In addition, the modified WD is not well-concentrated along the nonlinear IF of some signal components. In order to be able to exactly match linear or nonlinear FM signals, a matching pursuit is used with a dictionary of waveforms that includes complex sinusoids with linear or nonlinear phase functions such as logarithmic and power functions. The aim of the adaptive (modified) matching pursuit (AMMP) is to be able to analyze and correctly classify multicomponent signals where each component has a specific (and often different) monotonic phase function that can be written in closed form [10]. The advantage of using a dictionary that is matched to the analysis data is that only a small number of elements are used to decompose the signal, and the algorithm gives fast and parsimonious results. At each iteration, the algorithm a d a p t i v e l y chooses the best dictionary element according to some condition, identifies its TF structure, and computes a specific T F R of the element. This T F R is adaptively chosen so that it provides a wellconcentrated representation (without interference terms) of the selected element. The resulting T F R is formed as a weighted linear superposition of these matched TFRs. In this respect, it is similar to the T F R obtained in [11]. The computation of the T F R is only necessary for signal analysis. The algorithm can also be used to identify and classify each signal component. This will be useful, for example, in the classification of multiple received signals in a detection application. Thus, as will be shown, the AMMP decreases the number of algorithm iterations, improves the TF concentration of different multiple nonlinear FMs, and correctly classifies the IF of each signal component.
Adaptive matching pursuit and TFR The AMMP is based exclusively on the original algorithm in [7], but it has some major differences [10]. First, the AMMP uses more than one type of basic atom in the dictionary. Particularly, the dictionary consists of a large class of different basic atoms each of which has the form of a nonlinear FM signal [Article 5.6]
g(t;
v/I.(t)l
)
(12.2.1)
which is uniquely specified by its FM rate A and its monotonic phase function ~(b). Note that v(t) - d ~ ( ~ ) is the IF of the signal in (12.2.1), and tr > 0 is a reference time. The dictionary may consist of only one type of FM signal with fixed ~(b) in (12.2.1) or a linear combination of them including sinusoids with ~(b) - b, linear FM signals with ~(b) - sgn(b)lbl 2 (where sgn(b) is - 1 for b < 0 and 1 for b > 0), hyperbolic FM signals with ((b) - In b, power FM signals with ~(b) - sgn(b)lbl ~, and exponential FM signals with ( ( b ) - e b. The dictionary is formed by transforming the FM signal in (12.2.1) as e j 2 r c~(a( -~--)) t-~ =
v/lav(a(t
- T))I eju= (~+~)~(~(~)) ,
(12.2.2)
Chapter 12:
512
Detection, Classification and Estimation
with the parameter vector 0_0_- [c, a, T] E (O - R3). The dictionary is normally formed with a linear (i.e., grid) spacing of the parameters in 0, although this is not a requirement. The unitary operators G~~), Ca, and Sr result in an IF shift c, scale change a, and constant time shift r, respectively, of the FM signal. Specifically, the operators transform a signal x(t) as (G~)x)(t) = x(t)eJ2~c~(,-~ ) (where ~(b) is the same function as in (12.2.1)), (CaX)(t) = ~ x ( a t ) , and ($rX)(t) - x ( t r). In (12.2.2), the transformation G(() results in a constant shift (from A to c + A) of the FM rate of the nonlinear FM signal instead of the constant frequency shift used in [7]. Hence, A is not a parameter of the optimization, and is instead used to specify a base FM rate in (12.2.1). Shifts in the FM rate are affected by varying the c parameter. This is performed in this way since the signals considered may be wideband as well as dispersive, thus a shift of the IF is a better matched transformation (that covers the entire TF plane) than a constant frequency shift. With appropriate normalization, the energy of g(t; ~, ;~, 0_) is restricted to be unity for every 0_ in order to ensure energy preservation when ~(b) is fixed [7,10]. The iterative procedure of the AMMP first projects the analysis signal x(t) = (Rox)(t) onto each element of the dictionary, and selects 1 g(t; ~o, )~, 0_o) based on the condition 2 [( x, g(~o, A, 0_o)}] > [( x, g(~, A,_O))1, V_O E 0 and for all possible phase functions ~(b) of the dictionary elements. This ensures that the signal component with the highest energy is separated first by being matched to the element g(t; ~Co,A,_Oo) that yields the largest correlation with the signal. This results in
x(t) --/~0 g(t; ~0,)~,O0)+ (nlx)(t)
(12.2.3)
with the expansion coefficient ~0 - ( x, g(~0, ~, 0_0_0)). The function ~o(b) is the phase function of the first selected element. For example, if this element is a hyperbolic FM signal, then ~0(b) - ln b. The second major difference of the AMMP from the algorithm in [7] is that the algorithm does not compute the WD of selected elements to form the modified WD. Instead, it adaptively uses the information that the first selected waveform has phase function ~0(b) to compute its IF shift covariant dispersively warped Wigner distribution (DWD) [Article 5.6]. The DWD is a warped version of the WD, Wy(t, f) - f _ ~ y(t + r/2)y*(t - 7/2)e-J27rfrdT, with the warping based on a monotonic and (possibly) nonlinear parameter function ~(b). In particular,
W(r
- Wu (tr~( t~t-) ' t~#(t) f )
(12.2.4)
t where tz(t) - ~ r and y(t) - (WCx)(t) - t~ltz(t~r162 [Article 5.6] is the warped signal. Note that a specific DWD is obtained simply by fixing its parameter function ((b). By matching ((b) in (12.2.4) to be equal to the 1 Note t h a t a subscript n in the p a r a m e t e r s Rn, ~n(b), 0__~, r~, and Cn, and a superscript n in a T F R T'~(t, f ) indicate the algorithm parameters at the (n + 1)th iteration. 9The inner product is defined as (x, g} = f--~c~ x(t)g* (t)dt.
Time-Frequency Signal Analysis and Classification Using Matching Pursuits
513
phase function ~0(b) in (12.2.3) (i.e. if ~(b) = ~0(b)), an adaptive representation for multiple structures (ARMUS) TFR, at this first iteration, is simply given as
T~
f)-
[/~o[2 w(~~ -g(~o,_Oo) (t , f) 9
At the second iteration, the residual function (RlX)(t) is obtained by solving (12.2.3), and it is decomposed in a similar manner as the signal z(t). At the ( n + l ) t h iteration, the condition
[(RnX, g(~n,On)}1 ~ ]{RnX, g ( ~ , O ) ) [ ,
VOeO
is used to decompose the nth residual function (Rnx)(t)
x)(t)
-
(12.2.5)
~ng(t; ~n,On)+
Zn --
(12.2.6)
is the expansion coefficient. The DWD of (Rnx)(t) is also obtained adaptively to match the TF structure of the nth residual function by letting ((b) = ~n(b) in
(12.2.4).
After a total of N iterations, the matching pursuit algorithm results in the signal decomposition N-1 -
g(t;
+
(12.2.7)
n--0
As the dictionary is complete [10], any signal can be represented as in (12.2.7) with N - oo (yielding (RN x)(t) -- 0) [7]. In actuality, when the signal components match the TF structure of the dictionary elements, the algorithm converges quickly. A maximum number of iterations and an acceptable small residue energy compared to the data energy are used as stopping criteria [7]. "The resulting ARMUS of the signal at the N t h iteration is the weighted sum of the corresponding DWD of each selected dictionary element N-1 ARMUSx(t, f) - TN-I(t ' f) -- E n--0
[~n[2 W(~n)
9(~,0.)
(t, f)
,
(12.2.8)
with the weights ]/~n] 2 defined in (12.2.6). Note that the same DWD (with fixed ~(b)) may be used in (12.2.8) for components with the same TF structure but different FM rate. It is important to note that the success of the algorithm depends highly on the choice of dictionary elements. As a result, pre-processing the data might be necessary in order to avoid a poor algorithm performance due to a mismatch between the analysis data and the dictionary elements used. Although this analysis technique follows directly from the matching pursuit in [7] with some simple modifications, it is very powerful once the dictionary is matched to the data not only for analysis but also for classification.
514
Chapter 12:
Detection, Classification and Estimation
Properties of the algorithm and the TFR An important property of the matching pursuit in (12.2.7) is its covariance to certain signal changes. Consider the decomposed signal x(t) - }--~n~__0~ng(t;~,A, 0__n) in (12.2.7) with g = cx~, and with identical WE structure dictionary elements (i.e. let ~n(b) - ~(b), V n). If the FM rate of a nonlinear FM signal x(t) is shifted by a constant amount to form B ~ t y(t) -- (G(~)x)(t) x(t) e j2~u (~), then its matching pursuit is simply given as y(t) - ~-]~n~176 g(t; ~,A,~n). Note that the expansion coefficients fin are not affected by this signal change. The parameter vector changes to ~n - [Cn + U, an, Tn] indicating that the time shifts Tn and the scale changes an remain the same, whereas the dictionary elements undergo a constant shift in their FM rate from (A + cn) to (A + Cn + U). Note that if ~(b) is a power or a logarithmic function, then we can show that the corresponding AMMP is also covariant to scale changes [10]. The ARMUS T F R in (12.2.8) also satisfies various properties that are desirable in many applications. By simply combining the DWDs of each selected dictionary element, no cross terms are introduced in the TFR. Also, the T F R preserves the underlying TF structure of each analysis signal component, and it provides a highly concentrated representation of each component as it does not apply any smoothing. Specifically, the DWD with parameter ~(b) of a nonlinear FM signal with phase function ~(b) results in the highly concentrated representation DWDg(~,~)(t, f; ~) = Iv(t)l 5 ( f - A v(t)) [Article 5.6] with v(t) - d ~ ( K t) . If a particular application uses signal components with only one type of TF structure, then we should form our dictionary using the corresponding nonlinear FM signal with matched IF. In such cases, the ARMUS satisfies other desirable signal properties such as the preservation of signal energy, and changes in the analysis signal's FM rate [10]. If the dictionary elements are either hyperbolic or power FM signals, then the T F R can also be shown to preserve scale changes. Implementation issues Since the algorithm parameters vary in order to form the dictionary elements for the matching pursuit, the computation is intensive. However, if the data is pre-processed, one can form a dictionary with elements which approximately span the data in TF structure. Thus, the algorithm iterates more rapidly. Additional speedup is possible if the matched DWD of each dictionary element is computed ahead of time. Since the last operation on the basic atom in (12.2.2) is time shifting, the inner products in the AMMP condition in (12.2.5) can be computed as a cross-correlation instead of introducing another layer of dictionary elements over all possible time shifts. This increases the computational speed since correlations can be implemented using the fast Fourier transform (FFT). Also, the memory consumption by the dictionary is significantly reduced since additional dictionary elements are not needed for every time shift. Moreover, since the elements do not change, and the residual data is constant during a given AMMP algorithm iteration, additional speedup could be achieved by pre-computing and storing the FFTs of these sequences. If the signal components are well-separated in time, the algorithm can simply find the time support and phase function of each selected element, and then use the
Time-Frequency Signal Analysis and Classification Using Matching Pursuits
515
information to analyze the actual data (instead of the selected waveforms) with its matched DWD. This will greatly reduce computation as only a few DWDs need to be obtained. If classification is needed without analysis, then there is no need to compute the TFR. The AMMP will provide the IF and the FM rate of each component. It can also provide other relevant parameters depending on how adapted the dictionary is to a particular application.
12.2.3
Simulation Example
A synthetic multicomponent signal is used to demonstrate the significance of the AMMP method. The signal is 512 points long (although it need not be a power of two) and consists of thirteen components: four windowed hyperbolic FM signals, five windowed linear FM signals, and four Gaussian waveforms, all with varying parameters. Their "ideal" TF representation shown in Fig. 12.2.1(a) is obtained by plotting the IF of each of the hyperbolic and linear FM signals (hyperbolae and lines with positive slope) and the characteristic TF curve of each Gaussian component (ellipses). The WD in Fig. 12.2.1(b) suffers from cross terms and makes it difficult to identify the true TF structure of each component. On the other hand, the spectrogram in Fig. 12.2.1(c) suffers from loss of resolution due to smoothing that complicates the identification of the exact number of signal terms. Furthermore, the spectrogram cannot provide parameter values for any signal component. The dictionary was formed using hyperbolic FM signals, linear FM signals, and Gaussian waveforms. Note that the Gaussian elements are not in the form of (12.2.1). However, they were included in the dictionary for a fair comparison with the matching pursuit in [7], and due to the a priori knowledge of their presence in the data. The decomposition approximates the data very well after only twenty iterations as demonstrated by overlaying the signal with its expansion. The ARMUS T F R in Fig. 12.2.1(d) provides a highly concentrated representation for all thirteen components without outer cross terms or loss of resolution. This is because it adaptively computes the Altes Q-distribution for selected elements with hyperbolic TF characteristics, and the WD for selected elements with linear or constant TF characteristics. Note that the mild spreading of the signal components as well as some inner interference terms are due to the fact that the data was windowed for processing. As the algorithm iterated twenty times, the ARMUS in Fig. 12.2.1(d) is a T F R of all twenty elements that the algorithm selected. However, only the thirteen elements matching the actual signal components are visible. This is because the energy term of the signal residues was very small for, seven of the twenty iterations. If T F R analysis is not required, the ARMUS T F R computation step is removed from the algorithm, and the results are used only for classification. In this example, the components were classified in three signal types (hyperbolic FM, linear FM, and Gaussian). Furthermore, the values of the FM rate of each hyperbolic and linear wavefbrm as well as the variance of each Gaussian component were directly obtained and found to be very similar to the parameters of the actual data. For further comparison, the matching pursuit with only Gaussian dictionary
516
Chapter 12:
Detection, Classification and Estimation
Fig. 12.2.1: (a) A linear combination of the ideal TF structure of each component of a signal consisting of four windowed hyperbolic FM signals, five windowed linear FM signals, and four windowed Gaussian signals. Note that the TF structure of a Gaussian waveform is represented as an ellipse delineating the approximate time and frequency support of the signal component. The signal is analyzed using (b) the Wigner distribution, (c) the spectrogram, (d) the ARMUS TFR, and (e) the modified WD in [7].
Time-Frequency Signal Analysis and Classification Using Matching Pursuits
517
elements was used to decompose the signal, and then analyze it using the modified WD [7] as shown in Fig. 12.2.1(e). After 275 iterations, the T F R does not provide a concentrated representation that can easily identify the T F structure of the linear and hyperbolic FM components even though no cross terms are present. Also, the algorithm does not provide a closed form estimate of the IF and FM rate of the signal components for classification. On the other hand, as it is well-matched to Gaussians, the four Gaussian signal components are highly concentrated. For relative comparison purposes, the speeds of the various methods can be contrasted as performed on a Pentium III based PC running MATLAB TM under Linux. Since the Wigner distribution and the spectrogram use no form of matching pursuit iteration, they are considered to provide essentially instant results requiring only a few seconds to generate a plot. The ARMUS representation required approximately five minutes to compute, whereas the modified Wigner distribution consumed approximately forty-five minutes. The slowness of the modified Wigner distribution is primarily due to the high number of terms (i.e., iterations) required to approximate the analysis signal since Gaussian dictionary elements are not well matched to all of the analysis signal components. In fact, it takes several Gaussians to represent a single hyperbolic FM signal. The ARMUS representation, however, also contains hyperbolic and linear FM dictionary elements and is consequently better matched to the analysis signal. A real data example using the AMMP method for classification can be found in [12].
12.2.4
Summary and Conclusions
In this article, we have presented a method of analyzing complex multicomponent time-frequency signal structures without the usual trade-off of T F resolution versus cross terms. The iterative approach is based on the matching pursuit of [7] but extended to include non-Gaussian signal types. Additionally, we generate a T F R which is free of cross terms by summing T F R s for the decomposed signal components. The T F R for each individual component is chosen to be ideally suited for the particular signal type extracted by the algorithm. Finally, since the T F R not only represents the individual components, but also parameterizes them, this algorithm lends itself nicely as a feature extraction tool for signal classification purposes.
References Ill L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [2] P. Flandrin, Time-Frequency/Time-Scale Analysis. San Diego: Academic Press, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [3] F. Hlawatsch and G. F. Boudreaux-Bartels, "Linear and quadratic time-frequency signal representations," IEEE Signal Processing Magazine, vol. 9, pp. 21-67, April 1992. [4] F. Auger and P. Flandrin, "Improving the readability of time-frequency and time-scale representations by the reassignment method," IEEE Trans. Signal Processing, vol. 43, pp. 1068-1089, May 1995.
518
Chapter 12:
Detection, Classification and Estimation
[5] R. G. Baraniuk and D. L. Jones, "A signal-dependent time-frequency representation: Optimal kernel design," IEEE Trans. Signal Processing, vol. 41, pp. 1589-1602, April 1993. [6] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 1: Fundamentals; Part 2: Algorithms and applications," Proc. IEEE, vol. 80, pp. 519-568, April 1992. With introductions by Michael J. Riezenman. [7] S. C_,.Mallat and Z. Zhang, "Matching pursuits with time-frequency dictionaries," IEEE Trans. Signal Processing, vol. 41, pp. 3397-3415, December 1993. [8] A. Bultan, "A four-parameter atomic decomposition of chirplets," IEEE Trans. Signal Processing, vol. 47, pp. 731-745, March 1999. [9] M. R. McClure and L. Carin, "Matching pursuits with a wave-based dictionary," IEEE Trans. Signal Processing, vol. 45, pp. 2912-2927, December 1997. [10] A. Papandreou-Suppappola and S. B. Suppappola, "Adaptive time-frequency representations for multiple structures," in Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pp. 579-583, Pocono Manor, PA, 14-16 August 2000. [11] S. Qian and D. Chen, "Decomposition of the Wigner distribution and time-frequency distribution series," IEEE Trans. Signal Processing, vol. 42, pp. 2836-2842, October 1994. [12] S. Pon Varma, A. Papandreou-Suppappola, and S. B. Suppappola, "Matching pursuit classification for time-varying acoustic emissions," in Proc. 35th Asilomar Conf. on Signals, Systems, and Computers, Pacific Grove, CA, 4-7 November 2001. Paper TA2-3.
System Identification using Time-Frequency Filtering
12.3
12.3.1
519
SYSTEM IDENTIFICATION USING TIME-FREQUENCY FILTERING o Problem Description
A discrete time system identification problem can be stated as follows:
y[n] - E q[n - k]x[k] + e[n],
(12.3.1)
k
where x[n] is a transmitted signal, q[n] is the impulse response of a linear time invariant (LTI) system, e[n] is an additive noise, and y[n] is the received signal. The problem is to identify the LTI system transfer function Q(w), i.e., the Fourier transform, of q[n] given the input and the output signals x[n] and y[n]. The conventional method for solving the above problem is the least-squares solution method that is equal to the cross-spectral method in stationary cases, i.e., the system transfer function Q(w)can be estimated by (see e.g. [1])
Szy(W)
Q(~) = S ~ ( ~ ) '
(12.3.2)
where Szy(W) is the cross-spectrum of x[n] and y[n], and Sxz(w) is the auto-spectrum of x[n]. When the additive noise e[n] in (12.3.1) is a zero-mean Gaussian process and statistically independent of the input signal x[n], the estimate in (12.3.2) is asymptotically unbiased but the performance is limited by the noise variance, or the signal-to-noise ratio (SNR). When this SNR is low, the performance of the estimate in (12.3.2) is poor as we will also see later. Since the auto-spectrum of the input signal x[n] is in the denominator in the estimate (12.3.2), the input signal is, in general, chosen as a pseudo-random signal with flat spectrum. With this kind of input signals, noise reduction techniques before system identification do not apply. In the following, we introduce a different technique [2] for the system identification problem. The main idea is as follows. Instead of pseudo-random signal x[n], chirp type signals are transmitted as training signals, which have wideband characteristics in the frequency domain but are concentrated in the joint time-frequency (TF) domain. The TF concentration property usually holds after passing through an LTI system (this will be seen later). Since a joint TF distribution usually spreads noise and localizes signals, in particular chirps, the receiver may use a TF analysis technique to map the received signal y[n] from the time domain into the joint TF domain. In this way, the SNR can be significantly increased in the joint TF domain [3]. Furthermore, TF filtering can be used in the TF plane to reduce the noise and the SNR in the time domain can be increased and therefore the system identification after denoising can improved. ~ X i a n g - G e n Xia, Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA (xxia@ee.udel.edu). Reviewers: Paulo M. Oliveira and Shie Qian.
520
Chapter 12:
Detection, Classification and Estimation
Fig. 12.3.1: TF filtering.
12.3.2
Time-Frequency Filtering
TF filtering (described in Chapter 11) includes three major components, namely a TF analysis or mapping, with a TF tool such as a TF distribution, from the time domain to the joint TF domain, a masking or filtering in the joint TF plane, and a TF synthesis from the joint TF plane back to the time domain, as illustrated in Fig. 12.3.1 for a chirp signal. Differently from what happens with the Fourier transform in the Fourier transform based filtering, the TF mapping in a TF filtering may not be onto. In other words, not every signal S[k, 1] defined in the joint TF plane corresponds to a time domain signal s[n] such that the TF mapping of the time domain signal s[n] is exactly equal to S[k, 1]. This causes problems in filtering in the TF domain, since the filtered signal in the TF domain may not correspond to any time domain signal as shown in Fig. 12.3.1 (a) and (e). An intuitive solution for this problem is to take the least-squares error (LSE) solution in the time domain, (see, for example, [4]). The LSE, however, usually does not have a desired TF characteristics in the TF domain. When a signal is very long, the computational load for the LSE solution
System Identification using Time-Frequency Filtering
521
is significantly high because of the inverse matrix computation. Based on these observations, an iterative algorithm based on the discrete Gabor transform (DGT) was proposed in [5]; see Article 11.2. Conditions on the convergence, properties of the limit signals, and the relationship between the LSE solutions and solutions from the iterative algorithms were obtained in [6], where a significant improvement over the LSE solution was also shown. In the remainder of this article, we adopt the DGT as the TF analysis. 12.3.2.1
Discrete Gabor Transform
Let a signal s[n], a synthesis window function h[n] and an analysis window function 7In] be all periodic with same period L. Then,
M-1N-1 s[k]- E E Cm,nhm,n[k], m=O n=O L-1
(12.3.3)
Cm,n - E s[k]7~,,~[k], k=0
(12.3.4)
hm,,~[k] - h[k - m A M ] W ~ Agk,
(12.3.5)
7m,n[k] - 7[k - r n A M ] W ~ ANk,
(12.3.6)
and WL -- exp(j27r/L), j -- x/%--1, where A M and A N are the time and the frequency sampling interval lengths, and M and N are the numbers of sampling points in the time and the frequency domains, respectively, M - A M -- N . A N -- L, M N > L (or A M A N <_ L). The coefficients Cm,n are called the d i s c r e t e G a b o r t r a n s f o r m (DGT) of the signal s[k] and the representation (12.3.3) is called the i n v e r s e d i s c r e t e G a b o r t r a n s f o r m (IDGT) of the coefficients Cm,n. Let H and G be the following A M A N by L and M N by L matrices, respectively: the element at the (raN + n)th row and the kth column in H is
h [ k - mAM]W~, ~xNk
O < m < M-10
< n < N-
l,O < k < L - 1
and the element at the kth row and the ( m N + n)th column in G is 7,~,~[k]-7*[k-mAM]W[
~/'Nk, O < m <_ M -
l, O < n < N -
l, O <_ k <_ L - 1 .
Then, the DGT and IDGT can be rewritten as C - Gs
and
s-
HC,
(12.3.7)
where the following condition is needed
HG-
I,
(12.3.8)
where I is the L x L identity matrix. (For more details, see [5] and Articles 4.9, 6.3 and 11.2.)
522
12.3.2.2
Chapter 12:
Detection, Classification and Estimation
Iterative T F Filtering A l g o r i t h m
Let D denote the mask in the T F plane, i.e., an M N by M N diagonal matrix with diagonal elements either 0 or 1. Let s be a signal with length L in the time domain. The first step in the time-variant filtering is to mask the D G T of s: C1 = DGs, where D masks a desired domain in the T F plane. Since the D G T G is a redundant transformation, the I D G T of C1, HC1, may not fall in the mask. In other words, in general, GHC1 =/=DGHCI, (12.3.9) which is illustrated in Fig. 12.3.1(e). An intuitive method to reduce the difference between the right and the left hand sides of (12.3.9) is to mask the right hand side of (12.3.9) again and repeat the procedure, which leads to the following iterative algorithm: So = s,
(12.3.10)
= DGsz,
Cl+l
(12.3.11)
sz+l = H C t + I ,
(12.3.12)
1 - 0, 1, 2, .... For more details, see Article 11.2 and reference [6].
12.3.3
Denoising for Received Signals through a Noisy Channel
We first describe some parameters. The training signal -
cos(
,
x[n] is
n = 0, 1,
''',
499.
(12.3.13)
The synthesis and analysis window functions h[n] and v[n] are the orthogonal-like functions from [5, 6] and Article 11.2. Fig. 12.3.2(a) shows the D G T of x[n]. We use 20 tap LTI systems in our numerical examples. The system model is N-1
y[n] - ~ q[k]x[n- k] + e[n],
(12.3.14)
k=O
where N = 20, e[n] is additive white Gaussian noise, independent of
x[n], and
N-1
sin]- E q[k]x[n- k],
(12.3.15)
k=0
is considered as the signal. The original SNR for the received signal is calculated by 1010g10
V,499
Inl =
9
System Identification using Time-Frequency Filtering
523
In the following, we randomly generate the channel q[n]. As an example, a random channel, the noiseless signal sin], the received time waveform y[n] with SNR= -4.5dB, and their Fourier spectrum are shown in Fig. 12.3.2(b). The D G T of the received signal y[n] with - 4 . 5 d B SNR is shown in Fig. 12.3.2(c). In Fig. 12.3.2(c), one is still able to see the chirp pattern in the joint time and frequency plane although that is impossible in the time or the frequency domain alone in Fig. 12.3.2(b).
12.3.3.1
Mask D Design
The pattern in the D G T domain of the above signal s[n] in (12.3.15) is similar to the one for the signal x[n] in Fig. 12.3.2(a). This is not only true for this particular example, but has also been the case for our numerous examples. The reason can be found in the following analytic argument. Assume the chirp signal x[n] - exp(jcn r) for some constants r _> 2 and c :/: 0. Then, r--1
s[n] - E q[kJx[n - k] - x[n] E q[k] exp(jc E c , n ' k r - ' ) , k
k
l-O
which is dominated by the original chirp x[n] for finite tap LTI systems q[k]. It is because that the highest chirp order of sin], r, and the corresponding chirp rate are the same as those of x[n], while the chirp order for the above multiplier of x[n] in r--1
h[k] exp(jc E c,n' k r - ' ) k
/=0
is only r - 1. As a special case, when r - 2, s[n] - x[n]t~(2cn), where (~(w) is the Fourier transform of the signal q[n]x[n]" (~(w) - ~-]k q[k]x[k] exp(-j2cnk). When the channel q[n] has only a finite tap, the function Q(a;) is usually a smooth signal. Since the transmitted signal x[n] is known to both transmitter and the receiver, by the above property its pattern in the D G T domain may help in designing a mask in the D G T domain for filtering noise, which motivates the following design algorithm. A M a s k D e s i g n Procedure" S t e p 1. Implement the DGT, obtaining the coefficients Cm,n of the transmitted signal x[k]. S t e p 2. Threshold the DGT coefficients Cm,n and have a mask Dx from Cm,n:
D~(m,n) -
I, if ICm,nl > to, O, otherwise,
where to is a predesigned positive number that is called a thresholding constant.
524
Chapter 12:
Detection, Classification and Estimation
S t e p 3. Implement Steps 1-2 for the received signal y[k] and design a mask Dy with thresholding constant tl from the D G T coefficients of y[n] with another predesigned constant tl > 0. S t e p 4. The final mask is the product of D~ and
Dy" D -- DzDy.
Since the D G T of the signal x[n] usually dominates the D G T of the signal s[n], the pattern in the DGT domain of the signal sin] is usually in a close neighborhood of the pattern in the D G T domain of x[n]. Therefore, the mask Dx is usually designed so that it covers a relatively large area, i.e., the thresholding constant to in Step 2 is usually chosen not too large. Since the received signal y[n] is from a noisy channel, the resolution of its DGT pattern may be reduced and therefore the thresholding constant tl in Step 3 is usually chosen not too small. Otherwise, the mask Dy will cover too much area you do not want. Let us see an example. The mask Dx from x[n], the mask Dy from y[n], their product D - D~Dy, and the mask Ds from the true signal s[n] are shown in Fig. 12.3.2(d), respectively. The SNR in this case is S N R = - I . 4 d B . The thresholding constants in Steps 1-3 are to = O.12.max(DGT(y)) and tl = O.15.max(DGT(y)). It should be pointed out that the above mask design procedure may be improved by using more sophisticated designs. Possible improvements are" (i) to find the optimal thresholding constants to and tl by training a large number of signals and systems; (ii) to use more sophisticated statistical detection method in the D G T domain for the received signal y[n] instead of a simple thresholding in Step 3; (iii) to smooth the mask D = DzDy since the true mask D s is usually smooth due to the nature of a chirp signal but Dy from the noisy signal y[n] may not be smooth. Some morphological operations, such as dilation, may be used for smoothing the mask D.
12.3.3.2 DenoisingExperiments We now implement the denoising using the iterative T F filtering algorithm with three masking techniques: using the mask D = Dx from the transmitted signal; using the mask D = DyDz as designed by Steps 1-4; using the true mask D = Ds. We run 100 tests in terms of different LTI systems q[n] (randomly generated) and different additive white Gaussian noises e[n] for each masking method and take their mean SNR. Nine iterative steps are used in the iterative algorithm (12.3.10)(12.3.12). Fig. 12.3.3(a) shows the curves of the mean SNR versus iterative steps for the three masking methods.
12.3.4
System Identification
In this section, we first use the iterative time-variant filter (12.3.10)-(12.3.12) developed in the previous sections to reduce the additive white Gaussian noise c[n] from the received signal y[n]. In the iterative time-variant filter, for calculation simplicity we choose the first masking method, i.e., the mask D = Dx, throughout this section. With this mask, two iterations are used in the T F filtering. We then implement the conventional system identification method, as shown in Fig. 12.3.3(b).
System Identification using Time-Frequency Filtering
525
Fig. 12.3.2: (a) The discrete Gabor transform of signal x[n]. (b) An example of LT! channel q[n], signal s[n] and received signal y[n], and their Fourier spectrum, where the SNR=-4.SdB for the additive white Gaussian noise. (c) The discrete Gabor transform of a received signal y[n] with SNR=-4.5dB. (d) Example of masks Dx from x[n], Dy from y[n], the final mask D = DxDy and the true mask Ds from sin].
The conventional system identification method used here is the cross-spectral method: S$x(w) (12.3.16) where x[n] is the chirp signal defined in (12.3.13) and ~ is the signal after the denoising. It is compared with the conventional method without denoising, i.e.,
Qold~ (w) - Sx~(W) '
(12.3.17)
where x[n] is also the chirp signal. Since the system identification performance usually depends on the signal x[n] you transmit, one might say that it is not fair to
526
Chapter 12:
Detection, Classification and Estimation
Fig. 12.3.3: (a) The mean SNR curves of the iterative time-variant filtering with the following masks: D = Dx, D = D x D y , and D = Ds. (b) New system identification method. (c) Comparison of system identification methods: the conventional method using chirp signals; the conventional method using pseudo-random signals; new method using chirp signals and time-variant filtering. (d) System identification examples: original spectrum IQ(~)I; identified spectrum without additive noise using the chirp signal; the conventional method with additive noise of SNR=-0.4dB; the new method with additive noise of SNR=-0.4dB.
compare them using the chirp signal that is preferred here for denoising purposes, but might not be preferred for other methods. For this reason, we also compare our new method with the conventional method using pseudo-random sequences: Qotd:(
) =
(12.3.18)
where 2[n] is a pseudo-random sequence. Fig. 12.3.3(c) shows their performances, where 200 tests are used for the mean SNR curves for the system spectrum versus the original SNR. Our new method performs much better than others. Surprisingly, even for the conventional cross
System Identification using Time-Frequency Filtering
527
spectral method, the chirp signal in (12.3.13) outperforms pseudo-random signals by approximately 6dB. In Fig. 12.3.3(d), some identification examples are shown, where the original SNR is -0.4dB. As a remark, all system identification calculations used in this article are based on the MATLAB T M Signal Processing Toolbox.
12.3.5
Summary and Conclusions
System identification can be performed by transmitting chirp signals and applying iterative time-frequency filters based on a discrete Gabor transform/expansion. The filters are implemented by projecting the signal back and forth between the time domain and the joint time-frequency domain. This system identification method has better performance than the conventional methods at low signal-to-noise ratios.
References [1] T. SSdertrSm and P. Stoica, System Identification. Englewood Cliffs. N J: Prentice-Hall, 1989. [2] X.-G. Xia, "System identification using chirp signals and time-variant filters in the joint time-frequency domain," IEEE Trans. Signal Processing, vol. 45, pp. 2072-2084, August 1997. [3] X.-G. Xia and V. C. Chen, "A quantitative SNR analysis for the pseudo Wigner-Ville distribution," IEEE Trans. Signal Processing, vol. 47, pp. 2891-2894, October 1999. [4] G. F. Boudreaux-Bartels and T. Wo Parks, "Time-varying filtering and signal estimation using Wigner distribution synthesis techniques," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 34, pp. 442-451, June 1986. [5] S. Qian and D. Chen, Joint Time-Frequency Analysis: Methods ~ Applications. Upper Saddle River, N J: Prentice-Hall, 1996. [6] X.-G. Xia and S. Qian, "Convergence of an iterative time-variant filtering based on discrete Gabor transform," IEEE Trans. Signal Processing, vol. 47, pp. 2894-2899, October 1999.
528
Chapter 12:
Detection, Classification and Estimation
T I M E - F R E Q U E N C Y M E T H O D S FOR SIGNAL E S T I M A T I O N AND D E T E C T I O N o
12.4
This article uses the generalized Wigner-Ville spectrum (GWVS) and the generalized Weyl symbol (GWS) to develop time-frequency (TF) techniques for the estiruction and detection of underspread nonstationary processes. These TF techniques extend optimal signal estimators (Wiener filters) and optimal signal detectors for the stationary case to underspread nonstationary processes. They are conceptually simple and intuitively appealing as well as computationally efficient and stable. We will first review some fundamentals (for more details see Articles 4.7 and 9.4). The GWVS of a nonstationary random process x(t) with correlation function rx(t, t') = E{x(t) x*(t')} is defined as
Wx (~) (t, f) ~=
rx t +
-~ - c~ 7, t -
-~ + ~ T e-J2=/'dT,
(12.4.1)
oo
where a is a real-valued parameter. The GWS of a linear, time-varying (LTV) system H with kernel (impulse response) h(t, t') is defined as
L(H~)(t,f) A
h t+
-~ - a
7, t -
-~ + a
T e-J2~/'dT.
oo
In the case of a stationary process, the GWVS reduces to the conventional power spectral density. Similarly, for a linear, time-invariant (LTI) system, the GWS reduces to the conventional transfer function (frequency response). n nonstationary process x(t) is called underspread if components of x(t) that are sumciently separated in the TF plane are effectively uncorrelated (see Article 9.4). Two random processes x(t), y(t) are called jointly underspread if they satisfy similar underspread conditions [1]. An LTV system H is called underspread if it produces only small TF displacements of the input signal (see Article 4.7).
12.4.1
Nonstationary Signal Estimation
The enhancement or estimation of signals corrupted by noise or other interference is important in many applications. Here, we consider estimation of a nonstationary, zero-mean random signal s(t) from a noise-corrupted observed signal x(t) = s(t) + n(t) by means of an LTV system H. The signal estimate is thus given by ~(t) = (Hx)(t) =
F
h(t, t') x(t') dt'.
(12.4.2)
(x)
The additive noise n(t) is nonstationary and assumed uncorrelated with s(t). ~ F. H l a w a t s c h and G. M a t z , Institute of Communications and I~adio-Frequency Engineering, Vienna University of Technology, Gusshausstrasse 25/389, A-1040 Vienna, Austria (fhlawats@pop.tuwien.ac.at, g.matz@ieee.org). Reviewers: P. Flandrin and S. Carstens-Behrens.
Time-Frequency Methods for Signal Estimation and Detection
529
The time-varying Wiener filter. The LTV system H that minimizes the meansquare error (MSE) E{ 116 - s 12} is the time-varying Wiener filter [2, 3]
H w = R~(R~ + Rn) -1.
(12.4.3)
Here, Rs and Rn denote the correlation operators 1 of signal and noise, respectively. For stationary random processes, H w is an LTI system whose frequency response is given by [2, 3] Hw(f) -
Ps(f) Ps (f) + Pn(f) '
(12.4.4)
where P~ (f) and Pn (f) denote the power spectral density of signal and noise, respectively. This frequency-domain expression involves merely a product and a reciprocal of functions (instead of a product and an inverse of operators as in (12.4.3)) and thus allows a simple design and interpretation of time-invariant Wiener filters. TF formulation of time-varying Wiener filters. We may ask whether a simple ibrmulation similar to (12.4.4) can be obtained for the time-varying Wiener filter H w by replacing H w ( f ) with the GWS L Hw (") (t, f) and P~(f) Pn(f) with the GWVS
Ws(~) (t, f), I47..~(~) (t, f). Indeed, for jointly underspread processes s(t) and n(t), it can be shown [1] that the time-varying Wiener filter H w can be written as the sum of two components: (i) an overspread (i.e., not underspread)system component that has negligible effect on the system's performance (MSE) and thus can be disregarded, and (ii) an underspread system component, hereafter denoted as H~r that allows the approximate TF formulation r(") (t f ) ~ ~u~ ,
W:~) (t, f )
__(~) W: ") (t, f ) + W n (t, f )
(12.4.5)
This TF formulation extends (12.4.4) to the underspread nonstationary case and allows a simple and intuitively appealing TF interpretation of the time-varying Wiener filter (see Fig. 12.4.1). Let 7~s and 7 ~ denote the effective support regions of W(~) (t, f) and W..(~) (t, f), respectively. In the "signal only" TF region T~s\7~n, (12.4.5) gives L H~ (~) (t, f) ~ 1 Thus, H~v passes all "noise-free" components of x(t) without attenuation or distortion. In the "noise only" TF region ~ n \ ~ s , (12.4.5) gives L H~ (~) (t, f) ~ 0, i.e., H~v suppresses all components of x(t) located in TF regions where there is no signal. Finally, in the "signal plus noise" TF region 7~s NT~n, I~g~ r(~) (t, f)l assumes values approximately between 0 and 1. Here, H~v performs an attenuation that depends on the signal-to-noise ratio at the respective TF point. 1The correlation operator 1~= of a nonstationary random process x(t) is the positive (semi-) definite linear operator whose kernel equals the correlation function ra(t, t') = E{x(t)x*(t')}.
530
Chapter 12:
Detection, Classification and Estimation
Fig. 12.4.1" TF interpretation of the time-varying Wiener filter Hw for jointly underspread signal and noise processes" (a) Effective TF support regions of signal and noise, (b) TF pass, stop, and transition regions of the time-varying Wiener filter.
TF design of time-varying Wiener filters. The TF formulation (12.4.5) suggests a simple TF design of nonstationary signal estimators. Let us define the "TF pseudoWiener filter" H w by setting its GWS equal to the right-hand side of (12.4.5) [1]"
L(-~) (t, f) ~-- W(~) Hw
W(~) (t, f ) ~)
(t, f) + W(
(t, f)
.
(12.4.6)
For jointly underspread processes s(t), n(t) where (12.4.5) is a good approximation, the TF pseudo-Wiener filter H w will closely approximate (the underspread part of) the optimal Wiener filter Hw; furthermore, it can be shown that H w will then be nearly independent of the value of a used in (12.4.6). For processes s(t), n(t) that are not jointly underspread, however, H w must be expected to perform poorly. Whereas the TF pseudo-Wiener filter H w is designed in the TF domain, the signal estimate g(t) can be calculated in the time domain according to (12.4.2). The impulse response of Hw is obtained from L (a) (t, f) as (cf. Article 4.7) Hw
S;D(-')( .
I § -)' §
7')
dr.
(12.4.7)
An efficientimplementation of the TF pseudo-Wiener filterH w that is based on the multiwindow short-time Fourier transform is discussedin [I,4]. Compared to the Wiener filterHw, the TF pseudo-Wiener filterH w possesses two practicaladvantages. First,the priorknowledge required for calculatingH w is ~(~) given by the OWVS W: .) (t, f) and W n (t, f) that are more intuitive and easier to handle than the correlation operators R~ and Rn. Second, the TF design (12.4.6) is less computationally intensive and more stable than (12.4.3) since it requires pointwise (scalar) divisions of functions instead of operator inversions. Robust TF Wiener filters. The performance of the filters H w and H w is sensitive to deviations of the second-order statistics (correlations or GWVS) from the nominal
531
Time-Frequency Methods for Signal Estimation and Detection
statistics for which these filters were designed 9 This motivates the use of minimax robust Wiener filters that optimize the worst-case performance (maximum MSE) within specified uncertainty classes of second-order statistics [5, 6]. Consider a partition of the TF plane into K mutually disjoint TF regions Hi, i = 1, ..., K. Extending the stationary case definition in [7], we define so-called ppoint uncertainty classes $ and Af as the sets of all nonnegative TF functions (not necessarily valid GWVS) Ws (t, f) and Wn (t, f) that have prescribed energies si and ni, respectively, within Hi, i.e., f f n Ws(t, f ) d t d f = si and f f n Wn(t, f ) d t d f - ni for i = 1, ..., K. For these uncertainty classes, the GWS of the minimax robust TF Wiener filter H a is given by [5, 6] K
si +s~ ni In, (t, f)
a (t, f) L H(~) E
(12.4.8)
i--1
where In, (t, f) is the indicator function of TQ (i.e., In, (t, f) is 1 for (t, f) inside 7~i and 0 outside Hi) 9 Note that L HFt (a) (t, f) is piecewise constant, expressing constant TF weighting within Hi. The performance of HR is approximately independent of the actual second-order statistics as long as they are within S, Af [5, 6]. Signaladaptive, online implementations of robust time-varying Wiener filters using local cosine bases have been proposed in [6, 8]. Simulation results. Fig. 12.4.2(a), (b) shows the Wigner-Ville spectra (GWVS with c~ = 0) of jointly underspread signal and noise processes. The Weyl symbols (GWS with c~ = 0) of the corresponding Wiener filter H w , of its underspread part H ~ , and of the TF pseudo-Wiener filter H w are shown in Fig. 12.4.2(c)-(e). It is verified that the Weyl symbol of H w approximates that of H ~ . The mean SNR improvement achieved by the TF pseudo-Wiener filter H w was obtained as 6.11dB; this is nearly as good as that of the Wiener filter H w (6.14dB). To illustrate the performance of the robust TF Wiener filter H a , we used K = 4 rectangular TF regions Hi to define p-point uncertainty classes S and Af as described above. The regional energies si and ni were obtained by integrating the nominal Wigner-Ville spectra in Fig. 12.4.2(a), (b) over the TF regions Hi. The Weyl symbol of the robust TF Wiener filter HR in (12.4.8) is shown in Fig. 12.4.2(f). Fig. 12.4.3 compares the nominal and worst-case performance of the Wiener filter H w (designed for the nominal Wigner-Ville spectra in Fig. 12.4.2(a), (b)) with the performance of the robust TF Wiener filter HR. It is seen that HR achieves a substantial performance improvement over H w at worst-case operating conditions with only a slight performance loss at nominal operating conditions.
12.4.2
Nonstationary Signal Detection
Next, we consider the discrimination of two nonstationary, zero-mean, Gaussian random signals xo(t) and xl(t). The hypotheses are H0:
x(t) = xo(t)
vs.
=
Chapter 12:
532
Detection, Classification and Estimation
Fig. 12.4.2" TF representations of signal and noise statistics and of various Wiener-type filters: (a) Wigner-Ville spectrum of s(t), (b) Wigner-Ville spectrum of n(t), (c) Weyl symbol of Wiener filter Hw, (d) Weyl symbol of underspread part H~v of Hw, (e) Weyl symbol of TF pseudo-Wiener filter Hw, (f) Weyl symbol of robust TF Wiener filter HR. The time duration is 128 samples; normalized frequency ranges from - I/4 to 1/4. The optimal detector. The optimal likelihood ratio detector quadratic form of the observed signal x(t),
A ( x ) - - ( H L x , x>---
/?/? OO
hL(t,t')x(t')x*(t)dtdt',
[2, 3] calculates a
(12.4.9)
OO
with the operator (LTV system) Ha given by HL = Rxlo - R ~
= R-~:(Rxl-Rzo)Rx:.
(12.4.10)
The test statistic A(x) is then compared to a threshold to decide whether H0 or H1 is in force. For stationary processes, A(x) can be expressed in terms of the Fourier transform X ( f ) of x(t) and the power spectral densities of xo(t) and zl (t) as A(x) -
/?
[X(f)l 2 HL(f)df,
with Ha(f) = Pxl (f) - Pxo(f) (12.4.11) Pxo(f)P=~(f) 9
This frequency-domain expression involves simple products and reciprocals of functions (instead of operator products and inverses as in (12.4.10)) and thus allows a
533
Time-Frequency Methods for Signal Estimation and Detection
Fig. 12.4.3: Comparison of the performance (output SNR vs. input SNR) of the ordinary Wiener filter H w and the robust TF Wiener filter HR for various operating conditions.
simple interpretation and design of likelihood ratio detectors in the stationary case. TF formulation of nonstationary detectors.
It is known [9] that the quadratic
test statistic in (12.4.9) can be rewritten as A(x) where
/?/? c<)
c~
W(a)(t, f) L(~)*(t, f) dtdf
x t + (~ -
Wx(~) (t, f ) -
(12.4.12)
HL
t - (~ +
oo
is the generalized Wigner distribution [10] of the observed signal x(t). Thus, A(x) can be interpreted as a weighted integral of W (a) (t, f), with the TF weight function being the conjugate of the GWS of the operator HE. In analogy to Section 12.4.1, a simplified approximate TF formulation of A(x) exists for jointly underspread processes x0(t), xl(t). Here, the operator HE can be written as the sum of an overspread component whose effect is negligible and an underspread component, denoted H~, whose GWS can be approximated as [11]
L(a) (t f ) . ~ H~ '
w~, ~) (t, f) - W~o (t f) ' .
(12.4.13)
~ (Xo a ) (t ~ f) W~Xl(~) (t, f)
Substituting this approximation of L H(~) (t, f) for L H(~) (t, f) in (12.4.12) we obtain ~ L the following approximate TF formulation of our test statistic, A(x) ~
~
~
W(~)(t' f)
~1 (t, f) - W~o (t, f)
-~.(~t---~-;--~: =o ( , f ) =, (t,J)
dtdf .
(12.4.14)
This TF formulation extends (12.4.11) to the underspread nonstationary case and allows an intuitively appealing TF interpretation that is analogous to the one given in Section 12.4.1 in the context of the approximation (12.4.5).
Chapter 12:
534
Detection, Classification and Estimation
TF design of nonstationary detectors. The TF formulation (12.4.14) suggests a simple TF design of nonstationary detectors. In analogy to (12.4.12), we define the test statistic
A(x) ~-
/?/:
W (a) (t, f) L )* (t, f ) d t d f ,
O0
O0
L
where the operator (LTV system) HL is defined by setting its GWS equal to the right-hand side of (12.4.13) [11]"
W ('~)(t,f)
L(-~)(t,f) A ..x, HL
--(~)
%0 (t, f)
W (c~)(t,f)
---xo
) (t, f)
"
For jointly underspread processes xo(t), xl(t) where (12.4.13) is a good approximation, HL will closely approximate (the underspread part of) HL, and thus the performance of the TF designed detector A(x) will be similar to that of the optireal likelihood ratio detector A(x). For processes xo(t), x~(t) that are not jointly underspread, however, A(x) must be expected to perform poorly. Whereas the detector A(x) is designed in the TF domain, it can be implemented in the time domain in complete analogy to (12.4.9). The impulse response hL(t, t') of HL can be obtained from L (~) (t, f) by an inverse Weyl transformation (cf. (12.4.7)). HL
An efficient implementation of the TF detector A(x) that uses the multiwindow short-time Fourier transform is discussed in [12]. Compared to the likelihood ratio detector A(x), the TF designed detector/~(x) is practically advantageous because the statistical a priori knowledge required for its design is formulated in the intuitively accessible TF domain, and because its design is less computationally intensive and more stable since operator inversions are replaced by pointwise divisions of functions. These advantages are analogous to the advantages of the TF pseudo-Wiener filter discussed in Section 12.4.1. Minimax robust detectors that are analogous to the minimax robust Wiener filter in Section 12.4.1 are reported in [13].
Simulation results. We first consider xo(t) - n(t) and Xl (t) : 8(t) -~- n(t), where signal s(t) and noise n(t) are jointly underspread, uncorrelated, zero-mean, Gaussian processes with Wigner-Ville spectra as shown in Fig. 12.4.4(a), (b). From Fig. 12.4.4(c), (d), we verify that the Weyl symbols of the optimal operator HL and the TF designed operator HL are effectively identical. Also, Fig. 12.4.4(e), (f) shows that the performance of the TF designed detector/~(x) closely approximates that of the likelihood ratio detector A(x). Our next example concerns the detection of knocking combustions in car engines (see Article 15.2 and refs. [12,14,15] for background and details). Here, xo(t) is the nonknocking signal and Xl (t) is the knocking signal. Estimates of the correlations of xo(t) and x~ (t) were computed from a set of labeled training data, 2 and estimates 2We are grateful to S. Carstens-Behrens, M. Wagner, and J. F. Bhhme for illuminating discussions and for providing us with the labeled car engine data.
Time-Frequency Methods for Signal Estimation and Detection
535
Fig. 12.4.4" Comparison of likelihood ratio detector A(x) and TF designed detector A(x): (a) WignerVilla spectrum of s(t), (b) Wigner-Ville spectrum of n(t), (c) Weyl symbol of HL, (d) Weyl symbol of ~IL, (e) receiver operator characteristics (ROC) [2] of A(x), (f) ROC of/~(x). The ROCs were obtained by Monte Carlo simulation. The time duration is 128 samples; normalized frequency ranges from -- 1/4 to 1/4.
of the Wigner-Ville spectra (shown in Fig. 12.4.5(a), (b)) were derived according to (12.4.1). The likelihood ratio detector A(x) and the TF designed detector/~(x) were constructed using these estimated statistics, and the performance of these detectors was analyzed by applying them to a different set of labeled data. It can be seen from Fig. 12.4.5(c) that the TF designed detector performs significantly better than the theoretically optimal likelihood ratio detector. This is due to numerical problems that occurred in the design of the likelihood ratio detector. Specifically, the estimated correlation matrices 3 were poorly conditioned. Despite the use of pseudoinverses, the inversion of these matrices (which is required for the design of the likelihood ratio detector) could not be stabilized sufficiently. In contrast, the design of the TF detector merely involves a pointwise division of the estimated WignerVille spectra. This is much less affected by numerical problems since divisions by near-to-zero values can easily be stabilized by means of a thresholding.
12.4.3
Summary and Conclusions
The generalized Wigner-Ville spectrum (GWVS) provides a natural extension of the power spectral density to underspread, nonstationary random processes. Similarly, 3In the discrete-time case, correlation operators are replaced by correlation matrices.
536
Chapter 12:
Detection, Classification and Estimation
Fig. 12.4.5: Detection of knocking combustions: (a) Estimated Wigner-Ville spectrum of nonknocking combustion process xo(t), (b) estimated Wigner-Ville spectrum of knocking combustion process Xl (t) (crank angle is proportional to time; signal leng~ch is 186 samples), (c) ROCs of likelihood ratio detector A(x) (dashed line) and TF designed detector A(x) (solid line). the generalized Weyl symbol (GWS) provides a natural extension of the transfer function (frequency response) to underspread, time-varying linear systems. In this article, we have considered the application of the GWVS and GWS to the estimation and detection of underspread, nonstationary random processes. Using the GWVS and GWS, it was possible to extend classical stationary estimators and detectors to the nonstationary case in an intuitive manner. We note that the general approach replacing the power spectral density with the GWVS and the transfer function with the G W S - - i s applicable to other problems of statistical signal processing as well, as long as the nonstationary processes involved are (jointly) underspread. Further time-frequency methods for nonstationary signal estimation and detection are discussed in Articles 8.3, 9.2, 12.1, and 15.2 as well as in refs. [15-18].
References [1] F. Hlawatsch, G. Matz, H. Kirchauer, and W. Kozek, "Time-frequency formulation, design, and implementation of time-varying optimal filters for signal estimation," IEEE Trans. Signal Processing, vol. 48, pp. 1417-1432, May 2000. [2] H. V. Poor, An Introduction to Signal Detection and Estimation. New York: Springer, 1988. [3] H. L. L. Van Trees, Detection, Estimation, and Modulation Theory, vol. I: "Detection, Estimation, and Linear Modulation Theory". New York: Wiley, 1968. Reprinted 2001. [4] W. Kozek, H. G. Feichtinger, and J. Scharinger, "Matched multiwindow methods for the estimation and filtering of nonstationary processes," in Proc. IEEE Internat. Syrup. on Circuits and Systems (ISCAS 96), vol. 2, pp. 509-512, Atlanta, GA, 12-15 May 1996. [5] G. Matz and F. Hlawatsch, "Minimax robust time-frequency filters for nonstationary signal estimation," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'99), pp. 1333-1336, Phoenix, AZ, 15-19 March 1999.
Time-Frequency Methods for Signal Estimation and Detection
537
[6] G. Matz and F. Hlawatsch, "Minimax robust nonstationary signal estimation based on a p-point uncertainty model," J. Franklin Institute, vol. 337, pp. 403-419, July 2000. [7] S. A. Kassam and H. V. Poor, "Robust techniques for signal processing: A survey," Proc. IEEE, vol. 73, pp. 433-481, March 1985. [8] G. Matz, F. Hlawatsch, and A. Raidl, "Signal-adaptive robust time-varying Wiener filters: Best subspace selection and statistical analysis," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), pp. 3945-3948, Salt Lake City, UT, 7-11 May 2001. [9] P. Flandrin, "A time-frequency formulation of optimum detection," IEEE Trans. Acoustics, Speech, ~4 Signal Processing, vol. 36, pp. 1377-1384, September 1988. [10] F. Hlawatsch and P. Flandrin, "The interference structure of the Wigner distribution and related time-frequency signal representations," in The Wigner Distribution-Theory and Applications in Signal Processing (W. Mecklenbr/iuker and F. Hlawatsch, eds.), pp. 59-133, Amsterdam: Elsevier, 1997. [11] G. Matz and F. Hlawatsch, "Time-frequency formulation and design of optimal detectors," in Proc. IEEE-SP Internat. Syrnp. on Time-Frequency ~ Time-Scale Analysis, pp. 213-216, Paris, 18-21 June 1996. [12] G. Matz and F. Hlawatsch, "Time-frequency methods for signal detection with application to the detection of knock in car engines," in Proc. Ninth IEEE Workshop on Statistical Signal and Array Processing (SSAP-98), pp. 196-199, Portland, OR, 14-16 September 1998. [13] G. Matz and A. Raidl, "Robust detection of nonstationary random signals belonging to p-point uncertainty classes," in Proc. IEEE Inter'nat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'03), pp. 641-644, Hong Kong, scheduled 6-10 April 2003. [14] S. Carstens-Behrens, M. Wagner, and J. F. BShme, "Detection of multiple resonances in noise," Archly fiir Elektronik und Ubertragungstechnik (Internat. J. of Electronics Communications), vol. 52, no. 5, pp. 285-292, 1998. [15] G. Matz and F. Hlawatsch, "Time-frequency subspace detectors and application to knock detection," Archly fiir Elektronik und Ubertragungstechnik (Internat. J. of Electronics ~4 Communications), vol. 53, pp. 379-385, December 1999. [16] H. A. Khan and L. F. Chaparro, "Nonstationary Wiener filtering based on evolutionary spectral theory," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), vol. 5, pp. 3677-3680, Munich, 21-24 April 1997. [17] A. M. Sayeed and D. J. Jones, "Optimal detection using bilinear time-frequency and time-scale representations," IEEE Trans. Signal Processing, vol. 43, pp. 2872-2883, December 1995. [18] J. A. Sills and E. W. Kamen, "Time-varying matched filters," Circuits, Systems, Signal Processing, vol. 15, no. 5, pp. 609-630, 1996.
This Page Intentionally Left Blank
Part V
Engineering Applications
This Page Intentionally Left Blank
Chapter 13
Time-Frequency Methods in Communications The wide range of potential applications of time-frequency methods made them an important tool in most fields of science and engineering. A large number of approaches exist, depending on the application considered. Key time-frequency methodologies are presented on specific applications in this Part V of the book and illustrated using selected examples. Telecommunications is one of the key industries where time-frequency methods are already playing an important role. The topic is represented by four articles selected for this chapter, complemented by articles in other chapters such as Articles 8.5 and 11.3. Due to possible hostile jamming, broadband communication platforms use spread spectrum technology where interference protection is achieved by interference excision. By distributing the signature of received data over a time-frequency region, it is possible to attenuate strong interferences (Article 13.1). Linear dispersion in wireless communication channels distorts the transmitted signal in both time and frequency. This is accounted for by a time-frequency scattering function. In CDMA systems, fading and multi-access interference can be dealt with using time-frequency processing. A time-frequency RAKE receiver is described which accounts for both spectral and temporal channel variations (13.2). Eigenfunctions of linear systems can be modeled by signals with a time-frequency distribution well localized in the time-frequency plane. The knowledge of the eigenfunctions of time-varying transfer functions enables us to optimize the transmission strategy and take advantage of the channel dispersive properties (13.3). Detection and parameter estimation of chirps in communication systems may be implemented using the fractional FT (13.4).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
541
542
13.1
13.1,1
Chapter 13:
Time-Frequency Methods in Communications
T I M E - F R E Q U E N C Y INTERFERENCE M I T I G A T I O N IN SPREAD SPECTRUM C O M M U N I C A T I O N SYSTEMS o
Spread-Spectrum Systems and Interference
In modern broadband communication systems, mitigation of correlated interference is an important aspect of system performance. Benign jamming sources exist ubiquitously in the transmission channel of multiple access systems like cellular telephony and wireless networks. Hostile jamming is certainly a significant issue in military communication systems. For these and many other reasons, broadband communication platforms employ spread spectrum (SS) technology, in which [1] (1) The signal occupies a bandwidth much in excess of the minimum bandwidth necessary to send the information. (2) Spreading is accomplished by means of spreading signal, which is often called a code signal that is independent of the data. (3) At the receiver, despreading for the recovering the original data is accomplished by the correlation of the received spread signal with a synchronized replica of the spreading signal. A commonly employed SS technique called direct sequence (DS) is to superimpose a pseudorandom noise (PN) sequence on the data bits. This effectively widens the signal bandwidth by a factor proportional to the ratio of the rate of the PN sequence divided by the data rate. The advantage of this spreading is that the wider bandwidth essentially marginalizes narrowband interference sources so they have a smaller effect on overall system performance. At the receiver, the cross correlation with the replica of the PN sequence transfers the information signal back to its original bandwidth while reducing the level of the narrowband interference by spreading it across the bandwidth occupied by the PN sequence. Since the availability of the code at the receiver enables despreading and recovery of data while spreading and suppression of interference, any level of interference rejection can be achieved by using sufficient processing gain. This, however, may entail increasing the bandwidth of the transmitted signal beyond the limits of the available frequency spectrum. Therefore, signal processing techniques have been used in conjunction with the DS spread spectrum receiver to augment the processing gain, permitting greater interference protection without an increase in bandwidth [2]. Typically, interference excision in DSSS systems is performed in the following domains [3]. Frequency Domain: the FFT of the data over the information bit is weighted by appropriate values and then transformed back to the time domain. This is an effective method for stationary narrowband interference. Sidelobes may present a problem in removing the interference without losing some of the signal energy. Time Domain: this includes clipping or gating the high energy regions. It also ~ M o e n e s s G. A m i n , Villanova University, Villanova, PA, and A l a n R. Lindsey, Air Force Research Laboratory, Rome, NY. Reviewers: S. Batalama and D. L. Jones.
Time-Frequency Interference Mitigation in Spread Spectrum Communication Systems
543
Fig. 13.1.1: Excision methods for nonstationary signals.
includes Wiener filtering, adaptive linear predictors, and smoothers. Tracking is highly dependent on the signal-to-noise ratio (SNR) and often fails under rapidly time-varying interference. Time and Frequency Domains: a transversal filter is designed from the spectral information of the data. Spectral estimation methods combined with open-loop adaptive filtering have been shown to suffer from the same drawbacks as frequencydomain techniques. Wavelet/Gabor domain: the discrete wavelet transform (DWT) or the Gabor transform is applied to the data, and the coefficients of high energy are removed prior to the inverse transform. The DWT is appropriate for cases of pulse jamming or interference with burst characteristics. The Gabor transform is an effective excision tool only when the interference is consistent with the corresponding tiling of the time-frequency ((t, f)) plane. The same is true for the wavelet transform. None of the above methods is capable of effectively incorporating the timevarying nature of the interference frequency characteristics. These methods all suffer from their lack of intelligence about the other domain(s) and therefore are limited in their results and their applicability. To illustrate, Fig. 13.1.1 shows that most frequency-and time-domain excisions, in essence, respectively, remove all desired signal information over the frequency band A F and time duration A T . As such, in the case of time-varying interferences, frequency-domain methods ignore the fact that only few frequency bins are contaminated by the jammer at a given time. Dually, time domain excision techniques, do not account for the cases where only few time samples are contaminated by the jammer for a given frequency. Applying either method will eliminate the interference but at the cost of unnecessarily reducing the desired signal energy. The above example clearly demonstrates that nonstationary interferers, which
Chapter 13:
544
Time-Frequency Methods in Communications
have model parameters that change with time, are particularly troublesome due to the inability of single-domain mitigation algorithms to adequately ameliorate their effects. In this challenging situation, and others like it, joint time-frequency techniques can provide significant performance gains, since the instantaneous frequency (IF), the instantaneous bandwidth, and the energy measurement, in addition to myriad other parameters, are available. The objective is then to estimate the timefrequency signature of the received data using time-frequency distributions (TFDs), attenuating the received signal in those time-frequency regions that contain strong interference. This is depicted by the region in between the fine lines in Fig. 13.1.1.
13.1.2
Typical Signal Model
The signal model r(t) arriving at a receiver is in the form
r(t) = s(t) + n(t) + j(t)
(13.1.1)
which is composed of the desired spectrally spread signal s(t), the composite additive jamming signal j(t) which may be made up of several different jammers from various sources, and the obligatory uncorrelated thermal noise, n(t), commonly assumed to be white with a Gaussian distribution.
13.1.3
A Time-Frequency Distribution Perspective
A straightforward application of time-frequency methods in the underlying problem is the design of adaptive notch filters based on the instantaneous frequency of the jamming signal. Amin pioneered this approach in [3], using the assumption that n(t) and s(t) are spread across the (t, f) plane and without features. In this model, the correlated features in j(t) rise conspicuously in the (t, f) distribution performed at the input of the receiver, and subsequent instantaneous frequency information allows for the design of a strong notch filter that effectively excises only the portion of the (t, f) spectrum contaminated by the jammer. The process is illustrated in Fig. 13.1.2. The notch filter developed from the TFD can be very short having been shown to be very effective with as few as three or five coefficients. Wang and Amin [4] considered the performance analysis of this system using a general class of multiple-zero FIR excision filters showing the dependence of the bit error rate (BER) on the filter order and its group delay. The effect of inaccuracies in the jammer IF on receiver performance was also considered, as a function of the filter notch bandwidth. Closed form approximations for signal to interference plus noise ratio (SINR) at the receiver are given for the various cases. The general form of the receiver SINR is given by E2[D~] S I N R - Var(Ds) + V a r ( D j ) + Var(Dn)
(13.1.2)
where the three quantities in the denominator represent the variances of the decision variable, due to the SS signal, the jammer, and the noise respectively. E[.] is the
Time-Frequency Interference Mitigation in Spread Spectrum Communication Systems
Adjustable
Input
545
Correlator
Filter
I
TFD IF-Estimate
Filter Replica
PN
Fig. 13.1.2: TFD Excision System of Amin et al.
expectation operator, and with typical independent and uncorrelated characteristics of jammers and noise, it only leads to non-zero values when applied to the desired signal. If the jammer is fully excised then Var(Dj) = 0. It is clear from the above equation that one of the main goals of the excision process should be reducing the self-noise represented by the term Var(Ds). One of the drawbacks to the notch filter approach in [3] is the infinite notch depth due to the placement of the filter zeros. The effect is a "self-noise" inflicted on the received signal by the action of the filter on the P N sequence underlying the spread information signal. This problem led to the design of an open-loop filter with adjustable notch depth based on the jammer energy. The notch depth is determined by a variable embedded in the filter coefficients chosen as the solution to an optimization problem which maximizes receiver SINR cost function given in (13.1.2). The TFD is necessary for this work, even for single component signals, because simple IF estimators do not provide energy information. Amin, Wang, and Lindsey accomplished this work in [5], incorporating a "depth factor" into the analysis and redeveloping all the SINR calculations. The result was significant improvement in SINR, especially at mid-range jammer-to-signal ratios (JSRs), typically around 0 to 20 dB. Barbarossa and Scaglione [6] proposed a two-step procedure based on dechirping techniques commonly applied in radar algorithms (see Article 13.3). In the first step the time varying jammer is converted to a fixed frequency sinusoid eliminated by time invariant filters. The process is reversed In the second step and the jammer-free signal is multiplied by the jammer TF signature to restore the DSSS signal and noise characteristics which have been strongly impacted in the first phase. Comparison of this technique with time-varying excision filters is yet to be conducted in terms of computational complexity and robustness to IF estimation errors. Synthesis of the interfering signal from information available at the receiver followed by direct subtraction [7] is an approach put forth by Lach, Amin, and Lindsey.
Chapter 13:
546
Time-FrequencyMethods in Communications
In many situations, it is possible to make assumptions about certain key parameters which describe fully the nature and composition of a jammer, and then utilize signal processing to extract these parameters from the received signal. Once the parameters are determined, a replica of the jammer can be derived and subtracted from the incoming signal to produce an essentially jammer free channel. However, for this process to work, a jammer of constant modulus and polynomial phase is required. These parameters are extracted at the receiver via time-frequency distribution where the optimally matched signal in a least squares sense is constructed. The jamming signal is projected to the constant modulus and then phase-matched. The result, especially in high SNR environments, is a signal matching the jammer in amplitude, frequency profile and phase. The last step of generating the difference signal, which is theoretically also the desired signal after the subtraction of the jammer, is straightforward. To overcome the drawbacks of the least squares synthesis methods Amin, Ramineni and Lindsey [8] proposed a projection filter approach in which the nonstationary interference subspace is constructed from its TF signature. Since the signal space at the receiver is not specifically mandated, it can be rotated such that a single jammer becomes one of the basis functions. In this way, the jammer subspace is one dimensional and its orthogonal subspace is jammer-free. A projection of the received signal onto the orthogonal subspace accomplishes interference excision with a minimal message degradation. The projection filtering methods compare favorably over the previous notch filtering systems.
13.1.4
Example
At this point, in order to further illustrate these excision methods, the work in [8] will be detailed since it includes comparisons between the two most prominent techniques based on time-frequency distributions currently being studied- notch filtering and projection filtering. The signal model is, as expected, given by (13.1.1), and the major theme of the work is to annihilate interference via projection of the received signal onto a "jammer-free" subspace generated from the estimated jammer characteristics. Fig. 13.1.3, reprinted from [8], clearly illustrates the trade-offs between projection and notch filtering based on the JSR. In the legend, the variable a represents the adaptation parameter for the notch filtering scheme and N represents the block size, in samples, for a 128 sample bit duration in the projection method. Thus, N=128 means no block processing and N=2 corresponds to 64 blocks per bit being processed for projection. Since the projection and non-adaptive notch filter techniques are assumed to completely annihilate the jammer, their performance is decoupled from the jammer power, and therefore correctly indicate constant SINR across the graph. The dashed line representing the notch filter with a=0 is really indicating no filtering at all, since the adaptation parameter controls the depth of the notch. It is evident from Fig. 13.1.3 that without adaptation a crossover point occurs around 2 dB where filtering with an infinitely deep notch is advantageous. Thus
Time-Frequency Interference Mitigation in Spread Spectrum Communication Systems
547
Fig. 13.1.3: Comparison between projection and notch filtering excision methods.
when jammer power exceeds this point, presumably a user would flip a switch to turn on the excision subsystem. However, with adaptation this process happens automatically, while giving superior performance in the midrange. For the projection technique, the block size determines receiver performance conspicuously (ceteris paribus). Most important to note, however, is the superior performance of projection over all methods when the block size is equal to the bit duration, i.e. no block processing. It is feasible that computational complexity may warrant a trade-off between SINR and block size, in which case a hybrid implementation may be of benefit - one that automatically switches between adaptive notch filtering and projection depending on the desired SINR. In any case, this example illustrates the parameters involved in the design of modern excision filters for nonstationary jammers.
13.1.5
Summary and Conclusions
The prime objective of interference suppression in broadband signal platforms is to cancel the interference with minimum distortions of the desired signal. Timefrequency signal representation provides the mechanism to achieve that objective for a large class of nonstationary interference signals. The signal localization in the time-frequency domain allows signal processing, acting on information of the instantaneous frequency and bandwidth, to play an effective role in enhancing the receiver performance and improving the bit error rates over existing techniques that deal with only the time-domain or the frequency-domain. We have presented the problem of interference excision from a quadratic timefrequency perspective. Joint time-frequency and time-scale linear transforms have also been successfully applied to suppress nonstationary jammers. Further citations in this area can be found in the references of [2] and [9].
548
Chapter 13:
Time-Frequency Methods in Communications
References [1] M. K. Simon, J. K. Omura, R. A. Scholtz, and B. K. Levitt, Spread Spectrum Communications (3 vols.). Rockville, MD: Computer Science Press, 1985. [2] M. G. Amin and A. N. Akansu, "Time-frequency for interference excision in spreadspectrum communications," in "Highlights of Signal Processing for Communications", IEEE Signal Processing Magazine (G. B. Giannakis, ed.), vol. 16(2), IEEE, March 1999. [3] M. G. Amin, "Interference mitigation in spread-spectrum communication systems using time-frequency distributions," IEEE Trans. Signal Processing, vol. 45, pp. 90-102, January 1997. [4] C. Wang and M. G. Amin, "Performance analysis of instantaneous frequency based interference excision techniques in spread spectrum communications," IEEE Trans. Signal Processing, vol. 46, pp. 70-83, January 1998. [5] M. G. Amin, C. Wang, and A. R. Lindsey, "Optimum interference excision in spreadspectrum communications using open-loop adaptive filters," IEEE Trans. Signal Processing, vol. 47, pp. 1966-1976, July 1999. [6] S. Barbarossa and A. Scaglione, "Adaptive time-varying cancellations of wideband interferences in spread-spectrum communications based on time-frequency distributions," IEEE Trans. Signal Processing, vol. 47, pp. 957-965, April 1999. [7] S. Lach, M. G. Amin, and A. R. Lindsey, "Broadband nonstationary interference excision in spread-spectrum communications using time-frequency synthesis techniques," IEEE J. on Selected Areas in Communications, vol. 17, pp. 704-714, April 1999. [8] R. S. Ramineni, M. G. Amin, and A. R. Lindsey, "Performance analysis of subspace projection techniques for interference excision in DSSS communications," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), vol. 5, pp. 2825-2828, Istanbul, 5-9 June 2000. [9] G. J. Saulnier, M. J. Medley, and P. K. Das, "Wavelets and filter banks in spread spectrum communication systems," in Subband and Wavelet Transforms: Design and Applications (A. N. Akansu and M. J. T. Smith, eds.), ch. 10, pp. 309-346, Norwell, MA: Kluwer, 1996.
549
Communication Over Linear Dispersive Channels
13.2 13.2.1
C O M M U N I C A T I O N OVER LINEAR DISPERSIVE CHANNELS: A TIME-FREQUENCY PERSPECTIVE ~ Linear Dispersive Channels
Linear dispersive channels are encountered in many applications, including mobile wireless communications and underwater acoustical communications [1]. In general, such channels disperse the transmitted signal in both time and frequency. Without proper system design, the effects of dispersion, such as signal fading, can significantly limit system performance. However, appropriate signaling and reception can significantly mitigate the effects of dispersion. Accurate modeling of channel characteristics is critical in this regard. The effect of the channel on the transmitted signal intimately depends on the bandwidth-duration product (TBP) of the signaling waveforms. Narrowband signaling schemes with TBP .~ 1 typically suffer from significant intersymbol interference (ISI) requiring sequence (Viterbi) decoding at the receiver. Linear equalizers constitute a sub-optimal low-complexity solution for narrowband systems. In contrast, wideband signaling schemes with TBP >> 1 can significantly mitigate ISI and side-step the requirement for equalization. In fact, wideband systems can actually exploit the dispersion effects for improved performance. Time-frequency representations and methods provide useful insights in the analysis and design of such wideband communication systems. The goal of this article is to highlight the key aspects of this time-frequency perspective on wideband communication over linear dispersive channels. We develop the concepts in the context of spread-spectrum code division multiple access (CDMA) systems. However, the general ideas hold in other applications as well, including underwater acoustical communications. The next section describes a canonical time-frequency model for wideband dispersive channels that lays the foundation of the time-frequency perspective. The model is used in Section 13.2.3 to discuss various key aspects of wireless CDMA communication system design.
13.2.2
Time-Frequency Model for Dispersive Channels
Let x(t) denote the transmitted complex baseband signal. The received signal after passing through a linear dispersive channel can be generally expressed as [1,2]
-
+
= fo
h(t, 7")x(t - 7)d7 + e(t),
(13.2.1)
where h(t, 7-) denotes the time-varying channel impulse response, s(t) is the complex baseband signal at the output of the channel, and c(t) is additive white Gaussian noise (AWGN) with power spectral density a c2. The maximum delay produced ~ Akbar M. Sayeed, Department of Electrical and Computer Engineering, University of Wisconsin, Madison, WI 53706, USA (akbarQengr.wisc.edu). Reviewers: S. Barbarossa and M. G. Amin.
550
Chapter 13:
Time-FrequencyMethods in Communications
by the channel, Tin, is called the multipath spread of the channel. The following equivalent channel representation is particularly relevant from a time-frequency perspective
s(t) -
H(~, T)x(t -- T)eJ2"~tdvd7, JO
(13.2.2)
J --Bd
where H(~, 7-) = f h(t, T)e-J2~tdt is the time-frequency spreading function of the channel [1,2]. The variable 7 corresponds to temporal (multipath) spreading produced by the channel and the variable ~ corresponds to the spectral (Doppler) spreading produced by the channel. The maximum Doppler frequency produced by the channel, Bd, is called the Doppler spread of the channel. The representation (13.2.2) states that the output signal s(t) is a linear combination of time- and frequency-shifted versions of the transmitted signal x(t). Canonical Time-Frequency Channel Model The channel model (13.2.2) in terms of a continuum of scatterers is difficult to incorporate in receiver design. The finite duration T and (essentially) finite onesided bandwidth B of the signaling waveform x(t) can be exploited to derive a canonical time-frequency channel model that greatly facilitates system design. The canonical model asserts the following channel representation [2-4] L
M
L
s(t) ,~
M
hmzxmz(t) = l---0 m = -
I-I -~,-~ /--0 m = -
M
x t - -~
M
where L = [TmB1, M = [TBd], and {hmL} are samples of a smoothed version, H(~, T), of the spreading function. The model (13.2.3) is a canonical uniform timefrequency (multipath-Doppler) sampling of the channel induced by the finite duration and bandwidth of x(t). The sampling resolution in is AT = 1/B in time (multipath) and At, = 1/T in frequency (Doppler). The canonical representation (13.2.3) states that the output signal s(t) belongs to a subspace spanned by the basis functions
Xml(t)--X
t--~
, l=0,'",L,
m=-M,...,M
(13.2.4)
that are discretely time-frequency shifted versions of the transmitted signal. The dimension of this (active) subspace is Na = (L + 1)(2M + 1) ,,~ 2TBTmBd + TmB + 2TBd + 1 which is proportional to the TBP (TB) of the signaling waveform as well as the channel spread factor TmBd. Typical channels encountered in practice are underspread: TmBd << 1. While (13.2.4) provides a fixed basis for representing the received signal, an alternative signal representation in terms of approximate eigenfunctions of underspread channels is developed in Article 13.3 in this volume. However, the eigenfunctions are not fixed and need to be computed for each channel realization. Underspread channels imply that the output signal subspace is one-dimensional (Na = 1) for narrowband (TB = 1) systems, whereas it is Na > 1
Communication Over Linear Dispersive Channels
551
dimensional for wideband (TB >> 1) systems. As we will see, this increase in dimensionality due to wideband signaling plays an important role in improved communication over dispersive channels. Note that once the signaling waveform x(t) and the channel spreads Tm and Bd are known at the receiver, all information about the channel is captured linearly via {hml} in (13.2.3). Statistical Channel Characterization Communication system performance depends on statistical channel characteristics. A widely used model is the wide-sense stationary uncorrelated scattering (WSSUS) model in which {H(u, T)} are modeled as uncorrelated Gaussian random variables [1,2]: $[H(u, T)H*(u', ~-')] = ~(u, T)5(7 -- ~")5(U -- U') where ~I,(u, T) is the time-frequency scattering function which quantifies the relative channel power at different delays and Doppler shifts. The support of ~I,(u, T) is limited to the channel spreads: [0, Tm] • [-Bd, Bd]. Under the WSSUS assumption, it can be shown that the channel coefficients {hml} in the canonical time-frequency model (13.2.3) are approximately uncorrelated if ~(u, T) is sufficiently smooth. For simplicity, we focus on Rayleigh fading WSSUS channels in which {hmz} are zero mean, uncorrelated Gaussian random variables. A more detailed characterization of randomly time-varying channels is developed in Article 9.5 in this volume.
13.2.3
Communication over Dispersive Channels
Signals with duration T and bandwidth B form a vector space of dimension No ..~ TB. Spread-spectrum waveforms used in CDMA system take the form N-1 c[n]v (t nTc) where c[n] is the length-N spreading code associated q ( t ) - }-~n:0 with the waveform q(t), Tc is the chip duration, and N = T/Tc is the processing gain. For CDMA signals, B ~ 1/Tc and N .~ T B ..~ No. Given a particular spreadspectrum waveform q(t), a complete basis for the signal space can be generated via distinct waveforms of the form {qmL(t)} as defined in (13.2.4). The two most significant factors affecting CDMA system performance are signal fading and multiaccess interference (MAI). Fading is due to the destructive combination of various timefrequency shifted signal copies and manifests itself large fluctuations in received signal power. MAI is caused by the multiple users simultaneously communicating over the channel and can drown the signal of the desired user. We first discuss the role of time-frequency processing in single-user CDMA systerns, highlighting the concept of diversity to combat fading. We then discuss the issue of interference suppression in multiuser systems via the notion of certain timefrequency subspaces. For simplicity, we focus on coherent receivers with binary phase shift keying (BPSK). We also make the assumption that Tm << T; that is, the ISI is negligible. Thus, symbol by symbol detection suffices. -
13.2.3.1
Time-Frequency RAKE Receiver
Consider a single symbol transmission in a single-user system; that is, x(t) = bq(t), where q(t) is the wideband signaling waveform and b c { - 1 , 1} is the transmitted
Chapter 13:
552
Time-Frequency Methods in Communications
bit. The channel model (13.2.3) dictates the following front-end time-frequency correlation at the receiver T
Zml
-
(r, qml) - [
JU
L
M
r(t)q~l(t)dt = bE
E
l~=Orn~=-M
(qm't',qml)hm't' + (c, qmz).
(13.2.5)
The correlator outputs {zml } form the Na-dimensional representation of the noisy received signal with respect to the basis {qmz(t)}. They are also the samples of the narrowband cross-ambiguity function, Ar,q(u, 7), between the received waveform r(t) and the signaling waveform q(t); that is, Zml - Ar,q (~, l ) where Ar,q(V, T) = f r(t)q* (t -- T)e-J~tdt [5]. The correlator outputs can be expressed in vector form as
z = bQh + w
(13.2.6)
where h is the vector of channel coefficients, w ~ Af(0, a~2Q), and Q is an Na x Na matrix of correlation between the different basis waveforms consisting of the entries (qm'l' , qml)
__Aqq
(m-m'
TB eJ2'~(m-m')l' '
T
l-l') '
B
(13.2.7) "
For typical spread-spectrum CDMA waveforms, and with chip-rate multipath sampling (B = 1/Tc in (13.2.3)), the basis {qmz(t)} can be approximately orthogonal resulting in Q ~ I; that is, (qm'l', qmL) ~ 5t-l'Sm-m'. This implies that the sampled auto-ambiguity function of q(t) in (13.2.7) is approximately a "thumbtack" function with peak at the origin. Given estimates of the channel coefficients {hint}, the time-frequency correlator outputs {Zml} are coherently combined at the receiver to yield the final bit estimate /~ -- sign
real
E
hmLzml
/=0 m = - M
= sign {real [hHz~] ) - - s i g n {real
[bhHQh +
hHw]}
where the superscript H denotes complex conjugate transpose. The receiver (13.2.8) is a time-frequency generalization of the RAKE receiver used in conventional CDMA systems [4]. The time-frequency RAKE receiver performs joint multipath-Doppler combining to account for spectral and temporal channel variations within a symbol in contrast to the multipath combining in conventional RAKE receiver. Conditioned on the channel coefficients h, and with perfect estimates (h - h), the bit-error-probability (BEP) of the receiver is given by Re,fading(h) - Q(SNR(h)) where SNR(h) - 2hHQh/a 2 and Q(x) = ~1 fox e-t2/2dt" The unconditioned BEP is given by averaging over the statistics of the channel coefficients: Re,fading -E[Pe,fading(h)] -- E[Q(SNR(h))]. A
Communication Over Linear Dispersive Channels
13.2.3.2
553
Combating Fading with Time-Frequency Processing
The performance of the receiver (13.2.8) operating over an AWGN channel with the same average SNR as the fading channel is given by P~,AWCN = Q(E[SNR(h)]) = Q(2 trace(QRh)/Cr~) where trace(.) denotes the trace of a matrix and Rh = E[hh H] is the channel correlation matrix. We note that Re,fading ~ Pe,AWGN and the loss in performance under fading due to the fluctuations in SNR(h) is quite significant. For example, achieving a BEP of 10 -3 over a Rayleigh fading channel requires about 18 dB additional SNR compared to an AWGN channel with the same average SNR [1]. Diversity signaling and reception is a general technique for combating the effect of fading. The basic idea is to transmit the signal over multiple independent fading channels while keeping the total transmitted power constant. In the case of coherent reception, as the number of diversity channels increases, the performance of the diversity reception over a fading channel approaches that of an AWGN channel at the same average SNR [1]. The remarkable advantage of wideband signaling over WSSUS fading channels comes from the fact that the different delayed and Doppler shifted copies of the signaling waveform in (13.2.3) serve as Na virtual diversity channels [3,4]. In essence, the receiver is able to resolve Na weakly correlated time-frequency shifted copies ({qml(t)}) of the transmitted waveform that are linearly independent and serve as virtual diversity channels carrying the same information bit. Using the KarhunenLo~ve expansion, the SNR can be written as SNR(h) -- }-]n~=l N ~nXn2 where {X2n} are independent chi-squared random variables each with 2 degrees of freedom and {A,,} are the eigenvalues of 2QRh/Cr~. Keeping the transmitted power constant, the law of large numbers dictates that S N R ( h ~ r ~ E [ S N R ( h ) ] - trace(2RhQ/a2~) and thus P~,fadinN"-*'~ Pe,AWCN. Thus, as the number of time-frequency signal components increases in (13.2.3), the coherent receiver (13.2.8) completely combats the loss in performance due to fading. For given channel spread factor, TmBd, the larger the TBP of the signaling waveform, the higher the level of diversity. This advantage of wideband signaling is exploited in CDMA wireless communication systems via the use of direct-sequence spread-spectrum waveforms as well as in underwater acoustical communications via the use of linearly frequency-modulated waveforms. In particular, Doppler diversity, first introduced in [4], can be exploited in practice via time-selective signaling which increases the effective symbol duration [6].
13.2.3.3
Interference Suppression via Time-Frequency Subspaces
In CDMA systems, interference stems from multiple users simultaneously using the channel. In underwater acoustics, reverberation is the major source of interference. In either case, the time-frequency channel model (13.2.3) can be fruitfully exploited for effective interference suppression. The key idea is the notion of time-frequency subspaces associated with the desired signal and interference [7, 8]. We illustrate the concept in the context of CDMA systems. Consider a CDMA communication system with K users. The k th user is assigned a signaling waveform qk(t). According to the model (13.2.3), each user is associated
Chapter 13:
554
Time-Frequency Methods in Communications
with a canonical basis {qk,mL(t)}. For simplicity, we assume that all users are transmitting synchronously. The signal at any receiver can be represented as K
K
Lk
r(t) .~ E sk(t) + n(t) = E bk E k=l
k=l
Mk
E
hk,mlqk,ml(t) + n(t).
(13.2.9)
/=0 rn=--Mk
Suppose that we are interested in decoding the signal of the first user. We consider decentralized detection in which only the code of the desired user is known at the receiver [8]. In this case, bit detection is accomplished by projecting onto the waveforms {ql,ml} associated with the first user. For a discussion of centralized detection which uses information about all users, we refer the reader to [7]. Recall that a subset of {ql,mz} provide a complete basis for the signal space. The key idea from the viewpoint of interference suppression is that of active and inactive subspaces, Sa and $ia, respectively. The active subspace, Sa, is defined as the span of the basis elements that lie within the the multipath-Doppler channel spread and form the diversity channels: Sa = span{ql,ml : l = 0, 1 , . . . , L1, m = - M 1 , . . . , M1}. The inactive subspace, $ia, is spanned by elements that lie outside the channel spread: $ia = span{ql,mZ : 1 ~ { 0 , 1 , . . . ,L1}, rn ~ { - M I , . . . ,M1}}. The N,-dimensional vector of correlator outputs corresponding to S, can be expressed as K
Za -- blQllhl + E bkQlkhk + Wa
=
blSl + ia + Wa
(13.2.10)
k=2
where the first terms represents the signal of interest the second term represents the interference corrupting third term represents background noise. The matrix between the active basis waveforms of the first and the denote the dimension of the inactive subspace. The corresponding to $i~ is given by
(as in the single-user case), the active subspace, and the Qlk denotes the correlation k th user. Let Ni, <_No - N~ vector of correlator outputs
K
2~ia -- E
bkQlkhk
2F Wia -- iia + Wia
(13.2.11)
k=2
where Qlk represents the correlation between the inactive basis waveforms of the first user and the active basis waveforms of the k th user. Note that Zia does not contain the signal of interest it only contains interference (iia) and noise (Wia). By the very nature of spread-spectrum signals, i~ is correlated with iia and the basic idea is to use ii~ to suppress i~ corrupting the active subspace. After interference suppression, the "cleaned-up" version of Za can be coherently combined as in the single-user case (see (13.2.8)) to decode the bits of the desired user. The dimension of $ia controls the receiver complexity and the level of MAI suppression. A linear receiver may be used to accomplish both interference suppression and diversity exploitation in the multiuser case. The bit estimate for bl is given by D1 -- sign{real[g Hz]} -- sign{real[gHaza + giHzia]}
(13.2.12)
Communication Over Linear Dispersive Channels
555
where the Na + Nia dimensional filter g processes the active and inactive correlator outputs. If g is chosen to minimize the mean-squared-error (MSE) between bl and bl, the linear filter admits an intuitively appealing decomposition: H gHz~ + gi~za -- gaH [Za _ C H Zia]. In this decomposition, the matrix filter C forms a minimum MSE (MMSE) estimate of Za from Zia, thereby suppressing the interference corrupting the active subspace. The filter g~ then further suppresses any residual interference and exploits multipath-Doppler diversity to combat fading. The processing of N~ dimensional active correlator outputs, z~, is necessary for maximal diversity exploitation. The inactive subspace serves the sole purpose of interference suppression. It's dimension can be progressively increased to suppress any desired number of interfering users up to No - Na. Thus, the notion of active/inactive time-frequency subspaces provides a natural framework for interference suppression and controlling the complexity of the multiuser time-frequency receiver.
13.2.3.4
An Illustrative Example
Recall that the key parameters are the signal space dimension, No ,~ T B , which is proportional to the spreading gain N = T / T c , and the products T m B and T B d that control the level of multipath and Doppler diversity, respectively. We consider a system with a spreading gain of N = 64 and a multipath spread of Tm = 2Tc. Consider first a single-user system to illustrate the effects of multipath-Doppler diversity. Suppose chip-rate sampling (B = 1/Tc) at the receiver so that L = 2, and 0 < T B d < 0.5 so that M = 1. This results in an Na = 9 dimensional active subspace. Assume uniform power in all the multipath channel components. Let p E (0, 1/3) denote the ratio of the power in each of the m = -t-1 Doppler components relative to the total power in the three (m = - 1 , 0, 1) components. As T B d ---* 0.5 (faster fading), p -~ 1/3 to yield maximum Doppler diversity. 1 Figure 13.2.1(a) shows the BEP of the coherence time-frequency RAKE receiver (13.2.8) as a function of SNR for different values of p. It is evident that time-frequency RAKE receiver yields significant gains over RAKE as p increases. We note that while typical values of T B d are relatively small in practice, 2 simple signaling schemes with longer symbol duration may be used to increase T B d to achieve significant Doppler diversity [4]. Now consider a multiuser system to illustrate the role of active/inactive subspaces in interference suppression. We consider a slow-fading system ( T B d = O) employing 4-fold oversampling (B = 4~To), resulting in a signal space dimension of No = 4N = 256 and Na = 9. There are are 27 interfering users at the same power as the desired user. The signal-to-interference-and-noise-ratio (SINR) at the output of the desired user's receiver is an appropriate performance metric in the multiuser case. Figure 13.2.1(b) shows the SINR of the MMSE receiver as a func1For a uniform Doppler power spectrum, the values p -- 0.0125,0.05, 0.25 are achieved at TB d ,~ 0.2, 0.4, 0.6, respectively [4]. 2For example, TBd ~, 0.013, at a carrier frequency of 1.8 GHz, data rate of 10 kHz, and maximum speed of 50 mph.
556
Chapter 13:
Time-Frequency Methods in Communications
Fig. 13.2.1: (a) Joint multipath-Doppler diversity gains achieved by a single-user coherent timefrequency RAKE receiver as a function of p. The performance of conventional RAKE corresponds to p = T B d = 0. (b) The monotonic increase in the output SINR of a multiuser time-frequency RAKE receiver with increasing dimension of the inactive time-frequency subspace. For Nia < 19, the SlNR saturates, whereas for Nia >_ 19, interference is completely suppressed so that output SlNR increases linearly with input SNR as in a single-user system.
tion of the transmitted SNR of the desired user for different values for the inactive subspace dimension, Nia. As evident from Figure 13.2.1(b), the receiver goes from an interference-limited regime to a noise-limited regime as ]Via increases, with the
Communication Over Linear Dispersive Channels
557
optimal cut-off value around N i a - - 2 8 - N a : 19. Thus, the dimension of the inactive subspace can be progressively increased to achieve a desired level of interference suppression with commensurate receiver complexity.
13.2.4
Summary and Conclusions
The time-frequency channel model (13.2.3) at the heart of the framework presented in this article was first developed by Bello in his classic paper [2] on WSSUS channels. The monograph by Kennedy [3] exploits the model to address several key questions relating to reliable communication over dispersive channels, including capacity and error exponents. Results relating to the use of the model in wireless communications are more recent and are necessarily incomplete due to ongoing investigations. As evident from our brief discussion on diversity and interference suppression, the insights offered by the time-frequency perspective are very useful in efficient transceiver design. While not discussed in this article, similar ideas can be fruitfully exploited in the design of efficient transmission schemes for reaping the capacity of linear dispersive channels, including the design of orthogonal frequency division multiplexing (OFDM) schemes that are strong candidates for high rate wireless communication. Furthermore, the same basic principles apply in underwater acoustical communications as well.
References [1] J. G. Proakis, Digital Communications. New York: McGraw-Hill, 3rd ed., 1995. [2] P. A. Bello, "Characterization of randomly time-variant linear channels," IEEE Trans. Communication Systems, vol. 11, pp. 360-393, December 1963. [3] R. S. Kennedy, Fading dispersive communication channels. New York: Wiley, 1969. [4] A. M. Sayeed and B. Aazhang, "Joint multipath-Doppler diversity in mobile wireless communications," IEEE Trans. Communications, vol. 47, pp. 123-132, January 1999. [5] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-HM1, 1995. [6] S. Bhashyam, A. M. Sayeed, and B. Aazhang, "Time-selective signaling and reception for communication over multipath fading channels," IEEE Trans. Communications, vol. 48, pp. 83-94, January 2000. [7] A. M. Sayeed, A. Sendonaris, and B. Aazhang, "Multiuser detection in fast fading multipath environments," IEEE J. on Selected Areas in Communications, vol. 16, pp. 16911701, December 1998. [8] T. A. Kadous and A. M. Sayeed, "Decentralized multiuser detection for time-varying multipath channels," IEEE Trans. Communications, vol. 48, pp. 1840-1852, November 2000.
558 13.3
Chapter 13:
Time-FrequencyMethods in Communications
EIGENFUNCTIONS OF UNDERSPREAD LINEAR C O M M U N I C A T I O N SYSTEMS 0
The knowledge of the eigenfunctions of a linear system is a fundamental issue both from the theoretical as well as from the applications point of view. Nonetheless, no analytic solution is available for the eigenfunctions of a general linear system. There are two important classes of contributions suggesting analytic expressions for the eigenfunctions of slowly-varying operators: [1] and the references therein, where it was proved that the eigenfunctions of underspread operators can be approximated by signals whose time-frequency distribution (TFD) is well localized in the timefrequency plane, and [2] where a strict relationship between the instantaneous frequency of the channel eigenfunctions and the contour lines of the Wigner Transform of the operator kernel (or Weyl symbol) was derived for Hermitian slowly-varying operators. (See also Article 4.7.) In this article, following an approach similar to [2], we will show that the eigenfunctions can be found exactly for systems whose spread function is concentrated along a straight line and they can be found in approximate sense for those systems whose spread function is maximally concentrated in regions of the Doppler-delay plane whose area is smaller than one.
13.3.1
Eigenfunctions of Time-Varying Systems
The input/output relationship of a continuous-time (CT) linear system is [3]:
y(t) -
F
h(t, T)x(t -- T)dT
(13.3.1)
oo
where h(t, T) is the system impulse response. Although throughout this section we will use the terminology commonly adopted in the transit of signals through timevarying channels, it is worth pointing out that the mathematical formulation is much more general. For example, (13.3.1) can be used to describe the propagation of waves through non homogeneous media and in such a case the independent variables t and T are spatial coordinates. Following the same notation introduced by Bello [3], any linear time-varying (LTV) channel can be fully characterized by its impulse response h(t, T), or equivalently by the delay-Doppler spread function (or simply spread function) S(V,T) "= f _ ~ h(t,T)e-J2~tdt, or by the time-varying transfer
function H(t, f) := f-~c~ h(t, T ) e - J 2 r f r d T
.
Since the kernels of LTV systems in general are not self-adjoint, it is not possible to define the eigenfunctions of a linear system, but we can introduce the so called left and right singular functions (in the following we will use the term eigenfunction only for simplicity, meaning generically the left and right singular functions). In ~ Sergio Barbarossa, INFOCOM Department, University of Rome "La Sapienza", Via Eudossiana 18, 00184 Rome, Italy (sergio~infocom.uniromal.it). Reviewers: G. Matz and A. M. Sayeed.
Eigenfunctions
of
Underspread Linear Communication Systems
559
fact, if the system impulse response is square-integrable, i.e.
f:r
j-
oo
f~
[h(t,T)[ 2 dt dr < oc,
(13.3.2)
J - oo
then there exists a countable set of singular values Ai and two classes of orthonormal functions vi(t) and ui(t), named right and left singular functions, such that the following system of integral equations holds true
A u (t)
h(t, t -
-
/:
(3O
A~v~(r) -
h* (t, t - r)u~(t)dt.
(13.3.4)
cx)
Inserting (13.3.3)in (13.3.4), we have
h*(t,t - r)h(t,O)vi(t -- O)dOdt.
A~vi(7) oo
(13.3.5)
oo
so that vi(7) is the eigenfunction of the system whose kernel is
h(~-, O) "-
/?
h* (t, t - r)h(t, t -- O)dt.
(13.3.6)
oo
In practice, there are at least two quite common situations where h(t, T) is not square-integrable: (i) linear time-invariant (LTI) channels, where h(t, T) is constant along t 1; and (ii) multipath channels with specular reflections, where h(t, T) contains Dirac pulses. However, to avoid unnecessary complications with different notations as a function of the integrability assumption, in the following we will keep assuming (13.3.2), considering the aforementioned exceptions as limiting cases, as in [4, sec. 8].
13.3.2
Systems with Spread Function Confined to a Straight Line
If the spread function is confined to a line, i.e. S(~, T) = g ( T ) 5 ( u -
fo -- #T),
(13.3.8)
the singular functions are chirp signals, i.e. vi(t)
--
eJTrttt2eJ27rfit
-
(13.a. o)
:
1T h e LTI case as well as a large class of t i m e - v a r y i n g s y s t e m s e x h i b i t i n g s o m e sort of s t a t i o n a r i t y can be dealt with by requiring the following integrability condition lim
--
T-----* cx:~ T
instead of (13.3.2).
j-T/2
dt
Ih(t,r)12d'r c~
< ~,
(13.3.7)
560
Chapter 13:
Time-Frequency Methods in Communications
In fact, the impulse response corresponding to (13.3.8) is
h(t, T) = g(T)eJ2~t'~te j2~l~ and, substituting
(13.3.11)
(13.3.11) and (13.3.9)in (13.3.3)we get
Aiui(t) = eY2'~f~
y'~t't~Gt,(fi) = G~,(fi)d2~I~
(13.3.12)
where a.(f) is the Fourier transform (FT) of g•(t)"- g(t)e j~"t2. We can verify that (~3.3.12) is satisfied if ui(t) is given by and ,~i = Gt,(fi). It is also straightforward to check that the two classes of functions vi(t) and ui(t) are orthogonal. Interestingly, the contour lines of IH(t, f)l coincide with the instantaneous frequency of the eigenfunctions. In fact, the transfer function associated to (13.3.8) is H(t, f) = G(f - #t)d 2~f~ where G(f) denotes the F T of g(7), so that IH(t, f)l is constant along lines of equation f = pt + fi, which coincides with the instantaneous frequency of the right singular functions. Furthermore, if f0 = 0, i.e. S(u, 7-) is maximally concentrated along a line passing through the origin, the left and right singular functions are simply proportional to each other and we can talk of eigenfunctions and eigenvalues. Finally, it is worth noticing that if the spread function is mainly concentrated inside a rectangle of a r e a B m a x T m a x ~ 1, thus FtT2ax << 1 and IN(t, fi + pt)l = fG(f~)l ~ IG.(f~)f, so that the modulus of the i-th eigenvalue coincides approximately with the absolute value of the channel transfer function evaluated over the curve given by the eigenfunctions' instantaneous frequency. In the following, we will show how these results can be generalized, albeit in approximate sense, to the more challenging case where the spread function is not confined to a straight line. But, before considering the more general case, it is worthwhile to remark that the model (13.3.8) encompasses three examples of systems commonly encountered in the applications, namely (i) time-invariant systems, where S(L,, T) = g(T)5(L,), which corresponds to p = 0 and thus to having, as well known, sinusoidal eigenfunctions; (ii)multiplicative systems, where S(L,,7-) = C(u)5(T), which corresponds to # = c~ and thus to Dirac pulses as eigenfunctions; (iii) communication channels affected by two-ray multipath propagation, each ray having its own delay and Doppler frequency shift, i.e. 1
S(t/, T) -- ~
q=0
1
hqS(~
-
~q)5(w
-
Tq) or h(t, T)
--
~
hqej27rfqtS(T
--
Tq). (13.3.13)
q=0
In such a case, the eigenfunctions are chirp signals having different initial frequencies, but all with the same sweep rate p = (fl - fo)/(T1 -- ~-0), which depends on the channel delay and Doppler parameters.
13.3.3
Analytic Models for Eigenfunctions of Underspread Channels
We extend now the analysis to systems whose spread function has a support, in the delay-Doppler domain, with small, but, differently from the previous case, non-null
Eigenfunctions of Underspread Linear Communication Systems
561
area. Interestingly, this case encompasses all current digital communication systems. The aim of the ensuing analysis is to show that if S(v, 7) is mainly concentrated around the origin of the Doppler-delay plane, along one of the two axes but not along both, the main result derived above can be generalized, even though only in approximate sense. First of all, proceeding as in [1], we define the absolute moments of S(v, T) as
rn(k,l) .
=
f - ~ f - ~ Ivlkl~-IllS( u,r)ldvdT f_o~ f _ ~ IS(v ' T)ldvd~. 9
(13.3.14)
We say that a system is underspread if all the products rn(~'j) rn (k't) of order i + j + k + l > 2, where the indices are such that there is at least the product of a non null moment along 7- times a non null moment along v, are "small". This definition is not rigorous, but its meaning will be clarified within the proof of the main statement of this section. Since the partial derivatives of H(t, f) can be upper bounded as follows
OHk+t(t,f) OtkOft
< (27r)k+trn(kt)
-
'
~ oo
oo
IS(v,
T)ldvdv,
(13.3.15)
if S(~, T) has small moments, H(t, f) must be a smooth function in at least one direction. In the following we show that, if the system is underspread, the singular function associated to the i-th singular value can be approximated by the following analytic function Ki(t)
vi(t) "- E
Ki(t)
vi,m(t)"- E
m--1
where (i) the instantaneous phase r neous frequency fi,m(t) := $i,m(t)/27r of
ai,m(t)eJ4)~'m(t)'
(13.3.16)
m--1
is such that the corresponding instanta-
vi,m(t) is one of the real solutions of
]H(t, fi,m(t))l 2 = A2, Vm;
(13.3.17)
(ii) the amplitude a~,m(t) is approximately constant and different from zero only within the time interval where IH(t, f/,m(t))l 2 = A2 admits a real solution, and its value is such that vi(t) has unit norm; Ki(t) is the number of solutions of (13.3.17), for each Ai and t. The existence of a real solution for fi,m(t) of implies that the singular values A~ must be bounded in the following interval: m i n t , / I H ( t , f)l < A~ < m a x t , / I H ( t , f)l. Between these two boundaries, not all values of Ai are possible: The only admissible values are the ones that allow the eigenfunctions to be orthonormal and respect Heisenberg's uncertainty principle, similarly to the area rule suggested in [2]. From (13.3.17) we notice that the instantaneous frequencies of the system eigenfunctions coincide with the contour lines of IH(t, f)l. Typically, the contour plots
Chapter
562
13:
Time-Frequency Methods in Communications
are closed curves and then Ki(t) is usually an even integer. In general, we have verified numerically that if there are more closed curves corresponding to the same eigenvalue Ai, the multiplicity of the eigenvalue is equal to the number of closed curves corresponding to Ai, with each closed curve giving rise to one eigenfunction. We show now under which approximations, the function Vi(T), as given in (13.3.16), is a solution of (13.3.5). Exploiting the system linearity, we compute the output yi,m(t) corresponding to each rn-th component Vi,m(t) in (13.3.16) and then we exploit the superposition principle to derive the output corresponding to vi(t). In our proof, we assume that the support of h(t, T) along T is small. 2 As a consequence, for each value of T, the product h*(t, t - r)h(t, O) in (13.3.5) assumes significant values only for small values of both t - T and 9. We can thus expand Vi,m(t - 9) in (13.3.5), around T and keep only the lower order components
v~,m(t - O) ~ ai,m(T)eJr
(13.3.18)
having used a first order approximation for r O) and a zero-th order approximation for ai,m(t- 0). Substituting (13.3.18) into (13.3.5) and invoking the principle of stationary phase [5] to derive an approximate analytic expression of the integral, we get the m-th output term
Yi,m(T) "~ ai,m(~')e jr
co co
h * ( t , t - ~')h(t,O)e-J$~,m(r)eeJ(~','~(r)(t-r)dOdt co
h * ( t , t - T)H(t, fi,m(T))eJr
= a~,m(~-)ejr
(13.3.19)
co
After a few algebraic manipulations involving the Taylor series expansion of both h(t, t--T) and H(t, f) about T in their first argument and summing over m, we get
Ki
co ~(k,O)(T, fi,m(T))D~(l,k+l)'(T, fi,m(T) )
m=l k,t=o
(--j2~) k+zk! l!
ai,m
,
k,t#O (13.3.20) where ~'t,fTJ(k'(t, l) f) := ok+lH(t, f ) / o t k o f I. This equation shows that Vi(T ), as given in (13.3.16), is (approximately) the eigenfunction associated to the eigenvalue A2 if the perturbation, given by the second term of the right-hand side of (13.3.20), is small with respect to A2vi(v). From (13.3.20), we notice that the perturbation is equal to the sum of complex functions given by the product of the partial derivatives of the system transfer function, evaluated along the curve where the modulus of the transfer function is constant. Furthermore, each term in the perturbation contains at least the first order derivative with respect to both time and frequency. Therefore, 2If this assumption is not true, to respect our main assumption about the concentration of the spread function, the spread of S(~, r ) along ~ must be very small. In such a case, using duality arguments, we can derive equivalent results working with the spectrum of the eigenfunctions.
Eigenfunctions of Underspread Linear Communication Systems
563
the perturbation is small with respect to the first term in (13.3.20) if the transfer function is smooth in at least one direction, i.e. time or frequency. Hence, the analytic model (13.3.16) is valid only for underspread systems, i.e. systems whose transfer function has small partial derivatives, by virtue of (13.3.15), in at least one direction. Furthermore, since the energy of the first term is A~, the approximation error is smaller for the highest eigenvalues. Since so many approximations have been used to justify the analytic model (13.3.16), it is necessary to check the validity of such approximations. Given the crucial role played by the instantaneous frequency in the definition of the system eigenfunction and the interplay of time and frequency, the analysis of the timefrequency distribution (TFD) of the system eigenfunctions plays a fundamental role as a validation tool. Since the validation is necessarily numerical, we start deriving the equivalent discrete-time (DT) system corresponding to the continuoustime (CT) relationship (13.3.1). Specifically, we consider the system obtained by windowing h(t, T) in time and in frequency. Assuming that the input signal x(t) has a spectrum confined within the bandwidth [-1/2Ts, 1/2Ts], we can express x(t) as oo
x(t) -
~
x[k]sinc(Tr(t - kTs)/Ts),
(13.3.21)
k=--oO
where x[k] := x(kTs) andl/Ts is the sampling rate. Sampling the continuous time system output y(t) at the same rate 1/Ts ,3 we get the equivalent discrete-time I / O relationship oo
k----oo
where h[n, k] denotes the equivalent DT impulse responses, defined as
h[n, n - k] "-
/?j? oo
oo
sinc(Tr(nWs - 0)/Ts)sinc(Tr(0 - T -- kTs)/Ws)h(0, T)dTdO.
(13.3.23)
Equation (13.3.22) is the DT counterpart of (13.3.1). Assuming that h[n,k] has finite support over k, i.e. the channel is FIR of order L, and parsing the input sequence into consecutive blocks of size R, the discrete-time model leads directly to the matrix I / O relationship y(n) = H(n)x(n), where H(n) is the P x R channel matrix, with P = R + L, relative to the n-th transmitted block, whose (i, j) entry is {H(n)}i,j = h[nP + i , i - j], whereas x(n) := (x[nR],...,x[R + R - 1]) T and y(n) := (y[nP],..., y[nP + P - 1]) r are the input and output blocks. The discrete time counterpart of (13.3.3) and (13.3.4) is the singular value decomposition (SVD) of the channel matrix g ( n ) , i.e. H(n) = U(n)A(n)VH(n), that allows us to write 3We assume that 1/Ts is large enough to respect the Nyquist principle for the system output y(t); this means that, if we take into account the b a n d w i d t h increase due to the transit through a time-varying system, 1/Ts is strictly larger than the b a nd w i d t h of x(t).
Chapter 13:
564
U(n)A(n) = H(n)V(n),
Time-Frequency Methods in Communications
or HH(n)U(n) = V(n)A(n),
(13.3.24)
where the columns of U(n) and V(n) are the left and right channel singular vectors associated to the singular values contained in the diagonal matrix A(n). To check the validity of model (13.3.16), we proceed through the following steps. Given the impulse response h(t, T) of the CT system, (i) we build the channel matrix H(n) of the equivalent DT system; (ii) we compute the SVD of H(n); (iii) we compute the TFD of the right and left singular vectors associated to the generic singular value Ai; and (iv) we compare the energy distribution of these TFD with the contour plot of IH(t,f)l corresponding to level A~. We used as a basic tool to analyze the signals in the time-frequency domain the Smoothed Pseudo-Wigner-Ville Distribution (SPWVD) with reassignment, presented in [6] and Article 7.2, for its property of having low cross terms without degrading the resolution. We considered as a test system a communication channel affected by multipath propagation, thus described by the CT impulse response Q-1 h(t,
-
-
q--0
where each path is characterized by the triplet of amplitude hq, delay Tq and Doppler shift fq. We generated the amplitudes hq as independent identically distributed (lid) complex Gaussian random variables with zero mean and unit variance (the Rayleigh fading model), and the variables Tq and fq as lid random variables with uniform distribution within the intervals [0, AT] and [-Af/2, Af/2], respectively. An example, relative to a multipath channel composed of Q = 12 paths, with AT = 4Ts and A f = 4/NTs, N = 128, is reported in Fig. 13.3.1 where we show: (a) the mesh plot of IH(t, f)l, (b) two contour plots of IH(t, f)l corresponding to the levels/~16 (dashed line) and )~32 (solid line); (c) the contour plot of the SPWVD of v16; (d) the contour plot of the SPWVD of v32. It is worth noticing how, in spite of the rather peculiar structure of the contour plots of IH(t, f)l, the SPWVDs of the two singular functions are strongly concentrated along curves coinciding with the contour lines of IH(t, f)l corresponding to the associated singular values, as predicted by the theory. It is also interesting to observe the bubble-like structure of the two SPWVDs. Indeed this behavior is quite common, because in general the contour lines of the time-varying transfer function are typically closed curves. Before concluding this section, it is also important to provide some physical insight to justify the rather peculiar behavior of the channel eigenfunctions. Indeed, the bubble-like structure is perfectly functional to guaranteeing two of the fundamental properties of the eigenfunctions, namely orthogonality and system modes excitation. In fact, by construction, (13.3.16) and (13.3.17) insure that the instantaneous frequency curves of singular functions associated to distinct eigenvalues do not intersect. Therefore, if the WVDs of the eigenfunctions associated to distinct eigenvalues are well concentrated along their instantaneous frequency curve (i.e. if their amplitude modulation is negligible), the scalar product of their WVDs is null
Ei&enfunctions of Underspread Linear Communication Systems
565
Fig. 13.3.1: Comparison between contour lines of IH(t, f)l and TFDs of channel singular vectors: (a) 3D plot of IH(t, f)l; (b) contour lines of IH(t, f)l corresponding to levels ~16 (dashed line) and /~32 (solid line); (c) SPWVD of v16; (d) SPWVD of v32.
and thus, by virtue of Moyal's formula, the eigenfunctions are orthogonal, as required. Considering now the modes of the system, we know that the unit energy input signal that maximizes the output energy is the right singular function associated to the highest singular value. Now, if we combine this basic property with the model given in (13.3.16) and (13.3.17), we can conclude that, not surprisingly, the input signal which maximizes the output energy is the signal whose energy is concentrated in the time-frequency region where the channel time-varying transfer function is maximum.
566
Chapter 13:
13.3.4
Time-FrequencyMethods in Communications
Optimal Waveforms for Digital Communications through LTV Channels
Let us consider one of the most interesting applications of the theory described above, i.e. the transmission of information symbols s[k] through an LTV channel. In Article 13.2, for example, it is shown how to convert the channel dispersiveness, possibly in both time and frequency domains, into a useful source of diversity to be exploited to enhance the SNR at the receiver. Here we show that if the transmitter is able to predict the channel time-varying transfer function, at least within the next time slot where one is going to transmit, it is possible to optimize the transmission strategy and take full advantage of the diversity offered by the channel dispersiveness (see e.g. [7] for more details). Considering a channel with approximately finite impulse response of order L, we can parse the input sequence in consecutive blocks of K symbols and insert null guard intervals of length L between successive blocks to avoid inter-block interference. If the symbol rate is 1/Ts, the time necessary to transmit each block is KTs. For each i-th block, we must consider the channel hi(t, T) obtained by windowing h(t, 7) in time, in order to retain only the interval [igTs, (i + 1)KTs], and in frequency, keeping only the band [-1/2T~, 1/2T~]. The optimal strategy for transmitting a set of symbols si[k] :- s[iK + k], k = 0 , . . . , K - 1, in the presence of additive white Gaussian noise (AWGN), is to send the signal [4, sea. 8] K-1
9~(t) = ~
~,k~[k]v~,~(t)
(13.3.25)
k=O
where vi,k(t) is the right singular function associated to the k-th eigenvalue of the channel response hi(t, T) in the i-th transmit interval and Ci,k are coefficients used to allocate the available power among the transmitted symbols according to some optimization criterion [7]. Using (13.3.3), the received signal is thus
yi(t)
-
//
hi(t, ~')z~(t - ~)d~" + w(t) - ~
c~
c~,ka~,ks~[k]u~,k(t) + w(t),
(la.a.26)
k
ui,k(t) is the left singular function associated to the k-th singular value of h~(t, T) and w(t) is AWGN. Hence, by exploiting the orthonormality of the functions ui,k(t), the transmitted symbols can be estimated by simply taking the scalar products of y(t) with the left singular functions, i.e. where
1 ii[m] - ,Xi,~ci,m
//
oo y(t)ui*m(t)dt = s[m] + w~[ml,
(13.3.27)
where the noise samples sequence wi [m] "= f-~c~ w(t)ui*m (t)dt constitutes a sequence of iid Gaussian random variables. In this way, the initial LTV channel, possibly dispersive in both time and frequency domains, has been converted into a set of parallel independent non-dispersive subchannels, with no intersymbol interference, and the symbol-by-symbol decision is also the maximum likelihood detector.
Eigenfunctions of Underspread Linear Communication Systems
557
Most current transmission schemes turn out to be simple examples of the general framework illustrated above. For example, in communications through flat fading multiplicative channels, whose eigenfunctions are Dirac pulses, the optimal strategy is time division multiplexing. By duality, the optimal strategy for linear time-invariant channels is orthogonal frequency division multiplexing (OFDM). Interestingly, in the most general case (of underspread channels), the optimal strategy consists in sending symbols through channel-dependent bubble-carriers.
13.3.5
Summary and Conclusions
The analytic model for the eigenfunctions of underspread linear operators shown in this article, although approximate, shows that the energy of the system eigenfunctions is mainly concentrated along curves coinciding with level curves of the system transfer function. This property, for whose validation the analysis of the system eigenfunctions' T F D plays a fundamental role, gives a general framework for interpreting some current data transmission schemes and, most importantly, gives a new perspective on the optimal waveforms for transmissions over time-varying channels. Pictorially speaking, if we draw a parallelism between time-frequency representations and musical scores, we may say that the eigenfunctions of underspread systems give rise to a polyphonic texture which reduces to monophonic lines only in the simple case of systems whose spread function is concentrated on a straight line. In the most general case, we have a polyphony of ascending and descending melodic lines which run in order to create bubbles whose shape is dictated by the contour lines of the system transfer function.
References [1] G. Matz and F. Hlawatsch, "Time-frequency transfer function calculus (symbolic calculus) of linear time-varying systems (linear operators) based on a generalized underspread theory," J. of Mathematical Physics, vol. 39, pp. 4041-4070, August 1998. Special Issue on Wavelet and Time-Frequency Analysis. [2] L. Sirovich and B. W. Knight, "On the eigentheory of operators which exhibit a slow variation," Quarterly of Applied Mathematics, vol. 38, pp. 469-488, 1980. [3] P. A. Bello, "Characterization of randomly time-variant linear channels," IEEE Trans. Communication Systems, vol. 11, pp. 360-393, December 1963. [4] R. G. Gallager, Information Theory and Reliable Communication. New York: Wiley, 1968. [5] A. Papoulis, Signal analysis. New York: McGraw-Hill, 1977. [6] F. Auger and P. Flandrin, "Improving the readability of time-frequency and time-scale representations by the reassignment method," IEEE Trans. Signal Processing, vol. 43, pp. 1068-1089, May 1995. [7] S. Barbarossa and A. Scaglione, "Time-varying fading channels," in Signal Processing Advances in Wireless and Mobile Communications (G. B. Giannakis, Y. Hua, P. Stoica, and L. Tong, eds.), vol. 2: "Trends in Single- and Multi-User Systems", ch. 1, Upper Saddle River, N J: Prentice-Hall, 2001.
568
13.4
Chapter 13:
Time-Frequency Methods in Communications
F R A C T I O N A L A U T O C O R R E L A T I O N FOR D E T E C T I O N IN C O M M U N I C A T I O N S 0
Linear frequency modulated (FM) signals, also called chirps, are commonly used in communications systems. For example, they are employed in modulation schemes that allow better resilience against interferences. Due to their frequent occurrence in communications, detection of chirp signals and estimation of their parameters have been of great interest. In this article, a method based on fractional autocorrelation is proposed for detection and parameter estimation of chirps. To that end, after briefly introducing the fractional Fourier transform (FRFT), formulations of fractional convolution and correlation are presented. Then, a detection statistic based on fractional autocorrelation is proposed and its performance is demonstrated through simulations.
13.4.1
Fractional Fourier Transform
The F R F T is a linear, energy preserving signal transformation that generalizes the conventional Fourier transform (FT) via an angle parameter r [1,2]. For r - ~, the F R F T simplifies to the conventional FT. The conventional FT of a time domain signal s(t) can be interpreted as a counterclockwise rotation by ~ radians in the time-frequency plane. The F R F T generalizes this rotation property by defining rotations associated with angles other than -~ 2" The F R F T of a time domain signal s(t) is defined [1,2] as -
V/1--ycotr
-
e'~r2c~162
'Trt2 c~ r
e-227rtrcscr r
s(r).)(_r
r = 2n~,
,
(13.4.1)
r
where ]F r is the F R F T operator associated with angle r Sr denotes the frac7r tional Fourier transformed signal and n is an integer. For r y, the conventional FT is obtained. Similarly, for r - 0, one obtains the time domain representation s(t) [1]. Furthermore, for r - ~, the F R F T reduces to a time reversal operation [1], (IF ~s)(t) - s(-t).
(13.4.2)
One fundamental property of the F R F T relates it to the WD. The WD of the time domain signal s(t) is defined (in Section 2.1.3) as
Ws(t, f) =
s(t + -~) s*(t - -~) e--22~fTdT.
(13.4.3)
~ Olcay Akay, Department of Electrical and Electronics Engineering, Dokuz Eyliil University, Izmir, Turkey (olcay.akay@eee.deu.edu.tr), and G. Faye B o u d r e a u x - B a r t e l s , Department of Electrical Engineering, University of Rhode Island, Kingston, RI 02881, USA (boud@ele.uri.edu). Reviewers: M. J. Bastiaans and X.-G. Xia.
569
Fractional Autocorrelation for Detection in Communications
The relationship between the F R F T and the WD can be stated [1,2] as Ws~ (t, f) =
W~(t cos r
- f sin r t sin r + f cos r
(13.4.4)
where Ws~(t,f) denotes the WD of the FRET signal, Sr whereas Ws(t,f) is the WD of the original time domain signal, s(t). According to (13.4.4), the WD of the fractional Fourier transformed signal Sr can be obtained by rotating the WD of the original signal s(t) in the clockwise direction by r
13.4.2
Fractional Convolution and Correlation
Definitions of fractional convolution and correlation operations are obtained using the unitary fractional-shift operator R~, which is defined [3] as (RCs)(t) -
s(t - pcosr e -227r~- c~
r + J27rtpsin
(13.4.5)
where p is the fractional lag variable. For r = 0, the unitary fractional-shift operator, R r simplifies to the well-known unitary time-shift operator, (R~ (TTs)(t) = s ( t - 7), with ~- denoting the time lag. Analogously, as the special case 7r for r - 2, the unitary frequency-shift operator is obtained, (R7 s)(t) = (F~s)(t) = e32~ts(t), with u denoting the frequency lag. Fractional convolution of two signals s(t) and h(t) is calculated [4] by evaluating the inner product 1 of the first signal s(t) with the axis-reversed, complex-conjugated and fractionally-shifted version of the second signal h(t), -
_
*)
_
-
~2 r e327r-ycosCsin
(/3) h ( r c o s r
Z)e-327r/3rsinCd/3.
(13.4.6)
In (13.4.6), ,~ denotes fractional convolution associated with angle r h(t) = h * ( - t ) , and IF" is the axis-reversal operator in (13.4.2). For r = 0, fractional convolution simplifies to linear time invariant (LTI) convolution given as
(s *o h)(t) - {s, TtlF~h*} - {s, Tth) - / s(fl) h(t -/~) dZ.
(13.4.7)
Similarly, for r - ~, fractional convolution in (13.4.6) reduces to linear frequency invariant (LFrI) convolution which is obtained [4] by
(s ,~ h)(f) - (s Ff
iF
71"
h*)
m
(s, Fi[t )
-Is(Z) h(-Z)e-Y2~f fldZ =JS~ (Z) H~ where S~ (f) and H ~ (f) denote the conventional FTs of
[ - ( f -/3)]dZ
s(t)
and
h(t),
(13.4.8)
respectively.
1The inner product of two functions g(t) and h(t) is defined as {g, h} - f g(t) h* (t) dt.
Chapter 13:
570
Time-FrequencyMethods in Communications
T a b l e 13.4.1: Definitions of fractional convolution and correlation at angle r
Convolution Cross-corr. Autocorr.
r2
(s .r h)(r) = eg27rY cosCsin r
/
s(/3)h(rcosr - f~)e-327rf~r sin Cdf/
(s *r h)(p) = e227r~ cos r sin r
h*(f~ - p cos r
-32Trypsin Cd~
(s .r s)(p) - e327r~ cos Csin r
s*(fl - pcos r
-327rflpsin Cd~
Table 13.4.2" Linear time invariant (LTI) and frequency invariant (LFrl) convolution and correlation. LTI
LFrI
Convolution
(s *o h)(t) - / s ( # ) h(t-#)d#
Cross-corr.
(s *o h)(T) = /s(#) h*(#--T) d#
Autocorr.
(s *o S)(T) = /S(~) S*(~--T) d~
[
(~.~ h)(f) = / S ~ ( # ) H ~ [ - ( f - ~ ) ] d f i
(s.~ h)(~,) =
S~ (~) H~ (~-u
a~.
[
Analogously, fractional cross-correlation, denoted by .r of s(t) and h(t) is cal-~ culated [4] using the fractional-shift operator, R r within an inner product, (s .r h)(p) - (s, RCh) - eg27r~2 cosCsin r
- pcosr
e - g27rflp sin Cdfl.
(13.4.9) Fractional autocorrelation at angle r is similarly defined [4] by replacing the second function h(t) in (13.4.9)with the function s(t),
(s ,r s)(p) = e327r~c~ r sin r 2
(/3) s* (/3 - p c o s r
-221r~P sin Cd~.
(13.4.10)
For the special cases of r - 0 and r - y, r fractional correlation operations in (13.4.9) and (13.4.10) reduce to the corresponding LTI and LFrI correlation operations, respectively. Table 13.4.1 displays the definitions of fractional convolution and correlation operations. Definitions of LTI and LFrI convolution and correlation operations are listed in Table 13.4.2. 13.4.2.1
Alternative Formulations of Fractional Convolution and Correlation
In this subsection, alternative formulations for fractional convolution and correlation are presented. Having alternative equivalent formulations allows one to gain additional insight about these fractional operations. Furthermore, these alternative formulations suggest efficient ways for computer implementation.
571
Fractional Autocorrelation for Detection in Communications
The first alternative formulation of fractional convolution is given [4] as (s ,r
h)(r) - f
Sr162
H-r
(13.4.11)
- ~ ) d ~ = (S r *o H - r
where Sr = (IFCs)(r) and H - r = (IF-r One can see from (13.4.11) that fractional convolution at angle r can equivalently be calculated by a conventional LTI convolution of S4)(r) and H - r Another alternative formulation can be obtained by applying the convolution property of the conventional FT [4] to the right hand side of (13.4.11) and is given by (s ,r
h)(r) -
(F-~
{SO+r162
(r).
(13.4.12)
According to this formulation, fractional convolution associated with angle r can be computed by multiplying the F R F T at angle ~ + r of the first signal, S~+r 71" 7r with the FRFT at angle ~ - r of the second signal, H ~ - r and then taking a conventional inverse FT. This formulation suggests a way for efficient computer implementation of fractional convolution using the fast Fourier transform (FFT) along with the fast FRFT algorithm [5]. An alternative formula for fractional autocorrelation in (13.4.10) is given as [4] (s ,r
s)(p) - f
Sr162
- p)]*d/~ = (S r *o Sr
(13.4.13)
Thus, as can be seen from (13.4.13), fractional autocorrelation can also be calculated via the LTI autocorrelation of Sr By applying the autocorrelation property of the conventional FT to the right hand side of (13.4.13), another alternative formulation of fractional autocorrelation is obtained as (13.4.14) In (13.4.14), one FRFT with angle r + 771" and one conventional inverse FT are used to compute fractional autocorrelation. This formulation suggests a fast discretetime approximation of fractional autocorrelation. The discrete F R F T algorithm proposed in [5] has a computational load of O(N log N) for a discrete-time signal of length N. Thus, utilizing the discrete F R F T and F F T algorithms together, a discrete-time approximation of fractional autocorrelation via (13.4.14) can be computed efficiently with a computational load of O ( 2 N l o g N + N) [4]. If one wants to calculate fractional autocorrelation for M different angles, then the total computational load is O[M(2N log N + N)}. By computing the conventional FT of both sides of (13.4.14), one obtains
{(s ,+
IS ++(u)l
(13.4.15)
Eq. (13.4.15) can be thought as the fractional autocorrelation theorem. For r = 0, it simplifies to the autocorrelation theorem of the conventional FT [4], (IF~ {(s *o S)(T)}) (f) = IS~ (f)l 2.
(13.4.16)
Chapter 13:
572
Time-FrequencyMethods in Communications
T a b l e 13.4.3: Alternative formulations of the fractional convolution and correlation operations associated with the fractional domain at angle r The cross-AF, AFs,h(T,U) = f s(t Jr ~ ) h * ( t -
~)e-32~Utdt, Fractional Convolution
(~,~ h)(~) =
Cross-corr.
simplifies to the auto AF in (13.4.17) when
/
alternative I
(s .~/, ~)(p) =
Sr
Also, h(t) =
alternative II
(IF-~ { S~+r162
Sr162
./S r (~)[H r ( 3 -
Autocorr.
h(t)= s(t).
h*(-t).
alternative III
} )(r)
P)]*d3
AFs,h(r cos r
r sin r
AFs,h (p cos r
psin r
AFs (p cos r
r ( 3 - P)]*d/3
p sin r
The alternative formulations of fractional cross-correlation are similarly obtained by replacing the second s(t) in (13.4.13) and (13.4.14)with h(t). All alternative formulations of fractional convolution and correlation operations reported in this subsection are listed in the second and third columns of Table 13.4.3.
13.4.3
Fractional Autocorrelation and the Ambiguity Function
One of the fundamental properties of fractional autocorrelation is its relation to the ambiguity function (AF). The AF of s(t) can be defined [6] as
AF~(T,U)
-
(F_~T_~s,F~T~s>
-
T s * ( t - r -~) e_22~utd t s(t + -~)
(13.4.17)
where (T~s)(t) = s ( t - 7) is the unitary time-shift and (F~s)(t) = e32~Vts(t) is the unitary frequency-shift operator. A well-known property of the AF is that one can recover the LTI and LFrI autocorrelations in the fourth row of Table 13.4.2 by taking horizontal (r - 0) and vertical (r - ~) cross sections of the AF, respectively [6];
(s *o S)(T) -- AFs(T, 0) and
(s ,~ s)(u) - AFs(O, u).
( 3.4.1s)
The relationship between the AF and fractional autocorrelation is given [4] as (s , r
s)(p) = AFs(pcosr psin r
This relationship generalizes the properties given in (13.4.18) for time (r = 0) and frequency (r - 2) domain autocorrelations into other orientations of the AF. One can see from (13.4.19) that fractional autocorrelation at angle r can also be recovered by taking a radial slice of the AF at angle r in the ambiguity plane. Similarly, fractional convolution and fractional cross-correlation can be related to radial slices of the cross-AF as can be seen in the fourth column of Table 13.4.3.
573
Fractional Autocorrelation for Detection in Communications
13.4.4
Detection and Chirp Rate Parameter Estimation of Chirps
In this section a detection statistic is proposed for detection and chirp rate parameter estimation of chirp signals. A continuous-time chirp can be formulated as e32~r(r t)t, where rh and ~ are the unknown chirp rate and initial frequency parameters, respectively. The AF of all chirps has a linear region of support that passes through the origin of the ambiguity plane [6, 7]. Since fractional autocorrelation corresponds to radial slices of the AF as in (13.4.19), a detection statistic based on fractional autocorrelation can be used for detection of chirps. By expressing the integral of the AF magnitude along the radial line with angle r the following detection statistic is derived [4],
L(rn)
/
1(8 "karctan(m )
S)(p) I dp.
(13.4.20)
Eq. (13.4.20) states that integration of the modulus of fractional autocorrelation with angle r = arctan(m) of the received signal s(t) can be used for detection of chirp signals [4]. If the detection statistic, L ( m ) , exceeds a threshold for a certain chirp rate value, ~h, then the detection statistic determines that a linear FM signal, with that particular chirp rate, is present in the received signal. The fast fractional autocorrelation algorithm via (13.4.14) does not require the calculation of the AF, and hence, is computationally efficient. In fact, if the number of chirp rates M, for which the detection statistic is calculated, is much less than the number of signal samples N, (M < < N), then, computational savings provided by (13.4.20) over any detection statistic which requires calculation of the AF is considerable.
13.4.4.1
Simulation Examples
In this subsection, we test the detection statistic in (13.4.20) for detection and chirp rate parameter estimation of monocomponent and multicomponent chirp signals. E x a m p l e 1: In this simulation example, a discrete-time monocomponent chirp signal with chirp rate rh = 0.35 is corrupted with complex additive noise, w[k], generated by two independent, zero-mean, Gaussian random processes of equal variance; ^
~ k2
s[k] - e3ml--O-~
+ w[k],
k - 1, 2 . . . , 512.
(13.4.21)
The normalized 2 detection statistic is calculated using (13.4.20) for different chirp rate values mt - 0 . 1 + (0.4 2---0-6)l, l - 0 , . . . , 199. The experiment was realized for the noise-free case, and for additive complex noise cases o f - 6 dB and - 9 dB signal-tonoise ratio (SNR) values. Fig. 13.4.1 shows how the normalized detection statistic behaves in different levels of noise. It can be seen in Fig. 13.4.1 that the algorithm is able to detect the monocomponent chirp signal with the correct chirp rate parameter value, rh - 0.35, even in significant levels of noise. 2We normalize the detection statistic by the area under the received signal magnitude, i.e. L(m) so that the detection threshold level is less affected by the noise level.
E ~=1 Is[k]l'
Chapter 13:
574
Time-Frequency Methods in Communications
F i g . 13.4.1: Normalized (See Footnote 2) detection statistic. SNR - - 6 dB. Solid line: SNR = - 9 dB.
Dotted line: noise-free. Dashed line:
E x a m p l e 2: In this example, a discrete-time, multicomponent linear FM signal with initial frequency and chirp rate parameter values ~0 - 6/1024, ~1 - 3/1024, ~2 - 2/1024, ~3 : 1/1024, rh0 = 0.10, rhl = 0.20, rh2 - 0.22 and ?~t 3 --- 0.45 is corrupted with complex additive noise, w[k], generated by two independent, zeromean, Gaussian random processes of equal variance; 3
s[k] - ~
e212u~i + ~ h i ~ k ] k
+ w[k],
k - 1 , 2 . . . , 2048.
(13.4.22)
i=0
The normalized detection statistic is calculated via (13.4.20) with respect to different --~ chirp rates rnz - (0.5)l, 1 - 0, . . . , 199. The experiment was done for the noise-free case, and for additive complex noise cases o f - 5 dB and - 1 0 dB SNR values. As can be seen from Fig. 13.4.2, all four chirps are detected with their correct chirp rates. Two closely placed chirps with rhl = 0.20 and rh2 = 0.22 are also resolved well.
13.4.5
Summary and Conclusions
In this article, a computationally efficient method based on fractional autocorrelation is proposed for detection and chirp rate parameter estimation of chirp signals that are utilized frequently in various applications of communications. Formulations of the fractional convolution and correlation operations associated with the fractional domains of the time-frequency plane are also presented. Those fractional domains are defined by the recently developed fractional Fourier transform ( F R F T ) . The effectiveness of the proposed method is illustrated with simulation examples.
Fractional Autocorrelation for Detection in Communications
575
Fig. 13.4.2: Normalized (See Footnote 2) detection statistic. Dotted line: noise-free. Dashed line: SNR = - 5 dB. Solid line: SNR = - 1 0 dB.
Alternative presentations of the fractional Fourier transform are given in Articles 4.8 and 5.8. References
[1] L. B. Almeida, "The fractional Fourier transform and time-frequency representations," IEEE Trans. Signal Processing, vol. 42, pp. 3084-3091, November 1994. [2] H. M. Ozaktas, B. Barshan, D. Mendlovic, and L. Onural, "Convolution, filtering, and multiplexing in fractional Fourier domains and their relationship to chirp and wavelet transforms," J. Optical Soc. of America A, vol. 11, pp. 547-559, February 1994. [3] O. Akay and G. F. Boudreaux-Bartels, "Unitary and Hermitian fractional operators and their relation to the fractional Fourier transform," IEEE Signal Processing Letters, vol. 5, pp. 312-314, December 1998. [4] O. Akay and G. F. Boudreaux-Bartels, "Fractional convolution and correlation via operator methods and an application to detection of linear FM signals," IEEE Trans. Signal Processing, vol. 49, pp. 979-993, May 2001. [5] H. M. Ozaktas, O. Ankan, M. A. Kutay, and G. Bozda~l, "Digital computation of the fractional Fourier transform," IEEE Trans. Signal Processing, vol. 44, pp. 2141-2150, September 1996. [6] F. Hlawatsch and G. F. Boudreaux-Bartels, "Linear and quadratic time-frequency signal representations," IEEE Signal Processing Magazine, vol. 9, pp. 21-67, April 1992. [7] B. Ristic and B. Boashash, "Kernel design for time-frequency signal analysis using the Radon transform," IEEE Trans. Signal Processing, vol. 41, pp. 1996-2008, May 1993.
This Page Intentionally Left Blank
Chapter 14
Time-Frequency Methods in Radar, Sonar & Acoustics The fields of radar and sonar are traditionally key application areas and testing grounds for advances in signal processing. Time-frequency methodologies have made significant inroads already in these fields; their usefulness is demonstrated in five articles with appropriate internal cross-referencing to this and other chapters. A baseband Doppler radar return from a helicopter target is an example of a persistent non-stationary signal. A linear time-frequency representation provides a high resolution suitable for preserving the full dynamic range of such complicated signals (Article 14.1). The synthetic aperture principle allows the combination of range resolution, achieved by the use of linear FM signals, with cross-range. For long observation intervals, the phase cannot be assumed to be a linear function of time; then time-frequency based imaging can obtain improvements in focus of the synthetic-aperture image (14.2). When a propeller-driven aircraft or a helicopter passes overhead, it produces a Doppler effect which allows the estimation of flight parameters by using IF estimation and the interference patterns formed in TFDs (14.3). To track a theater ballistic missiles launch, the WVD can be used effectively. Its peak provides a direct estimate of the instantaneous Doppler law giving the accelerating target dynamics (14.4). In sonar, there is a clear rationale for using time-frequency processing of returns to provide useful information about targets such as ships (14.5).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
577
578
14.1 14.1.1
Chapter 14:
Time-FrequencyMethods in Radar, Sonar & Acoustics
SPECIAL TIME-FREQUENCY ANALYSIS OF HELICOPTER DOPPLER RADAR DATA~ Dynamic Range Considerations in TF Analysis
The classical technique for characterizing the time evolution behavior of sampled nonstationary signals has been the short-time Fourier transform (STFT), a timefrequency analysis technique that uses linear operations on sampled data. With momentary nonstationary signals of short duration, one often wishes to improve the time-vs-frequency resolving detail over that achievable with the STFT. Timefrequency distributions (TFDs) that use quadratic operations on sampled data, such as the Wigner-Ville distribution (WVD), are often able to achieve this improvement. For finite duration data records, this improvement in practice can be shown graphically to be approximately a factor of two in the time or frequency sharpening of the TFD response. However, this improvement is achieved in the presence of significant cross-term artifacts and with the sacrifice of detectable dynamic range (DNR). The cross terms, which are generated by multi-component signals (more than one signal present at an instant of time), additive noise, and analog-to-digital converter quantization effects, often obscure relevant signal components, particularly much weaker signal components (see Article 4.2). Numerous filtering and smoothing approaches have been developed to mitigate these cross-term artifacts, but this is always achieved at the expense of degrading the time-frequency sharpness of quadratic TFDs. Thus, quadratic TFDs tend to work best in mono-component signal situations. The DNR between strongest and weakest signal components in practice can be quite significant. For example, modern radar systems, sonar systems, and medical Doppler ultrasound cardiovascular imaging systems can have DNRs that exceed 70dB, 50dB, and 100dB, respectively. Signals from these systems almost always have multiple signal components present. Thus, quadratic TFD techniques are not good candidates for analyzing signals from such systems due to obscuring crossterm artifacts and dynamic range degradation. As this article will illustrate, it is still possible to achieve at least a factor of two improvement in the TFD response sharpness, without incurring the degradation caused by cross-term artifacts, by exploiting a pair of special two-dimensional (2-D) data arrays: the windowed data function (WDF) and the complex WDF transform (CWT). Application of variants of high-resolution 2-D spectral analysis techniques to the CWT will generate the desired high resolution TFD capability sought in quadratic TFDs, but without the cross-term artifacts.
14.1.2
Classical Linear and Quadratic TFDs
The baseline technique for comparing all time-frequency analysis methods is the STFT linear TFD. If x(t) is the signal to be analyzed, define the short-time win~ S. L a w r e n c e M a r p l e Jr., ORINCON Corporation, 9363 Towne Centre Drive, San Diego, California 92121 (lmarple@orincon.com). Reviewers: Joel M. Morris and Emin Tagluk.
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
dowed signal
xh(t, r) = x(T)h*(J
-
579
(14.1.1)
t),
in which h(T) is the analysis window centered about 7 = 0, * denotes complex conjugation, and t represents the center time of the analysis interval on x(t). The Fourier transform of the short-time windowed signal is therefore
Fh~(t,f)- ~ {Xh(t,T)}-r---~f
//
Xh(t,T) exp(--j27rfr)dT
.
(14.1.2)
oO
which is the classical STFT. The localized STFT spectrum, or spectrogram, is simply the magnitude of the STFT
sh(t, f) - IFh(t, f)l 2
(14.1.3)
which is then plotted to form the 2-D TFD gram. Trading off time-frequency concentration vs the resolution is achieved by the selection of the analysis window shape and duration. The classical quadratic-based TFD method is the Wigner-Ville distribution W~(t, f) [1]. Most alternatives to the WVD are typically derivatives in which weighting or smoothing kernels are added to the functional definition of the quadratic TFD to better handle finite duration signal records or to suppress cross-term artifacts (see Chapter 3 and Article 5.7). As explained in Section 3.2.1, the WVD is linked to three companion quadratic functions: the instantaneous autocorrelation function (IAF) Kx(t, T), the spectral correlation function (SCF) k=(u, f), and the complex ambiguity function (CAF) A=(u, T). The WVD can alternatively be computed as (1) the one-dimensional forward transform of the IAF, (2) the one-dimensional inverse transform of the SCF, or (3) the double transform of the CAF as follows
Wx(t,f) -
jz {K~(t, T)} r---' f
x(t+T/2) x*(t--T/2) e-J2'~f'dT
(14.1.4)
x(f+./2) x*(f-./2)e+~2~"~d~
(14.1.5)
Am(,, 7) eJ2~[Ir-~tldu dT-.
(14.1.6)
O0
= S {kx(., f)} -
////
oo
=
~
)r {Am(-, T)} -
V----+t T---+f
~
in which X ( f ) is the Fourier transform of x(t), t is the "localized" center time, r is the relative separation (shift) time displacement, f is the "localized" center frequency, and u is the relative separation (shift) frequency displacement. More details on these relationships may be found in Part I. Fig. 14.1.1 depicts the key mathematical definitions and relationships among these four quadratic functions. Fig. 14.1.1 also graphically depicts the support regions of the quadratic functions for a finite-duration signal of N samples x(nAT) = x[n] at AT second intervals. Note that the temporal and spectral correlation functions have diamond-shaped support regions with time-shift and frequency-shift axes, respectively, that are twice the
580
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
Flow diagram and key relationships and support domains among the four quadratic representations: Wigner-Ville distribution, complex ambiguity function, instantaneous autocorrelation function, and spectral correlation function. Fig. 14.1.1:
original signal time-width and its transform band-width, due to the correlation processes used to form the IAF and SCF. Having twice the temporal or spectral extents will yield, after transformation, WVD responses that are twice as sharp relative to the original signal's time extent or bandlimited frequency extent. Note that the temporal and frequency resolutions of the WVD are greatest at t = 0 and f = 0 and decrease as one advances to the outer time and frequency edges of the IAF or the SCF diamond-shaped regions of support. In order to mitigate the deleterious effects of cross terms, a number of optional smoothing and filtering operations, marked in Fig. 14.1.1 as kernels g, G, 7, G, have been developed to suppress the cross-term effects (see Articles 5.2 and 6.1 for more details). However, suppressing the cross terms will also degrade the time-frequency concentration; therefore, all kernel designs involve a tradeoff between minimizing
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
581
cross-term artifacts and maximizing time-frequency resolution 9
14.1.3
Alternative High-Resolution Linear TFD
Let us revisit the 2-D S T F T and develop a new perspective for creating the 2-D S T F T spectrogram from a I-D data record. Based on this new perspective, we will create an alternative 2-D matrix array using linear transform operations on data samples from which an opportunity to apply high-resolution 2-D spectral analysis procedures is generated. If one assumes a finite-width analysis window of M samples for a case of finite-duration data of N samples, and further assumes the analysis window center time is stepped through the data one sample at a time (multiple sample steps are also possible but not discussed here), then the operations to be performed on the data will start with the data matrix ,
9..
o .
.
o
x[i]
x[~]
~[2]
0
x [ ( M - 1)/2]
x[1] 0
x(t,~)
-
."
x[1]
x[NI 9 "
x [ N - ( M - 1)/2]
x[N]
0
.-
:
0
x[N-~]
z[N]
.
x[N]
0
...
0
(14.1.7)
0 ...
0
in which the analysis center time t = m A T corresponds to the row index m and the data samples within an analysis window interval are all the elements along a selected row, indexed by their column time ~- = nAT. Note that the above data matrix has N + M - 1 rows corresponding to the center times ranging from t = - ( M + 1)/2 to N + (M + 1)//2, which requires zero fill when the analysis window runs off the ends of the available data. Other ranges along the vertical axis are also possible to define such that zero fill is not required. Noting the analysis window values as h[1] to h[M], the resultant windowed data function (WDF) is obtained element-by-element as xh(t, ~-) - x(~-)h* (~- - t)
.
(14.1.8)
to form the W D F matrix H(t, 7) shown in Fig. 14.1.2. The S T F T is then formed by taking the one-dimensional Fourier transform of each row of the W D F
Fh(t, f) -- ~ {H(t, T)} r---,
f
(14.1.9)
582
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
Fig. 14.1.2: Flow diagram of key relationships among windowed data function (WDF), complex WDF transform (CWT), short-time Fourier transform (STFT), and short-time Fourier transform spectrogram. Insertion of alternative 2-D spectral technique possibilities is indicated.
as indicated in Fig. 14.1.2. Although counterintuitive, one could alternatively take the Fourier transform of each column (rather than each row) of the WDF to produce the complex WDF transform (CWT) C(v, T) = ,1: {H(t, T)} (14.1.10) as shown in Fig. 14.1.2. It may seem computationally disadvantageous to form the CWT, but the motivation is found in the similarity of the CWT to the CAF of Fig. 14.1.1. A double transform of the CWT yields the STFT as shown in Fig. 14.1.2, just as a double transform of the CAF yields the WVD in Fig. 14.1.1. By computing the CWT, we now open the possibility of creating a TFD gram via an alternative high-definition 2-D spectral analysis technique in lieu of the squared-magnitude STFT spectrogram. Because no cross terms are created in the formation of the CWT, there will be no cross-term artifacts introduced into the 2-D spectral analysis results. Examples of high-definition 2-D spectral estimation procedures that may be
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
583
applied here include the 2-D autoregressive [2] and 2-D minimum variance techniques [3]. The 2-D data set that is used as the input to these methods is the CWT array. The final spectral analysis formula for generating the time-vs-frequency analysis gram will be a modification of the normal 2-D spectral analysis formula in order to account for the time-vs-frequency units of the CWT (rather than time-vs-time). For example, the 2-D autoregressive spectral formula is modified to have the structure STFAR(t, f ) --
1
I~-~m~-~ a,,r[m, n] exp(-j27~[-mAFt + nATf])] 2
(14.1.11)
in which the two-dimensional autoregressive parameters a,,r[m,n] are estimated from the CWT array elements, AT is the time-increment along the CWT rows, A F is the frequency-increment along the CWT columns, t represents the center time of the analysis evaluation, and f represents the center frequency of the analysis evaluation. Note the opposite signs of the time and frequency variables in the above equation, which contrast with the usual 2-D formulae that have the same sign. The modified 2-D spectral procedures are more appropriately termed the 2-D time-vsfrequency autoregressive (TFAR) and the 2-D time-vs-frequency minimum variance (TFMV) methods for TFD gram creation from the CWT array. Fast computational algorithms for 2-D autoregressive parameter estimation may be found in Marple [2,4] and for the 2-D minimum variance spectral function in Jakobsson et al. [3].
14.1.4
Application to Simulated and Actual Data
To illustrate the benefit of the new 2-D high-resolution approach to time-frequency analysis, consider the various TFDs shown in Fig. 14.1.3 for a simulated complexvalued waveform (has in-phase I and quadrature-phase Q components) of 256 samples consisting of two criss-crossing chirp (linear frequency modulated) signals and a sinusoidal FM modulation. The traditional stationary processing approach would be to form the assumed stationary correlation estimate using the entire 256-point data record, followed by Fourier transformation of this estimated correlation to create the long-term Fourier transform (LTFT) spectrum shown at the bottom of Fig. 14.1.3(a) (note that a logarithmic scale in dB is used to capture the full dynamic range of the spectral response). The I and Q temporal plots of the signal are drawn on the left and right sides of Fig. 14.1.3(a), respectively. Note that the LTFT spectral shapes of the criss-crossing chirps and the sinusoidal FM component are essentially the same, and thus the two signals are almost indistinguishable if the LTFT alone is used for signal classification.The three signals are distinguishable when the short-term Fourier transform spectrogram is used, as illustrated in Fig. 14.1.3(a) using a color-coded linear plot and in Fig. 14.1.3(c) using a log plot of the top 50 dB. The nonstationary frequency-varying character of the three signal components is now apparent. The Wigner-Ville TFD estimate is plotted with linear units in Fig. 14.1.3(b) and with log units in Fig. 14.1.3(d). The promise of the WVD to increase sharpness in the time-vs-frequency localization is apparent, but this has
584
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
Fig. 14.1.3" Comparison of various time-frequency analysis techniques to extract the 2-D localized nonstationary features of a signal with three components: two criss-crossing chirps and one FM modulated waveform.
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
585
been achieved at the price of introducing significant cross-term artifacts, especially when emphasized by the logarithmic gram of Fig. 14.1.3(d). Four alternatives to S T F T and WVD characterizations of the nonstationary behavior of the three signals are illustrated in Figs. 14.1.3(e-h). These include the TFAR and TFMV. These last two alternatives produce results as sharp as those obtained with WVD without the cross-term artifacts. Consider the three TFD grams of Figs. 14.1.4-14.1.6 produced from a 330ms record of baseband Doppler radar returns from a helicopter target. This is a case of a persistent nonstationary signal, in contrast with the more typical momentary nonstationary signals that characterize much of the time-frequency analysis literature. We shall use a sliding analysis window in this case, computing a single line of the TFD with each increment of the sliding analysis window. The helicopter target illuminated by the X-band CW radar was a two-engine Eurocopter Deutschland BO-105, as depicted in the inset of Fig. 14.1.4. The radar signal consisted of samples of the complex-valued (I/Q) baseband (carrier frequency demodulated down to 0 Hz) waveform; the temporal waveform samples are plotted as vertical displays on the left and right gram sides in Figs. 14.1.4-14.1.6. The helicopter is in motion, so the radar signal will have Doppler components due to the forward motion of the fuselage, the main rotor and tail rotor rotations, and multipath components between the fuselage and rotors. A LTFT spectral estimate of the entire 330 ms data record produced the spectrum plotted beneath each TFD gram in Figs. 14.1.4-14.1.6. Note that there is at least 70 dB of dynamic range between strongest and weakest signal components in this radar signal. Fig. 14.1.4 is the result of a classical S T F T analysis of the data record which has been adjusted to use sliding 128-sample Hamming analysis windows to best capture the most rapidly-varying Doppler components (main rotor) of the radar signal. Note that positive frequencies represent Doppler components moving toward the radar and negative frequencies represent Doppler components moving away from the radar in this baseband signal. Also note the color coding assignments of the logarithmic dynamic range that is is displayed; approximately the top 70 dB of each T F R gram is displayed. The dominant signal component is the +2.8 kHz line that represents the Doppler component off the helicopter fuselage due to the constant velocity motion of the helicopter toward the radar. Other features that can be seen in the S T F T gram of Fig. 14.1.4 include the sinusoidal patterns bounded between -12kHz and +18 kHz that correspond to the Doppler pattern of the four main rotor blades and the periodic broadband horizontal "flashes" corresponding to the periodic alignment of the main or tail rotors to maximally reflect the radar beam. Spectral lines at +500 Hz also appear due to a 500 Hz modulated time tone that was added to the data tape. Using the same sliding 128-sample analysis window increments as the STFT, the WVD gram shown in Fig. 14.1.5 was produced. A Choi-Williams smoothing filter was employed using a exponential parameter factor of 0.3 to mitigate the cross-term artifacts. Although there is an improved sharpness relative to the S T F T spectrogram, the cross-term mitigation is insufficient as these terms are still much
586
Chapter 14:
Time-Frequency Methods in Radar, Sonar gz Acoustics
Fig. 14.1.4" Classical short-time Fourier transform (STFT) time-frequency analysis of 330 ms of complex (I/Q) baseband Doppler radar signal of a helicopter target in motion. Profile of B0-105 helicopter target is shown in lower right inset.
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
587
Fig. 14.1.5' Time-vs-frequency gram of Doppler radar signal using the Wigner-Ville distribution with sliding window and complex ambiguity function weighting using a Choi-Williams kernel.
588
Chapter 14:
Time-Frequency Methods in Radar, Sonar 8z Acoustics
Fig. 14.1.6: Time-vs-frequency gram of Doppler radar signal using the two-dimensional minimum variance method with sliding window.
Special Time-Frequency Analysis of Helicopter Doppler Radar Data
589
stronger than the weaker Doppler components that we are attempting to detect and to track. Fig. 14.1.6 is the T F D gram resulting from use of the T F M V technique.
14.1.5
Summary and Conclusions
The alternative high-resolution methods presented in this article clearly are capable of preserving the full dynamic range of features in complicated signals being analyzed while achieving sharp time-frequency responses comparable to those found in the WVD gram. These attributes are achieved without the artificial introduction of cross-term artifacts typical of some of the quadratic T F D methods.
References [1] T. A. C. M. Claasen and W. F. G. Mecklenbr~iuker, "The Wigner Distribution--A tool for time-frequency signal analysis," Philips J. of Research, vol. 35, pp. 217-250 (Part 1), 276-300 (Part 2) & 372-389 (Part 3), 1980. [2] S. L. Marple Jr., "Two-dimensional lattice linear prediction parameter estimation method and fast algorithm," IEEE Signal Processing Letters, vol. 7, pp. 164-168, June 2000. [3] A. Jakobsson, S. L. Marple Jr., and P. Stoica, "Computationally efficient twodimensional Capon spectrum analysis," IEEE Trans. Signal Processing, vol. 48, pp. 26512661, September 2000. [4] S. L. Marple Jr., Digital Spectral Analysis with Applications. Englewood Cliffs, N J: Prentice-Hall, 1987.
590 14.2
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
TIME-FREQUENCY MOTION COMPENSATION ALGORITHMS FOR ISAR IMAGING o
The synthetic aperture principle is now a well established concept and has given rise to several applications in high resolution microwave imaging. Among the most impressive applications is the imaging of Venus' surface provided by the synthetic aperture radar (SAR) aboard the spacecraft Magellan (see e.g. h t t p : / / w w w , j p l . n a s a . g o v / m a g e l l a n ) . Venus' surface could not be observed by optical telescopes because of the thick layers of clouds covering the planet. It was thus necessary to use lower frequency waves to penetrate the clouds, but this would have come at the expense of resolution. The radar aboard the Magellan orbiter made it possible to send and receive electromagnetic waves through Venus' atmosphere and still maintain good resolution: the range resolution was achieved by transmitting a wideband impulsive linear FM (chirp) signal, whereas the cross-range resolution was obtained by exploiting the synthetic aperture principle. The so-called s y n t h e t i c a p e r t u r e is formed by gathering the echoes collected by the radar platform during the interval in which the radar antenna beam insists on a specific site, and processing them coherently. The instantaneous phases of the echoes are compensated as to mimic the presence of a real antenna gathering the same echoes. If the phase compensation is perfect, we can consider the compensated signal as gathered by a synthetic aperture whose along-track length is equal to the distance traveled by the radar during the time interval within which each point on the ground is illuminated by the radar antenna. Clearly the relative motion between radar and observed scene, or target, 1 is fundamental to synthesizing an aperture wider than the real aperture. The synthesis of the wide aperture requires an accurate knowledge of the temporal evolution of the phase shift induced by the relative radar/target motion. In formulas, if the radar sends an unmodulated sinusoid of frequency f0, the echo from a point-like target whose distance from the radar is r(t) assumes the form y(t) -- a ( t ) e j2rrf~
-- a(t ) E j27rf~ e -j47rr(t)/A,
(14.2.1)
where c is the speed of light, A = c / f o is the transmission wavelength, and a(t) is the amplitude modulation. In the imaging of stationary scenes observed by a radar carried on a spacecraft or an aircraft, the variation of r(t) is entirely due to the radar and an initial coarse estimation of r(t) comes from the navigational system of the vehicle carrying the radar. However, in applications where the observed target is also moving, as in the so called Inverse SAR (ISAR), the distance r(t) is not known at the radar site. Furthermore, also in cases where the target is not moving, the accuracy provided by the ~ Sergio Barbarossa, INFOCOM Department, University of Rome "La Sapienza", Via Eudossiana 18, 00184 Rome, Italy (sergio@infocom.uniromal.it). Reviewers: M. Mesbah and G. R. Putland. : 1In this article, the "target" means the object of the radar imaging.
Time-Frequency Motion Compensation Algorithms for ISAR Imaging
591
navigational instruments may be insufficient, especially for high resolution imaging. In all these cases, it is necessary to estimate the phase history r := 47rr(t)/)~ directly from the received data [1]. This requires the presence of prominent scatterers on the target, whose radar echoes are sufficiently stronger than the background to allow a reliable estimate. The situation is complicated, however, when more dominant scatterers occupy the same range cell, as is likely in many practical circumstances. In fact, if no a priori knowledge of the relative motion is available, it is safer not to use a high range resolution at the beginning, simply to avoid the range migration problem. But the more we decrease the range resolution, the higher is the probability of observing more dominant scatterers in the same range cell. In these cases, as proposed in [2], an iterative procedure can be followed, based on the following steps: (i) start using low range resolution data, for example by smoothing along the range direction, to avoid the range resolution problem; (ii) estimate the instantaneous phase of the dominant scatterer; (iii) estimate the relative motion law r(t) of the dominant scatterer from the instantaneous phase; (iv) use the estimated motion law to compensate for the range migration occurring in the high range resolution data (hence the dominant scatterer automatically becomes the image center); (v) analyze the full range resolution data after compensation of the range migration. Clearly step (i) increases the probability of having more dominant scatterers in the same range cell and, if the observed signal is really multi-component, the analysis becomes more complicated because we must separate the components corresponding to each echo before estimating their instantaneous phases. The separation is simple if the components have linear phase. In such a case, in fact, an FFT-based approach may be sufficient. However, especially when long observation intervals are used to obtain high cross-range resolution, the phase cannot be assumed to be a linear function of time. Time-frequency representations of the observed signals thus provide an important analysis tool in such cases. In this article we illustrate a method for separating the signal components and estimating their instantaneous phases using the so called Smoothed Pseudo Wigner-Ville Distribution with Reassignment (RSPWVD), described in Article 7.2 and originally proposed in [3], followed by a parametric estimation method. Intuitively speaking, the time-frequency analysis is used to get an initial idea about the kind of received signals. Based on this preliminary analysis, we can assume a parametric model and then use parametric estimation methods to improve the estimation accuracy. One possible method for combining T F D s and parametric modeling for retrieving the parameters of multi-component signals, with instantaneous frequencies (IFs) modeled as sinusoids or hyperbolas, was suggested in [4]; but clearly many alternatives can be followed to optimize the performance of the estimation method.
14.2.1
Echo from a Rotating Rigid Body
We assume that the target is a rigid body in the far field of the radar antenna and is characterized by a certain number of dominant scatterers. For a transmission
592
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
frequency fo = c / A , the echo from the k-th scatterer is k = 0,..., K - 1
A k e j2rrf~176
(14.2.2)
where ro is the vector indicating the radar position, r~ (t) indicates the k-th scatterer and K is the number of scatterers. Under the far field hypothesis, the echo can be approximated as A k e j27rf~ e -j4~rR~ e j47ri'~ (14.2.3) where R o = Irol. The motion of a rigid body can always be expressed as the composition of the translation of one of its points plus the rotation of the body around that point. Every imaging or classification procedure must apply some kind of motion compensation and, in general, the translational motion is compensated first. This operation is performed by multiplying the radar echo by a reference signal matched to the echo from one dominant scatterer, and resampling the data in range to remove any range migration of the scatterer assumed as a reference. Taking as a reference the echo from the 0-th scatterer in (14.2.3) (setting k = 0 and A0 = 1), the signals after compensation take the form k -- 1 , . . . , K - 1,
A k e j47ri'~
(14.2.4)
where q k ( t ) "-- r'k(t ) -- r'o(t ). Under the rigid-body constraint, the vectors q k ( t ) can only rotate and the rotation matrix is the same for all points belonging to the target. The differential equation characterizing the rotation of the generic vector q k ( t ) is dqk(t) dt
(14.2.5)
= w(t) x qk(t)
where w ( t ) is the vector containing the instantaneous pitch, roll and yaw pulsations ( w p ( t ) , w~(t), w y ( t ) ) , and x denotes the vector (cross) product. Assuming a constant pulsation, i.e. w ( t ) -
w , and indicating by ~ its modulus, i.e. ~t - Cw 2 + w2 + W2~
and given an initial position qk(O) = qk of the vector at time to = 0, the solution of equation (14.2.5) is q k ( t ) - a k + bk cos(at) + g-~ sin(at)
(14.2.6)
where the vectors a k , bk, ck are a k --
(w.q~) ~2 w ,
bk -- qk --
(w.q~) ~2 w ,
Ck = W X qk.
(14.2.7)
Hence the echo from the generic k-th scatterer is - A k e j4~(mk co~(~t+r
Ak ej4~i~~
(14.2.8)
where m k --
j(
i'o.bk) 2 +
~2
,
Ck -- arctan
ai'o.bk
,
a k -- i ' o . a k .
(14.2.9)
Time-Frequency Motion Compensation Algorithms for ISAR Imaging
593
It is important to notice that the instantaneous phase of each echo contains a constant term plus a sinusoidal contribution having the same frequency for all the scatterers, but different amplitudes and initial phases. This is a consequence of the rigid-body constraint.
14.2.2
Signal Analysis based on Time-Frequency Representations
The general model for the frequency modulation induced by the relative radartarget motion can always be decomposed into the sum of a slow component, well approximated by a low-order polynomial, plus a possible fast component having a sinusoidal behavior. The echo can then be modeled as Sg(t) -- A c j
~M=~a.m,t" cJflcos(Ftt+O) ~_ w(t),
(14.2.10)
where w(t) is additive noise. The slow component is mainly due to the translation; the fast component depends on the rotation. If the sinusoidal component has a period (T = 27r/f~) much longer than the duration (To) of the observation interval, it can also be approximated as a low-order polynomial, so that the overall signal can be modeled as a polynomial-phase signal (PPS). Conversely, if T is much less than To, it is better to estimate the parameters of the sinusoidal component without any polynomial modeling. As the velocity of variation is not known a priori, it has to be estimated from the data using, at least initially, a non-parametric approach. We used, as a preliminary tool, the Reassigned Smoothed Pseudo Wigner-Ville Distribution (RSPWVD) for its good localization and low cross-terms (see Article 7.2). As an example, in Fig. 14.2.1 we report the R S P W V D of the echo from a rotating object with two dominant scatterers having the same backscattering coefficients. The two components are clearly visible and show the same period. We can use the RSPWVD to extract an initial estimate of the period ft and then exploit the parametric modeling, as shown in the following section. One possibility is to extract the peaks of the TFD, as in [5] and Articles 10.3 and 10.4, and then estimate the modulating frequency of the IF.
14.2.3
Parametric Estimation of Instantaneous Phases
With the aid of the echo modeling illustrated in Section 14.2.1, we can improve the performance of the estimation method over that obtainable from the simple use of the RSPWVD. In principle, considering signals expressed as in (14.2.8), we could smooth the WVD by integrating it over all possible sinusoids in the time-frequency plane, as a function of the sinusoidal parameters, using for example a generalized Hough Transform as in [4]. However this operation would be quite troublesome from the computational point of view. In fact such an operation is almost equivalent to computing the square modulus of the scalar product of the received signal and the signal model (14.2.8), with Ak = 1. Hence the detection and estimation of FM signals satisfying (14.2.8) can be carried out by searching for the peaks of the
594
Chapter 14:
Time-FrequencyMethods in Radar, Sonar & Acoustics
Fig. 14.2.1: Reassigned smoothed pseudo-Wigner-Ville distribution of the sum of two echoes from a
rotating object.
function
N-1 P(m.
r
=
n--0
(14.2.11)
x(n) e -j4 m
where N is the number of samples: if a peak exceeds a suitable threshold, a sinusoidal FM signal is detected and its parameters are estimated as the coordinates of the peak. Of course the straightforward application of (14.2.11) is also quite troublesome from the computational point of view. Observing, however, that all signal components have the same pulsation ~, we can estimate ~t from the RSPWVD and then use the estimated value ~ in (14.2.11) to compute the 2D function r
=
N-I E x(n)e-J4~mc~162
12"
(14.2.12)
n-O
An example is shown in Fig. 14.2.2 for the case where three sinusoidal FM signals occupy the same range cell. Fig. 14.2.2 shows the function P(m, r given in (14.2.12). We can clearly observe the presence of three peaks. Comparing P(m, r with a suitable threshold, we detect the presence of dominant scatterers and estimate, at the same time, their modulation parameters rnk and r as the coordinates of the peaks which exceed the threshold. A further simplification of the proposed procedure consists in analyzing time intervals smaller than the rotation period. In these sub-intervals, the instantaneous phases can be approximated by polynomials (the first terms of their Taylor series expansions). In this case we can use specific algorithms devised for the detection and parameter estimation of multicomponent polynomial phase signals embedded in noise, based on the so called high order ambiguity function [6].
Time-Frequency Motion Compensation Algorithms for ISAR Imaging
Fig. 14.2.2: P(m, r
595
of a three-component sinusoidal FM signal.
Fig. 14.2.3: Example of imaging: (a) Image defocused with a third-order polynomial instantaneous phase; (b) Focused image.
An example of application to real SAR images is given in Fig. 14.2.3, where we show a SAR image defocused by a third-order polynomial (left side) and the image focused by estimating the polynomial parameters and then compensating the instantaneous phase (right side). The time-frequency analysis coupled with the parametric method is especially important when the illuminated spot contains moving targets. In such a case, the conventional SAR processing would provide a focused image of the stationary background and a defocused image of the moving object. Indeed, the only way to discriminate moving targets echoes from the background is to analyze their instantaneous frequencies: a different motion law must give rise to a different frequency modulation. Some examples of applications of this idea were shown in [7].
596
14.2.4
Chapter 14:
Time-FrequencyMethods in Radar, 5onar & Acoustics
Summary and Conclusions
Estimating the frequency modulations of the radar echoes is a fundamental step in the formation of high-resolution images. The tool is especially important in the presence of targets moving on a stationary background, where the only possibility for discriminating moving from fixed targets involves the analysis of their instantaneous frequencies. As the observed signal often comes from multiple scatterers, the availability of TFDs with good localization and low cross-terms is particularly important. The other key idea is that, given a time-frequency representation of the signal of interest, we may derive a parametric model of the signal and then use parametric estimation techniques to improve the estimation accuracy.
References [1] W. G. Carrara, R. S. Goodman, and R. M. Majewski, Spotlight Synthetic Aperture Radar. Norwood, MA: Artech House, October 1995. [2] S. Barbarossa, "Detection and imaging of moving objects with synthetic aperture radarwPart 1: Optimal detection and parameter estimation theory," IEE Proc., Part F: Radar ~ Signal Processing, vol. 139, pp. 79-88, February 1992. [3] F. Auger and P. Flandrin, "Improving the readability of time-frequency and time-scale representations by the reassignment method," IEEE Trans. Signal Processing, vol. 43, pp. 1068-1089, May 1995. [4] S. Barbarossa and O. Lemoine, "Analysis of nonlinear FM signals by pattern recognition of their time-frequency representation," IEEE Signal Processing Letters, vol. 3, pp. 112115, April 1996. [5] B. Barkat and B. Boashash, "Instantaneous frequency estimation of polynomial FM signals using the peak of the PWVD: Statistical performance in the presence of additive Gaussian noise," IEEE Trans. Signal Processing, vol. 47, pp. 2480-2490, September 1999. [6] S. Barbarossa, A. Scaglione, and G. B. Giannakis, "Product high-order ambiguity function for multicomponent polynomial-phase signal modeling," IEEE Trans. Signal Processing, vol. 46, pp. 691-708, March 1998. [7] S. Barbarossa and A. Scaglione, "Autofocusing of SAR images based on the product high-order ambiguity function," IEE Proc.: Radar, Sonar ~ Navigation, vol. 145, pp. 269273, October 1998.
Flight Parameter Estimation using Doppler and Lloyd's Mirror Effects
14.3 14.3.1
FLIGHT PARAMETER ESTIMATION A N D LLOYD'S M I R R O R E F F E C T S ~
597
USING D O P P L E R
Acoustical Doppler Effect
In 1842, Doppler predicted the change in the frequency of a wave observed at a receiver whenever the source or receiver is moving relative to the other or to the wave propagation medium. The effect was first verified for sound waves by BuysBallot in 1845 from experiments conducted on a moving train. Nowadays, more striking examples of the acoustical Doppler effect can be found in the time-frequency distributions (TFDs) from acoustic sensors, whether in air or under water, when a propeller-driven aircraft or a helicopter passes overhead. The dominant feature in the acoustic spectrum of a turboprop aircraft is the spectral line corresponding to the propeller blade-passage frequency. In the case of a helicopter, the dominant line corresponds to the main (or tail) rotor blade-passage frequency. This line of constant frequency represents the signal emitted by the source. During an aircraft transit, the instantaneous frequency (IF) of this signal, when received by a stationary sensor, is observed to change with time due to the acoustical Doppler effect.
14.3.1.1
Time-Frequency Model: Microphone in Air
Consider an acoustic source moving along a straight line at a constant subsonic speed v so that at time Tc it is at the closest point of approach (CPA) to a stationary acoustic sensor at a separation distance (or slant range) of R c . Suppose that the source emits an acoustic tone of constant frequency f0 and that the isospeed of sound propagation in air is Ca. The IF of the signal received by the sensor at time t is given by [1] fa(t)
= Ol -}- ~ p ( t ;
(14.3.1)
7c, S)
where a
foC2a/(C2a - v 2) -- f o C a V / ( C
2 -- V2)
Rc
v
s p ( t ; T~, s )
(14.3.2)
--
-
-
(14.3.4)
?2Ca
t - 7~
+ (t-
(14.3.3)
.
(14.3.5)
Examples of time-frequency (t, f a ) curves computed by (14.3.1-14.3.5), with NLS-optimized parameters, are shown as solid lines in Fig. 14.3.1. ~ B r i a n G. F e r g u s o n and K a m W . Lo, Defence Science and Technology Organisation, P.O. Box 44, Pyrmont, NSW 2009, Australia (Brian.Ferguson@dsto.defence.gov.au, Kam.Lo@dsto.defence.gov.au). Reviewers: G. J. Frazer and M. Mesbah.
Chapter 14:
598
Time-Frequency Methods in Radar, Sonar & Acoustics
Fig. 14.3.1: (a) Variation with time of the instantaneous frequency estimates (circles) of the signal received by a microphone during the transit of a turboprop aircraft, and the nonlinear least squares (NLS) fit (solid line) to the observations. The actual and estimated values of the source parameters are tabled at the top of the diagram. (b) Similar to (a), but for the transit of a helicopter.
14.3.1.2
Time-Frequency Model" Hydrophone in Water
Now consider an airborne source traveling with a constant subsonic speed v at a constant altitude h in a constant direction so that its flight path passes directly over an acoustic sensor located at a depth d below the sea surface. Due to the propagation delay, the signal emitted by the source at time T (the source time) arrives at the sensor at a later time t (the sensor time), which is given by [2, 3] t - T+
la(T) Ca
lw(T)
+ ~
Cw
= T+
h Ca COS
+
__
d
(14.3.6)
Cw ~r - (cw ~Ca)2 sin 2 Ca (T)
where ca and cw are the respective isospeeds for sound propagation in air and water, Ca(T) is the angle of incidence (in air) at source time T, and la(T) and lw(T) are the respective travel distances in air and water. Assume the source emits an acoustic tone with a constant frequency f0. The IF of the signal received by the sensor at time t is given by [2, 3]
dT fo fw(t) -- fo d-~ - 1 + vsinr
fo = 1 + vsinr
(14.3.7)
where r is the angle of refraction (in water) at source time T. Note t h a t (14.3.6) and (14.3.7) constitute a set of parametric equations in Ca(T). Given the
Flight Parameter Estimation using Doppler and Lloyd's Mirror Effects
599
Fig. 14.3.2: (a) Variation with time of the instantaneous frequency estimates (circles) of the signal received by a hydrophone during the transit of a turboprop aircraft, and the nonlinear least squares fit (solid line) to the observations. The actual and source parameters are tabled at the top of the diagram. (b) Similar to (a), but for another transit of the turboprop aircraft.
source parameter set { f0, v, h, 7c } and sensor depth d, there is no explicit analytical expression for the IF fw as a function of the sensor time t. However, a theoretical curve of fw as a function of t can be obtained indirectly by first specifying the angle of incidence Ca(T) and then calculating the corresponding values for fw and t. Only acoustic ray paths for which the angle of incidence is less than the critical angle of incidence r will have (some) acoustic energy transmitted across the airsea interface into the underwater medium; i.e., ICa(~-)l must be less than the critical angle of incidence r which is equal to sin-l(ca/Cw). Ray paths with angles of incidence exceeding the critical angle have all their acoustic energy reflected from the sea surface. Examples of time-frequency (t, fw) curves computed by (14.3.6) and (14.3.7), with NLS-optimized parameters, are shown as solid lines in Fig. 14.3.2.
14.3.2
Acoustical Lloyd's Mirror Effect
When a broadband acoustic source (like a jet aircraft) is in motion, the T F D of the output of a sensor located above the ground displays a pattern of interference fringes known as the acoustical Lloyd's mirror effect. Fig. 14.3.3(a) shows a simulated T F D of the acoustic energy at the sensor output during the transit of an airborne source of broadband sound in an ideal environment. The resultant sound field at the sensor is the sum of the direct and ground-reflected sound fields. The reflected path is
600
Chapter 14:
-lime-Frequency Methods in Radar, Sonar gz Acoustics
Fig. 14.3.3: (a) Simulated time-frequency distribution of a microphone's output for the transit of an airborne broadband acoustic source in an ideal environment. (b) Spectrogram of real data recorded from a microphone's output during a jet aircraft transit. (c) Image enhancement of (b) through preprocessing. (d) Predicted paths (thinner lines) of 1st and 2nd destructive-interference curves using NLS method superimposed on observed paths (thicker lines) from (c).
longer than the direct path between the source and the sensor, and the path length difference results in a phase difference between the two fields. At certain frequencies, the phase difference is of the correct amount to cause destructive interference (or cancellation) of the two sound fields.
14.3.2.1
Destructive-Interference Frequency Model
Consider an airborne source traveling in a straight line with a constant subsonic speed v at a constant altitude ht over a hard ground. The acoustic sensor is located at a height h~ above the ground. The source is at the CPA to the sensor at time 7-~, with the ground range at CPA being dc. Suppose the isospeed of sound propagation in air is Ca. The temporal variation of the nth order destructive-interference
Flight Parameter Estimation using Doppler and Lloyd's Mirror Effects
601
frequency as a function of the four parameters {v~, vt, 7-c, 7} is modeled by [4] f ~ ( t ) _ 2 n - 14
2
c~
C2_V2
]
2(c~ -- v~)+ C~V 2 t2(t -- T~)2 --VrVt (t -- T~)
(14.3.8)
where vr = v / h r , vt = v / h t , 7 = V/1 + ( d c / h t ) 2, and cr = c a / h r . The four parameters {v~, vt, To, 7} are uniquely related to the four flight parameters {v, 7~, ht, dc}. Examples of destructive-interference frequency curves { f n ( t ) : n = 1,2} computed by (14.3.8), with NLS-optimized parameters, are shown as thinner lines in Fig. 14.3.3(d).
14.3.3
Time-Frequency Signal Analysis
Various techniques for estimating the IF of a non-stationary signal have been considered in the literature (see [5] and Chapter 10). A common approach is to locate the peak of a TFD of the signal. In this article, an IF estimate corresponds to the propeller blade rate of a turboprop aircraft (or main rotor blade rate of a helicopter) as observed at an acoustic sensor. Shown in Figs. 14.3.1(a) and 14.3.1(b) are two examples of using the peaks of the spectrogram (short-time Fourier transform) to estimate the temporal variation of the IF (denoted by circles) of the signal received by a microphone located just above the ground during the respective transits of a turboprop aircraft and a helicopter. Shown in Figs. 14.3.2(a) and 14.3.2(b) are two examples of using the peaks of the Wigner-Ville distribution [Section 2.1.4] to estimate the temporal variation of the IF (denoted by circles) of the signal received by a hydrophone located under water during the respective transits of a turboprop aircraft. Similarly, the temporal variation of the destructive-interference frequency of the signal received by an acoustic sensor located above the ground during the transit of an airborne source of broadband sound can be extracted by locating the minima of a TFD of the sensor's output. Shown in Fig. 14.3.3(d) is an example of using the minima of the spectrogram to estimate the temporal variation of the first and second destructive-interference frequencies (denoted by thicker lines) of the signal received by a microphone located above the ground during the transit of a jet aircraft.
14.3.4
Source Parameter Estimation: An Inverse Time-Frequency Problem
The inverse time-frequency problem is to estimate the source or flight parameters from the variation with time of the received signal's IF or destructive-interference frequency. The problem is solved by a nonlinear least-squares (NLS) method.
14.3.4.1
Narrowband Source in Level Flight with Constant Velocity: Microphone in Air
The source parameters { fo, v, 7c, Rc }, or equivalently {c~,fl, Tc, S}, are estimated by minimizing the sum of the squared deviations of the noisy IF estimates from their
602
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
predicted values [1]. Specifically, the NLS estimates of {c~, fl, Tc, S} are given by {&,/~, 7~c,g} = arg
[a' + fl'p(tk; T~c S') -- g(tk)] 2
min {~' ~',~',s'}
(14.3.9)
k--1
where g(tk) is the IF estimate at sensor time t = tk and K is the number of IF estimates. The four-dimensional minimization in (14.3.9) can be reduced to a twodimensional maximization [1]" {?~ g } -
max I Ek--1 K [g(tk) - ~0] p(tk)l 2
arg
K
Ek=l[p(tk)
_
Ek~l
(14.3.10)
-
[g(tk) - ~0]/~(tk)
(14.3.11) (14.3.12)
1 ~-~k p(tk) , P(tk) -- p(tk; "rc, s) , where ~0 - ~1 E k g(tk), p(tk) - p(tk; ~'~c,S'), P - -~ and/~ - ~ ~--~k~(tk). Solving (14.3.2) and (14.3.3) using the estimated values for c~ and ~ gives the estimates of the source speed v and source frequency f0 as
-(fl/&)ca
=
(14.3.13)
f0 - &(1 - ~2/c~).
(14.3.14)
From (14.3.4), the estimate of the CPA slant range Rc is given by -
(14.3.15)
-
The maximization in (14.3.10) is performed using the quasi-Newton method where the initial estimates of 7c and s are given by the method described in [1]. The results of applying the source parameter estimation method to experimental data (represented by the circles) are shown at the top of Figs. 14.3.1(a) and 14.3.1(b). The estimates closely match the actual values of the aircraft's speed, altitude, and propeller or main rotor blade rate.
14.3.4.2
Narrowband Source in Level Flight with Constant Velocity: Hydrophone in Water
Define the cost function
P f ( f ; , v',
I
, r~)
E
[g(tk)
!
foZk(V ' h', T')] 2
(14.3.16)
k
where g(tk) is the IF estimate at sensor time tk, {f~, v', h ! , Tc} ! are the hypothesized source parameters, and f~Zk(V', h', T~c) = fw(tk) is the modeled frequency at sensor time tk, which is computed using the approximate method found in [3]. Estimates
Flight Parameter Estimation using Doppler and Lloyd's Mirror Effects
603
of the source parameters, {f0, 9 h, "~c}, are obtained by finding the set of values of { f~, v', h', ~-c~} that minimizes Pf. It can be shown that {~), h,'~c} - a r g {
max {v',h',,'}
[~-~kg(tk)zk(v" h"T~)]2 h' Ek z~(v', h', T~) ,0 < V' < Ca, ]0 --
> 0
Ek g(tk) Zk(O,h, *c) .
}
(14.3.17) (14.3.18)
Numerical procedures for maximizing (14.3.17) use standard (constrained) optimization methods. The initial estimates for {v, h, To} are calculated using the procedure detailed in [3]. The results of applying the source parameter estimation method to experimental data (represented by the circles) are shown at the top of Figs. 14.3.2(a) and 14.3.2(b). The estimates closely match the actual values of the aircraft's speed, altitude and propeller blade rate. 14.3.4.3
Broadband Source in Level Flight with Constant Velocity: Microphone in Air
In a TFD of the sensor output, a destructive-interference curve is a sequence of connected points on the time-frequency plane with (locally) minimum amplitudes. The positions of these points, corresponding to the temporal variation of a particular order destructive-interference frequency, define the path (or shape) of the destructive-interference curve and their (logarithmic) magnitudes define the intensity of the curve. If the TFD is treated as an image, destructive-interference curves can be identified as dark fringes in the image - see Fig. 14.3.3(a). However, in practice, noise and background variations obscure the exact paths of these curves. Fig. 14.3.3(b) shows the spectrogram of a set of real data recorded from the output of a microphone during the transit of a jet aircraft. It is thus necessary that the timefrequency image be pre-processed to enhance the appearance of the destructiveinterference curves before extracting the flight parameters from the image. The image shown in Fig. 14.3.3(b) is first normalized to remove background variations, and then wavelet de-noising is applied to the normalized image to suppress noise [6]. The result is an enhanced image [Fig. 14.3.3(c)] with the noise suppressed, yet the destructive-interference curves are not degraded in appearance. Define the parameter vector z = [?)r,~)t,Tc,')']T. A cost function that measures the difference between the observed and predicted paths of the destructiveinterference curves is given by N
Kn
P(z) - ~ E [gn(tnk) - fn(tnk,Z)] 2 n=l
(14.3.19)
k=l
where g,~ (t) is the observed value of the nth order destructive-interference frequency at time t, which can be obtained by selecting the correct (local) minimum in the enhanced image, and fn(t,z)is the corresponding predicted value using (14.3.8). Minimizing P(z) produces the NLS estimate ~. = [~)r, ~)t, "~c,~/]T of Z. The speed, altitude
604
Chapter 14:
Time-FrequencyMethods in Radar, Sonar & Acoustics
and CPA ground range of the source are then estimated as ~)=hr~)r, ht = v / v t , and d c - l h t v / ~ 2 - 1 1 . The cost function is minimized using the Gauss-Newton method and the required initial estimate of z is obtained by the procedure detailed in [6]. The raw time-frequency image [Fig. 14.3.3(b)] is too noisy for flight parameter estimation. Indeed, direct application of the NLS method to this raw image produces very poor flight parameter estimates, necessitating the use of the enhanced image [Fig. 14.3.3(c)]. Fig. 14.3.3(d) shows the paths (thicker lines) of the first and second destructive-interference curves extracted from the enhanced image, which are then used in the NLS method to estimate the flight parameters. Only the first and second curves are used for flight parameter estimation, i.e., n _< 2 in (19), because the higher order (n > 2) destructive-interference curves are too noisy. The estimated and actual values of the flight parameters are shown at the top of Fig. 14.3.3(d). The estimates of the speed and altitude are in good agreement with the actual values.
14.3.5
Summary and Conclusions
The acoustical Doppler effect enables the flight parameters and blade-passage frequencies of turboprop and rotary-wing aircraft to be estimated from the timefrequency signal analysis of acoustic sensor data. Also, during the transit of a broadband acoustic source, the acoustical Lloyd's mirror effect manifests itself as an interference pattern in the time-frequency distribution of the output of an acoustic sensor positioned above the ground. The flight parameters of jet aircraft can be estimated from the destructive interference fringes.
References [1] B. G. Ferguson and B. G. Quinn, "Application of the short-time Fourier transform and the Wigner-Ville distribution to the acoustic localization of aircraft," J. Acoustical Soc. of America, vol. 96, pp. 821-827, 1994. [2] B. G. Ferguson, "Time-frequency signal analysis of hydrophone data," IEEE J. of Oceanic Engineering, vol. 21, pp. 537-544, October 1996. [3] B. G. Ferguson and K. W. Lo, "Transiting aircraft parameter estimation using underwater acoustic sensor data," IEEE J. of Oceanic Engineering, vol. 24, pp. 424-435, October 1999. [4] K. W. Lo and B. G. Ferguson, "Passive estimation of aircraft motion parameters using destructive interference between direct and ground-reflected sound waves," in Proc. Information Decision ~ Control 99, pp. 171-176, IEEE, Adelaide, Australia, February 1999. [5] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 2" Algorithms and applications," Proc. IEEE, vol. 80, pp. 540-568, April 1992. [6] K.W. Lo, S. W. Perry, and B. G. Ferguson, "Aircraft flight parameter estimation using acoustical Lloyd's mirror effect," IEEE Trans. Aerospace ~ Electronic Systems, vol. 38, pp. 137-151, January 2002.
WVD Analysis ot: HF Radar Measurements of a Surrogate -IBM
14.4
14,4.1
605
WIGNER-VILLE ANALYSIS OF HIGH FREQUENCY RADAR M E A S U R E M E N T S OF A SURROGATE T H E A T E R BALLISTIC MISSILE ~ Experiment Description
In the autumn of 1997 four surrogate theater ballistic missiles (TBM) were launched from a temporary launch site in northern West Australia with the objective of testing a variety of TBM launch detection sensors. The missiles comprised two stages and used Terrier and improved Orion motors together with a passive payload designed so the flight vehicle had the same in-flight length as a representative TBM. The Terrier first stage burnt for approximately 5s following launch and the Orion second stage ignited at approximately 18s following launch and burnt for approximately 24s. One of the sensors was an Australian Defence Science and Technology Organisation high frequency (HF) line-of-sight radar. The radar was positioned several tens of kilometers from the launch site and operated at a carrier frequency of either (approximately) 8MHz or 25MHz. The data reported here is from measurements made at 25MHz and is from the fourth surrogate TBM launch. The radar operated bistatically with the radar transmitter and receiver spaced approximately 20km apart to avoid the transmitter signal overloading the receiver via a direct line-ofsight or ground wave propagation path. This meant that there were no occluded regions in the range coverage. The transmitter floodlit the region of interest. The receiver used a '%" shaped array of 32 elements, with 16 elements on each arm of the "L" and each arm approximately 500m long. A 32 channel receiving system with analog de-ramping and digital output fed data to a real-time radar signal processor. Multiple directive beams covering the region of interest were formed using digital beamforming and both target elevation and azimuth could be determined. A full description of the radar is given in [1].
14.4.2
Signal Description
The radar used a linear frequency modulated continuous wave ( L F M C W ) waveform with a sweep (or waveform) repetition frequency (WRF) of 50Hz. A set of coherent measurements were collected, each of 256 sweeps or 5.12s duration. For hardware limitation reasons there was a short inter-dwell gap of approximately 12 sweeps (12. ~ s ) where no data was recorded between each coherent measurement interval. The radar signal processor pulse compressed each sweep using stretch processing then formed 20 digital beams (10 for each arm of the "L" array). Doppler analysis of the 256 sweeps in each successive coherent processing interval was performed for each range cell in each beam direction. The selection of W R F (50Hz) meant that Doppler measurements of what was a ~ G. J. Frazer, Defence Science and Technology Organisation (DSTO), PO Box 1500, Salisbury, SA 5108, Australia. Reviewers: B. Ristic and M. Turley (DSTO).
606
Chapter 14:
Time-FrequencyMethods in Radar, Sonar & Acoustics
F i g . 14.4.1: Range-Doppler map showing the accelerating target smeared in Doppler (from -22Hz to 0Hz) in range cells 6 and 7. The direct wave from the transmitter and ground clutter is visible surrounding 0Hz Doppler and centered in range cell 2. The coasting spent first stage of the two stage T B M can be seen at range cell 6 with 10Hz Doppler.
very high velocity target were ambiguous for most of the flight. The long (with respect to target kinematics) coherent integration time (CIT) increased radar sensitivity although the target acceleration decreased the coherent processing gain achieved and limited the accuracy of velocity measurements. The radar used existing software for range, beamforming and Doppler processing which assumed constant velocity targets and rather than modify the software to support accelerating targets it was considered that some form of post event acceleration analysis could be used to mitigate the accelerating target mismatch loss. Received radar data was displayed in real-time during the experiment and also recorded on tape for subsequent analysis. The data for off-line analysis was range processed (sweep compressed) and beamformed but not Doppler processed. It was organized into a sequence of dwells of data where each dwell contained two sets of 10 formed beams (one set for each arm of the "L" array) with a group of 20 range cells of interest and the complex signal corresponding to each of the 256 sweeps in a CIT for each range and beam. Sequences of dwells were collected into a single file. A typical range-Doppler map seen by the operator is shown in Figure 14.4.1. It corresponds to one beam in the direction of the target and 20 range cells stacked vertically. For each range cell the Doppler spectrum has been determined from the appropriate range samples of each of the 256 pulse compressed sweeps, with one spectrum per range cell. Although not shown, the operator actually sees 20 such figures per dwell where each one corresponds to a different beam direction. This particular example covers the time interval from launch time plus 16s to launch time plus 21s (T+16s : T+21s). The range-Doppler map shows the acceler-
607
WVD Analysis of HF Radar Measurements of a Surrogate "IBM
ating target smeared in Doppler (from-22Hz to 0Hz) in range cells 6 and 7. The target velocity is such that this is an ambiguous Doppler measurement. The coasting spent first stage of the two stage TBM can be seen at range cell 6 with 10Hz Doppler. The direct wave from the transmitter as well as ground clutter is visible surrounding 0Hz Doppler and centered in range cell 2. An injected calibration signal can be seen in range cells 2 and 3 at -25Hz and +25Hz. Receding targets appear to incorrectly have positive Doppler however this is a frequency inversion artifact of the hardware design of the radar. It is clearly difficult to determine the true time-varying velocity of the target since it changes significantly during the radar CIT. We are interested in determining the instantaneous Doppler law and hence the time-varying velocity of the target throughout this and all other dwells which contain the target during the powered segment of flight.
14.4.3
Signal Model
Let Zk,r,b,a,d be the complex sample output from the radar signal processor prior to Doppler processing corresponding to the k th sweep of the r th range cell and bth beam for the a th arm of the "L" shaped receive array, for the d th dwell. In the data k e [0,...,255], r e [1,...,20], b e [1,...,10], a e [1,2] and d e [1,...,10]. Now let the complex signal corresponding to the coherent radar return from all sweeps for the r th range cell, bth beam and a th arm in the d th dwell be Zr, b,d,a(k). This is the signal used for conventional Doppler processing. An example of the results of conventional Doppler processing applied to a set of range cells is shown in Figure 14.4.1. Consider now a signal model for the radar return from a particular range and beam cell in a dwell which contained the target and where now for clarity we drop the notational dependence on r, b, d, a. Preliminary analysis using a spectrogram timefrequency distribution [2] suggested that the following discrete-time signal model would be at least piece-wise appropriate z(t) = AeJ2~[f~162 t~l + c(t) + n ( t ) 1
(14.4.1)
K
for {t 90, W R F ' ' ' ' , W R F } and where A is the complex amplitude, fo, fl are the linear FM parameters, c(t) represents clutter, and n ( t ) represents noise. T C I T = K+I W R F is the coherent integration time (CIT) of the radar In the case of a bistatic HF line-of-sight radar, c(t) includes contributions such as; the direct signal from the transmitter, range sidelobes from the direct signal, additional targets, say from a booster stage in a multi-stage rocket and meteor and ionospheric scatter. In general, both the clutter and noise are unknown, although we assume the relative energy is such that
IAI2Tci
Ic(t)l 2 +
In(t)L 2
>> 1
(14.4.2)
and thereby consider z(t) as deterministic with unknown parameters, unknown
608
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
clutter and background noise, and with high signal to clutter plus noise energy ratio. The objective is to determine the instantaneous Doppler law as part of the task of establishing the accelerating target dynamics. At a given radar carrier frequency the instantaneous Doppler as parameterized by f0 and fl can be converted to instantaneous target velocity. In some cases the instantaneous Doppler law of the target is not linear, or is piece-wise linear only, nevertheless, we adopt the model in (14.4.1) and if necessary apply our analysis over appropriate subsets of {t " 0 < t < T c I T } .
14.4.4
Instantaneous Doppler Estimation
There are many methods which may have been applied to determine the instantaneous Doppler law (or f0 and/9) of the TBM target from the measured signal z(t). Some are based on various time-frequency distributions [3-5]. Others are based on some estimation procedure for the parameters f0 and fl or the Doppler law, e.g. a maximum-likelihood criteria estimator for f0 and fl, or a least-squares criteria polynomial-phase-law estimator for the instantaneous Doppler law. We have chosen to use a procedure based on the Wigner-Ville distribution (WVD) of the signal z(t). We have done this for several reasons. Firstly, the clutter term c(t) in (14.4.1) is unknown and at least early in the flight will include components due to the Terrier booster and range sidelobes from the direct signal from the transmitter. A simple phase difference based estimator we tried failed in these cases and more disturbingly provided no diagnostic as to the source of the error. On the other hand the WVD provided a clear visualization of the timefrequency law of the signal and since c(t) was not too complicated the WVD display did not become overwhelmed by cross-terms in the WVD. Secondly, the data is in batch form (a sequence of dwells) with comparatively few samples in each dwell (256) which simplifies selection of the analysis window size for the WVD. There are small temporal gaps between dwells so that each dwell must be examined in isolation. Finally, our analysis is off-line with no requirement to be fully automatic. It was quite permissible to manually inspect and interpret the WVD of any signal of interest. Full details on the WVD are given in [2] and the references therein, and in Chapters 2 and 3.
14.4.5
Results
Standard analysis of the measured data from launch four consisted of extracting sequences of peaks corresponding to the target from beam-range-Doppler maps for the sequence of dwells covering the time interval of power flight. Each dwell com1 5 12s CIT. A prised (256 + 12). 1 5.36s elapsed time and a 256. WRF single beam example of one of these maps from the dwell collected during second stage ignition is shown in Figure 14.4.1, the detail of which was discussed previously. The extracted Doppler measurements for the full period of powered flight, one per dwell, are shown in Figure 14.4.2. In this figure we have manually corrected for the -
W
R
F
-
- -
-
~
-
"
WVD Analysis of HF Radar Measurements of a Surrogate TBM
609
Fig. 14.4.2: The result of target Doppler law extraction based on conventional processing. One target Doppler measurement is determined for each dwell (5.12s).
discrete ambiguously measured Doppler caused by the selection of such a low W R F with respect to the target velocity. Although not presented, similar results were obtained for target azimuth, elevation, range, and signal and noise energy. Although" of no consequence here, determining azimuth, elevation and range required compensation for beam coning for the beams formed on each arm of the receive array and for range-Doppler coupling induced by our selection of waveform. It is clear from Figure 14.4.2 that the temporal resolution of our Doppler measurements is poor. In fact, we have only 9 measurements for the period of powered flight, and any attempt to predict the full target trajectory following motor burnout, including impact location, will certainly be less accurate than desired. Many of the peaks are very approximate since the target accelerated during the CIT and therefore had no single velocity or Doppler. Based on the approximate signal model in (14.4.1) and known localization properties of the WVD we have applied a procedure based on the WVD to improve upon the results derived from our standard analysis. Our objective was: to determine the target instantaneous Doppler law, to assist with distinguishing between accelerating and transient targets or scatterers, and to determine the instantaneous received target energy law and the processing loss caused by the target acceleration.
14.4.5.1
Instantaneous Doppler Law
The instantaneous Doppler law has been estimated from the peak of the WVD shown in Figure 14.4.3. Dwells prior to and later than this measurement interval have also been analyzed using the same procedure. The sequence of Doppler estimates, approximately one per sweep, is shown in Figure 14.4.4. There is an estimate every 20ms excluding the inter-dwell gap intervals and some missing estimates at the beginning and end of each CIT. It covers the period of powered flight and is an interval of approximately i0 CITs or some 50s. Contrasting Figures 14.4.2 and 14.4.4
610
Chapter 14:
Time-FrequencyMethods in Radar, Sonar &. Acoustics
F i g . 14.4.3: WVD of the signal containing the accelerating target from range cell 7 in Figure 14.4.1. Second stage ignition occurred at 18s.
F i g . 14.4.4: Doppler v. elapsed time. Computed using the WVD. The gaps are due to missing sweeps during the radar inter-dwell gap and some lost instantaneous Doppler detections at either end of individual radar CIT. The plot is a sequence of point measurements, one per radar sweep, and not a continuous line, as it appears at this scale.
shows the benefit of determining accurate instantaneous Doppler law estimates as compared with conventional Doppler processing. The WVD based procedure has a temporal resolution of approximately the sweep duration, as compared with the CIT for conventional Doppler processing (20ms v. 5.12s). Note that the target second stage motor ignited at approximately 18s and that an accurate time of ignition can be determined from the instantaneous Doppler law. Cross-terms are visible in the WVD shown in Figure 14.4.3 and are generated between the segment of the signal prior to second stage motor ignition and the segment which follows second stage ignition. The presence of the cross-terms did not impede the estimation of the instantaneous Doppler law. Further work not reported here showed that smoothing of the instantaneous
WVD Analysis of HF Radar Measurements of a Surrogate "IBM
011
Fig. 14.4.5: Range-Doppler map showing the transient meteor scatterer in range cell 19. It is difficult to distinguish between this smeared feature and the smeared accelerating target shown in Figure 14.4.1.
Doppler law estimates using polynomial models reduces estimate variance and allows Doppler law prediction into the inter-dwell intervals. The corresponding polynomial velocity law was analytically integrated to produce range law estimates which were consistent with the directly measured range measurements.
14.4.5.2
Accelerating Target v. Transient
Figures 14.4.1 and 14.4.5 show the range-Doppler (RD) maps generated for two separate beam steer directions measured during the same 5.12s dwell. In Figure 14.4.1 the accelerating target is visible as a large smear in Doppler at range cell 7. In Figure 14.4.5 a transient meteor scatterer is visible at range cell 19, which is also smeared in the Doppler domain. We seek improved discrimination between the accelerating and the transient scatterers. Figures 14.4.3 and 14.4.6 show the WVD computed from the signal corresponding to the mentioned range cells. The instantaneous Doppler law of the accelerating target is visible and so is the transient behavior of the meteor scatterer. The WVD provides a display whose features readily allows the two types of scatterers to be distinguished.
14.4.5.3
Instantaneous Energy Law
Knowledge of the instantaneous Doppler law can also be used to construct a demodulation reference signal, s(t). This signal has unity amplitude and instantaneous frequency law which is the conjugate of the estimated instantaneous Doppler law of the target, i.e. the instantaneous frequency law of z(t). s(t) can be used to
Chapter 14:
612
Time-Frequency Methods in Radar, Sonar 2r Acoustics
Fig. 14.4.6: WVD of the signal containing the transient meteor scatterer from range cell 19 in Figure 14.4.5.
demodulate
z(t)
giving the approximately constant Doppler frequency signal z'(t)
=
z'(t).
(14.4.3)
The instantaneous energy of the demodulated signal is ~(t) = Clz'(t)l 2
(14.4.4)
where G is some zero-phase smoothing operator. The instantaneous energy is shown in Figure 14.4.7, which shows three different levels of local smoothing, (i.e. different G). Regardless of the smoothing operator selected there is a significant reduction in the target return energy immediately following second stage ignition between 18s and 18.5s. The cause of this effect is unresolved.
14.4.5.4
Processing Loss due to Target Acceleration
The processing loss due to target acceleration compared with a comparable target of constant velocity can be determined. One contrasts standard Doppler processing applied to the signal z(t) and to the demodulated version z'(t). It can be seen from Figure 14.4.8 that the processing loss is approximately 10dB for this particular CIT. This is the worst case since the dwell considered includes the period immediately before and immediately after second stage ignition and target acceleration was greatest in this measurement interval.
14.4.6
Summary and Conclusions
The Wigner-Ville distribution has been applied to HF line-of-sight radar measurements of a surrogate TBM launch. A procedure based on the WVD has been used
WVD Analysis of HF Radar Measurements of a Surrogate T B M
613
Fig. 14.4.7: Instantaneous energy law v. elapsed time for the interval 16.5s to 21.5s after launch. The three curves correspond to i (..), i i (- -) and ig (-) sample zero phase moving average smoothing.
Fig. 14.4.8: The Doppler spectrum computed over the full CIT for the original signal ( - . ) and for the demodulated signal (-). The processing loss caused by assuming a constant velocity target is approximately 10dB in this case.
to estimate the Doppler law of the target at a temporal resolution of approximately 20ms compared with standard processing which had a temporal resolution of 5.12s and which contained error due to the target accelerating during the radar CIT. Examination of the display of the WVD has assisted with distinguishing between the accelerating target and interfering transient meteor scatterers. It has also al-
514
Chapter 14:
Time-FrequencyMethods in Radar, 5onar & Acoustics
lowed determination of the instantaneous energy law of the target, identifying a still unexplained reduction in received signal level immediately following stage two ignition. Finally, analysis based on the procedure using the WVD has provided an estimate of the processing loss when standard Doppler processing is applied for this particular accelerating target.
References [1] G. J. Frazer, "Project DUNDEE results: High frequency line-of-sight radar measurements and analysis," Tech. Rep. DSTO-TR-0734, Defence Science and Technology Organisation, 1998. AR 010-659. [2] L. Cohen, "Time-frequency distributions--A review," Proc. IEEE, vol. 77, pp. 941-981, July 1989. Invited paper. [3] B. Boashash, ed., Time-Frequency Signal Analysis: Methods and Applicationz. Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [4] B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 1: Fundamentals," Proc. IEEE, vol. 80, pp. 520-538, April 1992. [5] I. Djurovi(~ and L. Stankovi5, "Influence of high noise on the instantaneous frequency estimation using quadratic time-frequency distributions," IEEE Signal Processing Letters, vol. 7, pp. 317-319, November 2000. [6] B. Boashash, G. Jones, and P. J. O'Shea, "Instantaneous frequency of signals: Concepts, estimation techniques and applications," in Proc. SPIE: Advanced Algorithms and Architectures for Signal Processing IV, vol. 1152, pp. 382-400, Soc. of Photo-optical Instrumentation Engineers, San Diego, 8-10 August 1989.
615
Time-Frequency Methods in Sonar
14.5 14.5.1
TIME-FREQUENCY
METHODS
IN S O N A R 0
Principles of Sonar
S o n a r (for "Sound Navigation and Ranging", analogous to "Radar" for "Radio Detection and Ranging") was originally used for finding the distances and velocities of targets. It has evolved to systems that scan and are capable of localizing, identifying and classifying targets. Many other applications use sound under water, e.g. tracking and locating shoals of fish, identifying and tracking whales, sea-bottom surveying for mining, estimating depth or turbulence or temperature variations of the ocean, tracking aircraft flying above the ocean, vision systems for underwater robotic vehicles, etc. These tasks require the processing of a received signal to extract the relevant information. Some of these systems are active, that is, they transmit a sound signal and receive the reflected or backscattered signal. Others are passive and only receive sound emanating from other sources. The transmitters and receivers of sound in water are called hydrophones. Often arrays of hydrophones are employed to provide narrow beamwidths and increased sensitivity. In a c t i v e s o n a r , the transmitted signal is of known shape. The problem is to determine the delay in receiving the reflected or scattered signal. This delay is proportional to the target distance (i.e. range) and inversely proportional to the velocity of sound in water, which is about 1500m/s. The wavelength of a 15kHz component is therefore about 10 cm. In order to obtain better resolution, ultrasound at frequencies above 20 kHz is frequently employed in active sonar. The signal travels through water on its way to the target and back and is modified by the channel response. The received signal could be a superposition of scattered signals from several points on the target. It can also be a superposition of reflections from other objects in the environment (referred to as multipath). In addition, there may be noise. An active sonar signal can be modeled using linear systems theory [1]. In the general case, both the object and the channel are linear and time-varying. The channel impulse response c(~, t) and the insonified object's impulse response o(~', t) are functions of space and time. The received signal S~e~(r;t) is related to the transmitted signal Str(t) through superposition integrals, as follows: 8rec(~'; t)
C(~; t, T)V(~'; t, T) d7 + n(t)
:
(14.5.1)
oo
where ~'= [x, y, z] is the spatial position vector, t is the time, n(t) is the noise, and v(K; t)
-
/?
u(~'; t, T)O(~'; t, T)dT
(14.5.2)
C(r'; t, T) 8tr(T) dT.
(14.5.3)
oo
where U(~'; t) --
F
(x)
~ V. Chandran, Queensland University of Technology, Brisbane, Australia (v.chandran@qut.edu.au). Reviewers: P. O'Shea, K. Abed-Meraim, E. Palmer and B. Boashash.
616
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
The spatial and temporal coherence of the ocean medium determine the largest aperture that can be used for image formation and the longest waveform duration that can be used in sonar systems. It is usually assumed that the coherence time of the channel is much greater than the time taken by the signal to travel to the target and back. Then the phase shift remains nearly constant over one pulse period. It is also assumed that the signal bandwidth is much smaller than the coherence bandwidth of the channel, which makes the channel wideband and frequency nonselective. If the channel impulse response does not change over the duration of the signal, the system can be considered to be linear and time-invariant, and the superposition integrals are replaced by convolution integrals. Srec(t) -- [(Str(t) * C(t)) * o(t)] * C(t) zc rt(t)
(14.5.4)
where 9 represents convolution in the time domain. The channel impulse response consists of delayed impulses corresponding to various paths of propagation, such as echoes from the target and the sea-bottom. A typical channel impulse response that takes into account multiple reflections (but ignores point scatterers) is the discrete summation of impulses M c(t) - E ci 5(t - Ti) (14.5.5) i--1 where the ci represent the attenuation suffered by each reflection term. Usually, in active sonar, the return from the target of interest is the strongest. The ideal condition is when there is only the one reflection, i.e. when M = 1. In practice, there are not only multiple reflections from other objects in the environment (such as fish or debris) but also backscattering from the ocean floor. If the lags for which the returned signal is analyzed are restricted to those corresponding to reasonable expected distances of the target, bottom backscatter and reflections from cluttering objects outside this "ballpark" can be ignored. The problem then essentially becomes one of detecting and locating the principal echo in additive noise that may be nonstationary owing to clutter. Time-frequency analysis plays an important role in analyzing the received signal and estimating the range of the echo from the target because 9 the transmitted signal is often frequency-modulated; 9 there is a Doppler shift if the target is moving; 9 time-delay is the parameter of most interest; 9 noise level is high and signals are highly non-stationary, so that classical methods such as correlation or matched filters do not perform well, and 9 time-frequency displays are physically meaningful for human observers to interpret. In p a s s i v e s o n a r (or acoustic) systems, there is no transmitted pulse; the sound emanates from the target itself. However, there may still be multiple paths by
617
Time-Frequency Methods in Sonar
Fig. 14.5.1: Spectrogram of ship noise. Parabolic patterns are visible towards the right of the plot. The time (t) varies from 0 to 12 minutes and the frequency (f) from 0 to 280 Hz. (From ref. [21.)
which the sound travels to any hydrophone. In fact, the interference between the direct path and the bottom-returned path can be used to advantage in determining the distance and velocity of the source, and is particularly useful in detecting and tracking ships off the coast; it leaves parabolic patterns in the time-frequency plane as shown in [2]. If the channel is modeled as c(t) = 5(t) + c 5(t -
(14.5.6)
,
then the magnitude-squared frequency response is IC(w)l 2 - 1 + c 2 + 2Cl cos(wT). If h is the bottom depth, h - k the hydrophone depth, and d the distance from the hydrophone to the ship along the surface, then it can be inferred from the geometry [2] that the direct signal travels a distance (d 2 + ( h - k ) 2)1/2 and the bottom-reflected signal travels a distance (d 2 + (h + k)2) -1/2. 1
--1/2
The time delay
2) 1/2
between them is ~[(d 2 + (h + k) 2) - (d 2 + ( h - k ) ], where c is the velocity of sound in water. Because d is often far greater than h - k or h + k, the square roots can be approximated by first-order Taylor expansions to show that ~- ~ 2hk dc " Since the ship is in motion, 7- will vary with time. If a is the minimum range of the ship and v its velocity of passage, and if the time origin is at the moment of closest approach, then d 2 = a 2 -t-(vt) 2. Again, when vt is much larger than a, this becomes a2 d ~ vt + ~ , and the first interference maximum occurs at the angular frequency Wmax :
27r 7
'~
7r[vt + a 2 / ( 2 v t ) ] hk
"
(14.5.7)
Therefore, a time-frequency display will show parabolic patterns; see Fig. 14.5.1. Even if there is only flow noise from the vessel, these patterns can show up in long-time history averaged time-frequency representations. From these patterns, the speed of the vessel and its range can be estimated. The passive sonar problem could be looked upon as one of blind deconvolution where the emitted signal parameters and the channel parameters need to be
618
Chapter 14:
Time-FrequencyMethods in Radar, Sonar & Acoustics
simultaneously estimated (see also Article 8.1). Time-frequency methods are able to overcome the ill-conditioning typical of single sensor deterministic deconvolution techniques. Passive sonar can also be viewed as a transient signal detection problem in nonstationary noise when the signal-to-noise ratio is poor.
14.5.2 14.5.2.1
Classical Methods used in Sonar Matched Filter
The optimal detector for a signal of known shape, s(t), in additive Gaussian noise is the matched filter. If the received signal is Srec(t) and there is no Doppler shift, the matched filter can be formulated as a correlation, i.e. r/(T) =
F
Srec(t) s*(t -- 7)dr.
(14.5.8)
(x)
The greatest correlation between the delayed transmitted signal and the received signal will occur at the value of T which corresponds to the delay of the principal echo. Hence the ideal matched filter output exhibits a peak at this delay. The matched filter output is thresholded to make a detection decision. With increasing noise, dispersion in the medium and uncertainty in the transmitted signal shape, it becomes more and more difficult to obtain a discernible peak in the output at the correct location. When there is relative velocity between the source and receiver and a corresponding Doppler frequency r the matched filter can be used for range-Doppler processing as r](r 7) --
F
Srec(t) S* (t -- T) e -j2~r dt
(14.5.9)
c~
where the delay T corresponds to the range parameter. The above version of the matched filter is also known as the Woodward ambiguity function.
14.5.2.2
Hypothesis Tests
Hypotheses tests, based on energy in sliding window signal segments, spectral density correlations and other statistics, are employed to decide whether a segment around a particular location resulted from signal (echo of the transmitted pulse in active sonar or signature of some acoustic source of interest in passive sonar) plus noise, or only noise. The alternative hypotheses for the detection of an unknown signal s(t) in additive noise n(t) are H0: Srec(t) = n(t) H i : srec(t) = s(t) + n(t).
(14.5.10) (14.5.11)
A test (or decision) statistic rl is computed from the received signal 8rec(t) or one of its transformed representations. If the value of the statistic exceeds a certain
619
Time-Frequency Methods in Sonar
threshold, the hypothesis is accepted. A measure of goodness of the test statistic (r]) is the Signal-to-Noise (SNR) ratio [3] SNR-
[E(v/H1) - E(~/Ho)[
(14.5.12)
V/ 89[var(Tl/H1) + var(r//H0)] where E(...) denotes the expected value and var(...) the variance. This SNR can be used to compare test statistics. If the energy of the signal s(t) is A, and if the noise is additive, Gaussian and white with power spectral density No, then the SNR of the matched filter output as a decision statistic is given by v/A/No. For classification of the signal into one of several known categories rather than detection of one category, the transient event would need to be segmented and a set of features extracted from the segment. Instead of thresholding a likelihood, a comparison of likelihoods can be used (e.g. the maximum-likelihood Bayes classifier). Time-frequency methods are also used in hypothesis testing frameworks using a test statistic, and in classification frameworks using a set of features extracted from the time-frequency representation of the signal. The two-dimensional nature of the representation offers some potential for the selection of discriminating features that are also robust to noise.
14.5.3 14.5.3.1
Time-Frequency Approach to Sonar Quadratic and Adaptive TFDs in Sonar
For sonar signal detection with the Wigner-Ville distribution (WVD), energy in the time-frequency plane is used as a test statistic. The WVD (see Article 2.1) is expressed as
W(t, f) -
Zrec (t -- T/2) Zr*ec(t -t- 7"/2)e-j27rfrdT
(14.5.13)
oo
where Z~ec(t) is the analytic associate of Srec(t). The noise performance is given by [3] !
SNR
= V/A-~-
1
No v/l + No/A"
(14.5.14)
Thus the time-frequency representation is known to be sub-optimal and its performance degrades in high noise, due to its cross-terms. However, it provides the potential for time-varying filtering, and the effect of noise on the test statistic can be reduced by windowing the distribution. It is thus possible to obtain adaptive filtering prior to statistic computation or feature selection. The cross WVD can be applied when the reference signal waveform is known (see Subsection 3.1.2.3):
W(t, f) -
Zrec(t - 7/2) z* (t + T/2) e-J2~l'dT oo
(14.5.15)
620
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
where z(t) is the analytic associate of s(t). It is equivalent to the matched filter approach in this case, but with the added advantage of a two-dimensional timefrequency plane for adaptation and filtering. If the transient signal is monocomponent and of known instantaneous frequency (IF), a two-dimensional window in the time-frequency plane that preserves points in a neighborhood of the IF, while filtering out all others, can be used to provide a test statistic with improved SNR. If the transient signal is multicomponent, one can compute and add the cross WVDs of the received signal with the separate components of the transient signal, thereby eliminating the effect of cross-terms while retaining auto-terms in the time-frequency plane. The IF laws in this case would need to have been identified and are assumed known [3]. Other quadratic TFDs such as the c o n e - s h a p e d kernel time-frequency representation (CK-TFR) or ZAM distribution, given in Article 3.3, significantly attenuate the cross-terms and hence may be better suited for time-frequency displays of active sonar returns [4]. The a d a p t i v e o p t i m a l k e r n e l (AOK), described in Article 5.3, uses a radially Gaussian signal-dependent kernel. The shape of the kernel is changed to optimally smooth the TFD as discussed in Chapter 3 and Article 5.3. The optimal kernel is one that maximizes a quality measure for the time-frequency representation subject to constraints that force the kernel to suppress cross-terms and to satisfy marginal distributions. Performance comparisons of the AOK-based time-frequency representation with the STFT, Wigner and ZAM kernels on real multicomponent active-sonar signals are presented in [5]. Fig. 14.5.2 shows various time-frequency representations of a man-made underwater acoustic signal comprising three chirps in noise. The signal itself in plotted in the top box. The other boxes, labeled STFT, WIGNER (for WVD), ZAM and AOK, contain four time-frequency representation plots where the horizontal axis is time and the vertical axis is frequency. The chirps show up in the AOK plot as three line-like features decreasing in frequency with time. They overlap in time but not significantly in the time-frequency plane of the AOK TFR. Hence the chirps can be detected and delays estimated from this representation. Notice that the Wigner and ZAM distributions suffer more severely from cross-term artifacts while the resolution of the STFT is too poor to bring out the relevant features.
14.5.3.2
Gabor Expansion and Wavelets in Sonar
A C r o s s - t e r m D e l e t e d W i g n e r r e p r e s e n t a t i o n ( C D W R ) , based on Gabor expansions (see Article 11.2), is applied in [6] to sonar detection and classification, on data collected by the US Navy. The test data include hyperbolic FM chirp signals and continuous-wave signals for two events. The decision statistic is a crosscorrelation between the cross-CDWR of the received and replica signals and the auto-CDWR of the replica signal. This method achieves better detection accuracy than the square-law detector used by the US Navy. It is tailored to helicopter-
621
Time-Frequency Methods in Sonar
STFT
~ ~, ~ ~. , . ' ~ f
~f~~-.'~)
~ _9- ' ~ ~
f~',I
.~
9 -,).",'-~.~.
t
.... 9
,,,
9
t
AOK
ZAM
9
t
WIGNER
~.
.
~
t
Fig. 14.5.2: Time trace (top) and various TFRs (labeled) of a sonar signal comprising three chirps. The AOK representation resolves features (nearly linear with negative slopes, showing frequencies decreasing with time) corresponding to the three chirps. The frequency sweep would typically be about 10 kHz to 20 kHz and the transients would last tens of milliseconds. (Adapted from [5].)
deployed arrays of hydrophones, which are often close to the target but have wide beamwidths and consequently poor signal-to-noise ratios. The wavelet t r a n s f o r m and its squared magnitude (the scalogram) are compared with the Wigner-Ville distribution for automatic classification of active sonar data in [7]. Classification accuracy of up to 92 percent is achieved at - 4 dB SNR on synthetic data of acoustic scattering from high-aspect-ratio solids (length-to-width ratios of 5 and 10). The wavelet transform often outperforms the Wigner-Ville
522
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
distribution partly because it does not suffer from cross-terms. The scalogram, however, does suffer from such effects because it is a second-order representation. Wavelet-based features and artificial neural networks based on the Bienenstock, Cooper and Munro (BCM) theory are used to classify underwater mammal sounds in [8]. Accuracy is shown to be significantly greater than for the use of the same number of Fourier energy and frequency features. The improvement is largely owing to the wavelets' non-uniform tiling of the time-frequency plane, and owing to BCM feature extraction that performs unsupervised nonlinear dimension reduction and supervised discriminant pursuit.
14.5.4
Prony and Higher-Order Spectral Methods in Sonar
The methods discussed below are "indirect" because although they provide timefrequency representations (primarily by means of sliding window analysis) they are adapted from techniques first proposed for spectral analysis, higher-order spectral analysis or other signal processing techniques. They are not members of the quadratic class of time-frequency representations either, but are sufficiently important to warrant description in this article.
Prony model method: The modified Prony method of detection of short-duration transient signals in noise is based on complex exponential signal modeling. It employs forward and backward predictions using high prediction orders, and eliminates noise zeroes from the forward and backward characteristic polynomials by using the fact that zeroes of white, stationary noise do not exhibit conjugate reciprocal relationships in the z-plane as do true signal zeroes. Singular value decomposition is used to provide further noise enhancement. After these steps, an enhanced shortduration signal spectrum for the analysis window is computed. A sliding window ensures that a time-frequency representation is obtained from which features can be extracted for detection and/or classification. The performance of this technique is compared with the STFT and the Wigner distribution on underwater acoustical data (tuna and sperm whale clicks) in [9]. The data was sampled at 23 kHz and the transient events lasted 10 to 30 milliseconds. Significantly improved features were extracted by the Prony technique. Higher-Order Spectral methods: Gaussian noise immunity, translation invariance and other useful properties of higher order spectra are also used in obtaining robust time-frequency representations and in the feature-extraction stage after a representation. Higher-order spectra are Fourier representations of cumulants or moments of a stationary random process. They are functions of more than one frequency. The bispectrum is a function of two frequencies and is the Fourier transform of the third-order cumulant, which is a function of two lag variables. Unlike the power spectrum (which is the second order spectrum), higher-order spectra retain phase information. They are zero for Gaussian processes and can be used to detect non-Gaussian processes and identify nonlinear systems. For transient detection, as required in sonar, the stationarity assumption needs to be relaxed. Further, the
Time-Frequency Methods in Sonar
623
noise is often non-stationary as well and can contribute to non-zero higher-order spectral contributions. Time-varying versions of higher-order spectra include the third-order Wigner distribution called the Wigner Bispectrum [10], and a class of generalized Wigner-Ville distributions based on polynomial phase laws [11], which is applied to underwater acoustic signals. A sonar image processing technique based on higher-order spectral feature extraction, presented in [12], is also indirectly related to time-frequency representation. This method applies to sonar scans that result in two-dimensional images with azimuth and range as independent variables. Sonar0, Sonar1 and Sonar3 sea-mine databases from the US Naval Surface Warfare Center, Coastal Systems Station, Florida (reference in [12]) are used. The three databases together contain 122 seamines in the training set and 123 in the testing set. The processing stages comprise adaptive Wiener filtering, sliding window matched filtering and adaptive thresholding. These steps account for non-stationary background noise. In this method, the thresholding after matched filtering is used for removing outlying noise rather than for detection. The resulting image is then used to extract features--the principal ones being bispectral and trispectral features (references in [12]) that are designed to be invariant to scaling and translation. Features that are robust to scaling make the task of the classifier easier. A statistical multi-stage classifier is used. The method achieves close to 90% accuracy with about 10% false alarms for detection of sea-mines. Since higher-order spectra are evaluated over overlapping windows, this method is an indirect "space-frequency" representation approach applied to 2D input. It is capable of being trained to achieve robustness to intra-class variations and some channel variations.
14.5.5
Dispersion and Angle Frequency Representation
Different propagating modes through the ocean (viewed as a waveguide) can exhibit different dispersion (group velocity versus frequency) curves. Different frequencies travel with different velocities within the same mode and between different modes of propagation. As a result, it is even possible for a linear FM up-ramp pulse to be converted into a down-ramp pulse of a different frequency law and increased duration, in extreme cases. In [13] a study of this effect is presented along with a method for deconvolution of the channel response. Such deconvolution is sometimes necessary before feature extraction to reduce classification or parameter estimation errors. Dispersion can also be beneficial in classifying ocean bottoms. Bottom baekscattering is a function of the type of surface (such as sand, pebble, clay, rock) and of the frequency of sound and incident angle. Wideband sonar (typically 20 kHz to 140 kHz [14]) is used to characterize the frequency dependence of bottom backscatter and estimate the backscattering coefficient or the impulse response of the bottom. Conventional methods are not adequate for this. The directivity of the transducer and propagation attenuation cannot be ignored and must be deconvolved from the echo response. An angle frequency representation is then obtained which exhibits
624
Chapter 14:
Time-Frequency Methods in Radar, Sonar & Acoustics
features similar to time-frequency representations that can be used for classification. When the scale of the roughness is much smaller than the depth, it affects only the phase and not appreciably the magnitude of the response at any frequency. However, there is an incident angle dependence which can be used for classification. A sand bottom shows a decrease in the backscattering coefficient as the angle increases, whereas, for surfaces with greater roughness such as the pebble bottom, the backscattering coefficient is nearly independent of the angle.
14.5.6
Summary and Conclusions
Time-frequency analysis may be applied with considerable success to active and passive underwater sonar data. Although multipath interference is usually a problem in radar and sonar applications, it is interesting to note that one of the first applications of time-frequency analysis to underwater sonar used it beneficially for detecting and tracking ships. Reduced-interference quadratic time-frequency representations allow time-frequency characteristics of transient signals to be displayed and interpreted without difficulty even at low signal-to-noise ratios and even for multicomponent signals. Filtering in the time-frequency plane allows for robust feature selection and decision statistics of better quality. Adaptations of other techniques such as Prony's method of spectral analysis, wavelet transforms, scale-space analysis and higher-order spectral analysis also allow time-frequency type representation and feature extraction for underwater acoustic signal processing. Automated classification of underwater sources is still challenging. Multi-class problems with more than a few classes are yet to be solved satisfactorily, as the signatures often show significant intra-class variation due to changing channel conditions--multipath, dispersion, etc.--while the classification problem is coupled with the estimation of range, velocity and possibly other parameters. Two-dimensional and three-dimensional scans make the classification task easier through the exploitation of spatial correlations of the returned signal or the match filtered output. However, they introduce additional difficulties with realtime processing. The concept of first expanding a signal in dimensionality to an information-rich space, and then selectively reducing the dimensionality through projections, slices or segmentation, is a powerful one and is being exploited in many pattern recognition problems. Time-frequency representations are one example of such an approach. They can reveal the nature of the evolution of correlations in a transient signal and provide information on the appropriate subspaces to extract features from. Practical exploitation of this in an automated detection, classification and tracking system requires its interfacing with other powerful concepts from other areas such as feature extraction, feature selection, adaptive information processing, classifier selection and classifier fusion.
References [1] J. S. Jaffe, G. Chandran, and E. Reuss, "High frequency acoustic imaging in the ocean," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95),
Time-Frequency Methods in Sonar
625
vol. 5, pp. 2793-2796, Detroit, 9-12 May 1995. [2] J. G. Lourens, "Passive sonar detection of ships with spectro-grams," in Proc. 3rd South African IEEE Conference on Theoretical and Practical Work in Communications and Signal Processing, pp. 147-151, Johannesburg, June 1990. [3] B. Boashash and P. J. O'Shea, "A methodology for detection and classification of some underwater acoustic signals using time-frequency analysis techniques," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 38, pp. 1829-1841, November 1990. [4] W. L. J. Fox, J. C. Luby, J. W. Pitton, P. J. Loughlin, and L. E. Atlas, "Sonar and radar range-Doppler processing using a cone-shaped kernel time-frequency representation," in Proc. 2~th Asilomar Conf. on Signals, Systems, and Computers, vol. 2, pp. 1079-1083, Pacific Grove, CA, 5-7 November 1990. [5] R. Baraniuk, D. Jones, T. Brotherton, and S. L. Marple Jr., "Applications of adaptive time-frequency representations to underwater acoustic signal processing," in Proc. 25th Asilomar Conf. on Signals, Systems, and Computers, vol. 2, pp. 1109-1113, Pacific Grove, CA, 4-6 November 1991. [6] S. Kadambe and T. Adali, "Application of cross-term deleted Wigner representation (CDWR) for sonar target detection/classification," in Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, vol. 1, pp. 822-826, Pacific Grove, CA, 1-4 November 1998. [7] F. Lari and A. Zakhor, "Automatic classification of active sonar data using timefrequency transforms," in Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~r TimeScale Analysis, pp. 21-24, Victoria, BC, 4-6 October 1992. [8] Q. Q. Huynh, L. N. Cooper, N. Intrator, and H. Shouval, "Classification of underwater mammals using feature extraction based on time-frequency analysis and BCM theory," IEEE Trans. Signal Processing, vol. 46, pp. 1202-1207, May 1998. [9] S. L. Marple Jr. and T. Brotherton, "Detection and classification of short duration underwater acoustic signals by Prony's method," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'91), vol. 2, pp. 1309-1312, Toronto, 14-17 May 1991. [10] N. L. Gerr, "Introducing a third-order Wigner distribution," Proc. IEEE, vol. 76, pp. 290-292, March 1988. [11] B. Boashash and G. Frazer, "Time-varying higher-order spectra, generalised WignerVille distribution and the analysis of underwater acoustic data," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'92), vol. 5, pp. 193-196, San Francisco, 23-26 March 1992. [12] V. Chandran, S. Elgar, and A. Nguyen, "Detection of mines in acoustic images using higher order spectral features," IEEE J. of Oceanic Engineering, vol. 27, pp. 610-618, July 2002. [13] Z.-H. Michalopoulou, "Underwater transient signal processing: Marine mammal identification, localization, and source signal deconvolution," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), vol. 1, pp. 503-506, Munich, 21-24 April 1997. [14] N. Ma, D. Vray, P. Delachartre, and G. Gimenez, "Sea-bottom backscattering modeling with a wideband constant beamwidth sonar at normal incidence," in Proc. IEEE Ultrasonics Symposium, vol. 2, pp. 1077-1080, Seattle, 7-10 November 1995.
This Page Intentionally Left Blank
Chapter 15
Time-Frequency Diagnosis and Monitoring Time-frequency applications are now so widespread that they cannot be comprehensively covered in one volume. For this reason, this chapter attempts to further illustrate the time-frequency approach by selecting a few key generic applications of diagnosis and monitoring. The topic is represented by six articles covering a wide range of diverse applications. Electrical power quality is often severely affected by transient disturbances. It is necessary to detect and assess their effect on voltage and current stability. This is achieved by a time-localized frequency analysis where the instantaneous frequency (IF) allows us to assess disturbance propagation (Article 15.1). In the automotive industry, the treatment and prevention of knock is a major problem for internal combustion engines as knock may lead to engine damage. The Wigner-Ville distribution is used to optimize the position for placement of knock sensors (15.2). Some applications involve signals that have dispersive group delays governed by a power law, such as dispersive propagation of a shock wave in a steel beam, and cetacean mammal whistles. A power class of TFDs suitable for such applications is formulated and a methodology described (15.3). In other applications of image processing, image quality may be assessed using a WVD based measure correlated with subjective human evaluations. A new SNR based on the WVD is shown to outperform the conventional SNR measure (15.4). In an application involving neonatal care, monitoring and diagnosing newborns for seizures is possible using a time-frequency approach exploiting characteristic patterns in the time-frequency plane. These patterns are used as a basis for a time-frequency matched-filter automatic detection (15.5). Machine condition monitoring is a task crucial to the competitiveness of a wide range of industries. Detecting and diagnosing faults in machines is possible using time-frequency approaches such as the WVD, wavelets and wavelet packets
(15.6).
Time Frequency Signal Analysis and Processing Boualem Boashash (Ed.) 92003 Published by Elsevier Ltd.
627
Chapter 15:
628
15.1
Time-FrequencyDiagnosis and Monitoring
T I M E - F R E Q U E N C Y ANALYSIS OF ELECTRIC P O W E R DISTURBANCES ~
With the advent of various power electronic devices in modern power systems, power quality is becoming an important issue for both customers and service providers. Ideally the voltage and current signals in power systems are supposed to be constant in amplitude and frequency. However, the voltage and current frequently suffer distortions in amplitude, frequency and phase due to various sources of disturbances [1]. The effects of periodic distortion or harmonics have been treated with Fourier series and are characterized by a total harmonic distortion (THD) index [1]. Recently transient disturbances, which occur within less than one cycle, are of great interest since they also degrade power quality. Specifically, the detection, estimation, classification and assessment of transient disturbance signals have become an important aspect in power quality analysis. To overcome the inability of Fourier analysis to provide time-localized frequency information for the transient disturbance signals, advanced signal processing techniques such as the wavelet transform [2] and time-frequency analysis [3] have been introduced to provide time-localized frequency analysis of the disturbances. In this article, we focus on applications of time-frequency analysis to transient power quality events. A power system is mainly divided into distribution and transmission levels. We will provide one example from the distribution level for power quality assessment and the other from the transmission level for the propagation of disturbance waves. 15.1.1
Time-Frequency Analysis: Reduced Interference Distribution
As explained in Chapter 3, various types of time-frequency distributions may be expressed in terms of the following quadratic (or bilinear) distribution function with a kernel g(~, T) [4]:
pz(t,f) -
eJ2r~(u-t) g(~, ~-) z(u+ 2) z*(u-2) e-J2~f'd, dudT. (15.1.1)
This pz(t, f) is the time-frequency distribution of z(t). The arguments regarding the kernel selection and suppression of interference effects are also applicable to the case of the power quality analysis. As the disturbance signal is characterized by the presence of multiple frequency components over a short time, interference is also problematic and a high resolution time-frequency distribution is required. Among the various types of time-frequency distributions, the reduced interference distribution (RID) [4] has been shown to exhibit the most suitable properties for the analysis of power quality disturbance events (see [3] and Article 5.2). ~ E d w a r d J. P o w e r s , Y o n g J u n e S h i n and W i l l i a m M. G r a d y , University of Texas at Austin, TX (ejpowers@mail.utexas.edu, june@ece.utexas.edu, grady@ece.utexas.edu). Reviewers: W. J. Williams and G. J. Frazer.
Time-Frequency Analysis of Electric Power Disturbances
629
The advantage of the RID can be found in the motivation and philosophy of the RID. RID is a more general definition of the time-frequency distribution kernel that satisfies the following criteria: realness, time/frequency shift invariance, time/frequency marginal properties, instantaneous frequency (IF) / group delay (GD) availability, and time/frequency support. Various definitions of the RID kernel are available as a two dimensional low pass filter and the requirements listed above. Among the various types of the RID kernels, we employ the binomial distribution kernel provided in [4]. In the beginning of transient power quality signal analysis, the wavelet transform has been mainly utilized [2], because wavelet analysis provides time-localized properties. However, the time-frequency distribution, especially the RID, is a potentially more useful distribution [5] because it allows one not only to visualize the transient signal information but also to extract transient parameters useful in assessing the severity of various transient power quality events.
15.1.2
Power Quality Assessment via Time-Frequency Analysis
In this section, we present the application of time-frequency analysis to the assessment of power quality [3]. A capacitor switching disturbance waveform with its corresponding RID is provided in Fig. 15.1.1. The objective of capacitor switching in power systems is to correct the power factor and/or mitigate the effects of harmonics associated with nonlinearities. However, the switching capacitor event also generates undesirable disturbance waveforms as shown at the top of Fig. 15.1.1. At the bottom of Fig. 15.1.1 the corresponding RID is provided. The time-varying frequency content of the capacitor switching disturbance is clearly seen. Note that frequencies up to 3.5kHz appear to be associated with this particular event. This voltage signal was recorded in the field and is provided through the courtesy of EPRI (Electric Power Research Institute). Besides the visualization of the time-varying spectral characteristics of the disturbance provided by the binomial RID, one can characterize the frequency variation of the disturbance in terms of instantaneous frequency. For a time-frequency distribution pz(t, f) satisfying the IF property (see Section 3.1.1), the instantaneous frequency (IF) may be expressed as
fi(t) = f_o~ f . pz(t, f ) d f . f _ ~ pz(t, f ) d f
(15.1.2)
The instantaneous frequency is a normalized first order frequency moment for the time-frequency distribution and corresponds to the "average" frequency for given time t, where each frequency is weighted by the relative energy associated with that frequency at time t. Note that not all types of time-frequency distributions provide a reasonable estimate of the instantaneous frequency. For a reasonable estimate of the instantaneous frequency, a kernel must meet the requirements that g(t/, ~- - 0) -- 1 and that Og(~,,~-) ~ I~=o - 0, which are satisfied by the binomial RID kernel used in this article (see Tables 3.3.1 and 6.1.1).
630
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
Fig. 15.1.1: Capacitor switching disturbance time series (top) and its reduced interference distribution (bottom). From [3], (~ 1999 IEEE.
Fig. 15.1.2: Capacitor switching disturbance and corresponding instantaneous frequency based on the RID. From [3], Q 1999 IEEE.
As a result, the instantaneous frequency of the disturbance provides a quantitative assessment of the transient frequency distortion. The same capacitor switching disturbance waveform (zoomed for the disturbance portion) used in Fig. 15.1.1, and its corresponding instantaneous frequency, are plotted in Fig. 15.1.2. The peak of
Time-Frequency Analysis of Electric Power Disturbances
Fig. 15.1.3:
631
EMTP simulation circuit configuration. From [6], (~ 2000 IEEE.
the instantaneous frequency is estimated to be around 800 Hz. During the disturbance (between times 16 ms and 28 ms), the disturbance is reflected in the change of the instantaneous frequency. After the transient disturbance the instantaneous frequency returns to 60 Hz. 15.1.3
Application of Instantaneous Frequency for Disturbance Propagation
The time of arrival of disturbance signals on high voltage transmission lines is of great interest for relay and fault localization. Traditional fault localization in a transmission line network is based on a fault-study using voltage and current measurements. The traditional methodology is subject to inaccurate results, because the calculation depends on the rough assumption of the fault impedance and the type of fault. Recently, power system monitoring systems employ GPS (Global Positioning Systems) receivers to provide time synchronized data. GPS synchronized data enables one to solve the fault location problem based on time-of-arrivM of the disturbance waveforms. The propagation properties of high voltage transmission lines have been carefully treated and shown to be dispersive [6]. To treat the time synchronized disturbance data, an accurate estimation of the arrival time is critical. In this section, an application example is provided to show how the instantaneous frequency can be utilized for the arrival time estimation. In Fig. 15.1.3 a simulation circuit diagram is provided. For a long transmission line (345kV), there occurs a typical line-to-ground fault which is 84.6 km away from 'SEND' and 394.3 km away from 'RECV' as indicated in Fig. 15.1.3. For this transmission line configuration, EMTP (Electro-Magnetic Transient Program) simulates the voltage and current disturbances. The corresponding voltage waveforms at individual buses (X0005, SEND, X0041, RECV) are provided in Fig. 15.1.4. As the transmission line is characterized by frequency-dependent attenuation and dispersion, different frequencies suffer different amounts of attenuation and also propagate with different phase and group velocities; consequently, the wave-
Chapter 15:
632
(a) X0005
I x 1o' O~
1 x lO'
Time-FrequencyDiagnosis and Monitoring
~
I
I
I
I
I
I
!
!
|
|
|
|
|
|
|
|
|
|
I
I
I
I
I
I
!
I
I
t
I
I
|
i
!
|
|
|
|
|
|
|
|
|
|
I
I
I
I
I
(b) S E N D |
0
ix lO'
(c) X0041 |
0
1 x lO'
(d) R E C V
0 ,t "
F i g . 15.1.4:
time(ms.)
Disturbance voltage waveforms recorded at individual buses. From [6], (~) 2000 IEEE.
forms observed at different buses appear "distorted" or "dispersed" from the original waveform. Therefore, it is difficult to assign time-of-arrival for "distorted" signals. There are perhaps many ways to determine time-of-arrival; however, in this article we focus on one, namely instantaneous frequency. The corresponding zero sequence mode disturbance voltage is provided in Fig. 15.1.5. The zero sequence mode is a summation of the individual three-phase waveforms and is ideally zero for a balanced three-phase system. Thus it is very sensitive to a fault on any of the three phases as shown in Fig. 15.1.5. The reduced interference distribution has been calculated for the disturbance waveforms in zero sequence mode in order to generate the instantaneous frequency of the zero sequence disturbance signals. The instantaneous frequency, its peak value and time of arrival of the disturbance at various observation points are provided in Fig. 15.1.6. Note that the time axis in Fig. 15.1.6 is zoomed to within a 20-40 ms interval as indicated in Fig. 15.1.5. The time of arrival t arrival has been assigned as follows: tarrival = arg{mtax[fi(t)] }.
As the frequency bandwidth of the disturbance is broad since the disturbance is transient, the assignment of the arrival time via the peak instantaneous frequency is a reasonable approximation. The arrival times and peak values of the instantaneous frequency are presented in Table 15.1.1. To convert times to distance we utilize the results of the analysis presented in reference [6], where it was shown that for a range of peak frequencies appearing in Table 15.1.1, the corresponding zero-sequence group velocity is Vg = (2.6+0.2) x 10 s m/sec. The corresponding estimates of distance are compared to the
Time-Frequency Analysis of Electric Power Disturbances x lO 5
0 105
633
(a) X 0 0 0 5
!
210
!
410
|
100
120 .
(b) S
140 .
160 .
180
200
160
180
200
~
,
160
180
.
0 ~ I
5 -0 105 5~
I
2~0 I
I 410 I
I
I
i0
;0
. 100
,
~
,
. 120.
. 140.
~
,
0
.
60
.
.
.
120
I
I
i
J
I
J
!
I
l
20
40
60
80
time(ms.)
120
140
160
180
'
140
.
101 0
'
100
.
I
I
80
.
0 1
=
200
'
'
i
200
Fig. 15.1.5: Zero sequence disturbance voltage waveforms recorded at individual buses. From [6], (~ 2000 IEEE.
~ (a) xo005 (origin) ' '
I
I
I
30
;2
i (84.6 km)i
__
!
i
36
38
J
40
51.22 KHz @ 29.25 ms.
~1
i
I
i
34
/
i
c) X0041( 162.7 km )
t ....
I
100.36 KHz @ 28.90 ms. J..~.
(d) RECV ( 394.3 km)
I
159.60 KHz @ 28.60 ms.
28
t (b) S Ei N D
I
~ ,
i
!
i
50.8 KHz @ 30.20 ms.
....
I
I
I
.
1
Fig. 15.1.6: Instantaneous frequency estimation of the disturbance voltage waveforms in zero sequence. From [6], (~) 2000 IEEE.
true distance in Table 15.1.1. Note the range of estimated distance agrees quite well with the known true distance. Note, also, that the peak instantaneous frequency is lower for the larger propagation distance. This is due to the fact that higher frequencies associated with the disturbance suffer greater attenuation than lower
Chapter 15:
634
Table
15.1.1:
Summary of the zero
Time-FrequencyDiagnosis and Monitoring
sequencedisturbancevia
Arrival time
Peak IF
True distance
(unit)
(ms)
(KHz)
(Km)
(Km)
X0005
28.60
159.60
0.0
N/A
SEND
28.90
100.36
84.6
72 ~ 84
Bus name
RID.
Estimated distance
X0041
29.25
51.22
162.7
156 ~ 182
RECV
30.20
50.80
394.3
384 ~ 448
frequencies. Ongoing work in voltage-only distance localization involves refinement of the instantaneous frequency approach and consideration of the use of group delay.
15.1.4
Summary and Conclusions
In this article, we provided two examples of time-frequency analysis applied to power system disturbances. RID-based time-frequency analysis successfully provides a useful characterization of power system disturbance signals. In this article, we have also demonstrated that the concept of instantaneous frequency is quite useful in providing a simplified picture of the time-frequency features of a transient disturbance and in determining the arrival time of a disturbance observed in the zero-sequence mode of a three-phase system.
References [1] J. Arrillaga, M. H. J. Bollen, and N. R. Watson, "Power quality following deregulation," Proc. IEEE, vol. 88, pp. 246-261, February 2000. [2] S. Santoso, E. J. Powers, W. M. Grady, and P. Hofmann, "Power quality assessment via wavelet transform analysis," IEEE Trans. Power Delivery, vol. 11, pp. 924-930, April 1996. [3] Y. Shin, A. C. Parsons, E. J. Powers, and W. M. Grady, "Time-frequency analysis of power system disturbance signals for power quality," in Proc. IEEE Power Engineering Soc. Summer Meeting, vol. 1, pp. 402-407, Edmonton, AL (Canada), 18-22 July 1999. [4] W. J. Williams and J. Jeong, "Reduced interference time-frequency distributions," in Time-Frequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 3, pp. 74-97, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992. [5] L. Cohen, "Wavelet moments and time-frequency analysis," in Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations IX, vol. 3807, pp. 434445, Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 19-21 July 1999. [6] Y. Shin, E. J. Powers, W. M. Grady, and S. C. Bhatt, "Effects of dispersion on disturbance propagation on high voltage transmission lines," in Proc. IEEE Power Engineering Soc. Summer Meeting, vol. 2, pp. 851-854, Seattle, WA (USA), 16-20 July 2000.
Combustion Diagnosis by -IF Analysis of Car Engine Signals
635
C O M B U S T I O N DIAGNOSIS BY TF ANALYSIS OF CAR ENGINE SIGNALS~
15.2 15.2.1
Knocking Combustions
A permanent challenge for car manufacturers is to increase efficiency, reduce pollution, and prolong life of internal combustion engines. A restriction to these aims is the occurrence of knock. Knock is an undesired spontaneous auto-ignition of the end gas causing a sharp increase of pressure and temperature [1]. Generally, rare knock has no effect to engine performance but frequent or very strong knock can damage the engine. Knock excites combustion chamber acoustic resonances that can be measured by special pressure sensors. But they are too expensive for use in serial vehicles. Nowadays, acceleration sensors mounted on the engine housing measure structure-borne sound as a distorted version of pressure to detect knock. Time-frequency analysis can help to understand the nature of pressure and structure-borne sound signals and to improve knock detection. The following applications for combustion diagnosis make use of the high frequency part of the signals which means frequencies above 3 kHz. Therefore, all signals were high pass filtered. Fig. 15.2.1 shows two pressure signals and a structure-borne sound signal of a BMW engine recorded simultaneously. The sensor of pressure 1 is mounted in the spark plug, the sensor of pressure 2 in the cylinder head.
15.2.2
Signal Models
Considering homogeneous gas distribution, the frequencies of the resonances depend on the speed of sound and of the combustion chamber geometry. The latter can be approximated by an ideal cylinder [2], or more accurately using finite element methods (FEM) [3]. Fig. 15.2.2 shows the instantaneous frequencies of a BMW engine estimated by FEM simulations [4]. But they do neither yield information about time instant when knock occurs nor the resonance amplitudes. The time instant of excitation is random, usually between 5 ~ to 15 ~ crank angle after top dead center of the piston. The amplitudes are damped due to heat and friction losses and to increasing combustion chamber volume. Previous considerations motivate to model the undisturbed pressure signal in time domain by [5]
X(t) - E Ape-d'(t-t~ cos 2zr
fp(a(T))dT + (bp u(t -- to).
(15.2.1)
p=l
The pressure signal is a superposition of P resonances, index p refers to the pth component. Ap and (I)p are random initial amplitude and phase, e -dpt describes the damping of the oscillation, to a suitable time instant after excitation, u(t) the step ~ J o h a n n F. B h h m e , Department of Electrical Engineering and Information Science, Ruhr-Universit~t Bochum, 44780 Bochum, Germany (boehme@sth.ruhr-uni-bochum.de) and Shnke C a r s t e n s - B e h r e n s , Robert Bosch GmbH, 70049 Stuttgart, Germany (scb@ieee.org, scb@sth.ruhr-uni-bochum.de). Reviewers: LJ. Stankovi(~ and G. Matz.
Chapter 15:
636
Time-FrequencyDiagnosis and Monitoring
Fig. 15.2.1" Example of time signals of a knocking combustion.
function, and fp(a(t)) the instantaneous frequency at crank angle a(t). The engine housing transfer function is modeled as a linear, time-varying function h(z(t), T), whereas time variance stems only from piston position z(t) which is a function of crank angle. Merging pressure model and engine housing transfer function yields the model of the undisturbed structure-borne sound signal Y(t) -
F
h(z(t), 7 ) X ( t - 7)dT-
oo
P
(15.2.2) p--1
where Hp(z(t), w) -
F
oo
h(z(t), 7) e dp~ e - j ~ d~-.
(15.2.3)
Combustion Diagnosis by TF Analysis of Car Engine Signals
637
Fig. 15.2.2: Frequencies estimated by finite elements (solid lines) and estimated WVS of pressure and structure-borne sound signals of a BMW engine at 3000 rpm based on 96 realizations measured simultaneously.
The approximation is valid if the instantaneous frequencies fp(c~(t)) do not change fast. Thus, the structure-borne sound signal is a complex amplitude modulated version of the pressure signal.
15.2.3
Signal Analysis using Wigner-Ville Spectrum
Pressure and structure-borne sound signals consist of a superposition of amplitude and frequency modulated components. Therefore, identification of the components only in time domain or frequency domain is difficult: a suitable time-frequency representation is required. The Wigner-Ville spectrum (WVS) yields a high timefrequency resolution. For a stochastic process S(t) the WVS is defined by
Ws(t, f) -
rs
t + -~7, t - -~- e
dT
(15.2.4)
O0
where r s ( t , t ' ) = E{S(t)S*(t')} is the correlation function of S(t) [see Article 9.4]. In case of constant speed and load pressure and structure-borne sound signals are
Chapter 15:
638
Time-FrequencyDiagnosis and Monitoring
cyclo-stationary with low stochastic dependency from combustion to combustion [5]. Exploiting this fact, the correlation function can be estimated by
§
t') -
1
L
-~E
sz(t)sz(t')
(15.2.5)
/--1
where st(t) is observed pressure or structure-borne sound of the /th combustion, l = 1 , . . . ,L, with st(t) = 0 if t is not in the observation interval [0,T] and t - 0 corresponds to top dead center for each combustion cycle. Substituting the estimated correlation function in Eq. (15.2.4) yields the estimated WVS of pressure or structure-borne sound, respectively,
? ( O0
)
This WVS estimate has three advantages: it has a high time-frequency resolution, cross-terms are reduced effectively, and the algorithm is fast compared to other approaches that first estimate the WVS of each combustion and then take the mean over all combustions. Model verification Fig. 15.2.2 shows the estimated WVS of the signals of two differently located pressure sensors and a structure-borne sound signal of a B M W engine. In order to stress weak signal components, signal amplitudes are scaled logarithmically, so we have to be careful in interpreting gray scales as amplitudes. In addition, Fig. 15.2.2 shows the instantaneous frequencies estimated by F E M simulations. Obviously, the pressure sensors observe different resonances due to their positions and pressure nodes and anti-nodes locations. The frequencies of the visible pressure resonances coincide well with the frequencies estimated by F E M simulations. The structure-borne sound signal shows similar behavior but there are some strange components at approximately 6.5 kHz and between 10 and 12 kHz. They may be a consequence of the engine housing transfer function, additional noise, or of insufficiently reduced cross-terms of the estimated WVS. As mentioned before, we have to be careful in interpreting the gray scale amplitudes. Nevertheless, we can recognize for pressure and structure-borne sound signals that knock occurs around 10 ~ crank angle. Furthermore, signal power (and amplitude) is maximal at excitation of knock and decreases with increasing crank angle. In general, the results of the estimated WVS justify the decomposition of pressure and structure-borne sound signals into frequency and amplitude modulated components.
Knock s e n s o r position optimization The number of resonances being observed by a sensor depends strongly on its position. This holds for pressure as well as for structure-borne sound. Therefore, it is very important to find a good acceleration sensor position. This can be done at test bed engines by optical methods like laser holography or by assessing the position of several acceleration sensors with statistical analysis [6]. A very simple, intuitive, and fast approach is to use the estimated
Combustion Diagnosis by TF Analysis of Car Engine Signals
639
WVS. Since there are fast algorithms for its implementation, it can be calculated at the test bed while the engine is running. The operator compares the WVS of the pressure signal with those of the acceleration sensor signals. Using pressure as reference, structure-borne sound signals that contain the same resonances are more suitable for knock detection than signals with less or different resonances. As spinoff, observing the sensor signals by WVS is very useful for verifying measurement setup, a very important task at test bed measurements.
Non-equidistant sampling Knock detection usually bases on estimated resonance energies of structure-borne sound signals. As previous considerations have shown, the frequencies vary with crank angle. Optimum estimation of resonance energies can therefore be achieved by time-varying filtering. Alternatively, optimized non-equidistant sampling in time domain leads to resonances with approximately constant frequencies [7]. Using Wigner-Ville analysis, parameters can be found to optimally compensate the frequency modulation in a crank angle interval with significant resonance energy. Then, ordinary band pass filtering is sufficient to estimate the resonance energies well.
15.2.4
Signal Analysis using S-Method
The estimated WVS is a useful tool for analyzing a large number of combustions at once. But if the number of combustions is not large enough, cross-terms disturb the estimated spectrum. In many cases, we are interested in analyzing single combustions. There are a couple of good WVS estimators reducing or avoiding cross-terms; see e.g. [8, 9]. Among these, the S-method has been proven to be an appropriate tool to represent single knock signals in time-frequency domain [10]. Using the short-time Fourier transform of signal s(t) -
/?
+
(15.2.7)
oo
where
w(r)
is a lag window, the S-Method can by defined as
SM(t,w)- 1 T"
/? P(O)F~(t,w+O)F~'*(t,w-O)dO
(15.2.8)
oo
with an appropriate window P(O). The width of P(O) affects the behavior of the S-Method with regard to reduction of cross-terms significantly. For a detailed discussion of the S-Method see Article 6.2. Exploiting Fast Fourier Transform algorithms, the S-Method can be implemented very efficiently [11], so that real-time processing is possible on a fast DSP.
Observation of single combustions Fig. 15.2.3 shows the estimated WVS and the S-Method of a single knocking combustion. The estimated WVS seems to contain mainly one component. There are some more but it is very difficult to distinguish between auto-terms and cross-terms. The time-frequency concentration of the SMethod is worse but it is easy to identify at least four components. Therefore, the
640
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
F i g . 15.2.3: Left: estimated WVS of a single knocking combustions of a BMW engine at 3000 rpm observed by pressure sensor 2; middle: S-Method of the same signal; right: S-Method averaged over 96 realizations.
S-Method is much more suitable to observe single combustions than the estimated WVS. Normally, the WVS is estimated by averaging over a large number of realizations. Analogue, the S-Method of a large number of single knocking combustions was averaged in Fig. 15.2.3 (right). Comparing this representation to the according WVS in Fig. 15.2.2 (middle), we recognize the worse resolution of the averaged SMethod. But the amplitude and frequency modulation is observable. In contrast to the WVS, the averaged S-Method is able to resolve the component that starts at 15 kHz at 0 ~ crank angle which is the strongest component in pressure 1; see Fig. 15.2.2 (left). In the WVS this component is covered by cross-terms. Apparently, the number of realizations was too small to reduce cross-terms sufficiently.
Power and energy estimation As demonstrated before, the S-Method can be used to investigate single combustions. As shown in [10], it is possible to track the resonances and to estimate the power A2(t) of component p at each time instant t
Combustion Diagnosis by -IF Analysis of Car Engine Signals
641
by integrating along the frequency axis in the component's band Bp(t):
1 /B
SM(t,w) dw.
(15.2.9)
In the simplest case, the band width Bp(t) is constant. But considering knock signals, the band width decreases with crank angle; see Fig. 15.2.2. So the signal-tonoise ratio can be improved by adapting the band width. In Article 6.2 an adaption algorithm is proposed. Finally, integration of A2p(t) along the observation interval [0, T] yields the estimated resonance energy Ep of component p
#,p -
T
f0
Ap(t)dt.
(15.2.10)
Knock detection The main application of estimating resonance energies of internal combustion engines is the detection of knock. A knock detector in today's cars based on structural-borne sound usually estimates the signal energy in a wide band, e.g. 5 to 12 kHz. The energy is compared to a threshold which yields the knock detection. This is a simple, fast and successfully applied method. But it can be improved by taking into consideration that signal energy is the sum of the energies of single components. Since the engine housing transfer function may attenuate or amplify the components differently, we get a wrong idea of the energy distribution inside the cylinder which is used as reference at test bed engines. Among other time-frequency detectors like those presented in Article 12.4 or [12], [13], the S-method is a useful tool for estimating the resonance energies of single components and is therefore advantageous for detecting knock; see e.g. [14].
15.2.5
Summary and Conclusions
On the basis of the WVS and the S-method, it was demonstrated that timefrequency representations are suitable and very useful tool~ for the task of combustion diagnosis of internal combustion engines. They allow an intuitive insight in relevant signal ~)arameters and properties like amplitude and frequency modulation. Therefore, they simplify and improve investigations during the development and application of new engines. Furthermore, they help to improve signal processing for higher engine efficiency.
References [1] J. B. Heywood, Internal Combustion Engine Fundamentals. McGraw-Hill, 1988. [2] R. Hickling, D. A. Feldmaier, F. H. K. Chen, and J. S. Morel, "Cavity resonances in engine combustion chambers and some applications," J. Acoustical Soc. of America, vol. 73, pp. 1170-1178, April 1983. [3] D. Scholl, C. Davis, S~ Russ, and T. Barash, "The volume acoustic modes of sparkignited internal combustion chambers," in SI Engine Combustion, no. 1315 in SAE SPECIAL PUBLICATIONS, pp. 1-8, Soc. of Automotive Engineers, 1998. Proc. SAE Internat. Congress & Exposition, Detroit, MI, 23-26 February 1998. SAE paper no. 980893.
642
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
[4] S. Carstens-Behrens and A. Podvoiski, "Estimation of combustion chamber resonances for improved knock detection in spark ignition engines," in Proc. 18th CAD-FEM Users' Meeting, Internationale FEM-Technologietage, Friedrichshafen, Germany, 20-22 September 2000. [5] J. F. B6hme and D. K6nig, "Statistical processing of car engine signals for combustion diagnosis," in Proc. Seventh IEEE Workshop on Statistical Signal and Array Processing (SSAP-94), pp. 369-374, Quebec City, 26-29 June 1994. [6] A. M. Zoubir, "Bootstrap multiple tests: An application to optimum sensor location for knock detection," Applied Signal Processing, vol. 1, pp. 120-130, 1994. [7] D. K6nig, "Application of time-frequency analysis for optimum non-equidistant sampling of automotive signals captured at knock," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'96), vol. 5, pp. 2746-2749, Atlanta, GA, 7-10 May 1996. [8] H.-I. Choi and W. J. Williams, "Improved time-frequency representation of multicomponent signals using exponential kernels," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 37, pp. 862-871, June 1989. [9] Y. Zhao, L. E. Atlas, and R. J. Marks II, "The use of cone-shaped kernels for generalized time-frequency representations of non-stationary signals," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 38, pp. 1084-1091, July 1990. [10] L. Stankovi6 and J. F. BShme, "Time-frequency analysis of multiple resonances in combustion engine signals," Signal Processing, vol. 79, pp. 15-28, November 1999. [11] L. Stankovi6, "A method for time-frequency analysis," IEEE Trans. Signal Processing, vol. 42, pp. 225-229, January 1994. [12] G. Matz and F. Hlawatsch, "Time-frequency subspace detectors and application to knock detection," Archiv fiir Elektronik und Ubertragungstechnik (Internat. J. of Electronics s Communications), vol. 53, pp. 379-385, December 1999. [13] B. Samimy and G. Rizzoni, "Mechanical signature analysis using time-frequency signal processing: Application to internal combustion engine knock detection," Proc. IEEE, vol. 84, pp. 1130-1343, September 1996. [14] S. Carstens-Behrens, M. Wagner, and J. F. B6hme, "Detection of multiple resonances in noise," A rchiv fiir Elektronik und (]bertragungstechnik (Internat. J. of Electronics s Communications), vol. 52, no. 5, pp. 285-292, 1998.
Power Class Time-Frequency Representations and their Applications
15.3
15.3.1
643
POWER CLASS TIME-FREQUENCY REPRESENTATIONS AND THEIR APPLICATIONS o Power Class Quadratic Time-Frequency Representations
Various classes of quadratic time-frequency representations (QTFRs) are best suited for analyzing signals with certain types of time-frequency (TF) geometries. For example, when a signal has constant TF characteristics, Cohen's-class QTFRs (with signal-independent kernels; i.e. the quadratic class as defined on p. 68) [1,2] are most appropriate. The aforementioned QTFR classification is based on the grouping together of all QTFRs that satisfy the same two signal transformation covariance properties (see [3,4] and Articles 4.3 and 5.6). Specifically, Cohen's class [1, 2] consists of QTFRs that are covariant to constant (nondispersive) time shifts and frequency shifts of the signal whereas the affine class (see [2, 5, 6] and Article 7.1) consists of QTFRs that are covariant to scale changes (dilations) and constant time shifts. Furthermore, the hyperbolic class [7, 8] consists of QTFRs that are covariant to scale changes and hyperbolic dispersive time shifts and are best suited to analyze signals with hyperbolic (nonlinear) group delay. When the analysis signal has a group delay that is a power function of frequency, the aforementioned QTFRs do not provide an adequate representation as they do not match power TF characteristics. Thus, we designed power class QTFRs to successfully localize signals along their power law group delay functions [9, 10]. The importance of power QTFRs is pronounced by the fact that many applications involve signals with dispersive group delays governed by a power law that corresponds to some power parameter ~. Examples of such signals include the dispersive propagation of a shock wave in a steel beam (~ = 1/2), trans-ionospheric chirps measured by satellites (t~ = -1), acoustical waves reflected from a spherical shell immersed in water, various cetacean mammal whistles, and signal solutions of the diffusion equation (t~ = 1/2) (e.g., waves propagating along uniform distributed RC transmission lines). Power laws can also be used to roughly approximate other, more complex, group delays. References for these applications can be found in [9].
Localized signal analysis application. The type of signals found in the applications mentioned above constitute the family of power impulses that best typifies the power TF geometry. Power impulses are defined in the frequency domain as I(~)(f) ~- V/IT~(f)]
e -j2rcA~(~/y)
--
V/(It~l/fr)[f/fr[ ~-1 e -j2zccsgn(f)[f/f'['~ (15.3.1)
with monotonic phase spectrum A ~ ( f / f r ) = s g n ( f ) [ f / f r [ ~ and power group delay ~ Antonia Papandreou-Suppappola, Telecommunications Research Center, Department of Electrical Engineering, Arizona State University, Tempe, AZ 85287-7206 USA (papandreou@asu.edu), F r a n z H l a w a t s c h , Institute of Communications and Radio-Frequency Engineering, Vienna University of Technology, A-1040 Vienna, Austria (fhlawats@pop.tuwien.ac.at), and G. Faye B o u d r e a u x - B a r t e l s , Department of Electrical and Computer Engineering, University of Rhode Island, Kingston, RI 02881 USA (boud@ele.uri.edu). Reviewers: J. Bertrand and J. P. Ovarlez.
Chapter 15:
644
Time-FrequencyDiagnosis and Monitoring
Tg(f) = cT~(f) -- C ~ If /f~l ~-1 -- C ~d A ~ ( f / f r ) with f E ~. Here, sgn(f) provides the sign (+1) of the frequency variable f, and fr > 0 is a fixed reference frequency. For successful analysis, an ideal Q T F R T must be localized along the group delay Tg(f) of the power impulse in Equation (15.3.1). In particular,
TI~, (t, f) - IT~(f)I
5(t -
(15.3.2)
eTa(f))
is very desirable in many applications where information about the signal analyzed could be obtained from the localized curve t = C T,~(f) in the T F plane. For example, the localization could be used in applications such as signal classification or estimation of the parameter c in (15.3.1). As we will show next, some power class Q T F R s ideally provide the localized representation in (15.3.2) for analyzing power impulses as well as other signals with power group delay. The power law T F structure can also be observed in applications where a system can cause a time shift that varies dispersively in frequency to an input signal with Fourier transform X ( f ) yielding the output signal Y ( f ) = e -j2rcA~(f/f~) X ( f ) . Thus, power Q T F R s could be used successfully in applications where a signal with constant group delay passes through a system with power dispersive T F characteristics that transforms the signal's constant group delay to a power group delay [9]. For example, the ocean is a medium (system) that could cause power dispersive changes to an underwater communications information message and its echoes. These changes could be accounted for at the receiver when matching processing tools, like power QTFRs, are used for detection.
The power QTFR classes. Following the covariance-based classification method, for an analysis signal x(t) with Fourier spectrum X (f), we define all nth power class QTFRs, T(x~) (t, f), to satisfy two specific covariance properties [9, 10]. The first property is covariance to scale changes on x(t), i.e.,
Tc(~) a X (t, f )
~
T(x'~)(at , f /a)
(15.3.3)
where the scaling operator Ca is defined as (CaX)(f) - X ( f / a ) / ~ . The second property is covariance to power time shifts on x(t) that correspond to frequencydependent shifts, T~(f), in the signal's group delay. Specifically, T T~: (~)~)X (t, f) - T (~) (t - c~-~(f) f ) - T (~) (t - c(~//f,)lf//f,I '~-~, f )
(15.3.4)
for f e ~. The effect of the power time shift operator 19(~) is given by [9] -
z(f)
-
x(f).
(15.3.5)
Here, c E ~ and ~ c ~ (~ # 0) is the power parameter associated with each power class. The ~th power function A~(b) = sgn(b)Ibl ~, b e ~, corresponds 1 to a 1The definition An(b) = sgn(b)Ibl~ versus An(b) -- b~ extends the power function to b < 0 so that An(b) is an odd, strictly monotonic function constituting a one-to-one mapping from ~ to ~.
Power Class Time-Frequency Representations and their Applications
T~:(f)l
645
~-4(f)
9
"f
~ 89
~ 89
Fig. 15.3.1: The power group delay T,~(f) for various choices of the power parameter ~.
transformation of the phase spectrum of the signal as shown in Equation (15.3.5). The frequency-dependent time shift 7,~(f) in (15.3.4) corresponds to the derivative d A,~(f/fr) - ~~ I f / f r t '~-1 " Fig. 15.3.1 depicts of the power function, i.e., r , ~ ( f ) - -37 T~(f) in the TF plane for various choices of the power parameter ~. The importance of the ~th power class QTFRs is directly linked to the two covariances in (15.3.3) and (15.3.4). The power time shift covariance in (15.3.4) is useful in analyzing signals passing through dispersive systems with power law group delay or signals localized along power law curves in the TF plane. On the other hand, the scale covariance in (15.3.3) is important for multiresolution analysis. Power class formulation.
It was shown in [9] that any Q T F R of the ath power
class can be expressed as T(~)(t'f)-
1
~
/F ~oo
9d 2€200
oo
FT(fl/f, fe/f) X ( f l ) X * ( f 2 ) d f l df2,
(15.3.6)
where the two-dimensional (2-D) kernel FT(bl, b2) uniquely characterizes the QTFR. Specific choices of FT(bl, b2) define specific QTFRs T (~) in the ~th power class. Also note that a different power class is obtained by varying ~ in (15.3.6). When ~ = 1 we obtain the affine class [see Article 7.1], which is an important special case of the power classes corresponding to the constant (nondispersive) time shift 71 (f) =- 1~ft. The nth power class QTFR in (15.3.6) can also be obtained via a unitary warping operation (see Articles 4.5 & 5.6 and [9-11]). Specifically, if T(A)(t, f ) is a QTFR
Chapter 15:
646
Time-Frequency Diagnosis and Monitoring
of the affine class, then the corresponding ~th power class QTFR, T(x~) (t, f), can be obtained by warping the affine class QTFR, T(xA) (t, f), according to [9]
T(x~)(t, f) -
-
(
t
u~x frT~(f-----~'h A ~ ( f / f ~ ) u~x
(
)
t
)
~]f/frl~_l ,f~sgn(f)lf/f~] ~ .
Here, the unitary 2 frequency axis warping operator
frA_~l(f -_ ~ [ f (bl~X)(f)- ~/SrX(frA-~l(f /fi.)lT~(/fr))]
[9,10]/g~
/frl
(15.3.7)
is given by
X(frsgn(f)lf /frl 88
u
where
inverse function
s tisOes
=
= b. The
QTFRs of the affine class are defined as [2]
T(xA)(t, f) = ~1 ~o~ oorT(fl/f, f2/f)e r
Z(f~) X*(f2) df~df2
where FT(bl, b2) is a 2-D kernel characterizing the affine class QTFR (aft (15.3.6)). Note that T(xA)(t,I) = T(x~)(t,f)l~=l . The unitary warping relation in (15.3.7) preserves certain desirable characteristics of the affine class while transforming other ones to match the dispersive nature of the signals to be analyzed by power class QTFRs. For example, whereas both classes preserve scale changes of the signal, only the affine class preserves constant (nondispersive) time shifts. On the other hand, the warping in (15.3.7) transforms constant time shifts to power dispersive time shifts in the power class, and thus the constant time shift covariance of the ai~ine class is transformed into the power time shift covariance of the power classes. The warping also provides an efficient method for computing power class QTFRs when algorithms for computing affine class QTFRs are available [10]. Class members. Specific QTFRs of the power classes satisfy various desirable properties in addition to the covariance properties in (15.3.3)-(15.3.4) satisfied by all members of the power classes. Some power class QTFRs of particular importance include the power Wigner distribution, the powergram, the smoothed pseudo power Wigner distribution [9], and the Bertrand P~-distributions (see Article 7.1 and [6,9]). All these QTFRs have counterparts in the affine class by virtue of the power warping relation in (15.3.7). For example, the power Wigner distribution, W(x~)(t,f) =
f
X fA; 1 1+~
fA~
1-
~2 ~-1
= Wu~x(t/(frT~(f)), f~A~(S/f~)) 2Unitarity of the operator/d~ implies that/d~ preserves inner products. Specifically, the operator satisfies .f-~oo(U~X)(f) (ld~X)* (f) df = f _~176X ( f ) X* (f) dr.
Power Class" Time-Frequency Representations and their Applications
547
(cf. (15.3.7)), is the power warped version of the well-known Wigner distribution (WD) , W x ( t , f ) - f _ ~ X ( f + 2 ) X * ( f - 2 ) e j2rtv dv [1-3]. As such, it satisfies many properties such as a specific set of marginal properties and the perfect TF localization property for power impulses in (15.3.2). Just like the WD, the power WD contains oscillatory and partly negative cross terms when multicomponent signals are analyzed (see Article 4.2 and [10]). In order to suppress cross terms, a specific type of smoothing can be applied that is matched to the power TF geometry. The powergram and the smoothed pseudo power WD apply such a smoothing to the power WD, at the expense of the loss of some properties (such as the marginal properties) and the loss of TF resolution. The Bertrand P~-distributions (see Article 7.1 and [6]) are also perfectly localized for power impulses; moreover, they are the only power class QTFRs that preserve constant time shifts in addition to power dispersive time shifts. Power class members and their properties are discussed in detail in [9]. Next, we present examples with both synthetic and real data.
15.3.2
Power Class Applications
Synthetic data analysis example. The discrete implementation of power QTFRs
(outlined in [9, 10]) was applied to analyze a two-component signal consisting of two power impulses with power parameter nsignal -- 3. F o r computational purposes, the impulses are windowed in the frequency domain. Figs. 15.3.2(a) and 15.3.2(b) show the results obtained with the power WD and a smoothed pseudo power WD with a very short smoothing window. Both QTFRs have power parameter ~ = 3, matched to the power impulse parameter t%igna 1. The power WD in Fig. 15.3.2(a) has very good TF concentration but large cross terms [10] which are effectively suppressed in the smoothed pseudo power WD in Fig. 15.3.2(b) with hardly any loss of TF concentration. Also shown (in Figs. 15.3.2(c) and 15.3.2(d)) are the results obtained with the WD and an affine-smoothed pseudo WD, both members of the affine class [2] (i.e., both QTFRs have power parameter t~= 1 ~ t~signal). The WD in Fig. 15.3.2(c) is not matched to the power impulses, displaying complicated cross terms. The affine-smoothed pseudo WD in Fig. 15.3.2(d) does not suppress all the cross terms and has a larger loss of TF concentration than does the smoothed pseudo power WD in Fig. 15.3.2(b). Although all QTFRs in Fig. 15.3.2 are scale covariant, the results of the two power QTFRs with ~ = 3 in Figs. 15.3.2(a) and 15.3.2(b) are better than those of the two affine QTFRs with ~ = 1 in Figs. 15.3.2(c) and 15.3.2(d) because the former two are optimally matched to the ~signal----3 power law group delays of the power impulse signal components. In order to further demonstrate the effect of mismatch in the signal parameter ~signal and the Q T F R power parameter ~, Figs. 15.3.2(e) and 15.3.2(f) show the results obtained when analyzing the above signal using the power WD and a smoothed pseudo power WD with Q T F R power parameter ~ = 4. Note that in Figs. 15.3.2(e) and 15.3.2(f) the power parameter of the power class QTFRs, ~ = 4, is different from that of the signal, ~signal = 3. The smoothed pseudo power WD in Fig. 15.3.2(f) has better cross term suppression and better TF concentration along
548
Chapter 15:
Time-Frequency Diagnosis and Monitoring
Fig. 15.3.2: Power class analysis of a two-component analytic signal consisting of the sum of two windowed power impulses with signal power parameter ~signal -- 3. (a) Power WD with ~ - 3, (b) smoothed pseudo power WD with ~--3, (c) WD (~--1), (d) affine-smoothed pseudo WD ( ~ - 1 ) , (e) power WD with ~---4, and (f) smoothed pseudo power WD with ~--4.
the true group delay than the affine-smoothed pseudo WD in Fig. 15.3.2(d) since the power parameter mismatch in Fig. 15.3.2(f) is smaller than in Fig. 15.3.2(d).
Real data analysis example. Next, we demonstrate the use of power class QTFRs for analyzing real data with dispersive TF structure. Fig. 15.3.3 shows two power class QTFRs with n =0.35 and two affine (n = 1) QTFRs of the measured impulse response of a steel beam with rectangular cross section 3 [12]. The impulse response was obtained by lightly tapping one end of the steel beam in the direction orthogonal to the fiat side of the beam. Bending waves travel along the beam until they are reflected at the free end. They return to the point of impact, are reflected again, etc., thereby producing a series of echoes with increasing dispersion. The QTFRs in 3The d a t a was obtained by J. Woodhouse in an experiment at C a m b r i d g e University. We are grateful to D. Newland and J. Woodhouse for making this d a t a accessible to us.
Power Class Time-Frequency Representations and their Applications
649
Fig. 15.3.3: Power class analysis of a bandpass-filtered segment of the measured impulse response of a steel beam (sampling freq. 4,096 Hz). (a) Power WD with ~--0.35, (b) smoothed pseudo power WD with ~=0.35, (c) WD ( ~ = 1), and (d) affine-smoothed pseudo WD ( ~ = 1).
Fig. 15.3.3 display a bandpass-filtered segment of the measured impulse response. As can be seen, the smoothed pseudo power WD with ~ = 0.35 in Fig. 15.3.3(b) shows better resolution and/or cross term suppression than the other three Q T F R s depicted. The specific value of ~ = 0.35 was chosen empirically to match the T F curvature of the primary reflection.
15.3.3
Summary and Conclusions
This article presented Q T F R classes specifically matched to signals and systems with power law group delay characteristics. These power Q T F R s preserve scale changes and power law frequency-dependent time shifts of the signal under analysis. Thus, these QTFRs are potentially useful in applications where a propagation medium causes power dispersive time shifts as was demonstrated using a real data example. The implementation of power QTFRs can be based on a warping transformation that relates the ecth power class with the affine class. Successful application of power
650
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
class Q T F R s presupposes sufficient a priori knowledge about the signal to aid in choosing the appropriate power parameter ~.
References [1] L. Cohen, Time-Frequency Analysis. Englewood Cliffs, N J: Prentice-Hall, 1995. [2] P. Flandrin, Time-Frequency/Time-Scale Analysis. San Diego: Academic Press, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [3] F. Hlawatsch and G. F. Boudreaux-Bartels, "Linear and quadratic time-frequency signal representations," IEEE Signal Processing Magazine, vol. 9, pp. 21-67, April 1992. [4] A. Papandreou-Suppappola, time-frequency representations ance: A unified framework for Processing: A Review Journal,
F. Hlawatsch, and G. with scale covariance the affine, hyperbolic, vol. 8, no. 1, pp. 3-48,
F. Boudreaux-Bartels, "Quadratic and generalized time-shift covariand power classes," Digital Signal 1998.
[5] O. Rioul and P. Flandrin, "Time-scale energy distributions: A general class extending wavelet transforms," IEEE Trans. Signal Processing, vol. 40, pp. 1746-1757, July 1992. [6] J. Bertrand and P. Bertrand, "A class of affne Wigner functions with extended covariance properties," J. of Mathematical Physics, vol. 33, pp. 2515-2527, July 1992. [7] A. Papandreou, F. Hlawatsch, and G. F. Boudreaux-Bartels, "The hyperbolic class of quadratic time-frequency representations--Part I: Constant-Q warping, the hyperbolic paradigm, properties, and members," IEEE Trans. Signal Processing, vol. 41, pp. 34253444, December 1993. Special Issue on Wavelets and Signal Processing. [8] F. Hlawatsch, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels, "The hyperbolic class of quadratic time-frequency representationsuPart II: Subclasses, intersection with the affine and power classes, regularity, and unitarity," IEEE Trans. Signal Processing, vol. 45, pp. 303-315, February 1997. [9] F. Hlawatsch, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels, "The power classesuQuadratic time-frequency representations with scale covariance and dispersive time-shift covariance," IEEE Trans. Signal Processing, vol. 47, pp. 3067-3083, November 1999. [10] A. Papandreou-Suppappola, F. Hlawatsch, and G. F. Boudreaux-Bartels, "Power class time-frequency representations: Interference geometry, smoothing, and implementation," in Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pp. 193196, Paris, 18-21 June 1996. [11] R. G. Baraniuk and D. L. Jones, "Unitary equivalence: A new twist on signal processing," IEEE Trans. Signal Processing, vol. 43, pp. 2269-2282, October 1995. [12] D. E. Newland, "Time-frequency and time-scale analysis by harmonic wavelets," in Signal Analysis and Prediction (A. Prochazka, J. Uhlir, P. J. W. Rayner, and N. G. Kingsbury, eds.), ch. 1, Boston, MA: Birkh~user, 1998.
Image Distortion Analysis using the Wigner-Ville Distribution
15.4
651
IMAGE DISTORTION ANALYSIS USING THE WIGNER-VlLLE DISTRIBUTION 0
This article deals with the 2D Wigner-Ville distribution (WVD) in the context of image analysis applications. The properties that motivate the use of 2D WVD in image analysis are reviewed. The important issue of choosing the analytic image is emphasized, and practical implementation aspects are discussed. The usefulness of Wigner-Ville distribution for image analysis is demonstrated by an application to image dissimilarities measurement. The WVD-based measure is correlated with subjective human evaluation, which is the premise towards an image quality assessor developed on this principle. 15.4.1
Image Quality and Joint Spatial/Spatial-Frequency Representations
The Wigner-Ville distribution (see Section 2.1.4) has been proved to be a powerful tool for analyzing the time-frequency distribution of nonstationary signals. WVDs were successfully applied in capturing essential nonstationary image structures [1-3]. In this context, the properties of joint spatial/spatial-frequency representations of images led to other applications of WVD to image processing [3, 4]. This article aims to present the WVD from the point of view of image analysis, that is, to emphasis the desired properties and implementation considerations, and to present a specific application, namely analyzing and tracking image distortions for computing an image quality measure. While the WVD is widely used in applications involving 1D signals, the extension to multidimensional signals, in particular to 2D images, has not reached a similar development. With the increasing use of digital video compression and transmission systems image quality assessment has become a crucial issue. In the last decade, there have been proposed numerous methods for image distortion evaluation inspired from the findings on Human Visual System (HVS) mechanisms [5]. In the vision research community, it is generally acknowledged that the early visual processing stages involve the creation of a joint spatial/spatial-frequency representation [6]. This motivates the use of the Wigner-Ville distribution as a tool for analyzing the effects induced by applying a distortion to a given image. The simple Wigner-based distortion measure introduced in this article does not take into account the masking effect. This factor needs to be introduced in future research. The qualitative correlation of the WVD-based measure with subjective human evaluation is illustrated through experimental results. This measure could be used for image quality assessment, or as a criterion for image coder optimization, or for bilinear problems encountered in computer vision, or for image segmentation. ~ A. Beghdadi, L2TI-Institute Galilee, Universit~ Paris 13, FR-93430 Villetaneuse, France (beghdadi@12ti.univ-parisl3.fr), and R. Iordache, Signal Processing Laboratory, Tampete University of Technology, P.O. Box 553, FIN-33101 Tampere, Finland (riordache@noos.fr). Reviewers: B. Boashash, H. Wechsler and K. Abed-Meraim.
Chapter 15:
652
15.4.2
Time-FrequencyDiagnosis and Monitoring
Continuous 2D Wigner-Ville Distribution
The 2D Wigner-Ville distribution (WVD) of a 2D image
f(x, y)
is defined as [41:
W f ( x ' y ' u ' v ) - j f R2 f(x+2'y+-~) f*(x-2'Y--~) e-j2~(`~u+zv)dad~
(15.4.1)
where x and y are the spatial coordinates, u and v are the spatial frequencies, and the asterisk denotes complex conjugation. Historically, the function introduced in (15.4.1) is called the "Wigner distribution" of f, while the "Wigner distribution" of an analytic image associated to f is referred to as the "Wigner-Ville distribution" of f (see Chapter 2). For simplicity, in this article only "Wigner-Ville distribution" appellation is used, as defined in (15.4.1). To the 2D image f(x, y) corresponds a Wigner-Ville distribution W/(x, y, u, v), that provides a spatial/spatial-frequency representation of the image. To any point (x, y) of the image is assigned a 2D spatial-frequency spectrum. Applying Parseval's formula, the WVD can be written in terms of Fourier transform of f(x, y), F(u, v), as
(15.4.2) The image can be reconstructed up to a sign ambiguity from its WVD: f (x, y) f* (0, O) -
2 W/
u, v
dx dy .
(15.4.3)
Among the properties of 2D Wigner-Ville distribution, the most important for image processing applications is that it is always a real-valued function and, at the same time, contains the phase information. The 2D Wigner-Ville distribution has many interesting properties related to translation, modulation, scaling, and convolution. For an in-depth description the reader is referred to [4]. In the sequel, the properties relevant for image analysis tasks are reviewed. The Wigner-Ville distribution localizes the spatial/spatial-frequency structures of f(x, y). For instance, if the signal energy is concentrated in space around (Xo, y0) and in frequency around (uo, v0), then the energy of W/(x, y, u, v) is centered at (Xo, Y0, u0, v0) and has the same spatial and frequency spread as f(x, y). Actually, it can be shown that the spatial-frequency support of W/(x,y, u,v), is included in the support interval of F(u, v), for all (x, y), and that the spatial support of Wf(x, y, u, v) is included in the support interval of f(x, y) for all (u, v). The local power at a fixed position (x0, Y0) is equal to the marginal integral of the WVD over the spatial-frequency domain at the considered spatial position, while the spectral energy density at the spectral point (u0, v0) is equal to the marginal
Image Distortion Analysis using the Wigner-Wile Distribution
653
integral of the WVD over the spatial domain at the considered spectral point:
If(xo,yo)l 2 - L~ IF(uo,vo)[
2
-
s
WI (Xo, Yo, u, v) dudv
(15.4.4)
WI (x, y, uo, vo) dx dy .
(15.4.5)
As the result of these properties, the Wigner-Ville distribution is often thought as the image energy distribution in the joint spatial/spatial-frequency domain, which strongly encourages the use of 2D WVD in image analysis applications where the spatial/spatial-frequency features of images are of interest. The main problem of the WVD is the so-called cross-term interference (see Article 4.2 for the one-dimensional case). Due to its bilinearity, the Wigner-Ville distribution of the sum of two images fl and f2 introduces an interference term, usually regarded as undesirable artifacts in image analysis applications:
WI, +f: (x, y, u, v) = WI~ (x, y, u, v) + WI~ (x, y, u, v) + 2Real [Wf~,f~ (x, y, u, v)] (15.4.6) where the interference term W f l , f 2 is the cross-WVD of fl and f2:
fl x + ~ (15.4.7) The interference terms are highly oscillating and their magnitude can be two times larger than that of the auto-terms. The artifacts introduced by the interference can be classified into two categories [2]: (a) interference terms due to the interaction between the conjugate symmetric spectral components of real images, and (b) interference terms due to the interaction between any two spectral components of the image (real or complex) at different spatial frequencies. Obviously, the WVD of any real image suffers from the first type of artifacts, due to its conjugate symmetric spectrum. Moreover, if the real image is multicomponent, which is usually the case in most of the applications, both kinds of artifacts are present, making the spatial/spatial-frequency representation difficult to interpret. The following subsection presents a concept that may be used to eliminate the first type of interferences and to reduce the second.
15.4.2.1 Analytic Image If the spectrum of an image contains only positive (or only negative) frequency components, there is obviously no interference between the positive and negative frequencies in its WVD. The analytic image has such a spectral property, therefore a cleaner spatial/spatial-frequency representation of a real image, f ( x , y), may be obtained by computing the WVD of an analytic image, z(x, y) associated to it.
Chapter 15:
654
Time-Frequency Diagnosis and Monitoring
The analytic image is the extension of 1D analytic signal to 2D. Contrary to the 1D analytic signal, the analytic image is not unique. For a reliable spatial/spatialfrequency representation of the real image, the analytic image should be chosen so that: (a) the useful information from the 2D WVD of the real signal are found in the 2D WVD of the analytic image, and (b) the 2D WVD of the analytic image minimize the interference effect. As in 1D case where the analytic image is formed by suppressing the negative frequencies, the 2D analytic image may be constructed by setting to zero one half-plane of the spatial-frequency domain. By introducing a reference direction (cos 0, sin 0), a pair of frequency (u, v) is called positive if u cos0 + v sin 8 > 0, and is called negative if u cos 0 + v sin 0 < 0. According to this convention, the analytic signal is straightforward defined in the frequency domain. The Fourier transform, Z(e)(u, v), of the analytic image with respect to direction (cos0, sin 0), z(e)(x, y), corresponding to the real image f is [7]:
z(O)(u, v) = F(u, v) (1 + sign(u cos 0 + v sinO)).
(15.4.8)
The choice of 0 depends on the knowledge on the spectral characteristics of the real image. Unfortunately, Wz(o) contains interference cross-terms between the spectral components of the two quadrants in the positive frequency half-plane. To eliminate these interference cross-terms a second definition of the analytic image seems appropriate, as a complex signal with single-quadrant spectrum [8]. To avoid any loss of information, two analytic images are associated to the real image. The Fourier transforms Z1 (u, v) and Z2(u, v), of the analytic images with the single-quadrant spectra, zl (x, y) and z2(x, y), corresponding to a real image f are defined as:
Z1 (u, v) = F(u, v) (1 + sign(u)) (1 + sign(v)) Z2(u, v) = F(u, v) (1 + sign(u)) (1 - sign(v)).
(15.4.9) (15.4.10)
Using zl and Z2, the interference between frequencies from different quadrants is eliminated for any spectral distribution of the real signal. Wzl reflects the spatial/spatial-frequency distribution of f for uv > O, and Wz2 for uv < O.
15.4.2.2
Continuous 2D Windowed Wigner-Ville Distribution
In practical applications, the images are of finite support. Therefore it is appropriate to apply Wigner analysis to a windowed version of the infinite support images. As in the 1D case [9], the 2D windowed Wigner-Ville distribution is constructed as follows: To compute the windowed WVD at a spatial location (x0, Y0), a windowed version of f is considered, where the window w(x, y) is centered on (x0, Y0): f
(z, y; zo, yo) = f ( x , y)w(
-
y - yo).
(15.4.11)
Image Distortion Analysis using the Wigner-Ville Distribution
655
Then the windowed WVD at (Xo, Yo) is defined as the WVD of f~ at x = x0, y = yo: ~) (x, y, u, v) =
w -~, ~ w*
a~,
/~2
2
x f
(x
o y + -~) . f( + ~,
)
(15.4.12)
o y - -~ e-J2'~(~u+~V) dc~ dZ . x - ~,
The 2D windowed WVD can be shown to represent the convolution with respect to the spatial frequencies of WI and W~:
W~W) (x' y' u' v) - JfR2 W f ( x ' Y'~' ~)Ww(x' y' u - ~' v - ~)d~drl "
(15.4.13)
The effect of the windowing is to smear the WVD representation in the frequency plane only, so that the frequency resolution is decreased but the spatial resolution is unchanged.
15.4.3
Discrete 2D Wigner-Ville Distribution
In this article the discrete version of (15.4.1) is not discussed, as it does not provide any relevant information from the image analysis viewpoint. In this respect, it is more interesting to take into account the discrete version of the windowed WVD (15.4.13). For the sake of simplicity, square shaped images and analysis windows are considered. Let f ( m , n) be the discrete image obtained by sampling f(x, y), adopting the convention that the sampling period is normalized to unity in both directions. The following notation is made:
K(m,n,r,s)
= w(r,s)w*(-r,-s)f(m+r,n+s)f*(m-r,n-s).
(15.4.14)
The discrete space equivalent of the windowed Wigner-Ville distribution (15.4.13) is given by: L
W (w)(m, n, u, v) -
4 E
L
E
K ( m , n, r, s)e -j47r(ru+sv)
(15.4.15)
r=-L s=-L
where w(r, s) = 0, for ]r], > L C Z +. The properties of the discrete space WVD are similar to the continuous WVD, except for the periodicity in the frequency variables, which is one-half the sampling frequency in each direction: (15.4.16) Therefore, if f ( x , y) is a real image, it should be sampled at twice the Nyquist rate to avoid aliasing effects in WL, ' (m, n, u, v).
Chapter 15:
656
Time-FrequencyDiagnosis and Monitoring
There are three practical solutions to avoid aliasing, when the original continuous image is not available: (a) to filter out the frequency regions that cause aliasing (lowpass filtering), (b) to up-sample the image by a factor of 2 in both directions and then appropriately band-limit the image using an anti-aliasing filter [1], and (c) to compute the Wigner-Ville distribution of one or more analytic images associated to the real image, that eliminate or at least reduce the aliasing [2, 10]. To have a discrete spatial-frequency representation of the Wigner-Ville distribution, the frequency variables should be sampled. The 2D discrete windowed Wigner-Ville distribution of an image f(n, m), (n, m) c Z 2 is defined as"
Wr
L
L
Up, Vq) - 4 E
E
K(m'n'r's)WnP+Sq
(15.4.17)
r=-L s=-L
where N = (2L + 2), W4 = e -j47r/N, and the normalized spatial-frequency pair is (Up, Vq) = (p/N, q/N). By making a periodic extension of the kernel K(m, n, r, s), for fixed (re, n), (15.4.17) can be transformed to match the standard form of a 2D DFT, except that the twiddle factor is W4 instead of 1412 (see [9] for additional details for 1D case; the 2D construction is a direct extension). Thus standard FFT algorithms can be used to calculate the discrete W} w). The additional power of two represents a scaling along the frequency axes, and can be neglected in the calculations. As the real-scene images have rich frequency content, the interference crossterms may mask the useful components contribution. Therefore a commonly used method to reduce the interference in image analysis applications is to smooth the 2D discrete windowed WVD in the spatial domain using a smoothing window h(m, n). The price to pay is the spatial resolution reduction. The result is the so-called 2D discrete pseudo-Wigner distribution (PWD) [1], which is obtained from the windowed WVD: ,
/
M
P W f ( m , n , Up, V q ) -
M
(15.4.18)
E E k=-M g=-M
In the case of a symmetric frequency window, i.e. can be defined by: L
L
PWi(m, n, up, Vq) - 4 E
E
w(r, s) - w(-r,-s),
[w(r,s)12W~p+~q
the PWD
(15.4.19)
r=-L s---L M
x
E
M
E
h(k,g)f(m+k+r,n+g+s)f*(m+k-r,n+g-s).
k=-M g=-M
A very important aspect to take into account when using PWD is the choice of the two windows, w(r, s) and h(k, g). The size of the first window, w(r, s), is dictated
Image Distortion Analysis using the Wigner-Ville Distribution
657
Fig. 15.4.1: Examples of discrete analytic images. The spatial frequency is normalized with respect to the sampling frequency. If the case, the frequency supports of different images are marked by different motives. (a) Analytic image used in [2], which is the discrete version of (15.4.8) for 0 = 0. (b) The discrete analytic image pair proposed in [10]. (c) Single-quadrant discrete analytic image pair, which are the discrete version of (15.4.9) and (15.4.10).
by the resolution required in the spatial-frequency domain. The spectral shape of the window should be an approximation of the delta function that optimizes the compromise between the central lobe width and the side lobes height. A window that complies with these demands is the 2D extension of Kaiser window, which was used in [1]. The role of the second window, h(k, ~), is to allow spatial averaging. Its size determines the degree of smoothing. The larger the size is, lower the spatial resolution becomes. The common choice for this window is the rectangular window.
15.4.3.1
Choice of Analytic Image in Computing Discrete Wigner-Ville Distributions
In the discrete case, there is an additional specific requirement when choosing the analytic image: the elimination of the aliasing effect. Taking into account that all the information of the real image must by preserved in the analytic image, only one analytic image cannot fulfill both requirements. Therefore, either one analytic image is used and some aliasing is allowed, or more analytic images are employed which satisfy two restrictions: (a) the real image can be perfectly reconstructed from the analytic images, and (b) each analytic image is alias free with respect to WVD. The discrete version of the half-plane analytic image (15.4.8) was used for texture segmentation in [2]. While preserving all the information of the real image, it fails to produce a substantial reduction of the aliasing, as it does not prevent frequency fold-over along one direction (Fig. 15.4.1(a)). To avoid aliasing, a solution is to use two analytic images, obtained by splitting the region of support of the half-plane analytic image into two equal area subregions. This is the procedure presented in [10], where one analytic image contains the low positive frequencies and the other the high positive frequencies (Fig. 15.4.1(b)).
Chapter 15:
658
(d)
(e)
Time-FrequencyDiagnosis and Monitoring
(f)
Fig. 15.4.2: Full-domain WVD computation using a single-quadrant analytic image pair. (a) Spectrum of the real image. (b) Spectrum of the upper-right quadrant analytic image (see (15.4.9)). (c) Spectrum of the lower-right quadrant analytic image (see (15.4.10)). (d) Spatial-frequency support of WVD of (b). (e) Spatial-frequency support of WVD of (c). (f) Spatial-frequency support of the full-domain WVD obtained from (d) and (e).
Although this method requires the computation of two WVD, no aliasing artifacts appear and the WVD of the analytic images can be combined to produce a spatial/spatial-frequency representation of the real image having the same frequency resolution and support as the original real image. This approach was successfully applied in texture analysis and segmentation in [3]. In general, the energy of natural images is concentrated at the low frequencies, so most probable the low frequency analytic image contains most of the energy of the real image, and, therefore, its WVD would exhibit strong cross-term interferences. A more equilibrated energy sharing is done when discrete single-quadrant analytic images are used (Fig. 15.4.1(c)). They are the discrete version of (15.4.9) and (15.4.10). As the positive frequency domain is split along v-axis, the interquadrant cross-terms are eliminated. As in [10], a full-domain PWD of the real image f(m, n), FPW:f(m, n, Up, Vq), can be constructed from PWzl (m, n, Up, Vq) and PWz2 (m, n, Up, vq). In the spatialfrequency domain, the full-domain PWD is, by definition, of periodicity 1 and symmetric with respect to the origin, as the WVD of a real image. It is completely
Image Distortion Analysis using the Wigner-Ville Distribution
specified by: FPW~(m.~.~,.~)
-
F P W f (m, n, up, O) -
{
659
PW~.(.~.~.~..v~).
0 <_ u . <
i . 0 < v~ < ~1 1
P Wz2 (m, n, 'ltp, Vq), 0 ~ Up < "~, 0 > Vq ~
2
(15.4.20) 1
PWz, (m, n, up, O) + PWz~ (m, n, up, O), 0 < Up < -~
(15.4.21)
F P W I ( m , n Up, Vq) '
F P W f ( m , n,-Up,-Vq)
0 > Up, Vq > - '
-
1
(15.4.22)
2
F P W f ( m , n , up + k, vq + l) = F P W f ( m , n , up, Vq), Vk, l,p,q e Z. (15.4.23) Fig. 15.4.2 illustrates the construction of the full-domain PWD from the PWD of the single-quadrant analytic images. The same shading indicates identical regions. Letters are used to follow the mapping of frequency regions of the real image; for instance, the region labeled A in (f) represents the mapping of the region A in the real image spectrum (a) on the spatial-frequency domain of the full-domain PWD. In conclusion, the full-domain PWD provides information un-altered by aliasing artifacts on the spatial/spatial-frequency distribution of the real image over the entire frequency spectrum. A potential drawback of these approaches is that the additional sharp filtering boundaries may introduce ringing effects.
15.4.4
An Image Dissimilarity Measure based on the 2D Wigner-Ville Distribution
Structured distortions affecting an image, which are more annoying than the unstructured distortions, are usually highly concentrated in the spatial/spatialfrequency domain. Between two distortions with the same energy, i.e. same signalto-noise ratio (SNR), the more disturbing is the one having a peaked energy distribution in spatial/spatial-frequency plane. In terms of the effect on the WVD, the noise added to an image influences not only the coefficients in the positions where the noise has non-zero WVD coefficients, but also induces cross-interference terms (see (15.4.6)). Stronger the noise WVD coefficients are, more important the differences between the noisy image WVD and original image WVD become. WVD-based SNR of a distorted version g(m, n) of the original discrete image f ( m , n) is defined as: E m E n maXp,q (IFPWI(m, n, up, vq)l)
SNR W -
10 log~o E~n
En maXp,q (IFPWI(m, n, up, Vq) -
FPWg(m, n, up, ca)l) (15.4.24)
For each position (m, n) the highest energy WVD component is retained, as if the contribution of the other components are masked by it. Of course, the masking mechanisms are much more complex, but this coarse approximation leads to results which are more correlated to the HVS perception than SNR.
Chapter 15:
660
Time-FrequencyDiagnosis and Monitoring
The use of maximum difference power spectrum as a nonlinearity transformation is motivated and inspired by some findings on nonlinearities in the HVS. Similar transformations have been successfully used to model intra-cortical inhibition in the primary visual cortex in an HVS-based method for texture discrimination [11]. Let rh and r/2 be two degradations having the same energy. The first, 7/1 is additive white Gaussian noise, and the second, r12 is an interference pattern. While the energy of the noise is evenly spread in the spatial/spatial-frequency plane, the energy of the structured degradation is concentrated in the frequency band of the interference. Thus the WVD of r/2 contains terms which have absolute values larger than any term of WVD of rh, as the two degradations have the same energy. These peak terms induce larger local differences between WVD of g2 = f + r/2 and WVD of f, which are captured by 'max' operation in the denominator of (15.4.24) and lead to a smaller SNR W for g2.
15.4.4.1
Results and Discussion
To show the interest of the proposed image distortion measure as compared to the signal-to-noise ratio (SNR), three types of degradation are considered: white noise, interference pattern, JPEG coding (Fig. 15.4.3). They induce nearly the same SNR, whereas visual comparison clearly reveals different perceptual quality. The WVD-based distortion measure is proven to correlate with subjective quality evaluation done by five non-expert observers, which prefer the white noise distorted image to the interference perturbed image and to the JPEG coded image. The reason is that for random degradation the noise has the same effect in the entire spatial-frequency plane. Therefore, the maximum spectral difference at almost any spatial position is lower than the just noticeable perceptual difference. On the other hand, when the distortion is localized (as interference patterns or distortion induced by JPEG coding), the maximum spectral difference corresponding to an important proportion of the pixels has a significant value, much larger than the just noticeable perceptual difference.
15.4.5
Summary and Conclusions
This article considers the 2D WVD in the framework of image analysis. The advantages and drawbacks of this spatial/spatial-frequency analysis tool are recalled in the light of some pioneer and recent works in this field. The usefulness of the WVD in image analysis is demonstrated by considering a particular application, namely distortion analysis. In this respect, a new image distortion measure is defined. It is calculated using the spatial/spatial-frequency representation of images obtained using the 2D WVD. The efficiency of this measure is validated through experiments and informal visual quality assessment tests. It is shown that this measure represents a promising tool for objective measure of image quality, although the masking mechanisms are neglected. To improve the reliability and the performance of the proposed method, a refinement to include a masking model is imperatively needed.
Image Distortion Analysis using the Wigner-Ville Distribution
661
F i g . 15.4.3: Distorted versions of 256 • 256 pixel Parrot image, f : gi is obtained by adding white Gaussian noise on f; g2 is a JPEG reconstruction of f, with a quality factor of 88; g3 is the result of imposing a grid-like interference over f . The SNR and SNR w values are given in dB.
It can be concluded that, taking into consideration some basic, well-established knowledge on the HVS (the joint spatial/spatial-frequency representation, and nonlinear inhibition models), one can develop a simple image distortion measure correlated with the perceptual evaluation. Further details on the use of the Wigner-Ville distribution for image analysis and processing can be found in [12-14].
References [1] T. R. Reed and H. Wechsler, "Segmentation of textured images and Gestalt organization using spatial/spatial-frequency representations," IEEE Trans. Pattern Analysis
662
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
Machine Intelligence, vol. 12, pp. 1-12, January 1990. [2] Y. M. Zhu, R. Goutte, and M. Amiel, "On the use of two-dimensional Wigner-Ville distribution for texture segmentation," Signal Processing, vol. 30, pp. 329-353, February 1993. [3] G. Cristdbal and J. Hormigo, "Texture segmentation through eigen-analysis of the Pseudo-Wigner distribution," Pattern Recognition Letters, vol. 20, pp. 337-345, 1999. [4] L. Jacobson and H. Wechsler, "Joint spatial/spatial-frequency representation," Signal Processing, vol. 14, pp. 37-68, 1988. [5] C. J. van den Branden Lambrecht, ed., Signal Processing, vol. 70, no. 3. EURASIP/Elsevier, 30 November 1998. Special issue on image quality assessment. [6] M. A. Garcfa-P~rez and V. Sierra-Vs "Visual processing in the joint spatial/spatial-frequency domain," in Vision Models for Target Detection and Recognition: In Memory of Arthur Menendez (E. Peli, ed.), ch. 2, pp. 16-62, World Scientific Publishing, 1995. [7] T. Billow and G. Sommer, "A novel approach to the 2D analytic signal," in Proc. 8th Internat. Conf. on Computer Analysis of Images ~ Patterns (CAIP'99), Ljubljana, Slovenia, 1-3 September 1999 (F. Solina and A. Leonardis, eds.), no. 1689 in LECTURE NOTES IN COMPUTER SCIENCE, pp. 25-32, Springer, 1999. [8] S. L. Hahn, "Multidimensional complex signals with single-orthant spectra," Proc. IEEE, vol. 80, pp. 1287-1300, August 1992. [9] B. Boashash and P. J. Black, "An efficient real-time implementation of the Wigner-Ville distribution," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 35, pp. 1611-1618, November 1987. [10] J. Hormigo and G. Cristdbal, "High resolution spectral analysis of images using the pseudo-Wigner distribution," IEEE Trans. Signal Processing, vol. 46, pp. 1757-1763, June 1998. [11] J. Malik and P. Perona, "Preattentive texture discrimination with early vision mechanisms," J. Optical Soc. of America A, vol. 7, pp. 923-932, 1990. [12] H. O. Bartelt, K. H. Brenner, and A. W. Lohmann, "The Wigner distribution function and its optical production," Optics Communications, vol. 32, pp. 32-38, January 1980. [13] G. Cristdbal, J. Bescds, and J. Santamarfa, "Image analysis through the Wigner distribution function," Applied Optics, vol. 28, pp. 262-271, January 1989. [14] H. Suzuki and F. Kobayashi, "A method of two-dimensional spectral analysis using the Wigner distribution," Electronics ~ Communications in Japan, Part III: Fundamental Electronic Science, vol. 75, pp. 1006-1013, January 1992.
Time-Frequency Detection of EEG Abnormalities
15.5
663
T I M E - F R E Q U E N C Y D E T E C T I O N OF EEG A B N O R M A L I T I E S 0
This article presents an example of time-frequency methodology used for the detection of seizures in recorded EEG signals. The techniques used are adapted to the case of newborn EEGs, which exhibit some well defined features in the timefrequency domain that allow an efficient discrimination between abnormal EEGs and background.
15.5.1
EEG Abnormalities and Time-Frequency Processing
Neonatal seizures are usually the first signs of neurological abnormalities and can lead to permanent brain damage or even fatalities if not detected at the early stages. There are a number of disturbances underlying the seizure rather than a single identifiable cause making the identification process difficult. The problem in newborn is harder than in adults because the more obvious clinical symptoms such as muscle spasms, sporadic eye movements and drooling are often difficult to detect [1]. For this reason, Electroencephalogram (EEG) is practically the only tool available in seizure detection and characterization in newborns. Three major approaches have been used to detect seizures in newborns based on the assumption that the EEG signals are stationary or at least locally stationary [1,2]. However, a detailed examination of these signals shows that EEG signals exhibit significant non-stationary and multi-component features [see Fig. 15.5.1(a)]. making these three methods essentially invalid and at best only an approximation. This explains the relatively poor performance of these methods [2]. The non-stationarity and multicomponent nature of the EEG signal suggested the use of time-frequency (TF) signal processing to analyze and characterize the different newborn EEG patterns for developing a time-frequency seizure detection and classification [1,3].
15.5.2
EEG Seizures in Newborns
A seizure is defined to occur when there is an excessive synchronous discharge of neurons within the central nervous system. Its manifestation in the EEG, known as electrographic seizure, consists of a paroxysmal events which are trains of rhythmic repetitive sharp waves that emerge more or less abruptly from the ongoing background activities and have a distinct beginning and end. They may start with low voltages that increase usually as the discharge progresses. They often contain subharmonics and may have polyphasic contours or be sinusoidal. These discharges pattern can be divided into four categories: focal spike and sharp waves (> 2Hz), local low frequency discharges (around 1Hz), focal rhythmic discharge (0.5 H z - 15 Hz), and multifocal patterns (EEG discharge originating from two or more loci) [1]. ~ Boualem B o a s h a s h and M o s t e f a M e s b a h , Signal Processing Research Centre, Queensland University of Technology, Brisbane, Australia (b.boashash@qut.edu.au, m.mesbah@qut.edu.au), and Paul Colditz, Perinatal Research Centre, Royal Women's Hospital, Brisbane, Australia (p.colditz@uq.edu.au). Reviewers: V. Chandran and A. Belouchrani.
Chapter 15:
664
Time-FrequencyDiagnosis and Monitoring
The duration of rhythmic discharges is highly variable, from as short as 1 second to as long as 30 minutes. This fact contributed to the disagreement between the researchers about what constitutes a seizure. In order to consider an EEG discharge as a seizure, some researchers require that it must last at least 10 seconds, others require a minimum of 20 seconds, while a third group does not specify a time limit. Seizure patterns are occasionally corrupted by artifacts and some abnormal background patterns such as burst suppression (BS). The most noticeable artifacts are the ones caused by the heartbeat (ECG), the eye movement (EOG) and head and body movements (EMG)[4].
15.5.3
Data Acquisition
Electrical signals produced in the brain can be monitored in a non-invasive manner by measuring variations in potential on the scalp. This EEG measurement is achieved by strategically placing several small electrodes on the scalp. One electrode, usually at the base of the skull, acts as a reference (ground) signal, and various channels of data are created by measuring the voltage differences between neighboring electrodes. Five channels of EEG have been recorded in each session using the 10-20 International System of Electrode Placement. The EEG data has been recorded using a sampling frequency of 256 Hz. For artifact detection, three auxiliary signals representing electro-oculogram (EOG), electrocardiogram (ECG), and respiration are also recorded. Data used has been collected at the Royal Women's Hospital Perinatal Intensive Care Unit in Brisbane, Australia. The EEG signals containing seizures were obtained from two different newborn babies that have been clinically identified to have seizures. The gestational ages of the babies were 35 weeks and 40 weeks and 3 days. The recording lasted 137 minutes and 23 minutes respectively.
15.5.4
Selection of a Time-Frequency Distribution
The following characteristics were found to be typical of neonatal EEG signals [1]: non-stationary, occasionally multicomponent, low frequency signals in the range 0 to 5 Hz. These factors must be considered when selecting an optimal time-frequency distribution (TFD), as each TFD is more suited to representing signals with particular characteristics (see Chapter 3). Since neonatal EEG signals are non-stationary and occasionally multicomponent, a desirable time-frequency distribution should have a good spectral resolution and reduced cross-terms. The performance and characteristics of several TFDs were compared to find an optimal representation of real neonatal EEG data in the T F domain. The scope of this comparison study has encompassed seven TFDs [1]. Each TFD has been applied to epochs of real neonatal EEGs for various data window lengths and individual TFD parameter values. The performances were compared visually and using an objective quantitative measure criterion (see Article 7.4). Based on this criterion, the B-distribution (BD) with the smoothing parameter ~ equals to 0.01 has been selected as the most suitable representation of
Time-Frequency Detection of EEG Abnormalities
665
the EEG signals in the T F domain. The B-distribution is defined in terms of its time-lag kernel (see chapters 2 and 3) and may be expressed as
pz(t, f) -
I_']L(
cosh2(t)
z(t + 7/2)z* ( t - ~-/2)e -j2~/~ dud~'.
The parameter fl (0 < fl < 1) controls the sharpness of the cut-off of the twodimensional filter in the ambiguity domain. Hence, the EEG signals were represented in time frequency using the B-distribution with a smoothing parameter of 0.01, a window length of 127 samples, and a time resolution of 5 samples. The data has been resampled to 20 Hz for better representation of low frequency regions. The time-frequency analysis was performed using the commercial TFSA 5.2 MATLAB TM toolbox (http://www.sprc.qut.edu.au//tfsa/~index.html).
15.5.5
EEG Pattern Analysis
The visual analysis of the time-frequency EEG data led to divide the time-frequency EEG patterns into two classes: seizure and background. The seizure patterns can be characterized in the time-frequency domain by a main ridge (component) as either a linear FM law or a piecewise linear FM while the background patterns exhibit a low frequency burst activities or irregular activities with no clearly defined patterns [1]. These observations correlate well with clinical information related to EEGs [5]. Representative T F representations of each of the subclasses are detailed below.
15.5.6 15.5.6.1
Analysis of Time-Frequency Seizure Patterns Linear FM (LFM) Patterns
The EEG seizures analyzed in the T F domain that can be approximated by linear FMs with either fixed or time-varying amplitudes can be classified into the following sub-classes: L F M P a t t e r n s w i t h a Q u a s i - C o n s t a n t F r e q u e n c y : Fig. 15.5.1(b)shows a seizure that has a linear FM behavior with an almost constant frequency. The amplitude of the time-frequency seizure pattern increases at the onset and decreases toward the end. A major advantage of the T F representation is that we can easily distinguish the seizure from other phenomena such as burst activities as long as they occupy different T F regions. These unwanted signals can be removed from the EEG signal using a well designed T F filter without affecting much the seizure signal. L F M P a t t e r n s w i t h a D e c r e a s i n g F r e q u e n c y : Fig. 15.5.1(c) of this class differs from the one above by the fact that its frequency decreases with time [5]. By looking at the T F behavior of the seizure, we can easily deduce the precise non-stationary character of the seizure. The classical detection methods based on the stationarity assumption will most likely miss these patterns.
Chapter 15:
666
15.5.6.2
Time-FrequencyDiagnosis and Monitoring
Piecewise LFM Patterns
Most of the patterns analyzed so far can be approximated to a good degree of accuracy by piecewise linear FM as shown in Fig. 15.5.1(d). These types of seizures usually comprises the different stages of the seizure [4].
15.5.6.3
EEG Background Patterns
By background, we mean any signal that is not classified as seizure. Two distinct patterns have been noticed: Burst of activity and an irregular activity with no clear pattern. B u r s t of A c t i v i t y : Fig. 15.5.1(e) is an example of this class characterized by a burst of activity. These are a short period signals with a high energy lasting for few seconds and usually occurring at frequencies below 4 Hz. These features are characteristic of burst suppression. Burst suppression is defined as burst of high voltage activity lasting 1-10 seconds and composed of various patterns (delta (0 4 Hz) and theta (4 - 8 Hz) with superimposed and intermixed spikes, sharp waves, and faster activity) followed by a marked background attenuation [5].
Activity Lacking a Specific P a t t e r n : Fig. 15.5.1(f) is an example of an EEG epoch lacking a well-defined and consistent pattern. These type of activities are not constrained within the low frequency bands characterizing the EEG seizure. This time-frequency analysis indicates that a linear or piecewise linear instantaneous frequency (IF), obtained by taking the peak of the main component of a TFD, can be used as a critical feature of EEG seizure characteristics. These findings suggested to propose a TF-based seizure detector. This detector, called T F matched detector, performs a two dimensional correlation between the EEG signal and a reference template selected as a model in TF domain of the EEG seizure.
15.5.7
Time-Frequency Matched Detector
The matched filter is the simplest approach for constructing detectors and classifiers. It essentially reduces to a correlator receiver whose output is compared to a threshold. The threshold is chosen such that the probability of a false alarm is kept constant. The correlator receiver is implemented in time domain as a onedimensional correlation between the received noisy signal x(t) and a reference signal s(t) or using the corresponding spectral representations. To extend this detector to handle nonstationary signals, the one-dimensional correlation is replaced by a twodimensional correlation involving the TED p(t, f) of z(t) and s(t). The resulting test statistic is given by:
T(x)
] ] px(t, f)ps(t, f) dt df
(15.5.2)
This type of detector has been implemented using different quadratic time-frequency distributions such as the spectrogram [6], the Wigner-Ville, and cross Wigner-Ville
Time-Frequency Detection of EEG Abnormalities
Fig. 15.5.1: B-distributions of EEG signals.
667
668
Chapter 15:
Time-Frequency'Diagnosis and Monitoring
distributions [7] and the auto- and cross-ambiguity functions [8]. Using Moyal's formula, we get:
// px(t, f)ps(t, f) dt df =
z(t)s* (t)dt
This equality is only valid when the kernel filter is unimodular; that is its absolute value is equal to one all over the ambiguity domain. This is the case, for example, fbr the Wigner-Ville distribution and Rihaczek distribution [9]. Equation (15.5.3) is an alternative interpretation of the correlator receiver in terms of a correlation of the TFDs. Even though the B-distribution does not verify exactly Eq. (15.5.3), it has been used as the basis for the TF matched detector because of its superiority over the other TFDs in representing EEG signals as discussed in Section 15.5.4 (see also Articles 5.7 and 7.4). For the case of a deterministic signal in additive noise (even white Gaussian noise) the TF-based correlator is suboptimal due to the nonlinearity of the quadratic TFDs which accentuates the effects of noise by introducing artifacts. To use a correlator receiver, it is usually required that the wave shape of the reference signal (or other related information such as its TFD) as well as the noise statistics are known. Section 15.5.6 indicated that the EEG seizure could be characterized by a linear or a piecewise linear FM. To construct a TF-based matched detector, a representative TFD of a linear or piecewise linear FM, pref(t, f), is selected to serve as template (reference). The correlator statistic T(x) used is the two dimensional cross-correlation between the EEG signal TFD and the reference signal TFD, i.e.:
/ / pref(t, f)p*(t, f)dt df
(15.5.4)
where z is the analytic signal corresponding to the EEG signal under consideration.
15.5.7.1
Implementation of the Time-Frequency Matched Detector
The implementation of the TF matched detector and a description of its main components are described below. More details may be found in [10]. P r e p r o c e s s i n g : This stage includes artifact (such as ECG, EOG, and EMG) removal, noise filtering, and resampling the signal to comply with detector input specifications. A low pass filter along with an artifact removal algorithm using adaptive signal processing techniques were implemented for this purpose [4]. Signal R e s t r u c t u r i n g : EEG is segmented into an array of signals of fixed length (2 minutes) to be suitable for performing the cross-correlation. Shorter signal lengths led to higher rates of miss detections and false alarms. Once the full input EEG signal is divided into blocks of 2 minutes duration, each block is stored as a row of the newly formed array of signals. A protocol of 50% overlap of each block was adopted.
Time-Frequency Detection of EEG Abnormalities
669
D e t e c t i o n L o o p : The detection loop is executed until all the blocks of the input EEG signal have been processed. An offset value is maintained, giving a precise location in the original signal where abnormal events are detected. C r o s s - c o r r e l a t i o n : The cross-correlation between the T F array of the EEG signal and the template (mask) is obtained using the two-dimensional cross-correlation function given by Eq. (15.5.4). The most crucial process is the choice of the template, in this case (see Section 15.5.6) the TFD of a linear FM or a piecewise linear FM. The time duration of the FM signal is set to 20 seconds as discussed below. To find the optimum slopes of the FM signal IF, that is the ones that corresponds to the best detection rate, a testing stage is necessary [10]. A similar testing stage is also required to select an optimum threshold that realizes a good compromise between the rate of good detections and the rate of false alarms. A m p l i t u d e a n d L e n g t h C r i t e r i a : Ideally there will be one peak value in the output of the cross-correlation array, with its output amplitude determining the presence or absence of seizure. This proved to be unreliable, and it was decided to perform a search of sequential series of values over the amplitude threshold defined earlier. This proved to be successful, and a minimum ridge length of 20 seconds over the amplitude threshold was classified as a seizure. The 20-second length adopted is larger than the minimum 10-second length of EEG seizure adopted by many neurologists [5]. M a p S e i z u r e D e c i s i o n to R e a l T i m e L o c a t i o n : This stage simply ties all of the independent decisions on each block of processed signal (remapping any seizure decision to a time series function) of equivalent length to the input EEG signal. This output waveform consists of ones or zeros, where one indicates the presence of seizure at the corresponding time. In order to validate and calibrate the detection algorithm, simulated data generated by the EEG model [2]. The model generates an EEG like signal characterized in time frequency by a linear IF with a random slope in the range of [-0.07 0]. These values were reported in [5]. The B-distribution was used to generate the reference template and the T F D of the simulated EEG. The signal used in the construction of the reference template is a linear FM. The average detection obtained was 99.1% while the false alarm rate was 0.4%. These results confirm the validity of the methodology since the template is well adapted to the EEG model.
15.5.8
Summary and Conclusions
The patterns obtained by a TF analysis of newborn EEG seizure signals show a linear FM or piecewise linear FM characteristic. This suggests a method of seizure detection and classification in the TF domain. A TF detector is proposed that involves cross-correlating the TFD of the EEG signal with a template. The design of the template takes into account the TF characteristics of the EEG seizure extracted in the TF domain. The performance of this time-frequency detector was tested on synthetic signals, corresponding to one specific type of seizure pattern (LFM). At
670
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
the time of publication, the methodology was being extended to deal with LFM patterns of varying slopes, and with piecewise linear FM patterns. The procedure will then allow classification within the selected sub-classes. Another time-frequency approach to newborn EEG seizure detection is described in [11].
References [1] B. Boashash and M. Mesbah, "A time-frequency approach for newborn seizure detection," IEEE Engineering in Medicine #A Biology Magazine, vol. 20, pp. 54-64, September/October 2001. [2] M. Mesbah and B. Boashash, "Performance comparison of seizure detection methods using EEG of newborns for implementation of a DSP subsystem," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), Orlando, FL, 13-17 May 2002. Paper no. 1932. [3] B. Boashash, M. Mesbah, and P. Colditz, "Newborn EEG seizure pattern characterisation using time-frequency analysis," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), vol. 2, pp. 1041-1044, Salt Lake City, UT, 7-11 May 2001. [4] P. Celka, B. Boashash, and P. Colditz, "Preprocessing and time-frequency analysis of newborn EEG seizures," IEEE Engineering in Medicine ~ Biology Magazine, vol. 20, pp. 30-39, September/October 2001. [5] C. T. Lombroso, "Neonatal EEG polygraphy in normal and abnormal newborns," in Electroencephalography: Basic Principles, Clinical Applications, and Related Fields (E. Niedermeyer and F. H. Lopes da Silva, eds.), pp. 803-875, Baltimore, MD: Williams & Wilkins, 3rd ed., 1993. [6] R. A. Altes, "Detection, estimation and classification with spectrograms," J. Acoustical Soc. of America, vol. 67, pp. 1232-1246, April 1980. [7] B. Boashash and P. J. O'Shea, "A methodology for detection and classification of some underwater acoustic signals using time-frequency analysis techniques," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 38, pp. 1829-1841, November 1990. [8] S. Pon Varma, A. Papandreou-Suppappola, and S. B. Suppappola, "Detecting faults in structures using time-frequency techniques," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), vol. 6, pp. 3593-3596, Salt Lake City, UT, 7-11 May 2001. [9] P. Flandrin, "A time-frequency formulation of optimum detection," IEEE Trans. Acoustics, Speech, ~ Signal Processing, vol. 36, pp. 1377-1384, September 1988. [10] B. Boashash and M. Mesbah, "Time-frequency methodology for newborn EEG seizure detection," in Applications in Time-Frequency Signal Processing (A. PapandreouSuppappola, ed.), ch. 9, Boca Raton, FL: CRC Press, 2002. [11] H. Hassanpour, M. Mesbah, and B. Boashash, "Comparative performance of timefrequency based newborn EEG seizure detection using spike signatures," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'03), vol. 2, pp. 389392, Hong Kong, scheduled 6-10 April 2003.
671
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
15.6 15.6.1
TIME-FREQUENCY BASED M A C H I N E M O N I T O R I N G A N D FAULT D I A G N O S I S 0
CONDITION
Machine Condition Monitoring and Fault Diagnosis
Machine condition monitoring is the process of checking a machine for abnormal symptoms. Fault diagnosis, on the other hand, means deciding the nature and the cause of the fault by examining the symptoms [1]. The article aims at providing a methodology for potential users interested in implementing techniques pertaining to the area of machine condition monitoring using time-frequency analysis (TFA). It also provides three examples and some relevant references. Although this article focuses on one-dimensional time-domain signals, its methodology can be extended to images and image sequences.
15.6.1.1
Machine Condition Monitoring
In modern manufacturing, the quest for automation and flexibility has resulted in machines performing extremely complex processes. The performance of such processes highly depends on the trouble-free operation of all the components. When a fault occurs, it is critical to detect it, isolate the causes, and take appropriate maintenance action at an early stage. This helps prevent faults from developing into an eventual major machine failure and interrupting the production cycle. Consequently a number of techniques have been developed which monitor certain parameters within the machinery allowing its condition to be determined. These monitoring techniques have become known as machine condition monitoring. The predictive maintenance through condition monitoring and diagnosis can significantly improve product quality, improve worker safety, and reduce the costs of maintenance. This is achieved by (1) allowing the early detection of potentially catastrophic faults which could be expensive to repair, and (2) allowing the implementation of condition based maintenance rather than periodic or failure based maintenance. In these cases, significant savings can be made by delaying scheduled maintenance until it is more convenient or necessary. An efficient condition monitoring technique is capable of providing warning and predicting the faults at early stages by obtaining information about the machine in the form of primary data. Through signal processing (SP), the critical information from these data is captured and correlated to the condition of the machine. Effectiveness depends on matching the SP algorithms to the characteristics of the monitored signals. Two types of condition monitoring and diagnosis systems are widely used: offline and on-line. In an off-line (periodic) monitoring system, the monitored signal is measured at pre-selected time intervals. This approach is routinely used for fault diagnosis and trend analysis. In an on-line (permanent) monitoring system the ~ M. Mesbah, B. Boashash and J. Mathew, Queensland University of Technology, CIEAM and SPRC, GPO Box 2434, Brisbane, Q 4001, Australia (m.mesbah@qut.edu.au, b.boashash@qut.edu.au, j.mathew@qut.edu.au). Reviewers: D. R. Iskander and V. Sucic.
672
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
signal is continuously measured and compared with a reference level. This type of system is intended to protect machines and/or operators by providing a warning about a possible malfunction of the machine and/or an imminent shutdown to prevent catastrophic failure. Traditionally, human operators, using a combination of sight and sound, have performed machine condition monitoring. Recently, automatic techniques have been proposed to replace human operators. Some of these techniques rely on direct measurements while the majority depend on indirect measurements. Direct methods use sensing techniques that directly measure the extent of the deterioration, such as tool wear, in a machine. Indirect methods may rely on sensing different machine parameters such as forces, acoustic emission, temperature, vibration, current, voltage, torque, strain, and images of the tools in question. In techniques based on indirect measurement, features indicative of condition are extracted from these monitored signals and correlated to give a measure of the extent, the nature, and the location of the fault [2].
15.6.1.2
The Four Stages of Condition Monitoring and Diagnosis
In general, machine condition monitoring, as a pattern recognition problem, consists of four stages: data acquisition, feature extraction, feature selection, and decisionmaking. Data are acquired using transducers and normally recorded in either analog or digital form on magnetic tape or computer disk. (In simple systems it may be possible to perform the analysis in real-time). A critical step of condition monitoring and diagnosis is feature extraction. It is generally not practical to automatically determine the machine condition using the collected raw signals and therefore some transformation or processing is required. This transformation usually involves as a first step mapping the original data from time-domain to another domain, such as the frequency or time-frequency domains, where the differences between the normal and abnormal behaviors are much more apparent. In this new domain, features that best describe the characteristics of the process condition are extracted. Feature extraction techniques include statistical methods, power spectral methods, and time-frequency methods as detailed in Section 15.6.2. In some cases, where the dimension of the feature space (or the number of features) is high, the dimension can be further reduced by retaining only the most valuable features and eliminating those that give little or no additional information. This dimension reduction processes is called feature selection. The decision-making or classifier stage can be viewed as a process that automatically correlates the feature set, obtained from the previous stage, to the machine conditions [3]. It is usually done through supervised learning, where the operator instructs the computer of the possible patterns in the feature sets and relates them to the machine conditions. Sometimes it is difficult to generate data that reflects all uncertainties and differences within one class or group of faults in an experiment setring. In this case, an unsupervised learning strategy is used. Unsupervised learning is a task in which the number of classes is not known before classification and there
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
673
are no labeled training features available. The classifier itself should be capable of exploring the extracted features and deciding about the number of classes. Typical automatic or computer decision-making methods include pattern recognition, fuzzy logic, decision trees, and artificial neural networks. Ideally, there exists a one-to-one correlation between feature sets and machine conditions. An alternative approach is to monitor the features and spot trends in them and thus predict failure. The decision to replace the faulty part is often taken when the feature crosses a given threshold [1].
15.6.1.3
Classical Signal Analysis Methods for Feature Extraction
Classical methods used for feature extraction can be classified into time domain and frequency domain. T i m e D o m a i n M e t h o d s : Probably the simplest approach proposed for fault detection in the time domain is through the measurement of the energy (mean square value) of the monitored signal. The method relies on the fact that as the machine's condition deteriorates, the vibration energy is expected to increase. Another approach is to use statistical parameters for fault detection. By treating the monitored signal as random variable, higher-order statistical moments, cumulants, and measures such as crest factor can also be used as features. Nonlinear signal based techniques have also been used for condition monitoring and fault diagnosis. In [4], for example, the correlation dimension was extracted from raw time-series acceleration data (collected from a rolling-element bearing) and used as a feature for detecting faults. Other methods such as level crossing, bandpass filtering, shock pulse, and autoregressive modeling are used (see for example [5]). F r e q u e n c y D o m a i n M e t h o d s : The basic principle of spectral analysis is based on the fact that the spectrum of the monitored signal changes when faults occur. The nature and extent of the change depends of the nature of the fault and the machine being monitored. The condition of the machine is estimated through monitoring the change in the spectrum or a number of discriminating features extracted from the spectrum of the measured signal. These features are usually chosen as some specific frequency components that depend on the type of machine and the nature of the fault. They are compared to references established when the machine was known to work properly under similar conditions, and an appropriate decision is taken when the feature vector deviates from the reference by more than a predetermined threshold. In [6], the different changes in the vibration spectrum of rotating machines are surveyed and linked to different types of faults. Also, in [7, ch. 11] the most frequent failure modes are identified for the different machine-train components such as drives, steam turbines, gearboxes, and generators. For each component, a number of specific vibration frequencies are monitored for the diagnosis of incipient problems. These frequency-domain features, depending on the component and the nature of the failure, may include defect frequencies, the fundamental and harmonics of the rotational speed, the line frequency, the slip frequency, and
(574
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
the tooth-mesh frequencies and the sidebands that surround them. Higher-order spectra such as the bispectrum and trispectrum are also used as a basis for condition monitoring. In [8], the bispectrum is used to analyze the acceleration signal obtained from a stamping process and to extract features related to defective parts.
15.6.1.4
Nonstationary Signals in Machines
L i m i t a t i o n s of Classical M e t h o d s : Traditional time-domain and spectral analysis techniques have several shortcomings. For example, the Fourier transform is unable to accurately analyze and represent a signal that has non-periodic components such as a transient signal, as it is based on the assumption that the signal to be transformed is periodic. Another deficiency of the traditional spectral analysis is its inability to provide any information about the time dependency of the frequency content of non-stationary signals (see Article 1.1 for more details). Motor current, for example, is well known to be a nonstationary signal whose properties vary with respect to the time-varying normal operating conditions of the motor, particularly with load. Also, for the case of rotating machines, the presence of certain frequency components within the spectrum has been shown to be an indication of a fault condition. However, since some of these frequencies depend on the rotational speed, it is not possible using spectral analysis to determine these frequencies when the bearing runs at variable rotational speed. Recent works have stressed the importance of machine monitoring during the transient states--such as start-up, shutdown, and acceleration periods--because some machine failures happen during these types of transition periods. Transient signals can be a good source of information about machine condition that is not available during steady states. Fourier transform based methods are known to be inadequate in representing this type of signals since the transient event can hardly be approximated by sines and cosines. For these reasons, Fourier transform based methods are unsuitable for machine monitoring in the above-mentioned circumstances [9]. T h e N e e d for T i m e - F r e q u e n c y M e t h o d s : To overcome the shortcomings of the traditional spectral analysis techniques, nonstationary signal analysis approaches have been introduced. The most frequently used methods in the area of machine condition monitoring and diagnosis are quadratic time-frequency distributions (TFDs) and time-scale analysis (mainly the wavelet transforms (WT)). These methods represent the signals in a wider time-frequency space that allows easier and more precise discrimination between fault and normal machine conditions. Using time-frequency techniques, such as the Wigner-Ville distribution (WVD), a framework was developed that provided robust detection and classification schemes for helicopter gearbox faults [10]. This was achieved by showing that different faults produced different patterns in the time-frequency plane. The WVD-based patterns of vibration and acoustic signals were also used to detect faults in a number of machines and machine components such as engines [11] and gearboxes [12]. Other time-frequency distributions such as higher-order Wigner-Ville moment distributions [13] and reduced-interference time-frequency distributions (RIDs) [14] are used
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
675
for machine monitoring and diagnosis. Most of these methods, however, are visualbased detection/classification techniques which are meant to show the effectiveness of the respective T F D s for early detection of faults. The other methods are used as automatic feature extractors in an overall classification process. Some of the features extracted are amplitude values of the contour plots [12] and singular values of the T F D [14]. Due to their ability to represent nonstationary signals in general, and to detect and localize transient events in particular, wavelet transforms (both continuous and discrete) have been readily adopted in machine condition monitoring and diagnosis. They were used in detecting a large number of faults in different machines or machine components such as turning and drilling machines [1], gears or gear trains [15], and bearings [16]. As in the case of the TFDs, some of the proposed methods are used as feature extractors whose output is fed to a detector/classifier [1].
15.6.2
Time-Frequency Analysis Methods
Articles in Chapters 1 to 5 present detailed background on different time-frequency methods. The two most widely used time-frequency methodologies are the quadratic time-frequency distributions and the wavelet transforms. These two classes of representations are related through the S T F T and the Gabor transform (see Articles 2.3 and 2.7). T F D s are suitable for large BT signals (see Chapter 1) while W T s give best results when used with low BT and transient signals.
15.6.2.1
Quadratic Time-Frequency Distributions
For nonstationary signals, the Wiener-Khintchine theorem indicates that the timevarying power spectral density, S=(t, f), of a real random signal x(t) is related to the time-varying autocorrelation function, Rx(t, ~-), by a Fourier transform relation; that is
Sx(t, f) - E {W~(t, f)} -
Rx(t, T)e -j2~y" d~-.
(15.6.1)
oo
The expression S=(t, f) given by Eq. (15.6.1) is the Wigner-Ville spectrum (WVS), which is the expectation value of the Wigner distribution (WD) Wz(t,f). For practical reasons, x(t) is replaced by its analytic associate z(t) (see Sections 1.2.2, 1.2.3 and 2.1.4). It was shown that an estimate of Sz(t, f) can be obtained from the quadratic class of TFDs [17], which was expressed in Section 3.2.2 as
pz(t, f) - Wz(t, f) ~ } ~/(t, f)
(15.6.2)
where 7(t, f) is a two-dimensional kernel window which is application dependent, Wz(t, f) is the WVD, and ** indicates a double convolution. The example in Sec-
ti
tion 15.6.3.1 illustrates an application of the WVD to machine condition monitoring. The kernel window, 7(t, f), characterizes a particular time-frequency distribution and is generally chosen so as to obtain the best possible time-frequency resolution [14] (see Article 3.3 for more details).
Chapter 15:
676
15.5.2.2
Time-FrequencyDiagnosis and Monitoring
Wavelet Transforms
Wavelet transforms are the localized equivalent of the Fourier transform. They provide a powerful tool for representing local features of a signal. A finite-energy signal x(t) can be represented by its Fourier transform X ( f ) :
x(t) -
?
X ( f ) ej2€
df
X(f) =
where
co
F
x(t) e -j2€
dt.
(15.6.3)
oo
Thus, the FT decomposes the time-domain signal into linear combinations of harmonics ej27rft. The wavelet transform (WT) is defined in the similar manner except that the harmonics are replaced by a series of wavelet basis functions given by [18]
1
(t--T)
(15.6.4)
where T and s are the translation and dilation (scale) parameters respectively. The function ~(...) is the transformation function called the mother wavelet. Using wavelet bases, the time-domain signal can be represented as x(t) -
1F~o~
1
r
(t--T)
ds
7
where -
x(t)
cr is a constant that depends on the wavelet used and ~r S) is the continuous wavelet transform of the signal x(t). A number of mother wavelets have been proposed, such as the Mexican hat wavelet and the Morlet wavelet [18]. The discrete version of the W T is called discrete wavelet transform (DWT). It is realized by first discretizing the parameter scale s on a logarithmic grade. The time parameter is then discretized with respect to the scale parameter; that is a different sampling rate is used for every scale. In other words, the sampling is done on a dyadic sampling grid. With this sampling, a signal x(t) can be decomposed into orthogonal basis functions (scaled and shifted versions of the mother wavelet ~p); that is [18] x(t) - cr E E alk sol/2r -- kTo) (15.6.7) l
where
aZk --
F
k
x(t) Sot/2r
-- kTo) dt
(15.6.8)
(:x)
with To and so being positive constants usually taken as 1 and 2 respectively. The integer 1 describes the different levels of wavelets, and k covers the number of wavelets in each level.
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
677
The wavelet transform allows localization in both the time domain via translations of the mother wavelet, and in the scale (frequency) domain via dilations. The wavelet is irregular in shape and compactly supported, thus making it an ideal tool for analyzing signals of a transient nature. Irregularity of the wavelet basis lends it to analysis of signals with discontinuities or sharp changes, while the compactly supported nature of wavelets enables temporal localization of a signal's features. The dilation function of the discrete wavelet transform can be represented as a tree of low- and high-pass filters, with each step transforming the low-pass filter. The original signal is successively decomposed into components of lower resolution, while the high-frequency components are not analyzed any further. In contrast with the regular DWT, discrete wavelet packet analysis (DWPA) can significantly increase the versatility and power of the DWT. Unlike the DWT, DWPA utilizes both the low frequency components (approximations), and the highfrequency components (details). From this family of bases, a method for choosing the optimum scheme for a particular signal can be developed [18]. The two examples in Sections 15.6.3.2 and 15.6.3.3 illustrate the applications of DWT and DWPA to machine condition monitoring.
15.6.3
Examples of Condition Monitoring Using TFA
To illustrate how both time-frequency distributions and wavelet transforms are used in condition monitoring and diagnosis, we summarize three methods selected from the literature. These examples illustrate the time-frequency methodology adopted in this increasingly important area of engineering.
15.6.3.1
Gearbox Fault Detection Using Wigner-Ville Distribution
We consider the detection of a broken tooth in a spur gear using the WVD as the basis for feature extraction and pattern recognition techniques for classification [12]. D a t a A c q u i s i t i o n : The system considered is composed of a 24-tooth input gear driven by an electric motor and meshing with 16 teeth of a pinion whose rotational frequency is 37.5 Hz. The applied load was 70% of the maximum load. The study simulated five fault types, each involving the partial or total removal of one or more teeth. In particular, the faults were the removal of 25, 50, 75, and 100 percent of the face-width at a given radius, plus the same defect with 100% advancement on two pinion teeth. The acceleration vibration signal obtained from the above-mentioned system was low-pass filtered and sampled at a rate of 6.4 kHz. F e a t u r e E x t r a c t i o n : The vibration signal is synchronously averaged in order to remove any periodic events not exactly synchronous with the gear of interest and to reduce the effects of noise and vibration sources other than that of the gear. In an industrial environment, where the problem of noise may become critical, efficient time-frequency based signal-cleansing techniques such as time-frequency peak filtering [19] (see also Article 11.4) may be required. The averaged signal is then transformed to the time-frequency domain using the pseudo-WVD (discrete
678
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
Fig. 15.6.1: Weighted WVD of the residual signal: (a) normal condition of the spur gear; (b) one broken tooth with 50% fault advancement; (c) one broken tooth with 100% fault advancement.
WVD) with a Hamming window. The negative values of the WVD are set to zero and the resulting distribution is normalized. The results are displayed in the form of contour plots. To enhance the sidebands around the meshing frequencies, the residual signal is obtained by removing the meshing harmonics using a band-stop filter. The extracted features are the amplitude values of the contour plots (see Fig. 15.6.1). F e a t u r e Selection: To reduce the dimension of the feature vector, a selected number of WVD cross-sections at and around a chosen meshing frequency are selected. D e c i s i o n Making" Two classification approaches are considered: statistical and neural pattern recognition. In the first approach, to assign the feature vector from the last stage to one of the K classes considered, the Mahalanobis distance was chosen as the similarity measure. This measure is given by 1/2
where ~ is the feature vector and ~ and 2k are the mean vector and covariance matrix representing the k th class. The study considered only two classes; namely normal (no fault) and abnormal (fault) and used only one template representing the normal condition. In the second classification approach a neural network was trained in a supervised mode using the back-propagation algorithm [12].
15.6.3.2
Fault Diagnosis of Rotating Machinery Using Wavelet Transforms
In this example, the problem is to detect faults in a model drive-line consisting of various interconnected rotating parts that include a vehicle gearbox, two bearing housings, and an electric motor. All these parts are connected by flexible couplings and loaded by a disk brake as seen in Fig. 15.6.2 [20].
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
679
F i g . 15.6.2: Schematic presentation of the model drive-line.
(a) Envelope of the W T of a vibration signal representing a normal condition.
(b) Envelope of the W T of a vibration signal representing a faulty gear.
F i g . 15.6.3: WT of normal and faulty condition of the vibration signal.
D a t a A c q u i s i t i o n : Of the five gears (four forward and one reverse) only the reverse gear pinion is used in the experiment. On the gear pinion, two types of localized faults were simulated: a small "blip" of 2mm diameter on one tooth, and a triangular fracture on the face of one tooth. On the bearing housing, one fault was simulated by introducing a l m m fracture across the inner race (Fig. 15.6.2). This gave six combinations of conditions for the pinion and housing, five of which represented fault conditions. An accelerometer was used to obtain the vibration signal from the bearing housing. F e a t u r e E x t r a c t i o n : The vibration signals were transformed to the timescale domain using the Daubechies 4th-order wavelet (D4) (see Fig. 15.6.3). After the transformation of the whole signal into the wavelet domain, a threshold value was chosen. This value was selected to be above the dominant component of the reference (normal) signal. The 10 most dominant amplitudes of the signals above the threshold value were selected to represent half of the feature vector. The other half consists of the 10 corresponding wavelet numbers (indicating both time and scale). As the number of features (20) is not large, no feature selection was needed.
680
Chapter 15:
Time-Frequency Diagnosis and Monitoring
Fig. 15.6.4: DWPA representation of the vibration signal showing wavelet packets selected by ANFIS.
Decision M a k i n g : The classification was achieved using a two-layer neural network with sigmoid nodal function trained in a supervised mode using the backpropagation algorithm [20].
15.6.3.3
Extraction of Bearing Fault Transients Using DWPA
This example exploits the multiple band-pass filtering capability of the DWPA for the extraction of rolling-element bearing fault-related components. An algorithm is trained to recognize three types of localized faults; namely inner race, rolling element, and outer race faults [21]. D a t a Acquisition: The vibration signals are obtained from a rolling-element bearing test rig with a rolling-element fault and an operating speed of 60 rpm. F e a t u r e E x t r a c t i o n : The extraction of high-frequency transients due to bearing impact resonance is achieved via best-basis DWPA representation using the Daubechies wavelet of order 20 and an adaptive network-based fuzzy inference systern (ANFIS). ANFIS is a transformational model of integration where the final fuzzy inference system is optimized via artificial neural network training. Before the neuro-fuzzy network is trained (using wavelet packets extracted from vibration signals), suitable input parameters to train the network are selected. These parameters are kurtosis (a measure of spikiness) and the spectrum peak ratio (an indication of the presence of localized defects). The network is then trained using wavelet packets characterizing the above-mentioned types of faults. Fig. 15.6.4 illustrates how this method facilitates the extraction of bearing-fault-related components from a signal while rejecting the unwanted harmonics. The wavelet packets identified by ANFIS as containing bearing fault-related features are indicated on the figure [21].
Time-Frequency Based Machine Condition Monitoring and Fault Diagnosis
15.5.4
681
Summary and Conclusions
Time-frequency analysis methods are applicable to the area of machine condition monitoring and diagnosis. They are capable of efficiently and unambiguously characterizing a large number of faults. TFA methods are used for detection, classification, and monitoring the progression of the faults and wear with time. This enables prediction and prevention of catastrophic failures. Time-frequency analysis techniques, in the form of either T F D or W T , are used as both visual indicators of the presence of faults and as a feature extractor in a fully automated pattern recognition process. Articles 11.2 and 15.2 of this book describe two other time-frequency approaches to machine condition monitoring.
References [1] R. Du, "Engineering monitoring and diagnosis using wavelet transforms," in ComputerAided Design, Engineering, and Manufacturing: Systems Techniques and Applications-Vol. I: Systems Techniques and Computational Methods (C. T. Leondes, ed.), ch. 8, CRC Press, 2000. [2] D. E. Dimla Snr., "Sensor signals for tool-wear monitoring in metal cutting operations~ A review of methods," Internat. J. Machine Tools CJ Manufacture, vol. 40, pp. 1073-1098, June 2000. [3] S. R. Kulkarni, G. Lugosi, and S. S. Venkatesh, "Learning pattern classification--A survey," IEEE Trans. Information Theory, vol. 44, pp. 2178-2206, October 1998. [4] D. Logan and J. Mathew, "Using the correlation dimension for vibration fault diagnosis of rolling element bearings--I: Basic concepts," J. Mechanical Systems and Signal Processing, vol. 10, pp. 241-250, May 1996. [5] D. C. Baillie and J. Mathew, "A comparison of autoregressive modeling techniques for fault diagnosis of rolling element bearings," J. Mechanical Systems and Signal Processing, vol. 10, pp. 1-17, January 1996. [6] J. Courrech and R. L. Eshleman, "Condition monitoring of machinery," in Harris' Shock and Vibration Handbook (C. M. Harris and A. G. Piersol, eds.), ch. 16, New York: McGraw-Hill, 2002. [7] R. K. Mobley, Vibration Fundamentals. Boston: Newnes, 1999. [8] G. C. Zhang, M. Ge, H. Tong, Y. Xu, and R. Du, "Bispectral analysis for on-line monitoring of stamping operation," Engineering Applications of Artificial Intelligence, vol. 15, pp. 97-104, February 2002. [9] R. Burnett, J. F. Watson, and S. Elder, "The application of modern signal processing techniques for use in rotor fault detection and location within three-phase induction motors," Signal Processing, vol. 49, pp. 57-70, February 1996. [10] B. D. Forrester, "Time-frequency analysis in machine fault detection," in TimeFrequency Signal Analysis: Methods and Applications (B. Boashash, ed.), ch. 18, pp. 406423, Melbourne/N.Y.: Longman-Cheshire/Wiley, 1992.
682
Chapter 15:
Time-FrequencyDiagnosis and Monitoring
[11] B. Boashash and P. J. O'Shea, "A methodology for detection and classification of some underwater acoustic signals using time-frequency analysis techniques," IEEE Trans. Acoustics, Speech, 8J Signal Processing, vol. 38, pp. 1829-1841, November 1990. [12] W. J. Staszewski, K. Worden, and G. R. Tomlinson, "Time-frequency analysis in gearbox fault detection using the Wigner-Ville distribution and pattern recognition," J. Mechanical Systems and Signal Processing, vol. 11, pp. 673-692, September 1997. [13] S. K. Lee and P. R. White, "Higher-order time-frequency analysis and its application to fault detection in rotating machinery," J. Mechanical Systems and Signal Processing, vol. 11, pp. 637-650, July 1997. [14] S. Gu, J. Ni, and J. Yuan, "Non-stationary signal analysis and transient machining process condition monitoring," Internat. J. Machine Tools s Manufacture, vol. 42, pp. 4151, January 2002. [15] H. Zheng, Z. Li, and X. Chen, "Gear fault diagnosis based on continuous wavelet transform," J. Mechanical Systems and Signal Processing, vol. 16, pp. 447-457, MarchMay 2002. [16] S. Prabhakar, A. S. Sekhar, and A. R. Mohanty, "Detection and monitoring of cracks in a rotor-bearing system using wavelet transforms," J. Mechanical Systems and Signal Processing, vol. 15, pp. 447-450, March 2001. [17] B. Boashash and M. Mesbah, "A time-frequency approach for newborn seizure detection," IEEE Engineering in Medicine eJ Biology Magazine, vol. 20, pp. 54-64, September/October 2001. [18] S. G. Mallat, A Wavelet Tour of Signal Processing. San Diego / London: Academic Press, 2nd ed., 1999. [19] B. Boashash and M. Mesbah, "Signal enhancement by time-frequency peak filtering," IEEE Trans. Signal Processing, vol. 51, November 2003. [20] B. A. Paya, I. I. Esat, and M. N. M. Badi, "Artificial neural network based fault diagnostics of rotating machinery using wavelet transforms as a preprocessor," J. Mechanical Systems and Signal Processing, vol. 11, pp. 751-765, September 1997. [21] J. Altmann and J. Mathew, "DWPA multiple band-pass filtering for extraction of bearing fault transients from a strong continuous signal," in Proc. Asia-Pacific Vibration Conference, vol. 2, pp. 772-776, Hangzhou, China, October-November 2001.
Chapter 16
Other Applications There are a number of applications that may be of interest to the reader but that could not be included in the chapters for obvious space reasons. A list of references to selected applications is provided below to further assist the reader of this book. They represent the areas of biomedical engineering [1-7], speech [8-10], radar [11-13], telecommunications [14, 15], plasma science [16], image [17, 18] and tomography [19]. Further references can be obtained by the reader using available computer searches on the relevant databases (e.g. Elsevier, IEEE, etc.).
References [1] A. Monti, C. Medigue, and L. Mangin, "Instantaneous parameter estimation in cardiovascular time series by harmonic and time-frequency analysis," IEEE Trans. Biomedical Engineering, vol. 49, pp. 1547-1556, December 2002. [2] H. G. van Steenis, W. L. J. Martens, and J. H. M. Tulen, "Time-frequency parameters of heart-rate variability," IEEE Engineering in Medicine ~ Biology Magazine, vol. 21, pp. 46-58, July-August 2002. [3] R. Zou, W. A. Cupples, K. R. Yip, N. H. Holstein-Rathlou, and K. Chon, "Time-varying properties of renal autoregulatory mechanisms," IEEE Trans. Biomedical Engineering, vol. 49, pp. 1112-1120, October 2002. [4] Y. Hu, K. D. Luk, W. W. Lu, and J. C. Leong, "Comparison of time-frequency analysis techniques in intraoperative somatosensory evoked potential (SEP) monitoring," Computers in Biology ~ Medicine, vol. 32, pp. 13-23, January 2002. [5] P. Bonato, P. Boissy, U. D. Croce, and S. H. Roy, "Changes in the surface EMG signal and the biomechanics of motion during a repetitive lifting task," IEEE Trans. Neural System ~ Rehabilitation Engineering, vol. 10, pp. 38-47, March 2002. [6] L. Rankine and M. Mesbah, "Significant atom determination of basis pursuit decomposition," in Proc. Seventh Internat. Syrup. on Signal Processing and its Applications (ISSPA '03), pp. 577-580, Paris, 1-4 July 2003. [7] P. Zarjam, M. Mesbah, and B. Boashash, "Detection of newborn EEG seizure using optimal features based on discrete wavelet transform," in Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'03), vol. 2, pp. 265-268, Hong Kong, scheduled 6-10 April 2003. [8] I. Magrin-Chagnolleau, G. Durou, and F. Bimbot, "Application of time-frequency principal component analysis to text-independent speaker identification," IEEE Trans. on Speech ~ Audio Processing, vol. 10, pp. 371-378, September 2002. 683
684
Chapter 16:
OtherApplications
[9] S. Ghaemmaghami, M. Deriche, and B. Boashash, "Hierarchical approach to formant detection and tracking through instantaneous frequency estimation," IEE Electronics Letters, vol. 33, pp. 17-18, January 1997. [10] M. D. Riley, Speech Time-Frequency Representations. Kluwer, 1989. [11] X.-G. Xia, G. Wang, and V. Chen, "Quantitative SNR analysis for ISAR imaging using joint time-frequency analysis--Short time Fourier transform," IEEE Trans. Aerospace Electronic Systems, vol. 38, pp. 649-659, April 2002. [12] O. P. Kenny and B. Boashash, "Time-frequency analysis of backscattered signals from diffuse radar targets," IEE Proc., Part F: Radar ~ Signal Processing, vol. 140, pp. 198208, June 1993. [13] V. C. Chen and H. Ling, Time-Frequency Transforms for Radar Imaging and Signal Analysis. Scitech Publishing, 2002. [14] A. Z. Sha'ameri, B. Boashash, and I. Ismail, "Design of signal dependent kernel functions for digital modulated signals," in Proc. Fourth Internat. Syrup. on Signal Processing and its Applications (ISSPA '96), vol. 2, pp. 527-528, Gold Coast, Australia, 25-30 August 1996. [15] G. Azemi, B. Senadji, and B. Boashash, "A novel estimator for the velocity of a mobile base station in a micro-cellular system," in Proc. IEEE Internat. Symp. on Circuits and Systems (ISCAS 03), vol. 2, pp. 212-215, Bangkok, 25-28 May 2003. [16] A. C. A. Figueiredo and J. P. S. Bizarro, "Time-frequency images of magnetohydrodynamic phenomena in tokamak plasmas using a discrete-time Wigner distribution," IEEE Trans. Plasma Science, vol. 30, pp. 54-55, February 2002. [17] I. Djurovid and S. Stankovid, "Estimation of time-varying velocities of moving objects by time-frequency representations," IEEE Trans. Image Processing, vol. 12, pp. 550-562, May 2003. [18] S. L. Hahn and K. M. Snopek, "Double-dimensional distributions: Another approach to "quartic" distributions," IEEE Trans. Signal Processing, vol. 50, pp. 2987-2997, December 2002. [19] C. Richard, "Time-frequency-based detection using discrete-time discrete-frequency Wigner distributions," IEEE Trans. Signal Processing, vol. 50, pp. 2170-2176, September 2002.
Time-Frequency Bibliography Reference numbers in the text of this book refer to the local "References" list at the end of the current Chapter (in Part I and Chapter 16) or the current Article (elsewhere). The following "time-frequency" bibliography ~ is an alphabetical listing of all publicly-available sources that appear in the local "References" lists, excluding application-specific references not pertaining to the time-frequency field, and adding some other important references which, because of space and scope restrictions, are not cited in the text. It is intended as a snapshot, dated 2003, of the most important references in TFSAP and its applications. If the same source appears in two or more local "References" lists, the reference numbers generally differ, but the substantive details have been verified and harmonized. 1
[1] K. Abed-Meraim and B. Barkat. "Blind source separation using the time-frequency distribution of the mixture signal". In Proc. Second IEEE Internat. Symp. on Signal Processing and Information Technology (ISSPIT'02), volume 2, pages 663-666, Marrakech, Morocco, 18-21 December 2002. [2] K. Abed-Meraim, N. age processing approach In Proc. Third Internat. Rome, 18-20 September
Linh-Trung, V. Sucic, F. Tupin, and B. Boashash. "An imfor underdetermined blind separation of nonstationary sources".
Symp. on Image ~ Signal Processing and Analysis (ISPA-03), 2003.
[3] S. S. Abeysekera and B. Boashash. "Methods of signal classification using the images produced by the Wigner-Ville distribution". Pattern Recognition Letters, 12(11):717-729, November 1991. [4] R. Adelino and F. da Silva. "Atomic decomposition with evolutionary pursuit". Digital Signal Processing: A Review Journal, 13(2):317-337, April 2003. [5] O. Akay and G. F. Boudreaux-Bartels. "Unitary and Hermitian fractional operators and their relation to the fractional Fourier transform". IEEE Signal Processing Letters, 5(12):312-314, December 1998. [6] O. Akay and G. F. Boudreaux-Bartels. "Fractional convolution and correlation via operator methods and an application to detection of linear FM signals". IEEE Trans. Signal Processing, 49(5):979-993, May 2001. [7] L. B. Almeida. "The fractional Fourier transform and time-frequency representations". IEEE Trans. Signal Processing, 42(11):3084-3091, November 1994. [8] R. A. Altes. "Detection, estimation and classification with spectrograms". J. Acoustical Soc. of America, 67(4):1232-1246, April 1980. ~ Gavin Ft. P u t l a n d and Boualem Boashash, Signal Processing Research Centre, Queensland University of Technology, Brisbane, Australia. 1A consolidated bibliographic database was compiled from the references supplied by contributing authors. Local reference lists were regenerated from that database using the bibunits package (v. 2.0 by Thorsten Hansen). 685
686
Time-Frequency Bibliography
[9] R. A. Altes. "Wideband, proportional-bandwidth Wigner-Ville analysis". IEEE Trans. Acoustics, Speech, s Signal Processing, 38(6):1005-1012, June 1990. [10] R. A. Altes and E. L. Titlebaum. "Bat signals as optimally Doppler tolerant waveforms". J. Acoustical Soc. of America, 48:1014-1020, October 1970. [11] J. Altmann and J. Mathew. "DWPA multiple band-pass filtering for extraction of bearing fault transients from a strong continuous signal". In Proc. Asia-Pacific Vibration Conference, volume 2, pages 772-776, Hangzhou, China, October-November 2001. [12] M. G. Amin. "Time-frequency spectrum analysis and estimation for non-stationary random processes". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 9, pages 208-232. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [13] M. G. Amin. "Spectral decomposition of time-frequency distribution kernels". IEEE Trans. Signal Processing, 42(5):1156-1165, May 1994. [14] M. G. Amin. "Minimum-variance time-frequency distribution kernels for signals in additive noise". IEEE Trans. Signal Processing, 44(9):2352-2356, September 1996. [15] M. G. Amin. "Interference mitigation in spread-spectrum communication systems using time-frequency distributions". IEEE Trans. Signal Processing, 45(1):90-102, January 1997. [16] M. G. Amin and A. N. Akansu. "Time-frequency for interference excision in spreadspectrum communications". In G. B. Giannakis, editor, "Highlights of Signal Processing for Communications", IEEE Signal Processing Magazine, volume 16(2). IEEE, March 1999. [17] M. G. Amin, A. Belouchrani, and Y. Zhang. "The spatial ambiguity function and its applications". IEEE Signal Processing Letters, 7(6):138-140, June 2000. [18] M. G. Amin, C. Wang, and A. R. Lindsey. "Optimum interference excision in spreadspectrum communications using open-loop adaptive filters". IEEE Trans. Signal Processing, 47(7):1966-1976, July 1999. [19] M. G. Amin and Y. Zhang. "Direction finding based on spatial time-frequency distribution matrices". Digital Signal Processing: A Review Journal, 10(4):325-339, October 2000. [20] J. C. Andrieux, R. Feix, G. Mourgues, P. Bertrand, B. Izrar, and V. T. Nguyen. "Optimum smoothing of the Wigner-Ville distribution". IEEE Trans. Acoustics, Speech, eJ Signal Processing, 35(6):764-769, June 1987. [21] L. Angrisani and M. D'Arco. "A measurement method based on a modified version of the chirplet transform for instantaneous frequency estimation". IEEE Trans. Instrumentation ~J Measurement, 51(4):704-711, August 2002. [22] M. J. Arnold, M. Roessgen, and B. Boashash. "Filtering real signals through frequency modulation and peak detection in the time-frequency plane". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), volume 3, pages 345-348, Adelaide, Australia, 19-22 April 1994. [23] H. Art,s, G. Matz, and F. Hlawatsch. "Unbiased scattering function estimation during data transmission". In Proc. IEEE Vehicular Technology Conf. (VTC'99-Fall), pages 1535-1539, Amsterdam, 19-22 September 1999.
Time-Frequency Bibliography
687
[24] F. Auger and P. Flandrin. "Improving the readability of time-frequency and time-scale representations by the reassignment method". IEEE Trans. Signal Processing, 43(5):10681089, May 1995. [25] G. Azemi, B. Senadji, and B. Boashash. "A novel estimator for the velocity of a mobile base station in a micro-cellular system". In Proc. IEEE Internat. Symp. on Circuits and Systems (ISCAS 03), volume 2, pages 212-215, Bangkok, 25-28 May 2003. [26] J. S. Bach (1685-1750). St Matthew Passion, nos. 2-3 (ed. A. Van Ryckeghem). GMD Music Archive, http://www.gmd.de/Misc/Music/scores/Introduction.html, 2000. Continued by Werner Icking Music Archive, http://icking-music-archive.sunsite.dk/. [27] R. Baraniuk, D. Jones, T. Brotherton, and S. L. Marple Jr. "Applications of adaptive time-frequency representations to underwater acoustic signal processing". In Proc. 25th Asilomar Conf. on Signals, Systems, and Computers, volume 2, pages 1109-1113, Pacific Grove, CA, 4-6 November 1991. [28] R. G. Baraniuk. "Covariant time-frequency representations through unitary equivalence". IEEE Signal Processing Letters, 3(3):79-81, March 1996. [29] R. G. Baraniuk. "Beyond time-frequency analysis: Energy densities in one and many dimensions". IEEE Trans. Signal Processing, 46(9):2305-2314, September 1998. [30] R. G. Baraniuk. "Joint distributions of arbitrary variables made easy". J. of Multidimensional Systems ~J Signal Processing, 9(4):341-348, October 1998. Special issue on time-frequency analysis. [31] R. G. Baraniuk and L. Cohen. "On joint distributions of arbitrary variables". IEEE Signal Processing Letters, 2(1):10-12, January 1995. [32] R. G. Baraniuk, P. Flandrin, A. J. E. M. Janssen, and O. J. J. Michel. "Measuring time-frequency information content using the R@nyi entropies". IEEE Trans. Information Theory, 47(4):1391-1409, May 2001. [33] R. G. Baraniuk and D. L. Jones. "Signal-dependent time-frequency analysis using a radially Gaussian kernel". Signal Processing, 32(3):263-284, June 1993. [34] R. G. Baraniuk and D. L. Jones. "A signal-dependent time-frequency representation: Optimal kernel design". IEEE Trans. Signal Processing, 41(4):1589-1602, April 1993. [35] R. G. Baraniuk and D. L. Jones. "A signal-dependent time-frequency representation: Fast algorithm for optimal kernel design". IEEE Trans. Signal Processing, 42(1):134-146, January 1994. [36] R. G. Baraniuk and D. L. Jones. "Unitary equivalence: A new twist on signal processing". IEEE Trans. Signal Processing, 43(10):2269-2282, October 1995. [37] S. Barbarossa. "Detection and imaging of moving objects with synthetic aperture radar--Part 1: Optimal detection and parameter estimation theory". IEE Proc., Part F: Radar 8J Signal Processing, 139(1):79-88, February 1992. [38] S. Barbarossa and O. Lemoine. "Analysis of nonlinear FM signals by pattern recognition of their time-frequency representation". IEEE Signal Processing Letters, 3(4):112115, April 1996. [39] S. Barbarossa and A. Scaglione. "Autofocusing of SAR images based on the product high-order ambiguity function". IEE Proc.: Radar, Sonar ~J Navigation, 145(5):269-273, October 1998.
688
Time-Frequency Bibliography
[40] S. Barbarossa and A. Scaglione. "Adaptive time-varying cancellations of wideband interferences in spread-spectrum communications based on time-frequency distributions". IEEE Trans. Signal Processing, 47(4):957-965, April 1999. [41] S. Barbarossa and A. Scaglione. "On the capacity of linear time-varying channels". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'99), pages 2627-2630, Phoenix, AZ, 15-19 March 1999. [42] S. Barbarossa and A. Scaglione. "Time-varying fading channels". In G. B. Giannakis, Y. Hua, P. Stoica, and L. Tong, editors, Signal Processing Advances in Wireless and Mobile Communications, volume 2: "Trends in Single- and Multi-User Systems", chapter 1. Prentice-Hall, Upper Saddle River, N J, 2001. [43] S. Barbarossa, A. Scaglione, and G. B. Giannakis. "Product high-order ambiguity function for multicomponent polynomial-phase signal modeling". IEEE Trans. Signal Processing, 46(3):691-708, March 1998. [44] B. Barkat. "Instantaneous frequency estimation of nonlinear frequency-modulated signals in the presence of multiplicative and additive noise". IEEE Trans. Signal Processing, 49(10):2214-2222, October 2001. [45] B. Barkat and K. Abed-Meraim. "A blind components separation procedure for FM signal analysis". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), volume 2, pages 1425-1428, Orlando, FL, 13-17 May 2002. [46] B. Barkat and B. Boashash. "Higher order PWVD and Legendre based time-frequency distribution". In Proc. Sixth IEEE Internat. Workshop on Intelligent Signal Processing and Communication Systems (ISPACS'98), volume 2, pages 532-536, Melbourne, Australia, 5-6 November 1998. [47] B. Barkat and B. Boashash. "Design of higher order polynomial Wigner-Ville distributions". IEEE Trans. Signal Processing, 47(9):2608-2611, September 1999. [48] B. Barkat and B. Boashash. "Instantaneous frequency estimation of polynomial FM signals using the peak of the PWVD: Statistical performance in the presence of additive Gaussian noise". IEEE Trans. Signal Processing, 47(9):2480-2490, September 1999. [49] B. Barkat and B. Boashash. "A high-resolution quadratic time-frequency distribution for multicomponent signals analysis". IEEE Trans. Signal Processing, 49(10):2232-2239, October 2001. [50] H. O. Bartelt, K. H. Brenner, and A. W. Lohmann. "The Wigner distribution function and its optical production". Optics Communications, 32(1):32-38, January 1980. [51] M. J. Bastiaans. "Gabor's signal expansion and its relation to sampling of the slidingwindow spectrum". In R. J. Marks II, editor, Advanced Topics in Shannon Sampling and Interpolation Theory, pages 1-35. Springer, New York, 1993. [52] M. J. Bastiaans. "Gabor's expansion and the Zak transform for continuous-time and discrete-time signals: Critical sampling and rational oversampling". Research Report 95-E-295, Eindhoven University of Technology, Eindhoven, Netherlands, 1995. [53] M. J. Bastiaans, T. Alieva, and L. StankoviS. "On rotated time-frequency kernels". IEEE Signal Processing Letters, 9(11):378-381, November 2002. [54] M. J. Bastiaans and M. C. W. Geilen. "On the discrete Gabor transform and the discrete Zak transform". Signal Processing, 49(3):151-166, March 1996.
Time-Frequency Bibliography
689
[55] E. Bedrosian. "A product theorem for Hilbert transforms". Proc. IEEE, 51:686-689, 1963. [56] P. A. Bello. "Characterization of randomly time-variant linear channels". IEEE Trans. Communication Systems, 11 (4):360-393, December 1963. [57] A. Belouchrani and M. G. Amin. "New approach for blind source separation using time frequency distributions". In Proc. SPIE: Advanced Signal Processing Algorithms, Architectures and Implementations VI, volume 2846, pages 193-203. Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 4-9 August 1996. [58] A. Belouchrani and M. G. Amin. "Blind source separation using time-frequency distributions: Algorithm and asymptotic performance". In Proc. IEEE Inter'nat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), volume 5, pages 3469-3472, Munich, 21-24 April 1997. [59] A. Belouchrani and M. G. Amin. "Blind source separation based on time-frequency signal representation". IEEE Trans. Signal Processing, 46(11):2888-2898, November 1998. [60] A. Belouchrani and M. G. Amin. "Time-frequency MUSIC: A new array signal processing method based on time-frequency signal representation". IEEE Signal Processing Letters, 6(5):109-110, May 1999. [61] A. Belouchrani, M. G. Amin, and K. Abed-Meraim. "Direction finding in correlated noise fields based on joint block-diagonalization of spatio-temporal correlation matrices". IEEE Signal Processing Letters, 4(9):266-268, September 1997. [62] M. Benidir. "Characterization of polynomial functions and application to timefrequency analysis". IEEE Trans. Signal Processing, 45(5):1351-1354, May 1997. [63] M. Benidir and A. Ouldali. "Polynomial phase signal analysis based on the polynomial derivatives decompositions". IEEE Trans. Signal Processing, 47(7):1954-1965, July 1999. [64] J. Bertrand and P. Bertrand. "Representations temps-fr~quence des signaux". Comptes rendus de l'Acaddmie des Sciences, Paris, 299, Ser. 1:635-638, 1984. [65] J. Bertrand and P. Bertrand. "A tomographic approach to Wigner's function". Foundations of Physics, 17:397-405, 1987. [66] J. Bertrand and P. Bertrand. "Afline time-frequency distributions". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 5, pages 118140. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [67] J. Bertrand and P. Bertrand. "A class of affine Wigner functions with extended covariance properties". J. of Mathematical Physics, 33(7):2515-2527, July 1992. [68] J. Bertrand and P. Bertrand. "Symbolic calculus on the time-frequency half-plane". J. of Mathematical Physics, 39(8):4071-4090, August 1998. [69] J. Bertrand, P. Bertrand, and J. P. Ovarlez. "The Mellin transform". In A. D. Poularikas, editor, The Transforms and Applications Handbook, chapter 11, pages 829885. CRC Press, Boca Raton, FL, 1st edition, 1996. [70] S. Bhashyam, A. M. Sayeed, and B. Aazhang. "Time-selective signaling and reception for communication over multipath fading channels". IEEE Trans. Communications, 48:83-94, January 2000.
6g0
Time-Frequency Bibliography
[71] B. Boashash. "Wigner analysis of time-varying signals--Its application in seismic prospecting". In Proc. European Signal Processing Conf. (EUSIPCO-83), pages 703-706, Nfirnberg, September 1983. [72] B. Boashash. "Note on the use of the Wigner distribution for time-frequency signal analysis". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 36(9):1518-1521, September 1988. [73] B. Boashash. "Time-frequency signal analysis". In S. Haykin, editor, Advances in Spectrum Analysis and Array Processing, volume 1, chapter 9, pages 418-517. PrenticeHall, Englewood Cliffs, N J, 1991. [74] B. Boashash. "Estimating and interpreting the instantaneous frequency of a signal Part 1: Fundamentals". Proc. IEEE, 80(4):520-538, April 1992. [75] B. Boashash. "Estimating and interpreting the instantaneous frequency of a signal Part 1: Fundamentals; Part 2: Algorithms and applications". Proc. IEEE, 80(4):519-568, April 1992. With introductions by Michael J. Riezenman. [76] B. Boashash. "Estimating and interpreting the instantaneous frequency of a signal. Part 2: Algorithms and applications". Proc. IEEE, 80(4):540-568, April 1992. [77] B. Boashash, editor. Time-Frequency Signal Analysis: Methods and Applications. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [78] B. Boashash. "Time-Frequency Signal Analysis: Past, present and future trends". In C. T. Leondes, editor, Digital Control and Signal Processing Systems and Techniques, number 78 in CONTROL AND DYNAMIC SYSTEMS: ADVANCES IN THEORY AND APPLICATIONS, chapter 1, pages 1-69. Academic Press, San Diego, 1996. [79] B. Boashash, editor. Time-Frequency Signal Analysis and Processing: A Comprehensive Reference. Elsevier, Amsterdam, 2003. [80] B. Boashash and P. J. Black. "An efficient real-time implementation of the WignerVille distribution". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 35(11):16111618, November 1987. [81] B. Boashash and B. Escudie. "Wigner-Ville analysis of asymptotic signals and applications". Signal Processing, 8(3):315-327, June 1985. [82] B. Boashash and G. Frazer. "Time-varying higher-order spectra, generalised WignerVille distribution and the analysis of underwater acoustic data". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'92), volume 5, pages 193-196, San Francisco, 23-26 March 1992. [83] B. Boashash, G. Jones, and P. J. O'Shea. "Instantaneous frequency of signals: Concepts, estimation techniques and applications". In Proc. SPIE: Advanced Algorithms and Architectures for Signal Processing IV, volume 1152, pages 382-400. Soc. of Photo-optical Instrumentation Engineers, San Diego, 8-10 August 1989. [84] B. Boashash and M. Mesbah. "A time-frequency approach for newborn seizure detection". IEEE Engineering in Medicine ~ Biology Magazine, 20(5):54-64, September/October 2001. [85] B. Boashash and M. Mesbah. "Time-frequency methodology for newborn EEG seizure detection". In A. Papandreou-Suppappola, editor, Applications in Time-Frequency Signal Processing, chapter 9. CRC Press, Boca Raton, FL, 2002.
Time-Frequency Bibliography
691
[86] B. Boashash and M. Mesbah. "Signal enhancement by time-frequency peak filtering". IEEE Trans. Signal Processing, 51(11), November 2003. [87] B. Boashash, M. Mesbah, and P. Colditz. "Newborn EEG seizure pattern characterisation using time-frequency analysis". In Proc. IEEE Inter'nat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), volume 2, pages 1041-1044, Salt Lake City, UT, 7-11 May 2001. [88] B. Boashash and P. O'Shea. "Use of the cross Wigner-Ville distribution for estimation of instantaneous frequency". IEEE Trans. Signal Processing, 41(3):1439-1445, March 1993. [89] B. Boashash and P. J. O'Shea. "A methodology for detection and classification of some underwater acoustic signals using time-frequency analysis techniques". IEEE Trans. Acoustics, Speech, ~4 Signal Processing, 38(11):1829-1841, November 1990. [90] B. Boashash and P. J. O'Shea. "Polynomial Wigner-Ville distributions and their relationship to time-varying higher order spectra". IEEE Trans. Signal Processing, 42(1):216220, January 1994. [91] B. Boashash, P. J. O'Shea, and M. J. Arnold. "Algorithms for instantaneous frequency estimation: A comparative study". In Proc. SPIE: Advanced Signal-Processing Algorithms, Architectures, and Implementations, volume 1348, pages 126-148. Soc. of Photo-optical Instrumentation Engineers, San Diego, 10-12 July 1990. [92] B. Boashash and A. Reilly. "Algorithms for time-frequency signal analysis". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 7, pages 163-181. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [93] B. Boashash and B. Ristic. "Analysis of FM signals affected by Gaussian AM using the reduced Wigner-Ville trispectrum". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'93), volume IV, pages 408-411, Minneapolis, 27-30 April 1993. [94] B. Boashash and B. Ristic. "A time-frequency perspective of higher-order spectra as a tool for non-stationary signal analysis". In B. Boashash, E. J. Powers, and A. M. Zoubir, editors, Higher-Order Statistical Signal Processing, chapter 4, pages 111-149. Longman/Wiley, Melbourne/N.Y., 1995. [95] B. Boashash and B. Ristic. "Polynomial time-frequency distributions and time-varying higher order spectra: Application to the analysis of multicomponent FM signal and to the treatment of multiplicative noise". Signal Processing, 67(1):1-23, May 1998. [96] B. Boashash and V. Sucic. "Resolution measure criteria for the objective assessment of the performance of quadratic time-frequency distributions". IEEE Trans. Signal Processing, 51(5):1253-1263, May 2003. [97] B. Boashash, L. B. White, and J. Imberger. "Wigner-Ville analysis of non-stationary random signals (with application to turbulent microstructure signals)". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'86), volume 4, pages 2323-2326, Tokyo, 7-11 April 1986. [98] B. Boashash and H. J. Whitehouse. "High resolution Wigner-Ville analysis". In Eleventh GRETSI Symp. on Signal Processing and its Applications, pages 205-208, Nice, France, 1-5 June 1987.
692
Time-Frequency Bibliography
[99] H. B61cskei and F. Hlawatsch. "Oversampled modulated filter banks". In H. G. Feichtinger and T. Strohmer, editors, Gabor Analysis and Algorithms: Theory and Applications, chapter 9, pages 295-322. Birkh/~user, Berlin/Boston, 1998. [100] P. J. Boles and B. Boashash. "Application of the cross-Wigner-Ville distribution to seismic data processing". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 20, pages 445-466. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [101] P. Bonato, P. Boissy, U. D. Croce, and S. H. Roy. "Changes in the surface EMG signal and the biomechanics of motion during a repetitive lifting task". IEEE Trans. Neural System CJ Rehabilitation Engineering, 10(1):38-47, March 2002. [102] M. Born and P. Jordan. "Zur quantenmechanik". Zeitschrift fiir Physik., 34:858-888, 1925. [103] G. F. Boudreaux-Bartels. Time-frequency signal processing algorithms: Analysis and synthesis using Wigner distribution. PhD thesis, Rice University, 1983. [104] G. F. Boudreaux-Bartels. "Mixed time-frequency signal transformations". In A. D. Poularikas, editor, The Transforms and Applications Handbook, chapter 12. CRC Press, Boca Raton, FL, 1st edition, 1996. [105] G. F. Boudreaux-Bartels and T. W. Parks. "Time-varying filtering and signal estimation using Wigner distribution synthesis techniques". IEEE Trans. Acoustics, Speech, CASignal Processing, 34(3):442-451, June 1986. [106] R. Bourdier, J. F. Allard, and K. Trumpf. "Effective frequency response and signal replica generation for filtering algorithms using multiplicative modifications of the STFT". Signal Processing, 15:193-201, September 1988. [107] T. Billow and G. Sommer. "A novel approach to the 2D analytic signal". In F. Solina and A. Leonardis, editors, Proc. 8th Internat. Conf. on Computer Analysis of Images ~J Patterns (CAIP'99), Ljubljana, Slovenia, 1-3 September 1999, number 1689 in LECTURE NOTES IN COMPUTER SCIENCE, pages 25-32. Springer, 1999. [108] A. Bultan. "A four-parameter atomic decomposition of chirplets". IEEE Trans. Signal Processing, 47(3):731-745, March 1999. [109] R. Burnett, J. F. Watson, and S. Elder. "The application of modern signal processing techniques for use in rotor fault detection and location within three-phase induction motors". Signal Processing, 49(1):57-70, February 1996. [110] A. B. Carlson. Communication Systems. McGraw-Hill, Tokyo, 2nd edition, 1975. [111] R. Carmona, W. L. Hwang, and B. Torr@sani. Practical Time-Frequency Analysis: Gabor and Wavelet Transforms with an Implementation in S. Academic Press, San Diego, 1998. [112] W. G. Carrara, R. S. Goodman, and R. M. Majewski. Spotlight Synthetic Aperture Radar. Artech House, Norwood, MA, October 1995. [113] S. Carstens-Behrens, M. Wagner, and J. F. BShme. "Detection of multiple resonances in noise". A rchiv fiir Elektronik und Obertragungstechnik (Internat. J. of Electronics Communications), 52(5):285-292, 1998. [114] P. Celka, B. Boashash, and P. Colditz. "Preprocessing and time-frequency analysis of newborn EEG seizures". IEEE Engineering in Medicine eJ Biology Magazine, 20(5):30-39, September/October 2001.
Time-Frequency Bibliography
693
[115] V. Chandran, S. Elgar, and A. Nguyen. "Detection of mines in acoustic images using higher order spectral features". IEEE J. of Oceanic Engineering, 27(3):610-618, July 2002. [116] E. Chassande-Mottin, F. Auger, and P. Flandrin. "Supervised time-frequency reassignment". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency 6/Time-Scale Analysis, pages 517-520, Paris, 18-21 June 1996. [117] E. Chassande-Mottin, I. Daubechies, F. Auger, and P. Flandrin. "Differential reassignment". IEEE Signal Processing Letters, 4(10):293-294, October 1997. [118] E. Chassande-Mottin and P. Flandrin. "On the time-frequency detection of chirps". Applied C_f Computational Harmonic Analysis, 6(2):252-281, March 1999. [119] V. C. Chen and H. Ling. Time-Frequency Transforms for Radar Imaging and Signal Analysis. Scitech Publishing, 2002. [120] H.-I. Choi and W. J. Williams. "Improved time-frequency representation of multicomponent signals using exponential kernels". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 37(6):862-871, June 1989. [121] T. A. C. M. Claasen and W. F. G. Mecklenbrguker. "The Wigner Distribution-A tool for time-frequency signal analysis". Philips Y. of Research, 35:217-250 (Part 1), 276-300 (Part 2) & 372-389 (Part 3), 1980. [122] T. A. C. M. Claasen and W. F. G. Mecklenbrguker. "The Wigner Distribution~A tool for time-frequency signal analysis; Part 1: Continuous-time signals". Philips Y. of Research, 35(3):217-250, 1980. [123] T. A. C. M. Claasen and W. F. G. Mecklenbr~uker. "The Wigner Distribution~A tool for time-frequency signal analysis; Part 3: Relations with other time-frequency signal transformations". Philips J. of Research, 35(6):372-389, 1980. [124] M. Coates and W. J. Fitzgerald. "Time-frequency signal decomposition using energy mixture models". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), volume 2, pages 633-636, Istanbul, 5-9 June 2000. [125] F. S. Cohen, S. Kadambe, and G. F. Boudreaux-Bartels. "Tracking of unknown nonstationary chirp signals using unsupervised clustering in the Wigner distribution space". IEEE Trans. Signal Processing, 41(11):3085-3101, November 1993. [126] L. Cohen. "Generalized phase-space distribution functions". Physics, 7(5):781-786, May 1966.
J. of Mathematical
[127] L. Cohen. "Time-frequency distributions--A review". Proc. IEEE, 77(7):941-981, July 1989. Invited paper. [128] L. Cohen. "Distributions concentrated along the instantaneous frequency". In Proc. SPIE: Advanced Signal-Processing Algorithms, Architectures, and Implementations, volume 1348, pages 149-157. Soc. of Photo-optical Instrumentation Engineers, San Diego, 10-12 July 1990. [129] L. Cohen. "Introduction: A primer on time-frequency analysis". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 1, pages 342. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [130] L. Cohen. "The scale representation". IEEE Trans. Signal Processing, 41(12):32753292, December 1993.
694
Time-Frequency Bibliography
[131] L. Cohen. Time-Frequency Analysis. Prentice-Hall, Englewood Cliffs, NJ, 1995. [132] L. Cohen. "Wavelet moments and time-frequency analysis". In Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations IX, volume 3807, pages 434-445. Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 19-21 July 1999. [133] A. H. Costa and G. F. Boudreax-Bartels. "An overview of aliasing errors in discretetime formulations of time-frequency distributions". IEEE Trans. Signal Processing, 47(5):1463-1474, May 1999. [134] H. Cox. "Line array performance when the signal coherence is spatially dependent". J. Acoustical Soc. of America, 54:1743-1746, July 1973. [135] G. Cristdbal, J. BescSs, and J. Santamarfa. "Image analysis through the Wigner distribution function". Applied Optics, 28(2):262-271, January 1989. [136] G. Cristdbal and J. Hormigo. "Texture segmentation through eigen-analysis of the Pseudo-Wigner distribution". Pattern Recognition Letters, 20:337-345, 1999. [137] C. S. Cunningham and W. J. Williams. "Fast implementations of generalized discrete time-frequency distributions". IEEE Trans. Signal Processing, 42(6):1496-1508, June 1994. [138] G. S. Cunningham and W. J. Williams. "Kernel decomposition of time-frequency distributions". IEEE Trans. Signal Processing, 42(6):1425-1442, June 1994. [139] R. N. Czerwinski and D. L. Jones. "Adaptive cone-kernel time-frequency analysis". IEEE Trans. Signal Processing, 43(7):1715-1719, July 1995. [140] I. Daubechies. "Time-frequency localization operators: A geometric phase space approach". IEEE Trans. Information Theory, 34:605-612, July 1988. [141] I. Daubechies. "The wavelet transform, time-frequency localization and signal analysis". IEEE Trans. Information Theory, 36(5):961-1005, September 1990. [142] I. Daubechies. "The wavelet transform: A method for time-frequency localization". In S. Haykin, editor, Advances in Spectrum Analysis and Array Processing, volume 1, chapter 8, pages 366-417. Prentice-Hall, Englewood Cliffs, NJ, 1991. [143] M. Davy, C. Doncarli, and G. F. Boudreaux-Bartels. "Improved optimization of timefrequency-based signal classifiers". IEEE Signal Processing Letters, 8(2):52-57, February 2001. [144] M. Davy and A. Doucet. "Copulas: A new insight into positive time-frequency distributions". IEEE Signal Processing Letters, 10(7):215-218, July 2003. [145] N. G. de Bruijn. "A theory of generalized functions, with applications to Wigner distribution and Weyl correspondence". Nieuw Archief voor Wiskunde (3), 21:205-280, 1973. [146] S. R. Deans. "Radon and Abel transforms". In A. D. Poularikas, editor, The Transforms and Applications Handbook, chapter 8. CRC Press, Boca Raton, FL, 1st edition, 1996. [147] C. S. Detka and A. E1-Jaroudi. "The transitory evolutionary spectrum". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), volume 4, pages 289-292, Adelaide, Australia, 19-22 April 1994. [148] I. Djurovi(~, V. Katkovnik, and L. Stankovi5. "Median filter based realizations of the robust time-frequency distributions". Signal Processing, 81(8):1771-1776, August 2001.
Time-Frequency Bibliography
695
[149] I. Djurovi5 and L. Stankovi6. "Influence of high noise on the instantaneous frequency estimation using quadratic time-frequency distributions". IEEE Signal Processing Letters, 7(11):317-319, November 2000. [150] I. Djurovi5 and L. StankoviS. "Robust Wigner distribution with application to the instantaneous frequency estimation". IEEE Trans. Signal Processing, 49(12):2985-2993, December 2001. [151] I. Djurovid, L. Stankovi5, and J. F. B6hme. "Estimates of the Wigner distribution in Gaussian noise environment". Archiv fiir Elektronik und Obertragungstechnik (Internat. J. of Electronics ~ Communications), 56(5):337-340, 2002. [152] I. Djurovi6 and S. StankoviS. "Estimation of time-varying velocities of moving objects by time-frequency representations". IEEE Trans. Image Processing, 12(5):550-562, May 20O3. [153] R. Du. "Engineering monitoring and diagnosis using wavelet transforms". In C. T. Leondes, editor, Computer-Aided Design, Engineering, and Manufacturing: Systems
Techniques and A pplications--Vol. I: Systems Techniques and Computational Methods, chapter 8. CRC Press, 2000. [154] P. Duvaut and D. Declerq. "Statistical properties of the pseudo-Wigner-Ville representation of normal random processes". Signal Processing, 75(1):93-98, 5 January 1999. [155] G. Eichmann and N. M. Marinovich. "Scale-invariant Wigner distribution". In Proc. SPIE: Analog Optical Processing and Computing, volume 519, pages 18-25. Soc. of Photooptical Instrumentation Engineers, Cambridge, MA, 25-26 October 1984. [156] S. EI-Khamy, S. E. Shaaban, and E. A. Thabet. "Frequency-hopped multi-user chirp modulation (FH/M-CM) for multipath fading channels". In Proc. Sixteenth National Radio Science Conference (NRSC'99), pages C6/1-8, Ain Shams Univ., Cairo, 23-25 February 1999. [157] M. K. Emresoy and A. E1-Jaroudi. "Iterative instantaneous frequency estimation and adaptive matched spectrogram". Signal Processing, 64(2):157-65, January 1998. [158] S. Farkash and S. Raz. "Linear systems in Gabor time-frequency space". Trans. Signal Processing, 42(3):611-617, March 1994.
IEEE
[159] H. G. Feichtinger and T. Strohmer, editors. Gabor Analysis and Algorithms: Theory and Applications. Birkh/~user, Berlin/Boston, 1998. [160] H. G. Feichtinger and T. Strohmer. Advances in Gabor Analysis. Birkhafiser, 2002. [161] B. Ferguson. "A ground based narrow-band passive acoustic technique for estimating the altitude and speed of a propeller driven aircraft". J. Acoustical Soc. of America, 92(3):1403-1407, September 1992. [162] B. G. Ferguson. "Time-frequency signal analysis of hydrophone data". IEEE J. of Oceanic Engineering, 21(4):537-544, October 1996. [163] B. G. Ferguson and K. W. Lo. "Transiting aircraft parameter estimation using underwater acoustic sensor data". IEEE J. of Oceanic Engineering, 24(4):424-435, October 1999. [164] B. G. Ferguson and B. G. Quinn. "Application of the short-time Fourier transform and the Wigner-Ville distribution to the acoustic localization of aircraft". J. Acoustical Soc. of America, 96:821-827, 1994.
696
Time-Frequency Bibliography
[165] A. C. A. Figueiredo and J. P. S. Bizarro. "Time-frequency images of magnetohydrodynamic phenomena in tokamak plasmas using a discrete-time Wigner distribution". IEEE Trans. Plasma Science, 30(1):54-55, February 2002. [166] P. Flandrin. "Some features of time-frequency representations of multicomponent signals". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'8~), volume 3, pages 41B.4.1-41B.4.4, San Diego, 19-21 March 1984. [167] P. Flandrin. "A time-frequency formulation of optimum detection". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 36(9):1377-1384, September 1988. [168] P. Flandrin. "Time-dependent spectra for nonstationary stochastic processes". In C. Longo and B. Picinbono, editors, Time and Frequency Representation of Signals and Systems, pages 69-124. Springer, Vienna, 1989. [169] P. Flandrin. Time-Frequency//Time-Scale Analysis. Academic Press, San Diego, 1999. Original French edition: Temps-frdquence (Paris: Hermes, 1993). [170] P. Flandrin, R. G. Baraniuk, and O. Michel. "Time-frequency complexity and information". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), volume 3, pages 329-332, Adelaide, Australia, 19-22 April 1994. [171] P. Flandrin, E. Chassande-Mottin, and P. Abry. "Reassigned scalograms and their fast algorithms". In Proc. SPIE: Wavelet Applications in Signal and Image Processing III, volume 2569, pages 152-158. Soc. of Photo-optical Instrumentation Engineers, San Diego, CA, 12-14 July 1995. [172] P. Flandrin and P. Gonc;alv~s. "Geometry of affine time-frequency distributions". Applied ~ Computational Harmonic Analysis, 3:10-39, January 1996. [173] P. Flandrin and W. Martin. "The Wigner-Ville spectrum of nonstationary random signals". In W. Mecklenbr~uker and F. Hlawatsch, editors, The Wigner Distribution-Theory and Applications in Signal Processing, pages 211-267. Elsevier, Amsterdam, 1997. [174] G. B. Folland.
Harmonic Analysis in Phase Space. Number 122 in ANNALS OF
MATHEMATICS STUDIES. Princeton University Press, Princeton, N J, 1989.
[175] B. D. Forrester. "Time-frequency analysis in machine fault detection". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 18, pages 406-423. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [176] W. L. J. Fox, J. C. Luby, J. W. Pitton, P. J. Loughlin, and L. E. Atlas. "Sonar and radar range-Doppler processing using a cone-shaped kernel time-frequency representation". In Proc. 2~th Asilomar Conf. on Signals, Systems, and Computers, volume 2, pages 1079-1083, Pacific Grove, CA, 5-7 November 1990. [177] G. J. Frazer and B. Boashash. "Multiple window spectrogram and time-frequency distributions". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'9~), volume IV, pages 293-296, Adelaide, Australia, 19-22 April 1994. [178] D. Friedman. "Instantaneous frequency distribution vs. time: An interpretation of the phase structure of speech". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'85), pages 1121-1124, Tampa, FL, 26-29 March 1985. [179] D. Gabor. "Theory of communication", g. IEE, 93(III)(26):429-457, November 1946. [180] R. G. C-allager. Information Theory and Reliable Communication. Wiley, New York, 1968.
Time-Frequency Bibliography
697
[181] M. A. Garcfa-Pdrez and V. Sierra-Vgzquez. "Visual processing in the joint spatial/spatial-frequency domain". In E. Peli, editor, Vision Models for Target Detection and Recognition: In Memory of Arthur Menendez, chapter 2, pages 16-62. World Scientific Publishing, 1995. [182] W. A. Gardner, editor. Cyclostationarity in Communications and Signal Processing. IEEE Press, Piscataway, N J, 1995. [183] N. L. Gerr. "Introducing a third-order Wigner distribution". Proc. IEEE, 76(3):290292, March 1988. [184] A. B. Gershman and M. G. Amin. "Wideband direction-of-arrival estimation of multiple chirp signals using spatial time-frequency distributions". IEEE Signal Processing Letters, 7(6):152-155, June 2000. [185] A. B. Gershman, L. Stankovid, and V. Katkovnik. "Sensor array signal tracking using a data-driven window approach". Signal Processing, 80(12):2507-2515, December 2000. [186] A. Gersho and R. M. Gray. Vector Quantization and Signal Compression. Kluwer, 1991. [187] S. Ghaemmaghami, M. Deriche, and B. Boashash. "Hierarchical approach to formant detection and tracking through instantaneous frequency estimation", lEE Electronics Letters, 33(1):17-18, January 1997. [188] S.C. Glinski. "Diphone speech synthesis based on a pitch-adaptive short-time Fourier transform". Master's thesis, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 1981. [189] G. H. Golub and C. F. Van Loan. Matrix computations. Johns Hopkins University Press, Baltimore, MD, 1989. [190] P. Gon(;alv??s and R. G. Baraniuk. "Pseudo affine Wigner distributions: Definition and kernel formulation". IEEE Trans. Signal Processing, 46(6):1505-1517, June 1998. [191] E. Grall-Maes and P. Beauseroy. "Mutual information-based feature extraction on the time-frequency plane". IEEE Trans. Signal Processing, 50(4):779-790, April 2002. [192] S. Gu, J. Ni, and J. Yuan. "Non-stationary signal analysis and transient machining process condition monitoring". Internat. J. Machine Tools ~3 Manufacture, 42(1):41-51, January 2002. [193] P. Guillemain and R. Kronland-Martinet. "Horizontal and vertical ridges associated to continuous wavelet transforms". In Proc. IEEE-SP Internat. Symp. on Time-Frequency 83 Time-Scale Analysis, pages 63-66, Victoria, BC, 4-6 October 1992. [194] C. Gupta and A. Papandreou-Suppappola. "Wireless CDMA communications using time-varying signals". In Proc. Sixth Inter'nat. Syrup. on Signal Processing and its Applications (ISSPA '01), volume 1, pages 242-245, Kuala Lumpur, 13-16 August 2001. [195] S. L. Hahn. "Multidimensional complex signals with single-orthant spectra". Proc. IEEE, 80(8):1287-1300, August 1992. [196] S. L. Hahn and K. M. Snopek. "Double-dimensional distributions: Another approach to "quartic" distributions". IEEE Trans. Signal Processing, 50(12):2987-2997, December 2002.
(598
Time-Frequency Bibliography
[197] A. Hanssen and L. L. Scharf. "A theory of polyspectra for nonstationary stochastic processes". IEEE Trans. Signal Processing, 51(5):1243-1252, May 2003. [198] H. Hassanpour, M. Mesbah, and B. Boashash. "Comparative performance of timefrequency based newborn EEG seizure detection using spike signatures". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'03), volume 2, pages 389-392, Hong Kong, scheduled 6-10 April 2003. [199] S. B. Hearon and M. G. Amin. "Minimum-variance time-frequency distribution kernels". IEEE Trans. Signal Processing, 43(5):1258-1262, May 1995. [200] C. E. Hell and D. F. Walnut. "Continuous and discrete wavelet transforms". SIAM Review, 31(4):628-666, December 1989. [201] C. W. Helstrom. "An expansion of a signal in Gaussian elementary signals". IEEE Trans. Information Theory, 12:81-82, January 1966. [202] C.W. Helstrom. The Statistical Theory of Signal Detection. Oxford University Press, New York, 1968. [203] F. B. Hildebrand. Advanced Calculus for Engineers. Prentice-Hall, New York, 1949. [204] J. Hilgevoord and J. Uffink. "The mathematical expression of the uncertainty principle". In A. van der Merwe, G. Tarozzi, and F. Selleri, editors, Microphysical Reality and Quantum Formalism, number 25-26 in FUNDAMENTAL THEORIES OF PHYSICS, pages 91-114. Kluwer, Dordrecht, 1988. Proc. of the Conf. at Urbino, Italy, 25 Sep. to 3 Oct., 1985. [205] F. Hlawatsch. "Interference terms in the Wigner distribution". In Proc. Internat. Conf. on Digital Signal Processing, pages 363-367, Florence, Italy, 5-8 September 1984. [206] F. Hlawatsch. Time-Frequency Analysis and Synthesis of Linear Signal Spaces: Time-Frequency Filters, Signal Detection and Estimation, and Range-Doppler Estimation. Kluwer, Boston, 1998. [207] F. Hlawatsch and H. BSlcskei. "Unified theory of displacement-covariant timefrequency analysis". In Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ Time-Scale Analysis, pages 524-527, Philadelphia, PA, 25-28 October 1994. [208] F. Hlawatsch and G. F. Boudreaux-Bartels. "Linear and quadratic time-frequency signal representations". IEEE Signal Processing Magazine, 9(2):21-67, April 1992. [209] F. Hlawatsch and P. Flandrin. "The interference structure of the Wigner distribution and related time-frequency signal representations". In W. Mecklenbr~iuker and F. Hlawatsch, editors, The Wigner Distribution wTheory and Applications in Signal Processing, pages 59-133. Elsevier, Amsterdam, 1997. [210] F. Hlawatsch and G. Matz. "Quadratic time-frequency analysis of linear time-varying systems". In L. Debnath, editor, Wavelet Transforms and Time-Frequency Signal Analysis, chapter 9, pages 235-287. Birkh~iuser, Boston, 2001. [211] F. Hlawatsch, G. Matz, H. Kirchauer, and W. Kozek. "Time-frequency formulation, design, and implementation of time-varying optimal filters for signal estimation". IEEE Trans. Signal Processing, 48(5):1417-1432, May 2000. [212] F. Hlawatsch, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels. "The hyperbolic class of quadratic time-frequency representations--Part II: Subclasses, intersection with the affine and power classes, regularity, and unitarity". IEEE Trans. Signal Processing, 45(2):303-315, February 1997.
Time-Frequency Bibliography
699
[213] F. Hlawatsch, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels. "The power classes--Quadratic time-frequency representations with scale covariance and dispersive time-shift covariance". IEEE Trans. Signal Processing, 47(11):3067-3083, November 1999. [214] F. Hlawatsch, G. TaubSck, and T. Twaroch. "Covariant time-frequency analysis". In L. Debnath, editor, Wavelets and Signal Processing. Birkhguser, Boston, 2003. [215] F. Hlawatsch and T. Twaroch. "Covariant (c~,fl), time-frequency, and (a, b)representations". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pages 437-440, Paris, 18-21 June 1996. [216] F. Hlawatsch and R. L. Urbanke. "Bilinear time-frequency representations of signals: The shift-scale invariant class". IEEE Trans. Signal Processing, 42:357-366, 1994. [217] J. Hormigo and G. Cristdbal. "High resolution spectral analysis of images using the pseudo-Wigner distribution". IEEE Trans. Signal Processing, 46(6):1757-1763, June 1998. [218] C. Hory, N. Martin, and A. Chehikian. "Spectrogram segmentation by means of statistical features for non-stationary signal interpretation". IEEE Trans. Signal Processing, 50( 12):2915-2925, December 2002. [219] Y. Hu, K. D. Luk, W. W. Lu, and J. C. Leong. "Comparison of time-frequency analysis techniques in intraoperative somatosensory evoked potential (SEP) monitoring". Computers in Biology ~ Medicine, 32(1):13-23, January 2002. [220] P. J. Huber. Robust statistics. Wiley, 1981. [221] Z. M. Hussain and B. Boashash. "Adaptive instantaneous frequency estimation of multicomponent FM signals". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), volume II, pages 657-660, Istanbul, 5-9 June 2000. [222] Z. M. Hussain and B. Boashash. "Multi-component IF estimation". In Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pages 559-563, Pocono Manor, PA, 14-16 August 2000. [223] Z. M. Hussain and B. Boashash. "Design of time-frequency distributions for amplitude and IF estimation of multicomponent signals". In Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), volume 1, pages 339-342, Kuala Lumpur, 13-16 August 2001. [224] Z. M. Hussain and B. Boashash. "Adaptive instantaneous frequency estimation of multicomponent FM signals using quadratic time-frequency distributions". IEEE Trans. Signal Processing, 50(8):1866-1876, August 2002. [225] Q. Q. Huynh, L. N. Cooper, N. Intrator, and H. Shouval. "Classification of underwater mammals using feature extraction based on time-frequency analysis and BCM theory". IEEE Trans. Signal Processing, 46(5):1202-1207, May 1998. [226] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels. "New concepts in narrowband and wideband Weyl correspondence time-frequency techniques". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'98), volume 3, pages 1573-1576, Seattle, 12-15 May 1998. [227] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels. "A wideband time-frequency Weyl symbol and its generalization". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pages 29-32, Pittsburgh, PA, 6-9 October 1998.
700
Time-Frequency Bibliography
[228] B. G. Iem, A. Papandreou-Suppappola, and G. F. Boudreaux-Bartels. "Classes of smoothed Weyl symbols". IEEE Signal Processing Letters, 7(7):186-188, July 2000. [229] J. Imberger and B. Boashash. "Application of the Wigner-Ville distribution to temperature gradient microstructure: A new technique to study small-scale variations". J. of Physical Oceanography, 16:1997-2012, December 1986. [230] L. Jacobson and H. Wechsler. "Joint spatial/spatial-frequency representation". Signal Processing, 14:37-68, 1988. [231] J. S. Jaffe, G. Chandran, and E. Reuss. "High frequency acoustic imaging in the ocean". In Proc. IEEE Inter'nat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), volume 5, pages 2793-2796, Detroit, 9-12 May 1995. [232] A. Jakobsson, S. L. Marple Jr., and P. Stoica. "Computationally efficient twodimensional Capon spectrum analysis". IEEE Trans. Signal Processing, 48(9):2651-2661, September 2000. [233] J. Jeong and W. J. Williams. "Alias-free generalised discrete-time time-frequency distributions". IEEE Trans. Signal Processing, 40(11):2757-2765, November 1992. [234] J. Jeong and W. J. Williams. "Kernel design for reduced interference distributions". IEEE Trans. Signal Processing, 40(2):402-412, February 1992. [235] D. L. Jones and R. G. Baraniuk. "A simple scheme for adapting time-frequency representations". IEEE Trans. Signal Processing, 42(12):3530-3535, December 1994. [236] D. L. Jones and R. G. Baraniuk. "Adaptive optimal-kernel time-frequency representation". IEEE Trans. Signal Processing, 43(10):2361-2371, October 1995. [237] D. L. Jones and T. W. Parks. "A high-resolution data-adaptive time-frequency representation". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 38(12):2127-2135, December 1990. [238] D. L. Jones and T. W. Parks. "A resolution comparison of several time-frequency representations". IEEE Trans. Signal Processing, 40(2):413-420, February 1992. [239] G. Jones and B. Boashash. "Generalized instantaneous parameters and window matching in the time-frequency plane". IEEE Trans. Signal Processing, 45(5):1264-1275, May 1997. [240] S. M. Joshi and J. M. Morris. "Some results on product-function frames". Signal Processing, 80(4):737-740, April 2000. [241] S. Kadambe and T. Adali. "Application of cross-term deleted Wigner representation (CDWR) for sonar target detection/classification". In Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, volume 1, pages 822-826, Pacific Grove, CA, 1-4 November 1998. [242] T. A. Kadous and A. M. Sayeed. "Decentralized multiuser detection for time-varying multipath channels". IEEE Trans. Communications, 48:1840-1852, November 2000. [243] M. H. Kahaei, A. M. Zoubir, B. Boashash, and M. Deriche. "Tracking behaviour of lattice filters for linear and quadratic FM signals". In Wysocki et al. [506], pages 207-214. [244] S. A. Kassam. Signal detection in non-Gaussian noise. Springer, 1988. [245] S. A. Kassam and H. V. Poor. "Robust techniques for signal processing: A survey". Proc. IEEE, 73(3):433-481, March 1985.
Time-Frequency Bibliography [246] V. Katkovnik. "Robust M-periodogram". 46(11):3104-3109, November 1998.
701
IEEE Trans. Signal Processing,
[247] V. Katkovnik. "Robust M-estimates of the frequency and amplitude of a complexvalued harmonic". Signal Processing, 77(1):71-84, August 1999. [248] V. Katkovnik and L. Stankovi~. "Instantaneous frequency estimation using the Wigner distribution with varying and data-driven window length". IEEE Trans. Sighal Processing, 46(9):2315-2325, September 1998. [249] V. Katkovnik and L. StankoviS. "Periodogram with varying and data-driven window length". Signal Processing, 67(3):345-358, 30 June 1998. [250] J. Kay and R. Lerner. Lectures in Communications Theory. McGraw-Hill, 1961. [251] A. S. Kayhan and M. G. Amin. "Spatial evolutionary spectrum for DOA estimation and blind signal separation". IEEE Trans. Signal Processing, 48(3):791-798, March 2000. [252] E. J. Kelly and R. P. Wishner. "Matched filter theory for high-velocity, accelerating targets". IEEE Trans. Military Electronics, 9:56-59, 1965. [253] R. S. Kennedy. Fading dispersive communication channels. Wiley, New York, 1969. [254] O. P. Kenny and B. Boashash. "Time-frequency analysis of backscattered signals from diffuse radar targets". IEE Proc., Part F: Radar ~ Signal Processing, 140(3):198208, June 1993. [255] H. A. Khan and L. F. Chaparro. "Nonstationary Wiener filtering based on evolutionary spectral theory". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), volume 5, pages 3677-3680, Munich, 21-24 April 1997. [256] H. Kirchauer, F. Hlawatsch, and W. Kozek. "Time-frequency formulation and design of nonstationary Wiener filters". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), pages 1549-1552, Detroit, 9-12 May 1995. [257] J. G. Kirkwood. "Quantum statistics of almost classical ensembles". Physics Review, 44:31-37, 1933. [258] L. Knockaert. "A class of positive isentropic time-frequency distributions". IEEE Signal Processing Letters, 9(1):22-25, January 2002. [259] K. Kodera, C. de Villedary, and R. Gendrin. "A new method for the numerical analysis of nonstationary signals". Physics of the Earth ~ Planetary Interiors, 12:142150, 1976. [260] W. Koenig, H. K. Dunn, and L. Y. Lacy. "The sound spectrograph". J. Acoustical Soc. of America, 18(1):19-49, 1946. [261] D. KSnig. "Application of time-frequency analysis for optimum non-equidistant sampling of automotive signals captured at knock". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'96), volume 5, pages 2746-2749, Atlanta, CA, 7-10 May 1996. [262] P. J. Kootsookos, B. C. Lovell, and B. Boashash. "A unified approach to the STFT, TFDs, and instantaneous frequency". IEEE Trans. Signal Processing, 40(8):1971-82, August 1992. [263] W. Kozek. "On the generalized Weyl correspondence and its application to timefrequency analysis of linear time-varying systems". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pages 167-170, Victoria, BC, 4-6 October 1992.
702
Time-Frequency Bibliography
[264] W. Kozek. "Time-frequency signal processing based on the Wigner-Weyl framework". Signal Processing, 29(1):77-92, October 1992. [265] W. Kozek. "On the transfer function calculus for underspread LTV channels". IEEE Trans. Signal Processing, 45(1):219-223, January 1997. [266] W. Kozek, H. G. Feichtinger, and J. Scharinger. "Matched multiwindow methods for the estimation and filtering of nonstationary processes". In Proc. IEEE Internat. Syrup. on Circuits and Systems (ISCAS 96), volume 2, pages 509-512, Atlanta, GA, 12-15 May 1996. [267] W. Kozek and F. Hlawatsch. "A comparative study of linear and nonlinear timefrequency filters". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pages 163-166, Victoria, BC, 4-6 October 1992. [268] W. Kozek, F. Hlawatsch, H. Kirchauer, and U. Trautwein. "Correlative timefrequency analysis and classification of nonstationary random processes". In Proc. IEEESP Internat. Symp. on Time-Frequency gJ Time-Scale Analysis, pages 417-420, Philadelphia, PA, 25-28 October 1994. [269] W. Kozek and A. F. Molisch. "On the eigenstructure of underspread WSSUS channels". In Proc. IEEE-SP Workshop on Signal Processing Advances in Wireless Communications (SPA WC'97), pages 325-328, Paris, 16-18 April 1997. [270] W. Kozek and A. F. Molisch. "Nonorthogonal pulseshapes for multicarrier communications in doubly dispersive channels". IEEE J. on Selected Areas in Communications, 16:1579-1589, October 1998. [271] S. Krishnamachari and W. J. Williams. "Adaptive kernel design in the generalized marginals domain for time-frequency analysis". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'94), volume 3, pages 341-344, Adelaide, Australia, 19-22 April 1994. [272] S. R. Kulkarni, G. Lugosi, and S. S. Venkatesh. "Learning pattern classification--A survey". IEEE Trans. Information Theory, 44(6):2178-2206, October 1998. [273] R. Kumaresan and S. Verma. "On estimating the parameters of chirp signals using rank reduction techniques". In Proc. 21st Asilomar Conf. on Signals, Systems, and Computers, pages 555-558, Pacific Grove, CA, 2-4 November 1987. [274] S. Lach, M. G. Amin, and A. R. Lindsey. "Broadband nonstationary interference excision in spread-spectrum communications using time-frequency synthesis techniques". IEEE J. on Selected Areas in Communications, 17(4):704-714, April 1999. [275] F. Lari and A. Zakhor. "Automatic classification of active sonar data using timefrequency transforms". In Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ TimeScale Analysis, pages 21-24, Victoria, BC, 4-6 October 1992. [276] S. K. Lee and P. R. White. "Higher-order time-frequency analysis and its application to fault detection in rotating machinery". J. Mechanical Systems and Signal Processing, 11(4):637-650, July 1997. [277] M. J. Levin. "Instantaneous spectra and ambiguity functions". IEEE Trans. Information Theory, 10:95-97, January 1964. [278] A. R. Leyman, Z. M. Kamran, and K. Abed-Meraim. "Higher-order time frequencybased blind source separation technique". IEEE Signal Processing Letters, 7(7):193-196, July 2000.
Time-Frequency Bibliography
703
[279] S. Li and D. M. Healy, Jr. "A parametric class of discrete Gabor expansions". IEEE Trans. Signal Processing, 44(2):201-211, February 1996. [280] E. H. Lieb. "Integral bounds for radar ambiguity functions and Wigner distributions". J. of Mathematical Physics, 31(3):594-599, March 1990. [281] J. S. Lim and A. V. Oppenheim. "Enhancement and bandwidth compression of noisy speech". Proc. IEEE, 67(12):1586-1604, December 1979. [282] N. Linh-Trung, A. Belouchrani, K. Abed-Meraim, and B. Boashash. "Separating more sources than sensors using time-frequency distributions". In Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), volume 2, pages 583-586, Kuala Lumpur, 13-16 August 2001. [283] K. W. Lo and B. G. Ferguson. "Passive estimation of aircraft motion parameters using destructive interference between direct and ground-reflected sound waves". In Proc. Information Decision ~ Control 99, pages 171-176. IEEE, Adelaide, Australia, February 1999. [284] K. W. Lo, S. W. Perry, and B. G. Ferguson. "Aircraft flight parameter estimation using acoustical Lloyd's mirror effect". IEEE Trans. Aerospace ~ Electronic Systems, 38(1):137-151, January 2002. [285] A. W. Lohmann, D. Mendlovic, and Z. Zalevsky. "Fractional transformations in optics". In E. Wolf, editor, Progress in Optics m Vol. 38, chapter 4, pages 263-342. Elsevier, Amsterdam, 1998. [286] A. W. Lohmann and B. H. Softer. "Relationships between the Radon-Wigner and fractional Fourier transforms". J. Optical Soc. of America A, 11(6):1798-1801, June 1994. [287] C. T. Lombroso. "Neonatal EEG polygraphy in normal and abnormal newborns". In E. Niedermeyer and F. H. Lopes da Silva, editors, Electroencephalography: Basic Principles, Clinical Applications, and Related Fields, pages 803-875. Williams & Wilkins, Baltimore, MD, 3rd edition, 1993. [288] P. Loughlin and B. Tacer. "On the amplitude and frequency-modulation decomposition of signals". Y. Acoustical Soc. of America, 100:1594-1601, September 1996. [289] P. J. Loughlin, J. W. Pitton, and L. E. Atlas. "Bilinear time-frequency representations: New insights and properties". IEEE Trans. Signal Processing, 41:750-767, 1993. [290] J. G. Lourens. "Passive sonar detection of ships with spectro-grams". In Proc. 3rd South African IEEE Conference on Theoretical and Practical Work in Communications and Signal Processing, pages 147-151, Johannesburg, June 1990. [291] B. Lovell, R. C. Williamson, and B. Boashash. "The relationship between instantaneous frequency and time-frequency representations". IEEE Trans. Signal Processing, 41(3):1458-1461, March 1993. [292] N. Ma, D. Vray, P. Delachartre, and G. Gimenez. "Sea-bottom backscattering modeling with a wideband constant beamwidth sonar at normal incidence". In Proc. IEEE Ultrasonics Symposium, volume 2, pages 1077-1080, Seattle, 7-10 November 1995. [293] S. Maes. "The synchrosqueezed representation yields a new reading of the wavelet transform". In Proc. SPIE: Wavelet Applications II, volume 2491, pages 532-559. Soc. of Photo-optical Instrumentation Engineers, Orlando, FL, 17-21 April 1995.
704
Time-Frequency Bibliography
[294] I. Magrin-Chagnolleau, G. Durou, and F. Bimbot. "Application of time-frequency principal component analysis to text-independent speaker identification". IEEE Trans. on Speech ~ Audio Processing, 10(6):371-378, September 2002. [295] S. G. Mallat. A Wavelet Tour of Signal Processing. Academic Press, San Diego, 1st edition, 1998. [296] S. G. Mallat. A Wavelet Tour of Signal Processing. Academic Press, San Diego / London, 2nd edition, 1999. [297] S. G. Mallat and Z. Zhang. "Matching pursuits with time-frequency dictionaries". IEEE Trans. Signal Processing, 41(12):3397-3415, December 1993. [298] H. Margenau and R. N. Hill. "Correlation between measurements in quantum theory". Progress of Theoretical Physics, 26:722-738, 1961. [299] S. L. Marple Jr. Digital Spectral Analysis with Applications. Prentice-Hall, Englewood Cliffs, N J, 1987. [300] S. L. Marple Jr. "Computing the discrete-time "analytic" signal via FFT". IEEE Trans. Signal Processing, 47(9):2600-2603, September 1999. [301] S. L. Marple Jr. "Two-dimensional lattice linear prediction parameter estimation method and fast algorithm". IEEE Signal Processing Letters, 7(6):164-168, June 2000. [302] S. L. Marple Jr. and T. Brotherton. "Detection and classification of short duration underwater acoustic signals by Prony's method". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'91), volume 2, pages 1309-1312, Toronto, 14-17 May 1991. [303] W. Martin. "Time-frequency analysis of random signals". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'82), volume 3, pages 13251328, Paris, 3-5 May 1982. [304] W. Martin and P. Flandrin. "Wigner-Ville spectral analysis of nonstationary processes". IEEE Trans. Acoustics, Speech, ~J Signal Processing, 33(6):1461-1470, December 1985. [305] G. Matz and F. Hlawatsch. "Time-frequency formulation and design of optimal detectors". In Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ Time-Scale Analysis, pages 213-216, Paris, 18-21 June 1996. [306] G. Matz and F. Hlawatsch. "Time-frequency methods for signal detection with application to the detection of knock in car engines". In Proc. Ninth IEEE Workshop on Statistical Signal and Array Processing (SSAP-98), pages 196-199, Portland, OR, 14-16 September 1998. [307] G. Matz and F. Hlawatsch. "Time-frequency transfer function calculus (symbolic calculus) of linear time-varying systems (linear operators) based on a generalized underspread theory". J. of Mathematical Physics, 39(8):4041-4070, August 1998. Special Issue on Wavelet and Time-Frequency Analysis. [308] G. Matz and F. Hlawatsch. "Time-varying spectra for underspread and overspread nonstationary processes". In Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, pages 282-286, Pacific Grove, CA, 1-4 November 1998. [309] G. Matz and F. Hlawatsch. "Minimax robust time-frequency filters for nonstationary signal estimation". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'99), pages 1333-1336, Phoenix, AZ, 15-19 March 1999.
Time-Frequency Bibliography
705
[310] G. Matz and F. Hlawatsch. "Time-frequency subspace detectors and application to knock detection". Archiv fiir Elektronik und (]bertragungstechnik (Internat. J. of Electronics ~r Communications), 53(6):379-385, December 1999. [311] G. Matz and F. Hlawatsch. "Minimax robust nonstationary signal estimation based on a p-point uncertainty model". J. Franklin Institute, 337(4):403-419, July 2000. [312] G. Matz and F. Hlawatsch. "Linear time-frequency filters: On-line algorithms and applications". In A. Papandreou-Suppappola, editor, Applications in Time-Frequency Signal Processing, chapter 6, pages 205-271. CRC Press, Boca Raton, FL, 2002. [313] G. Matz and F. Hlawatsch. "Time-frequency projection filters: Online implementation, subspace tracking, and application to interference excision". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), pages 1213-1216, Orlando, FL, 13-17 May 2002. [314] G. Matz, F. Hlawatsch, and W. Kozek. "Generalized evolutionary spectral analysis and the Weyl spectrum of nonstationary random processes". IEEE Trans. Signal Processing, 45(6):1520-1534, June 1997. [315] G. Matz, F. Hlawatsch, and A. Raidl. "Signal-adaptive robust time-varying Wiener filters: Best subspace selection and statistical analysis". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), pages 3945-3948, Salt Lake City, UT, 7-11 May 2001. [316] G. Matz and A. Raidl. "Robust detection of nonstationary random signals belonging to p-point uncertainty classes". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'03), pages 641-644, Hong Kong, scheduled 6-10 April 2003. [317] A. C. McBride and F. H. Kerr. "On Namias's fractional Fourier transforms". IMA J. of Applied Mathematics, 39(2):159-175, 1987. [318] M. R. McClure and L. Carin. "Matching pursuits with a wave-based dictionary". IEEE Trans. Signal Processing, 45(12):2912-2927, December 1997. [319] W. Mecklenbr~uker and F. Hlawatsch, editors. The Wigner Distribution--Theory and Applications in Signal Processing. Elsevier, Amsterdam, 1997. [320] M. Mesbah and B. Boashash. "Reduced bias time-frequency peak filtering". In Proc. Sixth Internat. Syrup. on Signal Processing and its Applications (ISSPA '01), volume 1, pages 327-330, Kuala Lumpur, 13-16 August 2001. [321] M. Mesbah and B. Boashash. "Performance comparison of seizure detection methods using EEG of newborns for implementation of a DSP subsystem". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'02), Orlando, FL, 13-17 May 2002. Paper no. 1932. [322] Y. Meyer. Wavelets: Algorithms and applications. Soc. for Industrial and Applied Mathematics, Philadelphia, PA, 1993. Translated and revised by Robert D. Ryan. Original French title: Ondelettes et algorithems concurrents. [323] Z.-H. Michalopoulou. "Underwater transient signal processing: Marine mammal identification, localization, and source signal deconvolution". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), volume 1, pages 503506, Munich, 21-24 April 1997. [324] R. K. Mobley. Vibration Fundamentals. Newnes, Boston, 1999.
706
Time-Frequency Bibliography
[325] A. Monti, C. Medigue, and L. Mangin. "Instantaneous parameter estimation in cardiovascular time series by harmonic and time-frequency analysis". IEEE Trans. Biomedical Engineering, 49(12):1547-1556, December 2002. [326] M. R. Morelande, B. Barkat, and A. M. Zoubir. "Statistical performance comparison of a parametric and a non-parametric method for IF estimation of random amplitude linear FM signals in additive noise". In Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pages 262-266, Pocono Manor, PA, 14-16 August 2000. [327] D. R. Morgan and T. M. Smith. "Coherence effects on the detection performance of quadratic array processors with application to large-array matched-field beamforming". J. Acoustical Soc. of America, 87(2):737-747, February 1990. [328] J. M. Morris and Y. Lu. "Generalized Cabor expansions of discrete-time signals in 12(Z) via biorthogonal-like sequences". IEEE Trans. Signal Processing, 44(6):1378-1391, June 1996. [329] S. H. Nawab and T. F. Quatieri. "Short-time Fourier transform". In J. S. Lim and A. V. Oppenheim, editors, Advanced Topics in Signal Processing, chapter 6, pages 289-337. Prentice-Hall, Englewood Cliffs, N J, 1988. [330] A. W. Naylor and G. R. Sell. Linear Operator Theory in Engineering and Science. Springer, New York, 2nd edition, 1982. [331] D. E. Newland. "Time-frequency and time-scale analysis by harmonic wavelets". In A. Prochazka, J. Uhlir, P. J. W. Rayner, and N. G. Kingsbury, editors, Signal Analysis and Prediction, chapter 1. Birkh~user, Boston, MA, 1998. [332] R. M. Nickel, T.-H. Sang, and W. J. Williams. "A new signal adaptive approach to positive time-frequency distributions with suppressed interference terms". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'98), volume 3, pages 1777-1780, Seattle, 12-15 May 1998. [333] A. H. Nuttall. "On the quadrature approximation to the Hilbert transform of modulated signals". Proc. IEEE, 54:1458-1459, 1966. [334] A. H. Nuttall. "Efficient evaluation of polynomials and exponentials of polynomials for equispaced arguments". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 35(10):1486-1487, October 1987. [335] S. C. Olhede and A. T. Walden. "Generalized Morse wavelets". IEEE Trans. Signal Processing, 50(11):2661-2670, November 2002. [336] P. M. Oliveira and V. Barroso. "Uncertainty in the time-frequency plane". In Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pages 607-611, Pocono Manor, PA, 14-16 August 2000. [337] J. C. O'Neill and W. J. Williams. "Shift-covariant time-frequency distributions of discrete signals". IEEE Trans. Signal Processing, 47(1):133-146, January 1999. [338] P. O'Shea. "An iterative algorithm for estimating the parameters of polynomial phase signals". In Proc. Fourth Internat. Symp. on Signal Processing and its Applications (ISSPA '96), volume 2, pages 730-731, Gold Coast, Australia, 25-30 August 1996. [339] A. Ouldali and M. Benidir. "Statistical analysis of polynomial phase signals affected by multiplicative and additive noise". Signal Processing, 78(1):19-42, October 1999.
Time-Frequency Bibliography
707
[340] H. M. Ozaktas, O. Arlkan, M. A. Kutay, and G. Bozda~l. "Digital computation of the fractional Fourier transform". IEEE Trans. Signal Processing, 44(9):2141-2150, September 1996. [341] H. M. Ozaktas, B. Barshan, D. Mendlovic, and L. Onural. "Convolution, filtering, and multiplexing in fractional Fourier domains and their relationship to chirp and wavelet transforms". J. Optical Soc. of America A, 11:547-559, February 1994. [342] H. M. Ozaktas, M. A. Kutay, and D. Mendlovic. "Introduction to the fractional Fourier transform and its applications". In P. W. Hawkes, B. Kazan, and T. Mulvey, editors, Advances in Imaging ~4 Electron Physics--Vol. 106, chapter 4, pages 239-291. Academic Press, San Diego, February 1999. [343] C. H. Page. "Instantaneous power spectra". J. of Applied Physics, 23(1):103-106, January 1952. [344] A. Papandreou, F. Hlawatsch, and G. F. Boudreaux-Bartels. "The hyperbolic class of quadratic time-frequency representations--Part I: Constant-Q warping, the hyperbolic paradigm, properties, and members". IEEE Trans. Signal Processing, 41(12):3425-3444, December 1993. Special Issue on Wavelets and Signal Processing. [345] A. Papandreou-Suppappola. "Generalized time-shift covariant quadratic timefrequency representations with arbitrary group delays". In Proc. 29th Asilomar Conf. on Signals, Systems, and Computers, pages 553-557, Pacific Grove, CA, October-November 1995. [346] A. Papandreou-Suppappola, editor. Applications in Time-Frequency Signal Processing. CRC Press, Boca Raton, FL, 2002. [347] A. Papandreou-Suppappola, F. Hlawatsch, and G. F. Boudreaux-Bartels. "Power class time-frequency representations: Interference geometry, smoothing, and implementation". In Proc. IEEE-SP Internat. Syrup. on Time-Frequency ~ Time-Scale Analysis, pages 193-196, Paris, 18-21 June 1996. [348] A. Papandreou-Suppappola, F. Hlawatsch, and G. F. Boudreaux-Bartels. "Quadratic time-frequency representations with scale covariance and generalized time-shift covariance: A unified framework for the affine, hyperbolic, and power classes". Digital Signal Processing: A Review Journal, 8(1):3-48, 1998. [349] A. Papandreou-Suppappola, R. L. Murray, B. G. Iem, and G. F. Boudreaux-Bartels. "Group delay shift covariant quadratic time-frequency representations". IEEE Trans. Signal Processing, 49(11):2549-2564, November 2001. [350] A. Papandreou-Suppappola and S. B. Suppappola. "Adaptive time-frequency representations for multiple structures". In Proc. Tenth IEEE Workshop on Statistical Signal and Array Processing (SSAP-2000), pages 579-583, Pocono Manor, PA, 14-16 August 2000. [351] A. Papandreou-Suppappola and S. B. Suppappola. "Analysis and classification of time-varying signals with multiple time-frequency structures". IEEE Signal Processing Letters, 9(3):92-95, March 2002. [352] A. Papoulis. Signal analysis. McGraw-Hill, New York, 1977. [353] A. Papoulis. "Random modulation: A review". IEEE Trans. Acoustics, Speech, gJ Signal Processing, 31(1):96-105, February 1983.
708
Time-Frequency Bibliography
[354] A. Papoulis. Probability, Random Variables, and Stochastic Processes. McGraw-Hill, New York, 3rd edition, 1991. [355] T. W. Parks and R. G. Shenoy. "Time-frequency concentrated basis functions". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'90), pages 2459-2462, Albuquerque, NM, 3-6 April 1990. [356] L. Parra and C. Spence. "Convolutive blind separation of non-stationary sources". IEEE Trans. on Speech ~ Audio Processing, 8(3):320-327, May 2000. [357] J. D. Parsons. 1992.
The Mobile Radio Propagation Channel. Pentech Press, London,
[358] B. A. Paya, I. I. Esat, and M. N. M. Badi. "Artificial neural network based fault diagnostics of rotating machinery using wavelet transforms as a preprocessor". J. Mechanical Systems and Signal Processing, 11(5):751-765, September 1997. [359] S. Peleg and B. Friedlander. "Discrete polynomial-phase transform". IEEE Trans. Signal Processing, 43(8):1901-1914, August 1995. [360] S. Peleg and B. Friedlander. "Multicomponent signal analysis using the polynomialphase transform". IEEE Trans. Aerospace ~ Electronic Systems, 32(1):378-386, January 1996. [361] S. Peleg and B. Porat. "Estimation and classification of polynomial-phase signals". IEEE Trans. Information Theory, 37(2):422-430, March 1991. [362] S. Peleg and B. Porat. "Linear FM signal parameter estimation from discrete-time observations". IEEE Trans. Aerospace ~4 Electronic Systems, 27(4):607-616, July 1991. [363] A. Persson, T. Ottosson, and E. Strom. "Time-frequency localized CDMA for downlink multi-carrier systems". In Proc. Seventh Internat. Symp. on Spread Spectrum Techniques and Applications (ISSSTA-02), volume 1, pages 118-122, Prague, 2-5 September 2002. [364] B. Picinbono. "On Instantaneous Amplitude and Phase of Signals". IEEE Trans. Signal Processing, 45(3):552-560, March 1997. [365] B. Picinbono and W. Martin. "Reprdsentation des signaux par amplitude et phase instantan~es". Ann. Tdldcommunications, 38:179-190, 1983. [366] A. G. Piersol. "Power spectra measurements for space vibration data". J. Spacecraft and Rockets, 4:1613, December 1967. [367] S. Pon Varma, A. Papandreou-Suppappola, and S. B. Suppappola. "Detecting faults in structures using time-frequency techniques". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), volume 6, pages 3593-3596, Salt Lake City, UT, 7-11 May 2001. [368] S. Pon Varma, A. Papandreou-Suppappola, and S. B. Suppappola. "Matching pursuit classification for time-varying acoustic emissions". In Proc. 35th A silomar Conf. on Signals, Systems, and Computers, Pacific Grove, CA, 4-7 November 2001. Paper TA2-3. [369] H. V. Poor. An Introduction to Signal Detection and Estimation. Springer, New York, 1988. [370] B. Porat and B. Friedlander. "Asymptotic statistical analysis of the high-order ambiguity function for parameter estimation of polynomial phase signal". IEEE Trans. Information Theory, 42(3):995-1001, May 1996.
Time-Frequency Bibliography
709
[371] M. R. Portnoff. "Time-frequency representation of digital signals and systems based on short-time Fourier analysis". IEEE Trans. Acoustics, Speech, ~ Signal Processing, 28(1):55-69, February 1980. [372] S. Prabhakar, A. S. Sekhar, and A. R. Mohanty. "Detection and monitoring of cracks in a rotor-bearing system using wavelet transforms". J. Mechanical Systems and Signal Processing, 15(2):447-450, March 2001. [373] R. Price and E. M. Hofstetter. "Bounds on the volume and height distributions of the ambiguity function". IEEE Trans. Information Theory, 11:207-214, 1965. [374] M. B. Priestley. Spectral Analysis and Time Series--Part II. Academic Press, London, 1981. [375] M. B. Priestly. "Evolutionary spectra and non-stationary processes". J. Royal Statistical Soc. (Series B), 27(2):204-237, 1965. [376] J. G. Proakis. Digital Communications. McGraw-Hill, New York, 3rd edition, 1995. [377] G. R. Putland and B. Boashash. "Can a signal be both monocomponent and multicomponent?". In Third Australasian Workshop on Signal Processing Applications (WoSPA 2000), Brisbane, Australia, 14-15 December 2000. Paper no. 32. [378] S. Qian. Introduction to Time-frequency and Wavelet Transforms. Prentice-Hall, Englewood Cliffs, N J, 2002. [379] S. Qian and D. Chen. "Decomposition of the Wigner distribution and time-frequency distribution series". IEEE Trans. Signal Processing, 42(10):2836-2842, October 1994. [380] S. Qian and D. Chen. Joint Time-Frequency Analysis: Methods ~ Applications. Prentice-Hall, Upper Saddle River, N J, 1996. [381] S. Qian and D. Chen. "Joint time-frequency analysis". Magazine, 16(2):52-65, March 1999.
IEEE Signal Processing
[382] S. Qian, Y. Rao, and D. Chen. "A fast Gabor spectrogram". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), volume 2, pages 653656, Istanbul, 5-9 June 2000. [383] R. S. Ramineni, M. G. Amin, and A. R. Lindsey. "Performance analysis of subspace projection techniques for interference excision in DSSS communications". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), volume 5, pages 2825-2828, Istanbul, 5-9 June 2000. [384] L. Rankine and M. Mesbah. "Significant atom determination of basis pursuit decomposition". In Proc. Seventh Internat. Symp. on Signal Processing and its Applications (ISSPA '03), pages 577-580, Paris, 1-4 July 2003. [385] A. M. Rao and D. L. Jones. "Efficient structures for quadratic time-frequency and time-scale array processors". In Proc. IEEE-SP Internat. Symp. on Time-Frequency Time-Scale Analysis, pages 397-400, Pittsburgh, PA, 6-9 October 1998. [386] A. M. Rao and D. L. Jones. "Nonstationary array signal detection using timefrequency and time-scale representations". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'98), volume 4, pages 1989-1992, Seattle, 12-15 May 1998. [387] A. M. Rao and D. L. Jones. "A denoising approach to multisensor signal estimation". IEEE Trans. Signal Processing, 48(5):1225-1234, May 2000.
710
Time-Frequency Bibliography
[388] T. S. Rappaport. Wireless Communications: Principles ~4 Practice. Prentice-Hall, Upper Saddle River, NJ, 1996. [389] T. R. Reed and H. Wechsler. "Segmentation of textured images and Gestalt organization using spatial/spatial-frequency representations". IEEE Trans. Pattern Analysis Machine Intelligence, 12(1):1-12, January 1990. [390] D. C. Reid, A. M. Zoubir, and B. Boashash. "Aircraft flight parameter estimation based on passive acoustic techniques using the polynomial Wigner-Ville distribution". J. Acoustical Soc. of America, 102(1):207-23, July 1997. [391] A. Reilly, G. Frazer, and B. Boashash. "Analytic signal generation--Tips and traps". IEEE Trans. Signal Processing, 42(11):3241-3245, November 1994. [392] G. Reina and B. Porat. "Comparative performance analysis of two algorithms for instantaneous frequency estimation". In Proc. Eighth IEEE Workshop on Statistical Signal and Array Processing (SSAP-96), pages 448-451, Corfu, Greece, 24-26 June 1996. [393] C. Richard. "Time-frequency-based detection using discrete-time discrete-frequency Wigner distributions". IEEE Trans. Signal Processing, 50(9):2170-2176, September 2002. [394] C. Richard and R. Lengell~. "Data-driven design and complexity control of timefrequency detectors". Signal Processing, 77(1):37-48, August 1999. [395] M. S. Richman, T. W. Parks, and R. G. Shenoy. "Discrete-time, discrete-frequency time-frequency analysis". IEEE Trans. Signal Processing, 46(6):1517-1527, June 1998. [396] D. C. Rife and R. R. Boorstyn. "Single-tone parameter estimation from discrete-time observations". IEEE Trans. Information Theory, 20(5):591-598, September 1974. [397] A. W. Rihaczek. "Signal energy distribution in time and frequency". IEEE Trans. Information Theory, 14(3):369-374, May 1968. [398] A. W. Rihaczek. Principles of high-resolution radar. McGraw-Hill, New York, 1969. Reprinted Los Altos, CA: Peninsula Publishing, 1985. [399] M. D. Riley. Speech Time-Frequency Representations. Kluwer, 1989. [400] O. Rioul and P. Flandrin. "Time-scale energy distributions: A general class extending wavelet transforms". IEEE Trans. Signal Processing, 40(7):1746-1757, July 1992. [401] B. Ristic and B. Boashash. "Kernel design for time-frequency signal analysis using the Radon transform". IEEE Trans. Signal Processing, 41(5):1996-2008, May 1993. [402] B. Ristic and B. Boashash. "Relationship between the polynomial and higher order Wigner-Ville distribution". IEEE Signal Processing Letters, 2(12):227-229, December 1995. [403] B. Ristic and B. Boashash. "Instantaneous frequency estimation of quadratic and cubic polynomial FM signals using the cross polynomial Wigner-Ville distribution". IEEE Trans. Signal Processing, 44:1549-1553, 1996. [404] G. Roberts, A. M. Zoubir, and B. Boashash. "Non-stationary, narrowband Gaussian signal discrimination in time-frequency space". In Wysocki et al. [506], chapter 18, pages 159-166. [405] T. D. Rossing. The Science of Sound. Addison-Wesley, Reading, MA, 2nd edition, 1990. [406] W. Rudin. Real and complex analysis. McGraw-Hill, New York, 1987.
Time-Frequency Bibliography
711
[407] B. Samimy and G. Rizzoni. "Mechanical signature analysis using time-frequency signal processing: Application to internal combustion engine knock detection". Proc. IEEE, 84(9):1130-1343, September 1996. [408] T.-H. Sang and W. J. Williams. "R@nyi information and signal-dependent optimal kernel design". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'95), volume 2, pages 997-1000, Detroit, 9-12 May 1995. [409] N. Sang-Won and E. J. Powers. "Volterra series representation of time-frequency distributions". IEEE Trans. Signal Processing, 51(6):1532-1537, July 2003. [410] S. Santoso, E. J. Powers, W. M. Grady, and P. Hofmann. "Power quality assessment via wavelet transform analysis". IEEE Trans. Power Delivery, 11(2):924-930, April 1996. [411] G. J. Saulnier, M. J. Medley, and P. K. Das. "Wavelets and filter banks in spread spectrum communication systems". In A. N. Akansu and M. J. T. Smith, editors, Subband and Wavelet Transforms: Design and Applications, chapter 10, pages 309-346. Kluwer, Norwell, MA, 1996. [412] A. M. Sayeed and B. Aazhang. "Joint multipath-Doppler diversity in mobile wireless communications". IEEE Trans. Communications, 47:123-132, January 1999. [413] A. M. Sayeed and D. J. Jones. "Optimal detection using bilinear time-frequency and time-scale representations". IEEE Trans. Signal Processing, 43(12):2872-2883, December 1995. [414] A. M. Sayeed and D. L. Jones. "A canonical covariance-based method for generalized joint signal representations". IEEE Signal Processing Letters, 3(4):121-123, April 1996. [415] A. M. Sayeed and D. L. Jones. "Integral transforms covariant to unitary operators and their implications for joint signal representations". IEEE Trans. Signal Processing, 44(6):1365-1377, June 1996. [416] A. M. Sayeed and D. L. Jones. "Optimum quadratic detection and estimation using generalized joint signal representations". IEEE Trans. Signal Processing, 44(12):30313043, December 1996. [417] A. M. Sayeed, A. Sendonaris, and B. Aazhang. "Multiuser detection in fast fading multipath environments". IEEE J. on Selected Areas in Communications, 16:1691-1701, December 1998. [418] L. L. Scharf and B. Friedlander. "Toeplitz and Hankel kernels for estimating timevarying spectra of discrete-time random processes". IEEE Trans. Signal Processing, 49(1):179-189, January 2001. [419] M. O. Scully and L. Cohen. "Quasi-probability distributions for arbitrary operators". In Y. S. Kim and W. W. Zachary, editors, The Physics of Phase Space: Nonlinear Dynamics and Chaos, Geometric Quantization, and Wigner Functions, number 278 in LECTURE NOTES IN PHYSICS, pages 253-263. Springer, Berlin, 1987. Proc. First Internat. Conf. on the Physics of Phase Space, University of Maryland, College Park, MD, 20-23 May 1986. [420] J. P. Sessarego, J. Sageloli, P. Flandrin, and M. Zakharia. "Time-frequency WignerVille analysis of echoes scattered by a spherical shell". In J.-M. Combes, A. Grossmann, and P. Tchamitchian, editors, Wavelets: Time-frequency Methods and Phase Space, pages 147-153. Springer, 1989. Proc. of the Internat. Conf., Marseille, 14-18 December, 1987. 315pp.
712
Time-Frequency Bibliography
[421] A. Z. Sha'ameri, B. Boashash, and I. Ismail. "Design of signal dependent kernel functions for digital modulated signals". In Proc. Fourth Internat. Symp. on Signal Processing and its Applications (ISSPA '96), volume 2, pages 527-528, Gold Coast, Australia, 25-30 August 1996. [422] R. G. Shenoy and T. W. Parks. "The Weyl correspondence and time-frequency analysis". IEEE Trans. Signal Processing, 42(2):318-331, February 1994. [423] Y. Shin, A. C. Parsons, E. J. Powers, and W. M. Grady. "Time-frequency analysis of power system disturbance signals for power quality". In Proc. IEEE Power Engineering Soc. Summer Meeting, volume 1, pages 402-407, Edmonton, AL (Canada), 18-22 July 1999. [424] Y. Shin, E. J. Powers, W. M. Grady, and S. C. Bhatt. "Effects of dispersion on disturbance propagation on high voltage transmission lines". In Proc. IEEE Power Engineering Soc. Summer Meeting, volume 2, pages 851-854, Seattle, WA (USA), 16-20 July 2000. [425] W. M. Siebert. "Studies of Woodward's uncertainty function". Quarterly Progress Report (MIT Electronics Research Lab, Cambridge, MA), pages 90-94, 1958. [426] J. A. Sills and E. W. Kamen. "Time-varying matched filters". Circuits, Systems, Signal Processing, 15(5):609-630, 1996. [427] M. K. Simon, J. K. Omura, R. A. Scholtz, and B. K. Levitt. Spread Spectrum Communications (3 vols.). Computer Science Press, Rockville, MD, 1985. [428] L. Sirovich and B. W. Knight. "On the eigentheory of operators which exhibit a slow variation". Quarterly of Applied Mathematics, 38:469-488, 1980. [429] D. Slepian. "On bandwidth". Proc. IEEE, 64(3):292-300, March 1976. [430] D. T. Smithey, M. Beck, M. G. Raymer, and A. Faridani. "Measurement of the Wigner distribution and the density matrix of a light mode using optical homodyne tomography: Application to squeezed states and the vacuum". Physical Review Letters, 70:1244-1247, 1993. [431] T. SSdertrSm and P. Stoica. System Identification. Prentice-Hall, Englewood Cliffs. N J, 1989. [432] K. A. Sostrand. "Mathematics of the time-varying channel". In Proc. NATO Advanced Study Inst. on Signal Processing with Emphasis on Underwater Acoustics, volume 2, pages 25.1-25.20, 1968. [433] L. Stankovi5. "An analysis of some time-frequency and time-scale distributions". Ann. Tdldcommunications, 49(9/10):505-517, September/October 1994. [434] L. Stankovi5. "An analysis of Wigner higher order spectra of multicomponent sighals". Ann. Tdldcommunications, 49(3/4):132-136, March/April 1994. [435] L. StankoviS. "A method for time-frequency analysis". IEEE Trans. Signal Processing, 42(1):225-229, January 1994. [436] L. StankoviS. "Multitime definition of the Wigner higher order distribution: LWigner distribution". IEEE Signal Processing Letters, 1(7):106-109, July 1994. [437] L. Stankovi~. "A method for improved distribution concentration in the timefrequency analysis of the multicomponent signals using the L-Wigner distribution". IEEE Trans. Signal Processing, 43(5):1262-1268, May 1995.
Time-Frequency Bibliography
"713
[438] L. Stankovi6. "Auto-term representation by the reduced interference distributions: A procedure for kernel design". IEEE Trans. Signal Processing, 44(6):1557-1563, June 1996. [439] L. Stankovid. "Highly concentrated time-frequency distributions: Pseudo-quantum signal representation". IEEE Trans. Signal Processing, 45(3):543-551, March 1997. [440] L. Stankovid. "On the time-frequency analysis based filtering". Ann. Tdldcommunications, 55(5/6):216-225, May/June 2000. [441] L. StankoviS. "A measure of some time-frequency distributions concentration". Signal Processing, 81(3):621-631, March 2001. [442] L. StankoviS. "Analysis of noise in time-frequency distributions". IEEE Signal Processing Letters, 9(9):286-289, September 2002. [443] L. Stankovi5. "Time-frequency distributions with complex argument". IEEE Trans. Signal Processing, 50(3):475-486, March 2002. [444] L. Stankovi~, T. Alieva, and M. Bastiaans. "Fractional-Fourier-domain weighted Wigner distribution". In Proc. Eleventh IEEE Workshop on Statistical Signal Processing, pages 321-324, Singapore, 6-8 August 2001. [445] L. Stankovid and J. F. B6hme. "Time-frequency analysis of multiple resonances in combustion engine signals". Signal Processing, 79(1):15-28, November 1999. [446] L. Stankovid and V. Ivanovid. "Further results on the minimum variance timefrequency distributions kernels". IEEE Trans. Signal Processing, 45(6):1650-1655, June 1997. [447] L. Stankovid and V. Katkovnik. "The Wigner distribution of noisy signals with adaptive time-frequency varying window". IEEE Trans. Signal Processing, 47(4):10991108, April 1999. [448] L. Stankovi5 and V. Katkovnik. "Instantaneous frequency estimation using the higher order L-Wigner distributions with the data driven order and window length". IEEE Trans. Information Theory, 46(1):302-311, January 2000. [449] L. Stankovi5 and S. Stankovid. "On the Wigner distribution of the discrete-time noisy signals with application to the study of quantization effects". IEEE Trans. Signal Processing, 42(7):1863-1867, July 1994. [450] L. Stankovi~ and S. StankoviS. "An analysis of instantaneous frequency representation using time-frequency distributions--Generalized Wigner distribution". IEEE Trans. Signal Processing, 43(2):549-552, February 1995. [451] L. StankoviS, S. Stankovid, and I. DjuroviS. "Architecture for realization of the cross-terms free polynomial Wigner-Ville distribution". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'97), volume III, pages 2053-2056, Munich, 21-24 April 1997. [452] L. Stankovi6, S. Stankovid, and I. Djurovi6. "Space/spatial frequency based filtering". IEEE Trans. Signal Processing, 48(8):2343-2352, August 2000. [453] S. Stankovi6. "About time-variant filtering of speech signals with time-frequency distributions for hands-free telephone systems". Signal Processing, 80(9):1777-1785, September 2000.
714
Time-Frequency Bibliography
[454] S. Stankovi5 and L. Stankovid. "Approach to the polynomial Wigner distributions". In Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ Time-Scale Analysis, pages 153-156, Paris, 18-21 June 1996. [455] S. Stankovi5 and L. Stankovi5. "Introducing time-frequency distribution with a 'complex-time' argument". Electronics Letters, 32(14):1265-1267, July 1996. [456] S. Stankovk', L. Stankovi5, and Z. Uskokovi5. "On the local frequency, group shift, and cross-terms in some multidimensional time-frequency distributions: A method for multidimensional time-frequency analysis". IEEE Trans. Signal Processing, 43(7):17191725, July 1995. [457] S. Stankovi~ and J. Tilp. "Time-varying filtering of speech signals using linear prediction". Electronics Letters, 36(8):763-764, April 2000. [458] W. J. Staszewski, K. Worden, and G. R. Tomlinson. "Time-frequency analysis in gearbox fault detection using the Wigner-Ville distribution and pattern recognition". J. Mechanical Systems and Signal Processing, 11(5):673-692, September 1997. [459] N. Stevenson, E. Palmer, J. Smeathers, and B. Boashash. "The B T product as a signal dependent sample size estimate in hypothesis testing: An application to linear/nonlinear discrimination in bandwidth limited systems". In Proc. Seventh Internat. Symp. on Signal Processing and its Applications (ISSPA '03), pages 551-554, Paris, 1-4 July 2003. [460] L. R. O. Storey. "An investigation of whistling atmospherics". Phil. Trans. Roy. Soc., A246:113-141, 1953. [461] G. Strang and T. Q. Nguyen. Press, Wellesley, MA, 1996.
Wavelets and Filter Banks. Wellesley-Cambridge
[462] V. Sucic and B. Boashash. "Optimisation algorithm for selecting quadratic timefrequency distributions: Performance results and calibration". In Proc. Sixth Internat. Symp. on Signal Processing and its Applications (ISSPA '01), volume 1, pages 331-334, Kuala Lumpur, 13-16 August 2001. [463] V. Sucic and B. Boashash. "Parameter selection for optimising time-frequency distributions and measurements of time-frequency characteristics of non-stationary signals". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'01), volume 6, pages 3557-3560, Salt Lake City, UT, 7-11 May 2001. [464] V. Sucic and B. Boashash. "Selecting the optimal time-frequency distribution for reallife multicomponent signals under given constraints". In Proc. Eleventh European Signal Processing Conf. (EUSIPCO-02), volume 1, pages 141-144, Toulouse, 3-6 September 2002. [465] V. Sucic and B. Boashash. "An approach for selecting a real-life signal bestperforming time-frequency distribution". In Proc. Seventh Internat. Syrup. on Signal Processing and its Applications (ISSPA '03), volume 1, pages 100-104, Paris, 1-4 July 2003. [466] V. Sucic, B. Boashash, and K. Abed-Meraim. "A normalised performance measure for quadratic time-frequency distributions". In Proc. Second IEEE Internat. Symp. on Signal Processing and Information Technology (ISSPIT'02), pages 463-466, Marrakech, Morocco, 18-21 December 2002.
Time-Frequency Bibliography
715
[467] S. M. Sussman. "Least-squares synthesis of radar ambiguity functions". IRE Trans. Information Theory, 8:246-254, April 1962. [468] H. Suzuki and F. Kobayashi. "A method of two-dimensional spectral analysis using the Wigner distribution". Electronics ~ Communications in Japan, Part III: Fundamental Electronic Science, 75(1):1006-1013, January 1992. [469] H. H. Szu. "Two-dimensional optical processing of one-dimensional acoustic data". Optical Engineering, 21(5):804-813, September-October 1982. [470] M. T. Taner, F. Koehler, and R. E. Sheriff. "Complex seismic trace analysis". Geophysics, 44(6):1041-1063, June 1979. [471] S. Thangavelu. Lectures on Hermite and Laguerre Expansions. Princeton University Press, Princeton, N J, 1993. [472] S. A. Tretter. "Estimating the frequency of a noisy sinusoid by linear regression". IEEE Trans. Information Theory, 31(6):832-835, November 1985. [473] F. Tupin, H. Maitre, J.-F. Mangin, J.-M. Nicolas, and E. Pechersky. "Detection of linear features in SAR images: Application to road network extraction". IEEE Trans. Geoscience ~4 Remote Sensing, 36(2):434-453, March 1998. [474] D. Vakman. "On the analytic signal, the Teager-Kaiser energy algorithm, and other methods for defining amplitude and frequency". IEEE Trans. Signal Processing, 44(4):791-797, April 1996. [475] D. E. Vakman. Sophisticated Signals and the Uncertainty Principle in Radar. Springer, New York, 1968. Translated by K. N. Trirogoff; edited by E. Jacobs. [476] A. J. van Leest. Non-separable Gabor schemes: Their design and implementation. PhD thesis, Eindhoven University of Technology, Eindhoven, Netherlands, 2001. [477] H. G. van Steenis, W. L. J. Martens, and J. H. M. Tulen. "Time-frequency parameters of heart-rate variability". IEEE Engineering in Medicine ~4 Biology Magazine, 21(4):4658, July-August 2002. [478] H. L. L. Van Trees. Detection, Estimation, and Modulation Theory, volume I: "Detection, Estimation, and Linear Modulation Theory". Wiley, New York, 1968. Reprinted 2001. [479] H. L. L. Van Trees. Detection, Estimation, and Modulation Theory, volume III: "Radar-Sonar Signal Processing and Gaussian Signals in Noise". Wiley, New York, 1971. Reprinted Malabar, FL: Krieger, 1992. Reprinted New York: Wiley, 2001. [480] S. V. Vaseghi. Advanced Signal Processing and Digital Noise Reduction. Wiley and Teubner, 1996. [481] E. F. Velez and H. Garudadri. "Speech analysis based on smoothed Wigner-Ville distribution". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 15, pages 351-374. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [482] B . V . K . Vijaya Kumar and C. W. Carroll. "Performance of Wigner distribution function based detection methods". Optical Engineering, 23(6):732-737, November-December 1984. [483] J. Ville. "Th~orie et applications de la notion de signal analytique". Cables et Transmissions, 2A(1):61-74, 1948. In French. English translation: I. Selin, Theory and applications of the notion of complex signal, Rand Corporation Report T-92 (Santa Monica, CA, August 1958).
716
Time-Frequency Bibliography
[484] K. Vogel and H. Risken. "Determination of quasiprobability distributions in terms of probability distributions for the rotated quadrature phase". Physical Review A, 40:28472849, 1989. [485] H. Vold and J. Leuridan. "High resolution order tracking at extreme slew rates using Kalman tracking filters". In Proc. Noise ~ Vibration Conf. ~ Exposition. Soc. of Automotive Engineers, Traverse City, MI, 17-20 May 1993. SAE paper no. 931288. [486] C. Wang and M. G. Amin. "Performance analysis of instantaneous frequency based interference excision techniques in spread spectrum communications". IEEE Trans. Signal Processing, 46(1):70-83, January 1998. [487] W. Wang; and D. H. Johnson. "Computing linear transforms of symbolic signals". IEEE Trans. Signal Processing, 50(3):628-634, March 2002. [488] J. Wexler and S. Raz. "Discrete Gabor expansions". Signal Processing, 21(3):207-221, November 1990. [489] A. D. Whalen. Detection of Signals in Noise. Academic Press, 1971. [490] L. B. White. "Transition kernels for bilinear time-frequency signal representations". IEEE Trans. Signal Processing, 39(2):542-544, February 1991. [491] L. B. White and B. Boashash. "Cross spectral analysis of nonstationary processes". IEEE Trans. Information Theory, 36(4):830-835, July 1990. [492] H. J. Whitehouse, B. Boashash, and J. M. Speiser. "High-resolution processing techniques for temporal and spatial signals". In M. Bouvet and G. Bienvenu, editors, High-resolution methods in underwater acoustics, chapter 4, pages 127-176. Springer, Berlin, 1991. [493] E. P. Wigner. "On the quantum correction for thermodynamic equilibrium". Physics Review, 40:749-759, June 1932. [494] W. J. Williams. "Reduced interference distributions: Biological applications and interpretations". Proc. IEEE, 84(9):1264-1280, September 1996. [495] W. J. Williams. "Recent advances in time-frequency representations: Some theoretical foundation". In M. Akay, editor, Time Frequency and Wavelets in Biomedical Signal Processing, chapter 1. IEEE/Wiley, New York, 1998. [496] W. J. Williams. "Reduced interference time-frequency distributions: Scaled decompositions and interpretations". In L. Debnath, editor, Wavelet Transforms and TimeFrequency Signal Analysis, chapter 12. Birkh~iuser, Boston, 2001. [497] W. J. Williams and S. Aviyente. "Optimum window time-frequency distribution decompositions". In Proc. 32nd Asilomar Conf. on Signals, Systems, and Computers, pages 817-821, Pacific Grove, CA, 1-4 November 1998. [498] W. J. Williams and S. Aviyente. "Minimal-window time-frequency distributions". In Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations IX, volume 3807, pages 446-457. Soc. of Photo-optical Instrumentation Engineers, Denver, CO, 19-21 July 1999. [499] W. J. Williams, M. L. Brown, and A. O. Hero III. "Uncertainty, information and time-frequency distributions". In Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations II, volume 1566, pages 144-156. Soc. of Photo-optical Instrumentation Engineers, San Diego, 24-26 July 1991.
Time-Frequency Bibliography
717
[500] W. J. Williams and J. Jeong. "Reduced interference time-frequency distributions". In B. Boashash, editor, Time-Frequency Signal Analysis: Methods and Applications, chapter 3, pages 74-97. Longman-Cheshire/Wiley, Melbourne/N.Y., 1992. [501] W. J. Williams and J. C. O'Neill. "Decomposition of time-frequency distributions using scaled window spectrograms". In Proc. SPIE: Advanced Signal Processing Algorithms, volume 2563, pages 44-58. Soc. of Photo-optical Instrumentation Engineers, San Diego, CA, 10-12 July 1995. [502] W. J. Williams and T.-H. Sang. "Adaptive RID kernels which minimize timefrequency uncertainty". In Proc. IEEE-SP Internat. Symp. on Time-Frequency ~ TimeScale Analysis, pages 96-99, Philadelphia, PA, 25-28 October 1994. [503] W. J. Williams, T.-H. Sang, J. C. O'Neill, and E. J. Zalubas. "Wavelet windowed time-frequency distribution decompositions". In Proc. SPIE: Advanced Signal Processing Algorithms, Architectures, and Implementations VII, volume 3162, pages 149-160. Soc. of Photo-optical Instrumentation Engineers, San Diego, July-August 1997. [504] J. C. Wood and D. T. Barry. "Tomographic time-frequency analysis and its application toward time-varying filtering and adaptive kernel design for multicomponent linear FM signals". IEEE Trans. Signal Processing, 42:2094-2104, 1994. [505] P. M. Woodward. Probability and Information Theory with Applications to Radar. Pergamon, London, 1953. [506] T. Wysocki, H. Razavi, and B. Honary, editors. Digital Signal Processing for Communication Systems. Number 403 in KLUWER INTERNAT. SERIES IN ENGINEERING AND COMPUTER SCIENCE. Kluwer, Boston, MA, 1997. [507] X.-G. Xia. "System identification using chirp signals and time-variant filters in the joint time-frequency domain". IEEE Trans. Signal Processing, 45(8):2072-2084, August 1997. [508] X.-G. Xia and V. C. Chen. "A quantitative SNR analysis for the pseudo Wigner-Ville distribution". IEEE Trans. Signal Processing, 47(10):2891-2894, October 1999. [509] X.-G. Xia, Y. Owechko, B. H. Softer, and R. M. Matic. "On generalized-marginal time-frequency distributions". IEEE Trans. Signal Processing, 44(11):2882-2886, November 1996. [510] X.-G. Xia and S. Qian. "Convergence of an iterative time-variant filtering based on discrete Gabor transform". IEEE Trans. Signal Processing, 47(10):2894-2899, October 1999. [511] X.-G. Xia, G. Wang, and V. Chen. "Quantitative SNR analysis for ISAR imaging using joint time-frequency analysis--Short time Fourier transform". IEEE Trans. Aerospace Electronic Systems, 38(3):649-659, April 2002. [512] J. Yang. "Frequency domain noise suppression approaches in mobile telephone systems". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP'93), volume 2, pages 363-366, Minneapolis, 27-30 April 1993. [513] L. A. Zadeh. "Frequency analysis of variable networks". Proc. IRE, 3A-8:291-299, March 1950. [514] P. Zarjam, M. Mesbah, and B. Boashash. "Detection of newborn EEG seizure using optimal features based on discrete wavelet transform". In Proc. IEEE Internat. Conf. on
718
Time-Frequency Bibliography
Acoustics, Speech and Signal Processing (ICASSP'03), volume 2, pages 265-268, Hong Kong, scheduled 6-10 April 2003. [515] A. I. Zayed. "A class of fractional integral transforms: a generalization of the fractional Fourier transform". IEEE Trans. Signal Processing, 50(3):619-627, March 2002. [516] Y. Y. Zeevi, M. Zibulski, and M. Porat. "Multi-window Gabor schemes in signal and image representations". In H. G. Feichtinger and T. Strohmer, editors, Gabor Analysis and Algorithms: Theory and Applications, chapter 12, pages 381-407. Birkh~iuser, Berlin/Boston, 1998. [517] F. Zhang, G. Bi, and Y. Q. Chen. "Tomography time-frequency transform". IEEE Trans. Signal Processing, 50(6):1289-1297, June 2002. [518] Y. Zhang and M. G. Amin. "Blind separation of sources based on their time-frequency signatures". In Proc. IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2000), volume 5, pages 3132-3135, Istanbul, 5-9 June 2000. [519] Y. Zhang and M. G. Amin. "Spatial averaging of time-frequency distributions for signal recovery in uniform linear arrays". IEEE Trans. Signal Processing, 48(10):28922902, October 2000. [520] Y. Zhang, W. Mu, and M. C. Amin. "Time-frequency maximum likelihood methods for direction finding". J. Franklin Institute, 337(4):483-497, July 2000. [521] Y. Zhang, W. Mu, and M. G. Amin. "Subspace analysis of spatial time-frequency distribution matrices". IEEE Trans. Signal Processing, 49(4):747-759, April 2001. [522] Y. Zhao, L. E. Atlas, and R. J. Marks II. "The use of cone-shaped kernels for generalized time-frequency representations of non-stationary signals". IEEE Trans. Acoustics, Speech, ~4 Signal Processing, 38(7):1084-1091, July 1990. [523] H. Zheng, Z. Li, and X. Chen. "Gear fault diagnosis based on continuous wavelet transform". J. Mechanical Systems and Signal Processing, 16(2-3):447-457, March-May 2002. [524] Y. M. Zhu, R. Goutte, and M. Amiel. "On the use of two-dimensional Wigner-Ville distribution for texture segmentation". Signal Processing, 30(3):329-353, February 1993. [525] M. Zibulski and Y. Y. Zeevi. "Discrete multiwindow Gabor-type transforms". IEEE Trans. Signal Processing, 45(6):1428-1442, June 1997. [526] R. Zou, W. A. Cupples, K. R. Yip, N. H. Holstein-Rathlou, and K. Chon. "Timevarying properties of renal autoregulatory mechanisms". IEEE Trans. Biomedical Engineering, 49(10):1112-1120, October 2002.
Time- Frequency Index Symbols 1/0 kernel, 181 A AD-MUSIC, s e e MUSIC adaptive (modified) matching pursuit, 511 adaptive cone-shaped kernel, 182-183, 620 adaptive network-based fuzzy inference system (ANFIS), 680 adaptive notch filter, 544-547 adaptive optimal kernel (AOK), 182, 620-621 - with chirp signals, 620, 621 adaptive quadratic TFDs, 180-183 adaptive representation for multiple structures (ARMUS), 513 adaptive spectrogram, 178-180 adaptive STFT, 178-180 adaptive TFDs, 178-183 - of bat sound, 179 - with 1/0 kernel, 181 - with adaptive cone-shaped kernel, 182-183 - with adaptive optimal kernel (AOK), 182 - with radially Gaussian kernel, 182 - with reduced interference, 183 additive noise, 344, 345, 490 - and polynomial FM signals, 447, 455 - converted to multiplicative, 491 - IF estimation in, 437, 440 - polynomial phase estimation in, 198-201 - smearing the WVD, 492 adjoint system, 140 affine class, 106, 110, 643, 645-650
with reassignment, 293 affine group, 104, 280, 281, 288, 289 - defined, 280 affine transformation, 281 affine Wigner function, 283-289 unitarity of, 287-289 affine-smoothed pseudo-Wigner distribution, 647-649 aliasing - in discrete quadratic TFDs, 237, 241 - in discrete WVD, 233-234 in images, 655-659 ambiguity domain, 69, s e e Doppler-lag domain ambiguity function, 66, 69, 160-166 - and fractional autocorrelation, 572 and signal design, 165 - and TFDs, 161 - and the STFT, 162 - and the wavelet transform, 162 - and uncertainty, 164 - as time-frequency correlation, 163 computation of, 166 conflicting definitions of, 66n - filtered/generalized, s e e filtered ambiguity function higher-order, 594 - in radar, 69, 160 in sonar, 160, 618 narrowband, 160, 162-166 properties of, 162-166 (symmetrical), 69, 161, 169 time-localized, 182 weighted, s e e filtered ambiguity function -wideband, 161, 166 -
-
-
-
-
-
-
-
-
-
-
S
u
s
s
m
a
n
-
-
719
720
- Woodward, 161,618 ambiguity surface, 162 AMMP, see adaptive (modified) matching pursuit analytic associate, 14, 15 - and oversampling, 268 - computation of, 268-269 - used to reduce cross-terms, 96 analytic image, 651-654, 657, 658 - and aliasing, 656 - choice of, 657-659 analytic signal, 14, 86-93, see also analytic associate analytic signals, 13 - sum and product of, 87 ANFIS, see adaptive network-based fuzzy inference system angle of arrival, see direction-of-arrival... AOK, see adaptive optimal kernel ARMUS, see adaptive representation for multiple structures artifact, 62 asymptotic mean squared error - of PWVD-based IF estimate, 453-455 asymptotic reassignment points, 294 asymptotic signal, 18 asymptotic-phase signals, 92-93 atmospheric turbulence, 457 auto-terms, 64 - as superpositions of coherent cross-terms, 99 - in STFDs, 351,353 - location of, 70 - selection of, 360 autocorrelation function, 36, 637, 638 average frequency, 21 axis transformation, 123, 125 B B-distribution (BD), 51, 53, 55, 75-77, 217, 305, 441 - computation of, 270 - defined using time-lag kernel, 665 - discrete, 240, 241,271 - for component extraction, 361 - for component separation, 366
Time-Frequency Index - o f EEG signals, 664, 665, 667-669 backscattering, see scattering band-limited signal, 16, 35 bandpass signal, 23 bandwidth, 16, 18 - effective, 16 - relaxed measures of, 16 - Slepian, 17 bandwidth-duration ( B T ) product, 17, 18, 23, 152, 166, 503, 549, 550, 553 bat - large brown ( E p t e s i c u s f u s c u s ) , 179 bat sound, 156, 157 - adaptive TFDs of, 179 - Gabor spectrograms of, 157 - spectrogram and MBD of, 438 Bayesian detector, 508 - optimal, 507 BD, see also binomial distribution, see B-distribution Bedrosian's theorem, 88-89 Bertrand distribution, 209, 384, 646, 647 bias - o f IF estimation, 429-436, 439-440, 442, 443, 445, 490, 492-494 - of WVD, 372-373 bilinear time-frequency representations, see quadratic TFDs bilinear transformation, 62 binomial distribution, 175 - kernel decomposition, 266 - of electric power disturbances, 629 biorthogonal-like condition, 318 biorthogonality, 253, 258 bird song, 312, 313 BJD, see Born-Jordan distribution blind source separation - underdetermined, see underdetermined BSS blind source separation (BSS), 324-334, 339, 341-343, 617 - based on STFD, 326-331,349, 351, 357-368
Time-Frequency Index
conditions permitting, 324 - other names for, 324 - underdetermined, 328 blurring, s e e smoothing Born-Jordan distribution (BJD), 51, 53, 55, 74, 76, 77, 171 - computation of, 271-272 discrete, 240, 241,271 kernel decomposition, 266 - minimum kernel energy property, 376 BSS, s e e blind source separation B T product, s e e bandwidth-duration (BT) product burst (EEG pattern), 665-667 burst suppression (EEG pattern), 664, 666 butterfly function, 74, s e e cone-shaped kernel -
-
-
C canonical pair, 86-87 - characterization of, 88-93 - for regular phase signal, 89-91 - for singular and asymptotic phase signals, 92-93 with amplitude modulation, 88 Cauchy principal value, 15 CDMA (code-division multiple-access)..., s e e spread-spectrum... CDWR, s e e cross-term deleted Wigner representation center of gravity, 291 central finite-difference (CFD) approx., 31, 186 centroid, s e e local centroid C F D . . . , s e e central finite-difference... channel linear dispersive, 549-557 multiplicative, 567 - overspread, 414 - random, 410 time-varying, 410 - underspread, 414, 550 - WSSUS, s e e WSSUS channel characteristic function, 413, 492 Chebyshev polynomial, 190, 191 -
-
-
-
721
chirp, 4,
frequency modulation, linear FM, quadratic FM, hyperbolic FM and adaptive optimal kernel, 620, 621 detection by fractional autocorrelation, 573-574 - hyperbolic-phase, 92 parabolic-phase, 92 sweep rate estimation of, 573-574 Choi-Williams distribution (CWD), 51, 53, 54, 76, 77, 169-170 -computation of, 271-272 240, 241,271 invariance properties of, 172 of dolphin click, 176 sensitivity to noise, 379 CIT, s e e coherent integration time clustering of components, 361-362 - o f vectors, 357-363, 366-368 code-division multiple-access..., s e e spread-spectrum... Cohen's class, 68n, s e e quadratic class, quadratic TFDs coherent integration time (CIT), 606-613 communication tetherless underwater channel, 390 wireless channel, 384 complex envelope, 88 complex WDF transform (CWT), 578, 582-583 cross-terms avoided by, 582 complex-time form, s e e polynomial WVDs component clustering, 361-362 component extraction, 361-366 component separation, 364 components - resolution of, 307 computation latency of, 269 of analytic associate, 268-269 - of discrete kernels, 270-272 - of discrete quadratic TFDs, see also
-
-
-
-
-
-
-
-
-
-
-
-
-
-
d
i
s
c
r
e
t
e
,
Time-Frequency Index
722
268-278 of discrete WVD, 268 - of fractional Fourier transform, 152 - of modified B-distribution, 274-277 - of spectrogram, 277-278 - of windowed DWVD, 268 real-time, 269-270 throughput of, 269 concentration .~ and probabilities, 297 by analogy with duration, 299-30O - measured by R@nyi entropy, 298-299 measured by ratio of norms, 297-298 normalized measures, 299 - of S-method, 301-303 - of spectrogram, 301 optimization of, 302-304 condition monitoring, s e e machine condition monitoring cone-shaped kernel, 74, 171 - adaptive, 182-183, 620 conformity index of, 280-283, 285-287, 289 constant IF, 25 convolution invariance, 61 convolutive mixtures separation of, 328-332 correlation operator, 400, 529 covariance - in the group domain, 105 - in the time-frequency domain, 109 - o f bilinear/quadratic TFRs, 109 linear TFRs, 109 covariance property, 102, 203, 643, 644 covariance theory of time-frequency analysis, 102 Cramer-Rao bound, 164, 339, 490 CRB, s e e Cramer-Rao bound CRLS (cascade recursive least squares), s e e neural network cross polynomial WVD (XPWVD), 449 -
-
-
-
-
-
-
-
-
-
-
o
iterative IF estimation by, 449 cross-ambiguity function, 552 cross-correlation 2-D, 668 cross-spectrogram, 260, 263, 266 cross-term deleted Wigner representation (CDWR), 620-621, s e e a l s o Gabor expansion cross-terms, 62, 63, 94-101,620 - amplitude, 307 and localization, 98-100 - avoided by CWT, 582 - avoided by TFAR and TFMV, 585 - away from origin in Doppler-lag domain, 96-97, 170 - in polynomial TFDs, 191 - in STFDs, 341,351,353 - in the energy spectrum, 94 in the spectrogram, 94-95 - in the WVD, 94-96 location of, 70 reduced by using analytic associate, 96 - related to XWVDs, 64 statistical, 406 - suppressed by LI kernels, 217 suppression of, 64, 69 cross-Wigner-Ville distribution (XWVD), 155 - defined, 64 - in IF estimation, 422-423 cumulant, 492 CWD, s e e Choi-Williams distribution CWT, s e e complex W D F transform cyclo-stationarity - of engine signals, 638 cyclostationary random process, 414 -
-
-
-
-
-
-
-
f
D
DCT, s e e discrete cosine transform decay time, 41 dechirping, 34, 186 - in polynomial WVDs, 448 deflection, 345 delay power profile, 412 delay spread, 415
Time-Frequency Index
delay-Doppler spread, 139, 415, 506, 558, 560, 562 straight-line, 559-560, 567 denoising, 473, 522-524 experiments, 524 mask design for, 523-524 density distributions, 128 as fine organizational data, 129 detectable dynamic range (DNR), 578 detection, 531, s e e a l s o test statistic - as hypothesis testing, 344-345, 502-503, 618-619 - Bayesian, 507, 508 - by fractional Fourier transform, 151 - by quadratic TFDs, 503-509 - decentralized, 554 - of chirps, 573-574 - o f knock, s e e knock detection - of lines, 362 - optimal, 500-509 - quadratic, 344-347 DGT, s e e discrete Gabor transform D I . . . , s e e Doppler-independent... differential reassignment, 294 dilation invariance, 282-283, 293 direct-sequence spread-spectrum communication, 542 direction-of-arrival (DOA) estimation, 334-338, 340, 344-347, 349 discontinuities - well handled by wavelet transform, 677 discrete cosine transform (DCT), 318 - windowed, 318 discrete Gabor transform (DGT), 521 - inverse, 521 discrete PWVDs, 449-450 discrete SM, 245-248 - examples, 246-248 discrete spectrogram, 272-274, 277-278 - latency of, 274 - sampling for, 272-273 discrete TFDs, 232-241 - aliasing in, 233-234, 237, 241 - computation of, 268-278 - derived from standard kernels, -
-
-
-
723
240-241, 271 - Doppler-frequency form, 238 general quadratic, 235-238 - IF property for, 239 - properties related to kernel, 239-241 real-time computation of, 269-270 sampling for, 236-238 with Doppler-independent kernels, 240, 241,271 discrete T F P F bias-variance tradeoff, 494 iterative algorithm, 494 signal scaling, 493-494 - window length selection, 494 discrete wavelet packet analysis (DWPA), 677 in fault diagnosis, 680 discrete wavelet transform (DWT), 319-322, 676, 677, s e e a l s o scalogram in interference excision, 543 discrete WVD, 232-234, 240, 241, 271 ~ 2D, 655-659 aliasing in, 233-234 - as discrete quadratic TFD, 237 computation of, 268 sampling for, 232-234 - windowed, s e e windowed DWVD discrete WWVD, s e e windowed DWVD discrete-domain frames, 315-322 dispersion time shift, 205 time-frequency characteristics, 203 dispersive IF shift, 382 spreading function, 386 - system, 203, 382 transformation, 382, 643, 646 displacement function, 106 DNR, s e e detectable dynamic range D O A . . . , s e e direction-of-arrival... dolphin click - binomial distribution of, 266 - Choi~ distribution of, 176 dominant frequency, 21 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Time-Frequency Index
724
dominant time, 21 Doppler (~), 34 - in radar echo from helicopter, 585-589 Doppler effect - acoustical, 597-599 hydrophone in water, 598-599 microphone in air, 597-598 - vs. Doppler (~), 34n Doppler power profile, 412 Doppler shift, 102, 137 Doppler spread, 415, 550 Doppler spreading, 410 Doppler tolerance, 166 Doppler, Christian, 597 Doppler-delay spread, 139, 415, 558, 560, 562 - straight-line, 559-560, 567 Doppler-frequency kernel, 67, 70 Doppler-independent kernel, 52, 55, 63, 71, 72, 74, 76, 77, 214-215, 222, s e e a l s o windowed WVD - and TFD properties, 216-217 discrete, 240, 241,271 - giving windowed WVD, 215 smoothing effect of, 215 Doppler-invariant signal, 209 Doppler-lag domain, 69, 70 and cross-terms, 96-97, 170 Doppler-lag kernel, 67, 69 - and Moyal's formula, 668 double convolution, 38, 214 DSSS..., s e e direct-sequence spread-spectrum... dual functions, 42, 155 dual operator, 107 duration, 16, 18 - effective, 17 - Slepian, 17 duration-bandwidth product, s e e bandwidth-duration (BT) product DWT, s e e discrete wavelet transform DWVD, s e e discrete WVD dyadic sampling grid, 676 dynamic bandwidth, 26 -
-
-
-
-
E
EEG, see electroencephalogram effective analysis window, 37 effective bandwidth, 16 effective duration, 17 effective lag window, 35 electro-oculogram (EOG), 664 electrocardiogram (ECG), 664 electroencephalogram (EEG) analyzed by B-distribution, 664, 665, 667-669 - background patterns, 666-667 data acquisition for, 664 for seizure detection, 663 multicomponent nature of, 663 nonstationarity of, 663 seizure patterns, 665-666 time-frequency analysis of, 664-665, 668-669 - time-frequency peak filtering of, 495, 496 energy - concentration about IF law, 12, 76, 306 of signal, 5 energy atom, 155 of, 155-156 energy density, 45, 52 quasi-, 508 energy distributions, 128 - as smoothed density distributions, 128 as visual aids, 129 energy gradient, 45, 52 energy property, 60 energy spectrum, 7, 33 - cross-terms in, 94 engine pressure signal, 302, 303 engine sound, 476 - Gabor coefficients of, 476, 477 ensemble average, 37 ensemble view, 116 entropy - Leipnik measure, 300 - R~nyi, 298-300, 305 Shannon, 297n, 299-300 Zakai parameter, 300 -
-
-
-
-
-
-
-
-
-
-
-
-
o
s
c
i
l
l
a
t
i
o
n
725
Time-Frequency Index
estimation, 528 evolutionary spectrum, 37, 402 - generalized, 402 - transitory, 402 expected ambiguity function -generalized, 403, 414 exponential T-distribution, 443 extended displacement function, 106 extraction - of components, 361-366 F fading - in CDMA communication, 551, 555 - in wireless communication, 457 - Rayleigh, 551, 553, 564 - reduced by time-frequency processing, 553 fault diagnosis, 671,673 - in frequency domain, 673, 674 - localized, 679, 680 - nonlinear methods, 673 fault location - in electric power networks, 631-632 feature extraction - for machine condition monitoring, 672, 673, 675, 677, 679-681 feature selection, 672 - in gearbox fault detection, 678 feature space, 672 filter, s e e time-frequency filter filter bank, 42 - perfect reconstruction, 320, 321 filtered ambiguity function, 68, 69, 161, 170, 180, 181 - for separable kernel, 214 filtered WVD, 36 finite bandwidth, 16 finite duration, 17 finite-element method (FEM), 635, 638 Fisher information, 117 Fisher information matrix, 164 FM-like signals - underdetermined BSS for, 357-368 F M . . . , s e e frequency modulation...
Form I (of polynomial WVD), 189, 190 Form II (of polynomial WVD), 189, 190 Form III (of polynomial WVD), 189 Fourier series, 628 Fourier transform (FT), 7 fractional autocorrelation theorem, 571-572 fractional convolution, 569-572 alternative formulations, 570-572 - special cases of, 570 fractional correlation, 569-572 alternative formulations, 570-572 and ambiguity function, 572 - and chirp detection, 573-574 -
-
-
- auto-, 570 - computational complexity of, 571, 573 - cross-, 569-570 - special cases of, 570 fractional Fourier transform, 223-224, 568-569 - and Hermite-Gauss functions, 146 - applications of, 151-152 computational complexity of, 152 - defined, 146 - global moments of, 148-150 - invariance properties of, 149-150 - local moments of, 150-151 - of common functions, 148 - of linear FM signal, 151 - properties, 147 - rotation property, 146-147, 223, -
569 - special cases of, 223 fractional power spectrum - defined, 147-148 fractional-powers form, see polynomial WVDs frame, 315, 321 - cascaded, 319-321 - defined, 315 - discrete-domain, discrete-domain
see
- dual, 315
- pseudo-, 315 frame operator, 315 frequency covariance,
60,
frames
see
Time-Frequency Index
726
frequency-shift invariance frequency extent, 74 frequency marginal, 60 frequency modulation, s e e a l s o chirp, linear FM, quadratic FM, hyperbolic FM 21 - of multiple components, 510 - rate (nonlinear), 511 frequency resolution, 39 frequency support, 61, 74, 171 - discrete, 240 - strong, 46 frequency window, 35, 36, 70 frequency-dependent modulation function, 136 frequency-invariant system, 135 frequency-invariant WSSUS channel, 413 frequency-shift invariance, 60, 74, 172, 203 - of robust WD, 397 preserved by reassignment, 292 FRFT, s e e fractional Fourier transform FT, s e e Fourier transform -
l a w ,
4,
-
G Gabor coefficient, 155, 477-480 - of engine sound, 476, 477 Gabor expansion, 155, 620-621 - critical sampling in, 477 - discrete, 476-479 - dual window functions, 477-479 in time-varying filtering, 476-480 oversampling in, 252, 254-256, 477 - product form, 254 - sum-of-products form, 255 - with non-orthogonal sampling, 252, 257-259 - with rectangular lattice, 252-253 Gabor filter, 470 Gabor spectrogram, 153-158 - concept of, 154 - convergence to WVD, 156 defined, 154-156 - of bat sound, 156 Gabor transform, 42, 253, 316, 321,470 -
-
- discrete, s e e discrete Gabor transform in interference excision, 543 multi-window, 317 oversampling in, 254-256 - product form, 254 - sum-of-products form, 255 Gabor transform pair 478, 480 Gabor, Dennis, 17n, 41, 114, 252, 422 GAF, s e e generalized ambiguity function Gaussian process - complex circular, 460 Gaussian signal, 18, 22, 503 as basis function, 154 - in derivation of Gabor spectrogram, 154 GDS..., s e e group delay shift... generalized ambiguity function (GAF), 68, 195-196, s e e a l s o filtered ambiguity function - for polynomial phase estimation, 196 generalized evolutionary spectrum, 402 generalized expected ambiguity function, 403, 414 generalized impulse, 206, 207 generalized likelihood ratio test (GLRT), 346, 347, 502-504, 507-509, s e e a l s o likelihood ratio detector generalized marginals, 183, 224-228, 282 generalized Radon transform, 283 generalized spreading function, 137, 403, 411,469 generalized Weyl filter, 467 generalized Weyl symbol, 135, 400, 402, 408, 411,467, 528 generalized Wigner distribution (GWD), 136, 196-198, 400, 533, 623 application of, 197-198 discrete, 198 generalized Wigner-Ville spectrum, 400, 414, 528 generalized-marginal property, 225 -
-
- o r t h o g o n a l - l i k e ,
-
-
Time-Frequency Index
generalized-marginal TFDs, 225-228 - kernel examples, 227 - kernel properties, 227 Givens rotation, 354-355 global energy property, 60 GLRT, s e e generalized likelihood ratio test group, 103 group delay, 23, 107, 634, 644, s e e a l s o time delay - constant, 644 - dispersive, 644 - exponential, 210 - hyperbolic, 208, 283 - local, 291 - nonlinear, 203, 644 - power-function, 209, 643-645, 647, 649 group delay shift (GDS) covariance, 203-212 group velocity, 631,632 GWD, s e e generalized Wigner distribution H
Haar window, 260, 263 Hadamard product, 350 HAF, s e e higher-order ambiguity function halfband system, 472 Hamming window, 218 Hankel form, 243 Hanning window, 218, 301,373, 380 Heisenberg uncertainty relation, 41, 290, s e e a l s o uncertainty - and singular functions, 561 - applied to marginals, 114 - in quantum mechanics, 41n - misleading as limit on resolution, 114-115 helicopter - radar echo from, 585-586 - sound from, 597, 601 Hermite function expansions, 131-132 Hermite normal form, 257 Hermite-Gauss functions, 146 Hermitian operator, 122, 124 Hermitian symmetry, 13
727
Hessian, 117 higher-order ambiguity function (HAF), 452, 594 for multiple component analysis, 453 for single component analysis, 453 higher-order cumulants, 463 higher-order IAF, 187 higher-order moment, 463 higher-order spectra, 463, 622 - bispectrum, 622, 623 time-varying, 462, 463 - trispectrum, 328, 623 Hilbert space, 281,315 Hilbert transform, 14, 15 Huber M-estimates, 392, 393, 399 human visual system (HVS), 651, 659-661 HVS, s e e human visual system hyperbolic - class, 112, 643 FM signal, 62, 166, 282-283, 286-287, 389, 511,620 - frequency-shift operator, 110 - time-frequency structure, 387 - transformation, 387 - wavelet transform, 112 hyperbolic T-distribution (HTD), 441, s e e modified B-distribution hyperbolic-phase chirp, 92 -
-
-
-
I
IAF,
instantaneous autocorrelation function IDGT, s e e discrete Gabor transform (inverse) I F . . . , s e e instantaneous frequency... IFT, s e e inverse Fourier transform image dissimilarity - measured by 2D WVD, 659-660 image distortion, 651,659-661 image quality, 651,660 as signal-to-noise ratio, 659, 661 - time-frequency measures of, 659-661 impulse - generalized, 206, 207 impulse noise, 392, 393, 397, 399 see
-
728
Time-Frequency Index
- and TFDs, 392 impulse signal, 206 index of (dimensional) conformity, 280-283, 285-287, 289 index of significance, 281, 283, 289 information - spectral, 116-121 inner artifacts, 62, 73, 74, 76, 98 - suppressed by DI kernels, 217 - suppressed by polynomial WVDs, 190 inner-product invariance, 62, 282 innovations system, 401 instantaneous amplitude, 11, 20 instantaneous autocorrelation function (IAF), 33, 47, 66 - higher-order/polynomial, 187 spatial, 349 instantaneous bandwidth, 11,307 instantaneous cross-correlation function, 63 instantaneous Doppler, 607-613 instantaneous frequency (IF), 9, 11, 19, 20, 22, 25, 26, 76, 86-93, 107 adaptive estimation of, 429-446 - density of, 295 dispersive, 510 dispersive change of, 382 encoding signal as, 489-490 - estimated by matched spectrogram, 423-427 - estimated by maxima of MBD, 442-445 - estimated by maxima of quadratic TFDs, 438-441, 443 - estimated by maxima of spectrogram, 437-438, 445 - estimated by maxima of TFDs, 429 - estimated by polynomial WVD, 187-188, 453-455 estimated for noisy signals, 429 estimated for random signals, 422-427 - hyperbolic, 510 - hyperbolic shifts, 386 lag window affecting estimate of, -
-
-
-
-
-
-
-
439-440 local, 291 localization by WVS, 457 mean, 25 measurement of, 310 nonlinear, 510 of EEG signal, 666 - of EEG template, 669 - of electric power disturbance, 629-634 of engine noise, 637 of engine pressure signal, 635, 636, 638 power-function, 388 - property, 61, 74, 187-188, 239 unbiased estimation of, 76 uniqueness of, 86 visualization of, 72 instantaneous mixtures separation of, 325-328, 331 instantaneous phase, 20, 21, 26 parametric estimation of, 593-595 instantaneous power, 5, 33 integer-powers form, s e e polynomial WVDs interference traded against localization, 99-100 interference terms, 13 intermediate frequency (IF) definitions of, 422 interpolation, s e e oversampling inverse Fourier transform (IFT), 8 inverse Gabor transform, 42 inverse synthetic aperture radar (ISAR), 590 inverse time-frequency problems, 601-604 invertibility - of WVD, 61 invertible signal, 22 ISAR, s e e inverse synthetic aperture radar iterated projections distribution, 183 -
-
-
-
-
-
-
-
-
-
-
-
J
Jacobi diagonalization, 327, 330, 355 JAD, s e e joint anti-diagonalization
Time-Frequency Index
Jakes Doppler power profile, 413 jammer, 543, 546-547 Janssen's interference formula, 99 JBD, s e e joint block-diagonalization JD, s e e joint diagonalization jet aircraft - sound from, 599, 600 JOD, s e e joint off-diagonalization, joint anti-diagonalization joint anti-diagonalization (JAD), 327-328, s e e a l s o joint off-diagonalization joint block-diagonalization (JBD), 330-331 joint diagonalization (JD), 326-327, 351-352, 354-355 joint distributions - axis transformation approach, 123 by linked signal/axis transformations, 124-126 - construction of, 122-123 signal transformation approach, 124 joint off-diagonalization (JOD), 351-352, 354-355, s e e a l s o joint anti-diagonalization Joint Photographic Experts Group (JPEG) coding, 660, 661 joint spatial/spatial-frequency..., s e e spat ial/spat ial- frequency... -
-
K
Kaiser window - 2D extension of, 657 Karhunen-Lo~ve Transform (KLT), 261 kernel, 52, 60, s e e a l s o Doppler-frequency kernel, Doppler-lag kernel, kernel filter, time-frequency kernel, time-lag kernel - and variance of TFD, 377-379 - as filter, 70n, 71 - as in "signal kernel", 30-31 - computation of, 270-272 - decomposition into spectrogram kernels, 260-266, 502 -discrete, 240, 241, 270-272
729
Doppler-independent, 52, 55, 63, 71, 72, 76, 77, 214-215, 222, 240, 241,271 -energy of, 375-376 - for selected discrete quadratic TFDs, 240, 271 - generalized-marginal, 227 lag-independent, 55, 72, 215, 222 optimizing parameter of, 309, 312, 313 - separable, 55, 71-72, 97-98, 213-222, 292-293 singularities in, 271-272 - time-only, s e e lag-independent kernel transformations of, 67 kernel filter, 70, 71, s e e kernel Kirkwood-Rihaczek distribution, 45, s e e Rihaczek distribution Klauder wavelet, 288 KLT, s e e Karhunen-Lobve Transform (KLT) knife-edge ridge, 30 knock detection, 534, 635-637 by observing single combustions, 639-640 - by power and energy estimation, 640-641 - by S-method, 639-641 - position of sensors for, 638-639 - sampling for, 639 - signal models for, 635-638 Kohn-Nirenberg symbol, 136 -
-
-
-
-
-
L L-Wigner distribution (LWD), 248-249 optimal lag window for, 432 lag-independent kernel, 55, 72, 74, 215, 222 - and TFD properties, 216-217 smoothing effect of, 215 Laplacian distribution, 393 latency, 269 - of discrete spectrogram, 274 LD, s e e Levin distribution leakage, s e e spectral dispersion Leipnik entropy measure, 300 -
-
Time-Frequency Index
730
Levin distribution (LD), 49, 53, 54, 76, 77 - defined, 46 - discrete, 240, 241,271 LFM, s e e linear FM L I . . . , s e e lag-independent... Lie group, 104 likelihood ratio detector, 532, s e e a l s o generalized likelihood ratio test - time-frequency design, 534 - time-frequency formulation, 533 line detection, 362 linear FM signal, 6, 8, 29 as test signal, 4 - defined, 4 finite-duration, 34 - for frequency-hopped CDMA, 357 IF of, in multiplicative noise, 457-458 - optimal window duration for, 40 - TFDs of, 52 - Wigner-Ville spectrum of, 457-460 linear frequency sweep, 4 linear mixtures blind separation of, 324-333 linear phase, 23 linear time-frequency representations, 65, 102 linear time-varying channel, 410, 558 - optimal waveforms for, 566-567 linear time-varying system, 135, 402, 407, 410, 466, 528, 558 - nonlinear frequency shift in, 382 Lloyd's mirror effect, 599-601 - nodal frequencies, 600-601 local centroid, 291,292 local cross time-frequency product, 132 local energy, 72 local ergodicity, 37, 463 local frequency bandwidth, 132 local time bandwidth, 132 localization, 4 and cross-terms, 98-100 - as by-product of interference, 98, 99 -
-
-
-
-
-
4
traded against interference, 99-100 localized spectrum, s e e short-time Fourier transform logon, 41 loss function, 392-394, 396, 399 L T V . . . , s e e linear time-varying... LWD, s e e L-Wigner distribution -
M M-STFT, s e e robust S T F T machine condition monitoring, 671-672, 674, 677, 681 and diagnosis, 672-673 - data acquisition for, 672, 677, 679, 680 during transient states, 674-675 magnitude spectrum, 7, 9 Mahalanobis distance, 678 MAI, s e e multiaccess interference mainlobe amplitude, 306, 307 mainlobe bandwidth, 306 Margenau-Hill distribution, 46, s e e Levin distribution marginal median, 395 marginal properties, 33, 45, 114, 401, 402 - and the kernel, 224-225 discrete, 239 - violated by spectrogram, 73 marginals generalized, 183, 224-228, 282 Marinovich-Altes distribution, 126 masked WVD, 65 matched spectrogram, 422, 427 - c o m p u t a t i o n of, 423-427 - convergence of computation, 424-427 in IF estimation, 423-427 - proof of convergence, 424-427 - rate of convergence, 426-427 window of, 422, 426, 427 matching pursuit adaptive (modified), 511 adaptive algorithm, 512 - concept, 510 dictionary of waveforms, 511 iterative algorithm, 510 -
-
-
-
-
-
-
-
-
-
731
Time-Frequency' Index
code, 274, 275, 278, 349, 350, 352, 353, 355, 527, 665 maximum likelihood (ML), 393 MBD, s e e modified B-distribution mean IF, 25 mean instantaneous intensity, 400 mean squared error (MSE) - minimum (MMSE), 555 - o f IF estimation, 429, 430, 433 - of PWVD-based IF estimate, 453-455 mean value - of quadratic TFD, 374 median WD, 397-398 Mellin transform, 124, 283 Mellin-Fourier duality, 162 Mexican hat wavelet, 676 minimax Huber M-estimates, 392, 393, 399 minimax robust Wiener filter, 531 minimum description length (MDL), 360 missile tracking, 605-614 - and instantaneous energy, 611-614 mixing matrix, 325, 334 ML, s e e maximum likelihood mobile radio channel, 410 modified B-distribution (MBD), 51, 53, 75-77, 217-218, 310, 313 - computation of, 274-277 - discrete, 218, 240, 241, 271 - IF estimation by, 442-445 - lag-independent kernel of, 218, 441 - of bat sound, 438 - properties of, 441-442 - versus spectrogram, 439 modulation invariance, 61 moments of fractional Fourier transform, 148-151 monocomponent FM signal, 11 monocomponent linear FM signal, 76 monocomponent signal, 19, 306 imprecision of terminology, 98 optimal window duration for, 40 MATLAB TM
-
-
-
Monte Carlo method, 340, 460, 462, 535 Morlet wavelet, 676 moving targets discriminated from fixed, 595 Moyal's formula, 163, 287, 565, 668 and Rihaczek distribution, 668 related to Doppler-lag kernel, 668 - satisfied by WVD, 668 MSE, s e e mean squared error multiaccess interference (MAI), 551,554 multicomponent FM signal, 11 multicomponent signal, 19, 306, 357, 358 imprecision of terminology, 98 instantaneous frequencies of, 12, 437-441,443-446 - testing AMMP method, 515 time-frequency peak filtering of, 495, 496 - WVD of, 63 multipath propagation, 410 multipath spread, 550 multiple IFs, 12 multiplicative noise, 13, 457, 460 analyzed as additive, 380 polynomial phase estimation in, 200-201 multiplicative system, 560, 567 multiwindow Gabor filter, 470 multiwindow STFT, 530, 534 - filter based on, 468 MUSIC, 337-340, 343 - ambiguity-domain (AD-MUSIC), 337-338, 340 - time-frequency (t-f MUSIC), 338, 340-341 musical notation as time-frequency representation, 5 -
-
-
-
-
-
-
-
-
N negative frequency, 13 neural network, 622 in machine condition monitoring, 673, 678, 680 NLS, s e e nonlinear least-squares noise additive, s e e additive noise -
-
Time-Frequency Index
732
- analytic, 376 - complex white, 375, 377-380 impulse, s e e impulse noise - in quadratic TFDs, 374-376, 668 - in TFDs, 372-381 - in windowed WD, 372-374 - multiplicative, s e e multiplicative noise - nonstationary white, 374-375, 379-380 - real, 376 - stationary colored, 374-375, 380 - stationary white, 374, 377-379 - thresholding, 359, 364 noisy miner, 312, 313 noisy signals - polynomial WVD of, 448, 449 - quadratic TFDs of, 376-380 non-negativity, 73, s e e "positivity" non-orthogonal sampling, 256-257 nonlinear FM signal, 185 as basic atom, 511 - examples of, 185 - W V D of, 62 nonlinear frequency shift, 382 nonlinear least-squares (NLS) method, 598-604 nonstationary interference, 543, 546-547 nonstationary random process, 382, 400, 528 - cyclostationary, 414 -jointly underspread, 528, 529, 533 -overspread, 404, 406 - underspread, 404, 405, 528 - white, 401,402, 404 nonstationary signals, 5 - detection, 531 estimation, 528 nonstationary spectral correlation, 504, 505 notation standardization of, vii nuisance parameters, 500, 502, 503 - and covariance properties, 500, 504 - random, 507 Nyquist term, 269 -
-
-
-
O observation time optimal, 116-121 OFDM, s e e orthogonal frequency division multiplexing operator integral, 203, 383, 644 optimal signal detector, s e e likelihood ratio detector optimal signal estimator, s e e Wiener filter orthogonal frequency division multiplexing (OFDM), 417, 418, 557, 567 orthogonal-like functions, 522 orthogonal-like Gabor transform pair, 478, 480 orthogonality quasi-, 358 time-frequency, 358-360, 366 outer artifacts, 63, 73, 74, 76, 98 - suppressed by LI kernels, 217 oversampling rational, 254-256, 259 overspread process, 404, 406 overspread system, 139, 143, 529, 533 overspread WSSUS channel, 414 -
-
-
-
-
P p-point uncertainty class, 531 Page distribution, 50, 53, 54, 76, 77 - defined, 44-45 - discrete, 240, 241,271 parabolic-phase chirp, 92 parameters of signals, s e e signal models, signal parameters Parseval's relation, 163, 483, 652 - for fractional Fourier transform, 149 partial displacement operator, 104 pattern recognition in machine condition monitoring, 672, 677, 678, 681 PCA, s e e principal component analysis peak detection and tracking, 362, 364-366, 368 periodic auxiliary function, 478 -
-
Time-Frequency Index P F F s , s e e p r o d u c t - f u n c t i o n frames phase, 12 phase delay, 23 phase lag - of Hilbert transformer, 14 phase signals, 89 - regular, 89-91 phase s p e c t r u m , 7, 9 pitch, 5 polynomial FM signal, 185, s e e p o l y n o m i a l - p h a s e signal polynomial IAF, 187 polynomial phase estimation, 198-201, 450-452 - for c o n s t a n t a m p l i t u d e , additive noise, 198-200 - for multiplicative & additive noise, 200-201 - using the G A F , 196, 198-199 - using t h e G W D , 199-200 polynomial phase t r a n s f o r m ( P P T ) , 193 poly n o m i a l T F D s , 185-191, 193-202, see also polynomial W V D s - derivation of, 194-195 - m a t h e m a t i c a l foundations of, 193-194 polynomial W D , 187 polynomial W V D s , 185-191, 193, 197, 201,461-463 - coefficients of, 188-190 - cross, s e e cross p o l y n o m i a l W V D - cross-terms in, 191 - derivation of, 185-187 - discrete, 449-450 - for multiple c o m p o n e n t analysis, 453 - for single c o m p o n e n t analysis, 453 - general definition of, 187 - IF e s t i m a t i o n by, 438, 447-449, 453-455 - IF p r o p e r t y of, 187-188 - in time-frequency peak filtering, 493 - of m u l t i c o m p o n e n t signals, 191 - of noisy signals, 448, 449 - o p t i m a l lag window for, 432 - reducing to o r d i n a r y W V D ,
733 189-190 - S - m e t h o d for, 249-250 signal kernel of, 186, 448 - sixth-order, 190, 191, 462 - suppressing inner artifacts, 190 - s y m m e t r y a b o u t IF, 187-188 - with complex time, 189 - with fractional powers, 189-190 - with integer powers, 189-190 p o l y n o m i a l - p h a s e signal (PPS), 185, 460, 461 - detection of, 196 - e s t i m a t i o n of phase, s e e p o l y n o m i a l phase e s t i m a t i o n - in additive noise, 447-455 - in m o d e l of r a d a r echo, 593-595 - m a t c h e d s p e c t r o g r a m of, 427 positivity, 62, 73, 78 power class, 643-646 - applied to b e a m impulse response, 648-649 - applied to localized signal analysis, 643-644 - formulation of, 645-646 - i m p o r t a n c e of, 643 - ~th, 644-645 - m e m b e r s of, 646-647 - t e s t e d on s y n t h e t i c data, 647-648 power impulse, 643 power quality, 628-629 - time-frequency assessment of, 629-631 power spectral density (PSD), 36, 400 - frequency resolution in, 307 power s p e c t r u m , 153 - t i m e - d e p e n d e n t , 153 power time shift property, 644 power W i g n e r distribution, 646 - s m o o t h e d pseudo, 646-649 power-function - time-frequency represent at i on, 388 - t r a n s f o r m a t i o n , 388 powergram, 646, 647 P P S , s e e p o l y n o m i a l - p h a s e signal principal c o m p o n e n t analysis ( P C A ) , 261 -
Time-Frequency Index
734
product kernel, 71 product transform, 317, 321 product-function frames (PFFs), 321 - for non-periodic spaces, 317-319 - for periodic spaces, 316-317 projective group representation, 103 projectogram, 262 Prony method, 622, 624 PSD, s e e power spectral density pseudo-frame, 315 pseudo-Wigner distribution - affine-smoothed, 647-649 pseudo-WVD, s e e windowed WVD P W D (pseudo-Wigner distribution), s e e windowed WVD PWVDs, s e e polynomial WVDs
Q Q-distribution, 209, 387, 515 and the ambiguity function, 162 Q T F R s (quadratic time-frequency representations), s e e quadratic TFDs, bilinear/quadratic time-frequency representations quadratic class, 68, 102, 106, 110, 168, 170, 203, 501,643 quadratic FM signal IF estimation of, 462 quadratic TFDs, 67, 74, 102, 170 - adaptive, 180-183 - affine class of, 643, 645-650 - computation of, 268-278 - design of kernels for, 71, 170-173 - desirable properties of, 72-73, 76, 440-441 - detection by, 503-509 - discrete, 232-241 generalized-marginal, 225-228 - hyperbolic class of, 643 IF estimation by, 438-441 in machine condition monitoring, 674 - kernel decomposition, 260-263 noise in, 374-376, 668 of noisy signals, 376-380 - power-class, 643-650 -
-
-
-
-
-
-
properties related to kernel, 74-75, 216-217, 239-241 real-time computation of, 269-270 - S-method in, 242-245 - subclass P of, 73 - subclass P ' of, 73 table of, with properties, 77 time/frequency covariance of, 643 - variance of, 374-376, 381 with Doppler-independent kernels, 52, 55, 63, 71, 72, 76, 77, 214-215, 240, 241,271 with lag-independent kernels, 55, 72, 215 with reassignment, 292 - with separable kernels, 55, 71-72, 97-98, 213-222, 292-293 quadrature signal, 14 quasi-energy density, 508 quasi-orthogonality time-frequency, 358 -
-
-
-
-
-
-
-
R
radar - and the ambiguity function, 69, 160 - fluctuating target in, 457 HF line-of-sight, 605-608, 610, 612 radially Gaussian kernel, 182 Radon transform - generalized, 283 Radon-Wigner distribution - rotation of, 224 Radon-Wigner transform, 148 RAKE receiver, 551-552, 555-556 random time-frequency shift, 413 random time-varying channel, 410 range-velocity transformation, 160 rational oversampling, 254-256, 259 Rayleigh fading, 551,553, 564 RC, s e e reconstruction collection RD, s e e Rihaczek distribution real-time computation, 269-270 real-time filtering - of speech, 481,483 realness, 60, 74 -
735
Time-Frequency Index
exploitation of, 269-27O - of polynomial WD, 187 - o f robust TFDs, 398 - of robust WD, 396 reassigned smoothed pseudo Wigner-Ville distribution (RSPWVD), 292-293, 591, 593-594 reassignment points - asymptotic, 294 reassignment principle, 290-295 and bilinearity, 292 and time-frequency "squeezing", 292, 295 - and time/frequency shift invariance, 292 differential reassignment, 294 - fixed points, 295 - for component separation, 294 - for quadratic TFDs, 292 - for scalogram, 295 - for separable kernels, 292-293 for signal/noise discrimination, 294 - for spectrogram, 290-292 - for the affine class, 293-294 - histogram, 295 - supervised reassignment, 294 reconstruction collection, 315, 316, 318 reduced interference, 73, 74 reduced-interference distribution, 66, 76, 213 - adaptive, 183 - compared, 175 - computation of, 260 design of, 76 - design of kernels for, 170-173 - discrete kernels for, 173-175 - in Doppler-lag domain, 169-170 - of dolphin click, 176 - of electric power disturbances, 628-631,634 optimization of, 173 - support properties, 171 regular phase signals, 89, 90 - properties of, 91 -
-
-
-
-
c
o
m
p
u
t
a
t
i
o
n
a
l
relaxation time, 26, 40 R@nyi entropy, 298-300, 305 reproducing kernel, 162 residual spectrogram, 394, 398 resolution, 73 evaluation of, 309 - of components, 307 resolved components, 307 RID, s e e reduced-interference distribution Rihaczek distribution (RD), 48, 53, 76, 77 and Moyal's formula, 668 - defined, 45 discrete, 240, 241,271 Rihaczek spectrum, 401 ringing, 182, 659 road detection algorithm, 362, 367 road network tracking algorithm in underdetermined BSS, 362-366 robust spectrogram, 392-395, 397, 398 robust S T F T , 392-395 iterative realization, 394-395 - vector filter realization, 395 robust time-varying Wiener filter, 530 robust Wigner distribution, 396-398 - properties of, 396-397 rotation property, 223 R S P W V D , s e e reassigned smoothed pseudo Wigner-Ville distribution running energy spectrum, 44 running spectrum, 43 running transform, 44 RWT, s e e Radon-Wigner transform -
-
-
-
-
-
-
S
S-method, 242-251 affine form, 245 and L-Wigner distribution, 248-249 - and polynomial WVDs, 249-250 basic form, 242-244 - concentration of, 301-303 - cross-terms in, 243 discrete realization, 245-248 -examples, 246-248, 250 -
-
-
-
Time-Frequency Index
736
- forms in quadratic TFDs, 242-245 - fractional domain form, 244 - in spectral subtraction filtering, 487 - related to STFT, 639 sensitivity to noise, 379 - time direction form, 244 sampling - for discrete quadratic TFDs, 236-238 - for discrete spectrogram, 272-273 - for discrete WVD, 232-234 - for windowed DWVD, 235 ideal, 232 SAR, s e e synthetic aperture radar scalar product, s e e inner product scale covariance, 172, 210 scale modulation, 318-319, 321 scale operator, 124 scaling property, 644 scalogram - cross-terms in, 94, 622 - related to wavelet transform, 94, 294, 621 with reassignment, 295 scattering, 506, 615, 616, 621,623-624 - by rotating rigid body, 591-593 - in missile tracking, 607, 609, 611-613 scattering function, 411,506, 551 seizures definition of, 663-664 linear FM with decreasing frequency, 665, 667 - linear FM with quasi-constant frequency, 665, 667 - matched detection of, 666 - neonatal vs. adult, 663 piecewise-linear FM, 666-667 - time-frequency detection of, 666-669 time-frequency patterns of, 665-667 self-adjoint operator, 122 eigensystem of, 262, 264 separable kernel, 55, 71-72, 74, 77, -
-
-
-
-
-
-
-
97-98, 213-222 - and TFD properties, 216-217 - design examples, 217-218 - general theory of, 213-214 numerical examples, 218-221 separate convolutions in t and f, 71 separation - of components, 364 separation measure, 308 Shannon entropy, 297n, 299-300 short-time ambiguity function, 182 short-time Fourier transform (STFT), 102, 106, 110, 468, 477, 500 adaptive, 178-180 and S-method, 639 and the ambiguity function, 162 - defined, 38 in realization of higher-order TFDs, 248-250 - in realization of quadratic TFDs, 242-245 - in speech filtering, 483 multiwindow, 530, 534 - robust, 392-395 SIAF, s e e spatial instantaneous autocorrelation function sidelobe amplitude, 306, 307 Siebert's self-translation property, 164 signal classification, 510 signal decomposition by matching pursuit algorithm, 510, 513 signal detection, 531 signal enhancement, 528 signal estimation, 528 signal formulations, 12 signal kernel, 30-31, 194 - for polynomial WVD, 186, 448 signal measures - extended to two dimensions, 129-130 in one dimension, 129 signal models, 12 - parameters of, 12, 13 signal parameters, 310 signal spread, 41 signal transformation, 124-125 -
-
-
-
-
-
-
-
Time-Frequency Index significance - index of, 281, 283, 289 singular function, 558-561, 564-566 singular value, 559, 561, 564-566 singular-value decomposition (SVD), 563-564, 622 sinusoidal FM signal, 39 - defined, 4 IF of, Slepian bandwidth, 17 Slepian duration, 17 Slepian, David, 17 SM, s e e S-method smearing, 39n, s e e smoothing smoothed pseudo Wigner-Ville distribution, 97-98, 292-293 - has separable kernel, 213, 222 - reassigned ( R S P W V D ) , 292-293, 591, 593-594 smoothed SIAF, 349 smoothed WVD, 51,312 smoothing - as convolution, 48 - versus "squeezing", 292 sonar, 510, 615-618 - and the ambiguity function, 160, 618 sonogram, s e e sonograph sonograph, 42 - defined, 43 - related to spectrogram, 43 source parameter estimation, 601-604 - broadband, 603-604 - narrowband, in air, 601-602 - narrowband, in water, 602-603 - wavelet denoising for, 603 sparse decomposition, 366 spatial instantaneous autocorrelation function (SIAF), 349 - smoothed, 349 spatial T F D (STFD), 325, 334-343, 349-356 - auto-term points, 359 - auto-terms in, 351,353 - cross-terms in, 3 4 1 , 3 5 1 , 3 5 3 - for blind source separation, 326-331 -
4
737
- in blind source separation, 357-368 - matrices, 329-331 - quadratic, 334, 349 - structure under linear model, 351 spatial/spatial-frequency representations, 652-654, 658-661 image quality and, 651 spectral autocorrelation function, 35, 66 spectral complexity, 116 spectral disjointness, 15 spectral information, 116-121 for nonstationary signals, 118-121 for stationary signals, 116-118 spectral subtraction - SM-based, 487 485, 487 spectrogram, 39, 47, 53, 54, 76, 77, 501 - adaptive, 178-180 approximate decomposition into, 264-266, 506 as energy distribution, 128 - as special case of S-method, 243 - auto-, 260 - computation of, 277-278 concentration of, 301 260, 263, 266 cross-terms in, 94-95 decomposition into, 260-266, 502, 505 - defined, 38 -discrete, 240, 241, 271-274, 277-278 IF estimation by, 437-438 in spectral subtraction filtering, 485, 487 in speech filtering, 483-484 in time-varying Wiener filter, 484 limitations of, 153, 168 marginals violated by, 73 - matched, s e e matched spectrogram of bat sound, 438 optimal window duration for, 39, 40, 432 - related to sonograph, 43 -
-
-
-
s
p
e
c
t
r
s
-
,
-
-
-
-
-
-
-
-
-
-
-
-
-
-
c
r
o
s
o
g
r
a
m
-
b
a
s
e
d
,
Time-Frequency Index
738
residual, 394, 398 - robust, 392-395 sensitivity to noise, 379 - subsumed by quadratic TFDs, 77 - variance of, 381 - with reassignment, 290-292 speech time-varying filtering of, 481-486 speech enhancement, 473 spread function, s e e delay-Doppler spread spread-spectrum communication code-division multiple-access (CDMA), 357, 549, 551-557 - described, 542 - direct-sequence, 542 interference mitigation in, 542-547, 553-557 reduction of fading in, 553 spreading function, 550 dispersive, 386 generalized, 137, 403, 411,469 - hyperbolic, 387 - narrowband, 384 - power, 388 - wideband, 384 SPWVD, s e e smoothed pseudo Wigner-Ville distribution SS..., s e e spread-spectrum... statistical cross-terms, 406 steering vector, 334 STFD, s e e spatial TFD STFT, s e e short-time Fourier transform STFT filter, 468 strong frequency support, 46 strong time support, 46 supervised reassignment, 294 Sussman ambiguity function, s e e ambiguity function symmetrical ambiguity function, s e e ambiguity function synchrosqueezed plane, 295 synthetic aperture radar (SAR) basic principles, 590-591 - inverse (ISAR), 590 - RSPWVD used in, 591, 593-594 system -
-
-
-
-
-
-
-
-
- adjoint, 140 - approx. eigenfunctions/eigenvalues of, 141 - halfband, 472 innovations, 401 linear frequency-invariant, 135 linear time-invariant, 135 linear time-varying, 135, 402, 407, 410, 466, 528, 558 -overspread, 139, 143, 529, 533 - random time-varying, 410 - underspread, 138, 140, 407, 467, 528, 558-567 - wideband, 390 system identification, 519-527 using chirp signal, 519 using pseudo-random signal, 519 -
-
-
-
-
-
T t-f MUSIC, s e e MUSIC TBM (theater ballistic missile), s e e missile tracking TBP (time-bandwidth product), s e e bandwidth-duration (BT) product test statistic, 345-347, 502-508, 532, s e e a l s o detection -optimal, 345-347, 502 TFAR..., s e e time-vs-frequency autoregressive... TFDs, s e e time-frequency distributions T F M V . . . , s e e time-vs-frequency minimum-variance... TFPF, s e e time-frequency peak filtering, discrete TFPF TFRs, s e e time-frequency representations, time-frequency distributions TFSA package, 190, 278, 665 TFSP, s e e time-frequency signal processing thresholding constant, 523-524 time average, 37 time covariance, 60, s e e time-shift invariance time delay, 9, 22, 25, 26, 61 critique of terminology, 22n -
Time-Frequency Index
time delay p r o p e r t y - discrete, 239 time extent, 74 time marginal, 60 time of arrival - of electric power disturbances, 631-634 time resolution, 39 time support, 61, 74, 171 - discrete, 239 - strong, 46 time window, 35, 36, 70 time-advanced signal, 269 t i m e - b a n d w i d t h product (TBP), s e e bandwidth-duration (BT) product time-frequency analysis - matching, 206 time-frequency correlation, 403 time-frequency correlation function, 411 time-frequency correlation spread, 404 time-frequency correlator, 668 time-frequency displacement operator, 103 time-frequency distributions, 9 - adaptive, 178-183, 513 - affine, 209 - and ambiguity functions, 161 - comparison of, 305, 310, 312, 313 - concentration of, 297-304, see also concentration - density-class, 128 - desirable characteristics of, 11 detection by, 502-509, 619-621 - discrete, 232-241 - energy-class, 128 exponential, 210 - GDS-covariant, 203-212 generalized-marginal, 225-228 high-resolution linear, 581-589 - hyperbolic, 208 IF estimation by, 429-446 - of helicopter radar data, 578-589 - of hybrid e n e r g y / d e n s i t y class, 128 309, 310 - optimizing performance of, 309, -
-
-
-
-
-
o
p
t
i
m
a
l
,
739 310 performance criteria for, 306 - performance measure for, 309, 312 - performance of, 305 - polynomial, s e e polynomial TFDs, polynomial W V D s - poorly-performed, 309 power-class, 209, 643-650 - quadratic, s e e quadratic T F D s robust, 392-399, s e e robust spectrogram, robust S T F T , robust Wigner distribution, median W D selection of, 305, 309, 310, 312, 313 - well-performed, 309 time-frequency domain, 70 time-frequency filter, 466 - adaptive notch, 544-547 - applications of, 473 discrete-time formulation of, 472 explicit design of, 466 for noisy speech, 481-486 - Gabor, 470 - G a b o r expansion based, 476-480 - generalized Weyl, 467 implicit design of, 466 - iterative algorithm for, 522 - multiwindow Gabor, 470 - multiwindow S T F T , 468 - projection, 467, 546-547 - SM-based, 485-486 spectral subtraction, 482, 484-487 spectrogram-based, 486 - S T F T , 468 system identification using, 519-527 - Weyl, 467 - Zadeh, 467 time-frequency kernel, 51, 67, 74 time-frequency matched detector criteria for, 669 cross-correlation stage, 669 detection loop, 668 for E E G seizures, 668-669 o u t p u t of, 669 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Time-Frequency Index
740
- preprocessing stage, 668 signal restructuring stage, 668 time-frequency measures - generation of, 130 - properties & applications of, 132-134 - properties & interpretation of, 131-132 time-frequency peak filtering (TFPF), 489-496 - bias-variance tradeoff, 494 - concept, 489 definitions, 489-490 discrete signal scaling, 493-494 discrete window length selection, 494 iterative algorithm, 494 of EEG signal, 495, 496 of multicomponent signal, 495, 496 - principles, 490-491 - properties, 491-492 time-frequency projection filter, 467, 546-547 time-frequency pseudo-Wiener filter, 530 time-frequency reassignment, see reassignment principle time-frequency representations, see also time-frequency distributions linear, 65, 102 time-frequency scaling operator, 104, 110 time-frequency shift operator, 102, 136 -
-
-
-
-
-
-
-
T i m e - F r e q u e n c y Signal A n a l y s i s
package, 190, 278, 665 time-frequency signal detector, 534 time-frequency signal estimator, 530 time-frequency signal processing (TFSP), 11 time-frequency transfer function, 140 time-invariant system, 135 time-invariant WSSUS channel, 413 time-lag kernel, 48, 51, 65, 67, 69, 501 of B-distribution, 665 - of spectrogram, 501 time-limited signal, 16, 25, 35 -
time-Mellin distributions, 126 time-only kernel, 215, see lag-independent kernel time-scale analysis in machine condition monitoring, 674 time-scale distributions, 126, 508 time-shift invariance, 60, 74, 172, 203, 293 - of robust WD, 397 preserved by reassignment, 292 time-varying amplitude, 12 time-varying channel, 410 time-varying components, 12 time-varying frequency, 12 time-varying higher-order spectra, 462, 463 time-varying impulse response, 382 time-varying power spectrum, 400, 405, 4O8 time-varying spectrum, 37, 38 time-varying system, 135, 382, 402, 407, 410, 466, 528 eigenfunctions of, 558-559 time-varying transfer function, 382, 558, 564-566 time-varying Wiener filter, 485, 486, 529 - SM-based, 487 - spectrogram-based, 484, 487 time-vs-frequency autoregressive (TFAR) method, 583, 585 time-vs-frequency minimum-variance (TFMV) method, 583, 585, 589 Toeplitz factorization, 243 tomographic methods, 282-284 total harmonic distortion (THD), 628 transfer function approximations, 140 transformation, see axis transformation, signal transformation transient states - of machines, 674 transitory evolutionary spectrum, 402 tuning width, 41 turboprop aircraft - sound from, 597, 601 turbulence, 457 -
-
-
Time-Frequency Index
TV-HOS,
s e e time-varying higher-order spectra two-dimensional filtering, 70, 71
U uncertainty, s e e a l s o Heisenberg... - and the ambiguity function, 164 - entropic, 164 - in time-frequency plane, 114-116 - measures of, in (t, f) plane, 115-116 uncertainty class, 531 underdetermined blind source separation, s e e underdetermined BSS underdetermined BSS (UBSS) - for FM-like signals, 357-368 - peak detection and tracking, 364-366 - road network tracking, 362-366 - vector clustering, 358-361 underspread approximations, 140, 405, 415, 529, 533 underspread process, 404, 405, 528 underspread system, 138, 140, 407, 467, 528 - eigenfunctions of, 558-567 - linear, 558-567 underspread WSSUS channel, 414, 507 - approximate eigenfunctions and eigenvalues, 415 sampling approximation, 417 underwater acoustic channel, 410 uniform linear array (ULA), 344 unitarity - of affine Wigner function, 287-289 unitary equivalence, 124 unitary equivalence principle, 112 unitary group representation, 103 unitary transformation, 62, 205, 282, 327 unresolved components, 307 -
741
of IF estimation, 429, 431-433, 435, 440-443, 445, 458-460 - o f quadratic TFD, 374-376, 381 - of spectrogram, 381 - of windowed WVD, 373, 380 vector clustering, 357-363, 366-368 distance measure, 360 vector filter, 395 vector median, 395 Vibroseis signal, 4 Ville, J., 33, 161,422 volume invariance of ambiguity function, 163 -
-
-
W warp, 124-127 - dispersive, 205, 646 -operator, 124, 125, 205, 385, 646 - power, 646, 647 signal, 207, 386, 646 time-frequency representation, 207, 387, 512, 646 transformation, 124, 385, 646, 649 unitary, 646 warped Wigner distribution, 126 wavelet packet transform (WPT), 320, 321 wavelet transform, 106, 110, 294, 319, 621,622, 624, 676, 677, s e e a l s o scalogram - and discontinuities, 677 and the ambiguity function, 162 - and transient events, 675, 677 - discrete, s e e discrete wavelet transform in fault diagnosis of rotating machinery, 678-680 in machine condition monitoring, 674 of electric power disturbances, 628, 629 wavelets, 288 as basis functions, 676 Klauder, 288 Mexican hat, 676 - Morlet, 676 - mother, 676 - packets, 680 -
-
-
-
-
-
-
-
-
-
-
V variance - mean value of, 378
Time-Frequency Index
742
- theory of, 42 WDF, s e e windowed data function Weyl correspondence, 346, 501 Weyl filter, 467 Weyl spectrum, 402 Weyl symbol, 135, 346, 467, 501,506, 531,534, 558 - discrete-time, 472 dispersive, 385 generalized, 135, 400, 402, 408, 411,467, 528 - hyperbolic, 387 narrowband, 382 power, 388 wideband, 384 Weyl-Heisenberg group, 282 whale song, 133, 134 white nonstationary process, 401,402, 404 whitened STFD matrix, 326, 327 whitening, 326, 329-330, 351-353 wide-sense stationary uncorrelated scattering (WSSUS), 411,506, 551, 553, 557 Wiener filter, 482, 543 - robust, 530 - time-frequency design, 530 - time-frequency formulation, 529 - time-varying, 484-487, 529 Wiener-Khintchine theorem, 36, 403 Wigner bispectrum, 623 Wigner distribution, 31,207, 647 artifacts in, 32 generalized, 400, 533 nonlinearity of, 31 power, 646-649 Wigner, E. P., 31n Wigner-Ville distribution (WVD), 33, 48, 53, 54, 66, 76, 77, 500 2D, 651,659-660 2D continuous, 652-654 - 2D discrete, 655-659 - and Moyal's formula, 668 - as special case of polynomial WVD, 189-190 - as special case of S-method, 243 - cross-terms in, 94-96 -
-
-
-
-
-
-
-
-
-
-
- discrete, s e e discrete WVD discrete windowed, s e e windowed DWVD holographic nature of, 99 - in derivation of Gabor spectrogram, 155 in gearbox fault detection, 677-678 in missile tracking, 608-614 - in terms of the spectrum, 34 limitations of, 62, 153-154, 168 - local centroid of, 291 - masked, 65 median, 397-398 of linear FM signal, 33 - polynomial, s e e polynomial WVDs of, 60-62, 74, 153 - pseudo-, s e e windowed WVD robust, 396-398 - rotation of, 146-147, 223 satisfies a l l generalized marginals, 226 smoothing of, 71, 72, 311 - spectrogram decomposition, 262-263 - windowed, s e e windowed WVD Wigner-Ville spectrum, 37, 383, 401, 481-483, 491,501,504, 505, 531,534, 637, 639, 641 generalized, 400, 414, 528 - IF localization by, 457 in additive noise, 458-460 in multiplicative noise, 457-459 knock detection by, 639 of engine signal, 638 of single combustion, 639-640 of, 457-458 Wigner-Ville trispectrum, 328 window Hamming, 218 218, 301,373, 380 windowed data function (WDF), 578, 581-582 windowed DCT, 318 windowed DWVD, 234-235, 240, 241, 271 -
-
-
-
-
-
-
-
p
r
o
p
e
r
t
i
e
s
o
p
t
i
m
a
l
i
t
y
-
-
-
-
-
-
-
-
-
-
-
-
H
a
n
n
i
n
g
,
Time-Frequency Index
- as discrete quadratic TFD, 237 - computation of, 268 - sampling for, 235 windowed Levin distribution, 47, 50, 53, 54, 76, 77 - discrete, 240, 241,271 windowed Rihaczek distribution, 47, 49, 53, 76, 77 - discrete, 240, 241,271 windowed WVD, 36, 51, 53, 55, 63, 72, 76, 77, 339, s e e a l s o Doppler-independent kernel 2D continuous, 654-655 adaptive lag window for, 429, 431,433-436 - also called pseudo-WVD, 235 - discrete, s e e windowed D W V D - kernel of, 215 - noise in, 372-374 - optimal lag window for, 373-374, 429-436 sensitivity to noise, 379 - smoothed, 97-98, 292-293 - variance of, 373, 380 Woodward ambiguity function, s e e ambiguity function W P T , s e e wavelet packet transform WSSUS, s e e wide-sense stationary uncorrelated scattering WSSUS channel, 4 1 1 , 5 5 1 , 5 5 3 , 557 - delay power profile, 412 - delay spread, 415 - delay-Doppler spread, 415 - Doppler power profile, 412 - Doppler spread, 415 frequency correlation function, 413 frequency-invariant, 413 - overspread, 414 - random time-frequency shift, 413 scattering function, 411,551 statistical input-output relations, 414 time correlation function, 413 time-frequency correlation function, 411 time-invariant, 413 -
-
-
-
-
-
-
-
-
-
743
-
WT,
underspread, 414 wavelet transform
see
X X P W V D , s e e cross polynomial W V D XWVD, s e e cross-Wigner-Ville distribution Z
Zadeh's time-varying transfer function, 135, 467 Zak transform, 254, 255, 258, 259 and Fourier transform, 254 - defined, 254 Zakai entropy parameter, 300 Z A M . . . , s e e Zhao-Atlas-Marks... zero-padding, 234, 238, 268 Zhao-Atlas-Marks (ZAM) distribution, 51, 53, 55, 74, 76, 77, 171 - computation of, 270-271 - discrete, 240, 241,271 limitations of, 182-183 -
-
This Page Intentionally Left Blank