![]() |
|
TN201 |
|
The Fourier transform is frequently used to analyze and modify real world signals and waveforms. The Fourier transform is usually defined as:
When the independent variable corresponds to time, the transform variable corresponds to frequency, in which case X(f) is called the spectrum of x(t). The original x(t) may be recovered from X(f) by the inverse Fourier transform, defined by:
In the discrete-time, sampled-data domain of the digital computer, the Fourier transform becomes the discrete-time Fourier transform. The continuous-time variable t becomes the sample number n, and integrals become summations over a finite block of samples. The discrete-time Fourier transform (DFT) of a sequence of N samples is defined by:
As in the continuous-time case, we can recover the original sequence by an inverse transform. The inverse DFT (IDFT) is given by:
The summations above require on the order of N2 operations to evaluate, a number which grows impractically large as the number of samples increases. The fast discrete-time Fourier transform, or FFT, is a DFT algorithm whose computation time grows only as Nlog2N. For even modest values of N, the speed-up afforded by the FFT algorithm can be dramatic.
It should be stressed that the FFT is merely a fast algorithm for computing the DFT. It has exactly the same mathematical properties as the DFT. In the discussions below, we will usually refer to DFT with the understanding that it is computed using the FFT algorithm.
Applications
Probably the most common use of the DFT is to compute the frequency spectrum of a sampled time waveform, as might be obtained from a microphone or accelerometer output digitized by an analog-to-digital converter. Radar, sonar, and vibration analysis, for example, make extensive use of the DFT to extract and reveal signal characteristics that would otherwise be difficult or impossible to perceive from the time waveforms alone. A practical example using the Rabbit 2000 in vibration analysis is the monitoring of rotating machinery. By attaching one or more accelerometers to a piece of rotating machinery and computing the DFTs of their outputs, one can obtain an operating "signature" of the machinery. Changes in the signature over time can give early warning of mechanical problems or impending failure.
Because of its computational speed, the FFT is often used to perform fast discrete convolution. Discrete convolution normally requires on the order of N2 operations. This number can be reduced to the order of Nlog2N by first transforming the time-domain waveforms to the frequency domain, multiplying the transforms together, and then inverse transform back to the time domain. For large values of N, the speed-up can be substantial. Fast convolution is especially attractive when a sampled-data sequence must be convolved with a filter function whose FFT can be computed and stored ahead of time.
Another application of the FFT is in band-limited interpolation, in which a sequence's sample rate is increased by interpolating samples between original samples. For example, imagine that we have a sequence to convert to an analog waveform by a digital-to-analog converter. The waveform might be a stimulus for an experiment or a segment of speech. Such waveforms are often computed or stored at the minimum possible sample rate to save processor cycles and memory. Operating the digital to analog converter at the lowest possible rate complicates the design of the analog reconstruction filter used to smooth the DAC's stepped output. The requirements of the reconstruction filter can be relaxed considerably by increasing the sample rate into the converter. The DFT provides a convenient way to increase the sample rate without altering the waveform's character.
To see how the DFT can be used to increase the sample rate, suppose that the sequence to be converted has a sample rate of 2,500 samples per second and that we wish to increase the sample rate by a factor of 4 to 10,000 samples per second. We start with a convenient block of, say, 256 samples, and then compute its DFT. Since the samples represent a real signal, we use a version of the FFT that transforms 256 purely real samples into a 128-point complex spectrum that extends from zero frequency to 1.25 kHz. (The real-to-complex version of the FFT is about twice as fast as the complex-to-complex FFT.) Next we append 384 zero-valued points to the 128-point positive-frequency spectrum. This enlarged 512-point spectrum is precisely the spectrum that would have been computed had the original sequence been sampled at 10,000 samples/s. Performing a 512-point inverse DFT on the enlarged spectrum (using a complex-to-real version of the IFFT) produces a block of 1024 real samples that perfectly reproduce the waveform contained in the original 256-sample block.
Windowing
The DFT can be often viewed as a bank of digital filters, all having identical shape and having center frequencies whose absolute values range from zero to Fs/2, where Fs is the sampling frequency. The frequency response of these filters is described by sin(Npf)/sin(pf). Three adjacent responses are shown in Figure 1.
The bank of filters can be thought of as dividing the spectrum into a series of bins. Unfortunately the filters are often not so selective as one would like. First of all, the main lobes of adjacent filters overlap. As a result, a frequency component that falls in between the center frequencies of two filters contributes to the outputs of both filters. Second, because the filters have sizeable side lobes that extend far from the main lobe, a frequency component that falls in between the centers of two filters also contributes to the outputs of distant filters as well. Consequently, a given DFT output bin may contain misleading contributions from frequency components that lie far away from its center frequency.
While the overlap of the main lobes is unavoidable, the poor side lobe attenuation of the DFT can be improved by a process called "windowing." Windowing modifies the frequency response of the FFT bins by giving a lower weight to samples that lie closer to the ends of the block of input samples. A frequently used window function, and the one implemented by the Rabbit FFT library, is the Hann window, sometimes called the "raised cosine" window. It is defined as
Each input sample is multiplied by the corresponding window sample before being transformed. Figure 2 compares the Hann window frequency response to the basic response of the DFT. The improved side-lobe response is clearly evident. With the improved side-lobe response comes what is usually a minor drawback: the width of the main lobe produced by the Hann window is twice the width of the basic response.
(A note on implementation. The Hann window has a particularly simple, three-point DFT. In the Rabbit FFT library, the Hann window function convolves the three-point DFT with the FFT output spectrum. This approach has the advantage of not requiring the computation or storage of raised cosine curves for all possible FFT sizes.)
Real Signals
Practical applications of the DFT usually deal with real signals, as might be produced by an analog-to-digital converter. It is possible to use an N-point complex DFT to analyze a 2N-point real input sequence. The computed N-point complex spectrum will be the positive-frequency portion of the sequence's complex spectrum. No information is lost by computing only the positive-frequency portion of the spectrum because the spectrum of a real sequence is conjugate symmetric. This means that the negative-frequency portion of the spectrum can be computed from the positive-frequency spectrum via the relations:
Re X[-f ] = Re X[f ]
Im X[-f ] = - Im X[f ]In practice the negative-frequency portion of the spectrum is rarely needed.
The Rabbit FFT library contains both complex-in/complex-out and real-in/complex-out FFT and their inverse counterparts.
Power Spectrum
When the DFT is used to analyze a signal (either real or complex), one is often concerned not so much with the amplitude and phase of the signal's spectrum as with either its power or its magnitude. The power spectrum is given by:
Power[k] = (Re X[k])2 + (Im X[k])2When the magnitude is required, it can be computed by extracting the square root of each term in the power spectrum. The Rabbit FFT library provides a fast, full-precision function to compute the power spectrum from a complex spectrum.
THE RABBIT FFT LIBRARY
The Rabbit FFT library comprises seven functions:
fftcplx()
fftcplxinv()
fftreal()
fftrealinv()
hanncplx()
hannreal()
powerspectrum()These functions operate on a sequence or a spectrum stored in a single array. Data in the array are considered to be in the form of signed, 16-bit fractions with values that range from -1 to 1 - 215. The effect of interpreting the data as fractions shows up when two 16-bit numbers are multiplied together to produce a 32-bit result. The 16-bit fractional arithmetic retains the 16 more significant bits of the product, whereas 16-bit integer arithmetic retains the 16 less significant bits.
The best numerical accuracy is obtained when the data to be transformed (or inverse transformed) are multiplied by a scaling factor so as to be as large as possible and still be representable by 16 bits. This means that the data should be scaled so that the resulting values lie between -32768 and +32767 (which correspond to fraction values -1 and +32767/32768). For example, for best results a sequence whose values range from -2048 to +2047 should be scaled so that 2047 becomes 32767. In this case, the scaling factor is exactly 16. For a sequence whose values range from zero to +200, the optimum scaling factor is 32767/200 or 163. (One might be tempted to use a suboptimum scaling factor of 128 and accomplish the scaling by left-shifting seven positions. But because the Rabbit's multiply instruction is so fast, left-shifting seven places takes twice as long as multiplying.)
Depending on the particular function, the array data may be either complex or real. Complex values are stored such that real parts are contained in the even-numbered array positions and the imaginary parts are contained in the odd-numbered array positions.
The kth complex point in the array is given by
Re X[k] = x[ 2k ]
Im X[k] = x[2k + 1]Block floating point arithmetic is used to prevent arithmetic overflow. A binary exponent is associated with the entire array of data, viz,
xk = x[k] × 2blockexpDuring processing any addition or subtraction that would result in an arithmetic overflow causes the entire array to be scaled down by a factor of two, thereby preventing the overflow. Each time the array is scaled, its block exponent is increased by one to compensate for this scaling. The actual value of the exponent is arbitrary, and the calling program may set it to any convenient value.
The Complex FFT Functions
These functions are in
FFT.LIB.
fftcplx
void fftcplx(int *x, int N, int *blockexp);Description
Computes the complex DFT of the
N-point complex sequence contained in the arrayxand returns the complex result inx.Nmust be a power of 2 and lie between 4 and 1024. An invalidNcauses a RANGE exception. TheN-point complex sequence in arrayxis replaced with itsN-point complex spectrum. The value ofblockexpis increased by 1 each time arrayxhas to be scaled, to avoid arithmetic overflow.Parameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
fftcplxinv
void fftcplxinv(int *x, int N, int *blockexp);Description
Computes the inverse complex DFT of the
N-point complex spectrum contained in the arrayxand returns the complex result inx.Nmust be a power of 2 and lie between 4 and 1024. An invalidNcauses a RANGE exception. The value ofblockexpis increased by 1 each time arrayxhas to be scaled, to avoid arithmetic overflow. The value ofblockexpis also decreased by log2Nto include the 1/Nfactor in the definition of the inverse DFTParameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
The Real FFT Functions
The
fftreal()andfftrealinv()functions manipulate real sequences and positive-frequency spectra. The spectrum of a 2N-point real sequence has 2N complex points. However, it also has conjugate symmetry, so that its N + 1 positive-frequency components contain all the spectral information. The imaginary part of the zero-frequency, or dc term, and the imaginary part of the maximum-frequency, or fmax term, are always zero. The fmax term is the N + 1 term of the 2N-point spectrum, and so lies one position beyond the N-point positive-frequency complex spectrum. However, since the fmax term is purely real, we can store its value in the otherwise zero imaginary part of the dc term.These functions are in
FFT.LIB.
fftreal
void fftreal(int *x, int N, int *blockexp);Description
Computes the
N-point, positive-frequency complex spectrum of the 2N-point real sequence in arrayx. The 2N-point real sequence in arrayxis replaced with itsN-point positive-frequency complex spectrum. To avoid arithmetic overflow, the value ofblockexpis increased by 1 each time arrayxhas to be scaled,.The imaginary part of the X[0] term (stored in x[1]) is set to the real part of the fmax term.
The 2
N-point real sequence is stored in natural order. The zeroth element of the sequence is stored inx[0], the first element inx[1], and the kth element in x[k].
Nmust be a power of 2 and lie between 4 and 1024. An invalidNcauses a RANGE exception.Parameters
xPointer to 2
N-point sequence of real fractions.NNumber of complex elements in output spectrum
blockexpPointer to integer block exponent.
fftrealinv
void fftrealinv(int *x, int N, int *blockexp);Description
Computes the 2
N-point real sequence corresponding to theN-point, positive-frequency complex spectrum in arrayx. TheN-point, positive-frequency spectrum contained in arrayxis replaced with its corresponding 2N-point real sequence. The value ofblockexpis increased by 1 each time arrayxhas to be scaled, to avoid arithmetic overflow. The value ofblockexpis also decreased by log2Nto include the 1/Nfactor in the definition of the inverse DFT.The function expects to find the real part of the fmax term in the imaginary part of the zero-frequency
X[0]term (storedx[1]).The 2
N-point real sequence is stored in natural order. The zeroth element of the sequence is stored inx[0], the first element inx[1], and the kth element inx[k].
Nmust be a power of 2 and lie between 4 and 1024. An invalidNcauses a RANGE exception.Parameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
The Auxiliary Functions
These functions are in
FFT.LIB.
hanncplx
void hanncplx(int *x, int N, int *blockexp);Description
Convolves an
N-point complex spectrum with the three-point Hann kernel. The filtered spectrum replaces the original spectrum.The function produces the same results as would be obtained by multiplying the corresponding time sequence by the Hann raised-cosine window.
The zero-crossing width of the main lobe produced by the Hann window is 4 DFT bins. The adjacent sidelobes are 32 db below the main lobe. Sidelobes decay at an asymptotic rate of 18 db per octave.
Nmust be a power of 2 and lie between 4 and 1024. An invalidNcauses a RANGE exception.Parameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
hannreal
void hannreal(int *x, int N, int *blockexp);Description
Convolves an
N-point positive-frequency complex spectrum with the three-point Hann kernel. The function produces the same results as would be obtained by multiplying the corresponding time sequence by the Hann raised-cosine window.The zero-crossing width of the main lobe produced by the Hann window is 4 DFT bins. The adjacent sidelobes are 32 db below the main lobe. Sidelobes decay at an asymptotic rate of 18 db per octave.
The imaginary part of the dc term (stored in
x[1]) is considered to be the real part of the fmax term. The dc and fmax spectral components take part in the convolution along with the other spectral components. The real part of fmax component affects the real part of the X[N-1] component (and vice versa), and should not arbitrarily be set to zero unless these components are unimportant.Parameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
Return value
None. The filtered spectrum replaces the original spectrum.
powerspectrum
void powerspectrum(int *x, int N, *int blockexp);Description
Computes the power spectrum from a complex spectrum according to
Power[k] = (Re X[k])2 + (Im X[k])2The
N-point power spectrum replaces theN-point complex spectrum. The power of each complex spectral component is computed as a 32-bit fraction. Its more significant 16-bits replace the imaginary part of the component; its less significant 16-bits replace the real part.If the complex input spectrum is a positive-frequency spectrum computed by
fftreal(),the imaginary part of the X[0] term (storedx[1]) will contain the real part of the fmax term and will affect the calculation of the dc power. If the dc power or the fmax power is important, the fmax term should be retrieved fromx[1]andx[1]set to zero before callingpowerspectrum().The power of the kth term can be retrieved by:
P[k] = *(long*) & x[2k] * 2blockexpThe value of
blockexpis first doubled to reflect the squaring operation applied to all elements in arrayx. Then it is further increased by 1 to reflect an inherent division-by-two that occurs during the squaring operation.Parameters
xPointer to
N-element array of complex fractions.NNumber of complex elements in array
x.blockexpPointer to integer block exponent.
TIMING
The execution time of all functions other than
powerspectrum()depends somewhat on whether and by how much the data must be scaled to avoid arithmetic overflow. The following tables present the measured best case (no scaling) and worst case (maximum scaling) execution times for a 30 MHz Rabbit 2000 running under Dynamic C.
cplxfft cplxfftinv realfft realfftinv ffthanncplx ffthannreal powerspectrum
cplxfft cplxfftinv realfftN/A1
realfftinv ffthanncplx ffthannreal powerspectrum
1
realfftandrealfftinvare currently limited to 512 complex output points.REFERENCES
A.V. Oppenheim and R.W. Shaeffer, Digital Signal Processing, Prentice-Hall, Inc., 1975.
A good primer on digital signal processing with a good discussion of various forms of the FFT.A.V. Oppenheim and R.W. Shaeffer, Discrete-Time Signal Processing, Prentice-Hall, Inc., 1989.
E.O. Brigham, The Fast Fourier Transform, Prentice-Hall, 1974.
Gives a concise derivation of the real-input/complex-output FFT.W.H. Press, B.P. Flannery, S.A. Teukolsky, and W.T. Vetterling, Numerical Recipes in C, Cambridge University Press, 1988. (A second edition was published in 1997.)
Contains C-language examples of the FFT and of the bit-reversal function employed by Rabbit FFT.D.F. Elliot, Ed., Handbook of Digital Signal Processing, Academic Press, 1987.
The appendix to Chapter 3 describes and compares various window functions.
| 022-0048 Rev. C | |
| Z-World http://www.zworld.com Voice: (530) 757-3737 FAX: (530) 757-3792 sales@zworld.com |
Rabbit Semiconductor http://www.rabbitsemiconductor.com Voice: (530) 757-8400 FAX: (530) 757-8402 sales@rabbitsemiconductor.com |