In this section there is a detailed explanation of each
step.
3.2.1 Discrete Wavelet
Transform
The foundations of the DWT go back to 1976 when Croiser, Esteban, and
Galand devised a technique to decompose discrete time signals. Crochiere, Weber,
and Flanagan did a similar work on coding of speech signals in the same year.
They named their analysis scheme as subband coding. In 1983, Burt defined a
technique very similar to subband coding and named it pyramidal coding which is
also known as multiresolution analysis. Later in 1989, Vetterli and Le Gall made
some improvements to the subband coding scheme, removing the existing redundancy
in the pyramidal coding scheme. By all these studies, today subband coding has
come very popular[4].
3.2.1.1 The Subband Coding and the
Multiresolution Analysis
The Subband Coding and The Multiresolution Analysis, firstly, has
come with continues wavelet transform (CWT). CWT is not convenient in digital
environment. So discrete wavelet transform (DWT) was developed.
A time-scale representation of a digital signal is obtained using
digital filtering techniques. The continuous wavelet transform was computed by
changing the scale of the analysis window, shifting the window in time,
multiplying by the signal, and integrating over all times. In the discrete case,
filters of different cutoff frequencies are used to analyze the signal at
different scales. The signal is passed through a series of high pass filters to
analyze the high frequencies, and it is passed through a series of low pass
filters to analyze the low frequencies.
The resolution of the signal, which is a measure of the amount of
detail information in the signal, is changed by the filtering operations, and
the scale is changed by up-sampling and down-sampling (sub-sampling) operations.
Sub-sampling a signal corresponds to reducing the sampling rate, or removing
some of the samples of the signal. Up-sampling a signal corresponds to
increasing the sampling rate of a signal by adding new samples to the
signal.
Although it is not the only possible choice, DWT coefficients are
usually sampled from the CWT on a dyadic grid, i.e., s0 = 2 and
τ0 = 1, yielding s=2j and t =k*2j. Since the
signal is a discrete time function, the sequence will be denoted by x[n], where
n is an integer.
The procedure starts with passing this signal (sequence) through a
half band digital lowpass filter with impulse response h[n]. Filtering a signal
corresponds to the mathematical operation of convolution of the signal with the
impulse response of the filter. The convolution operation in discrete time is
defined as follows:
(3)
A half band low-pass filter removes all frequencies that are above
half of the highest frequency in the signal.
The unit of frequency is of particular importance at this time. In
discrete signals, frequency is expressed in terms of radians. Accordingly, the
sampling frequency of the signal is equal to 2π radians in terms of radial
frequency. Therefore, the highest frequency component that exists in a signal
will be π radians, if the signal is sampled at Nyquist’s rate (which is twice
the maximum frequency that exists in the signal); that is, the Nyquist’s rate
corresponds to π rad/s in the discrete frequency domain.
After passing the signal through a half band low-pass filter, half of
the samples can be eliminated according to the Nyquist’s rule, since the signal
now has a highest frequency of π/2 radians instead of π radians. Simply
discarding every other sample will sub-sample the signal by two, and the signal
will then have half the number of points. The scale of the signal is now
doubled. The low-pass filtering removes the high frequency information, but
leaves the scale unchanged. Only the sub-sampling process changes the scale.
Resolution, on the other hand, is related to the amount of information in the
signal, and therefore, it is affected by the filtering operations. Half band
low-pass filtering removes half of the frequencies, which can be interpreted as
losing half of the information. Therefore, the resolution is halved after the
filtering operation. However the sub-sampling operation after filtering does not
affect the resolution, since removing half of the spectral components from the
signal makes half the number of samples redundant anyway. Half the samples can
be discarded without any loss of information.
This procedure can mathematically be expressed as
(4)
The DWT analyzes the signal at different frequency bands with
different resolutions by decomposing the signal into a coarse approximation and
detail information. DWT employs two sets of functions, called scaling functions
and wavelet functions, which are associated with low pass and high-pass filters,
respectively. The decomposition of the signal into different frequency bands is
simply obtained by successive high-pass and low-pass filtering of the time
domain signal. The original signal x[n] is first passed through a half-band
high-pass filter g[n] and a low-pass filter h[n]. After the filtering, half of
the samples can be eliminated according to the Nyquist’s rule, since the signal
now has a highest frequency of π/2 radians instead of π. The signal can
therefore be sub-sampled by 2, simply by discarding every other sample. This
constitutes one level of decomposition and can mathematically be expressed as
follows:
(5)
where yhigh[k] and ylow[k] are the outputs of the high-pass
and low-pass filters, respectively, after sub-sampling by 2.
This decomposition halves the time resolution since only half the
number of samples characterizes the entire signal. However, this operation
doubles the frequency resolution, since the frequency band of the signal spans
only half the previous frequency band, effectively reducing the uncertainty in
the frequency by half. The above procedure, which is also known as the subband
coding, can be repeated for further decomposition. At every level, the filtering
and sub-sampling will result in half the number of samples (and hence half the
time resolution) and half the frequency band spanned (and hence doubles the
frequency resolution). Figure 3 illustrates this procedure, where x[n] is the
original signal to be decomposed, and h[n] and g[n] are low-pass and high-pass
filters, respectively. The bandwidth of the signal at every level is marked on
the figure as "f".
The frequencies that are most prominent in the original signal will
appear as high amplitudes in that region of the DWT signal that includes those
particular frequencies. The difference of this transform from the Fourier
transform is that the time localization of these frequencies will not be lost.
However, the time localization will have a resolution that depends on which
level they appear. If the main information of the signal lies in the high
frequencies, as happens most often, the time localization of these frequencies
will be more precise, since they are characterized by more number of samples. If
the main information lies only at very low frequencies, the time localization
will not be very precise, since few samples are used to express signal at these
frequencies. This procedure in effect offers a good time resolution at high
frequencies, and good frequency resolution at low frequencies. Most practical
signals encountered are of this type.

Figure 3. Wavelet transform procedure.
The frequency bands that are not very prominent in the original
signal will have very low amplitudes, and that part of the DWT signal can be
discarded without any major loss of information, allowing data reduction.
3.2.1.2 Inverse DWT
(Reconstruction)
The reconstruction of the signal is very easy since half-band filters
form orthonormal bases. The procedure on Figure 2. is followed in reverse order
for the reconstruction. The signals at every level are up-sampled by two, passed
through the synthesis filters g’[n], and h’[n] (high-pass and low-pass,
respectively), and then added. The interesting point here is that the analysis
and synthesis filters are identical to each other, except for a time reversal.
Therefore, the reconstruction formula becomes (for each layer)
(6)
However, if the filters are not ideal half-band, then perfect
reconstruction cannot be achieved. Although it is not possible to realize ideal
filters, under certain conditions it is possible to find filters that provide
perfect reconstruction. The most famous ones are the ones developed by Ingrid
Daubechies, and they are known as Daubechies’ wavelets.
Due to successive sub-sampling by 2, the signal length must be a
power of 2, or at least a multiple of power of 2, in order this scheme to be
efficient. The length of the signal determines the number of levels that the
signal can be decomposed to.
3.2.1.3 DWT on Images
One dimensional DWT transform is for audio signals or any other
signals. It can not be implemented directly to image but using one dimensional
DWT it is possible to transform an image to wavelet domain.
Two dimensional DWT is simple as one dimensional DWT. As seen in
Figure 4.a firstly each column is transformed independently to wavelet domain.
Using results of column DWT, all rows are transformed respectively.

(a)

(b)
Figure 4. 2D Discrete Wavelet Transform
Transforming the image into wavelet domain makes image to appear in
blocks. Figure 5 shows a transformed image for level 3. LL is the lowest
frequency band and HH1, HH2 and HH3 are the
highest frequency bands in corresponding levels indicating the different
characteristics of image in hierarchical order.

Figure 5. Wavelet Transformed image for level
3.
|