Matching pursuit (MP) is a sparse approximation algorithm which involves finding the "best matching" projections of multidimensional data onto the span of an over-complete (i.e., redundant) dictionary . The basic idea is to approximately represent a signal from Hilbert space as a weighted sum of finitely many functions (called atoms) taken from . An approximation with atoms has the form
where is the scalar weighting factor (amplitude) for the atom . Normally, not every atom in will be used in this sum. Instead, matching pursuit chooses the atoms one at a time in order to maximally (greedily) reduce the approximation error. This is achieved by finding the atom that has the biggest inner product with the signal (assuming the atoms are normalized), subtracting from the signal an approximation that uses only that one atom, and repeating the process until the signal is satisfactorily decomposed, i.e., the norm of the residual is small, where the residual after calculating and is denoted by
- .
If converges quickly to zero, then only a few atoms are needed to get a good approximation to . Such sparse representations are desirable for signal coding and compression. More precisely, the sparsity problem that matching pursuit is intended to approximately solve is
with the pseudo-norm (i.e. the number of nonzero elements of ). In the previous notation, the nonzero entries of are , and the th column of the matrix is . Solving the sparsity problem exactly is NP-hard, which is why approximation methods like MP are used.
For comparison, consider the Fourier series representation of a signal - this can be described in the terms given above, where the dictionary is built from sinusoidal basis functions (the smallest possible complete dictionary). The main disadvantage of Fourier analysis in signal processing is that it extracts only global features of signals and does not adapt to analysed signals . By taking an extremely redundant dictionary we can look in it for functions that best match a signal .
Video Matching pursuit
The algorithm
If contains a large number of vectors, searching for the most sparse representation of is computationally unacceptable for practical applications. In 1993 Mallat and Zhang proposed a greedy solution that they named "Matching Pursuit." The algorithm iteratively generates for any signal and any dictionary a sorted list of atom indices and weighting scalars which represent the sub-optimal solution to the problem of sparse signal representation.
The concept of matching pursuit in signal processing is related to statistical projection pursuit, in which "interesting" projections were found; ones that deviate more from a normal distribution are considered to be more interesting.
Maps Matching pursuit
Properties
- The algorithm converges (i.e. ) for any that is in the space spanned by the dictionary.
- The error decreases monotonically.
- As at each step, the residual is orthogonal to the selected filter, the energy conservation equation is satisfied for each :
- .
- In the case that the vectors in are orthonormal instead of redundant, then MP is a form of principal component analysis
Applications
Matching pursuit has been applied to signal, image and video coding, shape representation and recognition, 3D objects coding, and in interdisciplinary applications like structural health monitoring. It has been shown that it performs better than DCT based coding for low bit rates in both efficiency of coding and quality of image. The main problem with matching pursuit is the computational complexity of the encoder. In the basic version of an algorithm, the large dictionary has to be searched at each iteration. Improvements include the use of approximate dictionary representations and suboptimal ways of choosing the best match at each iteration (atom extraction). The matching pursuit algorithm is used in MP/SOFT, a method of simulating quantum dynamics.
MP is also used in dictionary learning. In this algorithm, atoms are learned from a database (in general natural scenes such as usual images) and not chosen among generic dictionaries.
Extensions
A popular extension of Matching Pursuit (MP) is its orthogonal version: Orthogonal Matching Pursuit (OMP). The main difference from MP is that after every step, all the coefficients extracted so far are updated, by computing the orthogonal projection of the signal onto the set of atoms selected so far. This can lead to better results than standard MP, but requires more computation.
Extensions such as Multichannel MP and Multichannel OMP allow to process multicomponents signals. An obvious extension of Matching Pursuit is over multiple positions and scales, by augmenting the dictionary to be that of a wavelet basis. This can be done efficiently using the convolution operator without changing the core algorithm.
Matching pursuit is related to the field of compressed sensing and has been extended by researchers in that community. Notable extensions are Orthogonal Matching Pursuit (OMP), Stagewise OMP (StOMP), compressive sampling matching pursuit (CoSaMP), Generalized OMP (gOMP), and Multipath Matching Pursuit (MMP).
See also
- CLEAN algorithm
- Principal component analysis (PCA)
- Projection pursuit
- Image processing
- Signal processing
- Sparse approximation
References
Source of article : Wikipedia