Appendix 1.A Selected List of Books on Image Processing and Computer1.A.1 Selected List of Books on Signal Processing from Year 2000 141.A.2 Selected List of Books on Pattern Recognition
Trang 2IMAGE PROCESSING AND PATTERN
RECOGNITION
Fundamentals and Techniques
FRANK Y SHIH
Trang 4IMAGE PROCESSING AND PATTERN
RECOGNITION
Trang 5445 Hoes LanePiscataway, NJ 08854IEEE Press Editorial BoardLajos Hanzo, Editor in Chief
J Anderson B M Hammerli W Reeve
F Canavero M Lanzerotti T Samad
Kenneth Moore, Director of IEEE Book and Information Services (BIS)
ReviewersTim Newman
Ed Wong
Trang 6IMAGE PROCESSING AND PATTERN
RECOGNITION
Fundamentals and Techniques
FRANK Y SHIH
Trang 7Published by John Wiley & Sons, Inc., Hoboken, New Jersey.
Published simultaneously in Canada.
No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4470, or on the web at www.copyright.com Requests to the
Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/ permissions.
Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts
in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose No warranty may be created or extended by sales representatives or written sales materials The advice and strategies contained herein may not be suitable for your situation You should consult with a professional where appropriate Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.
For general information on our other products and services, or technical support, please contact our Customer Care Department within the United States at (800) 762-2974, outside the United States at (317) 572-3993 or fax (317) 572-4002.
Wiley also publishes its books in a variety of electronic formats Some content that appears in print may not be available in electronic books For more information about Wiley products, visit our web site at www.wiley.com
Library of Congress Cataloging-in-Publication Data is Available
Trang 8Appendix 1.A Selected List of Books on Image Processing and Computer
1.A.1 Selected List of Books on Signal Processing from Year 2000 141.A.2 Selected List of Books on Pattern Recognition from Year 2000 15
Trang 94.4.2 Grayscale Dilation Erosion Duality Theorem 75
4.5.9.1 The Simple Morphological Edge Operators 854.5.9.2 Blur-Minimum Morphological Edge Operator 87
4.6.4 Order-Statistic Soft Morphological (OSSM) Filters 99
4.6.6 Recursive Order-Statistic Soft Morphological Filters 104
5.3.3 The Gravitation External Force Field and The Greedy Algorithm 128
5.5 Edge Linking by Adaptive Mathematical Morphology 137
5.5.2 The Adaptive Morphological Edge-Linking Algorithm 140
Trang 105.6 Automatic Seeded Region Growing 1465.6.1 Overview of the Automatic Seeded Region Growing Algorithm 146
5.7.2 Overview of the TDRD-Based Image Segmentation 159
5.7.3 The Region Dividing and Subregion Examination Strategies 162
5.7.5 Potential Applications in Medical Image Analysis 173
6 DISTANCE TRANSFORMATION AND SHORTEST PATH PLANNING 179
6.2 Distance Transformation by Mathematical Morphology 184
6.4 Decomposition of Distance Structuring Element 1886.4.1 Decomposition of City-Block and Chessboard Distance Structuring
6.4.2 Decomposition of the Euclidean Distance Structuring Element 190
6.6.1 Acquiring Approaches for City-Block and Chessboard Distance
6.6.2 Acquiring Approach for Euclidean Distance Transformation 196
6.7.3 The Complexity of the Two-Scan Algorithm 203
6.8.1 A Problematic Case of Using the Acquiring Approaches 2046.8.2 Dynamically Rotational Mathematical Morphology 2056.8.3 The Algorithm for Shortest Path Planning 206
6.9 Forward and Backward Chain Codes for Motion Planning 209
Trang 117 IMAGE REPRESENTATION AND DESCRIPTION 219
7.4.2.4 The Fully Parallel Thinning Algorithm 2437.4.2.5 Experimental Results and Discussion 243
7.5.1.1 The Skeleton from Distance Function 253
7.5.3 The Skeletonization Algorithm and Connectivity Properties 256
7.6.1 Representation Framework: Formal Languages and Mathematical
Trang 128.6 Linear Discriminate Analysis 2918.7 Feature Reduction in Input and Feature Spaces 293
9.1.1 Pass 1: Cluster’s Mean Vector Establishment 308
9.4.3 Binary Morphological Operations by Logic Modules 3249.4.4 Multilayer Perceptron as Processing Modules 327
9.5.3.2 An Improved ART Model for Pattern Classification 3429.5.3.3 Experimental Results of the Improved Model 344
9.6.1 Role of Fuzzy Geometry in Image Analysis 346
PART II
APPLICATIONS
Trang 1310.2.1.1 Smoothing and Thresholding 37210.2.1.2 Tracing Head and Face Boundaries 374
10.2.2 Finding Facial Features Based on Geometric Face Model 375
10.2.2.2 Geometrical Face Model Based on Gabor Filter 377
10.3.1 Facial Action Coding System and Expression Database 379
10.4 Facial Expression Recognition in JAFFE Database 386
11.1.1 An Improved Two-Step Algorithm for Block Segmentation 399
11.5.2 Statistical Fuzzy Model for Classification 430
Trang 1412.1.3 Private Versus Public 446
12.2.1 Substitution Watermarking in the Spatial Domain 44812.2.2 Additive Watermarking in the Spatial Domain 450
12.3.1 Substitution Watermarking in the Frequency Domain 45212.3.2 Multiplicative Watermarking in the Frequency Domain 45312.3.3 Watermarking Based on Vector Quantization 455
12.4.2 Weakness of the Block-Based Fragile Watermark 45912.4.3 The Hierarchical Block-Based Fragile Watermark 460
12.6.3 The Watermarking in the Frequency Domain 465
13.5.1 Overview of the GA-Based Breaking Methodology 48213.5.2 The GA-Based Breaking Algorithm on SDSS 48513.5.2.1 Generating the Stego Image on the Visual Steganalytic
13.5.2.2 Generating the Stego Image on the IQM-Based
13.5.3 The GA-Based Breaking Algorithm on FDSS 487
Trang 1513.5.4.1 The GA-Based Breaking Algorithm on VSS 48913.5.4.2 The GA-Based Breaking Algorithm on IQM-SDSS 49013.5.4.3 The GA-Based Breaking Algorithm on JFDSS 491
14.1.1 Local Thresholding Based on Median Values 49714.1.2 Global Thresholding with Brightness and Area Normalization 501
14.3.3 Classification of Strong, Medium, and Weak CMEs 526
Trang 16P A R T I
FUNDAMENTALS
Trang 18is referred to its clarity and audibility A thermocouple can convey temperature, and a
pH meter can convey the acidity of a solution
A signal may take a form of time variations or a spatially varying pattern.Mathematically speaking, signals are represented as functions of one or moreindependent variables that can be either continuous or discrete Continuous-timesignals are defined at a continuum of the time variable Discrete-time signals aredefined at discrete instants of time Digital signals are those for which both time andamplitude are discrete The continuous-time and continuous-amplitude signals arecalled analog signals Analog signals that have been converted to digital forms can beprocessed by a computer or other digital devices
Signal processing is the process of extracting information from the signal.Digital signal processing (DSP) is concerned with the representation of signals bysequences of numbers or symbols and processing of these sequences It was initiated
in the seventeenth century and has become an important modern tool in thetremendously diverse fields of science and technology The purpose of such proces-sing is to estimate characteristic parameters of a signal or to transform a signal into aform that is more sensible to human beings DSP includes subfields such as digitalimage processing, video processing, statistical signal processing, signal processingfor communications, biomedical signal processing, audio and speech signal proces-sing, sonar and radar signal processing, sensor array processing, spectral estimation,and so on
Human beings possess a natural signal processing system “Seeing” takes place
in the visual system and “hearing” takes place in the auditory system Human visualsystem (HVS) plays an important role in navigation, identification, verification, gait,gesture, posture, communication, psychological interpretation, and so on Human
Image Processing and Pattern Recognition by Frank Shih
Copyright 2010 the Institute of Electrical and Electronics Engineers, Inc.
3
Trang 19auditory system converts sound waves into nerve impulses, to analyze auditoryevents, remember and recognize sound sources, and perceive acoustic sequences Asthe speed, capability, and economic advantages of modern signal processing devicescontinue to increase, there is simultaneously an increase in efforts aimed at devel-oping sophisticated, real-time automatic systems capable of emulating humanabilities Because of digital revolution, digital signals have been increasingly used.Most household electronic devices are based entirely or almost entirely upon digitalsignals The entire Internet is a network of digital signals, as is modern mobile phonecommunication.
1.1 THE WORLD OF SIGNALS
The world is filled with many kinds of signals; each has its own physical meaning.Sometimes the human body is incapable of receiving a special signal or interpreting(decoding) a signal, so the information that the signal intends to convey cannot becaptured Those signals are not to be said nonsense or insignificant, but converselythey are exactly what people are working very hard to understand The more we learnfrom the world’s signals, the better living environment we can provide Furthermore,some disaster or damage can be avoided if a warning signal can be sensed in advance.For example, it was recorded historically that animals, including rats, snakes, andweasels, deserted the Greek city of Helice in droves just days before a quake
usually behave strangely before earthquake by barking, whining, or showing signs ofnervousness and restlessness
The characteristics of a signal may be one of a broad range of shapes,amplitudes, time durations, and perhaps other physical properties Based on thesampling of time axis, signals can be divided into continuous-time and discrete-timesignals Based on the sampling of time and amplitude axes, signals can be divided intoanalog and digital signals If signals repeat in some period, they are called periodicsignals; otherwise, aperiodic or nonperiodic signals If each value of a signal is fixed
by a mathematical function, it is called a deterministic signal; otherwise, a randomsignal that has uncertainty about its behavior In the category of dimensionality,signals are divided into one-dimensional (1D), two-dimensional (2D), three-dimensional (3D), and multidimensional signals, which are further explained below
1.1.1 One-Dimensional Signals
A 1D signal is usually modeled as an ensemble of time waveforms, for example, xðtÞ
evident in such diverse fields as biomedical engineering, acoustics (Beranek, 2007),sonar (Sun et al., 2004), radar (Gini et al., 2001), seismology (Al-Alaoui, 2001),speech communication, and many others When we use a telephone, our voice isconverted to an electrical signal and through telecommunication systems circulatesaround the Earth The radio signals, which are propagated through free space and byradio receivers, are converted into sound In speech transmission and recognition, one
Trang 20may wish to extract some characteristic parameters of the linguistic messages,representing the temporal and spectral behavior of acoustical speech input Alter-natively, one may wish to remove interference, such as noise, from the signal or tomodify the signal to present it in a form more easily interpreted by an expert.
1.1.2 Two-Dimensional Signals
Signal processing problems are not confined to 1D signals A 2D signal is a function of
the functional behavior in the form of an intensity variation over the (x, y)-plane.Everyday scenes viewed by a human observer can be considered to be composed ofilluminated objects The light energy reflected from these objects can be considered toform a 2D intensity function, which is commonly referred to as an image
As a result of numerous applications, not least as a consequence of cheapcomputer technology, image processing now influences almost all areas of our dailylife: automated acquisition, processing and production of documents, industrialprocess automation, acquisition and automated analysis of medical images, enhance-ment and analysis of aerial photographs for detection of forest fires or crop damage,analysis of satellite weather photos, and enhancement of television transmission fromlunar and deep-space probes
1.1.3 Three-Dimensional Signals
Photographs of a still scene are the images that are functions of the (x, y)-plane Byadding a time variable, the 3D signals represent image sequences of a dynamic scenethat are called video signals Computer analysis of image sequences requires thedevelopment of internal representations for the entities in a depicted scene as well asfor discernible changes in appearance and configuration of such entities Morefundamental approaches result from efforts to improve application-oriented solu-tions Some illustrative examples are given as follows
Image sequences obtained from satellite sensors are routinely analyzed todetect and monitor changes Evaluation of image series recorded throughout thegrowth and harvest periods can result in more reliable cover type mapping as well asimproved estimates of crop field Very important is the determination of clouddisplacement vector fields These are used to estimate wind velocity distributionsthat in turn are employed for weather prediction and meteorological modeling(Desportes et al., 2007)
Biomedical applications are concerned with the study of growth, tion, and transport phenomena Angiocardiography, blood circulation, and studies ofmetabolism are the primary areas of medical interest for the evaluation of temporalimage sequences (Charalampidis et al., 2006) Architects who have to designpedestrian circulation areas would appreciate quantitative data about how pedestrianswalk in halls and corridors Efforts to extract such data from TV-frame sequencescould be considered as behavioral studies They might as well be assigned to aseparate topic such as object tracking (Qu and Schonfeld, 2007), which is of specialconcern in cases of traffic monitoring (Zhou et al., 2007), target tracking, and visual
Trang 21transforma-feedback for automated navigation (Negahdaripour and Xun, 2002; Xu andTso, 1999).
1.1.4 Multidimensional Signals
When a signal is represented in more than one dimension, it is often called amultidimensional signal As discussed in previous sections, an image is a two-dimensional signal, and a video is a three-dimensional signal A multidimensionalsignal is vector valued and may be a function of multiple relevant independentvariables One chooses the variable domain in which to process a signal by making aninformed guess as to which domain best represents the essential characteristics of thesignal Multidimensional signal processing is an innovative field interested indeveloping technology that can capture and analyze information in more thanone dimension Some of its applications include 3D face modeling (Roy-Chowdhury
et al., 2004), 3D object tracking (Wiles et al., 2001), and multidimensional signalfiltering
The need for a generally applicable artificial intelligence approach for optimaldimensionality selection in high-dimensional signal spaces is evident in problems
likely to fail if vision problems are handled by reducing the dimensionality by means
of throwing away almost certain available information in a basically ad hoc manner.Therefore, designing a system capable of learning the relevant information extractionmechanisms is critical
1.2 DIGITAL IMAGE PROCESSING
Images are produced by a variety of physical devices, including still and videocameras, scanners, X-ray devices, electron microscopes, radar, and ultrasound,and are used for a variety of purposes, including entertainment, medical, business,industrial, military, civil, security, and scientific The interests in digital imageprocessing stem from the improvement of pictorial information for humaninterpretation and the processing of scene data for autonomous machineperception
Webster’s Dictionary defines an image as: “An image is a representation,likeness, or imitation of an object or thing, a vivid or graphic description, somethingintroduced to represent something else.” The word “picture” is a restricted type ofimage Webster’s Dictionary defines a picture as: “A representation made by painting,drawing, or photography; a vivid, graphic, accurate description of an object or thing so
as to suggest a mental image or give an accurate idea of the thing itself.” In imageprocessing, the word “picture” is sometimes equivalent to “image.”
Digital image processing starts with one image and produces a modified version
of that image Webster’s Dictionary defines digital as: “The calculation by numericalmethods or discrete units,” defines a digital image as: “A numerical representation of
an object,” defines processing as: “The act of subjecting something to a process,” and
Trang 22defines a process as: “A series of actions or operations leading to a desired result.” Anexample of a process is car wash that changes an automobile from dirty to clean.Digital image analysis is a process that converts a digital image into somethingother than a digital image, such as a set of measurement data or a decision Imagedigitization is a process that converts a pictorial form to numerical data A digitalimage is an image f(x, y) that has been discretized in both spatial coordinates andbrightness (intensity) The image is divided into small regions called picture elements
or pixels (see Fig 1.1)
Image digitization includes image sampling (i.e., digitization of spatial ordinates (x, y)) and gray-level quantization (i.e., brightness amplitude digitization)
co-An image is represented by a rectangular array of integers The image sizes and thenumber of gray levels are usually integer powers of 2 The number at each pixelrepresents the brightness or darkness (generally called the intensity) of the image
Figure 1.1 Image digitization (Courtesy of Gonzalez and Woods, 2008)
Figure 1.2 A digital image and its numerical representation
Trang 23The quality of an image strongly depends upon the number of samples and graylevels; the more are these two, the better would be the quality of an image But, thiswill result in a large amount of storage space as well because the storage space for animage is the product of dimensions of an image and the number of bits required tostore gray levels At lower resolution, an image can result in checkerboard effect or
The visual quality of an image required depends upon its applications Toachieve the highest visual quality and at the same time the lowest memory require-ment, we can perform fine sampling of an image in the neighborhood of sharp gray-level transitions and coarse sampling in the smooth areas of an image This is known assampling based on the characteristics of an image (Damera-Venkata et al., 2000).Another method, known as tapered quantization, can be used for the distribution ofgray levels by computing the occurrence frequency of all allowed levels Quantizationlevel is finely spaced in the regions where gray levels occur frequently, but when graylevels occur rarely in other regions, the quantization level can be coarsely spaced.Images with large amounts of details can sometimes still enjoy a satisfactoryappearance despite possessing a relatively small number of gray levels This can
be seen by examining isopreference curves using a set of subjective tests for images inthe Nk-plane, where N is the number of samples and k is the number of gray levels(Huang, 1965)
In general, image processing operations can be categorized into four types:
1 Pixel operations: The output at a pixel depends only on the input at that pixel,independent of all other pixels in that image Thresholding, a process of makingthe corresponding input pixels above a certain threshold level white and othersblack, is simply a pixel operation Other examples include brightness addition/subtraction, contrast stretching, image inverting, log, and power law
2 Local (neighborhood) operations: The output at a pixel depends on the inputvalues in a neighborhood of that pixel Some examples are edge detection,smoothing filters (e.g., the averaging filter and the median filter), and sharpen-ing filters (e.g., the Laplacian filter and the gradient filter) This operation can beadaptive because results depend on the particular pixel values encountered ineach image region
3 Geometric operations: The output at a pixel depends only on the input levels atsome other pixels defined by geometric transformations Geometric operationsare different from global operations, such that the input is only from somespecific pixels based on geometric transformation They do not require the inputfrom all the pixels to make its transformation
4 Global operations: The output at a pixel depends on all the pixels in an image Itmay be independent of the pixel values in an image, or it may reflect statisticscalculated for all the pixels, but not a local subset of pixels A popular distancetransformation of an image, which assigns to each object pixel the minimumdistance from it to all the background pixels, belongs to a global operation
Trang 24Other examples include histogram equalization/specification, image warping,Hough transform, and connected components.
Nowadays, there is almost no area that is not impacted in some way by digital imageprocessing Its applications include
1 Remote sensing: Images acquired by satellites and other spacecrafts are useful
in tracking Earth’s resources, solar features, geographical mapping (Fig 1.3),and space image applications (Fig 1.4)
2 Image transmission and storage for business: Its applications include cast television, teleconferencing, transmission of facsimile images for officeautomation, communication over computer networks, security monitoringsystems, and military communications
broad-3 Medical processing: Its applications include X-ray, cineangiogram, transaxialtomography, and nuclear magnetic resonance (Fig 1.5) These images may be
Figure 1.3 Remote sensing images for tracking Earth’s climate and resources
Figure 1.4 Space image applications
Trang 25used for patient screening and monitoring or for detection of tumors or otherdiseases in patients.
4 Radar, sonar, and acoustic image processing: For example, the detection andrecognition of various types of targets and the maneuvering of aircraft(Fig 1.6)
5 Robot/machine vision: Its applications include the identification or description
of objects or industrial parts in 3D scenes (Fig 1.7)
Figure 1.5 Medical imaging applications
Figure 1.6 Radar imaging
Trang 261.3 ELEMENTS OF AN IMAGE PROCESSING SYSTEM
Elements of an image processing system include
1 Image acquisition: A physical device that is sensitive to a band in theelectromagnetic energy spectrum can produce an electrical signal output
A digitizer is used for converting the electrical signal output into a digitalform Digital images can be obtained by conversion of the analog images (such
as 35 mm prints, slides, transparencies, or reflective art) into digital images with
a scanner, or else by directly capturing the object or scene into digital forms bymeans of a digital camera or video-capturing device
2 Storage:
(a) Short-term storage for use during processes One of the means of providingshort-term storage is computer memory Another is a specialized board,called a frame buffer
(b) Online storage for relatively fast recall
(c) Archival storage characterized by infrequent access The term “archivalquality” is used to designate materials that are permanent and durable, andtherefore can be safely used for preservation purposes The objective ofarchival storage is the protection against tampering, deletion, viruses, anddisaster
3 Processing: Most image processing functions can be implemented in software,running on a host computer
4 Communication: Cable modem Internet services on average promise higherlevels of bandwidth than DSL Internet services, and this bandwidth roughlytranslates to communication speed Cable Internet theoretically runs faster thanDSL Cable technology can support approximately 30 Mbps (megabits persecond) of bandwidth, whereas most forms of DSL cannot reach 10 Mbps
5 Display: An image display device may take the form of an illuminating picturelamp providing means by which images may be illuminated by a light source on
a selectable and removably attachable basis Monochrome and color monitorsare the principal display devices used Other display media include randomaccess cathode ray tubes (CRTs) and printing devices
Figure 1.7 Robot and machine vision applications
Trang 27To illustrate the systematic procedures of an image processing system, we give anexample of human face identification (Adler and Schuckers, 2007) The problemdomain is the faces of people The objective is to associate the face with the identity ofthe person The output is a person’s unique identifier (e.g., social security number).The necessary procedures to achieve the goal of face identification could include
1 Image acquisition: The face image could be acquired through a high-resolutionstill digital camera and compressed to an image file
2 Preprocessing: The acquired image may be enhanced by improving contrast,sharpness, color, and so on
3 Segmentation: The image may first be cropped to only the facial area Then, theface may be segmented into eyes, mouth, nose, chin, and so on
4 Representation and description: In this step, each of the segmented areas may
be characterized by statistical data, for example, principal components lysis, texture, aspect ratios of eyes and nose, or the color of eyes
ana-5 Matching recognition and interpretation: This step may involve using thecharacteristics derived in the previous step to match each individually seg-mented area based on specific recognition algorithms For example, eyes may
be processed to determine, based on its features, what class of eye it is Then, all
of these interpretations are used to create a composite description of the
“ensemble,” perhaps in the form of a feature vector for the subject
6 Knowledge base: Finally, the feature vector above may be fed to a knowledgebase of all known subjects to associate it with one of the subjects in the database,thus returning perhaps the individual’s social security number or perhaps aconfidence score of the match
APPENDIX 1.A SELECTED LIST OF BOOKS ON IMAGE PROCESSING AND COMPUTER VISION FROM YEAR 2000
Acharya, T and Ray, A K., Image Processing: Principles and Applications, Wiley,Sept 2005
Barner, K E and Arce, G R., Nonlinear Signal and Image Processing: Theory,Methods, and Applications, CRC Press, 2004
Bhanu, B and Pavlidis, I., Computer Vision Beyond the Visible Spectrum, Springer,2004
Bose, T., Digital Signal and Image Processing, Wiley, Nov 2003
Burger, W and Burge, M J., Digital Image Processing: An Algorithmic IntroductionUsing Java, Springer, 2008
Chan, T F and Shen, J., Image Processing and Analysis: Variational, PDE, Wavelet,and Stochastic Methods, SIAM, 2005
Chen, C H., Image Processing for Remote Sensing, CRC Press, 2008
Cyganek, B and Siebert, J P., An Introduction to 3-D Computer Vision Techniquesand Algorithms, Wiley, Feb 2009
Trang 28Dhawan, A., Medical Image Analysis, Wiley, July 2003.
Fisher, R B., Dictionary of Computer Vision and Image Processing, Wiley, 2005.Forsyth, D A and Ponce, J., Computer Vision: A Modern Approach, Prentice Hall,2003
Gonzalez, R C and Woods, R E., Digital Image Processing, 3rd edition, PrenticeHall, 2008
Gonzalez, R C., Woods, R E., and Eddins, S L., Digital Image Processing UsingMATLAB, Prentice Hall, 2003
Hartley, R and Zisserman, A., Multiple View Geometry in Computer Vision,Cambridge University Press, 2003
Hornberg, A., Handbook of Machine Vision, Wiley, 2006
J€ahne, B., Digital Image Processing, Springer, 2005
Koschan, A and Abidi M., Digital Color Image Processing, Wiley, Apr 2008.Kulkarni, A D., Computer Vision and Fuzzy-Neural Systems, Prentice Hall, 2001.Lukac, R and Plataniotis, K N., Color Image Processing: Methods and Applications,CRC Press, 2007
Maıˆtre, H., Image Processing, Wiley, Aug 2008
Medioni, G and Kang, S B., Emerging Topics in Computer Vision, Prentice Hall,2005
McAndrew, A., Introduction to Digital Image Processing with Matlab, ThomsonCourse Technology, 2004
Mitra, S K and Sicuranza, G L., Nonlinear Image Processing, Academic Press,2001
Morris, T., Computer Vision and Image Processing, Palgrave Macmillan, 2003.Nixon, M S and Aguado, A S., Feature Extraction and Image Processing,Academic Press, 2008
Paragios, N., Chen, Y., and Faugeras, O., Handbook of Mathematical Models inComputer Vision, Springer, 2006
Petrou, M and Sevilla, P G., Image Processing: Dealing With Texture, Wiley, Mar.2006
Pitas, I., Digital Image Processing Algorithms and Applications, Wiley–IEEE, 2000.Pratt, W K., Digital Image Processing: PIKS Scientific Inside, 4th edition, Wiley, 2007.Ritter, G X and Wilson, J N., Handbook of Computer Vision Algorithms in ImageAlgebra, CRC Press, 2001
Russ, J C., The Image Processing Handbook, CRC Press, 2006
Sebe, N., Machine Learning in Computer Vision, Springer, 2005
Shapiro, L G and Stockman, G C., Computer Vision, Prentice Hall, 2001.Shih, F.Y., Image Processing and Mathematical Morphology: Fundamentals andApplications, CRC Press, 2009
Snyder, W E and Qi, H., Machine Vision, Cambridge University Press, 2004.Sonka, M., Hlavac, V., and Boyle, R., Image Processing, Analysis, and MachineVision, Thomson Wadsworth, 2007
Trang 29Tuzlukov, V P., Signal and Image Processing in Navigational Systems, CRC Press,2005.
Umbaugh, S E., Computer Imaging: Digital Image Analysis and Processing, CRCPress, 2005
Whelan, P F and Molloy, D., Machine Vision Algorithms in Java: Techniques andImplementation, Springer, 2001
Woods, J W., Multidimensional Signal, Image, and Video Processing and Coding,Academic Press, 2006
Zuech, N., Understanding and Applying Machine Vision, CRC Press, 2000
1.A.1 Selected List of Books on Signal Processing from Year 2000Blackledge, J M., Digital Signal Processing: Mathematical and ComputationalMethods, Software Development, and Applications, Horwood Publishing, 2006.Blanchet, G and Charbit, M., Digital Signal and Image Processing Using MATLAB,Wiley, May 2006
Garello, R., Two-Dimensional Signal Analysis, Wiley, Apr 2008
Gaydecki, P., Foundations of Digital Signal Processing: Theory, Algorithms, andHardware Design, IET, 2004
Gray, R M and Davisson, L D., An Introduction to Statistical Signal Processing,Cambridge University Press, 2004
Ifeachor, E C and Jervis, B W., Digital Signal Processing: A Practical Approach,Prentice Hall, 2002
Ingle, V K and Proakis, J G., Digital Signal Processing Using MATLAB, Brooks/Cole, 2000
Khan, A A., Digital Signal Processing Fundamentals, Da Vinci Engineering Press,2005
Mitra, S K., Digital Signal Processing: A Computer-Based Approach, McGraw-Hill,2001
Narasimhan, S V and Veena, S., Signal Processing: Principles and Implementation,Alpha Science Int’l Ltd., 2005
Proakis, J G and Manolakis, D G., Digital Signal Processing, Pearson Prentice Hall,2007
Quinquis, A., Digital Signal Processing Using Matlab, Wiley, Apr 2008
Rockmore, D N and Healy, D M., Modern Signal Processing, Cambridge UniversityPress, 2004
Sundararajan, D., Digital Signal Processing: Theory and Practice, World Scientific,2003
Wang, B.-C., Digital Signal Processing Techniques and Applications in RadarImage Processing, Wiley, Aug 2008
Telecommunications and Multimedia, Springer, 2005
Trang 301.A.2 Selected List of Books on Pattern Recognition from Year 2000Abe, S., Support Vector Machines for Pattern Classification, Springer, 2005.Bhagat, P., Pattern Recognition in Industry, Elsevier, 2005.
Bishop, C M., Pattern Recognition and Machine Learning, Springer, 2006.Bunke, H., Kandel, A., and Last, M., Applied Pattern Recognition, Springer, 2008Chen, D and Cheng, X., Pattern Recognition and String Matching, Springer, 2002.Corrochano, E B., Handbook of Geometric Computing: Applications in PatternRecognition, Computer Vision, Neuralcomputing, and Robotics, Springer, 2005.Duda, R O., Hart, P E., and Stork, D G., Pattern Classification, 2nd edition, Wiley,2003
Dunne, R A., A Statistical Approach to Neural Networks for Pattern Recognition,Wiley, July 2007
Gibson, W., Pattern Recognition, Penguin Group, 2005
Hand, D J., Mannila, H., and Smyth, P., Principles of Data Mining, MIT Press, Aug.2001
Hastie, T., Tibshirani, R., and Fridman, J., The Elements of Statistical Learning: DataMining, Inference, and Prediction, Springer, 2001
Raudys, S., Statistical and Neural Classifiers, Springer, 2001
Schlesinger, M I and Hlavac, V., Ten Lectures on Statistical and Structural PatternRecognition, Kluwer Academic Publishers, 2002
Suykens, A K., Horvath, G., Basu, S., Micchelli, C., and Vandewalle, J., Advances inLearning Theory: Methods, Models and Applications, NATO Science Series III:Computer & Systems Sciences, vol 190, IOS Press, Amsterdam, 2003
Theodoridis, S and Koutroumbas, K., Pattern Recognition, Academic Press, 2003.Webb, A R., Statistical Pattern Recognition, 2nd edition, Wiley, 2002
Trang 31Charalampidis, D., Pascotto, M., Kerut, E K., and Lindner, J R., “Anatomy and flow in normal and ischemic microvasculature based on a novel temporal fractal dimension analysis algorithm using contrast enhanced ultrasound,” IEEE Trans Med Imaging, vol 25, no 8, pp 1079–1086, Aug 2006 Damera-Venkata, N., Kite, T D., Geisler, W S., Evans, B L., and Bovik, A C., “Image quality assessment based on a degradation model,” IEEE Trans Image Process., vol 9, no 4, pp 636–650, Apr 2000 Desportes, C., Obligis, E., and Eymard, L., “On the wet tropospheric correction for altimetry in coastal regions,” IEEE Trans Geosci Remote Sens., vol 45, no 7, pp 2139–2149, July 2007.
Gini, F., Farina, A., and Greco, M., “Selected list of references on radar signal processing,” IEEE Trans Aerospace Electron Syst., vol 37, no 1, pp 329–359, Jan 2001.
Gonzalez, R C and Woods, R E., Digital Image Processing, 3rd edition, Prentice Hall, 2008 Huang, T S., “PCM picture transmission,” IEEE Spectrum, vol 2, no 12, pp 57–63, Dec 1965 Negahdaripour, S and Xun X., “Mosaic-based positioning and improved motion-estimation methods for automatic navigation of submersible vehicles,” IEEE J Oceanic Eng., vol 27, no 1, pp 79–99, Jan 2002.
Qu, W and Schonfeld, D., “Real-time decentralized articulated motion analysis and object tracking from videos,” IEEE Trans Image Process., vol 16, no 8, pp 2129–2138, Aug 2007.
Roy-Chowdhury, A K., Chellappa, R., and Keaton, T., “Wide baseline image registration with application
to 3-D face modeling,” IEEE Trans Multimedia, vol 6, no 3, pp 423–434, June 2004.
Sun, Y., Willett, P., and Lynch, R., “Waveform fusion in sonar signal processing,” IEEE Trans Aerospace Electron Syst., vol 40, no 2, pp 462–477, Apr 2004.
Wiles, C S., Maki, A., and Matsuda, N., “Hyperpatches for 3-D model acquisition and tracking,” IEEE Trans Pattern Anal Mach Intell., vol 23, no 12, pp 1391–1403, Dec 2001.
Xu, W L and Tso, S K., “Sensor-based fuzzy reactive navigation of a mobile robot through local target switching,” IEEE Trans Syst Man Cybernet C, vol 29, no 3, pp 451–459, Aug 1999.
Zhou, J., Gao, D., and Zhang, D., “Moving vehicle detection for automatic traffic monitoring,” IEEE Trans Veh Technol., vol 56, no 1, pp 51–59, Jan 2007.
Trang 32C H A P T E R 2
MATHEMATICAL PRELIMINARIES
Most images are recorded and processed in the time domain or spatial domain Thespatial domain refers to the aggregate of pixels composing an image, and the spatial-domain processing involves operations that apply directly on these pixels However, it
is sometimes convenient and efficient to process images in the frequency domainbecause edge pixels generally correspond to high-frequency components and interiorpixels of an object region correspond to low-frequency components In this chapter, themathematical preliminaries that are often used in image processing for converting animage from spatial domain to frequency domain are introduced These include Laplacetransform, Fourier transform, Z-transform, cosine transform, and wavelet transform
s þ j1, where s is any fixed real number for which s ¼ s is a point in the region ofabsolute convergence of X(s) Equation (2.1) is called the one-sided Laplace transform,
Image Processing and Pattern Recognition by Frank Shih
Copyright 2010 the Institute of Electrical and Electronics Engineers, Inc.
17
Trang 33which depends only on the values of the signal x(t) for t 0 and must satisfy thefollowing constraint:
not have a Laplace transform
transform of u(t) is given by
L e½ ¼ct 1
sc
written as the sum of two exponential functions:
sinðo0tÞ ¼ejo0tejo0t
2j
Trang 34The Laplace transform of sinðo0tÞ is given by
2.1.1 Properties of Laplace Transform
Laplace transform satisfies a number of properties that are useful in the transformationbetween frequency domain and time domain (Schiff, 1999; Graf, 2004) By usingthese properties, it is helpful in deriving many new transformation pairs from a basicset of pairs Therefore, x(t) and X(s) are called the Laplace transform pair A number
of often-used transform pairs are given as follows:
s2þ o2
Several fundamental properties of Laplace transform are given below
1 Linearity: axðtÞ þ byðtÞ $ aXðsÞ þ bYðsÞ
Except time shift, one can also perform frequency shift, which corresponds
to the multiplication by an exponential in time domain, as follows:
Trang 354 Time scale: xðctÞ $1
sc
Laplace transform of second-order or high-order derivatives of x(t) can also beexpressed in terms of X(s) and the initial conditions Let n be an arbitrarypositive integer
9 Convolution: Given two functions x(t) and h(t) whose values are equal to zero
frequency domain by Laplace transform Conversely, the convolution infrequency domain corresponds to a product in time domain
xðtÞ hðtÞ $ XðsÞHðsÞxðtÞhðtÞ $ XðsÞ HðsÞThe convolution in the 2D case can be represented as
respectively A simple change of variables produces
Trang 36Another operation similar to convolution, called correlation, can be ented as
where f*is the complex conjugate of f In image processing, the difference of correlation
as compared to convolution is that the function h is not folded about the origin
((
Answer: The convolution is calculated as follows It can be separated into four cases, as
Trang 37Determine the convolution and correlation of f by h Use the lower-left pixel of h (i.e.,the underlined pixel) as its origin and deal with the pixels outside the boundary ofimage f to be zeros.
and then overlaying the result The flipped h is
1 Separation of real and imaginary numbers:
xðtÞ ¼ Re½xðtÞ þ j Im½xðtÞ $ XðsÞ ¼ Re½XðsÞ þ j Im½XðsÞ
Trang 382.2 FOURIER TRANSFORM
In the early 1800s, French mathematician Joseph Fourier, with his studies of theproblem of heat flow, introduced Fourier series for the representation of continuous-time periodic signals He made a serious attempt to prove that any piecewise smoothfunction can be expressed into a trigonometric sum Fourier’s theory has significantapplications in signal processing The frequency spectrum of a signal can be generated
by representing the signal as a sum of sinusoids (or complex exponentials), called aFourier series This weighted sum represents the frequency content of a signal calledspectrum
When a signal becomes nonperiodic, its period becomes infinite and itsspectrum becomes continuous An image is considered as a spatially varying function.The Fourier transform decomposes such an image function into a set of orthogonalfunctions and converts the spatial intensity image into its frequency domain From thecontinuous form, one can digitize it for discrete-time images Signal processing inthe frequency domain simplifies computational complexity in filtering analysis; thus,the Fourier transform has played the leading role in signal processing and engineeringcontrol for a long time (Sneddon, 1995; Bracewell, 2000)
Let T be a fixed positive real number A continuous-time signal x(t) is said to beperiodic with a period T if
xðtÞ ¼ xðt þ TÞ for all t
That is, a periodic signal repeats every T seconds, the frequency of the signal is
parameter is often used According to Fourier’s theorem, x(t) can be expressed as
a linear sum of complex exponentials:
The trigonometric identity is applied as
A cosðnotÞB sinðnotÞ ¼pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiA2þ B2
Trang 39Therefore, the following is obtained:
n¼1
Equations (2.14) and (2.15) are called the Fourier transform pair and can be shown to
conditions are almost satisfied in practice The Fourier transform is generallycomplex:
XðoÞ
j j ¼ ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
R2ðoÞ þ I2ðoÞp
and
XðoÞ
Trang 40Example 2.8 Derive the Fourier transform of f(x) that is defined in the followingfigure:
35dx
35
respectively, and a and b are two arbitrary constants, then