1. Trang chủ
  2. » Công Nghệ Thông Tin

frank y. shih - image processing and pattern recognition

552 2,7K 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Tiêu đề Image Processing and Pattern Recognition
Tác giả Frank Y. Shih
Chuyên ngành Image Processing and Pattern Recognition
Thể loại Book
Thành phố Piscataway
Định dạng
Số trang 552
Dung lượng 8,66 MB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Appendix 1.A Selected List of Books on Image Processing and Computer1.A.1 Selected List of Books on Signal Processing from Year 2000 141.A.2 Selected List of Books on Pattern Recognition

Trang 2

IMAGE PROCESSING AND PATTERN

RECOGNITION

Fundamentals and Techniques

FRANK Y SHIH

Trang 4

IMAGE PROCESSING AND PATTERN

RECOGNITION

Trang 5

445 Hoes LanePiscataway, NJ 08854IEEE Press Editorial BoardLajos Hanzo, Editor in Chief

J Anderson B M Hammerli W Reeve

F Canavero M Lanzerotti T Samad

Kenneth Moore, Director of IEEE Book and Information Services (BIS)

ReviewersTim Newman

Ed Wong

Trang 6

IMAGE PROCESSING AND PATTERN

RECOGNITION

Fundamentals and Techniques

FRANK Y SHIH

Trang 7

Published by John Wiley & Sons, Inc., Hoboken, New Jersey.

Published simultaneously in Canada.

No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4470, or on the web at www.copyright.com Requests to the

Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/ permissions.

Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts

in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose No warranty may be created or extended by sales representatives or written sales materials The advice and strategies contained herein may not be suitable for your situation You should consult with a professional where appropriate Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.

For general information on our other products and services, or technical support, please contact our Customer Care Department within the United States at (800) 762-2974, outside the United States at (317) 572-3993 or fax (317) 572-4002.

Wiley also publishes its books in a variety of electronic formats Some content that appears in print may not be available in electronic books For more information about Wiley products, visit our web site at www.wiley.com

Library of Congress Cataloging-in-Publication Data is Available

Trang 8

Appendix 1.A Selected List of Books on Image Processing and Computer

1.A.1 Selected List of Books on Signal Processing from Year 2000 141.A.2 Selected List of Books on Pattern Recognition from Year 2000 15

Trang 9

4.4.2 Grayscale Dilation Erosion Duality Theorem 75

4.5.9.1 The Simple Morphological Edge Operators 854.5.9.2 Blur-Minimum Morphological Edge Operator 87

4.6.4 Order-Statistic Soft Morphological (OSSM) Filters 99

4.6.6 Recursive Order-Statistic Soft Morphological Filters 104

5.3.3 The Gravitation External Force Field and The Greedy Algorithm 128

5.5 Edge Linking by Adaptive Mathematical Morphology 137

5.5.2 The Adaptive Morphological Edge-Linking Algorithm 140

Trang 10

5.6 Automatic Seeded Region Growing 1465.6.1 Overview of the Automatic Seeded Region Growing Algorithm 146

5.7.2 Overview of the TDRD-Based Image Segmentation 159

5.7.3 The Region Dividing and Subregion Examination Strategies 162

5.7.5 Potential Applications in Medical Image Analysis 173

6 DISTANCE TRANSFORMATION AND SHORTEST PATH PLANNING 179

6.2 Distance Transformation by Mathematical Morphology 184

6.4 Decomposition of Distance Structuring Element 1886.4.1 Decomposition of City-Block and Chessboard Distance Structuring

6.4.2 Decomposition of the Euclidean Distance Structuring Element 190

6.6.1 Acquiring Approaches for City-Block and Chessboard Distance

6.6.2 Acquiring Approach for Euclidean Distance Transformation 196

6.7.3 The Complexity of the Two-Scan Algorithm 203

6.8.1 A Problematic Case of Using the Acquiring Approaches 2046.8.2 Dynamically Rotational Mathematical Morphology 2056.8.3 The Algorithm for Shortest Path Planning 206

6.9 Forward and Backward Chain Codes for Motion Planning 209

Trang 11

7 IMAGE REPRESENTATION AND DESCRIPTION 219

7.4.2.4 The Fully Parallel Thinning Algorithm 2437.4.2.5 Experimental Results and Discussion 243

7.5.1.1 The Skeleton from Distance Function 253

7.5.3 The Skeletonization Algorithm and Connectivity Properties 256

7.6.1 Representation Framework: Formal Languages and Mathematical

Trang 12

8.6 Linear Discriminate Analysis 2918.7 Feature Reduction in Input and Feature Spaces 293

9.1.1 Pass 1: Cluster’s Mean Vector Establishment 308

9.4.3 Binary Morphological Operations by Logic Modules 3249.4.4 Multilayer Perceptron as Processing Modules 327

9.5.3.2 An Improved ART Model for Pattern Classification 3429.5.3.3 Experimental Results of the Improved Model 344

9.6.1 Role of Fuzzy Geometry in Image Analysis 346

PART II

APPLICATIONS

Trang 13

10.2.1.1 Smoothing and Thresholding 37210.2.1.2 Tracing Head and Face Boundaries 374

10.2.2 Finding Facial Features Based on Geometric Face Model 375

10.2.2.2 Geometrical Face Model Based on Gabor Filter 377

10.3.1 Facial Action Coding System and Expression Database 379

10.4 Facial Expression Recognition in JAFFE Database 386

11.1.1 An Improved Two-Step Algorithm for Block Segmentation 399

11.5.2 Statistical Fuzzy Model for Classification 430

Trang 14

12.1.3 Private Versus Public 446

12.2.1 Substitution Watermarking in the Spatial Domain 44812.2.2 Additive Watermarking in the Spatial Domain 450

12.3.1 Substitution Watermarking in the Frequency Domain 45212.3.2 Multiplicative Watermarking in the Frequency Domain 45312.3.3 Watermarking Based on Vector Quantization 455

12.4.2 Weakness of the Block-Based Fragile Watermark 45912.4.3 The Hierarchical Block-Based Fragile Watermark 460

12.6.3 The Watermarking in the Frequency Domain 465

13.5.1 Overview of the GA-Based Breaking Methodology 48213.5.2 The GA-Based Breaking Algorithm on SDSS 48513.5.2.1 Generating the Stego Image on the Visual Steganalytic

13.5.2.2 Generating the Stego Image on the IQM-Based

13.5.3 The GA-Based Breaking Algorithm on FDSS 487

Trang 15

13.5.4.1 The GA-Based Breaking Algorithm on VSS 48913.5.4.2 The GA-Based Breaking Algorithm on IQM-SDSS 49013.5.4.3 The GA-Based Breaking Algorithm on JFDSS 491

14.1.1 Local Thresholding Based on Median Values 49714.1.2 Global Thresholding with Brightness and Area Normalization 501

14.3.3 Classification of Strong, Medium, and Weak CMEs 526

Trang 16

P A R T I

FUNDAMENTALS

Trang 18

is referred to its clarity and audibility A thermocouple can convey temperature, and a

pH meter can convey the acidity of a solution

A signal may take a form of time variations or a spatially varying pattern.Mathematically speaking, signals are represented as functions of one or moreindependent variables that can be either continuous or discrete Continuous-timesignals are defined at a continuum of the time variable Discrete-time signals aredefined at discrete instants of time Digital signals are those for which both time andamplitude are discrete The continuous-time and continuous-amplitude signals arecalled analog signals Analog signals that have been converted to digital forms can beprocessed by a computer or other digital devices

Signal processing is the process of extracting information from the signal.Digital signal processing (DSP) is concerned with the representation of signals bysequences of numbers or symbols and processing of these sequences It was initiated

in the seventeenth century and has become an important modern tool in thetremendously diverse fields of science and technology The purpose of such proces-sing is to estimate characteristic parameters of a signal or to transform a signal into aform that is more sensible to human beings DSP includes subfields such as digitalimage processing, video processing, statistical signal processing, signal processingfor communications, biomedical signal processing, audio and speech signal proces-sing, sonar and radar signal processing, sensor array processing, spectral estimation,and so on

Human beings possess a natural signal processing system “Seeing” takes place

in the visual system and “hearing” takes place in the auditory system Human visualsystem (HVS) plays an important role in navigation, identification, verification, gait,gesture, posture, communication, psychological interpretation, and so on Human

Image Processing and Pattern Recognition by Frank Shih

Copyright  2010 the Institute of Electrical and Electronics Engineers, Inc.

3

Trang 19

auditory system converts sound waves into nerve impulses, to analyze auditoryevents, remember and recognize sound sources, and perceive acoustic sequences Asthe speed, capability, and economic advantages of modern signal processing devicescontinue to increase, there is simultaneously an increase in efforts aimed at devel-oping sophisticated, real-time automatic systems capable of emulating humanabilities Because of digital revolution, digital signals have been increasingly used.Most household electronic devices are based entirely or almost entirely upon digitalsignals The entire Internet is a network of digital signals, as is modern mobile phonecommunication.

1.1 THE WORLD OF SIGNALS

The world is filled with many kinds of signals; each has its own physical meaning.Sometimes the human body is incapable of receiving a special signal or interpreting(decoding) a signal, so the information that the signal intends to convey cannot becaptured Those signals are not to be said nonsense or insignificant, but converselythey are exactly what people are working very hard to understand The more we learnfrom the world’s signals, the better living environment we can provide Furthermore,some disaster or damage can be avoided if a warning signal can be sensed in advance.For example, it was recorded historically that animals, including rats, snakes, andweasels, deserted the Greek city of Helice in droves just days before a quake

usually behave strangely before earthquake by barking, whining, or showing signs ofnervousness and restlessness

The characteristics of a signal may be one of a broad range of shapes,amplitudes, time durations, and perhaps other physical properties Based on thesampling of time axis, signals can be divided into continuous-time and discrete-timesignals Based on the sampling of time and amplitude axes, signals can be divided intoanalog and digital signals If signals repeat in some period, they are called periodicsignals; otherwise, aperiodic or nonperiodic signals If each value of a signal is fixed

by a mathematical function, it is called a deterministic signal; otherwise, a randomsignal that has uncertainty about its behavior In the category of dimensionality,signals are divided into one-dimensional (1D), two-dimensional (2D), three-dimensional (3D), and multidimensional signals, which are further explained below

1.1.1 One-Dimensional Signals

A 1D signal is usually modeled as an ensemble of time waveforms, for example, xðtÞ

evident in such diverse fields as biomedical engineering, acoustics (Beranek, 2007),sonar (Sun et al., 2004), radar (Gini et al., 2001), seismology (Al-Alaoui, 2001),speech communication, and many others When we use a telephone, our voice isconverted to an electrical signal and through telecommunication systems circulatesaround the Earth The radio signals, which are propagated through free space and byradio receivers, are converted into sound In speech transmission and recognition, one

Trang 20

may wish to extract some characteristic parameters of the linguistic messages,representing the temporal and spectral behavior of acoustical speech input Alter-natively, one may wish to remove interference, such as noise, from the signal or tomodify the signal to present it in a form more easily interpreted by an expert.

1.1.2 Two-Dimensional Signals

Signal processing problems are not confined to 1D signals A 2D signal is a function of

the functional behavior in the form of an intensity variation over the (x, y)-plane.Everyday scenes viewed by a human observer can be considered to be composed ofilluminated objects The light energy reflected from these objects can be considered toform a 2D intensity function, which is commonly referred to as an image

As a result of numerous applications, not least as a consequence of cheapcomputer technology, image processing now influences almost all areas of our dailylife: automated acquisition, processing and production of documents, industrialprocess automation, acquisition and automated analysis of medical images, enhance-ment and analysis of aerial photographs for detection of forest fires or crop damage,analysis of satellite weather photos, and enhancement of television transmission fromlunar and deep-space probes

1.1.3 Three-Dimensional Signals

Photographs of a still scene are the images that are functions of the (x, y)-plane Byadding a time variable, the 3D signals represent image sequences of a dynamic scenethat are called video signals Computer analysis of image sequences requires thedevelopment of internal representations for the entities in a depicted scene as well asfor discernible changes in appearance and configuration of such entities Morefundamental approaches result from efforts to improve application-oriented solu-tions Some illustrative examples are given as follows

Image sequences obtained from satellite sensors are routinely analyzed todetect and monitor changes Evaluation of image series recorded throughout thegrowth and harvest periods can result in more reliable cover type mapping as well asimproved estimates of crop field Very important is the determination of clouddisplacement vector fields These are used to estimate wind velocity distributionsthat in turn are employed for weather prediction and meteorological modeling(Desportes et al., 2007)

Biomedical applications are concerned with the study of growth, tion, and transport phenomena Angiocardiography, blood circulation, and studies ofmetabolism are the primary areas of medical interest for the evaluation of temporalimage sequences (Charalampidis et al., 2006) Architects who have to designpedestrian circulation areas would appreciate quantitative data about how pedestrianswalk in halls and corridors Efforts to extract such data from TV-frame sequencescould be considered as behavioral studies They might as well be assigned to aseparate topic such as object tracking (Qu and Schonfeld, 2007), which is of specialconcern in cases of traffic monitoring (Zhou et al., 2007), target tracking, and visual

Trang 21

transforma-feedback for automated navigation (Negahdaripour and Xun, 2002; Xu andTso, 1999).

1.1.4 Multidimensional Signals

When a signal is represented in more than one dimension, it is often called amultidimensional signal As discussed in previous sections, an image is a two-dimensional signal, and a video is a three-dimensional signal A multidimensionalsignal is vector valued and may be a function of multiple relevant independentvariables One chooses the variable domain in which to process a signal by making aninformed guess as to which domain best represents the essential characteristics of thesignal Multidimensional signal processing is an innovative field interested indeveloping technology that can capture and analyze information in more thanone dimension Some of its applications include 3D face modeling (Roy-Chowdhury

et al., 2004), 3D object tracking (Wiles et al., 2001), and multidimensional signalfiltering

The need for a generally applicable artificial intelligence approach for optimaldimensionality selection in high-dimensional signal spaces is evident in problems

likely to fail if vision problems are handled by reducing the dimensionality by means

of throwing away almost certain available information in a basically ad hoc manner.Therefore, designing a system capable of learning the relevant information extractionmechanisms is critical

1.2 DIGITAL IMAGE PROCESSING

Images are produced by a variety of physical devices, including still and videocameras, scanners, X-ray devices, electron microscopes, radar, and ultrasound,and are used for a variety of purposes, including entertainment, medical, business,industrial, military, civil, security, and scientific The interests in digital imageprocessing stem from the improvement of pictorial information for humaninterpretation and the processing of scene data for autonomous machineperception

Webster’s Dictionary defines an image as: “An image is a representation,likeness, or imitation of an object or thing, a vivid or graphic description, somethingintroduced to represent something else.” The word “picture” is a restricted type ofimage Webster’s Dictionary defines a picture as: “A representation made by painting,drawing, or photography; a vivid, graphic, accurate description of an object or thing so

as to suggest a mental image or give an accurate idea of the thing itself.” In imageprocessing, the word “picture” is sometimes equivalent to “image.”

Digital image processing starts with one image and produces a modified version

of that image Webster’s Dictionary defines digital as: “The calculation by numericalmethods or discrete units,” defines a digital image as: “A numerical representation of

an object,” defines processing as: “The act of subjecting something to a process,” and

Trang 22

defines a process as: “A series of actions or operations leading to a desired result.” Anexample of a process is car wash that changes an automobile from dirty to clean.Digital image analysis is a process that converts a digital image into somethingother than a digital image, such as a set of measurement data or a decision Imagedigitization is a process that converts a pictorial form to numerical data A digitalimage is an image f(x, y) that has been discretized in both spatial coordinates andbrightness (intensity) The image is divided into small regions called picture elements

or pixels (see Fig 1.1)

Image digitization includes image sampling (i.e., digitization of spatial ordinates (x, y)) and gray-level quantization (i.e., brightness amplitude digitization)

co-An image is represented by a rectangular array of integers The image sizes and thenumber of gray levels are usually integer powers of 2 The number at each pixelrepresents the brightness or darkness (generally called the intensity) of the image

Figure 1.1 Image digitization (Courtesy of Gonzalez and Woods, 2008)

Figure 1.2 A digital image and its numerical representation

Trang 23

The quality of an image strongly depends upon the number of samples and graylevels; the more are these two, the better would be the quality of an image But, thiswill result in a large amount of storage space as well because the storage space for animage is the product of dimensions of an image and the number of bits required tostore gray levels At lower resolution, an image can result in checkerboard effect or

The visual quality of an image required depends upon its applications Toachieve the highest visual quality and at the same time the lowest memory require-ment, we can perform fine sampling of an image in the neighborhood of sharp gray-level transitions and coarse sampling in the smooth areas of an image This is known assampling based on the characteristics of an image (Damera-Venkata et al., 2000).Another method, known as tapered quantization, can be used for the distribution ofgray levels by computing the occurrence frequency of all allowed levels Quantizationlevel is finely spaced in the regions where gray levels occur frequently, but when graylevels occur rarely in other regions, the quantization level can be coarsely spaced.Images with large amounts of details can sometimes still enjoy a satisfactoryappearance despite possessing a relatively small number of gray levels This can

be seen by examining isopreference curves using a set of subjective tests for images inthe Nk-plane, where N is the number of samples and k is the number of gray levels(Huang, 1965)

In general, image processing operations can be categorized into four types:

1 Pixel operations: The output at a pixel depends only on the input at that pixel,independent of all other pixels in that image Thresholding, a process of makingthe corresponding input pixels above a certain threshold level white and othersblack, is simply a pixel operation Other examples include brightness addition/subtraction, contrast stretching, image inverting, log, and power law

2 Local (neighborhood) operations: The output at a pixel depends on the inputvalues in a neighborhood of that pixel Some examples are edge detection,smoothing filters (e.g., the averaging filter and the median filter), and sharpen-ing filters (e.g., the Laplacian filter and the gradient filter) This operation can beadaptive because results depend on the particular pixel values encountered ineach image region

3 Geometric operations: The output at a pixel depends only on the input levels atsome other pixels defined by geometric transformations Geometric operationsare different from global operations, such that the input is only from somespecific pixels based on geometric transformation They do not require the inputfrom all the pixels to make its transformation

4 Global operations: The output at a pixel depends on all the pixels in an image Itmay be independent of the pixel values in an image, or it may reflect statisticscalculated for all the pixels, but not a local subset of pixels A popular distancetransformation of an image, which assigns to each object pixel the minimumdistance from it to all the background pixels, belongs to a global operation

Trang 24

Other examples include histogram equalization/specification, image warping,Hough transform, and connected components.

Nowadays, there is almost no area that is not impacted in some way by digital imageprocessing Its applications include

1 Remote sensing: Images acquired by satellites and other spacecrafts are useful

in tracking Earth’s resources, solar features, geographical mapping (Fig 1.3),and space image applications (Fig 1.4)

2 Image transmission and storage for business: Its applications include cast television, teleconferencing, transmission of facsimile images for officeautomation, communication over computer networks, security monitoringsystems, and military communications

broad-3 Medical processing: Its applications include X-ray, cineangiogram, transaxialtomography, and nuclear magnetic resonance (Fig 1.5) These images may be

Figure 1.3 Remote sensing images for tracking Earth’s climate and resources

Figure 1.4 Space image applications

Trang 25

used for patient screening and monitoring or for detection of tumors or otherdiseases in patients.

4 Radar, sonar, and acoustic image processing: For example, the detection andrecognition of various types of targets and the maneuvering of aircraft(Fig 1.6)

5 Robot/machine vision: Its applications include the identification or description

of objects or industrial parts in 3D scenes (Fig 1.7)

Figure 1.5 Medical imaging applications

Figure 1.6 Radar imaging

Trang 26

1.3 ELEMENTS OF AN IMAGE PROCESSING SYSTEM

Elements of an image processing system include

1 Image acquisition: A physical device that is sensitive to a band in theelectromagnetic energy spectrum can produce an electrical signal output

A digitizer is used for converting the electrical signal output into a digitalform Digital images can be obtained by conversion of the analog images (such

as 35 mm prints, slides, transparencies, or reflective art) into digital images with

a scanner, or else by directly capturing the object or scene into digital forms bymeans of a digital camera or video-capturing device

2 Storage:

(a) Short-term storage for use during processes One of the means of providingshort-term storage is computer memory Another is a specialized board,called a frame buffer

(b) Online storage for relatively fast recall

(c) Archival storage characterized by infrequent access The term “archivalquality” is used to designate materials that are permanent and durable, andtherefore can be safely used for preservation purposes The objective ofarchival storage is the protection against tampering, deletion, viruses, anddisaster

3 Processing: Most image processing functions can be implemented in software,running on a host computer

4 Communication: Cable modem Internet services on average promise higherlevels of bandwidth than DSL Internet services, and this bandwidth roughlytranslates to communication speed Cable Internet theoretically runs faster thanDSL Cable technology can support approximately 30 Mbps (megabits persecond) of bandwidth, whereas most forms of DSL cannot reach 10 Mbps

5 Display: An image display device may take the form of an illuminating picturelamp providing means by which images may be illuminated by a light source on

a selectable and removably attachable basis Monochrome and color monitorsare the principal display devices used Other display media include randomaccess cathode ray tubes (CRTs) and printing devices

Figure 1.7 Robot and machine vision applications

Trang 27

To illustrate the systematic procedures of an image processing system, we give anexample of human face identification (Adler and Schuckers, 2007) The problemdomain is the faces of people The objective is to associate the face with the identity ofthe person The output is a person’s unique identifier (e.g., social security number).The necessary procedures to achieve the goal of face identification could include

1 Image acquisition: The face image could be acquired through a high-resolutionstill digital camera and compressed to an image file

2 Preprocessing: The acquired image may be enhanced by improving contrast,sharpness, color, and so on

3 Segmentation: The image may first be cropped to only the facial area Then, theface may be segmented into eyes, mouth, nose, chin, and so on

4 Representation and description: In this step, each of the segmented areas may

be characterized by statistical data, for example, principal components lysis, texture, aspect ratios of eyes and nose, or the color of eyes

ana-5 Matching recognition and interpretation: This step may involve using thecharacteristics derived in the previous step to match each individually seg-mented area based on specific recognition algorithms For example, eyes may

be processed to determine, based on its features, what class of eye it is Then, all

of these interpretations are used to create a composite description of the

“ensemble,” perhaps in the form of a feature vector for the subject

6 Knowledge base: Finally, the feature vector above may be fed to a knowledgebase of all known subjects to associate it with one of the subjects in the database,thus returning perhaps the individual’s social security number or perhaps aconfidence score of the match

APPENDIX 1.A SELECTED LIST OF BOOKS ON IMAGE PROCESSING AND COMPUTER VISION FROM YEAR 2000

Acharya, T and Ray, A K., Image Processing: Principles and Applications, Wiley,Sept 2005

Barner, K E and Arce, G R., Nonlinear Signal and Image Processing: Theory,Methods, and Applications, CRC Press, 2004

Bhanu, B and Pavlidis, I., Computer Vision Beyond the Visible Spectrum, Springer,2004

Bose, T., Digital Signal and Image Processing, Wiley, Nov 2003

Burger, W and Burge, M J., Digital Image Processing: An Algorithmic IntroductionUsing Java, Springer, 2008

Chan, T F and Shen, J., Image Processing and Analysis: Variational, PDE, Wavelet,and Stochastic Methods, SIAM, 2005

Chen, C H., Image Processing for Remote Sensing, CRC Press, 2008

Cyganek, B and Siebert, J P., An Introduction to 3-D Computer Vision Techniquesand Algorithms, Wiley, Feb 2009

Trang 28

Dhawan, A., Medical Image Analysis, Wiley, July 2003.

Fisher, R B., Dictionary of Computer Vision and Image Processing, Wiley, 2005.Forsyth, D A and Ponce, J., Computer Vision: A Modern Approach, Prentice Hall,2003

Gonzalez, R C and Woods, R E., Digital Image Processing, 3rd edition, PrenticeHall, 2008

Gonzalez, R C., Woods, R E., and Eddins, S L., Digital Image Processing UsingMATLAB, Prentice Hall, 2003

Hartley, R and Zisserman, A., Multiple View Geometry in Computer Vision,Cambridge University Press, 2003

Hornberg, A., Handbook of Machine Vision, Wiley, 2006

J€ahne, B., Digital Image Processing, Springer, 2005

Koschan, A and Abidi M., Digital Color Image Processing, Wiley, Apr 2008.Kulkarni, A D., Computer Vision and Fuzzy-Neural Systems, Prentice Hall, 2001.Lukac, R and Plataniotis, K N., Color Image Processing: Methods and Applications,CRC Press, 2007

Maıˆtre, H., Image Processing, Wiley, Aug 2008

Medioni, G and Kang, S B., Emerging Topics in Computer Vision, Prentice Hall,2005

McAndrew, A., Introduction to Digital Image Processing with Matlab, ThomsonCourse Technology, 2004

Mitra, S K and Sicuranza, G L., Nonlinear Image Processing, Academic Press,2001

Morris, T., Computer Vision and Image Processing, Palgrave Macmillan, 2003.Nixon, M S and Aguado, A S., Feature Extraction and Image Processing,Academic Press, 2008

Paragios, N., Chen, Y., and Faugeras, O., Handbook of Mathematical Models inComputer Vision, Springer, 2006

Petrou, M and Sevilla, P G., Image Processing: Dealing With Texture, Wiley, Mar.2006

Pitas, I., Digital Image Processing Algorithms and Applications, Wiley–IEEE, 2000.Pratt, W K., Digital Image Processing: PIKS Scientific Inside, 4th edition, Wiley, 2007.Ritter, G X and Wilson, J N., Handbook of Computer Vision Algorithms in ImageAlgebra, CRC Press, 2001

Russ, J C., The Image Processing Handbook, CRC Press, 2006

Sebe, N., Machine Learning in Computer Vision, Springer, 2005

Shapiro, L G and Stockman, G C., Computer Vision, Prentice Hall, 2001.Shih, F.Y., Image Processing and Mathematical Morphology: Fundamentals andApplications, CRC Press, 2009

Snyder, W E and Qi, H., Machine Vision, Cambridge University Press, 2004.Sonka, M., Hlavac, V., and Boyle, R., Image Processing, Analysis, and MachineVision, Thomson Wadsworth, 2007

Trang 29

Tuzlukov, V P., Signal and Image Processing in Navigational Systems, CRC Press,2005.

Umbaugh, S E., Computer Imaging: Digital Image Analysis and Processing, CRCPress, 2005

Whelan, P F and Molloy, D., Machine Vision Algorithms in Java: Techniques andImplementation, Springer, 2001

Woods, J W., Multidimensional Signal, Image, and Video Processing and Coding,Academic Press, 2006

Zuech, N., Understanding and Applying Machine Vision, CRC Press, 2000

1.A.1 Selected List of Books on Signal Processing from Year 2000Blackledge, J M., Digital Signal Processing: Mathematical and ComputationalMethods, Software Development, and Applications, Horwood Publishing, 2006.Blanchet, G and Charbit, M., Digital Signal and Image Processing Using MATLAB,Wiley, May 2006

Garello, R., Two-Dimensional Signal Analysis, Wiley, Apr 2008

Gaydecki, P., Foundations of Digital Signal Processing: Theory, Algorithms, andHardware Design, IET, 2004

Gray, R M and Davisson, L D., An Introduction to Statistical Signal Processing,Cambridge University Press, 2004

Ifeachor, E C and Jervis, B W., Digital Signal Processing: A Practical Approach,Prentice Hall, 2002

Ingle, V K and Proakis, J G., Digital Signal Processing Using MATLAB, Brooks/Cole, 2000

Khan, A A., Digital Signal Processing Fundamentals, Da Vinci Engineering Press,2005

Mitra, S K., Digital Signal Processing: A Computer-Based Approach, McGraw-Hill,2001

Narasimhan, S V and Veena, S., Signal Processing: Principles and Implementation,Alpha Science Int’l Ltd., 2005

Proakis, J G and Manolakis, D G., Digital Signal Processing, Pearson Prentice Hall,2007

Quinquis, A., Digital Signal Processing Using Matlab, Wiley, Apr 2008

Rockmore, D N and Healy, D M., Modern Signal Processing, Cambridge UniversityPress, 2004

Sundararajan, D., Digital Signal Processing: Theory and Practice, World Scientific,2003

Wang, B.-C., Digital Signal Processing Techniques and Applications in RadarImage Processing, Wiley, Aug 2008

Telecommunications and Multimedia, Springer, 2005

Trang 30

1.A.2 Selected List of Books on Pattern Recognition from Year 2000Abe, S., Support Vector Machines for Pattern Classification, Springer, 2005.Bhagat, P., Pattern Recognition in Industry, Elsevier, 2005.

Bishop, C M., Pattern Recognition and Machine Learning, Springer, 2006.Bunke, H., Kandel, A., and Last, M., Applied Pattern Recognition, Springer, 2008Chen, D and Cheng, X., Pattern Recognition and String Matching, Springer, 2002.Corrochano, E B., Handbook of Geometric Computing: Applications in PatternRecognition, Computer Vision, Neuralcomputing, and Robotics, Springer, 2005.Duda, R O., Hart, P E., and Stork, D G., Pattern Classification, 2nd edition, Wiley,2003

Dunne, R A., A Statistical Approach to Neural Networks for Pattern Recognition,Wiley, July 2007

Gibson, W., Pattern Recognition, Penguin Group, 2005

Hand, D J., Mannila, H., and Smyth, P., Principles of Data Mining, MIT Press, Aug.2001

Hastie, T., Tibshirani, R., and Fridman, J., The Elements of Statistical Learning: DataMining, Inference, and Prediction, Springer, 2001

Raudys, S., Statistical and Neural Classifiers, Springer, 2001

Schlesinger, M I and Hlavac, V., Ten Lectures on Statistical and Structural PatternRecognition, Kluwer Academic Publishers, 2002

Suykens, A K., Horvath, G., Basu, S., Micchelli, C., and Vandewalle, J., Advances inLearning Theory: Methods, Models and Applications, NATO Science Series III:Computer & Systems Sciences, vol 190, IOS Press, Amsterdam, 2003

Theodoridis, S and Koutroumbas, K., Pattern Recognition, Academic Press, 2003.Webb, A R., Statistical Pattern Recognition, 2nd edition, Wiley, 2002

Trang 31

Charalampidis, D., Pascotto, M., Kerut, E K., and Lindner, J R., “Anatomy and flow in normal and ischemic microvasculature based on a novel temporal fractal dimension analysis algorithm using contrast enhanced ultrasound,” IEEE Trans Med Imaging, vol 25, no 8, pp 1079–1086, Aug 2006 Damera-Venkata, N., Kite, T D., Geisler, W S., Evans, B L., and Bovik, A C., “Image quality assessment based on a degradation model,” IEEE Trans Image Process., vol 9, no 4, pp 636–650, Apr 2000 Desportes, C., Obligis, E., and Eymard, L., “On the wet tropospheric correction for altimetry in coastal regions,” IEEE Trans Geosci Remote Sens., vol 45, no 7, pp 2139–2149, July 2007.

Gini, F., Farina, A., and Greco, M., “Selected list of references on radar signal processing,” IEEE Trans Aerospace Electron Syst., vol 37, no 1, pp 329–359, Jan 2001.

Gonzalez, R C and Woods, R E., Digital Image Processing, 3rd edition, Prentice Hall, 2008 Huang, T S., “PCM picture transmission,” IEEE Spectrum, vol 2, no 12, pp 57–63, Dec 1965 Negahdaripour, S and Xun X., “Mosaic-based positioning and improved motion-estimation methods for automatic navigation of submersible vehicles,” IEEE J Oceanic Eng., vol 27, no 1, pp 79–99, Jan 2002.

Qu, W and Schonfeld, D., “Real-time decentralized articulated motion analysis and object tracking from videos,” IEEE Trans Image Process., vol 16, no 8, pp 2129–2138, Aug 2007.

Roy-Chowdhury, A K., Chellappa, R., and Keaton, T., “Wide baseline image registration with application

to 3-D face modeling,” IEEE Trans Multimedia, vol 6, no 3, pp 423–434, June 2004.

Sun, Y., Willett, P., and Lynch, R., “Waveform fusion in sonar signal processing,” IEEE Trans Aerospace Electron Syst., vol 40, no 2, pp 462–477, Apr 2004.

Wiles, C S., Maki, A., and Matsuda, N., “Hyperpatches for 3-D model acquisition and tracking,” IEEE Trans Pattern Anal Mach Intell., vol 23, no 12, pp 1391–1403, Dec 2001.

Xu, W L and Tso, S K., “Sensor-based fuzzy reactive navigation of a mobile robot through local target switching,” IEEE Trans Syst Man Cybernet C, vol 29, no 3, pp 451–459, Aug 1999.

Zhou, J., Gao, D., and Zhang, D., “Moving vehicle detection for automatic traffic monitoring,” IEEE Trans Veh Technol., vol 56, no 1, pp 51–59, Jan 2007.

Trang 32

C H A P T E R 2

MATHEMATICAL PRELIMINARIES

Most images are recorded and processed in the time domain or spatial domain Thespatial domain refers to the aggregate of pixels composing an image, and the spatial-domain processing involves operations that apply directly on these pixels However, it

is sometimes convenient and efficient to process images in the frequency domainbecause edge pixels generally correspond to high-frequency components and interiorpixels of an object region correspond to low-frequency components In this chapter, themathematical preliminaries that are often used in image processing for converting animage from spatial domain to frequency domain are introduced These include Laplacetransform, Fourier transform, Z-transform, cosine transform, and wavelet transform

s þ j1, where s is any fixed real number for which s ¼ s is a point in the region ofabsolute convergence of X(s) Equation (2.1) is called the one-sided Laplace transform,

Image Processing and Pattern Recognition by Frank Shih

Copyright  2010 the Institute of Electrical and Electronics Engineers, Inc.

17

Trang 33

which depends only on the values of the signal x(t) for t 0 and must satisfy thefollowing constraint:

not have a Laplace transform

transform of u(t) is given by

L e½  ¼ct 1

sc

written as the sum of two exponential functions:

sinðo0tÞ ¼ejo0tejo0t

2j

Trang 34

The Laplace transform of sinðo0tÞ is given by

2.1.1 Properties of Laplace Transform

Laplace transform satisfies a number of properties that are useful in the transformationbetween frequency domain and time domain (Schiff, 1999; Graf, 2004) By usingthese properties, it is helpful in deriving many new transformation pairs from a basicset of pairs Therefore, x(t) and X(s) are called the Laplace transform pair A number

of often-used transform pairs are given as follows:

s2þ o2

Several fundamental properties of Laplace transform are given below

1 Linearity: axðtÞ þ byðtÞ $ aXðsÞ þ bYðsÞ

Except time shift, one can also perform frequency shift, which corresponds

to the multiplication by an exponential in time domain, as follows:

Trang 35

4 Time scale: xðctÞ $1

sc

 

Laplace transform of second-order or high-order derivatives of x(t) can also beexpressed in terms of X(s) and the initial conditions Let n be an arbitrarypositive integer

9 Convolution: Given two functions x(t) and h(t) whose values are equal to zero

frequency domain by Laplace transform Conversely, the convolution infrequency domain corresponds to a product in time domain

xðtÞ  hðtÞ $ XðsÞHðsÞxðtÞhðtÞ $ XðsÞ  HðsÞThe convolution in the 2D case can be represented as

respectively A simple change of variables produces

Trang 36

Another operation similar to convolution, called correlation, can be ented as

where f*is the complex conjugate of f In image processing, the difference of correlation

as compared to convolution is that the function h is not folded about the origin

((

Answer: The convolution is calculated as follows It can be separated into four cases, as

Trang 37

Determine the convolution and correlation of f by h Use the lower-left pixel of h (i.e.,the underlined pixel) as its origin and deal with the pixels outside the boundary ofimage f to be zeros.

and then overlaying the result The flipped h is

1 Separation of real and imaginary numbers:

xðtÞ ¼ Re½xðtÞ þ j Im½xðtÞ $ XðsÞ ¼ Re½XðsÞ þ j Im½XðsÞ

Trang 38

2.2 FOURIER TRANSFORM

In the early 1800s, French mathematician Joseph Fourier, with his studies of theproblem of heat flow, introduced Fourier series for the representation of continuous-time periodic signals He made a serious attempt to prove that any piecewise smoothfunction can be expressed into a trigonometric sum Fourier’s theory has significantapplications in signal processing The frequency spectrum of a signal can be generated

by representing the signal as a sum of sinusoids (or complex exponentials), called aFourier series This weighted sum represents the frequency content of a signal calledspectrum

When a signal becomes nonperiodic, its period becomes infinite and itsspectrum becomes continuous An image is considered as a spatially varying function.The Fourier transform decomposes such an image function into a set of orthogonalfunctions and converts the spatial intensity image into its frequency domain From thecontinuous form, one can digitize it for discrete-time images Signal processing inthe frequency domain simplifies computational complexity in filtering analysis; thus,the Fourier transform has played the leading role in signal processing and engineeringcontrol for a long time (Sneddon, 1995; Bracewell, 2000)

Let T be a fixed positive real number A continuous-time signal x(t) is said to beperiodic with a period T if

xðtÞ ¼ xðt þ TÞ for all t

That is, a periodic signal repeats every T seconds, the frequency of the signal is

parameter is often used According to Fourier’s theorem, x(t) can be expressed as

a linear sum of complex exponentials:

The trigonometric identity is applied as

A cosðnotÞB sinðnotÞ ¼pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiA2þ B2

Trang 39

Therefore, the following is obtained:

n¼1

Equations (2.14) and (2.15) are called the Fourier transform pair and can be shown to

conditions are almost satisfied in practice The Fourier transform is generallycomplex:

XðoÞ

j j ¼ ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

R2ðoÞ þ I2ðoÞp

and

XðoÞ

Trang 40

Example 2.8 Derive the Fourier transform of f(x) that is defined in the followingfigure:

35dx

35

respectively, and a and b are two arbitrary constants, then

Ngày đăng: 05/06/2014, 12:04

TỪ KHÓA LIÊN QUAN