This document present introductions, what is computer vision, why study computer vision, why computer vision matters, how vision is used now...
Trang 1Department of Mechatronics
Chapter 1 Introduction to Computer Vision
James Hays, Brown University
Trang 2Department of Mechatronics
• Introductions
• What is Computer Vision?
• Why study computer vision?
• Why computer vision matters?
• How vision is used now?
Trang 3Department of Mechatronics
Brief history of computer vision
• 1966: Minsky assigns computer vision
as an undergrad summer project
• 1960’s: interpretation of synthetic
worlds
• 1970’s: some progress on interpreting
selected images
• 1980’s: ANNs come and go; shift toward
geometry and increased mathematical
rigor
• 1990’s: face recognition; statistical
analysis in vogue
• 2000’s: broader recognition; large
annotated datasets available; video
Trang 4Department of Mechatronics
Computer Vision
• Make computers understand images and video.
What kind of scene?
Where are the cars?
How far is the building?
Trang 5Human brains have ~10.000 computational power than computer brains
Lex Fridman, MIT
Trang 6Department of Mechatronics
Computer Vision and Nearby Fields
• Computer Graphics: Models to Images
• Computer Photography: Images to Images
• Computer Vision: Images to Models
Trang 7Department of Mechatronics
Related disciplines
Trang 8Department of Mechatronics
What is it related to?
Trang 9Department of Mechatronics
The goal of computer vision
• To bridge the gap between pixels and “meaning”.
Trang 10Department of Mechatronics
Trang 11Department of Mechatronics
Segmentation
Trang 12Department of Mechatronics
Perception
Trang 13Department of Mechatronics
Trang 14Department of Mechatronics
Trang 15Department of Mechatronics
What kind of information can we
extract from an image?
• Metric 3D information
• Sematic information
Trang 16Department of Mechatronics
Guess which bread is real?
Trang 17Department of Mechatronics
Left one is real
Trang 18Department of Mechatronics
Vision as measurement device
Trang 19Department of Mechatronics
Vision as a source of sematic information
Trang 20Department of Mechatronics
Why study computer vision?
• Vision is useful: images and video are everywhere.
Trang 21Why study computer vision?
In the field of artificial intelligence, the most difficult problems are informally known as
AI-complete or AI-hard, implying that the difficulty of these computational problems is
equivalent to solving the central artificial intelligence problem—making computers as intelligent as people, or strong AI.
AI-complete problems are hypothesised to include computer vision, natural language understanding, and dealing with unexpected circumstances while solving any real world problem.
Trang 22Department of Mechatronics
Vision is really hard
• Vision is an amazing feat of natural intelligence
brain
More human brain devoted to vision than anything else
Is that a queen or a bishop?
Trang 23Department of Mechatronics
Challenges: view variation
Trang 24Department of Mechatronics
Challenges: illumination
Trang 25Department of Mechatronics
Challenges: scale
Trang 26Department of Mechatronics
Challenges: deformation
Trang 27Department of Mechatronics
Challenges: occlusion
Magritte, 1957
Trang 28Department of Mechatronics
Challenges: background clutter
Trang 29Department of Mechatronics
Challenges: motion
Trang 30Department of Mechatronics
Challenges: object intra-class variation
Trang 31Department of Mechatronics
Challenges: local ambiguity
Trang 32Department of Mechatronics
Challenges or Opportunities?
• Images are confusing, but they also reveal the structure
of the world through numerous cues.
• Our job is to interpret the cues!
Trang 33Department of Mechatronics
Why computer vision matters
Trang 34Department of Mechatronics
How vision is used now
• Examples of state-of-the-art
Trang 35Department of Mechatronics
Optical character recognition (OCR)
Digit recognition, AT&T labs
http://www.research.att.com/~yann /
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR
software
License plate readers
http://en.wikipedia.org/wiki/Automatic_number_plate_recogni
tion
Trang 36Department of Mechatronics
Face detection
• Many new digital cameras now detect faces
• Canon, Sony, Fuji, …
Trang 37Department of Mechatronics
Smile detection
Sony Cyber-shot® T70 Digital Still Camera
Trang 38Department of Mechatronics
3D from thousands of images
Building Rome in a Day: Agarwal et al 2009
Trang 40Department of Mechatronics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story wikipedia
Trang 41Department of Mechatronics
Login without a password…
Fingerprint scanners on many new laptops, other devices
Face recognition systems now beginning to appear more widely
http://www.sensiblevision.com/
Trang 42Department of Mechatronics
Object recognition (in mobile phones)
Point & Find , Nokia Google
Trang 43Department of Mechatronics
The Matrix movies, ESC Entertainment, XYZRGB, NRC
Special effects: shape capture
Trang 44Department of Mechatronics
Pirates of the Carribean, Industrial Light and Magic
Special effects: motion capture
Trang 45Department of Mechatronics
Sports
Sportvision first downline Nice explanation on www.howstuffworks.com http://www.sportvision.com/video.html
Trang 47Department of Mechatronics
Advanced Driver Assistance Systems
ADAS applications
• Pedestrian Detection
• Blind Spot Detection
• Traffic Sign Detection
• Collision Avoidance
• Lane Detection
• Backup Obstacle Detection
• Occupant Sensing
www.uurmi.com
Trang 48Department of Mechatronics
Traffic Sign Recognition
Trang 49Department of Mechatronics
Google cars
Oct 9, 2010 "Google Cars Drive Themselves, in Traffic" The New York Times John Markoff
June 24, 2011 "Nevada state law paves the way for driverless
cars" Financial Post Christine Dobby
Aug 9, 2011, "Human error blamed after Google's driverless car sparks five-vehicle crash" The Star (Toronto)
Trang 50Department of Mechatronics
Google cars
• WAYMO – Self-driving car.
Trang 51Department of Mechatronics
Uber Self-Driving Car
Trang 52Department of Mechatronics
Interactive Games: Kinect
• Object Recognition:
http://www.youtube.com/watch?feature=iv&v=fQ59dX Oo63o
• Robot:
http://www.youtube.com/watch?v=w8BmgtMKFbY
Trang 53• Obstacle detection, position tracking
• For more, read “ Computer Vision on Mars ” by Matthies et al.
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007
Trang 54Department of Mechatronics
Industrial robots
Vision-guided robots position nut runners on wheels
Trang 56Department of Mechatronics
Medical imaging
Image guided surgery
Grimson et al., MIT
3D imaging MRI, CT