CS 766 - Computer Vision
Spring 1997
Office: 6397 CSS
Telephone: 262-1965
E-mail: dyer@cs.wisc.edu
Office Hours: 2 - 3 Tuesdays and Thursdays, and by appointment
Teaching Assistant: Bryan So
Office: 5364 CSS
Telephone: 262-5105
E-mail: so@cs.wisc.edu
Office Hours: 1 - 2 Mondays and Wednesdays, and by appointment
General Course Information
Broad overview of the basic concepts of computer vision. First, an
introduction to low-level image analysis methods, including image
formation, edge detection, feature detection, and segmentation.
Computer graphics techniques (e.g., warping, morphing, and mosaics)
for image synthesis. Principles of defining modules for
reconstructing three-dimensional scene information using techniques
such as shape from shading and depth from stereo. Active methods
for scene recovery such as depth from focus and occluding contour
detection by viewpoint control. Motion detection and analysis
including tracking. Model-based three-dimensional object
recognition.
- Schedule
9:30 - 10:45 a.m. Tuesdays and Thursdays in 3345 Engineering
Hall
- Prerequisites
CS 540, fundamentals of calculus, probability theory, linear
algebra, and C
- Grading
- Midterm Exam (about 2/3 through semester): 35%
- Homework assignments: 35%
- Project: 25%
- Class participation: 5%
- Syllabus
- Required Readings
- Selected parts of Machine Vision by
R. Jain, R. Kasturi, and B. G. Schunck, McGraw-Hill, New York,
1995
- A collection of
readings from journals and conference proceedings; to be sold
at DOIT Documentation in small batches
- Supplementary
Reading Sources
- Web Page: www.cs.wisc.edu/~dyer/766/
- Class E-Mail Alias: cs766-1list@cs.wisc.edu
Reading Assignments to Date
- Chapter 15
- Chapter 14
- Chapters 9, 11.1 - 11.3
- Handout #5 (4 papers)
- Chapters 6, 7.1 - 7.3, 13.6.2, and 15.6.1
- Chapters 4 and 5, Handout #3 (3 papers) and Handout #4 (5
papers)
- Chapters 2 and 3, plus Vista Documentation (Handout #2)
- Chapters 1 and 8 of JKS, and Handout #1 (1 paper) (at
DOIT)
Homework Assignments
Computer Accounts
- Accounts
Course accounts are on the instructional Unix workstations (sols
and vegas) running Solaris. You should use the color workstations
if you are displaying images. Each class account has a disk space
quota of 50MB so you can store images for homeworks and your
project. Be sure to delete old images and compress others (see
gzip(1)), however, in order to save space.
- Email
Email sent to cs766-1list goes to everyone
in the class including the instructor and TA
- Printers
To print images you should use one of the laserprinters,
laser1 - laser4, which are located in room 1359.
Alternatively, the generic printer name laser will send
output to one of the four printers with the shortest queue.
Caution: Before sending images to the printer, be sure to check the
queue; if there are a lot of jobs being printed it is bad manners
to send images to be printed because they take so long to print.
Be considerate!
- Vision Software
- Vista
The Vista programming environment will be used in the homework
assignments. The Solaris-compatible code is located in the
directory /p/vision/ip-tools/vista.solaris/ Man pages are
in /p/vision/ip-tools/vista.solaris/man/ and executables
are in /p/vision/ip-tools/vista.solaris/bin/ If you are
using a machine running SunOS, then instead use everything in the
directory /p/vision/ip-tools/vista/
- Xv
xv(1) is an interactive image display program for the X
window system that is very useful for displaying images in a
variety of formats.
- ImgStar
70 basic image processing operations invoked using Unix-like
command lines. Code, executables and manual are in
/p/vision/ip-tools/imgstar/
- Khoros
The Khoros image processing software development environment
provides a set of basic image processing modules and a graphical
programming language interface for rapid prototyping of simple
image processing algorithms. The code is located in the directory
/p/vision/ip-tools/khoros
/p/vision/ip-tools/khoros/bin/cantata is the executable
that starts up the interactive environment.
- Netpbm
A toolkit for conversion of images between a large variety of
different formats. Based on the Pbmplus package. Man pages are in
/p/vision/ip-tools/man/ and executables are in
/p/vision/ip-tools/bin/
- Matlab
Matlab(1) is a numeric computation and visualization
environment. Signal processing and image processing toolboxes are
especially relevant.
- Test Images
Most test images will be put in the directory
/p/vision/images/ although they may require format
conversion to be used. Some other images may be put in
~cs766-1/public/images/ Numerous image databases are also
accessible via the WWW; for example, see the collection
of test images at CMU
Examination
- There will be one examination on Thursday,
April 10 from 7:15 pm - 9:15 pm in Room 1325 CS. The exam
will cover topics up through that time, including readings in the
textbook, papers sold at DOIT, and homework assignments. You may
bring into the exam one (1) 8.5" x 11" sheet of paper with any
notes you want on both sides. The exam will focus on main ideas and
algorithms, not proofs. See old exams below for the types of
questions that will be asked.
- Old Exams
Links of Interest
Vision Demos and Projects
Other Computer Vision and Related Courses