|
|
Plenary Talks
Friday, January 13, 2006
Special Talk : Prof. Takeo Kanade, Carnegie Mellon University, Pittsburgh
16:00-17:00 Special Talk on "Factorization Methods in Computer Vision"
Saturday, January 14, 2006
Prof. Andrew Blake, Microsoft Research, Cambridge
08:30-10:00 Inauguration followed by Plenary Talk 1
Video Segmentation by Fusion of Colour, Contrast and Stereo
Technology advances mean that a stereo webcam could be manufactured and
sold for essentially the same price as a monocular one. There are two
outstanding advantages for teleconferencing in using stereo vision.
First, automatic control of pan/tilt/zoom, which is possible
monocularly, is particularly robust in stereo. Second privacy can be
protected by obscuring background elements and replacing them with safer
ones. For example a business conversation held at home could show only
the talking head, against a bland video background, with inappropriate
elements obscured. The first of these advantages is readily attainable
and we describe progress towards achieving the second.
An algorithm will be described that is capable of real-time segmentation
of foreground from background layers in stereo video sequences.
Automatic separation of layers from colour/contrast or from stereo alone
is known to be error-prone. Here, colour,contrast and stereo matching
information are fused to infer layers accurately and efficiently. The
"Layered Graph Cut" algorithm does not directly solve stereo. Instead it
marginalises the stereo match likelihood over foreground and background
hypotheses, and fuses it with a contrast-sensitive colour model that is
learned on the fly. Segmentation is then solved efficiently and exactly
by binary graph cut. The algorithm will be demonstrated in the
application of background substitution and shown to give good quality
composite video output.
Sunday, January 15, 2006
Prof. Andrew Zisserman, Oxford University
08:30-09:40 Plenary Talk 2
Category Recognition: Bags of Words and Beyond
There has been much recent research activity - and much recent success -
in recognizing visual object categories (such as cars, faces,
motorbikes) in images. The success has come from representing objects by
sets of local iconic image patches, where each patch may be thought of
as a "visual word" for describing part of the object. Surprisingly
object categories can be recognized without including the spatial
organization/location of the patches, and these models are referred to
as a "bag of words" in analogy with similar models in the statistical
text understanding literature.
In this talk I will describe two methods for learning bag of words
models: an unsupervised approach where object category models are learnt
from an unlabelled set of images; and a supervised approach where object
category models are learnt from images obtained from Google image
search. In both cases object category models are obtained by fitting
with probabilistic Latent Semantic Analysis (pLSA), a model originally
developed in the statistical text literature. The talk will conclude
with some illustrations of how spatial organization can be added to
these models.
Monday, January 16, 2006
Prof. Baba Vemuri, University of Florida
08:30-09:40 Plenary Talk 3
Information Theoretic Measures and Their Applications to Computer Vision and Medical Imaging
Information theory has played a fundamental role in many fields of
science and engineering including Computer Vision and Medical
Imaging. In this talk, I will introduce various information theoretic
measures that are used in achieving the goal of solving several
important problems in Computer Vision and Medical Imaging, namely, image
registration, point set registration, tensor field segmentation,
image/shape retrieval etc. Recently introduced information theoretic
measures such as, entropy defined on distributions, averages of Gaussian
distributions (computed as the minimizer of the sum of squared
J-divergences) and the well known Jensen-Shannon divergence etc. will be
presented. I will show how each of these measures are used in solving
the aforementioned application problems. The talk will be interspersed
with several examples from each of the aforementioned applications.
|