Main
Page
Call For
Papers Committee
Advance
Program
Keynote Speeches Special Session
Registration
Travel & Accommodation
Paper Submission
Sponsors
|
|

Advance
Program: Tutorials
Three tutorial sessions are currently offered:
Tutorial A:
|
| Title |
Streaming Video Profile in MPEG-4
|
| Speaker |
Weiping Li |
| Abstract |
Streaming Video Profile is the subject of Amendment 4 of MPEG-4. It is
developed in response to the growing need on a video coding standard for
streaming video over the Internet. It provides the capability to distribute
single-layered frame-based video over a wide range of bit-rates with high
coding efficiency. It also provides fine granularity scalability and its
combination with temporal scalability to address a variety of challenging
problems in delivering video over the Internet. This tutorial gives an
overview of the video coding techniques used in the Streaming Video Profile. |
| Vitae |
Weiping Li received his B.S. degree from University of Science and
Technology of China (USTC) in 1982, and his M.S. and Ph.D. degrees from
Stanford University in 1983 and 1988 respectively, all in electrical
engineering. In 1987, he joined the Faculty at Lehigh University, where he
is currently a Professor in the Department of Electrical Engineering and
Computer Science. Since 1998, he has taken a leave from Lehigh University to
work on network streaming video in Silicon Valley, California. He was with
Optivision, Inc. in Palo Alto, California, from 1998 to 1999, as the
Director of R&D. He is currently with WebCast Technologies, Inc., a start-up
company that he co-founded.
Weiping Li has been elected to Fellow of IEEE for contributions to image and
video coding algorithms, standards, and implementations. He is currently the
Editor-in-Chief for IEEE Transactions on Circuits and Systems for Video
Technology. He served as an Associate Editor of the same journal from 1995
to 1999. He is currently an Editor for Amendment-4 (Streaming Video Profile)
of MPEG-4 International Standard. He served as one of the Guest Editors for
a special issue of IEEE Proceedings on image and video compression (February
1995). He served as the Past-Chair (1998-1999) and Chair (1996-1998) of
Technical Committee on Visual Signal Processing and Communications of IEEE
Circuits and Systems Society. He is the Program Chair of the MPEG-4 Workshop
and Exhibition (2000). He was a Co-Chair (1999) and Chair (1997) of Technical Track on Multimedia and Communications in IEEE International
Symposium on Circuits and Systems. He served as the Chair of Best Student
Paper Award Committee for 1999 SPIE Visual Communications and Image Processing Conference. From 1997 to 1998, he served as the Chair of Working
Group on reaffirmation of IEEE Standard 1180 (Specifications for the
Implementation of 8X8 Inverse Discrete Cosine Transform). Since 1995, he has
been a member of Moving Picture Experts Group (MPEG) of International
Standard Organization (ISO).
Weiping Li received the Spira Award for Excellence in Teaching in 1992 at
Lehigh University and the Guo Mo-Ruo Prize for Outstanding Student in 1980
at University of Science and Technology of China.
|
Tutorial B:
|
| Title |
From Speechreading to Building Virtual Collaborative
Environments
|
| Speaker |
Tsuhan Chen |
| Abstract |
A human listener can use visual cues such as lip and tongue movements to
enhance the level of speech understanding. The process of combining the
audio modality and the visual modality is often referred to as
speechreading, or lipreading. Inspired by human speechreading, researchers
have been studying various ways to enable a computer to use speechreading
for automated speech recognition, with the goal of building a human-computer
interface via natural spoken language that works even in noisy environments.
In this tutorial, we will discuss recent development in the field of
automatic speechreading and provide technical details about the enabling
technologies for speechreading.
Related to speechreading, lip-synchronization technique that has wide
applications in movie dubbing, human-computer interface and cartoon animation, will also be
discussed in this tutorial. We will also survey recent effort in extending human-computer interfaces to human-to-human
interfaces, i.e., virtual collaborative environments. In a virtual collaborative environment, people can interact with each other as if they
are face-to-face in the same room. To accomplish this, head orientation,
eye contact, hand gestures, and the sense of direction and distance need to
be realistically reproduced in this environment. As a platform, we will
introduce NetICE (Networked Intelligent Collaborative Environment) that
integrates lip-synchronization with face and gesture tracking, directional
sound, animated avatars, and shared whiteboards. A related technology for
joint geometry/texture streaming of 3D objects will also be discussed. |
| Vitae |
Tsuhan Chen received the B.S. degree in electrical engineering from the
National Taiwan University in 1987, and the M.S. and Ph.D. degrees in
electrical engineering from the California Institute of Technology, Pasadena, California, in 1990 and 1993, respectively.
From January 1993 to July 1993, he was a part-time member of technical staff
at the Jet Propulsion Laboratory, Pasadena, CA. He was a postdoctoral
research fellow at the California Institute of Technology from July 1993 to
August 1993. From August 1993 to October 1997, he worked in the Visual
Communications Research Department, AT&T Bell Laboratories, Holmdel, New
Jersey, and later at AT&T Labs-Research, Red Bank, New Jersey, as a senior
technical staff member and then a principle technical staff member. Since
October 1997, he has been with the Electrical and Computer Engineering
Department, Carnegie Mellon University, as an Associate Professor. His
research interests include multimedia signal processing and communication,
audio-visual interaction, video coding and multimedia standards.
Dr. Chen co-founded and chaired the Multimedia Signal Processing Technical
Committee of the IEEE Signal Processing Society. He currently serves as
Associate Editor for IEEE Trans. on Image Processing, IEEE Trans. on
Multimedia, and IEEE Trans. on Signal Processing. He also served as
Associate Editor for IEEE Trans. on Circuits and Systems for Video Technology from 1996 to 1999. He serves on the Steering Committee of IEEE
Trans. on Multimedia and the Editorial Boards of IEEE Signal Processing
Magazine, Marcel Dekker Signal Processing Series, and Kluwer Journal of VLSI
Signal Processing Systems for Signal, Image, and Video Technology. Dr. Chen
is a member of the IEEE Circuits and Systems Society's Technical Committees
on Multimedia Systems and Applications and on Visual Signal Processing and
Communication. He co-chaired the first IEEE Workshop on Multimedia Signal
Processing in 1997, and is serving as a technical co-chair for the first
IEEE Conference on Multimedia and Exposition to be held in New York in 2000.
Dr. Chen has published tens of technical papers and holds seven U.S.
patents. He received the Charles Wilts Prize for outstanding independent
research in Electrical Engineering leading to a Ph.D. degree at the California Institute of Technology. He is a member of the Phi Tau Phi
Scholastic Honor Society. Dr. Chen is a recipient of the National Science
Foundation CAREER Award, titled "Multimodal and Multimedia Signal Processing," from 2000 to 2003.
|
Tutorial C:
|
| Title |
Signal Processing for 3rd Generation CDMA Systems
|
| Speaker |
Paul Alexander |
| Abstract |
Problems: Multiple Access Interference (Inter and Intra Cell),
Time-Varying Multipath Propagation, Receiver Algorithm Complexity.
Conventional Solutions: RAKE, Antenna Diversity
Current Proposals: Space-Time Coding, Iterative Processing at Base-Station,
Chip Level Equalization at Mobile-Station.
Abstract:
The service requirements placed on the 3G air-interface represent a significant advance relative to those applicable in 2G. Although the
quality required may not have changed much the service scope is much
broader. We will begin by outlining the difficulties in meeting these
service objectives and show that techniques applied in the 2G context
may not suffice in 3G. We will then describe signal processing techniques that are designed to meet the 3G criteria.
|
| Vitae |
Paul D. Alexander received his B.E. and M.Eng.Sc. degrees from the
University of Adelaide, South Australia in 1991 and 1995 respectively. He graduated with his Ph.D. in March 1997 from the
University of South Australia. His Ph.D. dissertation was titled
"Coded Multiuser CDMA". After thesis submission, he was the Post Doctoral Fellow in the Mobile Communications Research Centre at the
University of South Australia were he worked on mobile broadband systems incorporating CDMA. He continued work in this area at the
Centre for Wireless Communications at the National University of Singapore until 1999. He is currently with Southern Poro
Communications of Sydney Australia and is working on UMTS compliant systems.
Publication efforts include 10 journal publications, 2 book chapters
and 29 conference papers (4 invited). He has been co-inventor on two
Australian Provisional Patents for work on CDMA receiver structures and has Singapore and US patents for his work on channel estimation in
CDMA with antenna arrays. He has won a DEETYA travel grant and a ARC
SPRIT grant with colleagues at the University of South Australia where
he currently holds an Adjunct Lecturer position.
|
|