IEEE Circuits and Systems Society, IEEE Signal Processing Society,
and The Signal & Multimedia Processing Lab. - University of Sydney

The First IEEE Pacific-Rim Conference on Multimedia 

(2000 International Symposium on Multimedia Information Processing)

December 13-15, 2000, 
University of Sydney, Australia



Main Page

Call For Papers

Committee

Advance Program

Keynote Speeches

Special Session

Registration

Travel & Accommodation

Paper Submission

Sponsors

Advance Program: Tutorials

 

Three tutorial sessions are currently offered:

Tutorial A:

Title

Streaming Video Profile in MPEG-4

Speaker Weiping Li
Abstract Streaming Video Profile is the subject of Amendment 4 of MPEG-4. It is developed in response to the growing need on a video coding standard for streaming video over the Internet. It provides the capability to distribute single-layered frame-based video over a wide range of bit-rates with high coding efficiency. It also provides fine granularity scalability and its combination with temporal scalability to address a variety of challenging problems in delivering video over the Internet. This tutorial gives an overview of the video coding techniques used in the Streaming Video Profile.
Vitae Weiping Li received his B.S. degree from University of Science and Technology of China (USTC) in 1982, and his M.S. and Ph.D. degrees from Stanford University in 1983 and 1988 respectively, all in electrical engineering. In 1987, he joined the Faculty at Lehigh University, where he is currently a Professor in the Department of Electrical Engineering and Computer Science. Since 1998, he has taken a leave from Lehigh University to work on network streaming video in Silicon Valley, California. He was with Optivision, Inc. in Palo Alto, California, from 1998 to 1999, as the Director of R&D. He is currently with WebCast Technologies, Inc., a start-up company that he co-founded.

Weiping Li has been elected to Fellow of IEEE for contributions to image and video coding algorithms, standards, and implementations. He is currently the Editor-in-Chief for IEEE Transactions on Circuits and Systems for Video Technology. He served as an Associate Editor of the same journal from 1995 to 1999. He is currently an Editor for Amendment-4 (Streaming Video Profile) of MPEG-4 International Standard. He served as one of the Guest Editors for a special issue of IEEE Proceedings on image and video compression (February 1995). He served as the Past-Chair (1998-1999) and Chair (1996-1998) of Technical Committee on Visual Signal Processing and Communications of IEEE Circuits and Systems Society. He is the Program Chair of the MPEG-4 Workshop and Exhibition (2000). He was a Co-Chair (1999) and Chair (1997) of Technical Track on Multimedia and Communications in IEEE International Symposium on Circuits and Systems. He served as the Chair of Best Student Paper Award Committee for 1999 SPIE Visual Communications and Image Processing Conference. From 1997 to 1998, he served as the Chair of Working Group on reaffirmation of IEEE Standard 1180 (Specifications for the Implementation of 8X8 Inverse Discrete Cosine Transform). Since 1995, he has been a member of Moving Picture Experts Group (MPEG) of International Standard Organization (ISO).

Weiping Li received the Spira Award for Excellence in Teaching in 1992 at Lehigh University and the Guo Mo-Ruo Prize for Outstanding Student in 1980
at University of Science and Technology of China.

 

Tutorial B:

Title

From Speechreading to Building Virtual Collaborative Environments

Speaker Tsuhan Chen
Abstract A human listener can use visual cues such as lip and tongue movements to enhance the level of speech understanding. The process of combining the audio modality and the visual modality is often referred to as speechreading, or lipreading. Inspired by human speechreading, researchers have been studying various ways to enable a computer to use speechreading for automated speech recognition, with the goal of building a human-computer interface via natural spoken language that works even in noisy environments. In this tutorial, we will discuss recent development in the field of automatic speechreading and provide technical details about the enabling technologies for speechreading.

Related to speechreading, lip-synchronization technique that has wide applications in movie dubbing, human-computer interface and cartoon animation, will also be discussed in this tutorial. We will also survey recent effort in extending human-computer interfaces to human-to-human interfaces, i.e., virtual collaborative environments. In a virtual collaborative environment, people can interact with each other as if they are face-to-face in the same room. To accomplish this, head orientation, eye contact, hand gestures, and the sense of direction and distance need to be realistically reproduced in this environment. As a platform, we will introduce NetICE (Networked Intelligent Collaborative Environment) that integrates lip-synchronization with face and gesture tracking, directional sound, animated avatars, and shared whiteboards. A related technology for joint geometry/texture streaming of 3D objects will also be discussed.
Vitae Tsuhan Chen received the B.S. degree in electrical engineering from the National Taiwan University in 1987, and the M.S. and Ph.D. degrees in electrical engineering from the California Institute of Technology, Pasadena, California, in 1990 and 1993, respectively.

From January 1993 to July 1993, he was a part-time member of technical staff
at the Jet Propulsion Laboratory, Pasadena, CA. He was a postdoctoral research fellow at the California Institute of Technology from July 1993 to August 1993. From August 1993 to October 1997, he worked in the Visual Communications Research Department, AT&T Bell Laboratories, Holmdel, New Jersey, and later at AT&T Labs-Research, Red Bank, New Jersey, as a senior technical staff member and then a principle technical staff member. Since October 1997, he has been with the Electrical and Computer Engineering Department, Carnegie Mellon University, as an Associate Professor. His research interests include multimedia signal processing and communication, audio-visual interaction, video coding and multimedia standards.

Dr. Chen co-founded and chaired the Multimedia Signal Processing Technical Committee of the IEEE Signal Processing Society. He currently serves as Associate Editor for IEEE Trans. on Image Processing, IEEE Trans. on Multimedia, and IEEE Trans. on Signal Processing. He also served as Associate Editor for IEEE Trans. on Circuits and Systems for Video Technology from 1996 to 1999. He serves on the Steering Committee of IEEE Trans. on Multimedia and the Editorial Boards of IEEE Signal Processing Magazine, Marcel Dekker Signal Processing Series, and Kluwer Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology. Dr. Chen is a member of the IEEE Circuits and Systems Society's Technical Committees on Multimedia Systems and Applications and on Visual Signal Processing and Communication. He co-chaired the first IEEE Workshop on Multimedia Signal Processing in 1997, and is serving as a technical co-chair for the first IEEE Conference on Multimedia and Exposition to be held in New York in 2000. Dr. Chen has published tens of technical papers and holds seven U.S. patents. He received the Charles Wilts Prize for outstanding independent research in Electrical Engineering leading to a Ph.D. degree at the California Institute of Technology. He is a member of the Phi Tau Phi Scholastic Honor Society. Dr. Chen is a recipient of the National Science Foundation CAREER Award, titled "Multimodal and Multimedia Signal Processing," from 2000 to 2003.

 

Tutorial C: 

Title

Signal Processing for 3rd Generation CDMA Systems

Speaker Paul Alexander
Abstract Problems: Multiple Access Interference (Inter and Intra Cell), Time-Varying Multipath Propagation, Receiver Algorithm Complexity.

Conventional Solutions: RAKE, Antenna Diversity

Current Proposals: Space-Time Coding, Iterative Processing at Base-Station, Chip Level Equalization at Mobile-Station.


Abstract:

The service requirements placed on the 3G air-interface represent a significant advance relative to those applicable in 2G. Although the quality required may not have changed much the service scope is much broader. We will begin by outlining the difficulties in meeting these service objectives and show that techniques applied in the 2G context
may not suffice in 3G. We will then describe signal processing techniques that are designed to meet the 3G criteria.

 

Vitae Paul D. Alexander received his B.E. and M.Eng.Sc. degrees from the University of Adelaide, South Australia in 1991 and 1995 respectively. He graduated with his Ph.D. in March 1997 from the University of South Australia. His Ph.D. dissertation was titled
"Coded Multiuser CDMA". After thesis submission, he was the Post Doctoral Fellow in the Mobile Communications Research Centre at the University of South Australia were he worked on mobile broadband systems incorporating CDMA. He continued work in this area at the Centre for Wireless Communications at the National University of Singapore until 1999. He is currently with Southern Poro Communications of Sydney Australia and is working on UMTS compliant systems.

Publication efforts include 10 journal publications, 2 book chapters and 29 conference papers (4 invited). He has been co-inventor on two Australian Provisional Patents for work on CDMA receiver structures and has Singapore and US patents for his work on channel estimation in CDMA with antenna arrays. He has won a DEETYA travel grant and a ARC SPRIT grant with colleagues at the University of South Australia where
he currently holds an Adjunct Lecturer position.