Technical Program Contents

ThA1LP -- Opening Ceremony and Plenary Lecture

Chairs: H. Timothy Bunnell, Alfred I. duPont Institute; and Richard A. Foulds, Alfred I. duPont Institute

ThA2L1 -- Large Vocabulary

Chair: Michael D. Riley, AT&T Labs - Research

ThA2L2 -- Multimodal ASR (Face and Lips)

Chair: Eric Petajan, Bell Labs - Lucent Technologies

ThA2L3 -- Perception of Words

Chair: Sharon Manuel, Emerson College and Massachusetts Institute of Technology

ThA2P1 -- Phonetics, Transcription, and Analysis

Chair: Jim Hieronymus, Bell Labs - Lucent Technologies

ThA2P2 -- Spoken Language Processing for Special Populations

Chair: Valerie Hazan, University College London

ThA2S1 -- Dialogue Special Session I

Chairs: James R. Glass, MIT Laboratory for Computer Science; and Yasunaga Niimi, Kyoto Institute of Technology

ThP1L1 -- Language Modeling I

Chair: Roberto Pieraccini, AT&T Labs - Research

ThP1L2 -- Feature Extraction for Speech Recognition I

Chair: Shubha Kadambe, Atlantic Aerospace Electronics Corp.

ThP1L3 -- Speech Production - Measurement and Modeling

Chair: Terrance M. Neary, University of Alberta

ThP1P1 -- Speech Coding / HMMs and NNs in ASR

Chair: Jean-Luc Gauvain, LIMSI-CNRS

ThP1S1 -- Dialogue Special Session II

Chairs: Patti Price, SRI International; and Akira Kurematsu, University of Electro-Communications

Combining the Detection and Correction of Speech Repairs Peter A. Heeman, Kyung-ho Loken-Kim, James F. Allen
Generating Spontaneous Elliptical Utterance Yuji Sagawa, Wataru Sugimoto, Noboru Ohnishi
Developing the Modelling of Swedish Prosody in Spontaneous Dialogue Gösta Bruce, Marcus Filipsson, Johan Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati
Spoken Language Generation in a Multimedia System Shimei Pan, Kathleen R. McKeown
Synthesizing Dialogue Speech of Japanese Based on the Quantitative Analysis of Prosodic Features Keikichi Hirose, Mayumi Sakata, Hiromichi Kawanami
Spoken Dialogue Interface in a Dual Task Situation Shuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko Shirai

ThP1S2 -- Neural Models of Speech Processing I

Chair: Eric D. Young, Johns Hopkins University

ThP2L1 -- Language Modeling II

Chair: Jerome R. Bellegarda, Apple Computer, Inc.

ThP2L2 -- Feature Extraction for Speech Recognition II

Chair: Shubha Kadambe, Atlantic Aerospace Electronics Corp.

ThP2L3 -- Vowels

Chair: John Ohala, University of California, Berkeley

ThP2P1 -- NNs and Stochastic Modeling

Chair: Wu Chou, Bell Labs - Lucent Technologies

ThP2S1 -- Dialogue Special Session III

Chairs: Paul Dalsgaard, Aalborg University; and Hiroya Fujisaki, Science University of Tokyo

A Dialogue Control Strategy Based on the Reliability of Speech Recognition Yasuhisa Niimi, Yutaka Kobayashi
SpeechWear: A Mobile Speech System Alexander I. Rudnicky, Stephen Reed, Eric H. Thayer
WHEELS: A Conversational System in the Automobile Classifieds Domain Helen Meng, Senis Busayapongchai, James Glass, David Goddeau, Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue
Effective Human-computer Cooperative Spoken Dialogue: The AGS Demonstrator M.D. Sadek, A. Ferrieux, A. Cozannet, P. Bretier, F. Panaget, J. Simonin
Dialog in the RAILTEL Telephone-based System S.K. Bennacef, L. Devillers, S. Rosset, Lori Lamel
Dialogue Processing in a Conversational Speech Translation System Alon Lavie, Lori Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada

ThP2S2 -- Neural Models of Speech Processing II

Chair: Eric D. Young, Johns Hopkins University

FrA1L1 -- Utterance Verification and Word Spotting

Chair: Jay Wilpon, AT&T Labs - Research

FrA1L2 -- Acquisition/Learning Training L2 Learners

Chair: Grace H. Yeni-Komshian, University of Maryland

FrA1L3 -- Focus, Stress and Accent

Chair: Elizabeth Shriberg, SRI International

FrA1P1 -- Spoken Language Dialogue and Conversation

Chair: Alicia Abella, AT&T Labs - Research

FrA1P2 -- Speech Disorders

Chair: Don Jamieson, University of Western Ontario

FrA1S1 -- Vocal Tract Geometry I

Chair: Maureen Stone, University of Maryland at Baltimore

Human Palate and Related Structures: Their Articulatory Consequences Kiyoshi Honda, Shinji Maeda, Michiko Hashi, Jim Dembowski, John R. Westbury
A Continuum Mechanics Representation of Tongue Deformation Edward P. Davis, Andrew Douglas, Maureen Stone
From MRI and Acoustic Data to Articulatory Synthesis: A Case Study of the Lateral Approximants in American English Philbert Bangayan, Abeer Alwan, Shrikanth Narayanan
Liquids in Tamil Shrikanth Narayanan, Abigail Kaun, Dani Byrd, Peter Ladefoged, Abeer Alwan

FrA2L1 -- Prosody in ASR and Segmentation

Chair: Keikichi Hirose, University of Tokyo

FrA2L2 -- Acquisition and Learning by Machine

Chair: Allen L. Gorin, AT&T Labs - Research

FrA2L3 -- Dialogue Systems

Chair: Esther Levin, AT&T Labs - Research

FrA2P1 -- Speech Enhancement and Robust Processing

Chair: Richard Stern, Carnegie Mellon University

FrA2S1 -- Vocal Tract Geometry II

Chair: Maureen Stone, University of Maryland at Baltimore

FrP1L1 -- Speaker Adaptation and Normalization I

Chair: Chin-Hui Lee, Bell Labs - Lucent Technologies

FrP1L2 -- Spoken Language and NLP I

Chair: Adam L. Buchsbaum, AT&T Labs - Research

FrP1L3 -- Spoken Discourse Analysis/Synthesis

Chair: Jan P. van Santen, Bell Labs - Lucent Technologies

FrP1P1 -- Acoustic Modeling I

Chair: Ilija Zeljkovic, AT&T Labs - Research

FrP1S1 -- Physics and Simulation of the Vocal Tract I

Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology

FrP2L1 -- Speaker Adaptation and Normalization II

Chair: Aaron E. Rosenberg, AT&T Labs - Research

FrP2L2 -- Spoken Language and NLP II

Chair: David Roe, AT&T Labs - Research

FrP2L3 -- Duration and Rhythm

Chair: Dik J. Hermes, Institute for Perception Research / IPO

FrP2P1 -- Acoustic Analysis

Chair: Peggy Nelson, University of Maryland at Baltimore

FrP2S1 -- Physics and Simulation of the Vocal Tract II

Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology

SaA1L1 -- Speech Recognition Using HMMs and NNs

Chair: Nelson Morgan, ICSI and University of California, Berkeley

SaA1L2 -- Adverse Environments and Multiple Microphones

Chair: Tony Robinson, Cambridge University

SaA1L3 -- Prosodic Synthesis in Dialogue

Chair: Mark Steedman, University of Pennsylvania

SaA1P1 -- Speech Synthesis

Chair: Thierry Dutoit, Facilte Psytechnique De Mons - TCTS Laboratory

SaA1P2 -- Instructional Technology for Spoken Language

Chair: Reiko A. Yamada, ATR Human Information Processing Research Laboratories

SaA1S1 -- Multimodal Spoken Language Processing I

Chairs: Lynne E. Bernstein, House Ear Institute; and Christian Benoît, ICP-Grenoble

SaA2L1 -- Prosody - Phonological/Phonetic Measures

Chair: Paul Taylor, University of Edinburgh

SaA2L2 -- Phonetics and Perception

Chair: Christine Shadle, University of Southhampton

SaA2L3 -- Language Acquisition

Chair: Harriet S. Magen, Rhode Island College

The Acoustic Structure of Vowels in Mothers' Speech to Infants and Adults Jean E. Andruski, Patricia K. Kuhl
Acoustical Characteristics of Sound Production of Deaf and Normally Hearing Infants Chris J. Clement, Florien J. Koopmans-van Beinum, Louis C. W. Pols
Learning Non-native Vowel Categories John Kingston, Christine Bartels, José Benkí, Deanna Moore, Jeremy Rice, Rachel Thorburn, Neil Macmillan
Word Recognition by Japanese Infants P.A. Halle, Toshisada Deguchi, Yuji Tamekawa, B. Boysson-Bardies, Shigeru Kiritani
Investigations of the Word Segmentation Abilities of Infants Peter W. Jusczyk
Developmental Change in Perception of Clause Boundaries by 6- and 10-Month-old Japanese Infants Akiko Hayashi, Yuji Tamekawa, Toshisada Deguchi, Shigeru Kiritani

SaA2P1 -- Production and Prosody Posters

Chair: Carol Espy-Wilson, Boston University

SaA2S1 -- Multimodal Spoken Language Processing II

Chairs: Lynne E. Bernstein, House Ear Institute; and Christian Benoît, ICP-Grenoble

SaA2 -- Emotion in Recognition and Synthesis (Poster Preview)

Chair: Klaus R. Scherer, University of Geneva

SaP1L1 -- User-Machine Interfaces

Chair: Candy Kamm, AT&T Labs - Research

Evaluating Automatic Speech Recognition as a Component of a Multi-input Device Human-computer Interface B.A. Mellor, C. Baber, C. Tunley
Data Collection for the MASK Kiosk: WOz vs Prototype System A. Life, I. Salter, J.N. Temem, F. Bernard, S. Rosset, S.K. Bennacef, Lori Lamel
An Experimental Japanese/English Interpreting Video Phone System M. Karaorman, T.H. Applebaum, T. Itoh, M. Endo, Y. Ohno, M. Hoshimi, T. Kamai, K. Matsui, K. Hata, S. Pearson, J.-C. Janqua
User Participation and Compliance in Speech Automated Telecommunications Applications Sara Basson, Stephen Springer, Cynthia Fong, Hong Leung, Ed Man, Michele Olson, John Pitrelli, Ranvir Singh, Suk Wong
Embedding Speech in Web Interfaces Samuel Bayer
Voice-activated Home Banking System and its Field Trial Toshihiro Isobe, Masatoshi Morishima, Fuminori Yoshitani, Nobuo Koizumi, Ken'ya Murakami

SaP1L2 -- TTS Systems and Rules

Chair: Juergen Schroeter, AT&T Labs - Research

SaP1L3 -- Prosody and Labeling

Chair: Louis Boves, Nymegen University

Consistency in Transcription and Labelling of German Intonation with GToBI Martine Grice, Matthias Reyelt, Ralf Benzmüller, Jörg Mayer, Anton Batliner
Syntactic-prosodic Labeling of Large Spontaneous Speech Data-bases Anton Batliner, R. Kompe, A. Kiessling, H. Niemann, E. Nöth
Relationship Between Discourse Structure and Dynamic Speech Rate Florien J. Koopmans-van Beinum, Monique E. van Donzel
Using Prosodic Clues to Decide When to Produce Back-channel Utterances Nigel Ward
Dialog Act Classification with the Help of Prosody Marion Mast, Ralf Kompe, Stefan Harbeck, Andreas Kiessling, Heinrich Niemann, Elmar Nöth, E. G. Schukat-Talamazzini, V. Warnke
Using Lexical Stress in Continuous Speech Recognition for Dutch David van Kuijk, Henk van den Heuvel, Louis Boves

SaP1P1 -- Speaker/Language Identification and Verification

Chair: Sadaoki Furui, NTT Human Interface Lab

SaP1S1 -- Large Vocabulary Speech Recognition: The Switchboard Domain I

Chairs: Ronald Rosenfeld, Carnegie Mellon University; and Hervé Bourlard, Facult'e Polytechnique De Mons

SaP1S2 -- Emotion in Recognition and Synthesis I

Chair: Klaus R. Scherer, University of Geneva

SaP2L1 -- Stochastic Techniques in Robust Speech Recognition

Chair: Richard Rose, AT&T Labs - Research

SaP2L2 -- Prosodic Synthesis in Text to Speech

Chair: Bernd Moebius, Bell Labs - Lucent Technologies

SaP2L3 -- Dialogue Events

Chair: David G. Novick, European Institute of Cognitive Sciences and Engineering

SaP2P1 -- Databases and Tools

Chair: Bruce M. Buntschuh, AT&T Labs - Research

SaP2S1 -- Large Vocabulary Speech Recognition: The Switchboard Domain II

Chairs: Ronald Rosenfeld, Carnegie Mellon University; and Hervé Bourlard, Facult'e Polytechnique De Mons

SuA1L1 -- Robust Speech Processing

Chair: Mazin Rahim, AT&T Labs - Research

SuA1L2 -- Dialects and Speaking Styles

Chair: Jim Hieronymus, Bell Labs - Lucent Technologies

SuA1L3 -- Production and Perception of Prosody

Chair: Gunnar Fant, KTH

SuA1P1 -- Topics in ASR and Search

Chair: Enrico Bocchieri, AT&T Labs - Research

SuA1P2 -- Multimodal Dialogue/HCI

Chair: Donald Hindle, AT&T Labs - Research

An Investigation into the Generation of Mouth Shapes for a Talking Head A. P. Breen, E. Bowers, W. Welsh
A Text-to-audiovisual-speech Synthesizer for French Bertrand Le Goff, Christian Benoît
Analysis of Head Movements and its Role in Spoken Dialogue Yuri Iwano, Shioya Kageyama, Emi Morikawa, Shu Nakazato, Katsuhiko Shirai
RWC Multimodal Database for Interactions by Integration of Spoken Language and Visual Information Satoru Hayamizu, Osamu Hasegawa, Katunobu Itou, Katuhiko Sakaue, Kazuyo Tanaka, Shigeki Nagaya, Masayuki Nakazawa, T. Endoh, Fumio Togawa, Kenji Sakamoto, Kazuhiko Yamamoto
About the Relationship Between Eyebrow Movements and Fo Variations Christian Cavé, Isabelle Guaïtella, Roxane Bertrand, Serge Santi, Françoise Harlay, Robert Espesser
How Many Words is a Picture Really Worth? Laurel Fais, Kyung-ho Loken-Kim, Tsuyoshi Morimoto
Visual Synthesis of Source Acoustic Speech Through Kohonen Neural Networks A. Lagana`, F. Lavagetto, A. Storace
Audio-visual Speech Perception Without Speech Cues Helena M. Saldaña, David B. Pisoni, Jennifer M. Fellowes, Robert E. Remez

SuA1S1 -- Multilingual Speech Processing I

Chair: Alex Waibel, Carnegie Mellon University

Multilingual Speech Recognition at Dragon Systems Jim Barnett, A. Corrada, G. Gao, L. Gillick, Y. Ito, S. Lowe, L. Manganaro, B. Peskin
Multi-lingual Phoneme Recognition Exploiting Acoustic-phonetic Similarities of Sounds Joachim Köhler
Japanese Speech Databases for Robust Speech Recognition Atsushi Nakamura, Shoichi Matsunaga, Tohru Shimizu, Masahiro Tonomura, Yoshinori Sagisaka
Spoken Language Processing in a Multilingual Context Lori F. Lamel, M. Adda-Decker, Jean Luc Gauvain, G. Adda
Multilingual Human-computer Interactions: From Information Access to Language Learning Victor Zue, Stephanie Seneff, Joseph Polifroni, Helen Meng, James Glass
SpeeData: Multilingual Spoken Data Entry U. Ackermann, B. Angelini, F. Brugnara, M. Federico, D. Giuliani, R. Gretter, G. Lazzari, H. Niemann

SuA2L1 -- Acoustics in Synthesis

Chair: Michael Macon, Georgia Institute of Technology

SuA2L2 -- Pitch and Rate

Chair: David Talkin, Entropic Research Laboratory

SuA2L3 -- Acoustic Modeling II

Chair: Li Deng, University of Waterloo

SuA2P1 -- General ASR Posters

Chair: Lori Lamel, LIMSI-CNRS

SuA2S1 -- Multilingual Speech Processing II

Chair: Alex Waibel, Carnegie Mellon University

SuP1L1 -- Data-based Synthesis

Chair: Yoshinori Sagisaka, ATR Interpreting Telecommunications Research Laboratory

SuP1L2 -- Speaker Identification and Verification

Chair: Doug Reynolds, MIT Lincoln Laboratory

SuP1L3 -- Acoustic Phonetics

Chair: Nick Campbell, ATR Interpreting Telecommunications Research Laboratory

SuP1P1 -- Perception of Vowels and Consonants

Chair: Doug Whalen, Haskins Laboratories

SuP2LP -- Closing Ceremony and Plenary Lecture

Chairs: H. Timothy Bunnell, Alfred I. duPont Institute; and Richard A. Foulds, Alfred I. duPont Institute