Technical Program Contents
Chairs: H. Timothy Bunnell, Alfred I. duPont Institute; and Richard A. Foulds, Alfred I. duPont Institute
-
The Comparative Study of Spoken-Language Processing Anne Cutler
Chair: Michael D. Riley, AT&T Labs - Research
-
New Developments in the INRS Continuous Speech Recognition System Z. Li, M. Heon, Douglas
O'Shaughnessy
-
On Designing Pronunciation Lexicons for Large Vocabulary, Continuous Speech Recognition Lori Lamel, Gilles
Adda
-
Word Graph Rescoring Using Confidence Measures Pablo Fetter, Frédéric Dandurand, Peter
Regel-Brietzmann
-
A Bottom-up Approach for Handling Unseen Triphones in Large Vocabulary Continuous Speech Recognition
X.L. Aubert, Peter Beyerlein, Meinhard Ullrich
-
Discriminative Optimisation of Large Vocabulary Recognition Systems V. Valtchev, P.C. Woodland, S. J.
Young
-
Japanese Large-vocabulary Continuous-speech Recognition using a Business-newspaper Corpus Tatsuo
Matsuoka, Katsutoshi Ohtsuki, Takeshi Mori, Sadaoki Furui, Katsuhiko Shirai
-
Handling Compound Nouns in a Swedish Speech-understanding System David Carter, Jaan Kaja, Leonardo
Neumeyer, Manny Rayner, Fuliang Weng, Mats Wiren
-
Initial Evaluation of a Preselection Module for a Flexible Large Vocabulary Speech Recognition System in Telephone
Environment J. Macias-Guarasa, A. Gallardo, J. Ferreiros, Jose M. Pardo, L. Villarrubia
Chair: Eric Petajan, Bell Labs - Lucent Technologies
-
Asynchronous Integration of Visual Information in an Automatic Speech Recognition System Mamoun Alissali,
Paul Deleglise, Alexandrina Rogozan
-
Audiovisual Speech Recognition using Multiscale Nonlinear Image Decomposition. I.A. Matthews, J. Bangham,
S.J. Cox
-
Robust Audiovisual Integration using Semicontinuous Hidden Markov Models Qin Su, Peter L. Silsbee
-
The Effect of Visual Information on Word Initial Consonant Perception of Dysarthric Speech Richard P.
Schumeyer, Kenneth E. Barner
-
A Multiple Deformable Template Approach for Visual Speech Recognition Devi Chandramohan, Peter L.
Silsbee
-
Speaker Independent Bimodal Phonetic Recognition Experiments P. Cosi, E. Magno Caldognetto, F. Ferrero, M.
Dugatto, K. Vagges
-
Speechreading using Shape and Intensity Information Juergen Luettin, Neil A. Thacker, Steve W. Beet
-
Speaker Identification by Lipreading Juergen Luettin, Neil A. Thacker, Steve W. Beet
Chair: Sharon Manuel, Emerson College and Massachusetts Institute of Technology
-
How Word Onsets Drive Lexical Access and Segmentation: Evidence from Acoustics, Phonology and Processing
David W. Gow Jr., Janis Melvold, Sharon Manuel
-
RAW: A Real-speech Model for Human Word Recognition David van Kuijk, Peter Wittenburg, Ton
Dijkstra
-
How Facilitatory can Lexical Information Be During Word Recognition? Evidence from Moroccan Arabic Mehdi
Meftah, Sami Boudelaa
-
Effects of Frequency on the Auditory Perception of Open- Versus Closed-class Words Alette P. Haveman
-
Phonotactic and Metrical Influences on Adult Ratings of Spoken Nonsense Words Michael S. Vitevitch, Paul A.
Luce, Jan Charles-Luce, David Kemmerer
-
Lipreading Supplemented by Voice Fundamental Frequency: To What Extent Does the Addition of Voicing Increase Lexical
Uniqueness for the Lipreader? Edward T. Auer Jr., Lynne E. Bernstein
-
Strategies Used in Rhyme-Monitoring S. te Riele, S.G. Nooteboom, H. Quené
-
How do Dutch Listeners Process Words with Epenthetic Schwa? Wilma van Donselaar, Cecile Kuijpers, Anne
Cutler
Chair: Jim Hieronymus, Bell Labs - Lucent Technologies
-
Whole-word Phonetic Distances and the PGPfone Alphabet Patrick Juola, Philip Zimmermann
-
Automatic Vowel Quality Description using a Variable Mapping to an Eight Cardinal Vowel Reference Set
Shuping Ran, J. Bruce Millar, Phil Rose
-
Automatic Detection and Segmentation of Pronunciation Variants in German Speech Corpora Andreas Kipp,
Maria-Barbara Wesenick, Florian Schiel
-
ANGIE: A New Framework for Speech Analysis Based on Morpho-phonological Modelling Stephanie Seneff,
Raymond Lau, Helen Meng
-
Perceptual Contrast in the Korean and English Vowel System Normalized Byunggon Yang
-
On Phonetic Characteristics of Pause in the Korean Read Speech Yong-Ju Lee, Sook-hyang Lee
-
Cross-Language Effects of Lexical Stress in Word Recognition: The Case of Arabic English Bilinguals Sami
Boudelaa, Mehdi Meftah
-
Automatic Generation of German Pronunciation Variants Maria-Barbara Wesenick
-
Estimating the Quality of Phonetic Transcriptions and Segmentations of Speech Signals Maria-Barbara Wesenick,
Andreas Kipp
-
An Acoustic Analysis of Contemporary Vowels of the Standard Slovenian Language Bojan Petek, Rastislav
Sustarsic,Smiljana Komar
-
Using Decision Trees to Construct Optimal Acoustic Cues Sandrine Robbe, Anne Bonneau, Sylvie Coste, Yves
Laprie
-
Maximum Jaw Displacement in Contrastive Emphasis Donna Erickson, Osamu Fujimura
-
Subglottal Pressure and Final Lowering in English Rebecca Herman, Mary Beckman, Kiyoshi Honda
-
Phonological Variation: Epenthesis and Deletion of Schwa in Dutch Cecile Kuijpers, Wilma van Donselaar, Anne
Cutler
-
Can a Moraic Nasal Occur Word-initially in Japanese? Takashi Otake, Kiyoko Yoneyama
Chair: Valerie Hazan, University College London
-
Feedback Considerations for Speech Training Systems James J. Mahshie
-
Clinical Applications of Computer-Based Speech Training for Children with Hearing Impairment Anne-Marie
Öster
-
Enhancing Information-rich Regions of Natural VCV and Sentence Materials Presented in Noise Valerie Hazan,
Andrew Simpson
-
Speech Perceptual Abilities of Children with Specific Reading Difficulty (Dyslexia) Valerie Hazan, Alan
Adlard
-
Bimodal Perception of Spectrum Compressed Speech Larry D. Paarmann, Michael K. Wynne
-
Effect of Sentential Context on Syllabic Stress Perception by Hearing-impaired Listeners Dragana Barac-Cikoja,
Sally Revoile
-
Applications of Automatic Speech Recognition to Speech and Language Development in Young Children Martin
Russell, Catherine Brown, Adrian Skilling, Rob Series, Julie Wallace, Bill Bohnam, Paul Barker
-
Sub-band Adaptive Speech Enhancement for Hearing Aids D. R. Campbell
-
Adapting a TTS System to a Reading Machine for the Blind Thomas Portele, Juergen Kraemer
Chairs: James R. Glass, MIT Laboratory for Computer Science; and Yasunaga Niimi, Kyoto Institute of
Technology
-
Modeling of Spoken Dialogue with and without Visual Information Katsuhiko Shirai
-
Multimodal Discourse Modelling in a Multi-user Multi-domain Environment Stephanie Seneff, David Goddeau,
Christine Pao, Joseph Polifroni
-
Automatic Acquisition of Probabilistic Dialogue Models Kenji Kita, Yoshikazu Fukui, Masaaki Nagata, Tsuyoshi
Morimoto
-
Units of Dialogue Management: An Example Paul Heisterkamp, Scott McGlashan
-
Error Resolution During Multimodal Human-computer Interaction Sharon Oviatt, Robert VanGent
-
Improved Spontaneous Dialogue Recognition Using Dialogue and Utterance Triggers by Adaptive Probability Boosting
Ramesh R. Sarukkai, Dana H. Ballard
-
Speech Recognition for Spontaneously Spoken German Dialogues Kai Hübener, Uwe Jost, Henrik Heine
-
Using Prosodic Information to Constrain Language Models for Spoken Dialogue Paul Taylor, Hiroshi Shimodaira,
Stephen Isard, Simon King, Jacqueline Kowtko
Chair: Roberto Pieraccini, AT&T Labs - Research
-
Combination of Word-based and Category-based Language Models T.R. Niesler, P.C. Woodland
-
A Multi-level Lexical-semantics Based Language Model Design for Guided Integrated Continuous Speech Recognition
Francisco J. Valverde-Albacete, Jose M. Pardo
-
A Category Based Approach for Recognition of Out-of-Vocabulary Words Florian Gallwitz, Elmar Noeth,
Heinrich Niemann
-
Scalable Backoff Language Models Kristie Seymore, Ronald Rosenfeld
-
Modeling Long Distance Dependence in Language: Topic Mixtures vs. Dynamic Cache Models R. Iyer, Mari
Ostendorf
-
Bayesian Estimation Methods for N-Gram Language Model Adaptation Marcello Federico
Chair: Shubha Kadambe, Atlantic Aerospace Electronics Corp.
-
Feature Dimension Reduction Using Reduced-Rank Maximum Likelihood Estimation for Hidden Markov Models
Don X. Sun
-
Using Multi-Level Segmentation Coefficients to Improve HMM Speech Recognition Kai Hübener
-
A Comparative Study of Linear Feature Transformation Techniques for Automatic Speech Recognition T. Eisele,
R. Haeb-Umbach, D. Langmann
-
Inclusion of Temporal Information into Features for Speech Recognition Ben Milner
-
New Cepstral Representation using Wavelet Analysis and Spectral Transformation for Robust Speech Recognition
Hubert Wassner, Gérard Chollet
-
Wavelet Based Feature Extraction for Phoneme Recognition C.J. Long, S. Datta
Chair: Terrance M. Neary, University of Alberta
-
Extraction of Tongue Contours in X-ray Images with Minimal User Interaction Yves Laprie, Marie-Odile
Berger
-
Three-dimensional Measurement of the Vocal Tract by MRI Didier Demolin, Thierry Metens, Alain Soquet
-
Syllable Affiliation of Final Consonant Clusters Undergoes a Phase Transition Over Speaking Rates Philip
Gleason, Betty Tuller, J. A. Scott Kelso
-
Towards a Biomechanical Model of the Larynx Arthur Lobo, Michael O'Malley
-
Effects of Auditory Feedback on F0 Trajectory Generation Hideki Kawahara, Hiroko Kato, J. C. Williams
Chair: Jean-Luc Gauvain, LIMSI-CNRS
-
On the Effects of Accent and Language on Low Rate Speech Coders I. S. Burnett, J. J. Parry
-
VQ Codevector Index Assignment Using Genetic Algorithms for Noisy Channels J.S. Pan, Fergus R. McInnes,
Mervyn A. Jack
-
An Improved Vector Quantization Algorithm for Speech Transmission Over Noisy Channels Gavin C.
Cawley
-
Very Low Delay and High Quality Coding of 20 Hz-15 kHz Speech Signals at 64 kbit/s C. Murgia, G. Feng, A.
Le Guyader, C. Quinquis
-
Application of Speaker Modification Techniques to Phonetic Vocoding Carlos M. Ribeiro, Isabel M.
Trancoso
-
Entropy Coded Vector Quantization with Hidden Markov Models Tadashi Yonezaki, Kiyohiro Shikano
-
An Application of Recurrent Neural Networks to Low Bit Rate Speech Coding Minoru Kohata
-
CELP Coding System Based on Mel-Generalized Cepstral Analysis Kazuhito Koishida, Keiichi Tokuda, Takao
Kobayashi, Satoshi Imai
-
Wideband Re-synthesis of Narrowband CELP-coded Speech Using Multiband Excitation Model Cheung-Fat
Chan, Wai-Kwong Hui
-
Recurrent Neural Networks for Phoneme Recognition Takuya Koizumi, Mikio Mori, Shuji Taniguchi, Mitsutoshi
Maruya
-
A Model for the Acoustic Phonetic Structure of Arabic Language using a Single Ergodic Hidden Markov Model
M.A. Mokhtar, A. Zein-el-Abddin
-
Modelling Long Term Variability Information in Mixture Stochastic Trajectory Framework Yifan Gong, Irina Illina,
Jean-Paul Haton
-
Segmental Phonetic Features Recognition by means of Neural-fuzzy Networks and Integration in an N-best Solutions
Post-processing T. Moudenc, R. Sokol, G. Mercier
-
Stochastic Trajectory Model with State-Mixture for Continuous Speech Recognition Irina Illina, Yifan Gong
-
Recognition of Spelled Names over the Telephone Hermann Hild, Alex Waibel
-
Optimal Tying of HMM Mixture Densities using Decision Trees Gilles Boulianne, Patrick Kenny
-
Speech Recognition Using an Enhanced FVQ Based on a Codeword Dependent Distribution Normalization and Codeword
Weighting by Fuzzy Objective Function Hwan Jin Choi, Yung Hwan Oh
-
Using the Self-Organizing Map to Speed up the Probability Density Estimation for Speech Recognition with Mixture Density
HMMs Mikko Kurimo, Panu Somervuo
Chairs: Patti Price, SRI International; and Akira Kurematsu, University of Electro-Communications
-
Combining the Detection and Correction of Speech Repairs Peter A. Heeman, Kyung-ho Loken-Kim, James F.
Allen
-
Generating Spontaneous Elliptical Utterance Yuji Sagawa, Wataru Sugimoto, Noboru Ohnishi
-
Developing the Modelling of Swedish Prosody in Spontaneous Dialogue Gösta Bruce, Marcus Filipsson, Johan
Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati
-
Spoken Language Generation in a Multimedia System Shimei Pan, Kathleen R. McKeown
-
Synthesizing Dialogue Speech of Japanese Based on the Quantitative Analysis of Prosodic Features Keikichi
Hirose, Mayumi Sakata, Hiromichi Kawanami
-
Spoken Dialogue Interface in a Dual Task Situation Shuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko
Shirai
Chair: Eric D. Young, Johns Hopkins University
-
How is Information About Speech Encoded in the Peripheral Auditory System? Eric D. Young
-
Spectral Shape Analysis in the Central Auditory System Shihab Shamma
Chair: Jerome R. Bellegarda, Apple Computer, Inc.
-
Modeling Disfluencies in Conversational Speech Man-hung Siu, Mari Ostendorf
-
Evaluation of a Language Model using a Clustered Model Backoff John Miller, Fil Alleva
-
Language Modeling Using X-grams Antonio Bonafonte, José B. Mariño
-
Class Phrase Models For Language Modelling Klaus Ries, Finn Dag Buo, Alex Waibel
-
Introducing Linguistic Constraints into Statistical Language Modeling Petra Geutner
-
Language Modeling with Stochastic Automata Jianying Hu, William Turin, Michael K. Brown
Chair: Shubha Kadambe, Atlantic Aerospace Electronics Corp.
-
New Fast Wavelet Packet Transform Algorithms for Frame Synchronized Speech Processing Andrzej
Drygajlo
-
Frequency-Warping in Speech S. Umesh, L. Cohen, N. Marinovic, D. Nelson
-
Extracting Speech Features from Human Speech-like Noise Daisuke Kobayashi, Shoji Kajita, Kazuya Takeda,
Fumitada Itakura
-
Subband-Crosscorrelation Analysis for Robust Speech Recognition Shoji Kajita, Kazuya Takeda, Fumitada
Itakura
-
A New ASR Approach Based on Independent Processing and Recombination of Partial Frequency Bands Hervé
Bourlard, Stéphane Dupont
-
Frequency and Time Filtering of Filter-bank Energies for HMM Speech Recognition Climent Nadeu, José B.
Mariño, Javier Hernando, Albino Nogueiras
Chair: John Ohala, University of California, Berkeley
-
Temporal Cues for Vowels and Universals of Vowel Inventories Carrie E. Lang, John J. Ohala
-
Acoustic Variability in Spontaneous Conversational Speech of American English Talkers Ann K. Syrdal
-
Cross-language Speech Perception: Swedish, English, and Spanish Speakers' Perception of Front Rounded Vowels
Raquel Willerman, Patricia K. Kuhl
-
Inter-language Vowel Perception and Production by Korean and Japanese Listeners John C.L. Ingram,
See-Gyoon Park
-
Intelligibility and Acoustic Correlates of Japanese Accented English Vowels Diane Kewley-Port, Reiko
Akahane-Yamada, Kiyoaki Aikawa
-
Segmentation Strategies for Spoken Language Recognition: Evidence from Semi-bilingual Japanese Speakers of English
Kiyoko Yoneyama
Chair: Wu Chou, Bell Labs - Lucent Technologies
-
Integrating Connectionist, Statistical and Symbolic Approaches for Continuous Spoken Korean Processing
Geunbae Lee, Jong-Hyeok Lee, Kyubong Park, Byung-Chang Kim
-
Towards ASR on Partially Corrupted Speech Hynek Hermansky, Sangita Timberwala, Misha Pavel
-
Parametric Trajectory Models for Speech Recognition Herbert Gish, Kenney Ng
-
Use of Gaussian Selection in Large Vocabulary Continuous Speech Recognition Using HMMs K.M. Knill, M.J.F.
Gales, S. J. Young
-
Cross Phone State Clustering using Lexical Stress and Context J. Hogberg, K. Sjolander
-
Likelihood Ratio Decoding and Confidence Measures for Continuous Speech Recognition Eduardo Lleida,
Richard C. Rose
-
A Study on Continuous Chinese Speech Recognition Based on Stochastic Trajectory Models Xiaohui Ma, Yifan
Gong, Yuqing Fu, Jiren Lu, Jean-Paul Haton
-
A Proposal for a New Algorithm of Reference Interval-free Continuous DP for Real-time Speech or Text Retrieval
Yoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka
-
Language Modeling by String Pattern N-gram for Japanese Speech Recognition Akinori Ito, Masaki
Kohda
-
Statistical Language Modeling using a Variable Context Length Reinhard Kneser
-
A Comparison of Hybrid HMM Architectures Using Global Discriminative Training Finn Tore Johansen
-
Improved Probability Estimation with Neural Network Models Wei Wei, Etienne Barnard, Mark Fanty
-
A Neural Network Using Acoustic Sub-word Units for Continuous Speech Recognition Ha-Jin Yu, Yung-Hwan
Oh
-
On the Error Criteria in Neural Networks as a Tool for Human Classification Modelling Louis F. M. ten Bosch,
Roel Smits
-
A Non-linear Filtering Approach to Stochastic Training of the Articulatory-acoustic Mapping Using the EM Algorithm
Gordon Ramsay
-
A Tool for Automated Design of Language Models Y.P. Yang, J.R. Deller Jr.
-
Acoustic-phonetic Decoding Based on Elman Predictive Neural Networks F. Freitag, E. Monte
-
On Improving Discrimination Capability of an RNN Based Recognizer Tan Lee, P.C. Ching
-
An Evaluation of Statistical Language Modeling for Speech Recognition using a Mixed Category of Both Words and
Parts-of-speech Yumi Wakita, Jun Kawai, Hitoshi Iida
Chairs: Paul Dalsgaard, Aalborg University; and Hiroya Fujisaki, Science University of Tokyo
-
A Dialogue Control Strategy Based on the Reliability of Speech Recognition Yasuhisa Niimi, Yutaka
Kobayashi
-
SpeechWear: A Mobile Speech System Alexander I. Rudnicky, Stephen Reed, Eric H. Thayer
-
WHEELS: A Conversational System in the Automobile Classifieds Domain Helen Meng, Senis Busayapongchai,
James Glass, David Goddeau, Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor
Zue
-
Effective Human-computer Cooperative Spoken Dialogue: The AGS Demonstrator M.D. Sadek, A. Ferrieux, A.
Cozannet, P. Bretier, F. Panaget, J. Simonin
-
Dialog in the RAILTEL Telephone-based System S.K. Bennacef, L. Devillers, S. Rosset, Lori Lamel
-
Dialogue Processing in a Conversational Speech Translation System Alon Lavie, Lori Levin, Yan Qu, Alex
Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada
Chair: Eric D. Young, Johns Hopkins University
-
Novel Speech Processing Mechanism Derived from Auditory Neocortical Circuit Analysis Boris Aleksandrovsky,
James Whitson, Gretchen Andes, Gary Lynch, Richard Granger
-
Modeling Neurons in the Anteroventral Cochlear Nucleus for Amplitude Modulation (AM) Processing: Application to
Speech Sound Ping Tang, Jean Rouat
-
Noise Suppression and Loudness Normalization in an Auditory Model-based Acoustic Front-end Halewijn
Vereecken, Jean-Pierre Martens
-
A Psychoacoustic Model for the Noise Masking of Voiceless Plosive Bursts Jim Hant, Brian Strope, Abeer
Alwan
-
Training Machine Classifiers to Match the Performance of Human Listeners in a Natural Vowel Classification Task
Martin Hunke, Thomas Holton
-
A Neural Matrix Model for Active Tracking of Frequency-modulated Tones Kiyoaki Aikawa, Hideki
Kawahara, Minoru Tsuzaki
Chair: Jay Wilpon, AT&T Labs - Research
-
A User-Configurable System for Voice Label Recognition Richard C. Rose, Eduardo Lleida, G.W. Erhart, R.V.
Grubbe
-
Keyword Spotting Enhancement for Video Soundtrack Indexing Philippe Gelin, Chris. J. Wellekens
-
New Efficient Fillers for Unlimited Word Recognition and Keyword Spotting Rachida El Méliani, Douglas
O'Shaughnessy
-
Automatic Transcription of General Audio Data: Preliminary Analyses Michelle S. Spina, Victor Zue
-
Transcribing Radio News Francis Kubala, Tasos Anastasakos, Hubert Jin, Long Nguyen, Richard
Schwartz
-
Correcting Recognition Errors via Discriminative Utterance Verification Anand R. Setlur, Rafid A. Sukkar, John
Jacob
Chair: Grace H. Yeni-Komshian, University of Maryland
-
Does Training in Speech Perception Modify Speech Production? Reiko Akahane-Yamada, Yoh'ichi Tohkura,
Ann R. Bradlow, David B. Pisoni
-
Phrase-Final Lengthening and Stress-Timed Shortening in the Speech of Native Speakers and Japanese Learners of
English Motoko Ueyama
-
Japanese Accentuations by Foreign Students and Japanese Speakers of Non-Tokyo Dialect Nobuko
Yamada
-
Devoicing of Japanese Vowels by Taiwanese Learners of Japanese J. Kevin Varden, Tsutomu Sato
-
Fluency and Use of Segmental Dialect Features in the Acquisition of a Second Language (French) by English Speakers
Danièle Archambault, Catherine Foucher, Blagovesta Maneva
-
Estimating Child and Adolescent Formant Frequency Values From Adult Data P. Martland, S.P. Whiteside, Steve
W. Beet, L. Baghai-Ravary
Chair: Elizabeth Shriberg, SRI International
-
Acoustic Correlates of Linguistic Stress and Accent in Dutch and American English Agaath M.C. Sluijter, Vincent
J. van Heuven
-
On the Levels of Accentuation in Spoken Japanese Hiroya Fujisaki, Sumio Ohno, Osamu Tomita
-
Tonal Distinctions Between Emphatic Stress and Pretonic Lengthening in Quebec French Linda Thibault, Marise
Ouellet
-
Distinction Between 'Normal' Focus and 'Contrastive/Emphatic' Focus Anja (Petzold) Elsner
-
Perception of Tonal Accent by Americans Learning Japanese Yukihiro Nishinuma, Masako Arai, Takako
Ayusawa
-
Modeling Intra-Speaker Pitch Range Variation: Predicting F0 Targets when "Speaking Up" Elizabeth Shriberg, D.
Robert Ladd, Jacques Terken
Chair: Alicia Abella, AT&T Labs - Research
-
Predicting Dialogue Acts for a Speech-To-Speech Translation System Norbert Reithinger, Ralf Engel, Michael
Kipp, Martin Klesen
-
Automatic Speech Translation Based on the Semantic Structure Johannes Müller, Holger Stahl, Manfred
Lang
-
A Methodology for Application Development for Spoken Language Systems Lewis M. Norton, Carl E. Weir,
K.W. Scholz, Deborah A. Dahl, Ahmed Bouzid
-
A New Restaurant Guide Conversational System: Issues in Rapid Prototyping for Specialized Domains Stephanie
Seneff, Joseph Polifroni
-
Semantic Interpretation of a Japanese Complex Sentence in an Advisory Dialogue - Focused on the Postpositional Word
"KEDO,'' Which Works as a Conjunction Between Clauses Tadahiko Kumamoto, Akira Ito
-
A Korean Morphological Analyzer for Speech Translation System Youngkuk Hong, Myoung-Wan Koo, Gijoo
Yang
-
Generic and Domain-specific Aspects of the Waxholm NLP and Dialog Modules Rolf Carlson, Sheri
Hunnicutt
-
A Real-Time System for Summarizing Human-Human Spontaneous Spoken Dialogues Megumi Kameyama, Goh
Kawai, Isao Arima
-
Evaluation of Spoken Language Understanding and Dialogue Systems Bernd Hildebrandt, Heike Rautenstrauch,
Gerhard Sagerer
-
Inter-Speaker Interaction of F0 in Dialogs Kuniko Kakita
-
A Robust Dialogue System for Making an Appointment Hans Brandt-Pook, Gernot A. Fink, Bernd Hildebrandt,
Franz Kummert, Gerhard Sagerer
-
Segmentation of Spoken Dialogue by Interjections, Disfluent Utterances and Pauses Kazuyuki Takagi, Shuichi
Itahashi
-
A Form-Based Dialogue Manager for Spoken Language Applications David Goddeau, Helen Meng, Joe
Polifroni, Stephanie Seneff, Senis Busayapongchai
-
The Design of Complex Telephony Applications Using Large Vocabulary Speech Technology S.J. Whittaker, D.J.
Attwater
-
Building 10,000 Spoken Dialogue Systems Stephen Sutton, David G. Novick, Ronald A. Cole, Pieter Vermeulen,
Jacques de Villiers, Johan Schalkwyk, Mark Fanty
-
Speaker Intention Modeling for Large Vocabulary Mandarin Spoken Dialogues Yen-Ju Yang, Lee-Feng Chien,
Lin-Shan Lee
-
Hybrid Language Models and Spontaneous Legal Discourse P.E. Kenne, Mary O'Kane
-
Topic Change and Local Perplexity in Spoken Legal Dialogue P.E. Kenne, Mary O'Kane
-
Intonational Cues to Discourse Structure in Japanese Jennifer J. Venditti, Marc Swerts
-
Principles for the Design of Cooperative Spoken Human-Machine Dialogue Niels Ole Bernsen, Hans Dybkjær,
Laila Dybkjær
-
Development and Comparison of Three Syllable Stress Classifiers Karen L. Jenkin, Michael S. Scordilis
Chair: Don Jamieson, University of Western Ontario
-
Interaction of Speech Disorders with Speech Coders: Effects on Speech Intelligibility D.G. Jamieson, Li Deng, M.
Price, Vijay Parsa, J. Till
-
Detecting Arytenoid Cartilage Misplacement through Acoustic and Electroglottographic Jitter Analysis Maurílio N.
Vieira, Arnold G. D. Maran, Fergus R. McInnes, Mervyn A. Jack
-
Robust F0 and Jitter Estimation in Pathological Voices Maurílio N. Vieira, Fergus R. McInnes, Mervyn A.
Jack
-
Speech Monitoring of Infective Laryngitis F. Plante, H. Kessler, B.M.G. Cheetham, J. Earis
-
Searching for Nonlinear Relations in Whitened Jitter Time Series J. Schoentgen, R. De Guchteneere
-
Vocal Fold Pathology Assessment using AM Autocorrelation Analysis of the Teager Energy Operator Liliana
Gavidia-Ceballos, John H.L. Hansen, James F. Kaiser
-
Continuous Positive Airway Pressure (CPAP) in the Treatment of Hypernasality David P. Kuehn
-
Enhancement of Alaryngeal Speech by Adaptive Filtering Carol Y. Espy-Wilson, Venkatesh R. Chari, Caroline B.
Huang
-
Simulation of Disordered Speech Using a Frequency-Domain Vocal Tract Model Li Deng, Xuemin Shen, D.G.
Jamieson, J. Till
-
A Stochastic Model of Fundamental Period Perturbation and Its Application to Perception of Pathological Voice Quality
Yasuo Endo, Hideki Kasuya
-
A Screening Test for Speech Pathology Assessment Using Objective Quality Measures Eric J. Wallen, John H.L.
Hansen
-
Recent Advances in Hypernasal Speech Detection using the Nonlinear Teager Energy Operator Douglas A.
Cairns, John H.L. Hansen, James F. Kaiser
Chair: Maureen Stone, University of Maryland at Baltimore
-
Human Palate and Related Structures: Their Articulatory Consequences Kiyoshi Honda, Shinji Maeda, Michiko
Hashi, Jim Dembowski, John R. Westbury
-
A Continuum Mechanics Representation of Tongue Deformation Edward P. Davis, Andrew Douglas, Maureen
Stone
-
From MRI and Acoustic Data to Articulatory Synthesis: A Case Study of the Lateral Approximants in American English
Philbert Bangayan, Abeer Alwan, Shrikanth Narayanan
-
Liquids in Tamil Shrikanth Narayanan, Abigail Kaun, Dani Byrd, Peter Ladefoged, Abeer Alwan
Chair: Keikichi Hirose, University of Tokyo
-
Modeling Hyperarticulate Speech during Human-computer Error Resolution Sharon Oviatt, Gina-Anne Levow,
Margaret MacEachern, Karen Kuhn
-
Using Stress to Disambiguate Spoken Thai Sentences Containing Syntactic Ambiguity Siripong Potisuk, Mary P.
Harper, Jackson T. Gandour
-
Use of Prosodic Information to Integrate Acoustic and Linguistic Knowledge in Continuous Mandarin Speech Recognition
with Very Large Vocabulary Hung-yun Hsieh, Ren-yuan Lyu, Lin-shan Lee
-
Word Boundary Detection using Pitch Variations G.V. Ramana Rao, J. Srichand
-
Detection of Phrase Boundaries in Japanese by Low-Pass Filtering of Fundamental Frequency Contours Atsuhiro
Sakurai, Keikichi Hirose
-
A New Method for Speech Delexicalization, and its Application to the Perception of French Prosody V. Pagel,
N. Carbonell, Yves Laprie
Chair: Allen L. Gorin, AT&T Labs - Research
-
Task Adaptation for Dialogues Via Telephone Lines Udo Bub
-
The Influence of Bigram Constraints on Word Recognition by Humans: Implications for Computer Speech Recognition
Ronald A. Cole, Yonghong Yan, Troy Bailey
-
ALICE: Acquisition of Language In Conversational Environment - An Approach to Weakly Supervised Training of Spoken
Language System for Language Porting Tetsunori Kobayashi
-
Pitch Pattern Clustering of User Utterances in Human-Machine Dialogue Takashi Yoshimura, Satoru Hayamizu,
Hiroshi Ohmura, Kazuyo Tanaka
-
Simplifying Language through Error-correcting Decoding J.C. Amengual, E. Vidal, J.M. Benedí
-
A Mixed Approach to Speech Understanding Mauro Cettolo, Anna Corazza, Renato De Mori
Chair: Esther Levin, AT&T Labs - Research
-
Speech Recognition for an Information Kiosk J.L. Gauvain, J.J. Gangolf, L. Lamel
-
Localizing an Automatic Inquiry System for Public Transport Information Helmer Strik, Albert Russel, Henk van
den Heuvel, Catia Cucchiarini, Louis Boves
-
Prompt Constrained Natural Language - Evolving the Next Generation of Telephony Services Stephen M.
Marcus, Deborah W. Brown, Randy G. Goldberg, Max S. Schoeffler, William R. Wetzel, Richard R. Rosinski
-
Key-Phrase Detection and Verification for Flexible Speech Understanding Tatsuya Kawahara, Chin-Hui Lee,
Biing-Hwang Juang
-
Interactive Recovery from Speech Recognition Errors in Speech User Interfaces Bernhard Suhm, Brad Myers,
Alex Waibel
-
Estimation of Language Models for New Spoken Language Applications Sunil Issar
Chair: Richard Stern, Carnegie Mellon University
-
H-infinity Filtering for Speech Enhancement Xuemin Shen, Li Deng, Anisa Yasmin
-
A Comparitive Analysis of Channel-Robust Features and Channel Equalization Methods for Speech Recognition
Saeed V. Vaseghi, Ben Milner
-
Robust Speech Recognition Features Based on Temporal Trajectory Filtering of Frequency Band Spectrum Jia-lin
Shen, Wen-liang Hwang, Lin-shan Lee
-
Durational Modelling for Improved Connected Digit Recognition Kevin Power
-
Study on the Dereverberation of Speech Based on Temporal Envelope Filtering Carlos Avendano, Hynek
Hermansky
-
Estimating Markov Model Structures Thorsten Brants
-
A Fertility Channel Model for Post-Correction of Continuous Speech Recognition Eric K. Ringger, James F.
Allen
-
Restoration of Wide Band Signal from Telephone Speech using Linear Prediction Error Processing Hiroshi
Yasukawa
-
Smoothed Spectral Subtraction for a Frequency-Weighted HMM in Noisy Speech Recognition Hiroshi
Matsumoto, Noboru Naitoh
-
A Simple Architecture for using Multiple Cues in Sound Separation William S. Woods, Martin Hansen, Thomas
Wittkop, Birger Kollmeier
-
On the Robust Automatic Segmentation of Spontaneous Speech Bojan Petek, Ove Andersen, Paul
Dalsgaard
-
Bayesian Adaptation of Speech Recognizers to Field Speech Data C.G. Miglietta, C. Mokbel, D. Jouvet, J.
Monné
-
Sub-band Adaptive Filtering Applied to Speech Enhancement A. J. Darlington, D. J. Campbell
-
Noise Robust Estimate of Speech Dynamics for Speaker Recognition J.P. Openshaw, John S. Mason
-
Overview of Speech Enhancement Techniques for Automatic Speaker Recognition Javier Ortega-García, Joaquín
González-Rodríguez
-
Dynamic Features for Segmental Speech Recognition Naomi Harte, Saeed V. Vaseghi, Ben Milner
-
Speech Recognition Based on a Model of Human Auditory System Takuya Koizumi, Mikio Mori, Shuji
Taniguchi
-
APVQ Encoder Applied to Wideband Speech Coding J.M. Salavedra, E. Masgrau
-
Simple Fast Vector Quantization of the Line Spectral Frequencies Jin Zhou, Yair Shoham, Ali Akansu
Chair: Maureen Stone, University of Maryland at Baltimore
-
Speaker Individualities of Vocal Tract Shapes of Japanese Vowels Measured by Magnetic Resonance Images
Chang-Sheng Yang, Hideki Kasuya
-
Vocal Tract Acoustics Using the Transmission Line Matrix (TLM) Method S. El-Masri, X. Pelorson, P. Saguet,
P. Badin
-
Building Sensori-motor Prototypes from Audiovisual Exemplars Gérard Bailly
-
Parameterized VT Area Function Inversion Mats Båvegård, Gunnar Fant
-
An Improved Vocal Tract Model of Vowel Production Implementing Piriform Resonance and Transvelar Nasal
Coupling Jianwu Dang, Kiyoshi Honda
-
Pseudo-articulatory Speech Synthesis for Recognition using Automatic Feature Extraction from X-Ray Data C. S.
Blackburn, S. J. Young
Chair: Chin-Hui Lee, Bell Labs - Lucent Technologies
-
N-best-based Instantaneous Speaker Adaptation Method for Speech Recognition Tomoko Matsui, Sadaoki
Furui
-
Mixture Splitting Technic and Temporal Control in a HMM-based Recognition System C. Montacié, M.-J.
Caraty, C. Barras
-
A Unified Spectral Transformation Adaptation Approach for Robust Speech Recognition Lei Yao, Dong Yu,
Taiyi Huang
-
On-line Adaptive Learning of the Correlated Continuous Density Hidden Markov Models for Speech Recognition
Qiang Huo, Chin-Hui Lee
-
Speaker Adaptation by Modeling the Speaker Variation in a Continuous Speech Recognition System Nikko
Ström
-
An Enquiring System of Unknown Words in TV News by Spontaneous Repetition (Application of Speaker Normalization by
Speaker Subspace Projection) Yasuo Ariki, Shigeaki Tagashira
Chair: Adam L. Buchsbaum, AT&T Labs - Research
-
Language Understanding using Hidden Understanding Models Richard Schwartz, Scott Miller, David Stallard,
John Makhoul
-
Processing of Semantic Information in Fluently Spoken Language Allen L. Gorin
-
Automatic Linguistic Segmentation of Conversational Speech Andreas Stolcke, Elizabeth Shriberg
-
Towards Understanding Spontaneous Speech: Word Accuracy vs. Concept Accuracy M. Boros, W. Eckert,
Florian Gallwitz, G. Görz, G. Hanrieder, Heinrich Niemann
-
A Stochastic Case Frame Approach for Natural Language Understanding Wolfgang Minker, S.K. Bennacef, J.L.
Gauvain
-
Improving Speech Understanding by Incorporating Database Constraints and Dialogue History Frank Seide,
Bernhard Rüber, Andreas Kellner
Chair: Jan P. van Santen, Bell Labs - Lucent Technologies
-
A New Discourse Structure Model for Spontaneous Spoken Dialogue Tetsuro Chino, Hiroyuki Tsuboi
-
An Architecture for Spoken Dialogue Management David Duff, Barbara Gates, Susann LuperFoy
-
Pausing Strategies in Discourse in Dutch Monique E. van Donzel, Florien J. Koopmans-van Beinum
-
Filled Pauses as Markers of Discourse Structure Marc Swerts, Anne Wichmann, Robbert-Jan Beun
-
The Prosodic Analysis of Korean Dialogue Speech - Through a Comparative Study with Read Speech Cheol-jae
Seong, Minsoo Hahn
-
Changing the Topic: How Long Does it Take? Mary O'Kane, P.E. Kenne
Chair: Ilija Zeljkovic, AT&T Labs - Research
-
Learning Pronunciation Dictionary from Speech Data Christian-Michael Westendorf, Jens Jelitto
-
The Trended HMM with Discriminative Training for Phonetic Classification C. Rathinavelu, Li Deng
-
Improving Decision Trees for Acoustic Modeling Ariane Lazaridès, Yves Normandin, Roland Kuhn
-
An Improved Training Algorithm in HMM-based Speech Recognition Gongjun Li, Taiyi Huang
-
Speech Recognition Using a Strong Correlation Assumption for the Instantaneous Spectra J. Ming, P. O'Boyle, J.
McMahon, F. J. Smith
-
On Parameter Filtering in Continuous Subword-unit-based Speech Recognition Pau Pachès-Leal, Climent
Nadeu
-
Estimation of Statistical Phoneme Center Considering Phonemic Environments Shigeki Okawa, Katsuhiko
Shirai
-
Integration of Context-dependent Durational Knowledge into HMM-based Speech Recognition Xue Wang, Louis
F. M. ten Bosch, Louis C. W. Pols
-
Speech Recognition Based on Acoustically Derived Segment Units T. Fukada, M. Bacchiani, K.K. Paliwal,
Yoshinori Sagisaka
-
Robust Gender-dependent Acoustic-phonetic Modelling in Continuous Speech Recognition Based on a New Automatic
Male/Female Classification Rivarol Vergin, Azarshid Farhat, Douglas O'Shaughnessy
-
A Codebook Adaptation Algorithm for SCHMM Using Formant Distribution Tae Young Yang, Won Ho Shin,
Weon Goo Kim, Dae Hee Youn
-
Parameter Tying for Flexible Speech Recognition J. Simonin, S. Bodin, D. Jouvet, K. Bartkova
-
Word-spotting Based on Inter-word and Intra-word Diphone Models Tsuneo Nitta, Shin'ichi Tanaka, Yasuyuki
Masai, Hiroshi Matsu'ura
-
Duration Modeling with Expanded HMM Applied to Speech Recognition Antonio Bonafonte, Josep Vidal, Albino
Nogueiras
-
Different Strategies for Distribution Clustering using Discrete, Semicontinuous and Continuous HMMs in CSR
Ricardo de Córdoba, José M. Pardo
-
Improved HMM Phone and Triphone Models for Realtime ASR Telephony Applications Ilija Zeljkovic, Shrikanth
Narayanan
-
Improved Extended HMM Composition by Incorporating Power Variance Yasuhiro Minami, Sadaoki
Furui
-
Optimal Filtering and Smoothing for Speech Recognition using a Stochastic Target Model Gordon Ramsay, Li
Deng
-
Speech Recognition Using Syllable-Like Units Zhihong Hu, Johan Schalkwyk, Etienne Barnard, Ronald A.
Cole
Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology
-
Search for Unexplored Effects in Speech Production C.H. Coker, M.H. Krane, B.Y. Reis, R.A. Kubli
-
Computational Models for Speech Generation S. Levinson
-
Articulatory Synthesis from X-rays and Inversion for an Adaptive Speech Robot P. Badin, C. Abry
Chair: Aaron E. Rosenberg, AT&T Labs - Research
-
Adaptive Recognition Method Based on Posterior Use of Distribution Pattern of Output Probabilities Jin-Song
Zhang, Beiqian Dai, Changfu Wang, Hingkeung Kwan, Keikichi Hirose
-
Iterative Unsupervised Adaptation Using Maximum Likelihood Linear Regression P.C. Woodland, D. Pye,
M.J.F. Gales
-
A Compact Model for Speaker-Adaptive Training Tasos Anastasakos, John McDonough, Richard Schwartz,
John Makhoul
-
Iterative Unsupervised Speaker Adaptation for Batch Dictation Shigeru Homma, Jun-ichi Takahashi, Shigeki
Sagayama
-
Rapid Unsupervised Adaptation to Children's Speech on a Connected-Digit Task Daniel C. Burnett, Mark
Fanty
-
Speaker Adaptation Using Tree Structured Shared-State HMMs Jun Ishii, Masahiro Tonomura, Shoichi
Matsunaga
Chair: David Roe, AT&T Labs - Research
-
Learning to Parse Spontaneous Speech Finn Dag Buo, Alex Waibel
-
Spontaneous Speech and Natural Language Processing ALPES: A Robust Semantic-led Parser Jean-Yves
Antoine
-
The Natural Language Processing Module for a Voice Assisted Operator at Telefónica I+D J. Alvarez-Cercadillo,
J. Caminero-Gil, C. Crespo-Casas, D. Tapias-Merino
-
Compound Words in Large-Vocabulary German Speech Recognition Systems André Berton, Pablo Fetter, Peter
Regel-Brietzmann
-
Prosody, Empty Categories and Parsing - A Success Story Anton Batliner, A. Feldhaus, S. Geissler, T. Kiss, Ralf
Kompe, Elmar Nöth
-
"Almost Parsing" Technique for Language Modeling B. Srinivas
Chair: Dik J. Hermes, Institute for Perception Research / IPO
-
From Segmental Duration Properties to Rhythmic Structure: A Study of Interactions Between High and Low Level
Constraints Marise Ouellet, Benoît Tardif
-
Analysis of Context-dependent Segmental Duration for Automatic Speech Recognition Xue Wang, Louis C. W.
Pols, Louis F. M. ten Bosch
-
The Role of the Rhythmic Groups in the Segmentation of Continuous French Speech Delphine Dahan
-
The Implications of Temporal Patterns for the Prosody of Boundary Signaling in Connected Speech Zita
McRobbie-Utasi
-
Experimental Phonetic Study of the Syllable Duration of Korean with Respect to the Positional Effect Hyunbok
Lee, Cheol-jae Seong
-
Timing of Pitch Movements and Accentuation of Syllables Dik J. Hermes
Chair: Peggy Nelson, University of Maryland at Baltimore
-
A Probabilistic Approach to AMDF Pitch Detection Goangshiuan S. Ying, Leah H. Jamieson, Carl D.
Michell
-
From Sagittal Cut to Area Function: An RMI Investigation Alain Soquet, Véronique Lecuit, Thierry Metens,
Didier Demolin
-
Pitch Detection and Voiced/Unvoiced Decision Algorithm Based on Wavelet Transforms Léonard Janer, Juan
José Bonet, Eduardo Lleida-Solano
-
Decomposition of Speech Signals into a Deterministic and a Stochastic Part Yannis Stylianou
-
Improved Glottal Closure Instant Detector based on Linear Prediction and Standard Pitch Concept Cheol-Woo
Jo, Ho-Gyun Bang, W.A. Ainsworth
-
Analysis of Speech Segments using Variable Spectral/Temporal Resolution Xihong Wang, Stephen A. Zahorian,
Stefan Auberg
-
Time-based Clustering for Phonetic Segmentation Brian Eberman, William Goldenthal
-
Formant Analysis Using Mixtures of Gaussians Parham Zolfaghari, Tony Robinson
-
Deriving Articulatory Representations from Speech with Various Excitation Modes Hywel B. Richards, John S.
Mason, Melvyn J. Hunt, John S. Bridle
-
"Blind" Speech Segmentation: Automatic Segmentation of Speech Without Linguistic Knowledge Manish Sharma,
Richard J. Mammone
-
Speech Synthesis Using a Nonlinear Energy Damping Model for the Vocal Folds Vibration Effect Hiroshi
Ohmura, Kazuyo Tanaka
-
Neural Networks Learning with L1 Criteria and Its Efficiency in Linear Prediction of Speech Signals Munehiro
Namba, Hiroyuki Kamata, Yoshihisa Ishida
-
Preprocessing and Neural Classification of English Stop Consonants [b,d,g,p,t,k] A. Esposito, C. E. Ezin, M.
Ceccarelli
-
A Comparison of Modified k-means(MKM) and NN based Real Time Adaptive Clustering Algorithms for Articulatory
Space Codebook Formation K.S. Ananthakrishnan
-
A Novel Approach to the Estimation of Voice Source and Vocal Tract Parameters from Speech Signals Wen
Ding, Hideki Kasuya
-
Syllable Detection in Read and Spontaneous Speech Hartmut R. Pfitzinger, Susanne Burger, Sebastian
Heid
-
Maximum Likelihood Learning of Auditory Feature Maps for Stationary Vowels Kuansan Wang, Chin-Hui Lee,
Biing-Hwang Juang
-
Explicit Segmentation of Speech using Gaussian Models Antonio Bonafonte, Albino Nogueiras, Antonio
Rodriguez-Garrido
-
A Comparison of Several Recent Methods of Fundamental Frequency and Voicing Decision Estimation E.
Mousset, W.A. Ainsworth, José A. R. Fonollosa
-
Robust Pitch Estimation with Harmonics Enhancement in Noisy Environments Based on Instantaneous Frequency
Toshihiko Abe, Takao Kobayashi, Satoshi Imai
-
Integrated Polispectrum on Speech Recognition Asunción Moreno, Miquel Rutllán
Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology
-
Analysis of Acoustic Properties of the Nasal Tract Using 3-D FEM Hisayoshi Suzuki, Takayoshi Nakai, Hirosi
Sakakibara
-
Experiments with Analysis By Synthesis of Glottal Airflow Johan Liljencrants
Chair: Nelson Morgan, ICSI and University of California, Berkeley
-
An Incremental Speaker-Adaptation Technique for Hybrid HMM-MLP Recognizer Joao P. Neto, Ciro A.
Martins, Luís B. Almeida
-
Phoneme Segmentation of Continuous Speech using Multi-layer Perceptron Youngjoo Suh, Youngjik Lee
-
Stochastic Perceptual Speech Models with Durational Dependence Jeff Bilmes, Nelson Morgan, Su-Lin Wu,
Hervé Bourlard
-
Boosting the Performance of Connectionist Large Vocabulary Speech Recognition G.D. Cook, A.J.
Robinson
-
HMMs and OWE Neural Network for Continuous Speech Recognition Nicolas Pican, Dominique Fohr,
Jean-François Mari
-
Smoothed Local Adaptation of Connectionist Systems Steve Waterhouse, Dan Kershaw, Tony Robinson
Chair: Tony Robinson, Cambridge University
-
Robust Speech Recognition with Speaker Localization by a Microphone Array Takeshi Yamada, Satoshi
Nakamura, Kiyohiro Shikano
-
Sound Source Localization in Reverberant Environments using an Outlier Elimination Algorithm Ea-Ee Jan, James
L. Flanagan
-
The 1995 Abbot LVCSR System for Multiple Unknown Microphones Dan Kershaw, Tony Robinson, Steve
Renals
-
Experiments of Speech Recognition in a Noisy and Reverberant Environment using a Microphone Array and HMM
Adaptation D. Giuliani, M. Omologo, P. Svaizer
-
Increasing Robustness in GMM Speaker Recognition Systems for Noisy and Reverberant Speech with Low Complexity
Microphone Arrays Joaquín González-Rodríguez, Javier Ortega-García, César Martin, Luis Hernández
-
Robust Automatic Speech Recognition Using a Multi-channel Signal Separation Front-End Kuan-Chieh Yen,
Yunxin Zhao
Chair: Mark Steedman, University of Pennsylvania
-
Prosody Generation in Text-to-Speech Conversion Using Dependency Graphs Anders Lindström, Ivan Bretan,
Mats Ljungqvist
-
Extraction Method of Non-restrictive Modification in Japanese as a Marked Factor of Prosody Hisako Asano,
Hisashi Ohara, Yoshifumi Ooyama
-
Modeling Contrast in the Generation and Synthesis of Spoken Language Scott Prevost
-
A Left-to-right Processing Model of Pausing in Japanese Based on Limited Syntactic Information Hajime
Tsukada
-
Modeling of Intonation Bearing Emphasis for TTS-Synthesis of Greek Dialogues D. Galanis, V. Darsinos, G.
Kokkinakis
-
Synthesizing Prosody: a Prominence-based Approach Barbara Heuft, Thomas Portele
Chair: Thierry Dutoit, Facilte Psytechnique De Mons - TCTS Laboratory
-
Multilingual Text Analysis for Text-to-Speech Synthesis Richard Sproat
-
Spoken-style Explanation Generator for Japanese Kanji using a Text-to-speech System Yoshifumi Ooyama,
Hisako Asano, Koji Matsuoka
-
A Method for Estimating Prosodic Symbol from Text for Japanese Text-To-Speech Synthesis Ken-ichi Magata,
Tomoki Hamagami, Mitsuo Komura
-
Statistical Methods in Data-driven Modeling of Spanish Prosody for Text to Speech E. López-Gonzalo, J.M.
Rodríguez-García
-
Intonation Processing for TTS Using Stylization and Neural Network Learning Method Jung-Chul Lee, Youngjik
Lee, Sang-Hun Kim, Minsoo Hahn
-
Generating F0 Contours from ToBI Labels using Linear Regression Alan W. Black, Andrew J. Hunt
-
The Broad Study of Homograph Disambiguity for Mandarin Speech Synthesis Wern-Jun Wang, Shaw-Hwa
Hwang, Sin-Horng Chen
-
The MBROLA project: Towards a Set of High Quality Speech Synthesizers Free of Use for Non Commercial Purposes
T. Dutoit, V. Pagel, N. Pierret, F. Bataille, O. Van der Vrecken
-
Training Data Selection for Voice Conversion Using Speaker Selection and Vector Field Smoothing Makoto
Hashimoto, Norio Higuchi
-
A New Voice Transformation Method Based on Both Linear and Nonlinear Prediction Analysis Ki Seung Lee,
Dae Hee Youn, Il Whan Cha
-
On the Transformation of the Speech Spectrum for Voice Conversion G. Baudoin, Yannis Stylianou
-
Spectral Analysis of Synthetic Speech and Natural Speech with Noise over the Telephone Line Cristina Delogu,
Andrea Paoloni, Susanna Ragazzini, Paola Ridolfi
-
A New Speech Synthesis System Based on the ARX Speech Production Model Weizhong Zhu, Hideki
Kasuya
-
Speech Synthesis Using the CELP Algorithm Geraldo Lino de Campos, Evandro Bacci Gouvêa
-
A Mandarin Text-to-Speech System Shaw-Hwa Hwang, Sin-Horng Chen, Yih-Ru Wang
-
Residual-based Speech Modification Algorithms for Text-to-Speech Synthesis M.D. Edgington, A. Lowry
-
A Generalized LR Parser for Text-to-speech Synthesis Per Olav Heggtveit
-
Enhanced Shape-invariant Pitch and Time-scale Modification for Concatenative Speech Synthesis M.P. Pollard,
B.M.G. Cheetham, C.C. Goodyear, M.D. Edgington, A. Lowry
-
An Excitation Synchronous Pitch Waveform Extraction Method and its Application to the VCV-concatenation Synthesis of
Japanese Spoken Words Yasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura, Takashi Honda
-
A New Chinese Text-to-Speech System with High Naturalness Ren-Hua Wang, Qinfeng Liu, Difei Tang
-
Voice Conversion Based on Topological Feature Maps and Time-variant Filtering Ansgar Rinscheid
Chair: Reiko A. Yamada, ATR Human Information Processing Research Laboratories
-
Language Training System Utilizing Speech Modification Meron Yoram, Keikichi Hirose
-
Perception of English /r/ and /l/ Speech Contrasts by Native Korean Listeners with Extensive English-language
Experience D.G. Jamieson, K. Yu
-
Automatic Text-independent Pronunciation Scoring of Foreign Language Student Speech Leonardo Neumeyer,
Horacio Franco, Mitchel Weintraub, Patti Price
-
Assessing the Contribution of Instructional Technology in the Teaching of Pronunciation Antônio Simoes
-
Detection of Foreign Speakers' Pronunciation Errors for Second Language Training - Preliminary Results Maxine
Eskenazi
-
Foreign Accent in Intonation Patterns - A Contrastive Study Applying a Quantitative Model of the F0 Contour
Hansjörg Mixdorff
-
Input Modality Effects in Foreign Accent Duncan J. Markham, Yasuko Nagano-Madsen
Chairs: Lynne E. Bernstein, House Ear Institute; and Christian Benoît, ICP-Grenoble
-
For Speech Perception by Humans or Machines, Three Senses are Better than One Lynne E. Bernstein, Christian
Benoît
-
A Few Factors Which Affect the Degree of Incorporating Lip-read Information into Speech Perception Kaoru
Sekiyama, Yoh'ichi Tohkura, Michio Umeda
-
Characterizing Audiovisual Information During Speech E. Vatikiotis-Bateson, K.G. Munhall, Y. Kasahara, F.
Garcia, H. Yehia
-
The Implications of the Tadoma Method of Speechreading for Spoken Language Processing Charlotte M.
Reed
-
Seeing Speech in Space and Time: Psychological and Neurological Findings Ruth Campbell
Chair: Paul Taylor, University of Edinburgh
-
What's in the "Pure" Prosody? Volker Strom, Christina Widera
-
F0 Declination in Read-aloud and Spontaneous Speech Marc Swerts, Eva Strangert, Mattias Heldner
-
Prediction of Prosodic Phrase Boundaries Considering Variable Speaking Rate Yeon-jun Kim, Yung-hwan
Oh
-
Prediction of F0 Parameter of Contextualized Utterances in Dialogue Yoichi Yamashita, Riichiro Mizoguchi
-
The Production and Perception of Potentially Ambiguous Intonation Contours by Speakers of Russian and Japanese
V. Makarova, J. Matsui
-
What is Invariant and What is Optional in the Realization of a FOCUSED Word? A Cross-dialectal Study of Swedish
Sentences With Moving Focus Robert Eklund
Chair: Christine Shadle, University of Southhampton
-
Quantifying Spectral Characteristics of Fricatives Christine H. Shadle, Sheila J. Mair
-
Acoustic Characteristics of Ejectives in Ingush Natasha Warner
-
An Acoustic Profile of Consonant Reduction R.J.J.H. van Son, Louis C. W. Pols
-
Devoicing in Post-vocalic Canadian-French Obstruants Danièle Archambault, Blagovesta Maneva
-
Paying Attention to Speaking Rate Alexander L. Francis, Howard C. Nusbaum
-
The Lack of Invariance Problem and the Goal of Speech Perception Irene Appelbaum
Chair: Harriet S. Magen, Rhode Island College
-
The Acoustic Structure of Vowels in Mothers' Speech to Infants and Adults Jean E. Andruski, Patricia K.
Kuhl
-
Acoustical Characteristics of Sound Production of Deaf and Normally Hearing Infants Chris J. Clement, Florien J.
Koopmans-van Beinum, Louis C. W. Pols
-
Learning Non-native Vowel Categories John Kingston, Christine Bartels, José Benkí, Deanna Moore, Jeremy
Rice, Rachel Thorburn, Neil Macmillan
-
Word Recognition by Japanese Infants P.A. Halle, Toshisada Deguchi, Yuji Tamekawa, B. Boysson-Bardies,
Shigeru Kiritani
-
Investigations of the Word Segmentation Abilities of Infants Peter W. Jusczyk
-
Developmental Change in Perception of Clause Boundaries by 6- and 10-Month-old Japanese Infants Akiko
Hayashi, Yuji Tamekawa, Toshisada Deguchi, Shigeru Kiritani
Chair: Carol Espy-Wilson, Boston University
-
A Frequency Domain Method for Parametrization of the Voice Source Paavo Alku, Erkki Vilkman
-
Glottal Correlates of the Word Stress and the Tense/Lax Opposition in German Krzysztof Marasek
-
Coarticulatory Stability in American English /r/ Suzanne Boyce, Carol Y. Espy-Wilson
-
An MRI-based Analysis of the English /r/ and /l/ Articulations Shinobu Masaki, Reiko Akahane-Yamada, Mark
K. Tiede, Yasuhiro Shimada, Ichiro Fujimoto
-
Does Lexical Stress or Metrical Stress Better Predict Word Boundaries in Dutch? David van Kuijk
-
Optopalatograph (OPG): A New Apparatus for Speech Production Analysis A. A. Wrench, A. D. McIntosh, W.
J. Hardcastle
-
Prediction of Vowel Systems using a Deductive Approach René Carré
-
Distinctions Between [t] and [tch] using Electropalatography Data Sheila J. Mair, Celia Scully, Christine H.
Shadle
-
Relating Formants and Articulation in Intelligibility Test Words Michiko Hashi, Raymond D. Kent, John R.
Westbury, Mary J. Lindstrom
-
The Role of Coarticulation in the Perception of Vowel Quality in Modern Standard Arabic Imad Znagui,
Mohamed Yeou
-
Updating the Reading EPG Simon Arnfield, Wilf Jones
-
Lexical Stress Detection on Stress-minimal Word Pairs Goangshiuan S. Ying, Leah H. Jamieson, Ruxin Chen,
Carl D. Mitchell
-
An Acoustic Study of the Interaction Between Stressed and Unstressed Syllables in Spoken Mandarin Jing
Wang
-
Automatic Detection of Accent Nuclei at the Head of Words for Speech Recognition Nobuaki Minematsu, Seiichi
Nakagawa
-
Automatic Generation of Prosodic Structure for High Quality Mandarin Speech Synthesis Fu-chiang Chou,
Chiu-yu Tseng, Lin-shan Lee
-
A Study on Japanese Prosodic Pattern and its Modeling in Restricted Speech Tomoki Hamagami, Ken-ichi
Magata, Mitsuo Komura
-
A Phonetic Study of Focus in Intransitive Verb Sentences Steve Hoskins
-
Variation in Vocal Fold Vibration Associated with Prosodic Conditions Shigeru Kiritani, Hiroshi Imagawa, Seiji
Niimi
-
Goethe for Prosody Stefan Rapp
-
Prosodic Cues in Syntactically Ambiguous Strings; An Interactive Speech Planning Mechanism K.A.
Straub
-
A Functional Model for Generation of the Local Components of F0 Contours in Chinese Jinfu Ni, Ren-Hua
Wang, Deyu Xia
-
The Acquisition of Voiceless Stops in the Interlanguage of Second Language Learners of English and Spanish
Marie Fellbaum
-
Jaw Contribution to Timing Control of "Guttural" Consonants Production Ahmed M. Elgendy
Chairs: Lynne E. Bernstein, House Ear Institute; and Christian Benoît, ICP-Grenoble
-
Studies of the McGurk Effect: Implications for Theories of Speech Perception Kerry P. Green
-
Using the Visual Component in Automatic Speech Recognition N. M. Brooke
-
Perceptual Organization of Speech in One and Several Modalities: Common Functions, Common Resources
Robert E. Remez
-
Multi-modal Encoding of Speech in Memory: A First Report David B. Pisoni, Helena M. Saldaña, Sonya M.
Sheffert
Chair: Klaus R. Scherer, University of Geneva
-
Word Class Driven Synthesis of Prosodic Annotations Simon Arnfield
-
Dynamical Modelling of Vowel Sounds as a Synthesis Tool M. Banbrook, S. McLaughlin
-
Emotional Speech Elicited using Computer Games Tom Johnstone
-
Automatic Statistical Analysis of the Signal and Prosodic Signs of Emotion in Speech Roddy Cowie, Ellen
Douglas-Cowie
-
Recognizing Emotion in Speech Frank Dellaert, Thomas Polzin, Alex Waibel
-
Emotions in Time Domain Synthesis Barbara Heuft, Thomas Portele, Monika Rauth
Chair: Candy Kamm, AT&T Labs - Research
-
Evaluating Automatic Speech Recognition as a Component of a Multi-input Device Human-computer Interface
B.A. Mellor, C. Baber, C. Tunley
-
Data Collection for the MASK Kiosk: WOz vs Prototype System A. Life, I. Salter, J.N. Temem, F. Bernard, S.
Rosset, S.K. Bennacef, Lori Lamel
-
An Experimental Japanese/English Interpreting Video Phone System M. Karaorman, T.H. Applebaum, T. Itoh,
M. Endo, Y. Ohno, M. Hoshimi, T. Kamai, K. Matsui, K. Hata, S. Pearson, J.-C. Janqua
-
User Participation and Compliance in Speech Automated Telecommunications Applications Sara Basson, Stephen
Springer, Cynthia Fong, Hong Leung, Ed Man, Michele Olson, John Pitrelli, Ranvir Singh, Suk Wong
-
Embedding Speech in Web Interfaces Samuel Bayer
-
Voice-activated Home Banking System and its Field Trial Toshihiro Isobe, Masatoshi Morishima, Fuminori
Yoshitani, Nobuo Koizumi, Ken'ya Murakami
Chair: Juergen Schroeter, AT&T Labs - Research
-
A Text Analyzer for Korean Text-to-Speech Systems Sangho Lee, Yung-Hwan Oh
-
Design and Evaluation of a Phonological Phrase Parser for Spanish Text-to-Speech Helen E. Karn
-
Comparison of Two Tree-Structured Approaches for Grapheme-to-Phoneme Conversion Ove Andersen, Roland
Kuhn, Ariane Lazaridès, Paul Dalsgaard, Jürgen Haas, Elmar Nöth
-
A Recurrent Network that Learns to Pronounce English Text M.J. Adamson, R.I. Damper
-
Archisegment-based Letter-to-Phone Conversion for Concatenative Speech Synthesis in Portuguese Eleonora
Cavalcante Albano, Agnaldo Antonio Moreira
-
A New Method of Generating Speech Synthesis Units Based on Phonological Knowledge and Clustering Technique
Yuki Yoshida, Shin'ya Nakajima, Kazuo Hakoda, Tomohisa Hirokawa
Chair: Louis Boves, Nymegen University
-
Consistency in Transcription and Labelling of German Intonation with GToBI Martine Grice, Matthias Reyelt, Ralf
Benzmüller, Jörg Mayer, Anton Batliner
-
Syntactic-prosodic Labeling of Large Spontaneous Speech Data-bases Anton Batliner, R. Kompe, A. Kiessling,
H. Niemann, E. Nöth
-
Relationship Between Discourse Structure and Dynamic Speech Rate Florien J. Koopmans-van Beinum, Monique
E. van Donzel
-
Using Prosodic Clues to Decide When to Produce Back-channel Utterances Nigel Ward
-
Dialog Act Classification with the Help of Prosody Marion Mast, Ralf Kompe, Stefan Harbeck, Andreas
Kiessling, Heinrich Niemann, Elmar Nöth, E. G. Schukat-Talamazzini, V. Warnke
-
Using Lexical Stress in Continuous Speech Recognition for Dutch David van Kuijk, Henk van den Heuvel, Louis
Boves
Chair: Sadaoki Furui, NTT Human Interface Lab
-
Automatic Accent Classification of Foreign Accented Australian English Speech Karsten Kumpf, Robin W.
King
-
Discriminative Adaptation for Speaker Verification F. Korkmazskiy, Biing-Hwang Juang
-
Perceptual Features of Unknown Foreign Languages as Revealed by Multi-dimensional Scaling V. Stockmal, D.
Muljani, Z.S. Bond
-
On-line Incremental Adaptation for Speaker Verification using Maximum Likelihood Estimates of CDHMM Parameters
Kin Yu, John S. Mason
-
Combining Methods to Improve Speaker Verification Decision Dominique Genoud, Frédéric Bimbot, Guillaume
Gravier, Gérard Chollet
-
Incremental Speaker Adaptation with Minimum Error Discriminative Training for Speaker Identification Cesar
Martín del Alamo, J. Alvarez, C. de la Torre, F.J. Poyatos, L. Hernández
-
Frame Level Likelihood Normalization for Text-independent Speaker Identification using Gaussian Mixture Models
Konstantin P. Markov, Seiichi Nakagawa
-
On Using Prosodic Cues in Automatic Language Identification Ann E. Thymé-Gobbel, Sandra E. Hutchins
-
Speaker Recognition Model using Two-dimensional Mel-Cepstrum and Predictive Neural Network Tadashi
Kitamura, Shinsai Takei
-
Unknown Language Rejection in Language Identification System Hingkeung Kwan, Keikichi Hirose
-
Spoken Language Identification using Large Vocabulary Speech Recognition James L. Hieronymus, Shubha
Kadambe
-
Accent Identification Carlos Teixeira, Isabel M. Trancoso, António Serralheiro
-
Comparison of Text-independent Speaker Recognition Methods on Telephone Speech with Acoustic Mismatch
Sarel van Vuuren
-
On the Sources of Inter- and Intra-speaker Variability in the Acoustic Dynamics of Speech Xue Yang, J. Bruce
Millar, Iain Macleod
-
Language Identification with Inaccurate String Matching Kay M. Berkling, Etienne Barnard
-
Robust Prosodic Features for Speaker Identification M.J. Carey, E.S. Parris, H. Lloyd-Thomas, S.J.
Bennett
-
Text Independent Speaker Identification on Noisy Environments by Means of Self Organizing Maps E. Monte, J.
Hernando, X. Miró, A. Adolf
-
Language-identification Using Language-dependent Phonemes and Language-independent Speech Units Paul
Dalsgaard, Ove Andersen, Hanne Hesselager, Bojan Petek
Chairs: Ronald Rosenfeld, Carnegie Mellon University; and Hervé Bourlard, Facult'e Polytechnique De
Mons
-
Introduction to SWB Jorden Cohen
-
Disfluencies in SWB Elizabeth Shriberg
-
Error Analysis and Disfluency Modeling Ronald Rosenfeld
-
Fast Sparse Data Training/Portability Andreas Stolcke
-
Phrase Structure Language Models Salim Roukos
-
Language Modeling Issues for Spanish Herbert Gish
-
SRI Speaking Mode Experiments Andreas Stolcke
Chair: Klaus R. Scherer, University of Geneva
-
Adding the Affective Dimension: A New Look in Speech Analysis and Synthesis Klaus R. Scherer
-
Ethological Theory and the Expression of Emotion in the Voice John J. Ohala
-
Synthesizing Emotions in Speech: Is it Time to Get Excited? Iain R. Murray, John L. Arnott
Chair: Richard Rose, AT&T Labs - Research
-
A Study on Task-independent Subword Selection and Modeling for Speech Recognition Chin-Hui Lee,
Biing-Hwang Juang, Wu Chou, J.J. Molina-Perez
-
Simultaneous ANN Feature and HMM Recognizer Design using String-based Minimum Classification Error (MCE)
Training Mazin G. Rahim, Chin-Hui Lee
-
Quantizing Mixture-weights in a Tied-mixture HMM Sunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen
-
Variance Compensation within the MLLR Framework for Robust Speech Recognition and Speaker Adaptation
M.J.F. Gales, D. Pye, P.C. Woodland
-
Maximum-likelihood Stochastic Matching Approach to Non-linear Equalization for Robust Speech Recognition
A.C. Surendran, Chin-Hui Lee, Mazin G. Rahim
-
Estimation of Channel Bias for Telephone Speech Recognition Jen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min
Lee
Chair: Bernd Moebius, Bell Labs - Lucent Technologies
-
Synthesis of English Intonation using Explicit Models of Reading and Spontaneous Speech M. E. Johnson
-
Generating Intonation by Superposing Gestures Yann Morlec, Gérard Bailly, Vèronique Aubergé
-
Implementation and Evaluation of a Model for Synthesis of Swedish Intonation Merle Horne, Marcus
Filipsson
-
Natural Prosody Generation for Domain Specific Text-to-Speech Systems Nobuyuki Katae, Shinta Kimura
-
Improving Text-to-Speech Synthesis Mark Tatham, Eric Lewis
-
Synthesis of Stressed Speech from Isolated Neutral Speech Using HMM-based Models Sahar E. Bou-Ghazale,
John H.L. Hansen
-
Modeling Segment Intonation for Slovene TTS System Ales Dobnikar
Chair: David G. Novick, European Institute of Cognitive Sciences and Engineering
-
Word Predictability After Hesitations: A Corpus-based Study Elizabeth Shriberg, Andreas Stolcke
-
Interruptions and Intonation Li-chiung Yang
-
On not Recognizing Disfluencies in Dialogue Robin J. Lickley, Ellen Gurman Bard
-
A Theory of Word Frequencies and its Application to Dialogue Move Recognition Phil Garner, Sue Browning,
Roger Moore, Martin Russell
-
Utterance Units and Grounding in Spoken Dialogue David R. Traum, Peter A. Heeman
-
Coordinating Turn-taking with Gaze David G. Novick, Brian Hansen, Karen Ward
Chair: Bruce M. Buntschuh, AT&T Labs - Research
-
BABEL: An Eastern European Multi-language Database Peter Roach, Simon Arnfield, W. Barry, J. Baltova, M.
Boldea, A. Fourcin, W. Gonet, R. Gubrynowicz, E. Hallum, L. Lamel, K. Marasek, A. Marchal, E. Meister, K. Vicsi
-
USTC95---A Putonghua Corpus Ren-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu
-
Telephone Data Collection using the World Wide Web Edward Hurley, Joseph Polifroni, James Glass
-
The "SIVA" Speech Database for Speaker Verification: Description and Evaluation M. Falcone, A. Gallo
-
A Multi-level Description of Date Expressions in German Telephone Speech Christoph Draxler
-
Viterbi Search Visualization Using Vista: A Generic Performance Visualization Tool Robert H. Halstead Jr., Ben
Serridge, Jean-Manuel Van Thong, William Goldenthal
-
A Multilingual Phonetic Representation and Analysis System for Different Speech Databases Toomas Altosaar,
Matti Karjalainen, Martti Vainio
-
FRESCO: The French Telephone Speech Data Collection - Part of the European SpeechDat(M) Project D.
Langmann, R. Haeb-Umbach, Louis Boves, E. den Os
-
Predicting the Out-of-Vocabulary Rate and the Required Vocabulary Size for Speech Processing Applications
Johannes Müller, Holger Stahl, Manfred Lang
-
AMULET: Automatic MUltisensor Speech Labelling and Event Tracking: Study of the Spatio-temporal Correlations in
Voiceless Plosive Production Nathalie Parlangeau, Alain Marchal
-
Constructing Multi-level Speech Database for Spontaneous Speech Processing Minsoo Hahn, Sanghun Kim,
Jung-Chul Lee, Yong-Ju Lee
-
Preliminaries to a Romanian Speech Database Marian Boldea, Alin Doroga, Tiberiu Dumitrescu, Maria
Pescaru
-
Labelled Data Bank of Spoken Standard German The Kiel Corpus of Read/Spontaneous Speech Klaus J.
Kohler
-
SAPPHIRE: An Extensible Speech Analysis and Recognition Tool Based on Tcl/Tk Lee Hetherington, Michael
McCandless
-
Automatic Detection of Topic Boundaries and Keywords in Arbitrary Speech Using Incremental Reference Interval-free
Continuous DP Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka
-
Very-large-vocabulary Mandarin Voice Message File Retrieval using Speech Queries Bo-Ren Bai, Lee-Feng
Chien, Lin-Shan Lee
-
Gandalf - A Swedish Telephone Speaker Verification Database H. Melin
-
The DCIEM Map Task Corpus: Spontaneous Dialogue Under Sleep Deprivation and Drug Treatment Ellen
Gurman Bard, C. Sotillo, A. H. Anderson, M. M. Taylor
-
The Nemours Database of Dysarthric Speech Xavier Menéndez-Pidal, James B. Polikoff, Shirley M. Peters,
Jennie E. Leonzio, H.T. Bunnell
-
POST: Parallel Object-oriented Speech Toolkit Jean Hennebert, Dijana Petrovska Delacrétaz
Chairs: Ronald Rosenfeld, Carnegie Mellon University; and Hervé Bourlard, Facult'e Polytechnique De
Mons
-
Insights into Spoken Language Gleaned from Phonetic Transcription of the Switchboard Corpus Steven
Greenberg
-
Automatic Learning of Word Pronunciation from Data Eric Fosler
-
Modeling Systematic Variations in Pronunciation Bill Byrne
-
Speech Data Modeling Nelson Morgan
-
Linguistic Dependency Modeling Andreas Stolcke
-
Summary, Observations, and Plans for the Future Fred Jelinek
Chair: Mazin Rahim, AT&T Labs - Research
-
Channel and Noise Normalization Using Affine Transformed Cepstrum Xiaoyu Zhang, Richard J.
Mammone
-
Spectral Estimation and Normalisation for Robust Speech Recognition Tom Claes, Fei Xie, Dirk Van
Compernolle
-
Trellis Encoded Vector Quantization for Robust Speech Recognition Wu Chou, Nambi Seshadri, Mazin
Rahim
-
Phone Clustering using the Bhattacharyya Distance Brian Mak, Etienne Barnard
-
Variability of Lombard Effects Under Different Noise Conditions Atsushi Wakao, Kazuya Takeda, Fumitada
Itakura
-
Lombard Effect Compensation and Noise Suppression for Noisy Lombard Speech Recognition Sang-mun Chi,
Yung-Hwan Oh
Chair: Jim Hieronymus, Bell Labs - Lucent Technologies
-
The Use of Shibboleth Words for Automatically Classifying Speakers by Dialect A.W.F. Huggins, Yogen
Patel
-
The Organization of Dialect Diversity in North America William Labov
-
Data Collection of Japanese Dialects and its Influence into Speech Recognition Ikuo Kudo, Takao Nakama,
Tomoko Watanabe, Reiko Kameyama
-
Statistical Dialect Classification Based on Mean Phonetic Features David R. Miller, James Trischitta
-
Norwegian Numerals: a Challenge to Automatic Speech Recognition Knut Kvale
-
Evaluation of the Telefónica I+D Natural Numbers Recognizer over Different Dialects of Spanish from Spain and
America C. de la Torre, J. Caminero-Gil, J. Alvarez, C. Martín del Alamo, L. Hernández-Gómez
Chair: Gunnar Fant, KTH
-
Rhythmic Constraints on English Stress Timing Fred Cummins, Robert F. Port
-
On the Interaction of Clash, Focus and Phonological Phrasing Irene Vogel, Steve Hoskins
-
On the Quantal Nature of Speech Timing Gunnar Fant, Anita Kruckenberg
-
Differential Perception of Tonal Contours Through the Syllable David House
-
Pitch, Loudness, and Segmental Duration Correlates: Towards a Model for the Phonetic Aspects of Finnish Prosody
Martti Vainio, Toomas Altosaar
-
Prosodic Manipulation System of Speech Material for Perceptual Experiments Nobuaki Minematsu, Seiichi
Nakagawa, Keikichi Hirose
Chair: Enrico Bocchieri, AT&T Labs - Research
-
Clustered Language Models with Context-Equivalent States J.P. Ueberla, I. R. Gransden
-
Modeling of Contextual Effects and its Application to Word Spotting Yuji Yonezawa, Masato Akagi
-
A New Keyword Spotting Algorithm with Pre-calculated Optimal Thresholds J. Junkawitsch, L. Neubauer, H.
Höge, G. Ruske
-
Detection of Ambiguous Portions of Signal Corresponding to OOV Words or Misrecognized Portions of Input
Roxane Lacouture, Yves Normandin
-
Techniques for Approximating a Trigram Language Model Fabio Brugnara, Marcello Federico
-
Unsupervised and Incremental Speaker Adaptation under Adverse Environmental Conditions Keizaburo Takagi,
Koichi Shinoda, Hiroaki Hattori, Takao Watanabe
-
An Adaptive-Beam Pruning Technique for Continuous Speech Recognition Hugo Van hamme, Filip Van
Aelten
-
Data Based Filter Design for RASTA-like Channel Normalization in ASR Carlos Avendano, Sarel van Vuuren,
Hynek Hermansky
-
A Comparison of Time Conditioned and Word Conditioned Search Techniques for Large Vocabulary Speech
Recognition S. Ortmanns, H. Ney, Frank Seide, I. Lindam
-
Language-model Look-ahead for Large Vocabulary Speech Recognition S. Ortmanns, H. Ney, A. Eiden
-
A New Search Algorithm in Segmentation Lattices of Speech Signals Jean-Luc Husson, Yves Laprie
-
LR-Parser-driven Viterbi Search with Hypotheses Merging Mechanism Using Context-dependent Phone Models
Tomokazu Yamada, Shigeki Sagayama
-
Discrete-Utterance Recognition with a Fast Match Based on Total Data Reduction Jan Nouza
-
On-line Garbage Modeling with Discriminant Analysis for Utterance Verification J. Caminero, C. de la Torre, L.
Villarrubia, C. Martín, L. Hernández
-
Cheating with Imperfect Transcripts Paul Placeway, John Lafferty
-
Novel Training Method for Classifiers used in Speaker Adaptation Naoto Iwahashi
-
Large Vocabulary Word Recognition based on a Graph-structured Dictionary Katsuki Minamino
-
A Word Graph Based N-Best Search in Continuous Speech Recognition Bach-Hiep Tran, Frank Seide, Volker
Steinbiss
-
Viterbi Beam Search with Layered Bigrams David M. Goblirsch
-
A Wave Decoder for Continuous Speech Recognition Eric Burhke, Wu Chou, Qiru Zhou
-
Long Term On-line Speaker Adaptation for Large Vocabulary Dictation Eric Thelen
-
Incremental Generation of Word Graphs Gerhard Sagerer, Heike Rautenstrauch, G. A. Fink, Bernd Hildebrandt,
A. Jusek, Franz Kummert
-
Improvement in N-Best Search for Continuous Speech Recognition Irina Illina, Yifan Gong
-
Sethos: The UPC Speech Understanding System Antonio Bonafonte, José B. Mariño, Albino Nogueiras
-
Segmental Search for Continuous Speech Recognition Pietro Laface, Luciano Fissore, A. Maro, Franco
Ravera
Chair: Donald Hindle, AT&T Labs - Research
-
An Investigation into the Generation of Mouth Shapes for a Talking Head A. P. Breen, E. Bowers, W.
Welsh
-
A Text-to-audiovisual-speech Synthesizer for French Bertrand Le Goff, Christian Benoît
-
Analysis of Head Movements and its Role in Spoken Dialogue Yuri Iwano, Shioya Kageyama, Emi Morikawa,
Shu Nakazato, Katsuhiko Shirai
-
RWC Multimodal Database for Interactions by Integration of Spoken Language and Visual Information Satoru
Hayamizu, Osamu Hasegawa, Katunobu Itou, Katuhiko Sakaue, Kazuyo Tanaka, Shigeki Nagaya, Masayuki Nakazawa, T.
Endoh, Fumio Togawa, Kenji Sakamoto, Kazuhiko Yamamoto
-
About the Relationship Between Eyebrow Movements and Fo Variations Christian Cavé, Isabelle Guaïtella,
Roxane Bertrand, Serge Santi, Françoise Harlay, Robert Espesser
-
How Many Words is a Picture Really Worth? Laurel Fais, Kyung-ho Loken-Kim, Tsuyoshi Morimoto
-
Visual Synthesis of Source Acoustic Speech Through Kohonen Neural Networks A. Lagana`, F. Lavagetto, A.
Storace
-
Audio-visual Speech Perception Without Speech Cues Helena M. Saldaña, David B. Pisoni, Jennifer M.
Fellowes, Robert E. Remez
Chair: Alex Waibel, Carnegie Mellon University
-
Multilingual Speech Recognition at Dragon Systems Jim Barnett, A. Corrada, G. Gao, L. Gillick, Y. Ito, S. Lowe,
L. Manganaro, B. Peskin
-
Multi-lingual Phoneme Recognition Exploiting Acoustic-phonetic Similarities of Sounds Joachim Köhler
-
Japanese Speech Databases for Robust Speech Recognition Atsushi Nakamura, Shoichi Matsunaga, Tohru
Shimizu, Masahiro Tonomura, Yoshinori Sagisaka
-
Spoken Language Processing in a Multilingual Context Lori F. Lamel, M. Adda-Decker, Jean Luc Gauvain, G.
Adda
-
Multilingual Human-computer Interactions: From Information Access to Language Learning Victor Zue, Stephanie
Seneff, Joseph Polifroni, Helen Meng, James Glass
-
SpeeData: Multilingual Spoken Data Entry U. Ackermann, B. Angelini, F. Brugnara, M. Federico, D. Giuliani, R.
Gretter, G. Lazzari, H. Niemann
Chair: Michael Macon, Georgia Institute of Technology
-
Pseudo-articulatory Representations in Speech Synthesis and Recognition William H. Edmondson, Jon P. Iles,
Dorota J. Iskra
-
Synthesis of Initial (/s/-) Stop-liquid Clusters using HLsyn David R. Williams
-
Synthesis of Trill Chilin Shih
-
Phone-based Speech Synthesis with Neural Network and Articulatory Control W.K. Lo, P.C. Ching
-
Analysis of Ten Vowel Sounds Across Gender and Regional/Cultural Accent P. Martland, S.P. Whiteside, Steve
W. Beet, L. Baghai-Ravary
-
Speech Morphing by Gradually Changing Spectrum Parameter and Fundamental Frequency Masanobu
Abe
Chair: David Talkin, Entropic Research Laboratory
-
The Multi-Lag-Window Method for Robust Extended-range F0 Determination Edouard Geoffrois
-
Nonlinear Estimation of DEGG Signals with Applications to Speech Pitch Detection Kenneth E. Barner
-
Pitch Analysis Methods for Cross-Speaker Comparison John. A. Maidment, M. Luisa Garcia-Lecumberri
-
Continuous Adaptation of Linear Models with Impulsive Excitation Steve W. Beet, L. Baghai-Ravary
-
Quantitative Analysis of the Local Speech Rate and its Application to Speech Synthesis Sumio Ohno, Masamichi
Fukumiya, Hiroya Fujisaki
-
A Fast and Reliable Rate of Speech Detector Jan P. Verhasselt, Jean-Pierre Martens
Chair: Li Deng, University of Waterloo
-
Context Modeling and Clustering in Continuous Speech Recognition Jean-Claude Junqua, Lorenzo
Vassallo
-
Hierarchical Partition of the Articulatory State Space for Overlapping-feature Based Speech Recognition Li Deng,
Jim Jian-Xiong Wu
-
A Fuzzy Acoustic-phonetic Decoder for Speech Recognition Olivier Oppizzi, David Fournier, Philippe Gilles,
Henri Méloni
-
Syllable-level Desynchronisation of Phonetic Features for Speech Recognition Katrin Kirchhoff
-
A Probabilistic Framework for Feature-based Speech Recognition James Glass, Jane Chang, Michael
McCandless
-
Modeling Context-dependent Phonetic Units in a Continuous Speech Recognition System for Mandarin Chinese
Jim Jian-Xiong Wu, Li Deng, Jacky Chan
Chair: Lori Lamel, LIMSI-CNRS
-
JANUS-II: Towards Spontaneous Spanish Speech Recognition Puming Zhan, Klaus Ries, Marsal Gavaldà,
Donna Gates, Alon Lavie, Alex Waibel
-
Reduced Semi-continuous Models for Large Vocabulary Continuous Speech Recognition in Dutch Kris
Demuynck, Jacques Duchateau, Dirk Van Compernolle
-
Validating Different Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar Databases
Andrei Constantinescu, Olivier Bornet, Gilles Caloz, Gérard Chollet
-
Use of a Reliability Coefficient in Noise Cancelling by Neural Net and Weighted Matching Algorithms Nestor
Becérra Yoma, Fergus R. McInnes, Mervyn A. Jack
-
Likelihood Normalization Using an Ergodic HMM for Continuous Speech Recognition Kazuhiko Ozeki
-
Dynamic Control of a Production Model Laurence Candille, Henri Méloni
-
Speech Recognition Using Sub-word Units Dependent On Phonetic Contexts Of Both Training and Recognition
Vocabularies Hiroaki Hattori, Eiko Yamada
-
Hidden Markov Models Merging Acoustic and Articulatory Information to Automatic Speech Recognition Bruno
Jacob, Christine Senac
-
Creation of Unseen Triphones from Diphones and Monophones using a Speech Production Approach Mats
Blomberg, Kjell Elenius
-
Speaker-independent Dictation of Chinese Speech with 32K Vocabulary Bo Xu, Bing Ma, Shuwu Zhang, Fei
Qu, Taiyi Huang
-
Using Accent-specific Pronunciation Modelling for Robust Speech Recognition J.J. Humphries, P.C. Woodland,
D. Pearce
-
Dictionary Learning for Spontaneous Speech Recognition Tilo Sloboda, Alex Waibel
-
Comparison of Channel Normalisation Techniques for Automatic Speech Recognition Over the Phone Johan de
Veth, Louis Boves
-
Anchor Point Detection for Continuous Speech Recognition in Spanish: The Spotting of Phonetic Events Manuel
A. Leandro, Jose M. Pardo
-
Cepstral Compensation by Polynomial Approximation for Environment-independent Speech Recognition Bhiksha
Raj, Evandro B. Gouvêa, Pedro J. Moreno, Richard M. Stern
-
Effect of Speech Coders on Speech Recognition Performance B.T. Lilly, K.K. Paliwal
-
Wavelet Transforms For Non-uniform Speech Recogntion Systems Léonard Janer, Josep Martí, Climent Nadeu,
Eduardo Lleida-Solano
-
A Binaural Model as a Front-end for Isolated Word Recognition Tsuyoshi Usagawa, Markus Bodden, Klaus
Rateitschek
-
A New Speech Enhancement: Speech Stream Segregation Hiroshi G. Okuno, Tomohiro Nakatani, Takeshi
Kawabata
Chair: Alex Waibel, Carnegie Mellon University
-
Head Automata for Speech Translation Hiyan Alshawi
-
Word Clustering with Parallel Spoken Language Corpora Ye-Yi Wang, John Lafferty, Alex Waibel
-
Toward Translating Korean Speech Into Other Languages Jae-Woo Yang, Youngjik Lee
-
VERBMOBIL: The Evolution of a Complex Large Speech-to-Speech Translation System Thomas Bub, Johannes
Schwinn
-
Translation of Conversational Speech with JANUS-II Alon Lavie, Alex Waibel, Lori Levin, Donna Gates, Marsal
Gavaldà, Torsten Zeppenfeld, Puming Zhan, Oren Glickman
Chair: Yoshinori Sagisaka, ATR Interpreting Telecommunications Research Laboratory
-
Non-segmental Analysis and Synthesis Based on a Speech Database Andrew Slater, John Coleman
-
Microsegment Synthesis - Economic Principles in a Low-cost Solution Ralf Benzmüller, William J. Barry
-
Whistler: A Trainable Text-to-Speech System X.D. Huang, A. Acero, J. Adcock, H.W. Hon, J. Goldsmith, J.
Liu, Mike Plumpe
-
Generation of Multiple Synthesis Inventories by a Bootstrapping Procedure Thomas Portele, Karl-Heinz Stöber,
Horst Meyer, Wolfgang Hess
-
Modeling Segmental Duration in German Text-to-Speech Synthesis Bernd Möbius, Jan P.H. van Santen
-
Autolabelling Japanese ToBI Nick Campbell
Chair: Doug Reynolds, MIT Lincoln Laboratory
-
General Phrase Speaker Verification Using Sub-word Background Models and Likelihood-ratio Scoring S.
Parthasarathy, A.E. Rosenberg
-
Unknown-Multiple Signal Source Clustering Problem Using Ergodic HMM and Applied to Speaker Classification
J. Murakami, M. Sugiyama, H. Watanabe
-
GMM and ARVM Cooperation and Competition for Text-independent Speaker Recognition on Telephone Speech
J.-L. Le Floch, C. Montacié, M.-J. Caraty
-
Selective use of the Speech Spectrum and a VQGMM Method for Speaker Identification Qiguang Lin, Ea-Ee
Jan, ChiWei Che, Dong-Suk Yuk, James L. Flanagan
-
Speaker Verification through Large Vocabulary Continuous Speech Recognition Michael Newman, Larry Gillick,
Yoshiko Ito, Don McAllaster, Barbara Peskin
-
Predictive Neural Networks in Text Independent Speaker Verification: an Evaluation on the SIVA Database
Andrea Paoloni, Susanna Ragazzini, G. Ravaioli
Chair: Nick Campbell, ATR Interpreting Telecommunications Research Laboratory
-
Durational Characterstics of Hindi Consonant Clusters Nisheeth Shrotriya, Rajesh Verma, S.K. Gupta, S.S.
Agrawal
-
The Use of Wavelet Transforms in Phoneme Recognition Beng T. Tan, Minyue Fu, Andrew Spray, Phillip
Dermody
-
Acoustic Properties of Phonemes in Continuous Speech for Different Speaking Rate Hisao Kuwabara
-
Prosodic Parameterization of Spoken Japanese Based on a Model of the Generation Process of F0 Contours
Hiroya Fujisaki, Sumio Ohno
-
A Logistic Regression Model for Detecting Prominences Arman Maghbouleh
-
High-quality Prosodic Modification of Speech Signals Beat Pfister
Chair: Doug Whalen, Haskins Laboratories
-
On the Syllable Structures of Chinese Relating to Speech Recognition Jialu Zhang
-
Perceptual Assimilation of American English Vowels by Japanese Listeners W. Strange, Reiko Akahane-Yamada,
B.H. Fitzgerald, R. Kubo
-
Context and Speaker Effects in the Perceptual Assimilation of German Vowels by American Listeners W.
Strange, O.-S. Bohn, S. A. Trent, M.C. McNair, K.C. Bielec
-
Examination of a Perceptual Non-native Speech Contrast: Pharyngealized/Non-pharyngealized Discrimination by
French-speaking Adults Mohamed Zahid
-
Context-dependent Relevance of Burst and Transitions for Perceived Place in Stops: It's in Production, not Perception
Roel Smits
-
The Perception of Morae in Long Vowels Comparison Among Japanese, Korean and English Speakers Ryoji
Baba, Kaori Omuro, Hiromitsu Miyazono, Tsuyoshi Usagawa, Masahiko Higuchi
-
Juncture Cues to Disfluency Robin J. Lickley
-
Effects of Duration and Formant Movement on Vowel Perception James R. Sawusch
-
Benchmarking Human Performance for Continuous Speech Recognition N. Deshmukh, R.J. Duncan, A.
Ganapathiraju, J. Picone
-
Intelligibility of Speech with Filtered Time Trajectories of Spectral Envelopes Takayuki Arai, Misha Pavel, Hynek
Hermansky, Carlos Avendano
-
Perceptual Use of Vowel and Speaker Information in Breath Sounds D. H. Whalen, Sonya M. Sheffert
-
The Role of Neighborhood Relative Frequency in Spoken Word Recognition Philippe Mousty, Monique Radeau,
Ronald Peereman, Paul Bertelson
-
Transitional Probability and Phoneme Monitoring James M. McQueen, Mark A. Pitt
-
Identification of Vowel Features from French Stop Bursts Anne Bonneau
-
Listening in a Second Language Z.S. Bond, Thomas J. Moore, Beverley Gable
-
Acoustic Correlates to the Effects of Talker Variability on the Perception of English /r/ and /l/ by Japanese Listeners
James S. Magnuson, Reiko Akahane-Yamada
-
Perception of Lexical Tone Across Languages: Evidence for a Linguistic Mode of Processing Denis Burnham,
Elizabeth Francis, Di Webster, Sudaporn Luksaneeyanawin, Chayada Attapaiboon, Francisco Lacerda, Peter Keller
Chairs: H. Timothy Bunnell, Alfred I. duPont Institute; and Richard A. Foulds, Alfred I. duPont Institute
-
Natural Communication with Machines - Progress and Challenge James L. Flanagan