Volume 3 Contents
Chair: Nelson Morgan, ICSI and University of California, Berkeley
- An Incremental Speaker-Adaptation Technique for Hybrid HMM-MLP Recognizer
Joao P. Neto, Ciro A. Martins, Luís B. Almeida
- Phoneme Segmentation of Continuous Speech using Multi-layer Perceptron
Youngjoo Suh, Youngjik Lee
- Stochastic Perceptual Speech Models with Durational Dependence
Jeff Bilmes, Nelson Morgan, Su-Lin Wu, Hervé Bourlard
- Boosting the Performance of Connectionist Large Vocabulary Speech Recognition
G.D. Cook, A.J. Robinson
- HMMs and OWE Neural Network for Continuous Speech Recognition
Nicolas Pican, Dominique Fohr, Jean-François Mari
- Smoothed Local Adaptation of Connectionist Systems
Steve Waterhouse, Dan Kershaw, Tony Robinson
Chair: Tony Robinson, Cambridge University
- Robust Speech Recognition with Speaker Localization by a Microphone Array
Takeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano
- Sound Source Localization in Reverberant Environments using an Outlier Elimination Algorithm
Ea-Ee Jan, James L. Flanagan
- The 1995 Abbot LVCSR System for Multiple Unknown Microphones
Dan Kershaw, Tony Robinson, Steve Renals
- Experiments of Speech Recognition in a Noisy and Reverberant Environment using a Microphone Array and HMM Adaptation
D. Giuliani, M. Omologo, P. Svaizer
- Increasing Robustness in GMM Speaker Recognition Systems for Noisy and Reverberant Speech with Low Complexity Microphone Arrays
Joaquín González-Rodríguez, Javier Ortega-García,
César Martin, Luis Hernández
- Robust Automatic Speech Recognition Using a Multi-channel Signal Separation Front-End
Kuan-Chieh Yen, Yunxin Zhao
Chair: Mark Steedman, University of Pennsylvania
- Prosody Generation in Text-to-Speech Conversion Using Dependency Graphs
Anders Lindström, Ivan Bretan, Mats Ljungqvist
- Extraction Method of Non-restrictive Modification in Japanese as a Marked Factor of Prosody
Hisako Asano, Hisashi Ohara, Yoshifumi Ooyama
- Modeling Contrast in the Generation and Synthesis of Spoken Language
Scott Prevost
- A Left-to-right Processing Model of Pausing in Japanese Based on Limited Syntactic Information
Hajime Tsukada
- Modeling of Intonation Bearing Emphasis for TTS-Synthesis of Greek Dialogues
D. Galanis, V. Darsinos, G. Kokkinakis
- Synthesizing Prosody: a Prominence-based Approach
Barbara Heuft, Thomas Portele
Chair: Thierry Dutoit, Faculté Polytechnique de Mons
- Multilingual Text Analysis for Text-to-Speech Synthesis
Richard Sproat
- Spoken-style Explanation Generator for Japanese Kanji using a Text-to-speech System
Yoshifumi Ooyama, Hisako Asano, Koji Matsuoka
- A Method for Estimating Prosodic Symbol from Text for Japanese Text-To-Speech Synthesis
Ken-ichi Magata, Tomoki Hamagami, Mitsuo Komura
- Statistical Methods in Data-driven Modeling of Spanish Prosody for Text to Speech
E. López-Gonzalo, J.M. Rodríguez-García
- Intonation Processing for TTS Using Stylization and Neural Network Learning Method
Jung-Chul Lee, Youngjik Lee, Sang-Hun Kim, Minsoo Hahn
- Generating F0 Contours from ToBI Labels using Linear Regression
Alan W. Black, Andrew J. Hunt
- The Broad Study of Homograph Disambiguity for Mandarin Speech Synthesis
Wern-Jun Wang, Shaw-Hwa Hwang, Sin-Horng Chen
- The MBROLA project: Towards a Set of High Quality Speech Synthesizers Free of Use for Non Commercial Purposes
T. Dutoit, V. Pagel, N. Pierret, F. Bataille, O. Van der Vrecken
- Training Data Selection for Voice Conversion Using Speaker Selection and Vector Field Smoothing
Makoto Hashimoto, Norio Higuchi
- A New Voice Transformation Method Based on Both Linear and Nonlinear Prediction Analysis
Ki Seung Lee, Dae Hee Youn, Il Whan Cha
- On the Transformation of the Speech Spectrum for Voice Conversion
G. Baudoin, Yannis Stylianou
- Spectral Analysis of Synthetic Speech and Natural Speech with Noise over the Telephone Line
Cristina Delogu, Andrea Paoloni, Susanna Ragazzini, Paola
Ridolfi
- A New Speech Synthesis System Based on the ARX Speech Production Model
Weizhong Zhu, Hideki Kasuya
- Speech Synthesis Using the CELP Algorithm
Geraldo Lino de Campos, Evandro Bacci Gouvêa
- A Mandarin Text-to-Speech System
Shaw-Hwa Hwang, Sin-Horng Chen, Yih-Ru Wang
- Residual-based Speech Modification Algorithms for Text-to-Speech Synthesis
M.D. Edgington, A. Lowry
- A Generalized LR Parser for Text-to-speech Synthesis
Per Olav Heggtveit
- Enhanced Shape-invariant Pitch and Time-scale Modification for Concatenative Speech Synthesis
M.P. Pollard, B.M.G. Cheetham, C.C. Goodyear, M.D. Edgington,
A. Lowry
- An Excitation Synchronous Pitch Waveform Extraction Method and its Application to the VCV-concatenation Synthesis of Japanese Spoken Words
Yasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura, Takashi
Honda
- A New Chinese Text-to-Speech System with High Naturalness
Ren-Hua Wang, Qinfeng Liu, Difei Tang
- Voice Conversion Based on Topological Feature Maps and Time-variant Filtering
Ansgar Rinscheid
Chair: Reiko A. Yamada, ATR Human Information Processing Research
Laboratories
- Language Training System Utilizing Speech Modification
Meron Yoram, Keikichi Hirose
- Perception of English /r/ and /l/ Speech Contrasts by Native Korean Listeners with Extensive English-language Experience
D.G. Jamieson, K. Yu
- Automatic Text-independent Pronunciation Scoring of Foreign Language Student Speech
Leonardo Neumeyer, Horacio Franco, Mitchel Weintraub, Patti
Price
- Assessing the Contribution of Instructional Technology in the Teaching of Pronunciation
Antônio Simoes
- Detection of Foreign Speakers' Pronunciation Errors for Second Language Training - Preliminary Results
Maxine Eskenazi
- Foreign Accent in Intonation Patterns - A Contrastive Study Applying a Quantitative Model of the F0 Contour
Hansjörg Mixdorff
- Input Modality Effects in Foreign Accent
Duncan J. Markham, Yasuko Nagano-Madsen
Chairs: Lynne E. Bernstein, House Ear Institute; and Christian
Benoît, ICP-Grenoble
- For Speech Perception by Humans or Machines, Three Senses are Better than One
Lynne E. Bernstein, Christian Benoît
- A Few Factors Which Affect the Degree of Incorporating Lip-read Information into Speech Perception
Kaoru Sekiyama, Yoh'ichi Tohkura, Michio Umeda
- Characterizing Audiovisual Information During Speech
E. Vatikiotis-Bateson, K.G. Munhall, Y. Kasahara, F. Garcia,
H. Yehia
- The Implications of the Tadoma Method of Speechreading for Spoken Language Processing
Charlotte M. Reed
- Seeing Speech in Space and Time: Psychological and Neurological Findings
Ruth Campbell
Chair: Paul Taylor, University of Edinburgh
- What's in the "Pure" Prosody?
Volker Strom, Christina Widera
- F0 Declination in Read-aloud and Spontaneous Speech
Marc Swerts, Eva Strangert, Mattias Heldner
- Prediction of Prosodic Phrase Boundaries Considering Variable Speaking Rate
Yeon-jun Kim, Yung-hwan Oh
- Prediction of F0 Parameter of Contextualized Utterances in Dialogue
Yoichi Yamashita, Riichiro Mizoguchi
- The Production and Perception of Potentially Ambiguous Intonation Contours by Speakers of Russian and Japanese
V. Makarova, J. Matsui
- What is Invariant and What is Optional in the Realization of a FOCUSED Word? A Cross-dialectal Study of Swedish Sentences With Moving Focus
Robert Eklund
Chair: Christine Shadle, University of Southhampton
- Quantifying Spectral Characteristics of Fricatives
Christine H. Shadle, Sheila J. Mair
- Acoustic Characteristics of Ejectives in Ingush
Natasha Warner
- An Acoustic Profile of Consonant Reduction
R.J.J.H. van Son, Louis C. W. Pols
- Devoicing in Post-vocalic Canadian-French Obstruants
Danièle Archambault, Blagovesta Maneva
- Paying Attention to Speaking Rate
Alexander L. Francis, Howard C. Nusbaum
- The Lack of Invariance Problem and the Goal of Speech Perception
Irene Appelbaum
Chair: Harriet S. Magen, Rhode Island College
- The Acoustic Structure of Vowels in Mothers' Speech to Infants and Adults
Jean E. Andruski, Patricia K. Kuhl
- Acoustical Characteristics of Sound Production of Deaf and Normally Hearing Infants
Chris J. Clement, Florien J. Koopmans-van Beinum, Louis C.
W. Pols
- Learning Non-native Vowel Categories
John Kingston, Christine Bartels, José Benkí,
Deanna Moore, Jeremy Rice, Rachel Thorburn, Neil Macmillan
- Word Recognition by Japanese Infants
P.A. Halle, Toshisada Deguchi, Yuji Tamekawa, B. Boysson-Bardies,
Shigeru Kiritani
- Investigations of the Word Segmentation Abilities of Infants
Peter W. Jusczyk
- Developmental Change in Perception of Clause Boundaries by 6- and 10-Month-old Japanese Infants
Akiko Hayashi, Yuji Tamekawa, Toshisada Deguchi, Shigeru Kiritani
Chair: Carol Espy-Wilson, Boston University
- A Frequency Domain Method for Parametrization of the Voice Source
Paavo Alku, Erkki Vilkman
- Glottal Correlates of the Word Stress and the Tense/Lax Opposition in German
Krzysztof Marasek
- Coarticulatory Stability in American English /r/
Suzanne Boyce, Carol Y. Espy-Wilson
- An MRI-based Analysis of the English /r/ and /l/ Articulations
Shinobu Masaki, Reiko Akahane-Yamada, Mark K. Tiede, Yasuhiro
Shimada, Ichiro Fujimoto
- Does Lexical Stress or Metrical Stress Better Predict Word Boundaries in Dutch?
David van Kuijk
- Optopalatograph (OPG): A New Apparatus for Speech Production Analysis
A. A. Wrench, A. D. McIntosh, W. J. Hardcastle
- Prediction of Vowel Systems using a Deductive Approach
René Carré
- Distinctions Between [t] and [tch] using Electropalatography Data
Sheila J. Mair, Celia Scully, Christine H. Shadle
- Relating Formants and Articulation in Intelligibility Test Words
Michiko Hashi, Raymond D. Kent, John R. Westbury, Mary J.
Lindstrom
- The Role of Coarticulation in the Perception of Vowel Quality in Modern Standard Arabic
Imad Znagui, Mohamed Yeou
- Updating the Reading EPG Simon
Arnfield, Wilf Jones
- Lexical Stress Detection on Stress-minimal Word Pairs
Goangshiuan S. Ying, Leah H. Jamieson, Ruxin Chen, Carl D.
Mitchell
- An Acoustic Study of the Interaction Between Stressed and Unstressed Syllables in Spoken Mandarin
Jing Wang
- Automatic Detection of Accent Nuclei at the Head of Words for Speech Recognition
Nobuaki Minematsu, Seiichi Nakagawa
- Automatic Generation of Prosodic Structure for High Quality Mandarin Speech Synthesis
Fu-chiang Chou, Chiu-yu Tseng, Lin-shan Lee
- A Study on Japanese Prosodic Pattern and its Modeling in Restricted Speech
Tomoki Hamagami, Ken-ichi Magata, Mitsuo Komura
- A Phonetic Study of Focus in Intransitive Verb Sentences
Steve Hoskins
- Variation in Vocal Fold Vibration Associated with Prosodic Conditions
Shigeru Kiritani, Hiroshi Imagawa, Seiji Niimi
- Goethe for Prosody Stefan
Rapp
- Prosodic Cues in Syntactically Ambiguous Strings; An Interactive Speech Planning Mechanism
K.A. Straub
- A Functional Model for Generation of the Local Components of F0 Contours in Chinese
Jinfu Ni, Ren-Hua Wang, Deyu Xia
- The Acquisition of Voiceless Stops in the Interlanguage of Second Language Learners of English and Spanish
Marie Fellbaum
- Jaw Contribution to Timing Control of "Guttural" Consonants Production
Ahmed M. Elgendy
Chairs: Lynne E. Bernstein, House Ear Institute; and Christian
Benoît, ICP-Grenoble
- Studies of the McGurk Effect: Implications for Theories of Speech Perception
Kerry P. Green
- Using the Visual Component in Automatic Speech Recognition
N. M. Brooke
- Perceptual Organization of Speech in One and Several Modalities: Common Functions, Common Resources
Robert E. Remez
- Multi-modal Encoding of Speech in Memory: A First Report
David B. Pisoni, Helena M. Saldaña, Sonya M. Sheffert
Chair: Klaus R. Scherer, University of Geneva
- Word Class Driven Synthesis of Prosodic Annotations
Simon Arnfield
- Dynamical Modelling of Vowel Sounds as a Synthesis Tool
M. Banbrook, S. McLaughlin
- Emotional Speech Elicited using Computer Games
Tom Johnstone
- Automatic Statistical Analysis of the Signal and Prosodic Signs of Emotion in Speech
Roddy Cowie, Ellen Douglas-Cowie
- Recognizing Emotion in Speech
Frank Dellaert, Thomas Polzin, Alex Waibel
- Emotions in Time Domain Synthesis
Barbara Heuft, Thomas Portele, Monika Rauth
Chair: Candy Kamm, AT&T Labs - Research
- Evaluating Automatic Speech Recognition as a Component of a Multi-input Device Human-computer Interface
B.A. Mellor, C. Baber, C. Tunley
- Data Collection for the MASK Kiosk: WOz vs Prototype System
A. Life, I. Salter, J.N. Temem, F. Bernard, S. Rosset, S.K.
Bennacef, Lori Lamel
- An Experimental Japanese/English Interpreting Video Phone System
M. Karaorman, T.H. Applebaum, T. Itoh, M. Endo, Y. Ohno, M.
Hoshimi, T. Kamai, K. Matsui, K. Hata, S. Pearson, J.-C. Janqua
- User Participation and Compliance in Speech Automated Telecommunications Applications
Sara Basson, Stephen Springer, Cynthia Fong, Hong Leung, Ed
Man, Michele Olson, John Pitrelli, Ranvir Singh, Suk Wong
- Embedding Speech in Web Interfaces
Samuel Bayer
- Voice-activated Home Banking System and its Field Trial
Toshihiro Isobe, Masatoshi Morishima, Fuminori Yoshitani,
Nobuo Koizumi, Ken'ya Murakami
Chair: Juergen Schroeter, AT&T Labs - Research
- A Text Analyzer for Korean Text-to-Speech Systems
Sangho Lee, Yung-Hwan Oh
- Design and Evaluation of a Phonological Phrase Parser for Spanish Text-to-Speech
Helen E. Karn
- Comparison of Two Tree-Structured Approaches for Grapheme-to-Phoneme Conversion
Ove Andersen, Roland Kuhn, Ariane Lazaridès, Paul Dalsgaard,
Jürgen Haas, Elmar Nöth
- A Recurrent Network that Learns to Pronounce English Text
M.J. Adamson, R.I. Damper
- Archisegment-based Letter-to-Phone Conversion for Concatenative Speech Synthesis in Portuguese
Eleonora Cavalcante Albano, Agnaldo Antonio Moreira
- A New Method of Generating Speech Synthesis Units Based on Phonological Knowledge and Clustering Technique
Yuki Yoshida, Shin'ya Nakajima, Kazuo Hakoda, Tomohisa Hirokawa
Chair: Louis Boves, Nymegen University
- Consistency in Transcription and Labelling of German Intonation with GToBI
Martine Grice, Matthias Reyelt, Ralf Benzmüller, Jörg
Mayer, Anton Batliner
- Syntactic-prosodic Labeling of Large Spontaneous Speech Data-bases
Anton Batliner, R. Kompe, A. Kiessling, H. Niemann, E. Nöth
- Relationship Between Discourse Structure and Dynamic Speech Rate
Florien J. Koopmans-van Beinum, Monique E. van Donzel
- Using Prosodic Clues to Decide When to Produce Back-channel Utterances
Nigel Ward
- Dialog Act Classification with the Help of Prosody
Marion Mast, Ralf Kompe, Stefan Harbeck, Andreas Kiessling,
Heinrich Niemann, Elmar Nöth, E. G. Schukat-Talamazzini,
V. Warnke
- Using Lexical Stress in Continuous Speech Recognition for Dutch
David van Kuijk, Henk van den Heuvel, Louis Boves
Chair: Sadaoki Furui, NTT Human Interface Lab
- Automatic Accent Classification of Foreign Accented Australian English Speech
Karsten Kumpf, Robin W. King
- Discriminative Adaptation for Speaker Verification
F. Korkmazskiy, Biing-Hwang Juang
- Perceptual Features of Unknown Foreign Languages as Revealed by Multi-dimensional Scaling
V. Stockmal, D. Muljani, Z.S. Bond
- On-line Incremental Adaptation for Speaker Verification using Maximum Likelihood Estimates of CDHMM Parameters
Kin Yu, John S. Mason
- Combining Methods to Improve Speaker Verification Decision
Dominique Genoud, Frédéric Bimbot, Guillaume
Gravier, Gérard Chollet
- Incremental Speaker Adaptation with Minimum Error Discriminative Training for Speaker Identification
Cesar Martín del Alamo, J. Alvarez, C. de la Torre,
F.J. Poyatos, L. Hernández
- Frame Level Likelihood Normalization for Text-independent Speaker Identification using Gaussian Mixture Models
Konstantin P. Markov, Seiichi Nakagawa
- On Using Prosodic Cues in Automatic Language Identification
Ann E. Thymé-Gobbel, Sandra E. Hutchins
- Speaker Recognition Model using Two-dimensional Mel-Cepstrum and Predictive Neural Network
Tadashi Kitamura, Shinsai Takei
- Unknown Language Rejection in Language Identification System
Hingkeung Kwan, Keikichi Hirose
- Spoken Language Identification using Large Vocabulary Speech Recognition
James L. Hieronymus, Shubha Kadambe
- Accent Identification Carlos
Teixeira, Isabel M. Trancoso, António Serralheiro
- Comparison of Text-independent Speaker Recognition Methods on Telephone Speech with Acoustic Mismatch
Sarel van Vuuren
- On the Sources of Inter- and Intra-speaker Variability in the Acoustic Dynamics of Speech
Xue Yang, J. Bruce Millar, Iain Macleod
- Language Identification with Inaccurate String Matching
Kay M. Berkling, Etienne Barnard
- Robust Prosodic Features for Speaker Identification
M.J. Carey, E.S. Parris, H. Lloyd-Thomas, S.J. Bennett
- Text Independent Speaker Identification on Noisy Environments by Means of Self Organizing Maps
E. Monte, J. Hernando, X. Miró, A. Adolf
- Language-identification Using Language-dependent Phonemes and Language-independent Speech Units
Paul Dalsgaard, Ove Andersen, Hanne Hesselager, Bojan Petek
Chairs: Ronald Rosenfeld, Carnegie Mellon University; and
Hervé Bourlard, Faculté Polytechnique De Mons
- Introduction to SWB Jorden
Cohen
- Disfluencies in SWB Elizabeth
Shriberg
- Error Analysis and Disfluency Modeling
Ronald Rosenfeld
- Fast Sparse Data Training/Portability
Andreas Stolcke
- Phrase Structure Language Models
Salim Roukos
- Language Modeling Issues for Spanish
Herbert Gish
- SRI Speaking Mode Experiments
Andreas Stolcke
Chair: Klaus R. Scherer, University of Geneva
- Adding the Affective Dimension: A New Look in Speech Analysis and Synthesis
Klaus R. Scherer
- Ethological Theory and the Expression of Emotion in the Voice
John J. Ohala
- Synthesizing Emotions in Speech: Is it Time to Get Excited?
Iain R. Murray, John L. Arnott
Chair: Richard Rose, AT&T Labs - Research
- A Study on Task-independent Subword Selection and Modeling for Speech Recognition
Chin-Hui Lee, Biing-Hwang Juang, Wu Chou, J.J. Molina-Perez
- Simultaneous ANN Feature and HMM Recognizer Design using String-based Minimum Classification Error (MCE) Training
Mazin G. Rahim, Chin-Hui Lee
- Quantizing Mixture-weights in a Tied-mixture HMM
Sunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen
- Variance Compensation within the MLLR Framework for Robust Speech Recognition and Speaker Adaptation
M.J.F. Gales, D. Pye, P.C. Woodland
- Maximum-likelihood Stochastic Matching Approach to Non-linear Equalization for Robust Speech Recognition
A.C. Surendran, Chin-Hui Lee, Mazin G. Rahim
- Estimation of Channel Bias for Telephone Speech Recognition
Jen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min Lee
Chair: Bernd Moebius, Bell Labs - Lucent Technologies
- Synthesis of English Intonation using Explicit Models of Reading and Spontaneous Speech
M. E. Johnson
- Generating Intonation by Superposing Gestures
Yann Morlec, Gérard Bailly, Vèronique Aubergé
- Implementation and Evaluation of a Model for Synthesis of Swedish Intonation
Merle Horne, Marcus Filipsson
- Natural Prosody Generation for Domain Specific Text-to-Speech Systems
Nobuyuki Katae, Shinta Kimura
- Improving Text-to-Speech Synthesis
Mark Tatham, Eric Lewis
- Synthesis of Stressed Speech from Isolated Neutral Speech Using HMM-based Models
Sahar E. Bou-Ghazale, John H.L. Hansen
- Modeling Segment Intonation for Slovene TTS System
Ales Dobnikar
Chair: David G. Novick, European Institute of Cognitive Sciences
and Engineering
- Word Predictability After Hesitations: A Corpus-based Study
Elizabeth Shriberg, Andreas Stolcke
- Interruptions and Intonation
Li-chiung Yang
- On not Recognizing Disfluencies in Dialogue
Robin J. Lickley, Ellen Gurman Bard
- A Theory of Word Frequencies and its Application to Dialogue Move Recognition
Phil Garner, Sue Browning, Roger Moore, Martin Russell
- Utterance Units and Grounding in Spoken Dialogue
David R. Traum, Peter A. Heeman
- Coordinating Turn-taking with Gaze
David G. Novick, Brian Hansen, Karen Ward
Chair: Bruce M. Buntschuh, AT&T Labs - Research
- BABEL: An Eastern European Multi-language Database
Peter Roach, Simon Arnfield, W. Barry, J. Baltova, M. Boldea,
A. Fourcin, W. Gonet, R. Gubrynowicz, E. Hallum, L. Lamel, K.
Marasek, A. Marchal, E. Meister, K. Vicsi
- USTC95---A Putonghua Corpus
Ren-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu
- Telephone Data Collection using the World Wide Web
Edward Hurley, Joseph Polifroni, James Glass
- The "SIVA" Speech Database for Speaker Verification: Description and Evaluation
M. Falcone, A. Gallo
- A Multi-level Description of Date Expressions in German Telephone Speech
Christoph Draxler
- Viterbi Search Visualization Using Vista: A Generic Performance Visualization Tool
Robert H. Halstead Jr., Ben Serridge, Jean-Manuel Van Thong,
William Goldenthal
- A Multilingual Phonetic Representation and Analysis System for Different Speech Databases
Toomas Altosaar, Matti Karjalainen, Martti Vainio
- FRESCO: The French Telephone Speech Data Collection - Part of the European SpeechDat(M) Project
D. Langmann, R. Haeb-Umbach, Louis Boves, E. den Os
- Predicting the Out-of-Vocabulary Rate and the Required Vocabulary Size for Speech Processing Applications
Johannes Müller, Holger Stahl, Manfred Lang
- AMULET: Automatic MUltisensor Speech Labelling and Event Tracking: Study of the Spatio-temporal Correlations in Voiceless Plosive Production
Nathalie Parlangeau, Alain Marchal
- Constructing Multi-level Speech Database for Spontaneous Speech Processing
Minsoo Hahn, Sanghun Kim, Jung-Chul Lee, Yong-Ju Lee
- Preliminaries to a Romanian Speech Database
Marian Boldea, Alin Doroga, Tiberiu Dumitrescu, Maria Pescaru
- Labelled Data Bank of Spoken Standard German The Kiel Corpus of Read/Spontaneous Speech
Klaus J. Kohler
- SAPPHIRE: An Extensible Speech Analysis and Recognition Tool Based on Tcl/Tk
Lee Hetherington, Michael McCandless
- Automatic Detection of Topic Boundaries and Keywords in Arbitrary Speech Using Incremental Reference Interval-free Continuous DP
Jiro Kiyama, Yoshiaki Itoh, Ryuichi Oka
- Very-large-vocabulary Mandarin Voice Message File Retrieval using Speech Queries
Bo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee
- Gandalf - A Swedish Telephone Speaker Verification Database
H. Melin
- The DCIEM Map Task Corpus: Spontaneous Dialogue Under Sleep Deprivation and Drug Treatment
Ellen Gurman Bard, C. Sotillo, A. H. Anderson, M. M. Taylor
- The Nemours Database of Dysarthric Speech
Xavier Menéndez-Pidal, James B. Polikoff, Shirley M.
Peters, Jennie E. Leonzio, H.T. Bunnell
- POST: Parallel Object-oriented Speech Toolkit
Jean Hennebert, Dijana Petrovska Delacrétaz
Chairs: Ronald Rosenfeld, Carnegie Mellon University; and
Hervé Bourlard, Faculté Polytechnique De Mons
- Insights into Spoken Language Gleaned from Phonetic Transcription of the Switchboard Corpus
Steven Greenberg
- Automatic Learning of Word Pronunciation from Data
Eric Fosler
- Modeling Systematic Variations in Pronunciation
Bill Byrne
- Speech Data Modeling Nelson
Morgan
- Linguistic Dependency Modeling
Andreas Stolcke
- Summary, Observations, and Plans for the Future
Fred Jelinek
Chair: Klaus R. Scherer, University of Geneva
- Discussion Period Klaus
R. Scherer