Volume 2 Contents



FrA1L1 -- Utterance Verification and Word Spotting

Chair: Jay Wilpon, AT&T Labs - Research
  1. A User-Configurable System for Voice Label Recognition Richard C. Rose, Eduardo Lleida, G.W. Erhart, R.V. Grubbe
  2. Keyword Spotting Enhancement for Video Soundtrack Indexing Philippe Gelin, Chris. J. Wellekens
  3. New Efficient Fillers for Unlimited Word Recognition and Keyword Spotting Rachida El Méliani, Douglas O'Shaughnessy
  4. Automatic Transcription of General Audio Data: Preliminary Analyses Michelle S. Spina, Victor Zue
  5. Transcribing Radio News Francis Kubala, Tasos Anastasakos, Hubert Jin, Long Nguyen, Richard Schwartz
  6. Correcting Recognition Errors via Discriminative Utterance Verification Anand R. Setlur, Rafid A. Sukkar, John Jacob


FrA1L2 -- Acquisition/Learning Training L2 Learners

Chair: Grace H. Yeni-Komshian, University of Maryland
  1. Does Training in Speech Perception Modify Speech Production? Reiko Akahane-Yamada, Yoh'ichi Tohkura, Ann R. Bradlow, David B. Pisoni
  2. Phrase-Final Lengthening and Stress-Timed Shortening in the Speech of Native Speakers and Japanese Learners of English Motoko Ueyama
  3. Japanese Accentuations by Foreign Students and Japanese Speakers of Non-Tokyo Dialect Nobuko Yamada
  4. Devoicing of Japanese Vowels by Taiwanese Learners of Japanese J. Kevin Varden, Tsutomu Sato
  5. Fluency and Use of Segmental Dialect Features in the Acquisition of a Second Language (French) by English Speakers Danièle Archambault, Catherine Foucher, Blagovesta Maneva
  6. Estimating Child and Adolescent Formant Frequency Values From Adult Data P. Martland, S.P. Whiteside, Steve W. Beet, L. Baghai-Ravary


FrA1L3 -- Focus, Stress and Accent

Chair: Elizabeth Shriberg, SRI International
  1. Acoustic Correlates of Linguistic Stress and Accent in Dutch and American English Agaath M.C. Sluijter, Vincent J. van Heuven
  2. On the Levels of Accentuation in Spoken Japanese Hiroya Fujisaki, Sumio Ohno, Osamu Tomita
  3. Tonal Distinctions Between Emphatic Stress and Pretonic Lengthening in Quebec French Linda Thibault, Marise Ouellet
  4. Distinction Between 'Normal' Focus and 'Contrastive/Emphatic' Focus Anja (Petzold) Elsner
  5. Perception of Tonal Accent by Americans Learning Japanese Yukihiro Nishinuma, Masako Arai, Takako Ayusawa
  6. Modeling Intra-Speaker Pitch Range Variation: Predicting F0 Targets when "Speaking Up" Elizabeth Shriberg, D. Robert Ladd, Jacques Terken


FrA1P1 -- Spoken Language Dialogue and Conversation

Chair: Alicia Abella, AT&T Labs - Research
  1. Predicting Dialogue Acts for a Speech-To-Speech Translation System Norbert Reithinger, Ralf Engel, Michael Kipp, Martin Klesen
  2. Automatic Speech Translation Based on the Semantic Structure Johannes Müller, Holger Stahl, Manfred Lang
  3. A Methodology for Application Development for Spoken Language Systems Lewis M. Norton, Carl E. Weir, K.W. Scholz, Deborah A. Dahl, Ahmed Bouzid
  4. A New Restaurant Guide Conversational System: Issues in Rapid Prototyping for Specialized Domains Stephanie Seneff, Joseph Polifroni
  5. Semantic Interpretation of a Japanese Complex Sentence in an Advisory Dialogue - Focused on the Postpositional Word "KEDO,'' Which Works as a Conjunction Between Clauses Tadahiko Kumamoto, Akira Ito
  6. A Korean Morphological Analyzer for Speech Translation System Youngkuk Hong, Myoung-Wan Koo, Gijoo Yang
  7. Generic and Domain-specific Aspects of the Waxholm NLP and Dialog Modules Rolf Carlson, Sheri Hunnicutt
  8. A Real-Time System for Summarizing Human-Human Spontaneous Spoken Dialogues Megumi Kameyama, Goh Kawai, Isao Arima
  9. Evaluation of Spoken Language Understanding and Dialogue Systems Bernd Hildebrandt, Heike Rautenstrauch, Gerhard Sagerer
  10. Inter-Speaker Interaction of F0 in Dialogs Kuniko Kakita
  11. A Robust Dialogue System for Making an Appointment Hans Brandt-Pook, Gernot A. Fink, Bernd Hildebrandt, Franz Kummert, Gerhard Sagerer
  12. Segmentation of Spoken Dialogue by Interjections, Disfluent Utterances and Pauses Kazuyuki Takagi, Shuichi Itahashi
  13. A Form-Based Dialogue Manager for Spoken Language Applications David Goddeau, Helen Meng, Joe Polifroni, Stephanie Seneff, Senis Busayapongchai
  14. The Design of Complex Telephony Applications Using Large Vocabulary Speech Technology S.J. Whittaker, D.J. Attwater
  15. Building 10,000 Spoken Dialogue Systems Stephen Sutton, David G. Novick, Ronald A. Cole, Pieter Vermeulen, Jacques de Villiers, Johan Schalkwyk, Mark Fanty
  16. Speaker Intention Modeling for Large Vocabulary Mandarin Spoken Dialogues Yen-Ju Yang, Lee-Feng Chien, Lin-Shan Lee
  17. Hybrid Language Models and Spontaneous Legal Discourse P.E. Kenne, Mary O'Kane
  18. Topic Change and Local Perplexity in Spoken Legal Dialogue P.E. Kenne, Mary O'Kane
  19. Intonational Cues to Discourse Structure in Japanese Jennifer J. Venditti, Marc Swerts
  20. Principles for the Design of Cooperative Spoken Human-Machine Dialogue Niels Ole Bernsen, Hans Dybkjær, Laila Dybkjær
  21. Development and Comparison of Three Syllable Stress Classifiers Karen L. Jenkin, Michael S. Scordilis


FrA1P2 -- Speech Disorders

Chair: Don Jamieson, University of Western Ontario
  1. Interaction of Speech Disorders with Speech Coders: Effects on Speech Intelligibility D.G. Jamieson, Li Deng, M. Price, Vijay Parsa, J. Till
  2. Detecting Arytenoid Cartilage Misplacement through Acoustic and Electroglottographic Jitter Analysis Maurílio N. Vieira, Arnold G. D. Maran, Fergus R. McInnes, Mervyn A. Jack
  3. Robust F0 and Jitter Estimation in Pathological Voices Maurílio N. Vieira, Fergus R. McInnes, Mervyn A. Jack
  4. Speech Monitoring of Infective Laryngitis F. Plante, H. Kessler, B.M.G. Cheetham, J. Earis
  5. Searching for Nonlinear Relations in Whitened Jitter Time Series J. Schoentgen, R. De Guchteneere
  6. Vocal Fold Pathology Assessment using AM Autocorrelation Analysis of the Teager Energy Operator Liliana Gavidia-Ceballos, John H.L. Hansen, James F. Kaiser
  7. Continuous Positive Airway Pressure (CPAP) in the Treatment of Hypernasality David P. Kuehn
  8. Enhancement of Alaryngeal Speech by Adaptive Filtering Carol Y. Espy-Wilson, Venkatesh R. Chari, Caroline B. Huang
  9. Simulation of Disordered Speech Using a Frequency-Domain Vocal Tract Model Li Deng, Xuemin Shen, D.G. Jamieson, J. Till
  10. A Stochastic Model of Fundamental Period Perturbation and Its Application to Perception of Pathological Voice Quality Yasuo Endo, Hideki Kasuya
  11. A Screening Test for Speech Pathology Assessment Using Objective Quality Measures Eric J. Wallen, John H.L. Hansen
  12. Recent Advances in Hypernasal Speech Detection using the Nonlinear Teager Energy Operator Douglas A. Cairns, John H.L. Hansen, James F. Kaiser


FrA1S1 -- Vocal Tract Geometry I

Chair: Maureen Stone, University of Maryland at Baltimore
  1. Human Palate and Related Structures: Their Articulatory Consequences Kiyoshi Honda, Shinji Maeda, Michiko Hashi, Jim Dembowski, John R. Westbury
  2. A Continuum Mechanics Representation of Tongue Deformation Edward P. Davis, Andrew Douglas, Maureen Stone
  3. From MRI and Acoustic Data to Articulatory Synthesis: A Case Study of the Lateral Approximants in American English Philbert Bangayan, Abeer Alwan, Shrikanth Narayanan
  4. Liquids in Tamil Shrikanth Narayanan, Abigail Kaun, Dani Byrd, Peter Ladefoged, Abeer Alwan


FrA2L1 -- Prosody in ASR and Segmentation

Chair: Keikichi Hirose, University of Tokyo
  1. Modeling Hyperarticulate Speech during Human-computer Error Resolution Sharon Oviatt, Gina-Anne Levow, Margaret MacEachern, Karen Kuhn
  2. Using Stress to Disambiguate Spoken Thai Sentences Containing Syntactic Ambiguity Siripong Potisuk, Mary P. Harper, Jackson T. Gandour
  3. Use of Prosodic Information to Integrate Acoustic and Linguistic Knowledge in Continuous Mandarin Speech Recognition with Very Large Vocabulary Hung-yun Hsieh, Ren-yuan Lyu, Lin-shan Lee
  4. Word Boundary Detection using Pitch Variations G.V. Ramana Rao, J. Srichand
  5. Detection of Phrase Boundaries in Japanese by Low-Pass Filtering of Fundamental Frequency Contours Atsuhiro Sakurai, Keikichi Hirose
  6. A New Method for Speech Delexicalization, and its Application to the Perception of French Prosody V. Pagel, N. Carbonell, Yves Laprie


FrA2L2 -- Acquisition and Learning by Machine

Chair: Allen L. Gorin, AT&T Labs - Research
  1. Task Adaptation for Dialogues Via Telephone Lines Udo Bub
  2. The Influence of Bigram Constraints on Word Recognition by Humans: Implications for Computer Speech Recognition Ronald A. Cole, Yonghong Yan, Troy Bailey
  3. ALICE: Acquisition of Language In Conversational Environment - An Approach to Weakly Supervised Training of Spoken Language System for Language Porting Tetsunori Kobayashi
  4. Pitch Pattern Clustering of User Utterances in Human-Machine Dialogue Takashi Yoshimura, Satoru Hayamizu, Hiroshi Ohmura, Kazuyo Tanaka
  5. Simplifying Language through Error-correcting Decoding J.C. Amengual, E. Vidal, J.M. Benedí
  6. A Mixed Approach to Speech Understanding Mauro Cettolo, Anna Corazza, Renato De Mori


FrA2L3 -- Dialogue Systems

Chair: Esther Levin, AT&T Labs - Research
  1. Speech Recognition for an Information Kiosk J.L. Gauvain, J.J. Gangolf, L. Lamel
  2. Localizing an Automatic Inquiry System for Public Transport Information Helmer Strik, Albert Russel, Henk van den Heuvel, Catia Cucchiarini, Louis Boves
  3. Prompt Constrained Natural Language - Evolving the Next Generation of Telephony Services Stephen M. Marcus, Deborah W. Brown, Randy G. Goldberg, Max S. Schoeffler, William R. Wetzel, Richard R. Rosinski
  4. Key-Phrase Detection and Verification for Flexible Speech Understanding Tatsuya Kawahara, Chin-Hui Lee, Biing-Hwang Juang
  5. Interactive Recovery from Speech Recognition Errors in Speech User Interfaces Bernhard Suhm, Brad Myers, Alex Waibel
  6. Estimation of Language Models for New Spoken Language Applications Sunil Issar


FrA2P1 -- Speech Enhancement and Robust Processing

Chair: Richard Stern, Carnegie Mellon University
  1. H-infinity Filtering for Speech Enhancement Xuemin Shen, Li Deng, Anisa Yasmin
  2. A Comparitive Analysis of Channel-Robust Features and Channel Equalization Methods for Speech Recognition Saeed V. Vaseghi, Ben Milner
  3. Robust Speech Recognition Features Based on Temporal Trajectory Filtering of Frequency Band Spectrum Jia-lin Shen, Wen-liang Hwang, Lin-shan Lee
  4. Durational Modelling for Improved Connected Digit Recognition Kevin Power
  5. Study on the Dereverberation of Speech Based on Temporal Envelope Filtering Carlos Avendano, Hynek Hermansky
  6. Estimating Markov Model Structures Thorsten Brants
  7. A Fertility Channel Model for Post-Correction of Continuous Speech Recognition Eric K. Ringger, James F. Allen
  8. Restoration of Wide Band Signal from Telephone Speech using Linear Prediction Error Processing Hiroshi Yasukawa
  9. Smoothed Spectral Subtraction for a Frequency-Weighted HMM in Noisy Speech Recognition Hiroshi Matsumoto, Noboru Naitoh
  10. A Simple Architecture for using Multiple Cues in Sound Separation William S. Woods, Martin Hansen, Thomas Wittkop, Birger Kollmeier
  11. On the Robust Automatic Segmentation of Spontaneous Speech Bojan Petek, Ove Andersen, Paul Dalsgaard
  12. Bayesian Adaptation of Speech Recognizers to Field Speech Data C.G. Miglietta, C. Mokbel, D. Jouvet, J. Monné
  13. Sub-band Adaptive Filtering Applied to Speech Enhancement A. J. Darlington, D. J. Campbell
  14. Noise Robust Estimate of Speech Dynamics for Speaker Recognition J.P. Openshaw, John S. Mason
  15. Overview of Speech Enhancement Techniques for Automatic Speaker Recognition Javier Ortega-García, Joaquín González-Rodríguez
  16. Dynamic Features for Segmental Speech Recognition Naomi Harte, Saeed V. Vaseghi, Ben Milner
  17. Speech Recognition Based on a Model of Human Auditory System Takuya Koizumi, Mikio Mori, Shuji Taniguchi
  18. APVQ Encoder Applied to Wideband Speech Coding J.M. Salavedra, E. Masgrau
  19. Simple Fast Vector Quantization of the Line Spectral Frequencies Jin Zhou, Yair Shoham, Ali Akansu


FrA2S1 -- Vocal Tract Geometry II

Chair: Maureen Stone, University of Maryland at Baltimore
  1. Speaker Individualities of Vocal Tract Shapes of Japanese Vowels Measured by Magnetic Resonance Images Chang-Sheng Yang, Hideki Kasuya
  2. Vocal Tract Acoustics Using the Transmission Line Matrix (TLM) Method S. El-Masri, X. Pelorson, P. Saguet, P. Badin
  3. Building Sensori-motor Prototypes from Audiovisual Exemplars Gérard Bailly
  4. Parameterized VT Area Function Inversion Mats Båvegård, Gunnar Fant
  5. An Improved Vocal Tract Model of Vowel Production Implementing Piriform Resonance and Transvelar Nasal Coupling Jianwu Dang, Kiyoshi Honda
  6. Pseudo-articulatory Speech Synthesis for Recognition using Automatic Feature Extraction from X-Ray Data C. S. Blackburn, S. J. Young


FrP1L1 -- Speaker Adaptation and Normalization I

Chair: Chin-Hui Lee, Bell Labs - Lucent Technologies
  1. N-best-based Instantaneous Speaker Adaptation Method for Speech Recognition Tomoko Matsui, Sadaoki Furui
  2. Mixture Splitting Technic and Temporal Control in a HMM-based Recognition System C. Montacié, M.-J. Caraty, C. Barras
  3. A Unified Spectral Transformation Adaptation Approach for Robust Speech Recognition Lei Yao, Dong Yu, Taiyi Huang
  4. On-line Adaptive Learning of the Correlated Continuous Density Hidden Markov Models for Speech Recognition Qiang Huo, Chin-Hui Lee
  5. Speaker Adaptation by Modeling the Speaker Variation in a Continuous Speech Recognition System Nikko Ström
  6. An Enquiring System of Unknown Words in TV News by Spontaneous Repetition (Application of Speaker Normalization by Speaker Subspace Projection) Yasuo Ariki, Shigeaki Tagashira


FrP1L2 -- Spoken Language and NLP I

Chair: Adam L. Buchsbaum, AT&T Labs - Research
  1. Language Understanding using Hidden Understanding Models Richard Schwartz, Scott Miller, David Stallard, John Makhoul
  2. Processing of Semantic Information in Fluently Spoken Language Allen L. Gorin
  3. Automatic Linguistic Segmentation of Conversational Speech Andreas Stolcke, Elizabeth Shriberg
  4. Towards Understanding Spontaneous Speech: Word Accuracy vs. Concept Accuracy M. Boros, W. Eckert, Florian Gallwitz, G. Görz, G. Hanrieder, Heinrich Niemann
  5. A Stochastic Case Frame Approach for Natural Language Understanding Wolfgang Minker, S.K. Bennacef, J.L. Gauvain
  6. Improving Speech Understanding by Incorporating Database Constraints and Dialogue History Frank Seide, Bernhard Rüber, Andreas Kellner


FrP1L3 -- Spoken Discourse Analysis/Synthesis

Chair: Jan P. van Santen, Bell Labs - Lucent Technologies
  1. A New Discourse Structure Model for Spontaneous Spoken Dialogue Tetsuro Chino, Hiroyuki Tsuboi
  2. An Architecture for Spoken Dialogue Management David Duff, Barbara Gates, Susann LuperFoy
  3. Pausing Strategies in Discourse in Dutch Monique E. van Donzel, Florien J. Koopmans-van Beinum
  4. Filled Pauses as Markers of Discourse Structure Marc Swerts, Anne Wichmann, Robbert-Jan Beun
  5. The Prosodic Analysis of Korean Dialogue Speech - Through a Comparative Study with Read Speech Cheol-jae Seong, Minsoo Hahn
  6. Changing the Topic: How Long Does it Take? Mary O'Kane, P.E. Kenne


FrP1P1 -- Acoustic Modeling I

Chair: Ilija Zeljkovic, AT&T Labs - Research
  1. Learning Pronunciation Dictionary from Speech Data Christian-Michael Westendorf, Jens Jelitto
  2. The Trended HMM with Discriminative Training for Phonetic Classification C. Rathinavelu, Li Deng
  3. Improving Decision Trees for Acoustic Modeling Ariane Lazaridès, Yves Normandin, Roland Kuhn
  4. An Improved Training Algorithm in HMM-based Speech Recognition Gongjun Li, Taiyi Huang
  5. Speech Recognition Using a Strong Correlation Assumption for the Instantaneous Spectra J. Ming, P. O'Boyle, J. McMahon, F. J. Smith
  6. On Parameter Filtering in Continuous Subword-unit-based Speech Recognition Pau Pachès-Leal, Climent Nadeu
  7. Estimation of Statistical Phoneme Center Considering Phonemic Environments Shigeki Okawa, Katsuhiko Shirai
  8. Integration of Context-dependent Durational Knowledge into HMM-based Speech Recognition Xue Wang, Louis F. M. ten Bosch, Louis C. W. Pols
  9. Speech Recognition Based on Acoustically Derived Segment Units T. Fukada, M. Bacchiani, K.K. Paliwal, Yoshinori Sagisaka
  10. Robust Gender-dependent Acoustic-phonetic Modelling in Continuous Speech Recognition Based on a New Automatic Male/Female Classification Rivarol Vergin, Azarshid Farhat, Douglas O'Shaughnessy
  11. A Codebook Adaptation Algorithm for SCHMM Using Formant Distribution Tae Young Yang, Won Ho Shin, Weon Goo Kim, Dae Hee Youn
  12. Parameter Tying for Flexible Speech Recognition J. Simonin, S. Bodin, D. Jouvet, K. Bartkova
  13. Word-spotting Based on Inter-word and Intra-word Diphone Models Tsuneo Nitta, Shin'ichi Tanaka, Yasuyuki Masai, Hiroshi Matsu'ura
  14. Duration Modeling with Expanded HMM Applied to Speech Recognition Antonio Bonafonte, Josep Vidal, Albino Nogueiras
  15. Different Strategies for Distribution Clustering using Discrete, Semicontinuous and Continuous HMMs in CSR Ricardo de Córdoba, José M. Pardo
  16. Improved HMM Phone and Triphone Models for Realtime ASR Telephony Applications Ilija Zeljkovic, Shrikanth Narayanan
  17. Improved Extended HMM Composition by Incorporating Power Variance Yasuhiro Minami, Sadaoki Furui
  18. Optimal Filtering and Smoothing for Speech Recognition using a Stochastic Target Model Gordon Ramsay, Li Deng
  19. Speech Recognition Using Syllable-Like Units Zhihong Hu, Johan Schalkwyk, Etienne Barnard, Ronald A. Cole


FrP1S1 -- Physics and Simulation of the Vocal Tract I

Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology
  1. Search for Unexplored Effects in Speech Production C.H. Coker, M.H. Krane, B.Y. Reis, R.A. Kubli
  2. Computational Models for Speech Generation S. Levinson
  3. Articulatory Synthesis from X-rays and Inversion for an Adaptive Speech Robot P. Badin, C. Abry


FrP2L1 -- Speaker Adaptation and Normalization II

Chair: Aaron E. Rosenberg, AT&T Labs - Research
  1. Adaptive Recognition Method Based on Posterior Use of Distribution Pattern of Output Probabilities Jin-Song Zhang, Beiqian Dai, Changfu Wang, Hingkeung Kwan, Keikichi Hirose
  2. Iterative Unsupervised Adaptation Using Maximum Likelihood Linear Regression P.C. Woodland, D. Pye, M.J.F. Gales
  3. A Compact Model for Speaker-Adaptive Training Tasos Anastasakos, John McDonough, Richard Schwartz, John Makhoul
  4. Iterative Unsupervised Speaker Adaptation for Batch Dictation Shigeru Homma, Jun-ichi Takahashi, Shigeki Sagayama
  5. Rapid Unsupervised Adaptation to Children's Speech on a Connected-Digit Task Daniel C. Burnett, Mark Fanty
  6. Speaker Adaptation Using Tree Structured Shared-State HMMs Jun Ishii, Masahiro Tonomura, Shoichi Matsunaga


FrP2L2 -- Spoken Language and NLP II

Chair: David Roe, AT&T Labs - Research
  1. Learning to Parse Spontaneous Speech Finn Dag Buo, Alex Waibel
  2. Spontaneous Speech and Natural Language Processing ALPES: A Robust Semantic-led Parser Jean-Yves Antoine
  3. The Natural Language Processing Module for a Voice Assisted Operator at Telefónica I+D J. Alvarez-Cercadillo, J. Caminero-Gil, C. Crespo-Casas, D. Tapias-Merino
  4. Compound Words in Large-Vocabulary German Speech Recognition Systems André Berton, Pablo Fetter, Peter Regel-Brietzmann
  5. Prosody, Empty Categories and Parsing - A Success Story Anton Batliner, A. Feldhaus, S. Geissler, T. Kiss, Ralf Kompe, Elmar Nöth
  6. "Almost Parsing" Technique for Language Modeling B. Srinivas


FrP2L3 -- Duration and Rhythm

Chair: Dik J. Hermes, Institute for Perception Research / IPO
  1. From Segmental Duration Properties to Rhythmic Structure: A Study of Interactions Between High and Low Level Constraints Marise Ouellet, Benoît Tardif
  2. Analysis of Context-dependent Segmental Duration for Automatic Speech Recognition Xue Wang, Louis C. W. Pols, Louis F. M. ten Bosch
  3. The Role of the Rhythmic Groups in the Segmentation of Continuous French Speech Delphine Dahan
  4. The Implications of Temporal Patterns for the Prosody of Boundary Signaling in Connected Speech Zita McRobbie-Utasi
  5. Experimental Phonetic Study of the Syllable Duration of Korean with Respect to the Positional Effect Hyunbok Lee, Cheol-jae Seong
  6. Timing of Pitch Movements and Accentuation of Syllables Dik J. Hermes


FrP2P1 -- Acoustic Analysis

Chair: Peggy Nelson, University of Maryland at Baltimore
  1. A Probabilistic Approach to AMDF Pitch Detection Goangshiuan S. Ying, Leah H. Jamieson, Carl D. Michell
  2. From Sagittal Cut to Area Function: An RMI Investigation Alain Soquet, Véronique Lecuit, Thierry Metens, Didier Demolin
  3. Pitch Detection and Voiced/Unvoiced Decision Algorithm Based on Wavelet Transforms Léonard Janer, Juan José Bonet, Eduardo Lleida-Solano
  4. Decomposition of Speech Signals into a Deterministic and a Stochastic Part Yannis Stylianou
  5. Improved Glottal Closure Instant Detector based on Linear Prediction and Standard Pitch Concept Cheol-Woo Jo, Ho-Gyun Bang, W.A. Ainsworth
  6. Analysis of Speech Segments using Variable Spectral/Temporal Resolution Xihong Wang, Stephen A. Zahorian, Stefan Auberg
  7. Time-based Clustering for Phonetic Segmentation Brian Eberman, William Goldenthal
  8. Formant Analysis Using Mixtures of Gaussians Parham Zolfaghari, Tony Robinson
  9. Deriving Articulatory Representations from Speech with Various Excitation Modes Hywel B. Richards, John S. Mason, Melvyn J. Hunt, John S. Bridle
  10. "Blind" Speech Segmentation: Automatic Segmentation of Speech Without Linguistic Knowledge Manish Sharma, Richard J. Mammone
  11. Speech Synthesis Using a Nonlinear Energy Damping Model for the Vocal Folds Vibration Effect Hiroshi Ohmura, Kazuyo Tanaka
  12. Neural Networks Learning with L1 Criteria and Its Efficiency in Linear Prediction of Speech Signals Munehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida
  13. Preprocessing and Neural Classification of English Stop Consonants [b,d,g,p,t,k] A. Esposito, C. E. Ezin, M. Ceccarelli
  14. A Comparison of Modified k-means(MKM) and NN based Real Time Adaptive Clustering Algorithms for Articulatory Space Codebook Formation K.S. Ananthakrishnan
  15. A Novel Approach to the Estimation of Voice Source and Vocal Tract Parameters from Speech Signals Wen Ding, Hideki Kasuya
  16. Syllable Detection in Read and Spontaneous Speech Hartmut R. Pfitzinger, Susanne Burger, Sebastian Heid
  17. Maximum Likelihood Learning of Auditory Feature Maps for Stationary Vowels Kuansan Wang, Chin-Hui Lee, Biing-Hwang Juang
  18. Explicit Segmentation of Speech using Gaussian Models Antonio Bonafonte, Albino Nogueiras, Antonio Rodriguez-Garrido
  19. A Comparison of Several Recent Methods of Fundamental Frequency and Voicing Decision Estimation E. Mousset, W.A. Ainsworth, José A. R. Fonollosa
  20. Robust Pitch Estimation with Harmonics Enhancement in Noisy Environments Based on Instantaneous Frequency Toshihiko Abe, Takao Kobayashi, Satoshi Imai
  21. Integrated Polispectrum on Speech Recognition Asunción Moreno, Miquel Rutllán


FrP2S1 -- Physics and Simulation of the Vocal Tract II

Chairs: Qiguang Lin, IBM Watson Research; and Johan Liljencrants, Royal Institute of Technology
  1. Analysis of Acoustic Properties of the Nasal Tract Using 3-D FEM Hisayoshi Suzuki, Takayoshi Nakai, Hirosi Sakakibara
  2. Experiments with Analysis By Synthesis of Glottal Airflow Johan Liljencrants