Thumbnail
Access Restriction
Open

Author Wrench, Alan A. ♦ Richmond, Korin
Source CiteSeerX
Content type Text
File Format PDF
Language English
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Timit Corpus ♦ Acoustic Signal ♦ Significant Improvement ♦ Phone Accuracy ♦ Acoustic Input ♦ Phone Recognition Accuracy ♦ Continuous Speech Recognition ♦ Useful Supplementary Information ♦ Preliminary Attempt ♦ Articulatory Data ♦ Measurable Information ♦ Articulatory Feature Space ♦ Speaker Dependent Phone Recognition Task ♦ Principal Component Analysis ♦ Articulatory System
Description In this paper we show that there is measurable information in the articulatory system which can help to disambiguate the acoustic signal. We measure directly the movement of the lips, tongue, jaw, velum and larynx and parameterise this articulatory feature space using principal components analysis. The parameterisation is developed and evaluated using a speaker dependent phone recognition task on a specially recorded TIMIT corpus of 460 sentences. The results show that there is useful supplementary information contained in the articulatory data which yields a small but significant improvement in phone recognition accuracy of 2%. However, preliminary attempts to estimate the articulatory data from the acoustic signal and use this to supplement the acoustic input have not yielded any significant improvement in phone accuracy. 1.
in Proc. ICSLP 2000
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research
Education Level UG and PG ♦ Career/Technical Study
Learning Resource Type Article
Publisher Date 2000-01-01