Thumbnail
Access Restriction
Open

Source CiteSeerX
Content type Text
File Format PDF
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Probability Model ♦ Formant Param ♦ Formant Parameter Vector ♦ Formant Parameter ♦ Formant Vector ♦ State Gaussian Mixture Density ♦ Non-uniform Frequency ♦ Phoneme-dependent Two-dimensional Hidden Markov Model ♦ Alternative Method ♦ Linear Prediction ♦ Formant Trajectory ♦ Spectral Mapping ♦ Voice Morphing ♦ Source Towards ♦ Warping Ratio ♦ Formant Distribution ♦ Experimental Evaluation ♦ Mapping Function ♦ Voice Conversion
Abstract This paper explores the estimation and mapping of probability models of formant parameter vectors for voice conversion. The formant parameter vectors consist of the frequency, bandwidth and intensity of resonance at formants. Formant parameters are derived from the coefficients of a linear prediction (LP) model of speech. The formant distributions are modelled with phoneme-dependent two-dimensional hidden Markov models with state Gaussian mixture densities. The HMMs are subsequently used for re-estimation of the formant trajectories of speech. Two alternative methods are explored for voice morphing. The first is a non-uniform frequency warping method and the second is based on spectral mapping via rotation of the formant vectors of the source towards those of the target. Both methods transform all formant parameters (Frequency, Bandwidth and Intensity). In addition, the factors that affect the selection of the warping ratios for the mapping function are presented. Experimental evaluation of voice morphing examples is presented. 1.
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research
Education Level UG and PG ♦ Career/Technical Study
Learning Resource Type Article