F0 and Segment Duration in Formant Synthesis of Speaker Age
Author
Summary, in English
This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a first crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.
Department/s
Publishing year
2006
Language
English
Pages
515-518
Publication/Series
Proc. of Speech Prosody
Full text
- Available as PDF - 915 kB
- Download statistics
Document type
Conference paper
Publisher
Dresden
Topic
- General Language Studies and Linguistics
Conference name
Speech Prosody 2006
Conference date
2006-05-02 - 2006-05-05
Conference place
Dresden, Germany
Status
Published