Automatic Pronunciation Assessment of Non-native English based on Phonological Analysis
ABSTRACT: The rapid development of speech recognition systems has motivated the community to work on accent classification, considerably improving the performance of these systems. However, only a few works or tools have focused on evaluating and analyzing in depth not only the accent but also the p...
- Autores:
-
Escobar Grisales, Daniel
Ríos Urrego, Cristian David
Moreno Acevedo, Santiago Andrés
Pérez Toro, Paula Andrea
Noth, Elmar
Orozco Arroyave, Juan Rafael
- Tipo de recurso:
- http://purl.org/coar/resource_type/c_5794
- Fecha de publicación:
- 2023
- Institución:
- Universidad de Antioquia
- Repositorio:
- Repositorio UdeA
- Idioma:
- eng
- OAI Identifier:
- oai:bibliotecadigital.udea.edu.co:10495/37640
- Acceso en línea:
- https://hdl.handle.net/10495/37640
- Palabra clave:
- Habla
Speech
Inglés - Pronunciación
English languaje - pronunciation
Actos del habla
Speeh acts (linguistics)
Inglés
English language
Fonética
Phonetics
- Rights
- openAccess
- License
- https://creativecommons.org/licenses/by-nc-sa/4.0/
| Summary: | ABSTRACT: The rapid development of speech recognition systems has motivated the community to work on accent classification, considerably improving the performance of these systems. However, only a few works or tools have focused on evaluating and analyzing in depth not only the accent but also the pronunciation level of a person when learning a non-native language. Our study aims to evaluate the pronunciation skills of non-native English speakers whose first language is Arabic, Chinese, Spanish, or French. We considered training a system to compute posterior probabilities of phonological classes from English native speakers and then evaluating whether it is possible to discriminate between native English speakers vs. non-native English speakers. Posteriors of each phonological class separately and also their combination are considered. Phonemes with low posterior results are used to give feedback to the speaker regarding which phonemes should be improved. The results suggest that it is possible to distinguish between each of the non-native languages and native English with accuracies between 67.6% and 80.6%. According to our observations, the most discriminant phonological classes are alveolar, lateral, velar, and front. Finally, the paper introduces a graphical way to interpret the results phoneme-by-phoneme, such that the speaker receives feedback about his/her pronunciation performance. |
|---|
