Pronunciation Tutor for Deaf Children based on ASR

Authors

  • Yunling Bai

DOI:

https://doi.org/10.54097/hset.v24i.3903

Keywords:

ASR; Deaf Children Education; Machine Learning.

Abstract

ASR, whose full name is Automated Speech Recognition, is a technology that converts human speech into text. Speech recognition, a multidisciplinary field, is closely related to acoustics, phonetics, linguistics, digital signal processing theory, information theory, computer science and other disciplines. ASR has been applied in educational technology such as deaf children's education in this day and age. This paper makes a preview of a project in which a computer-aided tutor for deaf children instruction based on the speech recognition technology. This tutor utilizes three effective models and is combined with data mining technology. Two evaluation approaches and overview of embedded experiment are also detailed in this paper.

Downloads

Download data is not yet available.

References

C. Y. Hin, A. Yu On Lam and A. Wong Yiu Leung, "Translanguaging in Hong Kong Deaf Signers: Translating Meaning from Written Chinese," Sign Language Studies, vol. 22, (3), pp. 430-483, 2022.

G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, et al., "Deep Neural Networks for Acoustic Modeling in Speech Recognition", IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.

T. N. Sainath, O. Vinyals, A. Senior and H. Sak, "Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks," 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015, pp. 4580-4584, doi: 10.1109/ICASSP.2015.7178838.

T. N. Sainath, A. Mohamed, B. Kingsbury and B. Ramabhad-ran, "Deep Convolutional Neural Networks for LVCSR", Proc. ICASSP, 2013.

H. Sak, A. Senior and F. Beaufays, "Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling", Proc. Interspeech, 2014.

Chen, P. H., Lin, C. J., & Schölkopf, B. (2005). A tutorial on ν‐support vector machines. Applied Stochastic Models in Business and Industry, 21(2), 111-136.

Tam, Yik-Cheung & Mostow, Jack & Beck, Joseph & Banerjee, Satanjeev. (2003). Training a confidence measure for a reading tutor that listens. 10.21437/Eurospeech.2003-790.

J. Mostow, S. Roth, A. G. Hauptmann, and M. Kane, “A Prototype Reading Coach that Listens [AAAI-94 Outstanding Paper Award],” in Proceedings of the Twelfth National Conference on Artificial Intelligence, pp. 785–792, 1994. Seattle, WA: American Association for Artificial Intellige.

J. MOSTOW and J. BECK, "Some useful tactics to modify, map and mine data from intelligent tutors," Natural Language Engineering, vol. 12, (2), pp. 195-208, 2006.

Beck, Joseph E., et al. "Does help help? Introducing the Bayesian Evaluation and Assessment methodology." International conference on intelligent tutoring systems. Springer, Berlin, Heidelberg, 2008

Mostow, Jack, et al. "Evaluation of an automated Reading Tutor that listens: Comparison to human tutoring and classroom instruction." Journal of Educational Computing Research 29.1 (2003): 61-117.

Downloads

Published

27-12-2022

How to Cite

Bai, Y. (2022). Pronunciation Tutor for Deaf Children based on ASR. Highlights in Science, Engineering and Technology, 24, 119-124. https://doi.org/10.54097/hset.v24i.3903