Visualizing melody with multiple acoustic and tagging values using the visualization module of the Oralstats tool

Adrián Cabedo Nebot

Authors

Adrián Cabedo Nebot https://orcid.org/0000-0002-3881-9308

Keywords:

Visualization tools, Melody, Combined factors, R tool, Github

Abstract

This paper presents a way to visualize pitch patterns combiningmacoustic features (F0, intensity or duration) with other variables, like a basic notation on ToBI (Tone and Break Indices) or the projection of acoustic transformations, following MAS (Melodic Analysis of Speech) model. This visualization and the previous data transformation leading to it have been carried out with Oralstats, a tool developed in R that is conceived to merge speech transcriptions with prosodic, linguistic, and other variables. Here, the multiple melodic visualizations available in Oralstats are exemplified with intonational phrases taken from a corpus of YouTubers. The complete interactive dashboard is freely available on Github.

References

Boersma, P., & Weenink, D. (2021). Praat, version 6.1.53. Computer program. Retrieved from: http://www.Praat.org/

Bigi, B. (2015). SPPAS: Multi-lingual approaches to the automatic annotation of speech. The Phonetician: International Society of Phonetic Sciences, 111-112, 54-69.

Cabedo, A. (2021). Oralstats. https://github.com/acabedo/oralstats

Cantero, F. J. (2002). Teoría y análisis de la entonación. Universitat de Barcelona.

Cantero, F. J. (2019). Análisis prosódico del habla: Más allá de la melodía. In M. R. Álvarez Silva, A. Muñoz Alvarado, & L. Ruiz (Eds.), Comunicación social: Lingüística, medios masivos, arte, etnología, folclor y otras ciencias afines (pp. 485-498). Centro de Lingüística Aplicada.

Cantero, F. J., & Font-Rotchés, D. (2009). Melodic analysis of speech method (MAS) applied to Spanish and Catalan. Phonica, 5, 33-47.

Cantero, F. J., & Mateo Ruiz, M. (2011). Análisis melódico del habla: Complejidad y entonación en el discurso. Oralia, 14, 105-128.

Chang, W., Cheng, J., Allaire, J. J., Sievert, C., Schloerke, B., Xie, Y., Allen, J., McPherson, J., Dipert, A., & Borges, B. (2021). Shiny: Web application framework for r. https://CRAN.R-project.org/package=shiny

Domínguez, M., Latorre, I., Farrús, M., CodinaFilbà, J., & Wanner, L. (2016). Praat on the Web: an upgrade of Praat for semiautomatic speech annotation. In Y. Matsumoto, & R. Prasad (Eds.), Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016), Osaka, Japan (pp. 218-222). The COLING 2016 Organizing Committee.

Elvira-García, W., Roseano, P., Fernández-Planas, A. M., & Martínez-Celdrán, E. (2016). A tool for automatic transcription of intonation: Eti_ToBI; a ToBI transcriber for Spanish and Catalan. Language Resources and Evaluation, 50(4), 767-792.

Estebas, E., & Prieto, P. (2008). La notación prosódica del español: Una revisión del SpToBI. Estudios de Fonética Experimental, 17, 263-283.

Garrido, J. M. (2003). La escuela holandesa: El modelo IPO. In P. Prieto (Ed.), Teorías de la entonación (pp. 97-122). Ariel.

Garrido, J. M. (2012). Análisis fonético de los patrones melódicos locales en español: Patrones entonativos. Revista Española de Lingüística, 42(2), 95-126.

Garrido, J. M. (2018). Using large corpora and computational tools to describe prosody: An exciting challenge for the future with some (important) pending problems to solve. In I. Feldhausen, J. Fliessbach, & M. M. Vanrell (Eds.), Methods in prosody: A Romance language perspective (pp. 3-43). Language Science Press.

Hidalgo, A. (2019). Sistema y uso de la entonación en español hablado. Universidad Andrés Hurtado.

Hirst, D. (2007). A Praat plugin for Momel and INTSINT with improved algorithms for modelling and coding intonation. In J. Trouvain (Ed.), Proceedings of the 16th International Congress of Phonetic Sciences (ICPhS XVI), Saarbrücken, Germany. Universität des Saarlandes.

Hirst, D. (2015). ProZed: A speech prosody editor for linguists, using analysis-by-synthesis. In H. Keikichi, & T. Jianhua (Eds.), Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis (pp. 3-17). Springer.

Hirst, D., Di Cristo, A., & Espesser, R. (2000). Levels of Representation and Levels of Analysis for the Description of Intonation Systems. In M. Horne (Ed.), Prosody: Theory and Experiment. Text, Speech and Language Technology, Springer.

Mateo Ruiz, M. (2010). Protocolo para la extracción de datos tonales y curva estándar en Análisis Melódico del Habla (AMH). Phonica, 6, 49-90.

Mateo Ruiz, M. (2013). De melodías y variedades del español. Phonica, 9, 14-18.

Mertens, P. (2004). The prosogram: Semi-automatic transcription of prosody based on a tonal perception model. In B. Bel, & I. Marlien (Eds.), Proceedings of Speech Prosody 2004, Nara, Japan. SProSIG.

Pierrehumbert, J. (1980). The phonology and phonetics of english intonation. Doctoral Dissertation. Massachusetts Institute of Technology, United States of America.

Quilis, A. (1999). Tratado de fonología y fonética españolas. Gredos.

Quilis, A., Cantarero, M., & Esgueva, M. (1993). El grupo fónico y el grupo de entonación en español hablado. Revista de Filología Española, 73, 55-65.

R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.r-project.org/

Rosenberg, A. (2010). AuToBI: A tool for automatic ToBI annotation. In K. Hirose (Ed.), Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), Makuhari, Chiba, Japan. ISCA.

Shriberg, E., Bates, R., Stolcke, A., Taylor, P., Jurafsky, D., Ries, K., Coccaro, N., Martin, R., Meteer, M., & van Ess-Dykema, C. (1998). Can prosody aid the automatic classification of dialog acts in conversational speech? Language and Speech, 41(3), 443-492.

’t Hart, J., Collier, R., & Cohen, A. (1990). A perceptual study of intonation: An experimental-phonetic approach to speech melody. Cambridge University Press.

Tench, P. (1996). The intonation systems of English. Cassell.

Vnijs, V. (2016). Radiant, business analytics using R and shiny. https://vnijs.Github.io/radiant/

Xu, Y. (2013). ProsodyPro: A tool for large-scale systematic prosody analysis. In B. Bigi, & D. Hirst (Eds.), Proceedings of Tools and Resources for the Analysis of Speech Prosody (TRASP 2013), Aix-en-Provence, France (pp. 7-10). Laboratoire Parole et Langage.

Visualizing melody with multiple acoustic and tagging values using the visualization module of the Oralstats tool

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Information

Make a Submission