Pitch range and identification of emotions in Spanish speech: A perceptual study

Authors

  • Juan María Garrido Almiñana
  • Juan Antonio Chica Sabariego

Keywords:

prosody, pitch, perception, emotion, Spanish

Abstract

 

This paper describes two perceptual experiments carried out to determine the contribution of pitch range to the identification of emotions in speech. Synthetic manipulated speech was used to observe the effect on emotion perception of two specific acoustic cues: global pitch range along utterances and local excursions at final rising-falling pitch movements. The obtained results suggest that increasing F0 (the main acoustic parameter related to pitch perception) in both cues, global and local pitch range, is associated by subjects to the expression of particular emotions, especially surprise (ESCP hypothesis), rather to a general expression of arousal (ACP hypothesis) or as a general cue for emotional content (EGP hypothesis), at least in Spanish.

References

BANZIGER, T. and SCHERER, K. R. (2005): «The role of intonation in emotional expressions», Speech Communication, 46, pp. 252-267.

BORRÀS-COMES, J.; VANRELL, M. M. and PRIETO, P. (2014): «The role of pitch range in establishing intonational contrasts», Journal of the International Phonetic Association, 44, pp. 1-20.

COWIE, R. and CORNELIUS, R. R. (2003): «Describing the emotional states that are expressed in speech», Speech Communication, 40 (1-2), pp. 5-32.

EKMAN, P. ; FRIESEN, W. V. and ELLSWORTH, P. (1982): «What emotion categories or dimensions can observers judge from facial behavior?», in P. Ekman (ed.): Emotion in the human face, New York, Cambridge University Press, pp. 39-55

FRANCISCO, V.; GERVÁS, P. and HERVÁS, R. (2005): «Análisis y síntesis de expresión emocional en cuentos leídos en voz alta», Procesamiento del Lenguaje Natural, 35, pp. 293-300.

GARRIDO, J. M. (1996): Modelling Spanish intonation for text-to-speech applications, tesis doctoral, Universitat Autònoma de Barcelona.

GARRIDO, J. M. (2001): «La estructura de las curvas melódicas del español: propuesta de modelización», Lingüística Española Actual, XXIII(2), pp. 173-209.

GARRIDO, J. M. (2010): «A tool for automatic F0 stylisation, annotation and modelling of large corpora», Speech Prosody 2010, Chicago, May 2010. http://speechprosody2010.illinois.edu/papers/100041.pdf [03/10/2017].

GARRIDO, J. M. (2011): «Análisis de las curvas melódicas del español en habla emotiva simulada», Estudios de Fonética Experimental, XX, pp. 205-255.

GARRIDO, J. M. (2013): «ModProso: A Praat-based tool for F0 prediction and modification», Proceedings of TRASP 2013, pp. 38-41. http://www.lpl-aix.fr/~trasp/Proceedings/19866-trasp2013.pdf [03/10/2017].

GARRIDO, J. M.; LAPLAZA, Y. and MARQUINA, M. (2012a): «On the use of melodic patterns as prosodic correlates of emotion in Spanish», Speech Prosody 2012, Shanghai. http://isle.illinois.edu/sprosig/sp2012/uploadfiles/ file/sp2012_submission_57.pdf [03/10/2017].

GARRIDO, J. M.; LAPLAZA, Y., MARQUINA, M., PEARMAN, A., ESCALADA, J. G., RODRÍGUEZ, M. A. and ARMENTA, A. (2012b): «The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech», LREC 2012 Proceedings, pp. 1197-1202. http://www.lrec-conf.org/proceedings/lrec2012/pdf/865_Paper.pdf [03/10/2017].

HOZJAN, V.; KACIC, Z., MORENO, A., BONAFONTE, A. and NOGUEIRAS, A. (2002): «Interface databases: Design and collection of a multilingual emotional speech database», in Proceedings of the Third Int. Conference on Language Resources and Evaluation (LREC'02), Las Palmas de Gran Canaria, pp. 2024-2028.

IRIONDO, I.; GUAUS, R., RODRÍGUEZ, A., LÁZARO, P., MONTOYA, N., BLANCO, J. M., BERNADAS, D., OLIVER, J. M., TENA, D. and LONGUI, L. (2000): «Validation of an acoustical modelling of emotional expression in Spanish using speech synthesis techniques», in Proceedings of the ISCA Workshop on Speech and Emotion, Newcastle, pp. 161-166.

JUSLIN, P. and LAUKKA, P. (2003): «Communication of emotions in vocal expression and music performance: Different channels, same code?», Psychological Bulletin, 129, pp. 770-814.

LADD, S.; SIVERMAN, K., BERGMANN, G. and SCHERER, K. (1985): «Evidence for independent function of intonation contour type, voice quality, and F0 in signalling speaker affect», Journal of the Acoustical Society of America, 78(2), pp. 435-444.

LAUKKA, P. (2004): Vocal expression of emotion, discrete-emotions and dimensional accounts, Uppsala, Uppsala University.

LAUKKA, P., JUSLIN, P. and BRESI, R. (2005): «A dimensional approach to vocal expression of emotion», Cognition and Emotion, 19(5), pp. 633-653.

LIEBERMAN, P. and MICHAELS, S. D. (1962): «Some aspects of fundamental frequency, envelope amplitude and the emotional content of speech», Journal of the Acoustical Society of America, 34, pp. 922-927.

MARTÍNEZ, H. and ROJAS, D. (2011): «Prosodia y emociones: datos acústicos, velocidad de habla y percepción de un corpus actuado», Lengua y Habla, 15, pp. 59-72.

MONTERO, J. M. (2003): Estrategias para la mejora de la naturalidad y la incorporación de variedad emocional a la conversión texto a voz en castellano, tesis doctoral, Universidad Politécnica de Madrid.

MONTERO, J. M.; GUTIÉRREZ-ARRIOLA, J., COLÁS, J., ENRÍQUEZ, E. and PARDO, J. M. (1999): «Analysis and modelling of emotional speech in Spanish», in Proceedings of the 14th International Conference of Phonetics, San Francisco, pp. 957-960.

MOZZICONACCI, S. J. (1995): «Pitch variations and emotions in speech», in Proceedings of the 13th International Congress of Phonetic Sciences, ICPhS-95, Stockholm, August 13-19, 1995, Vol. 1, pp. 178-181.

MOZZICONACCI, S. J. (1998): Speech Variability and Emotion: Production and Perception, tesis doctoral, Technische Universiteit Eindhoven.

MOZZICONACCI, S. J. and HERMES, D. J. (1999): «Role of intonation patterns in conveying emotion in speech», in Proceedings ICPhS99, San Francisco, pp. 2001-2004.

NAVARRO TOMÁS, T. (1944): Manual de entonación española, New York, Hispanic Institute on the United States.

PEREIRA, C. (2000): «Dimensions of emotional meaning in speech», in Proceedings of the ISCA ITRW on Speech and Emotion, Newcastle, 5-7 September 2000, pp. 25-28.

RODRÍGUEZ, A.; LAZARO, P., MONTOYA, N., BLANCO, J. M., BERNADAS, D., OLIVER, J. M. and LONGHI, L. (1999): «Modelización acústica de la expresión emocional en el español», Procesamiento del Lenguaje Natural, 25, pp. 159-166.

RUSSELL, J. A. (1980): «A circumplex model of affect», Journal of Personality and Social Psychology, 39(6), pp. 1161-1178.

SCHERER, K. R. (1979): «Personality markers in speech», in K. R. Scherer and H. Giles (eds.), Social Markers in Speech, Cambridge, Cambridge University Press, pp. 147-210.

SCHERER, K. R. (2003): «Vocal communication of emotion: A review of research paradigms», Speech Communication, 40, pp. 227-256.

SCHERER, K. R.; LADD, D. R. AND SILVERMAN, K. (1984): «Vocal cues to speaker affect: Testing two models», Journal of the Acoustical Society of America, 76, pp. 1346-1356.

SCHRÖDER, M.; COWIE, R., DOUGLAS-COWIE, E., WESTERDIJK, M. and GIELEN, S. (2001): «Acoustic correlates of emotion dimensions in view of speech synthesis», in Proceedings of Eurospeech 2001, Aalborg, pp. 87-90.

‘T HART, J.; COLLIER, R. and COHEN, A. (1990): A Perceptual Study of Intonation. An Experimental-phonetic Approach to Speech Melody, Cambridge, Cambridge University Press.

WHISSELL, C. (1989): «The dictionary of affect in language», in R. Plutchik and H. Kellerman (eds.): Emotion: Theory, Research and Experience, 4, New York, Academic Press, pp. 113-131.

Published

2018-02-09

How to Cite

Garrido Almiñana, J. M., & Chica Sabariego, J. A. (2018). Pitch range and identification of emotions in Spanish speech: A perceptual study. Journal of Experimental Phonetics, 27, 13–36. Retrieved from https://revistes.ub.edu/index.php/experimentalphonetics/article/view/44060

Issue

Section

Articles