Acoustic analysis of Spanish female deliberate creaky voice phonation

Authors

Keywords:

creaky, modal, fundamental frequency, harmonics, female, Spanish

Abstract

Creaky voice is a phonation type that can be produced by different laryngeal settings which are still under research. Forty young European Spanish female speakers with no previous training, recorded some samples in modal and creaky voice. Acoustic measures were extracted and analyzed using the Praat software in order to classify different phonatory strategies to produce a creaky voice. At first, we analyzed the F0 values for modal and creaky voice and secondly, we focused on the amplitude difference between the first and the second harmonic (H1-H2) and between the first harmonic and the one with the highest amplitude of the third formant (H1-A3). The results revealed: the female speakers produced the creaky voice phonation by lowering their modal F0, 28% of the speakers showed a positive amplitude between H1-H2 and 55% of speakers had also a positive result between H1-A3. These harmonics measures are contrary to the literature. Finally, considering the F0 rate decreasing value and H1-H2 amplitude difference, we discovered three different groups of phonatory strategies that must be analyzed conducting further experiments.

References

ABERCROMBIE, D. (1967): Elements of general phonetics, Edimburgo, Edinburgh University Press.

BECK, J. M. (2010): «Organic Variation of the Vocal Apparatus», en W. J. Hardcastle, J. Laver y F. E. Gibbon (eds.): The Handbook of Phonetic Sciences, Oxford, John Wiley & Sons, 2010, pp. 119-155.

BELOTEL-GRENIÉ, A. y M. GRENIÉ (2004): «The creaky voice phonation and the organization of Chinese discourse», en B. Bel (ed): Proceedings of Inter-national Symposium on Tonal Aspects of Languages. With Emphasis on Tone Languages, Pekín, The Institute of Linguistics in Chinese Academy of Social Sciences, pp. 5-8.

BLOMGREN, M.; Y. CHEN, M. L, NG y H. R. GILBERT (1998): «Acoustic, aerodynamic, physiologic, and perceptual properties of modal and vocal fry registers», The Journal of the Acoustical Society of America, 103, p. 2649.

BOERSMA, P. y D. WEENINK (2013): Praat: doing phonetics by computer. Versión 5.3.56.

http://www.praat.org [10/02/2015]

CAMARGO, Z.; S. MADUREIRA, A. N. PESSOA y L. C. RUSILO (2012): «Voice quality and gender: some insights on correlations between perceptual and acoustic dimensions», en Q. Ma, H. Ding y D. Hirst (eds): Abstract Book of 6th International Conference on Speech Prosody, Shanghai, Tongji University Press, 1, pp. 115-118.

CATFORD, J. C. (1964): «Phonation types: The classification of some laryngeal components of speech production» en D. Abercrombie, D. B. Fry, P. A. D. MacCarthy, N. C. Scott y J. L. L. Trim (eds.): In honour of Daniel Jones, Londres, Longman, pp. 26-37.

CATFORD, J. C. (1977): Fundamental problems in phonetics, Edimburgo, Edinburgh University Press, vol. 1, p. 977.

CRYSTAL, D. (1976): Prosodic systems and intonation in English, Cambridge, Cambridge University Archives, vol. 1.

COATES, J. (1986): Women, Men, and Language: A Sociolinguistic Account of Sex Differences, Londres, Longman.

COBETA, I.; F. NÚÑEZ y S. FERNÁNDEZ (2013): Patología de la voz, Marge books, Madrid.

CULLEN, A.; J. KANE, T. DRUGMAN y N. HARTE (2013): «Creaky voice and the classification of affect», en Proceedings of WASSS (Workshop on Affective Social Speech Signals), Grenoble.

DEJONCKERE, P. H.; P. BRADLEY, P. CLEMENTE, G. CORNUT, L. CREVIER-BUCHMAN, G. FRIEDRICH y V. WOISARD (2001): «A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques», European Archives of Oto-rhino-laryngology, 258(2), pp. 77-82.

DILLEY, L.; S. SHATTUCK-HUNAGEL Y M. OSTENDROF (1996): «Glottalization of word-initial vowels as a funcion of prosodic structure», Journal of Phonetics, 24, pp. 423-444.

DRAKE, O. J. (1937): «Toward an improved vocal quality», Quarterly Journal of Speech, 23(4), pp. 620-626.

EDMONSON, J. A. y J. H. ESLING (2006): «The valves of the throat and their functioning in tone, vocal register and stress: laryngoscopic case studies», Phonology, 23, 2, pp. 157-191.

ERICKSON, D.; T. SHOCHI, C. MENEZES, H. KAWAHARA y K. I. SAKAKIBARA (2008): «Some non-F0 cues to emotional speech: an experiment with morphing», Speech Prosody, pp. 6-9.

ESLING, J. H. (1983): «A laryngographic investigation of phonation type and laryngeal configurations», Working Papers of the Linguistics Circle, 3 (1), pp. 14-36.

ESLING, J. y J. HARRIS (2005): «States of the glottis: An articulatory phonetic model based on laryngoscopic observations» en W. Hardcastle y J. Beck (eds.): A Figure of Speech: A Festschrift for John Laver, Nueva Jersey, Lawerence Erlbaum Associates, pp. 247-383.

ESPOSITO, C. M. (2010): «Variation in contrastive phonation in Santa Ana del Valle Zapotec», Journal of the International Phonetic Association, 40, 2, pp. 181-198.

FAWCUS, M. (1986): Voice disorders and their management, Chapman & Hall, Londres.

FERNÁNDEZ-BAILLO GALLEGO DE LA SACRISTANA, R. (2013): Índice acústico de discapacidad vocal (IADV) en población adulta: diseño de la escala, resultados y correlatos anatómico-fisiológicos, tesis doctoral, Universidad Complutense, Madrid.

FIELD, A. (2013): Discovering statistics using IBM SPSS statistics, Londres, Sage.

FOUGHT, C. (2003): Chicano English in context, Nueva York, Palgrave Macmillan.

FRIČ, M.; F. ŠRAM y J. G. ŠVEC (2006): «Voice registers, vocal folds vibration patterns and their presentation in videokymography», en E. Rajcan y E. Borsíková (eds): Proceedings of ACOUSTICS High Tatras 06, 33rd International Acoustical Conference - EAA Symposium, Eslovaquia, Strbské Pleso, Slovak Acoustical Soc, pp. 42-45.

FUJIMOTO, M. y K. Maekawa (2003): «Variation in phonation types due to paralinguistic information: An analysis of high-speed video images», en M., J. Solé, D. Recasens y J. Romero (eds): Proceedings of the 15th International Congress of Phonetic Sciences, Barcelona, Causal Productions, pp. 2401-2404.

GERRATT, B. y J. KREIMAN (2001): «Toward a taxonomy of nonmodal phonation», Journal of Phonetics, 29, 4, pp. 365-381.

GERRAT, B. y J. KREIMAN (2004): «Perceptual evaluation of voice quality», en R. D. Kent (ed.): The MIT Encyclopedia of Communication Disorders, Cambridge, Massachusetts, The MIT Press, pp. 78-80.

GICK, B.; I. WILSON y D. DERRICK (2013): Articulatory Phonetics, Malden, Wiley-Blackwell.

GIL FERNÁNDEZ, J. (1988): Los sonidos del lenguaje, Madrid, Síntesis.

GIL FERNÁNDEZ, J. (2012): «La cualidad de voz y la comparación judicial de voces», presentación en II Jornadas (In)formativas de Lingüística Forense. Facultad de Filosofía y Letras, Universidad Autónoma de Madrid. http://liceu.uab.es/~joaquim/phonetics/fon_prosod/suprasegmentales_fonacion.html [20/12/2015]

GOBL, C. (1989): «A preliminary study of acoustic voice quality correlates», STL-QPSR, 4, pp. 9-21.

GOBL, C. y A. NÍ CHASAIDE (1992): «Acoustic characteristics of voice quality», Speech Communication, 11 (4), pp. 481-490.

GOBL, C. y A. NÍ CHASAIDE (2003): «The role of voice quality in communicating emotion, mood and attitude», Speech communication, 40, 1, pp. 189-212.

GOBL, C. y A. NÍ CHASAIDE (2010): «Voice source variation and its communicative functions», en W. J. Hardcastle, J. Laver y F. E. Gibbon (eds.): The Handbook of Phonetic Sciences, Oxford, Wiley-Blackwell, 20102, pp. 378-423.

GORDON, M. (2001): «Linguistic aspects of voice quality with special reference to Athabaskan», en S. Tuttle y G. Holton (eds): Proceedings of the 2001 Athabaskan Languages Conference, Los Ángeles, Ed. Alaska Native Language Center, pp. 163-178.

GORDON, M. y P. LADEFOGED (2001): «Phonation types: a cross-linguistic overview», Journal of Phonetics, 29 (4), pp. 383-406.

HAMMARBERG, B. (1999): «Voice research and clinical needs», Folia phoniatrica et logopaedica, 52, 1-3, pp. 93-102.

HAMMARBERG, B. y J. GAUFFIN (1995): «Perceptual and acoustic characteristics of quality differences in pathological voices as related to physiological aspects», en O. Fujimura y M. Hirano (eds.): Vocal Fold Physiology, Voice Quality Control, San Diego, Ed. Singular Pub. Group, pp. 283-303.

HANSON, H. M. (1995): Glottal characteristics of female speakers, tesis doctoral, Harvard University, Cambridge, Massachussets.

HANSON, H. M. y E. S. CHUANG (1999): «Glottal characteristics of male speakers: Acoustic correlates and comparison with female data», The Journal of the Acoustical Society of America, 106, 2, pp. 1064-1077.

HANSON, H. M.; K. N. STEVEN, H. K. J. KUO, M. Y. CHEN y J. SLIFKA (2001): «Towards models of phonation», Journal of Phonetics, 29, 4, pp. 451-480.

HEDELIN, P. y D. HUBER (1990): «Pitch period determination of aperiodic speech signals», en Proceedings of Acoustics, Speech, and Signal Processing, Albuquerque, Institute of Electrical and Electronic Engineers Service Center, pp. 361-364.

HENRICH, D. N. (2006): «Mirroring the voice from Garcia to the present day: Some insights into singing voice registers», Logopedics Phonatrics Vocology, 31, 1, pp. 3-14.

HENTON, C. y A. BLADON (1988): «Creak as a sociophonetic marker» en L. Hyman y C. N. Li, (eds.): Language, Speech, and Mind: studies in honour of Victoria Fromkin, Londres, Routledge, pp. 3-29.

HEWLETT, N. y J. M. BECK (2013): An introduction to the science of phonetics, Londres, Routledge.

HIRANO, M. y D. M. BLESS (1993): Videostroboscopic examination of the larynx, San Diego, Singular Publishing Group.

HIROSE, H. (1971): «Laryngeal Adjustments for Vowel Devoicing in Japanese: An Electromyographic Study», Haskins Laboratories Status Report on Speech Research SR-28, pp. 157-165.

HOLLIEN, H. (1974): «On vocal registers», Journal of Phonetics, 2, pp. 125-143.

HOLLIEN, H. y R. W. WENDAHL (1968): «Perceptual study of vocal fry», The Journal of the Acoustical Society of America, 43, p. 506.

HOLMBERG, E. B.; R. E. HILLMAN, J. S. PERKELL, P. C. GUIOD y S. L. GOLDMAN (1995): «Comparisons among aerodynamic, electroglottographic, and acoustic spectral measures of female voice», Journal of Speech, Language and Hearing Research, 38, 6, p. 1212.

HONIKMAN, B. (1964): «Articulatory settings» en D. Abercrombie, D. B. Fry, P. A. D. MacCarthy, N. C. Scott y J. L. L. Trim (eds.): In honor of Daniel Jones, Londres, Longmans, pp. 73-84.

ISELI, M.; Y. L. SHUE y A. ALWAN (2007): «Age, sex, and vowel dependencies of acoustic measures related to the voice source», The Journal of the Acoustical Society of America, 121, 4, pp. 2283-2295.

JAVKIN, H. R.; N. ANTOÑANZAS-BARROSO e I. MADDIESON (1987): «Digital Inverse Filtering for Linguistic Research», Journal of Speech, Language, and Hearing Research, 30, 1, pp. 122-129.

JIANG, D. N.; J. H. TAO y L. H. CAI (2002): «Voice Quality Analysis under the Pitch Effect», en Proceedings of the International Symposium on Chinese Spoken Language Processing, Taipei.

JOHNSON, K. (1997): Acoustic and Auditory Phonetics, Malden, Wiley- Blackwell, 20123.

JOHNSTONE, T. y K. R. SCHERER (1999): «The effects of emotions on voice quality», en J. Ohala (ed): Proceedings of the XIVth International Congress of Phonetic Sciences, San Francisco, University of California, pp. 2029-2032.

KEATING, P. A. y C. ESPOSITO (2006): «Linguistic voice quality», UCLA Working Papers in Phonetics, 105, pp. 85-91.

KEATING, P.; J. KUANG, C. ESPOSITO, M. GARELLEK y S. KHAN (2012): «Multi-dimensional phonetic space for phonation contrasts», póster presentado en LabPhon 13, Stuttgart, Alemania.

KEATING, P. A. y M. GARELLEK (2015): «Acoustic analysis of creaky voice», póster presentado en una sesión especial sobre voz creaky en el congreso anual de la Linguistic Society of America, Portland, Estados Unidos.

KEATING, P. A.; M. GARELLEK y J. KREIMAN (2015): «Acoustic properties of different kinds of creaky voice», en M. Wolters, J. Livingstone, B. Beattie, R. Smith, M. MacMahon, J. Stuart-Smith y J. Scobbie (eds): Proceedings of 18th International Congress of Phonetic Sciences, Glasgow, International Phonetic Association.

KIM, J. Y. (2013): The Use of Creaky Voice by Spanish Heritage Speakers in the US, póster presentado en el Seventh Heritage Language Research Institute, Chicago.

KREIMAN, J.; D. VANLANCKER-SIDTIS y B. R. GERRAT (2008). «Perception of voice quality», en D. B. Pisoni y R. E. Remez (eds.): Handbook of Speech Perception, Oxford, Blackwell Publishing, pp. 338-362.

KREIMAN, J. y D. SIDTIS (2011): Foundations of voice studies: An interdisciplinary approach to voice production and perception, Hoboken, Wiley-Blackwell.

KREIMAN, J.; Y. L. SHUE, G. CHEN, M. ISELI, B. R. GERRAT, J. NEUBAUER y A. ALWAN (2012): «Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation», Journal of the Acoustical Society of America, 132(4), pp. 2625-2632.

LADEFOGED, P. (1971): Preliminaries to linguistic phonetics, Chicago, University of Chicago Press.

LAVER, J. (1968): «Voice quality and indexical information», International Journal of Language & Communication Disorders, 3, 1, pp. 43-54.

LAVER, J. (1975): «Individual features in voice quality», tesis doctoral, Universidad de Edimburgo.

LAVER, J. (1980): The Phonetic Description of Voice Quality, Cambridge, Cambridge University Press.

LAVER, J.; S. WIRZ, J. MACKENZIE y S. M. HILLER (1981): «A perceptual protocol for the analysis of vocal profiles», Edinburgh University Department of Linguistics Work in Progress, 14, pp. 139-155.

LAVER, J. (1994): Principles of phonetics, Cambridge, Cambridge University Press.

LUO, J. (2012): Affective computing and intelligent interaction, vol. 137, Springer Science & Business Media.

MENDOZA-DENTON, N. (2011): «The semiotic hitchhiker's guide ot creaky voice: circulation and gendered hardcore in a Chicana/o gang persona», Journal of the Linguistic Anthropology, 21, 2, pp. 261-280.

MICHEL, J. F. (1968): «Fundamental frequency investigation of vocal fry and harshness», Journal of Speech, Language, and Hearing Research, 11 (3), pp. 590-594.

MILLER, R. (2004): Solutions for singers: Tools for performers and teachers, Oxford, Oxford University Press.

MOOSMÜLLER, S. (2007): «The influence of creaky voice on formant frequency changes», International Journal of Speech Language and the Law, 8, 1, pp. 100-112.

MORRIS, R. y A. B. HARMON (2010): «Describing voice disorders», en J. S. Damico, N. Müller y M. J. Ball (eds.): The handbook of language and speech disorders, Malden, John Wiley & Sons, pp. 455-473.

MOSER, H. M. (1942): «Symposium on Unique Cases of Speech Disorders; Presentation of a Case», Journal of Speech and Hearing Disorders, 7, 2, pp. 173-174.

PERROT, P.; G. AVERSANO y G. CHOLLET (2007): «Voice disguise and automatic detection: review and perspectives», en Y. Stylianou, M. Faúndez-Zanuy y A. Esposito (eds): Progress in nonlinear speech processing, Springer Berlin Heidelberg, pp. 101-117.

PODESVA, R. J. (2011): «Gender and the social meaning of non-modal phonation types», en C. Cathcart, I-H-Chen, G. Finley, S. Kang, C. S. Sandy y E. Stickles (eds): Proceedings of the Annual Meeting of the Berkeley Linguistics Society, Berkeley, Berkeley Linguistic Society, vol. 37, 1, pp. 427-448.

REDI, L. y S. SHATTUCK-HUFNAGEL (2001): «Variation in the realization of glottalization in normal speakers», Journal of Phonetics, 29 (4), pp. 407-429.

RODMAN, R. (1998): «Speaker recognition of Disguised Voices: A Program for research», en M. Demirekler, A. Saranli, H. Altincay y A. Paoloni (eds): Proceedings of the Consortium on Speech Technology in Conjunction with the Conference on Speaker Recognition by Man and Machine: Directions for Forensic Applications, Ankara, Publishing Arm, pp. 9-22.

SAN SEGUNDO, E.; H. ALVES y M. FERNÁNDEZ TRINIDAD (2013): «CIVIL Corpus: voice quality for speaker forensic comparison», Procedia-Social and Behavioral Sciences, 95, pp. 587-593.

SAPIR, E. (1927): «Speech as a personality trait», American Journal of Sociology, 32, 6, pp. 892-905.

SCHERER, K. R. (1989): «Vocal correlates of emotional arousal and affective disturbance», en H. E. Wagner y A. E. Manstead (eds.): Handbook of Social Psychophysiology, Chichester, John Wiley & Sons, pp. 165-197.

SCHERER, S.; J. KANE; C. GOBL y F. SCHWENKER (2013): «Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification», Computer Speech & Language, 27(1), pp. 263-287.

SEYFAHRT, S. y M. GARELLEK (2015): «Coda glottalization in American English», en M. Wolters, J. Livingstone, B. Beattie, R. Smith, M. macMahon, J. Stuart-Smith y J. Scobbie (eds): Proceedings of 18th International Congress of Phonetic Sciences, Glasgow, International Phonetic Association.

SHAW, F. y W. CROCKER (2015): «Creaky voice as a stylistic feature of young american female speech: an intraspeaker variation study of Scarlett Johansson», Lifespans and Styles, 1, pp. 21-27.

SPSS Inc. Released (2008): SPSS Statistics for Windows, Version 17.0, Chicago, SPSS Inc.

STEVENS, K. (1977): «Physics of laringeal behavior and larynx modes», Phonetica, 34, pp. 264-279.

STORY, B. H. y I. R. TITZE (2002): «A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function», Journal of Phonetics, 30 (3), pp. 485-509.

SUNDBERG, J. (2013). «The perception of singing», en D. Deutsch (ed.): The psychology of music, Amsterdam, Academic Press, pp. 59-98.

SVEČ, J. (2000): On vibration properties of human vocal folds: voice registers, bifurcations, resonance characteristics, development and application of videokymography, tesis doctoral, Universidad de Groningen.

TAJFEL, H. (1974): «Social identity and intergroup behavior», Social Science Information, 13, pp. 65-93.

TESHIGAWARA, M. (2003): «Voices in Japanese animation: a phonetic study of vocal stereotypes of heroes and villains in Japanese culture», tesis doctoral, Universidad de Victoria.

TITZE, I. R. (1990): «Interpretation of the electroglottographic signal», Journal of Voice, 4 (1), pp. 1-9.

TITZE, I. R. (1994). Principles of Voice Production, Englewood Cliffs, New Jersey. Prentice Hall.

TRASK, L. R. (1996): A dictionary of phonetics and phonology, Londres, Routledge.

YANUSHEVSKAYA, I.; C. GOBL y N.A. CHASAIDE (2005): «Voice quality and F0 cues for affect expression: implications for synthesis», en Proceedings of INTERSPEECH, Lisboa, Curran Associates Inc, pp. 1849-1852.

YUASA, I. P. (2010): «Creaky voice: A new feminine voice quality for young urban-oriented upwardly mobile American women?», American Speech, 85, pp. 315-337.

Published

2016-02-24

How to Cite

Lirio, P. (2016). Acoustic analysis of Spanish female deliberate creaky voice phonation. Journal of Experimental Phonetics, 25, 193–232. Retrieved from https://revistes.ub.edu/index.php/experimentalphonetics/article/view/44139

Issue

Section

Articles