How language-specific and cross-linguistic factors affect speech rhythm
Evidence from Bengali and English
DOI:
https://doi.org/10.1344/efe-2024-33-169-190Keywords:
durational rhythmic metrics, Bengali (L1), English (L2), language-specific, cross-linguisticAbstract
This study examined the durational aspects of speech rhythm in Bengali (L1) and English spoken by educated Bengali advanced English learners (L2) to represent the effect of language-specific and cross-linguistic factors on speech rhythm. Employing metrics such as rateSyl, ΔC, nPVI-C, nPVI-V, VarcoV, %V, and ΔPeakLn, the study revealed that L1 exhibited a faster tempo, shorter consonants, and longer vowels relative to consonants, while L2 speech demonstrated greater variability in consonant and vowel durations, which are language-specific factors of L1 being a syllable-timed language and L2 being a stress-timed language. However, sonority patterns between syllables seemed consistent across L1 and L2 suggesting the cross-linguistic impact of L1 on L2. Moreover, the results identified the proportion of vocalic intervals (%V) as the most effective metric for differentiating between L1 and L2 rhythms.
References
Amengual, M. (2021). The acoustic realization of language-specific phonological categories despite dynamic cross-linguistic influence in bilingual and trilingual speech. The Journal of the Acoustical Society of America, 149(2), 1271–1284. https://doi.org/10.1121/10.0003559
Arvaniti, A. (2012). The usefulness of metrics in the quantification of speech rhythm. Journal of Phonetics, 40(3), 351–373. https://doi.org/10.1016/j.wocn.2012.02.003
Asadi, H., Nourbakhsh, M., He, L., Pellegrino, E., & Dellwo, V. (2018). Between-speaker rhythmic variability is not dependent on language rhythm, as evidence from Persian reveals. International Journal of Speech, Language and the Law, 25(2), 151–174. https://doi.org/10.1558/ijsll.37110
Baird, L., Evans, N., & Greenhill, S. J. (2022). Blowing in the wind: Using ‘North Wind and the Sun’ texts to sample phoneme inventories. Journal of the International Phonetic Association, 52(3), 453–494. https://doi.org/10.1017/S002510032000033X
Bangladesh Bureau of Statistics (2012). Population and Housing Census—2011 (Vol. 3: Socio-economic and demographic report). Ministry of Planning, Dhaka.
Barbosa, P. A. (2002). Explaining cross-linguistic rhythmic variability via a coupled-oscillator model of rhythm production. In D. Hirst & M. Rossi (Eds.), Proceedings of Speech Prosody 2002 (pp. 163–166). International Speech Communication Association. https://doi.org/10.21437/SpeechProsody.2002-26
Beckman, M. E. (2012). Stress and non-stress accent. De Gruyter Mouton. https://doi.org/10.1515/9783110874020 (Original work published 1996)
Bhatia, T. (2006). Introduction. In T. Bhatia & W. Ritchie (Eds.), The Handbook of Bilingualism (pp. 5–6). Blackwell. https://doi.org/10.1002/9780470756997.part1
Bhatia, T. & Ritchie, W. (Eds.) (2006). The Handbook of Bilingualism. Blackwell. https://doi.org/10.1002/9780470756997
Bhowmik, T., & Mandal, S. K. D. (2018). Manner of articulation based Bengali phoneme classification. International Journal of Speech Technology, 21(2), 233–250. https://doi.org/10.1007/s10772-018-9498-5
Boersma, P., & Weenink, D. (1992–2022). Praat: doing phonetics by computer (Version 6.3.03) [Computer program]. http://www.praat.org/
Chatterji, S. K. (1926). The origin and development of the Bengali language (Part 2). Calcutta University Press.
Chaudhary, S. (2009). Foreigners and foreign languages in India: A sociolinguistic history. Cambridge India. https://doi.org/10.1017/UPO9788175968493
Crystal, D. (2003). English as a global language. Cambridge university press. https://doi.org/10.1017/CBO9780511486999
Cummins, F., & Port, R. (1996). Rhythmic commonalities between hand gestures and speech. Proceedings of the Annual Meeting of the Cognitive Science Society, 18, 415–419.
Cummins, F., & Port, R. (1998). Rhythmic constraints on stress timing in English. Journal of Phonetics, 26(2), 145–171. https://doi.org/10.1006/jpho.1998.0070
Dauer, R. M. (1983). Stress-timing and syllable-timing reanalyzed. Journal of phonetics, 11(1), 51–62. https://doi.org/10.1016/S0095-4470(19)30776-4
Dellwo, V. (2010). Influences of speech rate on the acoustic correlates of speech rhythm: An experimental phonetic study based on acoustic and perceptual evidence [Doctoral dissertation, Universität Bonn]. bonndoc - The Repository of the University of Bonn. https://bonndoc.ulb.uni-bonn.de/xmlui/handle/20.500.11811/4239
Dellwo, V., Huckvale, M., & Ashby, M. (2007). How is individuality expressed in voice? An introduction to speech production and description for speaker classification. In C. Müller (Ed.), Speaker classification I: Fundamentals, features, and methods (pp. 1–20). https://doi.org/10.1007/978-3-540-74200-5_1
Dellwo, V., Leemann, A., & Kolly, M.-J. (2012). Speaker idiosyncratic rhythmic features in the speech signal. In R. Sproat (Ed.), Proceedings of Interspeech 2012. International Speech Communication Association. https://doi.org/10.21437/Interspeech.2012-342
Dellwo, V., Leemann, A., & Kolly, M. J. (2015). Rhythmic variability between speakers: Articulatory, prosodic, and linguistic factors. The Journal of the Acoustical Society of America, 137(3), 1513–1528. https://doi.org/10.1121/1.4906837
Deschamps, A. (1980). The syntactical distribution of pauses in English spoken as a second language by French students. In H. Dechert & M. Raupach (Eds.), Temporal variables in speech (pp. 255–262). De Gruyter Mouton. https://doi.org/10.1515/9783110816570.255
Deterding, D. (2007). Singapore English. Edinburgh University Press.
Edwards, J. (2006). Foundations of bilingualism. In T. Bhatia & W. Ritchie (Eds.), The Handbook of Bilingualism (pp. 7–31). Blackwell. https://doi.org/10.1002/9780470756997.ch1
Flege, J., Schirru, C., & MacKay, I. (2003). Interaction between the native and second language phonetic subsystems. Speech Communication, 40(4), 467–491. https://doi.org/10.1016/S0167-6393(02)00128-0
Fuchs, R. (2012, July 28). A duration-based account of speech rhythm in Indian English [Poster presentation]. LabPhon 13: The 13th Conference on Laboratory Phonology, Stuttgart, Germany.
Fuchs, R. (2016). Speech rhythm in varieties of English: Evidence from educated Indian English and British English. Springer. https://doi.org/10.1007/978-3-662-47818-9
Fuchs, R. (2023). A synthesis of research on speech rhythm in native, learner and second language varieties of English: Introduction to the Volume. In R. Fuchs (Ed.), Speech Rhythm in Learner and Second Language Varieties of English (pp. 1–14). Springer. https://doi.org/10.1007/978-981-19-8940-7_1
Gabriel, C., & Kireva, E. (2014). Speech rhythm and vowel raising in Bulgarian Judeo-Spanish. In N. Campbell, D. Gibbon, & D. Hirst (Eds.), Proceedings of Speech Prosody 2014 (pp. 728–732). https://doi.org/10.21437/speechprosody.2014-133
Gargesh, R. (2004). Indian English: Phonology. A handbook of varieties of English (Vol. 1, pp. 992–1002). De Gruyter Mouton. https://doi.org/10.1515/9783110197181-064
Gass, S., Behney, J., & Plonsky, L. (2020). Second language acquisition: An introductory course (5th ed.). Routledge. https://doi.org/10.4324/9781315181752
Gibbon, D. (2022). New perspectives on Ibibio speech rhythm. In M. E. Ekpenyong & I. I. Udoh (Eds.), Current issues in descriptive linguistics and digital humanities (pp. 457–486). Springer. https://doi.org/10.1007/978-981-19-2932-8_34
Gibbon, D. (2023). The rhythms of rhythm. Journal of the International Phonetic Association, 53(1), 233–265. https://doi.org/10.1017/S0025100321000086
Grabe, E., & Low, E. L. (2002). Durational variability in speech and the rhythm class hypothesis. In C. Gussenhoven & N. Warner (Eds.), Laboratory Phonology 7 (pp. 515–546). De Gruyter Mouton. https://doi.org/10.1515/9783110197105
Gussenhoven, C., & Chen, A. (Eds.) (2020). The Oxford handbook of language prosody. Oxford University Press.
He, L. (2022). Characterizing first and second language rhythm in English using spectral coherence between temporal envelope and mouth opening-closing movements. The Journal of the Acoustical Society of America, 152(1), 567–579. https://doi.org/10.1121/10.0012694
Hickey, R. (Ed.). (2005). Legacies of colonial English: Studies in transported dialects. Cambridge University Press. https://doi.org/10.1017/CBO9780511486920
Jang, T. Y. (2008). Speech rhythm metrics for automatic scoring of English speech by Korean EFL learners. Malsori, (66), 41–59.
Jarvis, S. & Pavlenko, A. (2008). Crosslinguistic influence in language and cognition. Routledge.
JASP Team (2024). JASP (Version 0.18.3) [computer software]. https://jasp-stats.org/
Kawase, S., Davis, C., & Kim, J. (in press). Impact of Japanese L1 rhythm on English L2 speech. Language and Speech. https://doi.org/10.1177/00238309241247210
Khan, S. U. D. (2008). Intonational phonology and focus prosody of Bengali [Unpublished doctoral dissertation]. University of California.
Khan, S. U. D. (2010). Bengali (Bangladeshi Standard). Journal of the International Phonetic Association, 40(2), 221–225. https://doi.org/10.1017/S0025100310000071
Kinoshita, N., & Sheppard, C. (2011). Validating acoustic measures of speech rhythm for second language acquisition. In W.-S. Lee & E. Zee (Eds.), 17th International Congress of Phonetic Sciences (Vol. 17, pp. 1086–1089). International Phonetic Association.
Kisler, T., Reichel, U., & Schiel, F. (2017). Multilingual processing of speech via web services. Computer Speech & Language, 45, 326–347. https://doi.org/10.1016/j.csl.2017.01.005
Kohler, K. (2009). Rhythm in speech and language. Phonetica, 66(1–2), 29–45. https://doi.org/10.1159/000208929
Kroll, J., & Sunderman, G. (2003). Cognitive processes in second language learners and bilinguals: The development of lexical and conceptual representations. In C. Doughty & M. H. Long (Eds.), The handbook of second language acquisition (pp. 104–129). Blackwell. https://doi.org/10.1002/9780470756492.ch5
Lange, C. (2009). Review of Indian English, by Pingali Sailaja. In R. Singh (Ed.), Annual Review of South Asian Languages and Linguistics (pp. 213–216). De Gruyter Mouton.
Lee, C., & Todd, N. M. (2004). Towards an auditory account of speech rhythm: application of a model of the auditory ‘primal sketch’ to two multi-language corpora. Cognition, 93(3), 225–254. https://doi.org/10.1016/j.cognition.2003.10.012
Leong, V., Stone, M. A., Turner, R., & Goswami, U. (2014). A role for amplitude modulation phase relationships in speech rhythm perception. The Journal of the Acoustical Society of America, 136(1), 366–381. https://doi.org/10.1121/1.4883366
Lewis, P. (Ed.) (2009). Ethnologue: Languages of the World (16th ed.). SIL International. https://www.ethnologue.com
Lindblom, B. (1963). Spectrographic study of vowel reduction. The Journal of the Acoustical Society of America, 35(11), 1773–1781. https://doi.org/10.1121/1.1918816
Loukina, A., Kochanski, G., Rosner, B., Keane, E., & Shih, C. (2011). Rhythm measures and dimensions of durational variation in speech. The Journal of the Acoustical Society of America, 129(5), 3258–3270. https://doi.org/10.1121/1.3559709
Malisz, Z., O’Dell, M., Nieminen, T., & Wagner, P. (2017). Perspectives on speech timing: Coupled oscillator modeling of Polish and Finnish. Phonetica, 73(3–4), 229–255. https://doi.org/10.1159/000450829
Masica, C. (1972). The sound system of Indian English (Monograph no. 7). Central Institute of English and Foreign Languages.
McDougall, K. (2004). Speaker-specific formant dynamics: An experiment on Australian English/aI. International Journal of Speech, Language and the Law, 11(1), 103–130. https://doi.org/10.1558/sll.2004.11.1.103
McDougall, K. (2006). Dynamic features of speech and the characterization of speakers: Toward a new approach using formant frequencies. International Journal of Speech, Language and the Law, 13(1), 89–126. https://doi.org/10.1558/ijsll.v13i1.89
Nespor, M., Shukla, M., & Mehler, J. (2011). Stress‐timed vs. syllable‐timed languages. The Blackwell companion to phonology: Volume II. Suprasegmental and Prosodic Phonology (pp. 1147–1159). Wiley. https://doi.org/10.1002/9781444335262.wbctp0048
Nolan, F. (2002). Intonation in speaker identification: An experiment on pitch alignment features. International Journal of Speech, Language and the Law, 9(1), 1–21. https://doi.org/10.1558/sll.2002.9.1.1
O’Dell, M., & Nieminen, T. (1999). Coupled oscillator model of speech rhythm. In J. Ohala, Y. Hasegawa, M. Ohala, D. Granville, & A. Bailey (Eds.), Proceedings of the 14th International Congress of Phonetic Sciences (pp. 1075–1078). International Phonetic Association.
Office of the Registrar General & Census Commissioner, India. (2011). Census of India 2011: Provisional population totals.
Oñate, K. C. (2019). Testing the effect of synchronous speech tasks in the production of L2 speech rhythm in learners of Spanish as a second language [Doctoral dissertation, Pontificia Universidad Catolica de Chile]. Repositorio UC. https://doi.org/10.7764/tesisUC/LET/27208
Ordin, M., Polyanskaya, L., & Ulbrich, C. (2011). Acquisition of timing patterns in second language. In P. Cosi & R. De Mori (Eds.), Proceedings of Interspeech 2011 (pp. 1129–1132). International Speech Communication Association. https://doi.org/10.21437/Interspeech.2011-336
Payne, E., & Maxwell, O. (2018). Durational variability as a marker of prosodic structure in Indian English(es). In K. Klessa, J. Bachan, A. Wagner, M. Karpiński, & D. Śledziński (Eds.), Proceedings of Speech Prosody 2018 (pp. 749–753). International Speech Communication Association. https://doi.org/10.21437/SpeechProsody.2018-152
Pellegrino, E. (2019). The effect of healthy aging on within-speaker rhythmic variability: A case study on Noam Chomsky. Loquens, 6(1), e060. https://doi.org/10.3989/loquens.2019.060
Ramus, F., Nespor, M., & Mehler, J. (1999). Correlates of linguistic rhythm in the speech signal. Cognition, 73(3), 265–292. https://doi.org/10.1016/S0010-0277(00)00101-3
Rasier, L., & Hiligsmann, P. (2007). Prosodic transfer from L1 to L2: Theoretical and methodological issues. Nouveaux Cahiers de Linguistique Française, 28, 41–66.
Rathcke, T. & Smith, R. (2015). Speech timing and linguistic rhythm: On the acoustic bases of rhythm typologies. The Journal of the Acoustical Society of America, 137(5), 2834–2845. https://doi.org/10.1121/1.4919322
Raupach, M. (1980). Temporal variables in first and second speech production. In H. Dechert & M. Raupach (Eds.), Temporal variables in speech (pp. 255–262). Mouton. https://doi.org/10.1515/9783110816570.263
Roach, P. (2009). English phonetics and phonology: A practical course (4th ed.). Cambridge University Press, 10–52.
Rodríguez-Vázquez, R., & Roseano, P. (2023). L2 rhythm acquisition and rhythmic transfer in Galician learners of English: A pilot study. Loquens, 9(1–2), e087. https://doi.org/10.3989/loquens.2022.087
Roy, R., Basu, T., Saha, A., Basu, J., & Mandal, S. K. D. (2008). Duration modeling for Bangla text to speech synthesis system. In M. Rayanakorn & K. T. Lua (Eds.), Proceedings of IALP 2008: International Conference on Asian Language Processing. Chinese and Oriental Languages Information Processing Society.
Saha, S. N., & Mandal, S. K. D. (2015). Study of acoustic correlates of English lexical stress produced by native (L1) Bengali speakers compared to native (L1) English speakers. In S. Möller & H. Ney (Eds.), Proceedings of Interspeech 2015 (pp. 815–819). International Speech Communication Association. https://doi.org/10.21437/Interspeech.2015-257
Saha, S. N., & Mandal, S. K. D. (2018). Phonetic realization of English lexical stress by native (L1) Bengali speakers compared to native (L1) English speakers. Computer Speech & Language, 47, 1–15. https://doi.org/10.1016/j.csl.2017.06.006
Sailaja, P. (2010). The standard, (non-)rhoticity and rhythm in Indian English: A response to Lange. In Rajendra Singh (Ed.), Annual Review of South Asian Languages and Linguistics: 2010 (pp. 183–186). https://doi.org/10.1515/9783110254976.183
Sailaja, P. (2012). Indian English: Features and sociolinguistic aspects. Language and Linguistics Compass, 6(6), 359–370. https://doi.org/10.1002/lnc3.342
Schiering, R. (2007). The phonological basis of linguistic rhythm: cross-linguistic data and diachronic interpretation. STUF-Language Typology and Universals, 60(4), 337–359. https://doi.org/10.1524/stuf.2007.60.4.337
Schön, D., & Tillmann, B. (2015). Short‐and long‐term rhythmic interventions: Perspectives for language rehabilitation. Annals of the New York Academy of Sciences, 1337(1), 32–39. https://doi.org/10.1111/nyas.12635
Setter, J. E. (2000). Rhythm and timing in Hong Kong English [Unpublished doctoral dissertation]. University of Reading.
Setter, J. (2006). Speech rhythm in world Englishes: The case of Hong Kong. Tesol Quarterly, 40(4), 763–782. https://doi.org/10.2307/40264307
Statista (2020). The most spoken languages worldwide 2023. https://www.statista.com/statistics/266808/the-most-spoken-languages-worldwide/
Taghva, N., Moloodi, A., Abolhasanizadeh, V., & Tabei, R. (2023). A corpus study of durational rhythmic measures in the Kalhori variety of Kurdish. Loquens, 10(1–2), e098. https://doi.org/10.3989/loquens.2023.e098
Tilsen, S., & Johnson, K. (2008). Low-frequency Fourier analysis of speech rhythm. The Journal of the Acoustical Society of America, 124(2), EL34-EL39. https://doi.org/10.1121/1.2947626
Todd, N. P. M. (1994). The auditory “primal sketch”: A multiscale model of rhythmic grouping. Journal of New Music Research, 23(1), 25–70. https://doi.org/10.1080/09298219408570647
Valdés, G. (2001). Heritage language students: Profiles and possibilities. In J. Peyton, D. Ranard, & S. McGinnis (Eds.), Heritage languages in America: Preserving a national resource. Center for Applied Linguistics.
Van Maastricht, L., Krahmer, E., Swerts, M., & Prieto, P. (2019). Learning direction matters: A study on L2 rhythm acquisition by Dutch learners of Spanish and Spanish learners of Dutch. Studies in Second Language Acquisition, 41(1), 87–121. https://doi.org/10.1017/S0272263118000062
Visceglia, T., Tseng, C. Y., Kondo, M., Meng, H., & Sagisaka, Y. (2009). Phonetic aspects of content design in AESOP (Asian English Speech cOrpus Project). In Oriental COCOSDA 2009: International Conference on Speech Database and Assessments (pp. 60–65). IEEE. https://doi.org/10.1109/ICSDA.2009.5278376
White, L., & Mattys, S. L. (2007a). Calibrating rhythm: First language and second language studies. Journal of Phonetics, 35(4), 501–522. https://doi.org/10.1016/j.wocn.2007.02.003
White, L., & Mattys, S. L. (2007b). Rhythmic typology and variation in first and second languages. In P. Prieto, J. Mascaró, & M.-J. Solé (Eds.), Segmental and prosodic issues in Romance phonology (pp. 237–257). John Benjamins. https://doi.org/10.1075/cilt.282.16whi
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Nafiseh Taghva, Shouvik Chaudhuri
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
All articles published online by Estudios de Fonética Experimental are licensed under Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International (CC BY-NC-ND 4.0 DEED), unless otherwise noted. Estudios de Fonética Experimental is an open access journal. Estudios de Fonética Experimental is hosted by RCUB (Revistes Científiques de la Universitat de Barcelona), powered by Open Journal Systems (OJS) software. The copyright is not transferred to the journal: authors hold the copyright and publishing rights without restrictions. The author is free to use and distribute pre and post-prints versions of his/her article. However, preprint versions are regarded as a work-in-progress version used as internal communication with the authors, and we prefer to share postprint versions.