Framework for the development of articulatory characterization studies over MRI images

Authors

Keywords:

articulatory characterization, cine-MRI, speech production, Basque, Spanish

Abstract

In this paper an innovative framework is presented, designed and developed by our research team to enable the accomplishment of research works concerning the articulatory characterization of the sounds of a language from measures taken over MRI image sequences. As fundamental element there is the DicomPas software tool, developed by our team, which allows to carry out the measures of articulatory parameters over the MRI image sequences and the execution of ad hoc algorithms over such measures, facing the data processing, with the view to the subsequent extraction of knowledge, in the form of the generation of statistical or artificial intelligence inferences. This framework is currently being applied to the achievement of diverse studies in Basque and Spanish of the Basque Country. To do so, a database with two repositories of images taken in the midsagittal plane, corresponding to 18 different informants, is available.

References

ALWAN, A.; S. NARAYANAN y K. HAKER (1997): «Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part II. The rhotics», The Journal of the Acoustical Society of America, 101, 2, pp.1078-1089.

BADIN, P; G. Bailly; M. Raybaudi y C. Segebarth (1998): «A three-dimensional linear articulatory model based on MRI data», Proceedings of the Third ESCA/COCOSDA International Workshop on Speech Synthesis, Jenolan Caves House, Blue Mountains, NSW, Australia, pp. 249–254.

BADIN, P.; G. Bailly; L. Reveret; M. Baciu; C. Segebarth y C. Savariaux (2002): «Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images», Journal of Phonetics, 30, 3, pp.533–553.

BADIN, P. y A. SERRURIER (2006): «Three-dimensional modeling of speech or-gans: Articulatory data and models», Transactions on Technical Committee of Psychological and Physiological Acoustics, The Acoustical Society of Japan, 36, 5, pp.421–426.

BAER, T. (1991): «Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels», The Journal of the Acoustical Society of America, 90, 2, pp.799–828.

BEAUTEMPS, D.; P. BADIN y G. BAILLY (1996): «Evaluation of an articulatory-acoustic model based on a reference subject», Proceedings of the First ESCA Tutorial and Research Workshop on Speech Production Modeling - Fourth Speech Production Seminar, Autrans, Francia, pp.45-48

DONOHO, D. L. (2006): «Compressed sensing», IEEE Transactions on Information Theory, 52, 4, pp.1289–1306.

ELEJABEITIA, A.; A. IRIBAR y R. M. PAGOLA (2009): «El cine-MRI aplicado a la descripción de las sibilantes vascas», Estudios de fonética experimental, XVIII, pp.145–160.

ENGWALL, O. y P. BADIN (1999): «Quarterly Progress and Status Report Collecting and analysing two- and three-dimensional MRI data for Swedish», Dept. for Speech, Music and Hearing. Quarterly Progress and Status Report (TMH-QPSR), 40, 3-4, pp. 11–38.

ENGWALL, O. (2000): «Are static MRI measurements representative of dynamic speech? Results from a comparative study using MRI, EPG and EMA», en B. Yuan; T. Huang y X. Tang (eds.): Proceedings of the International Con-ference on Spoken Language Processing (ICSLP), Pekín, China, pp. 17-20.

ENGWALL, O. (2003a): «A revisit to the Application of MRI to the Analysis of Speech Production-Testing our assumptions», en S. Palethorpe y M. Tabain (eds): Proceedings of 6th International Seminar on Speech Production, Sydney, Australia, pp.43-48.

ENGWALL, O. (2003b): «Combining MRI, EMA and EPG measurements in a three -dimensional tongue model», Speech Communication, 41, 2-3, pp.303-329.

FERNÁNDEZ PLANAS, A. M. (2008): «La electropalatografía (EPG) en el estudio articulatorio del habla. El WinEPG de Articulate Instruments Ltd», Estudios de fonética experimental, XVII, pp.285–299.

FITCH, W. T. y J. GIEDD (1999): «Morphology and development of the human vocal tract: A study using magnetic resonance imaging», The Journal of the Acoustical Society of America, 106, 3, pp.1511–1522.

GURLEKIAN, J. A; N. ELISEI y M. ELETA (2004): «Caracterización articulatoria de los sonidos vocálicos del español de Buenos Aires mediante técnicas de resonancia magnética», Revista Fonoaudiológica, 50, 2, pp.7–14.

HERMAN, G. T. (1980): Fundamentals of Computerized Tomography, Londres, Springer-Verlag, 20092.

HOOLE, P. y C. MOOSHAMMER (2002): «Articulatory analysis of the German vowel system», en P. Auer; P. Gilles y H. Spiekerman (ed): Silbenschnitt und Tonakzente, Tubingen, pp. 129–152.

HORNAK, J. (1996): The Basics of MRI, ScientificCommons.

http://www.cis.rit.edu/htbooks/mri/ [5/2/2013]

IBM (2013): SPSS

http://www-01.ibm.com/software/analytics/spss/. [5/2/2013]

IRIBAR, A. (2013): «Apuntes para la caracterización articulatoria experimental del vocalismo del español», Estudios de fonética experimental. XXII, pp. 37-80.

IRIBAR, A. (2012): Caracterización fonética experimental del vocalismo vasco-románico, tesis doctoral. Universidad de Deusto.

IRIBAR, A.; R. M. PAGOLA e I. TÚRREZ (en prensa): «Observaciones sobre la articulación de la lateral alveolar en euskara y castellano», en Actas del V Congreso de Fonética Experimental, Cáceres 2011.

IRIBAR, A.; R. M. PAGOLA e I. TÚRREZ (2013): «Caracterización articulatoria de ele en español y euskara», Estudios de fonética experimental, XXII, pp.129-171.

KARTHIKESWARAN, D. y S. DINAKAR (2011): «Developing a scientific visualization tool for Inner articulators», en Proceedings of the 2011 International Conference on Emerging Trends in Electrical and Computer Technology, IEEE, Chunkankadai, Nargelcoil, India, pp. 480–488.

KIM, Y. C.; S. S. NARAYANAN y K. S. NAYAK (2009a): «Accelerated 3D MRI of vocal tract shaping using compressed sensing and parallel imaging», Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP),Taipei, Taiwan, pp.389–392.

KIM, Y. C.; S. S. NARAYANAN y K. S. NAYAK (2009b): «Accelerated three-dimensional upper airway MRI using compressed sensing», Magnetic resonance in medicine : official journal of the Society of Magnetic Resonance in Medicine, 61, 6, pp.1434–40.

LUSTIG, M., D. DONOHO y J. M. PAULY (2007): «Sparse MRI: The application of compressed sensing for rapid MR imaging», Magnetic Resonance in Medi-cine : Official Journal of the Society of Magnetic Resonance in Medicine, 58, 6, pp.1182–95.

MARTINS, P.; I. Carbone; A. Pinto; A. Silva y A. Teixeira (2008): «European Portuguese MRI based speech production studies», Speech Communication, 50, 11-12, pp.925–952.

MICROSOFT (2013a): Excel, versión 2010

http://office.microsoft.com/en-us/excel/. [5/2/2013]

MICROSOFT (2013b): RTF, versión 1.9.1.

http://www.microsoft.com/en-us/download/details.aspx?id=10725. [5/2/2013]

NARAYANAN, S.; K. Navak; S. Lee y D. Byrd (2004): «An approach to real-time magnetic resonance imaging for speech production», The Journal of the Acoustical Society of America, 115, 4, pp.1771–1776.

NARAYANAN, S.; A. ALWAN y K. HAKER (1997): «Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals», The Journal of the Acoustical Society of America, 101, 2, pp.1064–1077.

NARAYANAN, S.; D. BYRD y A. KAUN (1999): «Geometry, kinematics, and acoustics of Tamil liquid consonants», The Journal of the Acoustical Society of America, 106, 4, pp.1993–2007.

NARAYANAN, S. S. y A. ALWAN (1995): «An articulatory study of fricative consonants using magnetic resonance imaging», The Journal of the Acoustical Society of America, 98, 3, pp.1325–1347.

NEMA (2013): DICOM specification.

http://medical.nema.org/dicom/.[5/2/2013]

ORACLE-SUN MICROSYSTEMS (2013): Java.

http:// java.sun.com. [5/2/2013]

PAGOLA, R. M. (1992): Euskal fonetika Nafarroan. Iruñea, Nafarroako Gobernua.

PAGOLA, R. M.; A. IRIBAR e I. TÚRREZ (2012): «La descripción articulatoria de los sonidos en euskara y castellano: el proyecto DAELPACE», en M. Acillona (ed.): Marcos interpretativos de la realidad social contemporánea, Bilbao, Universidad de Deusto, pp.107–118.

ROMANO, A. y P. BADIN (2009): «An MRI study of the articulatory properties of italian consonants», Estudios de fonética experimental, XVIII, pp.327–344.

ROMERO, J. (2008): «La electromagnetometría en el estudio de la producción del habla», Estudios de fonética experimental, XVII, pp.359–374.

SERRURIER, A. y P. BADIN (2005): «Towards a 3D articulatory model of velum based on MRI and CT imagesۚ», ZAS Papers in Linguistics (Speech production and perception: Experimental analyses and models), 40, 1, pp.195–211.

SERRURIER, A. y P. BADIN (2008): «A three-dimensional articulatory model of the velum and nasopharyngeal wall based on MRI and CT data», Journal of the Acoustical Society of America, 123, 4, pp. 2335–2355.

STORY, B. H.; I. R. TITZE y E. A. HOFFMAN (1996): «Vocal tract area functions from magnetic resonance imaging», The Journal of the Acoustical Society of America, 100, 1, pp.537–54.

TAKEMOTO, H.; T. Kitamura; H. Nishimoto y K. Honda (2004): «A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions», Acoustical Science and Technology, 25, 6, pp.468–474.

TAKEMOTO, H.; K. Honda; S. Masaki; Y. Shimada y I. Fujimoto (2006): «Measurement of temporal changes in vocal tract area function from 3D cine-MRI data», The Journal of the Acoustical Society of America, 119, 2, pp.1037–1049.

THE FREE DICTIONARY (2013a): Body planes.

http://medical-dictionary.thefreedictionary.com/coronal+planes. [5/2/2013]

THE FREE DICTIONARY (2013b): Midsagittal plane.

http://medical-dictionary.thefreedictionary.com/midsagittal+plane. [5/2/2013]

THE UNIVERSITY OF WAIKATO (2013): WEKA.

http://www.cs.waikato.ac.nz/ml/weka/.[5/2/2013]

TIEDE, M. K. S. MASAKI y E. VATIKIOTIS-BATESON (2000): «Contrasts in speech articulation observed in sitting and supine conditions», Proceedings of the Fifth Seminar on Speech Production: Models and Data, Kloster Seeon, Bavaria, Alemania, pp. 25–28.

TXILLARDEGI (1980): Euskal fonologia. Donostia, Ediciones Vascas.

U.S. NATIONAL INSTITUTES OF HEALTH (2013): ImageJ, versión 1.46.

http://rsbweb.nih.gov/ij/.[5/2/2013]

WORLD WIDE WEB CONSORTIUM (W3C) (2013a): JPEG.

http://www.w3.org/Graphics/JPEG/.[5/2/2013]

WORLD WIDE WEB CONSORTIUM (W3C) (2013b): XML

http://www.w3.org/XML/.[5/2/2013]

WORLD WIDE WEB CONSORTIUM (W3C) (2013c): XML Schema.

http://www.w3.org/XML/Schema. [5/2/2013]

YANG, B. (1999): «Measurement and synthesis of the vocal tract of Korean monophthongs by MRI», en J. Ohala; Y. Hasegawa; M. Ohala; D. Granville y A. C. Bailey (eds.): Proceedings of the XIVth International Congress of Phonetic Sciences (ICPhS) 1999, San Francisco, E.E.U.U, pp. 2005–2008.

ZHOU, X. (2009): An MRI-based articulatory and acoustic study of American English liquid sounds/r/and/l/, tesis doctoral. Universidad de Maryland, College Park.

ZHOU, X. et al. (2010): «An MRI-based articulatory and acoustic study of lateral sound in American English», Proceedings of the 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, E.E.U.U, pp. 4182–4185.

Published

2013-07-30

How to Cite

García Arroyo, J. L., García Zapirain, B. ., Oleagordia Ruiz, I., & Méndez Zorrilla, A. (2013). Framework for the development of articulatory characterization studies over MRI images. Journal of Experimental Phonetics, 22, 367–404. Retrieved from https://revistes.ub.edu/index.php/experimentalphonetics/article/view/44183

Issue

Section

Miscellaneous