Determination of Bahasa Melayu Word List From Friday Sermon Transcripts Using PHP and MySQL

Authors

  • Mokhtar Harun Fakulti Kejuruteraan Elektrik, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia
  • Muhammad ‘Aasim Asyafi’ie Ahmad UB2-1039, Kolej 10, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia
  • Siti Zaleha Abdul Hamid Fakulti Kejuruteraan Elektrik, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia
  • Fareha Abdul Rahman Fakulti Kejuruteraan Elektrik, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia
  • Puspa Inayat Khalid Fakulti Kejuruteraan Elektrik, Universiti Teknologi Malaysia, 81310 UTM Johor Bahru, Johor, Malaysia

DOI:

https://doi.org/10.11113/jt.v64.2071

Keywords:

Word list, Malay, speech intelligibility, mosque, languages

Abstract

Almost all of the speech intelligibility (SI) assessors today are from the West. The words used in the development of these SI assessors are mainly English. However, recent finding has found that word components that contribute to intelligibility of speech is different from one language to another. The purpose of this study is to determine Bahasa Melayu (BM) word list from Friday sermon transcripts. The project was undertaken by deriving BM words spoken during 52 Friday sermons read in mosques in Wilayah Persekutuan Kuala Lumpur. PHP is the programming language used to compute the frequency of words appears in the Friday sermon transcripts. These words were then stored into a database. The database used is MySQL. It has been found that the highest number of repeated word is 2509. The list is then truncated so that the numbers of repeated words range from 52 to 2509 times. The frequency of Bahasa Melayu words used in Friday sermons is an initial and important inventory that can be used in the development of SI assessor using BM words.

References

Yap, J. M. and Balota, D. A. 2009. Visual Word Recognition of Multisyllabic Words. Journal of Memory and Language. 60: 502–5.

Balota, D. A., Sergent-Marshall, S. D., Cortese, M. J. and Spieler, D. H. 2004. Visual Word Recognition of Single-Syllable Words. Journal of Experimental Psychology: General. 133(2): 283–316.

Jescheniak, J. D. and Levelt, W. J. M. 1994. Word-Frequency Effects in Speech Production: Retrieval of Syntactic Information and of Phonological Form. Journal of Experimental Psychology: Learning, Memory, and Cognition. 20: 824–843.

Thorndike, E. L. and Lorge, I. 1944. The Teacher’s Word Book of 30,000 Words. New York: Columbia University: Teachers College.

Brysbaert, M. and New, B. 2009. Moving Beyond KuÄera and Francis: A Critical Evaluation of Current Word Frequency Norms and the Introduction of a New and Improved Word Frequency Measure for American English. Behavior Research Methods. 41: 977–990.

Baayen, R. H., Piepenbrock, R. and Gulikers, L. 1995. The CELEX Lexical Database (CD-ROM). Linguistic Data Consortium. University of Pennsylvannia, Philadelphia.

New, B., Pallier C., Brysbaert, M. and Ferrand, L. 2004. Lexique 2: A New French Lexical Database. Behavior Research Methods, Instruments and Computers. 36(3): 516–524.

Fogerty, D. and Humes, L. E. 2012. The Role of Vowel and Consonant Fundamental Frequency, Envelope, and Temporal Fine Structure Cues to the Intelligibility of Words and Sentences. Journal of the Acoustical Society of America. 131(2): 1490–1501.

Gooskens, C., van Heuven, V. J., van Bezooijen, R. and Pacilly, J. J. A. 2010. Is Spoken Danish Less Intelligible than Swedish? Speech Communication. 52(11–12): 1022–1037.

Wong, L. L. N., Ho, A. H. S., Chua, E. W. W. and Soli, S. D. 2007. Development of the Cantonese speech intelligibility index. Journal of the Acoustical Society of America. 121(4): 2350–2361.

Granlund, S., Hazan, V. and Baker, R. 2012. An Acoustic–phonetic Comparison of the Clear Speaking Styles of Finnish–English Late Bilinguals. Journal of Phonetics. 40(3): 509–520.

Ahmad Khan Said, Mohamad Ngasri Dimon, and Mokhtar Harun. 2001. Measurements of A-Weighted SPL, RASTI and Alcons of Prayer Halls in Masjid Wilayah Persekutuan Kuala Lumpur. Technical Report. UT.CC-CONS-2.224 (58).

Broersma, M. and Scharenborg, O. 2010. Native and Non-native Listeners’ Perception of English Consonants in Different Types of Noise. Speech Communication. 52: 980–995.

Meyer, B. T., Jurgens, T., Wesker, T., Brand, T., & Kollmeier, B. 2010. Human Phoneme Recognition Depending on Speech-intrinsic Variability. The Journal of the Acoustical Society of America. 128(5): 3126–3141.

Sommerhoff, J. and Rosas, C. 2012. Logatom Corpus for the Assessment of the Intelligibility in Spanish Spekaing Environments and Its Relation with STI Measurements. Applied Acoustics. 73: 1190–1200.

Nielsen, J.B. and Dau, T. 2009. Development of a Danish Speech Intelligibility Test. International Journal of Audiology. 48: 729–741.

Tian, K. Q., Mukari, S. Z. M. S., Wahab, N. A. A., Razak, R. A., Omar, M. and Maamor, N. 2008. Malay Hearing in Noise Test. International Journal of Audiology. 47: 379–380.

Hällgren, M., Larsby, B. and Arlinger, S. 2006. A Swedish Version of the Hearing In Noise Test (HINT) for Measurement of Speech Recognition. International Journal of Audiology. 45(4): 227–37.

Nilsson, M., Soli, S.D. and Sullivan, J. A. 1994. Development of the Hearing In Noise Test for the Measurement of Speech Reception Thresholds in Quiet and in Noise. The Journal of the Acoustical Society of America. 95(2): 1085–1099.

Palaz, H., Bicil, Y., kanak, A. and Dogan, M. U. 2005. New Turkish Intelligibility Test for Assessing Speech Communication Systems. Speech Communication. 47(4): 411–423.

ANSI. 1989. Method for Measuring the Intelligibility of Speech over Communication Systems. United States, S3.2.

Kollmeier, B. and Wesselkamp, M. 1997. Development and Evaluation of a German Sentence Test. The Journal of Acoustical Society of America. 102(4): 2412–2421.

Tadmoor, U. 2009. The World’s Major Languages. 2nd ed. B. Comrie, ed. Abingdon, Oxon: Routledge.

Karim, N. S. 1995. Tatabahasa Dewan Edisi Baharu. Kuala Lumpur, Dewan Bahasa dan Pustaka.

Teoh, B. S. 1994. The Sound System of Malay Revisited. Kuala Lumpur, Dewan Bahasa dan Pustaka.

Downloads

Published

2013-09-15

Issue

Section

Science and Engineering

How to Cite

Determination of Bahasa Melayu Word List From Friday Sermon Transcripts Using PHP and MySQL. (2013). Jurnal Teknologi, 64(3). https://doi.org/10.11113/jt.v64.2071