SINGLE CHANNEL SPEECH ENHANCEMENT USING EVOLUTIONARY ALGORITHM WITH LOG-MMSE

Authors

  • Kalpana Ghorpade E &TC department, Faculty of Engineering, MKSSS’s Cummins College of Engineering for Women, Pune, Maharashtra, India
  • Arti Khaparde Department of ECE, Faculty of Engineering, Dr. Vishwanath Karad MIT World Peace University, Pune, Maharashtra, India

DOI:

https://doi.org/10.11113/aej.v12.16770

Keywords:

speech enhancement, evolutionary algorithms, log-MMSE, particle swarm optimization, speech intelligibility

Abstract

Additive noise degrades speech quality and intelligibility. Speech enhancement reduces this noise to make speech more pleasant and intelligible. It plays a significant role in speech recognition or speech-operated systems. In this paper, we propose a single-channel speech enhancement method in which the log-minimum mean square error method (log-MMSE) and modified accelerated particle swarm optimization algorithm are used to design a filter for improving the quality and intelligibility of noisy speech. Accelerated particle swarm optimization (APSO) algorithm is modified in which a single dimension of particle position is changed in a single iteration while obtaining the particle’s new position. Using this algorithm, a filter is designed with multiple passbands and notches for speech enhancement. The modified algorithm converges faster compared with standard particle swarm optimization algorithm (PSO) and APSO giving optimum filter coefficients. The designed filter is used to enhance the speech. The proposed speech enhancement method improves the perceptual estimation of speech quality (PESQ) by 17.05% for 5dB babble noise, 33.92 %  for 5dB car noise, 14.96 % for 5dB airport noise, and 39.13 % for 5dB exhibition noise. The average output PESQ for these four types of noise is improved compared to conventional methods of speech enhancement. There is an average of 7.58 dB improvement in segmental SNR for these noise types. The proposed method improves speech intelligibility with minimum speech distortion.

References

Kondaz, A. M., 2004. Digital speech coding for low bit rate communication systems, Second Edition, (John Wiley and Sons) DOI: https://doi.org/10.1002/0470870109

Loizou, P. C. 2013. Speech Enhancement: Theory and Practice, Second Edition CRC Press DOI: https://doi.org/10.1201/b14529

Hu, Yi and Loizou, P.C. 2006. Subjective Comparison of Speech Enhancement Algorithms. Department of Electrical Engineering, University of Texas at Dallas Richardson, Texas. 1-4244-0469-X/06 IEEE DOI: 10.1109/icassp.2006.1659980

Boll, S. 1979. Suppression of acoustic noise in speech using spectral subtraction, IEEE Transactions on acoustics, speech, and signal processing, 27(2): 113–120. DOI: https://doi.org/10.1109/TASSP.1979.1163209

Berouti,M. , Schwartz,R. , and Makhoul,J.1979.Ehancementof speech corrupted by acoutic noise,IEEE International Conference on Acoustics, Speech, and Signal Prcessing, ICASSP '79, 4: 208‐211

DOI: 10.1109/ICASSP.1979.1170788

Zadeh, L. 1950. Frequency analysis of variable networks, Institute of Radio Engineering. 38: 291‐299. DOI: https://doi.org/10.1109/JRPROC.1950.231083

Atlas,L. 2003.Modulation spectral transforms: Application to speech separation and modification, University of Washington, Washington, WA

Paliwal, K. Wojcicki, and Schwerin, B. 2010. Single-channel speech enhancement using spectral subtraction in the short-time modulation domain, Speech Communication, 52(5):450–475. DOI: https://doi.org/10.1016/j.specom.2010.02.004

Zang, Yi 2012. Modulation domain processing and speech phase spectrum in speech enhancement, A Dissertation Presented to the Faculty of the Graduate School at the University of Missouri-Columbia

Wang, Y. 2015. Speech enhancement in the modulation domain, PhD thesis, Imperial College London

Dionelis N. and Brookes M. 2017. Modulation domain speech enhancement using Kalman Filter with a Bayesian update of speech and noise in the log spectral domain, 978-1-5090-5925-6/IEEE Proceeding Hands-Free Speech Communication and Microphone Arrays, HSCMA. 111 - 115 DOI: https://doi.org/10.1109/HSCMA.2017.7895572

Wang, Y., and Brookes M. 2018. Model-Based Speech Enhancement in the Modulation Domain, IEEE/ACM Transaction on Audio, Speech and Language Processing, 26(3): 580–594. DOI: https://doi.org/10.1109/TASLP.2017.2786863

Widrow, B. and Stearns, S.D. 1985. Adaptive Signal Processing, Prentice-hall Englewood Cliffs, NJ.

Mohammed, J.R. 2007. A new simple adaptive noise cancellation scheme based on ale and NLMS filter, Proceedings of the 5th Annual Conference on Communication Networks and Services Research, May 14-17, IEEE Xplore Press, Frederlcton, NB, Canada, 245-254. DOI: https://doi.org/10.1109/CNSR.2007.4

Gorriz, J.M., Ramırez, J., Cruces-Alvarez, S., Puntonet, C.G. and Lang, E.W. et al.: “A novel LMS algorithm applied to adaptive noise cancellation, IEEE Signal Process. Lett., 16: 34-37.DOI: https://doi.org/10.1109/LSP.2008.2008584

Shynk, J. J. 1989. Adaptive IIR Filtering, IEEE ASSP Magazine, 4–21 DOI: https://doi.org/10.1109/53.29644

Krusicnski, D.J. and Jenkins, W.K. 2003. Adaptive Filtering Via Particle Swarm Optimization, Proc. 37’Asilomar Conf on Signals, Systems, and Computers. DOI: https://doi.org/10.1109/acssc.2003.1291975

Krusienski, D. J. and Jenkins, W. K. 2004. Particle Swarm Optimizationfor Adaptive IIR Filter Structures, 0-7803-8515-2/04/2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753). DOI: https://doi.org/10.1109/cec.2004.1330966

Yang X. S. 2008. Nature-Inspired Metaheuristic Algorithms, Luniver Press.

Yang, Xin-She, Deb, S. and Fong, S. 2011. Accelerated Particle Swarm Optimization and Support Vector Machine for Business Optimization and Applications, Networked Digital Technologies (NDT2011), Communications in Computer and Information Science, Vol. 136, Springer, 53–66.DOI: https://doi.org/10.1007/978-3-642-22185-9 6

Mandal, S., Ghoshal, S., Kar, R., Mandal, D. 2012. Design of optimal linear phase FIR high pass filter using craziness-based particle swarm optimization technique, Journal of King Saud University – Computer and Information Sciences, 24, 83–92. DOI: https://doi.org/10.1016/j.jksuci.2011.10.007

Mandal, S., Ghoshal, S., Kar R., Manda,l D. 2012.Craziness based Particle Swarm Optimization algorithm for FIR band stop filter design, Swarm and Evolutionary Computation, 7: 58–64. DOI: https://doi.org/10.1016/j.swevo.2012.05.002

Aggarwal, A., Rawat, T., Upadhyay,D. 2016. Design of optimal digital FIR filters using evolutionary and swarm optimization techniques, International Journal of Electronics and Communication (AEÜ), 70: 373–385.DOI: https://doi.org/10.1016/j.aeue.2015.12.012

Lim, W. H. and Nor A. M. I. 2015. Particle Swarm Optimization with Improved Learning Strategy, Journal of Engineering Science, 11: 27–48.

Zhao F. 2016. Optimized Algorithm for Particle Swarm Optimization, International Journal of Mathematical, Computational, Physical, Electrical and Computer Engineering, 10(3): 91-95. DOI: https://doi.org/10.1155/2016/3968324

Xu, G., Cui,Q., Shi,X., Ge, H., Zhan,Z., Lee, H. P., Liang,Y., Tai,R., Wu,C. 2019. Particle swarm optimization based on dimensional learning strategy, Swarm and Evolutionary Computation, 45: 33–51. DOI: https://doi.org/10.1016/j.swevo.2018.12.009

Fajr, R., and Bouroumi, A. 2017. An Improved Particle Swarm Optimization Algorithm for Global Multidimensional Optimization, Journal of Intelligent Systems, 29(1): 127–142. DOI: https://doi.org/10.1515/jisys-2017-0104

Zhang, Y., Wang,S., and Ji,G. 2015. Comprehensive Survey on Particle Swarm Optimization Algorithm and Its Applications, Mathematical Problems in Engineering, Article ID 931256, 38 pages. DOI: https://doi.org/10.1155/2015/931256

Prajna, K., Rao, G.S.B., Reddy, K. V. V. S. 2014. A New Dual Channel Speech Enhancement Approach Based on Accelerated Particle Swarm Optimization (APSO), International Journal of Intelligent Systems and Applications. DOI: https://doi.org/10.5815/ijisa.2014.04.01

Prajna, K., Rao, G.S.B., Reddy, K. V. V. S., Maheswari, R. U., 2015 .A new approach to dual channel speech enhancement based on hybrid PSOGSA, International Journal of Speech Technology, 18: 45–56. DOI: https://doi.org/10.1007/s10772-014-9245-5

Geravanchizadeh, M.,Osgouei S. G., 2015. A New Shuffled Sub-Swarm Particle Swarm Optimization Algorithm for Speech Enhancement, Journal of Advances in Computer Engineering and Technology, 1(1): 43-50

Sandeep Kumar, 2020.Directed Searching Optimization-Based Speech Enhancement Technique, Fluctuation and Noise Letters, 2050035, World Scientific Publishing Company. DOI: https://doi.org/10.1142/S0219477520500352

Selvi,R. S.,Suresh G.R. 2015: Hybridization of spectral filtering with particle swarm optimization for speech signal enhancement, International Journal of Speech Technology, 19(1): 19-31 DOI: https://doi.org/10.1007/S10772-015-9317-1

Lavanya T., Nagarajan T., and Vijayalakshmi P. 2020. Multi-level Single-Channel Speech Enhancement Using a Unified Framework for Estimating Magnitude and Phase spectra, IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28: 1315-1327. DOI: https://doi.org/10.1109/TASLP.2020.2986877

Kennedy J., and Eberhart R. 1995. Particle swarm optimization, Proceedings of the IEEE International Conference on Neural Networks, 4: 1942–1948.

Wei X., Anyu Li., Boya S., and Zhao J. 2018. A Novel Design of Sparse FIR Multiple Notch Filters with Tunable Notch Frequencies, Mathematical Problems in Engineering, 2018, Article ID 3490830. DOI: https://doi.org/10.1155/2018/3490830

Hu, Y. and Loizou, P. 2007. Subjective evaluation and comparison of speech enhancement algorithms, Speech Communication, 49: 588–601. DOI: https://doi.org/10.1016/j.specom.2006.12.006

Rix A.W., Beerends G. J., Holliar M.P. 2001. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs, IEEE International conference on Acoustic, Speech and Signal Processing proceedings (Cat. No.01CH37221). DOI: https://doi.org/10.1109/ICASSP2001.941023.

Downloads

Published

2022-02-28

How to Cite

Ghorpade, K. ., & Khaparde, A. . (2022). SINGLE CHANNEL SPEECH ENHANCEMENT USING EVOLUTIONARY ALGORITHM WITH LOG-MMSE. ASEAN Engineering Journal, 12(1), 83-91. https://doi.org/10.11113/aej.v12.16770

Issue

Section

Articles