摘要
Recently, lots of research has been directed towards natural language processing. However, the baby's cry, which serves as the primary means of communication for infants, has not yet been extensively explored, because it is not a language that can be easily understood. Since cry signals carry information about a babies' wellbeing and can be understood by experienced parents and experts to an extent, recognition and analysis of an infant's cry is not only possible, but also has profound medical and societal applications. In this paper, we obtain and analyze audio features of infant cry signals in time and frequency domains.Based on the related features, we can classify given cry signals to specific cry meanings for cry language recognition. Features extracted from audio feature space include linear predictive coding(LPC), linear predictive cepstral coefficients(LPCC),Bark frequency cepstral coefficients(BFCC), and Mel frequency cepstral coefficients(MFCC). Compressed sensing technique was used for classification and practical data were used to design and verify the proposed approaches. Experiments show that the proposed infant cry recognition approaches offer accurate and promising results.
Recently, lots of research has been directed towards natural language processing. However, the baby's cry, which serves as the primary means of communication for infants, has not yet been extensively explored, because it is not a language that can be easily understood. Since cry signals carry information about a babies' wellbeing and can be understood by experienced parents and experts to an extent, recognition and analysis of an infant's cry is not only possible, but also has profound medical and societal applications. In this paper, we obtain and analyze audio features of infant cry signals in time and frequency domains.Based on the related features, we can classify given cry signals to specific cry meanings for cry language recognition. Features extracted from audio feature space include linear predictive coding(LPC), linear predictive cepstral coefficients(LPCC),Bark frequency cepstral coefficients(BFCC), and Mel frequency cepstral coefficients(MFCC). Compressed sensing technique was used for classification and practical data were used to design and verify the proposed approaches. Experiments show that the proposed infant cry recognition approaches offer accurate and promising results.
引文
[1] H. Karp, The Happiest Baby on the Block; Fully Revised and Updated Second Edition:The New Way to Calm Crying, New York City, NY,USA, 2015.
[2] J. A. Green, P. G. Whitney, and M. Potegalb,"Screaming, yelling,whining and crying:categorical and intensity differences in vocal expressions of anger and sadness in children's tantrums,"Emotion, vol.5, no. 11, pp. 1124-1133, Oct. 2011.
[3] Y. Kheddache and C. Tadj,"Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies,"Journal of Biomedical Science and Engineering, vol. 6, no. 8, 9 pages, 2013.
[4] L. Liu, K. Kuo, and Sen M. Kuo,"Infant cry classification integrated ANC system for infant incubators,"in Proc. IEEE International Conf.on Networking, Sensing and Control, Paris, France, 2013, pp. 383-387.
[5] L. Liu and K. Kuo,"Active noise control systems integrated with infant cry detection and classification for infant incubators,"in Proc. Acoustic,pp. 1-6. 2012.
[6] L. LaGasse, A. Neal, and M. Lester,"Assessment of infant cry:acoustic cry analysis and parental perception,"Ment Retard Dev Disabil Res Rev..vol. 11, no. 1, pp. 83-93, 2005.
[7] Varallyay Jr. Gyorgy,"Future prospects of the application of the infant cry in the medicine,"Periodica Polytechnica Ser. El. Eng, vol. 50, no.1-2, pp. 47-62, 2006.
[8] G. Buonocore and C.V. Bellieni, Neonatal Pain,Suffering, Pain and Risk of Brain Damage in the Fetus and Newborn, Berlin, Germany, Springer,2008.
[9] L. L. LaGasse, R. Neal, and B. M. Lester."Assessment of infant cry:acoustic cry analysis and parental perception,"Mental Retardation and Developmental Disabilities Research Reviews, vol.11, no. 1. pp. 83-93,2005.
[10] L. Tan and J. Jiang, Digital Signal Processing:Fundamentals and Applications(3rd edition). Cambridge, MA, USA, Academic Press,2017.
[11] Z. Ren, K. Qian, Z. X. Zhang, V. Pandit, A. Baird, and B. Schuller,"Deep scalogram representations for acoustic scene classification,"IEEE/CAA J. Autom. Sinica, vol. 5, no. 3, pp. 662-669, May 2018.
[12] Dong Yu and Jinyu Li."Recent progresses in deep learning based acoustic models,"IEEE/CAA J. Autom. Sinica, vol. 4, no. 3, pp.396-409, April 2017
[13] B. Goldand N. Morgan, Speech and Audio Signal Processing. New York,NY, USA, John Wiley&Sons, 2011.
[14] V. R. Fisichelli, S. Karelitz, C. F. Z. Boukydis, and B. M. Lester,"The cry attencies of normal infants and those with brain damage,"Infant Crying, Plenum Press, 1985.
[15] C. F. Z. Boukydis and B. M. Lester, Infant Crying:Theoretical and Research Perspectives, Berlin, Germany, Springer Science and Bussiness Media, 2012.
[16] S. Ludington-Hoe, X. Cong, and F. Hashemi,"Infant crying:nature,physiologic consequences, and select interventions,"Neonatal Netw. vol.21, no. 2, pp. 29-36. Mar. 2002.
[17] P. Dunstan, Calm the Crying:The Secret Baby Language That Reveals the Hidden Meaning Behind an Infant's Cry, New York City, NY, USA,Avery, 2012.
[18] M. Sahidullah, and G. K. Saha,"Design analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition,"Speech Communication, vol. 54, no. 4, pp. 543-565, May2012.
[19] F. Katzberg, R. Mazur, M. Maass, P. Koch, and A. Mertins,"A compressed sensing framework for dynamic sound-field measurements;"IEEE/ACM Trans. Audio, Speech, and Language Processing, vol. 26, no.11,pp. 1962-1975, Jun. 2018.
[20] D. Needell and R. Ward,"Two-subspace projection method for coherent overdetermined systems,"Journal of Fourier Analysis and Applications,vol. 19, no. 2, pp. 256-269, April, 2013.
[21] C. Lau,"Development of suck and swallow mechanisms in infants,"Ann. Nutr. Metab., vol. 7, no. 5, pp. 7-14, July 2015.
[22] P. Runefors and E. Ambjonsson,"A sound spectrogram analysis of children's crying after painful stimuli during the first year of life,"Folia honiatr. Logop., vol. 2, no. 57, pp. 90-95, Mar-Apr. 2005.