Infant Cry Language Analysis and Recognition:An Experimental Approach
详细信息    查看全文 | 推荐本文 |
  • 英文篇名:Infant Cry Language Analysis and Recognition:An Experimental Approach
  • 作者:Lichuan ; Liu ; Wei ; Li ; Xianwen ; Wu ; Benjamin ; X.Zhou
  • 英文作者:Lichuan Liu;Wei Li;Xianwen Wu;Benjamin X.Zhou;IEEE;the Department of Electrical Engineering, Northern Illinois University;the Department of Biology, The College of New Jersey;
  • 英文关键词:Compressed sensing;;feature extraction;;infant cry signal;;language recognition
  • 中文刊名:ZDHB
  • 英文刊名:自动化学报(英文版)
  • 机构:IEEE;the Department of Electrical Engineering, Northern Illinois University;the Department of Biology, The College of New Jersey;
  • 出版日期:2019-05-15
  • 出版单位:IEEE/CAA Journal of Automatica Sinica
  • 年:2019
  • 期:v.6
  • 基金:supported by the Gerber Foundation and the Northern Illinois University Research Foundation
  • 语种:英文;
  • 页:ZDHB201903017
  • 页数:11
  • CN:03
  • ISSN:10-1193/TP
  • 分类号:173-183
摘要
Recently, lots of research has been directed towards natural language processing. However, the baby's cry, which serves as the primary means of communication for infants, has not yet been extensively explored, because it is not a language that can be easily understood. Since cry signals carry information about a babies' wellbeing and can be understood by experienced parents and experts to an extent, recognition and analysis of an infant's cry is not only possible, but also has profound medical and societal applications. In this paper, we obtain and analyze audio features of infant cry signals in time and frequency domains.Based on the related features, we can classify given cry signals to specific cry meanings for cry language recognition. Features extracted from audio feature space include linear predictive coding(LPC), linear predictive cepstral coefficients(LPCC),Bark frequency cepstral coefficients(BFCC), and Mel frequency cepstral coefficients(MFCC). Compressed sensing technique was used for classification and practical data were used to design and verify the proposed approaches. Experiments show that the proposed infant cry recognition approaches offer accurate and promising results.
        Recently, lots of research has been directed towards natural language processing. However, the baby's cry, which serves as the primary means of communication for infants, has not yet been extensively explored, because it is not a language that can be easily understood. Since cry signals carry information about a babies' wellbeing and can be understood by experienced parents and experts to an extent, recognition and analysis of an infant's cry is not only possible, but also has profound medical and societal applications. In this paper, we obtain and analyze audio features of infant cry signals in time and frequency domains.Based on the related features, we can classify given cry signals to specific cry meanings for cry language recognition. Features extracted from audio feature space include linear predictive coding(LPC), linear predictive cepstral coefficients(LPCC),Bark frequency cepstral coefficients(BFCC), and Mel frequency cepstral coefficients(MFCC). Compressed sensing technique was used for classification and practical data were used to design and verify the proposed approaches. Experiments show that the proposed infant cry recognition approaches offer accurate and promising results.
引文
[1] H. Karp, The Happiest Baby on the Block; Fully Revised and Updated Second Edition:The New Way to Calm Crying, New York City, NY,USA, 2015.
    [2] J. A. Green, P. G. Whitney, and M. Potegalb,"Screaming, yelling,whining and crying:categorical and intensity differences in vocal expressions of anger and sadness in children's tantrums,"Emotion, vol.5, no. 11, pp. 1124-1133, Oct. 2011.
    [3] Y. Kheddache and C. Tadj,"Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies,"Journal of Biomedical Science and Engineering, vol. 6, no. 8, 9 pages, 2013.
    [4] L. Liu, K. Kuo, and Sen M. Kuo,"Infant cry classification integrated ANC system for infant incubators,"in Proc. IEEE International Conf.on Networking, Sensing and Control, Paris, France, 2013, pp. 383-387.
    [5] L. Liu and K. Kuo,"Active noise control systems integrated with infant cry detection and classification for infant incubators,"in Proc. Acoustic,pp. 1-6. 2012.
    [6] L. LaGasse, A. Neal, and M. Lester,"Assessment of infant cry:acoustic cry analysis and parental perception,"Ment Retard Dev Disabil Res Rev..vol. 11, no. 1, pp. 83-93, 2005.
    [7] Varallyay Jr. Gyorgy,"Future prospects of the application of the infant cry in the medicine,"Periodica Polytechnica Ser. El. Eng, vol. 50, no.1-2, pp. 47-62, 2006.
    [8] G. Buonocore and C.V. Bellieni, Neonatal Pain,Suffering, Pain and Risk of Brain Damage in the Fetus and Newborn, Berlin, Germany, Springer,2008.
    [9] L. L. LaGasse, R. Neal, and B. M. Lester."Assessment of infant cry:acoustic cry analysis and parental perception,"Mental Retardation and Developmental Disabilities Research Reviews, vol.11, no. 1. pp. 83-93,2005.
    [10] L. Tan and J. Jiang, Digital Signal Processing:Fundamentals and Applications(3rd edition). Cambridge, MA, USA, Academic Press,2017.
    [11] Z. Ren, K. Qian, Z. X. Zhang, V. Pandit, A. Baird, and B. Schuller,"Deep scalogram representations for acoustic scene classification,"IEEE/CAA J. Autom. Sinica, vol. 5, no. 3, pp. 662-669, May 2018.
    [12] Dong Yu and Jinyu Li."Recent progresses in deep learning based acoustic models,"IEEE/CAA J. Autom. Sinica, vol. 4, no. 3, pp.396-409, April 2017
    [13] B. Goldand N. Morgan, Speech and Audio Signal Processing. New York,NY, USA, John Wiley&Sons, 2011.
    [14] V. R. Fisichelli, S. Karelitz, C. F. Z. Boukydis, and B. M. Lester,"The cry attencies of normal infants and those with brain damage,"Infant Crying, Plenum Press, 1985.
    [15] C. F. Z. Boukydis and B. M. Lester, Infant Crying:Theoretical and Research Perspectives, Berlin, Germany, Springer Science and Bussiness Media, 2012.
    [16] S. Ludington-Hoe, X. Cong, and F. Hashemi,"Infant crying:nature,physiologic consequences, and select interventions,"Neonatal Netw. vol.21, no. 2, pp. 29-36. Mar. 2002.
    [17] P. Dunstan, Calm the Crying:The Secret Baby Language That Reveals the Hidden Meaning Behind an Infant's Cry, New York City, NY, USA,Avery, 2012.
    [18] M. Sahidullah, and G. K. Saha,"Design analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition,"Speech Communication, vol. 54, no. 4, pp. 543-565, May2012.
    [19] F. Katzberg, R. Mazur, M. Maass, P. Koch, and A. Mertins,"A compressed sensing framework for dynamic sound-field measurements;"IEEE/ACM Trans. Audio, Speech, and Language Processing, vol. 26, no.11,pp. 1962-1975, Jun. 2018.
    [20] D. Needell and R. Ward,"Two-subspace projection method for coherent overdetermined systems,"Journal of Fourier Analysis and Applications,vol. 19, no. 2, pp. 256-269, April, 2013.
    [21] C. Lau,"Development of suck and swallow mechanisms in infants,"Ann. Nutr. Metab., vol. 7, no. 5, pp. 7-14, July 2015.
    [22] P. Runefors and E. Ambjonsson,"A sound spectrogram analysis of children's crying after painful stimuli during the first year of life,"Folia honiatr. Logop., vol. 2, no. 57, pp. 90-95, Mar-Apr. 2005.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700