基于人脸视频的疲劳检测
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
疲劳驾驶是交通事故的一个重要原因,对财产和生命造成巨大的损失,驾驶员疲劳检测对提高交通安全具有非常重要的意义。驾驶员在疲劳时面部会出现许多特有的表现。本文提出了一个非侵入式系统,通过融合两个和疲劳相关的视觉特征来检测疲劳。文中提取了来自眼部、嘴部的视觉特征来分析疲劳,表现为疲劳时出现的眼睛闭合和打哈欠。
     人在疲劳时,眼睛闭合的时间和频率都会变大。为了判定眼睛是否闭合,本文首先在LBP处理的基础上,提取眼睛分区的直方图特征,然后用AdaBoost算法选择最具判别能力的直方图特征构建强分类器得到眼部的视觉特征。为了判定是否打哈欠,本文通过灰度投影定位左右嘴角位置,然后用Gabor小波提取左右嘴角的纹理特征,最后通过LDA分类判别得到嘴部的视觉特征。在决策层,本文提取的两个视觉特征通过信息融合的方法得到最终的疲劳判定结果。
     为了测试提出的算法,本文自建了一个人脸疲劳数据集合。在自然环境中,用摄像头采集了10人的面部视频数据。然后从中提取面部疲劳的图像,创建了人脸疲劳数据库。库中包括5女5男的1600幅图像。平均每个人有160幅图像(80幅正常图像和80幅疲劳的图像)。试验结果表明本文提出的方法比单个视觉特征可靠性和准确性都有比较大提高。
Driver fatigue is a major factor in the causes of automobile accidents.In recent years.Traffic transport project of our country is developing quickly, but the malign traffic transport is also ascending. 78.5 percent of deaths by all kinds of accidents is caused by traffic accident during 2002. The traffic accidents caused by driver fatigue took part about 20 percent, which exceeded 40 percent of the most serious traffic accidents. It is reported that traffic accidents caused by driver fatigue take large percent in America, Japanese and England and so on. It can be seen that driver fatigue has become the main hidden trouble of traffic accidents. Therefore, detecting driver fatigue is extremely important to improving transportation safety.
     Driver fatigue monitoring technologies can be divided into subjective methods and objective methods. Subjective methods get driver fatigue levels from subjective questionaries, note tables made by drivers, questionaries about the habit of sleep, and Stanford measurement tables of sleep. Objective methods detect driver fatigue from the parameters about drivers and cars. Research on fatigue monitoring can make the measures for the prevention of dangers caused by driver fatigue more convenient and direct. In the past decade, more and more attention has been paid to the technology research and equipment development for fatigue monitoring. We can divide the methods for driver fatigue monitoring into three categories: physiological parameters, individual characters of drivers and performance of cars under driver control based on the information source. When people are in fatigue state, their circadian characters vary much. Circadian signals have some metabolic rules. Investigators get people’s circadian signals such as electroencephalogram, cardiogram, electro-oculogram, electromyogram and so on through med equipment to detect drivers’fatigue in real time. Detecting fatigue based on drivers’characters is to get drivers’videos through cameras, analyze the videos using computer optic method and judge whether drivers are in fatigue state or not according the results of the characters information. When drivers are in fatigue state, the veracity, agility and smartness of their actions all will be influenced, so it is feasible to detect drivers’fatigue by how the cars are controlled by drivers.
     When a driver fatigues, he will take many special visual cues on his face. In this paper we present a novel framework based on computer vision for driver fatigue detection that combines various visual cues typically related to fatigue. We combine visual cues from mouths and eyes systematically which characterize eye closed and yawning to infer driver fatigue.
     Eyes’changes are the most important character to reflect peoples’fatigue state, when man is tired, his eyelids stir frequently and eyes close for longer time. Judge eyes’state through detecting eyes’character seen as a charater to detect fatigue. AdaBoost is used to extract the most discriminative features from the LBP features of eye areas and constructs a highly accurate classifier to get the eye visual cue. LBP arithmetic operator is not parameterization which describes local dimensional structure. LBP is a strong tool to describe texture. Because of strong texture-distinguish ability and smaller compute cost, LBP has been used in pattern recognition widely. Basic LBP arithmetic operator converts the 3*3 pixels around every pixel of the image into binary numeric according to the grey level of the centre and takes the result as a binary digit. LBP has 256 possible values, so LBP codes can denote 256 LBP modes, and every value denotes a LBP local mode. In this article, we use LBP8,2 arithmetic operator to distill character of every pixesl and compute the LBP value of every pixel, the original eye image and LBP image then come into being.
     The article uses Adaboost arithmetic to choose the LBP character of eyes and train strong categorizer. Adaboost arithmetic is the most popular Boosting arithmetic, and the AdaBoosting arithmetic advanced by Freund and Schapire gives a very simple and effective study method to choose character and not-linearly classfy.
     AdaBoost has achieved much success in detecting face and other application areas. AdaBoost is an ensemble study algorithm. It creates a series of weak categorizers. AdaBoost conclude advanced arithmetic and basic arithmetic, basic or weak arithmetic is actually to choose suitable discriminated function, advanced arithmetic is a iterative process, it chooses the weak categorizer according to their capabilities in the training gather, then a strong categorizer forms by the linear combination of weak categorizers. Yawning is also an important cue of driver fatigue. Fusion the cues of eye closed and mouth yawning can get a more accurate and robust fatigue detection. In the paper, to get the mouth cue, we propose to use gray projection to locate the left and right mouth corners, extract the texture features by Gabor wavelets, and apply LDA to classify the mouth features to detect yawning. When drivers yawn, the texture of the mouth has great change while the geometric feature changes. The texture of the mouth includes more information. Gabor wavelets are widely used in computer vision, texture analysis and object recognition. The two dimensional Gabor wavelets can get the best resolution in spatial field, temporal field and direction. In the experiments, we use a discrete form of the Gabor wavelets which include 5 frequencies and 8 orientations. The image convolves with the real and imaginary parts of the Gabor wavelets at the left and right moth corners. Then, at each moth corner, we get 40 complex Gabor wavelet coefficients. Through the Gabor transformation, we can represent the texture of the left and right mouth corners with a vector of 80 elements. The purpose of the Fisher linear discriminant is to find a feature projection space which minimizes the ratio of the within class scatter of the projected samples to the between class scatter. That is to say the samples of different classes are separated optimally i.e. the more the scatter between classes is, the better it is while the samples from the same class are congregated i.e. the less the scatter within the classed is, the better it is. So, the samples can be separated correctly in the projection space.
     The analysis and understanding of fatigue based on single character have some localization, and the dubious factors may influence veracity and dependability of some fatigue characters. Therefor, many researchers fuse many characters information to analyze fatigue state. Analyse of fatigue state through fusing many characters information is to distinguish whether fatigue brings or not through two or more of blink, staring direction, gesture of head, ship of mouth, and expression of face and so on. In the article, information amalgamation occurs in the decision-making level. The result through matching of eyes’clue and mouth’clue composites a character vector which is input into LDA, and the last decision-making comes from the matching of the LDA’s output and the stencil.
     The method advanced by the article is tested by ten people in real life, which contain changes of gestures and illumination. The result indicates that visual fatigue clues of the article have stronger dependability and veracity.
引文
[1] A.A.; Cosic, I.; Kumar, D.K.; Polus, B.; Da Costa, C.Power changes of EEG signals associated with muscle fatigue: the root mean square analysis of EEG bands.Abdul-latif,;Intelligent Sensors, Sensor Networks and Information Processing Conference, 2004. Proceedings of the 2004 14-17 Dec. 2004 Page(s):531 – 534.
    [2] Chin-Teng Lin; Ruei-Cheng Wu; Sheng-Fu Liang; Wen-Hung Chao; Yu-Jie Chen; Tzyy-Ping Jung;EEG-based drowsiness estimation for safety driving using independent component analysis. Circuits and Systems I: Regular Papers, IEEE Transactions on Volume 52, Issue 12, Dec. 2005 Page(s):2726 – 2738.
    [3] 王炳浩,魏建勤,吴永红.汽车驾驶员瞌睡状态脑电波特征的初步探索[J].汽车工程,2004,26(1):70~73.
    [4] Hayashi, K.; Ishihara, K.; Hashimoto, H.; Oguri, K.;Individualized drowsiness detection during driving by pulse wave analysis with neural network. Intelligent Transportation Systems, 2005. Proceedings. 2005 IEEE 13-15 Sept. 2005 Page(s):901 – 906.
    [5] 杨渝书,姚振强,李增勇等.心电图时频域指标在驾驶疲劳评价中的有效性研究[J].机械设计与制造,2002,(5):94~95.
    [6] Bonato P., Heng M.S.S., Gonzalez-Cueto J., Leardini A., O'Connor J., Roy S.H..EMG-based measures of fatigue during a repetitive squat exercise.Engineering in Medicine and Biology Magazine, IEEE Volume 20, Issue 6, Nov.-Dec. 2001 Page(s):133 – 143.
    [7] Chesler N., Durfee W..EMG As An Indicator Of Fatigue During Functional Electrical Stimulation. Engineering in Medicine and Biology Society, 1991. Vol.13: 1991., Proceedings of the Annual International Conference of the IEEE.31 Oct.-3 Nov. 1991 Page(s):926 – 927.
    [8] 郑培,宋正河,周一鸣. 机动车驾驶员驾驶疲劳测评方法的研究状况及发展趋势[J].中国农业大学学报,2001,6(6):101~105.
    [9] Wen-Bing Horng, Chih-Yuan Chen, Yi Chang.Driver Fatigue Detection Based on Eye Tracking and Dynamic'Template Matching.Proceedings of the 2004 IEEE International Confercncc on Nclvorkinp, Sensing & Control.Taipei, Taiwan, March 21-23, 2004.
    [10] Suzuki, S.; Yazu, Y.; Ninomija, S.P.;Character and application of winking pattern as alternative index for grouped α-wave.Engineering in Medicine and Biology Society, 1994. Engineering Advances: New Opportunities for Biomedical Engineers. Proceedings of the 16th Annual International Conference of the IEEE 3-6 Nov. 1994 Page(s):225 - 226 vol.1.
    [11] Brandt, T.; Stemmer, R.; Rakotonirainy, A.;Affordable visual driver monitoring system for fatigue and monotony.Systems, Man and Cybernetics, 2004 IEEE International Conference on Volume 7, 10-13 Oct. 2004 Page(s):6451 - 6456 vol.7.
    [12] De la Torre, F.; Rubio, C.J.G.; Martinez, E.;Subspace eyetracking for driver warning.Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on Volume 3, 14-17 Sept. 2003 Page(s):III - 329-32 vol.2.
    [13] Veeraraghavan, H.;Papanikolopoulos, N. P. Detecting driver fatigue through the use of advanced face monitoring techniques. ITS Institute Center for Transportation Studies USA.September 2001 CTS 01-05.
    [14] Wang Rang-ben ,Guo Ke-you, Shi Shu-ming, Chu Jiang-wei. A monitoring method of driver fatigue behavior based on machine vision. Proceedings on intelligent Vehicles Symposium. IEEE. 2003;110-113.
    [15] 周玉彬,俞梦孙,金章瑞,张洪金.驶员瞌睡的视频监测研究.北京生物医学工程第 22 卷第 1 期,2003 年 3 月.
    [16] Xia Liu; Youding Zhu; Fujimura, K..Real-time pose classification for driver monitoring .The IEEE 5th International Conference onIntelligent Transportation Systems, 2002. Proceedings. 2002 Page(s):174 – 178.
    [17] Qiang Ji; Zhiwei Zhu; Lan, P.;Real-time nonintrusive monitoring and prediction of driver fatigue.Vehicular Technology, IEEE Transactions on Volume 53, Issue 4, July 2004 Page(s):1052-1068.
    [18] Jie Zhu; Jie Yang;Subpixel eye gaze tracking.Automatic Face and Gesture Recognition, 2002. Proceedings. Fifth IEEE International Conference on 20-21 May 2002 Page(s):124 – 129.
    [19] Rongben Wang, Lie Guo, Bingliang Tong, Lisheng Jin, “Monitoring mouth movement for driver fatigue or distraction with one camera”, The 7th International IEEE Conference on Intelligent Transportation Systems, pp. 314-319, Oct. 2004.
    [20] Tiesheng Wang, Pengfei Shi. Yawning detection for determining driver drowsiness. Proceedings of 2005 IEEE International Workshop on VLSI Design and Video Technology, 2005. 28-30 May 2005 Page(s):373 – 376.
    [21] Dinges, D. F., Grace, R. PERCLOS: A Valid Psychophysiological Measure of Alertness As Assessed by Psychomotor Vigilance, US Department of Transportation, Federal Highway Administration. Publication Number FHWA-MCRT-98-006,1998.
    [22] Philip W Kithil , Roger D. Jones , John MacCuishi. Development of Driver Alertness Detection System Using Overhead Capacitive Sensor Array [A] . SAE Technical Paper Series 982292 , SAE International [C] ,1998. 2.
    [23] Thum Chia Chieh; Mustafa, M.M.; Hussain, A.; Zahedi, E.; Majlis, B.Y.;Driver fatigue detection using steering grip force.Research and Development, 2003. SCORED 2003. Proceedings. Student Conference on 25-26 Aug. 2003 Page(s):45 – 48.
    [24] Renner, G. Mehring, S. 1997, “Lane Departure and Drowsiness-Two major accident causes-One Safety System”, 4th World Congress on ITS, 21-24 October, 1997, ICC Berlin, Germany.
    [25] 石坚,吴远鹏,卓斌等.汽车驾驶员主动安全性因素的辨识与分析[J].上海交通大学学报. 2000 ,34(4) : 441~444.
    [26] Takei, Y.; Furukawa, Y.;Estimate of driver's fatigue through steering motion.Systems, Man and Cybernetics, 2005 IEEE International Conference on Volume 2, 10-12 Oct. 2005 Page(s):1765 - 1770 Vol. 2.
    [27] Jun Miao, Baocai Yin, Kongqiao Wang, Lansun Shen, Xuecun Chen, “A hierarchical multiscale and multiangle system for human face detection in a complex background using gravity-center template”, Pattern Recognition, Vol 32,No. 7, pp. 1237-1248, 1999.
    [28] Hamada, T.; Ito, T.; Adachi, K.; Nakano, T.; Yamamoto, S.;Detecting method for drivers' drowsiness applicable to individual features.Intelligent Transportation Systems, 2003. Proceedings. 2003 IEEE Volume 2, 12-15 Oct. 2003 Page(s):1405 - 1410 vol.2.
    [29] Wenhui Dong; Xiaojuan Wu;Fatigue detection based on the distance of eyelid. VLSI Design and Video Technology, 2005. Proceedings of 2005 IEEE International Workshop on28-30 May 2005 Page(s):365 – 368.
    [30] T. Ojala, M. Pietikainen, and D. Harwood, “A Comparative Study of Texture Measures with Classification Based on Feature Distributions,” Pattern Recognition, vol. 29, no. 1, pp. 51-59, Jan. 1996.
    [31] T. Ojala, M. Pietikainen, and T. Maenpaa, “Multiresolution Gray-Scale and Rotation Invariant Texture Classification Width Local Binary Patterns,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 971-987, July 2002.
    [32] Y. Freund and R.E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting”, Journal of Computer and System Sciences, 55(1):119–139, August 1997.
    [33] P. Viola and M. Jones, “Robust real time object detection”, In IEEE ICCV Workshop on Statistical and Computational Theories of Vision, Vancouver, Canada, July 13 2001.
    [34] Benoit, A.; Caplier, A.; “Hypovigilence analysis: open or closed eye or mouth? Blinking or yawning frequency?”, Advanced Video and Signal Based Surveillance, 2005. AVSS 2005. IEEE Conference on, 15-16 Sept. 2005 pp.207-212.
    [35] 杨英,盛敬,杨佳,周巍.基于神经网络的驾驶员觉醒水平双目标监测法.东北大学学报( 自然科学版),第 28 卷第 3 期,pp.418-421,2007 年 3 月.
    [36] Lu, Yufeng; Wang, Zengcai; “Detecting Driver Yawning in Successive Images”, Bioinformatics and Biomedical Engineering, 2007. ICBBE 2007. The 1st International Conference on. 6-8 July 2007 pp.581 -583.
    [37] L. Wiskott, J.M. Fellous, N. Kuiger, C. von der Malsburg, “Face recognition by elastic bunch graph matching”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 19, No. 7, pp. 775–779, July 1997.
    [38] A.M. Martinez, A.C. Kak, “PCA versus LDA”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 23, No. 2, pp. 228–233, Feb. 2001.
    [39] Fletcher, L.; Apostoloff, N.; Petersson, L.; Zelinsky, A.;Vision in and out of vehicles.Intelligent Systems, IEEE Volume 18, Issue 3, May-Jun 2003 Page(s):12 – 17.
    [40] Wilson, B.J.; Bracewell, T.D.;Alertness monitor using neural networks for EEG analysis.Neural Networks for Signal Processing X, 2000. Proceedings of the 2000 IEEE Signal Processing Society Workshop Volume 2, 11-13 Dec. 2000 Page(s):814 - 820 vol.2.
    [41] Wenchao Zhang, Shiguang Shan, Wen Gao, Yizheng Chang, Bo Cao, Peng Yang, “Information fusion in face identification”, Pattern Recognition, 2004, ICPR 2004, Proceedings of the 17th International Conference on, Volume 3, 23-26 Aug. 2004, pp. 950 - 953 Vol.3.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700