Early detection of gradual concept drifts by text categorization and Support Vector Machine techniques: The TRIO algorithm
详细信息    查看全文
文摘
During the normal operation of complex and risky industrial plants such as the nuclear or the aerospace ones, the safety heavily rests upon the capability of the diagnostic systems of detecting concept drifts which might imply incipient failures. In this paper we propound the TRIO algorithm for the online detection of signal drifts: the underlying idea is that a real signal may be categorized as correct or drifting by comparison with added sets of artificial signals known to be correct or drifted. More specifically, the TRIO algorithm is based on three performers, namely (i) a training set of artificial signals, (ii) the Text Categorization (TC) technique and (iii) the Support Vector Machine (SVM) technique. Initially, we construct an artificial training set constituted by one “correct” set of signals, embraced by two “suspect” sets of signals, the suspect-up and the suspect-down drifting signals. These signals are transformed in points within the signal space by the TC technique; then the SVM technique is applied for isolating the regions occupied by the suspect-up and by the suspect-down points. At this point the “artificial context” has been established and the real measurements come in. By resorting to the sliding window technique, at each epoch the actually measured data segment is analogously transformed into a point within the signal space and then declared correct or suspect (drifted) according to the region where it falls. In the latter case suitable actions must be taken by the plant operators. Numerical case-studies and a comparison with literature results are presented.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700