基于全方位优化算法的马田分类和排序评价方法研究及应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
数据挖掘是从数据中获取知识和信息,并进行决策的有效手段。分类是数据挖掘的重要任务,它根据样本的数据特征判定其所属类别。目前许多成熟有效且各具特点的分类方法得到了深入研究和广泛应用,但仍有各自的局限性和不足。在分类过程中,特征变量筛选通常能够起到简化问题、提高分类精度和效率的作用。
     马田系统(Mahalanobis-Taguchi System, MTS)是一种结合马氏距离、正交表和信噪比进行分类和诊断的新型模式识别方法。MTS具有的能够筛选重要特征变量、对数据分布不需要进行假设、分类速度快等优点使得它在工业生产、企业管理和模式识别等领域得到了广泛的应用。作为一种较新的分类方法,它在理论基础和方法使用上存在一些缺陷和严谨性问题,如:特征变量筛选方法有待改进、阈值确定主观性较强、局限于二类分类等。除了用于分类和诊断之外,MTS的特点使得它还可以用于排序(综合)评价。本文针对MTS的不足,以MTS改进为主线,以优化方法为主要手段,目标是发展MTS使其成为一种实用有效的分类和排序评价方法,并应用于现实问题。本文的研究工作有以下几个方面:
     (1)基于全方位优化算法的MTS二类分类方法研究
     针对传统MTS在分类过程中采用正交表和信噪比筛选特征变量以及采用损失函数法确定阈值存在的缺陷,使用多目标优化和全方位优化算法替代以进行改进。综合考虑分类精度、望大特性信噪比和降维程度等目标构建了多目标优化模型并用全方位优化算法求解以筛选特征变量和确定阈值;采用数据集实验和比较的方式对方法的有效性进行了验证和讨论;将方法用于产品质量检测的现实问题。研究结果表明,该方法不仅有很高的分类精度,还能有效的筛选特征变量,对传统MTS有了较大改进。
     (2)基于全方位优化算法和概率阈值模型的MTS不平衡数据分类方法研究
     类别不平衡是分类问题常常面临的挑战。MTS通过马氏距离构建一个连续测量尺度而非直接对训练样本进行学习的性质有望不受类别不平衡的影响,而阈值的合理确定对该问题更加重要。提出了一种概率阈值模型用于MTS不平衡数据分类阈值的确定;综合考虑不平衡数据分类性能评估指标的g/F值、望大型信噪比和降维程度等目标构建多目标优化模型并用全方位优化算法求解以筛选特征变量;采用数据集实验和比较的方式对方法的有效性进行检验和讨论。研究结果表明,该方法对不平衡数据有良好的分类能力,同时还能筛选重要特征变量。
     (3)MTS多类分类方法研究
     MTS是一种二类分类方法,不能直接用于多类分类。本文研究了两种MTS多类分类方法——二叉树MTS和多马氏空间特征变量筛选MTS。二叉树MTS通过二叉树与MTS的结合,将多类分类问题进行分解。研究了二叉树MTS的实施过程和步骤,以及二叉树的构建方案等。多马氏空间特征变量筛选MTS通过为每类构建马氏空间,运用距离判别思想构建分类器,同时优化特征空间。研究了多马氏空间特征变量筛选MTS的实施过程和步骤,综合考虑分类精度、改进的望大型信噪和降维程度等目标构建多目标优化模型并用全方位优化算法求解以筛选特征变量。采用数据集实验和比较的方式对两种方法的有效性进行了检验和讨论。最后将MTS多类分类方法应用于政府投融资平台企业的信用等级评价。研究结果表明,多马氏空间特征变量筛选MTS有着更高的分类精度和特征变量筛选效果,具有更高的应用价值。
     (4)MTS排序评价方法研究
     MTS目前主要用于分类问题。实际上,MTS可计算出样本相对于基准空间(马氏空间)的马氏距离,得出样本偏离的程度,从而对待测样本进行排序。本文研究MTS排序评价方法,具体包括:MTS排序评价方法的过程及步骤;基于全方位优化算法的评价指标筛选模型;采用算例和比较的方式对MTS排序评价的有效性进行分析和讨论。研究结果表明,MTS排序评价方法不需要确定指标权重,能够保持评价基准的一致,且能够筛选指标,是一种有效的评价方法,但基准空间的确定机制需要进一步研究和完善。
     综合以上研究工作,本文的主要贡献和创新点有:
     (1)识别不同的分类或排序评价目标,在MTS特征变量筛选这一核心问题中导入优化思想,创新性的提出和研究了特征变量筛选的多目标优化模型以替代传统MTS的正交表,并采用先进的全方位优化算法求解,是一种新的特征变量筛选方法。
     (2)根据不同的分类目的,采用优化或概率模型替代传统MTS的损失函数(或穷举法)来确定MTS进行分类时需要的阈值,这是MTS新的闽值确定办法。
     (3)通过概率闽值模型、二叉树和多马氏空间等手段,将MTS二类分类方法成功的扩展到了不平衡数据分类和多类分类,并验证了这些方法的有效性,是新的不平衡数据分类方法和多类分类方法。
Data mining is an effective means of accessing knowledge and information from the data for further decision-making. The classification problem is one of the main issues in data mining because it aims to extract a classifier which can be used to predict the classes of objects whose class labels are unknown based on the data characteristics. Currently, many mature and effective classification methods with distinct characteristics have been thoroughly studied and widely used, but still have their own limitations and shortcomings. Feature selection, which aims to reduce the number of features (dimensions), can play a role of improving the classification accuracy, simplifying the problem and cost-saving in process of classification.
     The Mahalanobis-Taguchi System (MTS) is a new classifying and diagnostic tech-nique of pattern recognition using a collection of methods of Mahalanobis distance (MD), orthogonal arrays (OAs) and signal-to-noise ratios (SNR). Advantages of MTS, such as can determine the important features, do not need to make assumptions about data distribution and high classification speed, etc., make it a wide range of applications in areas such as industrial production, business management and pattern recognition. As a relatively new classification method, it also has some shortcomings and rigorous problems about theoret-ical basis and usage, such as:lack of rigor for feature screening, subjectivity of determin-ing the threshold and confining to the two-class classification. In addition to be used for classification and diagnosis, the characteristics of MTS makes it also be used to sort evalu-ation. For shortages of the MTS, the paper aims to make the MTS an effective and practic-al classification and sort evaluation technique with the focusing on improving the theory of MTS, and apply it to real problems. These research works of the paper are the following:
     (1) Research on MTS based on omni-optimizer algorithm for two-class classification
     For the inadequacy of traditional MTS in feature selection by OAs and SNR as well as determining the threshold by loss function, multi-objective optimization and om-ni-optimizer algorithm are alternative for improvements. Classification accuracy and the-larger-the-better SNR as well as dimensionality reduction are considered as optimiza-tion goals to build a multi-objective optimization model, which is used for feature selection and threshold determining, is solved by omni-optimizer algorithm. For evaluating the ef-fectiveness of the proposed method, a dataset experiment for comparison is implemented. Finally, the proposed method is applied to a real case about product quality inspection. The results show that the proposed method outperforms other well-known algorithms not only on classification accuracy but also on feature selection efficiency.
     (2) Research on MTS based on omni-optimizer algorithm and probabilistic thre-sholding model for imbalanced data classification
     MTS establishes a classifier by constructing a continuous measurement scale rather than directly learning from the training set. Therefore, it is expected that the construction of an MTS model will not be influenced by data distribution, and this property is helpful to overcome the class imbalance problem. A probabilistic thresholding method is proposed to determine the classification threshold in MTS for imblanced data classification. Perfor-mance evaluation indicators of imbalanced data classification--g/F values and the-larger-the-better SNR as well as dimensionality reduction are considered as optimiza-tion goals to build a multi-objective optimization model, which is used for feature selection, is solved by omni-optimizer algorithm. For evaluating the effectiveness of the proposed method, a dataset experiment for comparison is implemented. The results show that the proposed method is effective not only on classification accuracy for imbalanced data but also on feature selection efficiency.
     (3) Research on multi-MTS for multi-class classification
     MTS is a method for two-class classification, which can not be directly used for mul-ti-class classification. Two multi-class MTS classification methods----binary tree MTS (BT-MTS) and multi-Mahalanobis-space(MS) feature-selection MTS (MF-MTS) are stu-died. BT-MTS decomposes multi-class problem by a combination of binary tree and MTS. The process and steps of implementation of BT-MTS and the construction programs for binary tree are studied. MF-MTS establishes classifier using distance criterion by estab-lishing an individual MS space for each class and can optimize the feature space at the same time. The process and steps of implementation of MF-MTS are studied. In MF-MTS, classification accuracy and the-larger-the-better SNR as well as dimensionality reduction are considered as optimization goals to build a multi-objective optimization model, which is used for feature selection, is solved by omni-optimizer algorithm. For evaluating the ef-fectiveness of the proposed two methods, a dataset experiment for comparison is imple-mented. Finally, the proposed methods are applied to a real case about credit rating of en-terprises on platform of government investment and financing. According to the results, we can conclude that the two multi-class MTS classification methods have high classification accuracy, but the MF-MTS is more valuable for application.
     (4) Research on MTS based on omni-optimizer algorithm for sort evaluation
     Currently, MTS is mainly used for classification problems. In fact, the MTS can sort the samples for evaluation by calculating the Mahalanobis distances of samples to refer-ence space (RS), which represents the degree of deviation from the RS. In this paper, MTS for sort evaluation is studied including:procedures of MTS for sort evaluation; variable screening model based on the omni-optimizer algorithm; validity and discussion of the MTS for sort evaluation by examples and comparative analysis. The results show that the MTS for sort evaluation method does not need to determine the index weight, can maintain consistent evaluation reference, and can screen index, which is an effective evaluation me-thod. But the mechanisms for identifying the reference space require further studies and perfect.
     Based on the above research work, the main contributions and innovations of this pa-per are as follows:
     (1) With identifying different goals for classification and sort evaluation, optimization ideas are imported in the core issue of MTS feature selection and multi-objective optimiza-tion model as an alternative to OAs is proposed and studied innovatively, which is solved by omni-optimizer algorithms. It is a new method for feature selection.
     (2) According to the different purposes of classification. optimization or probabilistic model is used to determine threshold of MTS classification, which replaces the loss of function or exhaustive search method. It is a new threshold determining way.
     (3) The MTS class classification method has been successfully extended to imba-lanced data classification and multi-class classification by means of probabilistic thresholding model, binary tree and multi-MS and the effectiveness of the methods are proved. The me-thods are new for imbalanced data classification and multi-class classification.
引文
[1]Liu H., Motoda H. Feature selection for knowledge discovery and data mining [M]. Boston:Kluwer Academic,1998
    [2]殷志伟.基于统计学习理论的分类方法研究[D].哈尔滨工程大学,博士论文,2009
    [3]Han J., Kamber M. Data mining:concepts and techniques [M]. San Francisco:Aca-demic Press,2001
    [4]Justin C., Victor R.J. Feature subset selection with a simulated annealing data mining algorithm [J]. Journal of Intelligent Information Systems,1997,9:57-81
    [5]Walczk B., Massart D.L. Rough sets theory [J]. Chemometrics and Intelligent Labora-tory Systems,1999,47:1-16
    [6]Johnson R.A., Wichern D.W. Applied multivariate statistical analysis [M]. Pren-tice-Hall,1998
    [7]Kim H., Koehler G.J. Theory and practice of decision tree induction [J]. Omega,1995, 23(6):637-652
    [8]Taguchi G., Jugulum R. New trends in multivariate diagnosis [J]. The Indian Journal of Statistics,2000,62B (2):233-248
    [9]Taguchi G., Chowdhury S., Wu Y. The Mahalanobis-Taguchi System [M]. McGraw-Hill, 2001
    [10]Taguchi G., Jugulum R. The Mahalanobis-Taguchi Strategy:A pattern technology sys-tem [M]. New York:John Wiley &Sons,2002
    [11]Ahmet S., Jagannathan S., Saygin C. Mahalanobis Taguchi System(MTS) as a prog-nostics tool for rolling element bearing failures [J]. Journal of Manufacturing Science and Engineering,2010,132(5):51014(1-12)
    [12]Khanzode, V., Maiti, J. Implementing Mahalanobis-Taguchi system to improve cast-ing quality in grey iron foundry [J]. International Journal of productivity and Quality Management,2008,3(4):444-456
    [13]Hadighi A., Mahdavi I. A new model for strategy formulation using Mahalano-bis-Taguchi System and clustering algorithm [J]. Intelligent Information Management, 2011,3:198-203
    [14]Wang Pa-Chun, Su Chao-Ton, Chen Kun-Huang, Chen Ning-Hung. The application of rough set and Mahalanobis distance to enhance the quality of OS A diagnosis[J]. Expert Systems with Applications,2011,38:7828-7836
    [15]Woodall W. H., Koudelik R., Tsui K. L., Kim S. B., Stoumbos Z. G., Carvounis C. P. A review and analysis of the Mahalanobis-Taguchi system [J]. Technometrics,2003,45(1): 1-15
    [16]Woodall W. H., Koudelik R., Tsui Kwok-Leung., Kim S. B., Stoumbos Z. G., Carvou-nis C. P. Response:A review and analysis of the Mahalanobis-Taguchi system [J]. Tech-nometrics,2003,45(1):29-30
    [17]Bovas A., Asokan Mulayath V. Discussion-A review and analysis of the Mahalano-bis-Taguchi system [J]. Technometrics,2003,45(1):22-25
    [18]Hawkins D. M. Discussion-A review and analysis of the Mahalanobis-Taguchi sys-tem [J]. Technometrics,2003,45(1):25-29
    [19]Rajesh J., Taguchi G., Taguchi S. Discussion-A review and analysis of the Mahala-nobis-Taguchi system [J]. Technometrics,2003,45(1):16-21
    [20]Pal, A., Maiti, J. Development of a hybrid methodology for dimensionality reduction in Mahalanobis-Taguchi system using Mahalanobis distance and binary particle swarm op-timization[J]. Expert Systems with Applications,2010,37:1286-1293
    [21]Taguchi G., Jugulum R., Taguchi S. Computer-based robust engineering:Essentials for DFSS [M]. Milwaukee, Wisconsin:ASQ Quality Press,2004
    [22]Nakatsugawa,M., Ohuchi,A. A study on determination of the threshold in MTS algo-rithm [J]. Transactions of the Institute of Electronics Information and Communication En-gineers,2001(4):519-527
    [23]Hong J., Cudney E. A., Taguchi G., Jugulum R., Paryani K., Ragsdell K. A comparison study of Mahalanobis-Taguchi system and neural network for multivariate pattern recogni-tion [C]. ASME IMECE Proceedings, Orlando, Florida,2005
    [24]Dimitris Liparas, Lefteris Angelis, Robert Feldt. Applying the Mahalanobis-Taguchi strategy for software defect diagnosis [J]. Autom Software Engineering,2012(19):141-165
    [25J Rai B., Chinnam R. B., Singh N. Prediction of drill-bit breakage from degradation signals using Mahalanobis-Taguchi system analysis [J]. International Journal of Industrial and Systems Engineering,2008,3 (2):134-148
    [26]Das P., Datta S. Exploring the effects of chemical composition in hot rolled steel product using Mahalanobis-Taguchi system [J]. Computational Materials Science,2007,38: 671-677
    [27]Cudney, E. A., Ragsdell K M., Paryani K. Identifying useful variables for vehicle braking using the adjoint matrix approach to the Mahalanobis-Taguchi system[C]. SAE World Congress & Exhibition, Detroit, MI, USA,2007
    [28]Cudney,E.A.,and Ragsdell,K.M. Forecasting using the Mahalanobis-Taguchi System in the presence of collinearity [C]. SAE World Congress & Exhibition, Detroit, MI, USA, 2006
    [29]Asada M. Wafer yield prediction by the Mahalanobis-Taguchi system [J]. IEEE Inter-national Workshop on Statistical Methodology,2001,6:25-28
    [30]Nakatsugawa M., Ohuchi A. A study on selection of the terms in MTS algorithm [J]. Transactions of the Institute of Electronics, Information and Communication Engineers A, 2002, J85-A(4):434-441
    [31]Prucha,T.E.,and Nath,R. New approach in non-destructive evaluation techniques for automotive castings [C]. Proceedings SAE World Congress, Detroit, Michigan,2003
    [32]Nagao M., Yamamoto M., Suzuki K., Ohuchi A. MTS approach to facial image rec-ognition[C]. Proc. IEEE Conf. On Systems, Man, and Cybernetics,1999, (4):937-942
    [33]Valarmathi B., Palanisamy V. Opinion mining of customer reviews using Mahalano-bis-Taguchi System [J]. European Journal of Scientific Research.2011,62(1):95-100
    [34]Mahalakshmi P., Ganesan K. Mahalanobis Taguchi system based criteria selection for shrimp aquaculture development [J]. Computers and Electronics in Agriculture,2009,65: 192-197
    [35]Srinivasaraghavan J.. Allada V. Application of Mahalanobis distance as a lean assess-ment metric [J]. International Journal of Advanced Manufacturing Technology,2006,29: 1159-1168
    [36]Aman H., Mochiduki N., Yamada H. A model for detecting cost-prone classes based on Mahalanobis-Taguchi method [J]. IEICE Transactions on Information and Systems, 2006, E89-D (4):1347-1358
    [37]Debnath R. M., Kumar S., Shankar R., Roy, R. K. Students'satisfaction in manage-ment education:study and insights [J]. Decision,2005,32(2):139-155
    [38]Morita H., Haba Y. Variable selection in data envelopment analysis based on external evaluation [C]. Proceedings of the Eighth Czech-Japan Seminar on Data Analysis and De-cision Making under Uncertainty,2005
    [39]李昭阳,韩之俊.一种新的判别预测方法——马田系统(MTS)[J].管理工程学报,2000,14(2):54-55
    [40]郑称德,韩之俊.MTS原理及其设计模型[J].管理工程学报,2000,14(3):43-47
    [41]郑称德.质量工程学的新进展[J].科技进步与对策,2002.4,101-104
    [42]曾江辉,曾凤章.基于模糊判别准则的马田系统基准空间优化[J].工业工程与管理,2008,3:52-55
    [43]曾江辉,曾凤章,陈嵩辉.马田系统与SVM相集成的模式识别技术研究[J].计算机工程与应用,2010,46(8):245-248
    [44]何桢,韩亚娟,李菊栋.马氏田口两种不同方法的比较研究[J].中国卫生统计,2007,24(5):531-535
    [45]何桢,韩亚娟.多元系统的马氏田口诊断与分析研究[J].数理统计与管理,2007,26(5):830-839
    [46]郑称德.MTS多类判别研究[J].管理工程学报,2003,17(4):106-109
    [47]许前,郑称德,韩之俊.MTS多类判别研究[J].南京理工大学学报,2002,26(1):92-95
    [48]王海燕,赵培标.实现CSI测评的P-M模糊测度空间的构建探析[J].预测,2003,22(4):69-71,31
    [49]王海燕.企业绩效管理模式的选择逻辑——基于CSI模糊识别模型的实证分析[J].管理世界,2006(9):94-100
    [50]薛跃,韩之俊,盛党红.MTS法用于上市公司财务质量评估初探[J].数理统计与管理,2005,24(1):81-85
    151]宗鹏,曾凤章.基于MTS的企业可持续发展评价体系研究[J].科学技术与工程,2006,6(8):1163-1166,1170
    [52]Hu Jinqiu, Zhang Laibin, Liang Wei, Wang Zhaohui. Incipient mechanical fault detec-tion based on multifractal and MTS methods [J]. Petroleum Science,2009, (6):208-216
    [53]钟晓芳,韩之俊.评价计测仪器精度的一种新方法[J].计量与测试技术,2004(6):27-28
    [54]王雪,李勇.零件形状误差的MTS测量识别方法[J].1电测与仪表,2004,41(461):7-10,6
    [55]Huang Ching-Lien, Hsu Tsung-Shin, Liu Chih-Ming. The Mahalanobis-Taguchi sys-tem-neural network algorithm for data-mining in dynamic environments [J]. Expert Sys-tems with Applications,2009,36:5475-5480
    [56]Su Chao-Ton, Hsiao, Yu-Hsiang. An evaluation of the robustness of MTS for imba-lanced data [J]. IEEE Transactions on Knowledge and Data Engineering,2007,19(10): 1321-1332
    [57]Wang H., Chiu C, Su C. Data classification using Mahalanobis-Taguchi system[J]. Journal of the Chinese Institute of Industrial Engineers,2004,21(6):606-618
    [58]Yang Taho, Cheng Yuan-Ting. The use of Mahalanobis-Taguchi System to improve flip-chip bumping height inspection efficiency [J]. Microelectronics Reliability,2010,50: 407-414
    [59]Lee Yu-Cheng, Teng Hsiao-Lin. Predicting the financial crisis by Mahalano-bis-Taguchi system:Examples of Taiwan's electronic sector [J]. Expert Systems with Ap-plications,2009,36:7469-7478
    [60]Hsiao Y. H., Su C. T. Multiclass MTS for saxophone timbre quality inspection using waveform-shape-based features [J]. IEEE Transactions on Systems, Man and Cybernetics, 2009,39(3):690-704
    [61]Niu Gang, Satnam Singh, Steven W. Holland, Michael Pecht. Health monitoring of electronic products based on Mahalanobis distance and Weibull decision metrics [J]. Mi-croelectronics Reliability,2011,51:279-284
    [62]Khan N. M., Ksantini R., Ahmad I. S., Boufama B. A novel SVM-NDA model for classification with an application to face recognition [J]. Pattern Recognition,2012,45(1): 66-79
    [63]Richard O. H., Peter E. S., David G. Pattern Classification [M]. Wiley-Interscience, 2000
    [64]Airolaa A., Pahikkalaa T., Waegemanc W., Baetsc B. D., Salakoskia T. An experimen-tal comparison of cross-validation techniques for estimating the area under the ROC curve [J]. Computational Statistics & Data Analysis,2011,55(4):1828-1844
    [65]Vapnik V. The nature of statistical learning theory[M]. New York:Springer,1995
    [66]邓乃扬,田英杰.支持向量机:理论、算法与拓展[M].北京:科学出版社,2009
    [67]Vapnik V N. Estimation of dependences based on empirical data [M]. Berlin:Springer, 1982
    [68]Osuna E, Freund R, Girosi F. Improved training algorithm for support vector ma-chines [M]. Los Alamitos, C A:IEEE Computer Society,1997
    [69]Wang Xiangyang, Chen Jingwei, Yang Hongying. A new integrated SVM classifiers for relevance feedback content-based image retrieval using EM parameter estimation [J]. Applied Soft Computing,2011,11(2):2787-2804
    [70]张仁杰,庄松林,臧道青.基于聚类分析与支持向量机模型的缸盖座圈图像判别[J].光学精密工程,2011,19(10):2478-2484
    [71]Karim S., Mojtaba K., Majid S., Khoshro. Fault detection and diagnosis of an indus-trial steam turbine using fusion of SVM (support vector machine) and ANFIS (adaptive neuro-fuzzy inference system) classifiers [J]. Energy,2010,35(12):5472-5482
    [72]张付志,周全强.一种融入可信度的集成SVM垃圾书签检测方法[J].模式识别 与人工智能,2011,24(8):591-596
    [73]吴娅辉,李新良,洪宝林,张大治.基于SVM和广义粗糙度特征的航空发动机振动故障诊断方法[J].航空动力学报,2011,26(11):2445-2449
    [74]徐图,何大可.超球体多类支持向量机理论[J].控制理论与应用,2009,26(11):1293-1297
    [75]刁华志,赵春江,郭新宇,陆声链.一种新的基于平衡决策树的SVM多类分类算法[J].控制与决策,2011,26(1):149-152
    [76]Sabbi H., Geman D., Perona P. A hierachy of support vector machines for pattern de-tection [J]. Journal of Machine Learning Research,2006,7(10):2087-2123
    [77]Lin H. T., Lin C J. A study on sigmoid kernels for SVM and the training of non-PSD kernels by smo-type methods [D]. TaiPei, Taiwan:National Taiwan University,2003
    [78]周晓剑,马义中,朱嘉钢.SMO算法的简化及其在非正定核条件下的应用[J].计算机研究与发展,2010,47(11):1962-1969
    179]崔建,李强,刘勇,宗大伟.基于决策树的快速SVM分类方法[J].系统工程与电子技术,2011,(11):2558-2563
    [80]Yao Ping, Lu Yongheng. Neighborhood rough set and SVM based hybrid credit scor-ing classifier [J]. Expert Systems with Applications,2011,38(9):11300-11304
    [81]Wu Qi, Law Rob. The complex fuzzy system forecasting model based on fuzzy SVM with triangular fuzzy number input and output [J]. Expert Systems with Applications,2011, 38(10):Pages 12085-12093
    [82]Chen Zhenyu, Li Jianping, Wei Liwei, Xu Weixuan, Shi Yong. Multiple-kernel SVM based multiple-task oriented data mining system for gene expression data analysis [J]. Ex-pert Systems with Applications,2011,38(10):12151-12159
    [83]Guyon I., Weston J., Barnhill S., Vapnik V. Gene selection for cancer classification using support vector machines [J]. Machine Learning,2002,46:389-422
    [84]Rakotomamonjy A. Variable selection using SVM-based criteria [J]. Journal of Ma-chine Learning Research,2003(3):1357-1370
    [85]Langley P., Sage S. Induction of selective bayesian classifiers [C]. Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence. San Fransisco:Morgan Kauf-mann Publishers Inc,1994:339-406
    [86]赵军阳,张志利.基于最大互信息最大相关熵的特征选择方法[J].计算机应用研究,2009,26(1):233-235
    [87]赵文清.基于选择性贝叶斯分类器的变压器故障诊断[J].电力自动化设备,2011,31(2):44-47
    [88]程玉虎,仝瑶瑶,王雪松.类相关性影响可变选择性贝叶斯分类器[J].电子学报,2011,39(7):1628-1633
    [89]Chai Xiaoyong, Deng Lin, Yang Qiang. Test-cost sensitive naive bayes classification [C]. The Fourth IEEE International Conference on Data Mining,2004:51-58
    [90]胡邦辉,袁野,王学忠,丛爱丽.基于贝叶斯分类方法的雷暴预报[J].解放军理工大学学报,2010,11(5):578-584
    [91]Friedman N., Geiger D., Goldszmidt M. Bayesian network classifiers [J]. Machine Learning,1997,29(2/3):131-163
    [92]Lanterman A., Schwarz W., Rissanen. Intertwining themes in theories of model selec-tion[J]. International Statistical Review,2001,69(2):185-212
    [93]蔡志强,孙树栋,Bernard Yannou,司书宾.条件贝叶斯网络分类器及其在产品故障率等级分类中的应用[J].计算机集成制造系统,2010,16(2):417-422
    [94]马国普,沙基昌,陈俊良,陈超,姜鑫.动态贝叶斯网络与黑板机制结合的协同决策方法[J].系统工程与电子技术,2010,32(12):2590-2594
    [95]姜维,李一军.基于贝叶斯网络推理的导弹目标类型识别[J].计算机集成制造系统,2011,17(6):1264-1270
    [96]Gupta S., Kim H. W. Linking structural equation modeling to Bayesian networks:De-cision support for customer retention in virtual communities [J]. European Journal of Op-erational Research,2008,190:818-833
    [97]叶世伟,史忠植.神经网络原理[M].北京:机械工业出版社,2004
    [98]Xiao Zhi, Ye Shi-jie, Zhong Bo. BP neural network with rough set for short term load forecasting [J]. Expert Systems with Application,2009,36(1):273-279
    [99]管春,周雒维,卢伟国.基于多标签RBF神经网络的电能质量复合扰动分类方法[J].电工技术学报,2011,26(8):198-204
    [100]Li Ruqiang, Chen Jin, Wu Xing. Fault diagnosis of rotating machinery using know-ledge-based fuzzy neural network [J]. Applied Mathematics and Mechanics:English Edi-tion,2006,27(1):99-108
    [101]王崇倡,武文波,张建平.基于径向基函数神经网络的高光谱遥感图像分类[J].光谱学与光谱分析,2008,28(9):32-35
    [102]柯孔林,冯宗宪.基于粗糙集和神经网络集成的贷款风险5级分类[J].控制理论与应用,2008,25(4):759-763
    [103]李巍华,张盛刚.基于改进证据理论及多神经网络融合的故障分类[J].机械工程学报,2010,46(9):93-99
    [104]栾丽华,吉根林.决策树分类技术研究[J].计算机工程,2004,30(9):94-96
    [105]Aitkenhead M. J. A co-evolving decision tree classification method [J]. Expert Sys-tems with Applications,2008,34(1):18-25
    [106]Jesus F, Jorge T. A two-step approach to satellite image classification using fuzzy neural networks and the ID3 learning algorithm [J]. Expert Systems with Applications, 1998,14(1):211-218
    [107]Kemal P., Salih G. A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems [J]. Expert Systems with Applications,2009,36(2):1587-1592
    [108]Stuart L., Crawford. Extensions to the CART algorithm [J]. International Journal of Man-Machine Studies,1989,31(2):197-217
    [109]Chandra B., Varghese P. P. Moving towards efficient decision tree construction [J]. Information Sciences,2009,179(8):1059-1069
    [110]Li Xiaobai. A scalable decision tree system and its application in pattern recognition and intrusion detection [J]. Decision Support Systems,2005,41(1):112-130
    [111]Sun Weixiang, Chen Jin, Li Jiaqing. Decision tree and PCA-based fault diagnosis of rotating machinery[J]. Mechanical Systems and Signal Processing,2007,21(3): 1300-1317
    [112]Mevlut Ture, Fusun Tokatli, Imran Kurt. Using Kaplan-Meier analysis together with decision tree methods (C&RT, CHAID, QUEST, C4.5 and ID3) in determining recur-rence-free survival of breast cancer patients [J]. Expert Systems with Applications,2009, 36(2):2017-2026
    [113]Kim F. L., Jane W. M. Combining discriminant methods in solving classification problems in two-group discriminant analysis [J]. European Journal of Operational Re-search,2002,138(2):294-301
    [114]Shelley B. B., Carmen M., Celia M. T. G. A modified score function estimator for multinomial logistic regression in small samples [J]. Computational Statistics & Data Analysis,2002,39(1):57-74
    [115]Thanh N. T., Ron W., Lutgarde M. C., Buydens. KNN-kernel density-based cluster-ing for high-dimensional multivariate data [J]. Computational Statistics & Data Analysis, 2006,51(2):513-525
    [116]陈云芳,王汝传.基于免疫算法的分类器设计[J].计算机科学,2008,35(12):133-135
    [117]Zitzler E.,Thiele, L. Multiobjective evolutionary algorithms:a comparative case study and the strength pareto approach [J]. IEEE Transactions on Evolutionary Computa- tion,1999,3(4):257-271
    [118]Schaffer J. D. Multiple objective optimization with vector evaluated genetic algo-rithms[C]. In Proc.of Inter. Conf. Genetic Algorithms and Their Applications, Pittsburgh, PA,1985:93-100
    [119]Haimes Y. Y., Lasdon L. S., Wismer D. A. On a bicriterion formulation of the prob-lems of integrated systems identification and system optimization [J]. IEEE Transactions on System, Man, Cybernetics,1971,1(3):296-297
    [120]Zitzler E. SPEA-II:improving the strength Pareto evolutionary algorithm [J]. Swiss Federal Institute of Technology, Lausanne, Switzerland.Tech.Rep. TIK-Rep.2001:103
    [121]Deb K., Pratap A., Agrawal S., Meyarivan T. A fast and elitist multiobjective genetic algorithm:NSGA-II [J]. IEEE Transactions on Evolutionary Computation,2002,6(2): 182-197
    [122]Juergen B., Sanaz M. About selecting the personal best in multi-objective particle swarm optimization [J]. In Parallel Problem Solving from nature,2006,20(3):523-532
    [123]Molina J., Laguna M., Marti R., Caballero R. SSPMO:A scatter tabu search proce-dure for non-linear multiobjective Optimization [J]. Informs Journal on Computing,2007, 19(1):91-100
    [124]Deb K., Tiwari S. Omni-optimizer:A generic evolutionary algorithm for single and multi-objective optimization [J]. European Journal of Operational Research,2008(15): 1062-1087
    [125]Deb K., Agrawal R. B. Simulated binary crossover for continuous search space [J]. Complex Systems,1995,9(2):115-148
    [126]Deb K., Goyal M. A combined genetic adaptive search (geneas) for engineering de-sign [J]. Computer Science and Informatics,1996,26(4):30-45
    [127]Deb K., Mohan M., Mishra S. Evaluating the ε-domination based multi-objective evolutionary algorithm for a quick computation of pareto-optimal solutions [J]. Evolutio-nary Computation Journal.2005,13(4):501-525
    [128]Duda R., Hart P., Stork D. Pattern Classification [M]. Wiley,2001
    [129]Kalousis A., Prados J., Hilario M. Stability of feature selection algorithms [J]. Fifth IEEE International Conference on Data Mining,2005
    [130]Yu L., Ding C., Loscalzo S. Stable feature selection via dense feature groups [J]. Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
    [131]Luukka P. Similarity classifier using similarity measure derived from Yu's norms in classification of medical data sets [J]. Computers in Biology and Medicine,2007,37: 1133-1140
    [132]Brown G. Diversity in neural network ensembles [D]. Ph.D. dissertation, School of Computer Science, University of Birmingham,2003
    [133]Eggermont J., Kok J., Kosters W. A. Genetic programming for data classification: partitioning the search space [C].19th Annual ACM Symposium on Applied Computing (SAC'04),2004:1001-1005
    [134]Luukka P. Similarity classifier using similarities based on modified probabilistic equivalence relations [J]. Knowledge-Based Systems 2009,22:57-62
    [135]Shih L., Rennie J. D. M., Chang Y. H., Karger D. R. Text bundling:statistics-based data reduction [C]. Proc.20th Int'l Conf. Machine Learning (ICML),2003.
    [136]Li X.B. Data reduction via adaptive sampling [J]. Comm. Information and Systems, 2002,2(1):53-68
    [137]Liu H., Mtotda H. Instance selection and construction for data mining. Kluwer Aca-demic Publishers,2001.
    [138]Japkowicz N. Learning from imbalanced data sets:a comparison of various strate-gies [C]. Learning from Imbalanced Data Sets:The AAAI Workshop,2000:10-15
    [139]Kubat M., Holte R. C., Matwin S. Machine learning for the detection of oil spills in satellite radar images [J]. Machine Learning,1998,30(2-3):195-215
    [140]Phua C., Alahakoon D. Minority report in fraud detection:classication of skewed data [J]. SIGKDD Explorations,2004,6(1):50-59
    [141]Castillo M. D., Serrano J. I. A multistrategy approach for digital text categorization from imbalanced documents [J]. SIGKDD Explorations,2004,6(1):70-79
    [142]Zheng Zhao hui, WU X., Srihari R. K. Feature selection for text categorization on imbalanced data [J]. SIGKDD Explorations,2004,6(1):80-89
    1143] Japkowicz N., Stephen S. The class imbalance problem:A systematic study [J]. In-telligent Data Analysis,2002,6(5):203-231
    [144]Maloof M. A. Learning when data sets are imbalanced and when costs are unequal and unknown [C]. ICML-2003 Workshop on Learning from Imbalanced Data Sets II
    [145]Batista G., Prati R. C., Monard M. C. A study of the behavior of several methods for balancing machine learning training [J]. Data SIGKDD Explorations,2004,6(1):20-29
    [146]Guo H., Viktor H. L. Learning from imbalanced data sets With boosting and data generation:the databoost-IM approach [J]. SIGKDD Explorations,2004,6(1):30-39
    [147]Pendharkar P. C., Rodger J. Yaverbaum A., G. J., Herman N., Benner M. Association, statistical, mathematical and neural approaches for mining breast cancer patterns [J] Expert System with Applications,1993,17:223-232
    [148]Grzymala-Busse J. W., Stefanowski J., Wilk S. A comparison of two approaches to data mining from imbalanced data [J]. Lecture Notes in Computer Science,2004,3213: 757-763
    [149]Huang K., H Yang H., King I., Lyu M. Learning classifiers from imbalanced data based on biased minimax probability machine[C]. Proc. IEEE CS Conf. Computer Vision and Pattern Recognition (CVPR'04),2004,558-563
    [150]Chawla N.V., Bowyer K., Hall L., Kegelmeyer W. SMOTE:synthetic minority ovv-er-sampling Technique [J]. J. Artificial Intelligence Research,2002,16:231-357
    [151]曾志强,吴群,廖备水,高济.一种基于核SMOTE的非平衡数据集分类方法[J].电子学报,2009,37(11):2489-2495
    [152]Wu G., Chang E. Y. KB A:kernel boundary alignment considering imbalanced data distribution [J]. IEEE Transactions on Knowledge and Data Engineering,2005,17(6): 786-794
    [153]Wu G., Chang E. Adaptive feature-space conformal transformation for imbalanced data learning [C]. Proc.20th Int'l Conf. Machine Learning (ICML'03),2003:816-823
    [154]薛贞霞,刘三阳,刘万里.2v-SSPC——一种不平衡数据分类方法[J].系统工程与电子技术,2008,30(12):2471-2476
    [155]陈思,郭躬德,陈黎飞.基于聚类聚合的不平衡数据分类方法[J].模式识别与人工智能,2010,23(12):772-780
    [156]韩敏,朱新荣.不平衡数据分类的混合算法[J].控制理论与应用,2010,28(]0):1485-1489
    [157]Cun Y.L., Boser B., Denker J., Hendersen D., Howard R., Hubbard W., Jackel L Backpropagation applied to handwritten zip code recognition [J]. Neural Computation, 1989,1:541-551
    [158]Chapelle O., Haffner P., Vapnik V. N. Support vector machines for histogram-based image classification [J]. IEEE Transactions on Neural Networks,1999,10(5):1055-1064
    [159]Shami M., Verhelst W. An evaluation of the robustness of existing supervised ma-chine learning approaches to the classification of emotions in speech [J]. Speech Comm., 2007,49(3):201-212
    [160]Thamarai S., Arumugam S., Ganesan L. BIONET:An artificial neural network mod-el for diagnosis of diseases [J]. Pattern Recognition Letters,2000,21(8):721-740
    [161]Lam W., Ruiz M., Srinivasan P. Automatic text categorization and its application to text retrieval [J]. IEEE Transancations on Knowledge and Data Engineering,1999,11(6): 865-879
    [162]Hsu C. W., Lin C. J. A comparison of methods for multiclass support vector ma-chines [J]. IEEE Transanctions on Neural Networks,2002,13(2):415-425
    [163]Anand R., Mehrotra K., Mohan C. K., Ranka S. Efficient classification for multiclass problems using modular neural networks [J]. IEEE Transanctions on Neural Networks, 1995,6(1):117-124
    [164]王翼宁.我国地方政府融资平台的风险控制预警体系与市场化运作研究[J].现代管理科学,2010(11):33-34
    [165]孙继伟,王波.政府融资平台贷款的信用风险指标评价及实证研究[J].世界经济文汇,2011(3):98-109
    [166]杨建明.中国旅游业发展空间差异的综合评判[J].地理科学,2009,29(4):613-621
    [167]陆林,余风龙.中国旅游经济差异的空间特征分析[J].经济地理,2005,25(3):406-414

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700