参考文献:1. Todeschini, R, Consonni, V (2009) Molecular descriptors for chemoinformatics. Wiley-VCH, Weinheim 9783527628766" target="_blank" title="It opens in new window">CrossRef 2. Todeschini R, Consonni V, Pavan M (2002) DRAGON Software version 2.1. Milano Chemometric and QSAR Research Group. Milano 3. Guha R (1991) The CDK descriptor calculator, 0.94th edn. Indiana 4. Yap, CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32: pp. 1466-1474 CrossRef 5. Georg, H (2008) BlueDesc-molecular descriptor calculator. University of T眉bingen, T眉bingen 6. Liu, J, Feng, J, Brooks, A, Young, S (2005) PowerMV. National Institute of Statistical Sciences, Research Triangle Park 7. ADRIANA. Code (2011) Molecular Networks. Erlangen, Germany 8. Hong, H, Xie, Q, Ge, W, Qian, F, Fang, H, Shi, L, Su, Z, Perkins, R, Tong, W (2008) Mold2, molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics. J Chem Inf Comput Sci 48: pp. 1337-1344 CrossRef 9. Kellogg GE (2001) Molconn-Z 4.0 edn. eduSoft, Virginia 10. Liu H, Motoda H (2008) Less is More. In: Liu H, Motoda H (eds) Computational methods of feature selection. Data mining and knowledge discovery series. Taylor * Francis Group, Boca Raton, p 411 11. Wolpert, DH, Macready, WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1: pp. 67-82 9/4235.585893" target="_blank" title="It opens in new window">CrossRef 12. Venkatraman, V, Dalby, AR, Yang, ZR (2004) Evaluation of mutual information and genetic programming for feature selection in QSAR. J Chem Inf Comput Sci 44: pp. 1686-1692 9933v" target="_blank" title="It opens in new window">CrossRef 13. Yu L, Liu H (2003) Feature selection for high-dimensional data: a fast correlation-based filter solution. In: Proceedings of the Twentieth international conference on machine learning, Washington DC 14. Kira, K, Rendell, L (1992) The feature selection problem: traditional methods and a new algorithm. Association for the advancement of artificial intelligence. AAAI Press and MIT Press, Cambridge, pp. 129-134 15. Kullback, S, Leibler, RA (1951) On information and sufficiency. Ann Math Stat 22: pp. 79-86 9694" target="_blank" title="It opens in new window">CrossRef 16. Jeffreys, H (1946) An invariant form for the prior probability in estimation problems. Proc Roy Soc A 186: pp. 453-461 98/rspa.1946.0056" target="_blank" title="It opens in new window">CrossRef 17. Jennifer GD (2008) Unsupervised Feature Selection. In: Liu H, Motoda H (eds) Computational methods of feature selection. Data mining and knowledge discovery series. Taylor & Francis Group, Boca Raton, p 411 18. Varshavsky, R, Gottlieb, A, Linial, M, Horn, D (2006) Novel unsupervised feature filtering of biological data. Bioinformatics 22: pp. e507-e513 93/bioinformatics/btl214" target="_blank" title="It opens in new window">CrossRef 19. Maldonado, AG, Doucet, JP, Petitjean, M, Fan, B-T (2006) Molecular similarity and diversity in chemoinformatics: from theory to applications. Mol Divers 10: pp. 39-79 97-1" target="_blank" title="It opens in new window">CrossRef 20. Godden, JW, Stahura, FL (2000) Variability of molecular descriptors in compound databases revealed by Shannon entropy calculations. J Chem Inf Comput Sci 40: pp. 796-800 CrossRef 21. Godden, JW, Bajorath, J (2002) Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis. J Chem Inf Comput Sci 42: pp. 87-93 CrossRef 22. Barigye, SJ, Marrero-Ponce, Y, P茅rez-Gim茅nez, F, Bonchev, D (2014) Trends in information theory-based chemical structure codification. Mol Divers 18: pp. 673-686 9517-7" target="_blank" title="It opens in new window">CrossRef 23. Witten IH, Eibe F, Hall MA (2011) Data mining: practical machine learning tools and techniques. The Morgan Kaufmann series in data management systems, 3rd edn. Morgan Kaufmann, Burlington 24. Alter, O, Brown, PO, Botstein, D (2000) Singular value decomposition for genome-wide expression data processing and modeling. Proc Natl Acad Sci USA 97: pp. 10101-10106 97.18.10101" target="_blank" title="It opens in new window">CrossRef 25. Devakumari D, Thangavel K (2010) Unsupervised adaptive floating search feature selection based on contribution entropy. In: 2010 international conference on communication and computational intelligence (INCOCCI), pp 623鈥?27 26. Dash M, Choi K, Scheuermann P, Huan L (2002) Feature selection for clustering鈥攁 filter solution. In: Proceedings of the 2002 IEEE international conference on data mining (ICDM 2003), pp 115鈥?22. doi:10.1109/icdm.2002.1183893 27. Stahura, FL, Godden, JW, Bajorath, J (2002) Differential Shannon entropy analysis identifies molecular property descriptors that predict aqueous solubility of synthetic compounds with high accuracy in binary QSAR calculations. J Chem Inf Comput Sci 42: pp. 550-558 CrossRef 28. Wassermann, AM, Nisius, B, Vogt, M, Bajorath, J (2010) Identification of descriptors capturing compound class-specific features by mutual information analysis. J Chem Inf Model 50: pp. 1935-1940 9n" target="_blank" title="It opens in new window">CrossRef 29. Cover, TM, Thomas, JA (1991) Elements of Information theory. Wiley, New York CrossRef 30. Desurvire, E (2009) Classical and quantum information theory. Cambridge University Press, New York 9780511803758" target="_blank" title="It opens in new window">CrossRef 31. Quinlan JR (1983) Learning efficient classification procedures and their application to chess end games. In: Michalski R, Carbonell J, Mitchell T (eds) Machine learning. Symbolic computation. Springer, Berlin, pp 463鈥?82. doi:10.1007/978-3-662-12405-5_15 32. Press, WH, Flannery, BP, Teukolsky, SA, Vetterling, WT (1988) Numerical recipes in C: the art of scientific computing. Cambridge University Press, New York 33. Consonni, V, Todeschini, R, Pavan, M, Gramatica, P (2002) Structure/response correlations and similarity/diversity analysis by GETAWAY descriptors. Part 2. Application of the novel 3D molecular descriptors to QSAR/QSPR studies. J Chem Inf Comput Sci 42: pp. 693-705 CrossRef 34. P茅rez Gonz谩lez, M, Ter谩n, C, Teijeira, M, Gonz谩lez-Moa, MJ (2005) GETAWAY descriptors to predicting A2A adenosine receptors agonists. Eur J Med Chem 40: pp. 1080-1086 CrossRef 35. Saiz-Urra, L, P茅rez Gonz谩lez, M (2007) Quantitative structure-activity relationship studies of HIV-1 integrase inhibition.1. GETAWAY descriptors. Eur J Med Chem 42: pp. 64-70 CrossRef 36. Fedorowicz, A, Singh, H, Soderholm, S, Demchuk, E (2005) Structure鈥揳ctivity models for contact sensitization. Chem Res Toxicol 18: pp. 954-969 97806" target="_blank" title="It opens in new window">CrossRef 37. Saiz-Urra, L, P茅rez Gonz谩lez, M (2006) QSAR studies about cytotoxicity of benzophenazines with dual inhibition toward both topoisomerases I and II: 3D-MoRSE descriptors and statistical considerations about variable selection. Bioorg Med Chem 14: pp. 7347-7358 CrossRef 38. Gasteiger, J, Sadowski, J, Schuur, J, Selzer, P, Steinhauer, L, Steinhauer, V (1996) Chemical information in 3Dspace. J Chem Inf Comput Sci 36: pp. 1030-1037 960343+" target="_blank" title="It opens in new window">CrossRef 39. Gasteiger, J, Schuur, J, Selzer, P, Steinhauer, L, Steinhauer, V (1997) Finding the 3D structure of a molecule in its IR spectrum. Fresen J Anal Chem 359: pp. 50-55 CrossRef 40. Schuur, J, Selzer, P, Gasteiger, J (1996) The coding of the three-dimensional structure of molecules by molecular transforms and its application to structure-spectra correlations and studies of biological activity. J Chem Inf Comput Sci 36: pp. 334-344 950164c" target="_blank" title="It opens in new window">CrossRef 41. Baumann, K (1999) Uniform-length molecular descriptors for quantitative structure-property relationships (QSPR) and quantitative structure-activity relationships (QSAR): classification studies and similarity searching. TRAC 18: pp. 36-46 42. Jelcic, Z (2004) Solvent molecular descriptors on poly(D, L-lactide-co-glycolide) particle size in emulsification-diffusion process. Coll Surf A Physico-Chem Eng Asp 242: pp. 159-166 CrossRef 43. Todeschini, R, Bettiol, C, Giurin, G, Gramatica, P, Miana, P, Argese, E (1996) Modeling and prediction by using WHIM descriptors in QSAR studies. Submitochondrial particles (SMP) as toxicity biosensors of chlorophenols. Chemosphere 33: pp. 71-79 96)00153-1" target="_blank" title="It opens in new window">CrossRef 44. Randic, M (1995) Molecular profiles. Novel geometry-dependent molecular descriptors. New J Chem 19: pp. 781-791 45. Fayyad UM, Irani KB (1993) Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th international joint conference on artificial intelligence, pp 1022鈥?027. 93.html#FayyadI93" class="a-plus-plus">http://dblp.uni-trier.de/db/conf/ijcai/ijcai93.html#FayyadI93 46. Newman DJ, Hettich S, Blake CL, Merz CJ (1998) UCI repository of machine learning databases. University of California, Department of Information and Computer Science, Irvine, CA. http://www.ics.uci.edu/~mlearn/MLRepository.html 47. Guyon I, Gunn SR, Ben-Hur A, Dror G (2004) Result analysis of the NIPS 2003 feature selection challenge. In: Advances in neural information processing systems, Vancouver, BC, pp 545鈥?52. http://papers.nips.cc/paper/2728-result-analysis-of-the-nips-2003-feature-selection-challenge 48. Webb AR (2002) Statistical pattern recognition, 2nd edn. Wiley, Chichester 49. Cover, TM (1974) The best two independent measurements are not the two best. IEEE Trans Syst Man Cybern 4: pp. 116-117 9/TSMC.1974.5408535" target="_blank" title="It opens in new window">CrossRef