IMMAN: free software for information theory-based chemometric analysis
详细信息    查看全文
  • 作者:Ricardo W. Pino Urias (1) (2)
    Stephen J. Barigye (3)
    Yovani Marrero-Ponce (1) (4) (5)
    C茅sar R. Garc铆a-Jacas (1) (6)
    Jos茅 R. Valdes-Martin铆 (2)
    Facundo Perez-Gimenez (4)

    1. Unit of Computer-Aided Molecular 鈥淏iosilico鈥?Discovery and Bioinformatic Research (CAMD-BIR International)
    ; Cartagena de Indias ; Bol铆var ; Colombia
    2. Faculty of Mathematics Physics and Computation
    ; Universidad Central 鈥淢arta Abreu鈥?de Las Villas ; Santa Clara ; 54830 ; Villa Clara ; Cuba
    3. Departamento de Qu铆mica
    ; Universidade Federal de Lavras ; UFLA ; Caixa Postal 3037 ; 37200-000 ; Lavras ; MG ; Brazil
    4. Facultad de Farmacia
    ; Universitat de Val猫ncia ; Burjasot ; 46100 ; Val猫ncia ; Spain
    5. Grupo de Investigaci贸n en Estudios Qu铆micos y Biol贸gicos
    ; Facultad de Ciencias B谩sicas ; Universidad Tecnol贸gica de Bol铆var ; Cartagena de Indias ; Bol铆var ; Colombia
    6. Grupo de Investigaci贸n de Bioinform谩tica
    ; Centro de Estudio de Matem谩tica Computacional (CEMC) ; Universidad de las Ciencias Inform谩ticas ; La Habana ; Cuba
  • 关键词:Computational program ; Chemometric analysis ; IMMAN ; Information ; theoretic function ; Feature selection ; Classification
  • 刊名:Molecular Diversity
  • 出版年:2015
  • 出版时间:May 2015
  • 年:2015
  • 卷:19
  • 期:2
  • 页码:305-319
  • 全文大小:1,674 KB
  • 参考文献:1. Todeschini, R, Consonni, V (2009) Molecular descriptors for chemoinformatics. Wiley-VCH, Weinheim CrossRef
    2. Todeschini R, Consonni V, Pavan M (2002) DRAGON Software version 2.1. Milano Chemometric and QSAR Research Group. Milano
    3. Guha R (1991) The CDK descriptor calculator, 0.94th edn. Indiana
    4. Yap, CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32: pp. 1466-1474 CrossRef
    5. Georg, H (2008) BlueDesc-molecular descriptor calculator. University of T眉bingen, T眉bingen
    6. Liu, J, Feng, J, Brooks, A, Young, S (2005) PowerMV. National Institute of Statistical Sciences, Research Triangle Park
    7. ADRIANA. Code (2011) Molecular Networks. Erlangen, Germany
    8. Hong, H, Xie, Q, Ge, W, Qian, F, Fang, H, Shi, L, Su, Z, Perkins, R, Tong, W (2008) Mold2, molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics. J Chem Inf Comput Sci 48: pp. 1337-1344 CrossRef
    9. Kellogg GE (2001) Molconn-Z 4.0 edn. eduSoft, Virginia
    10. Liu H, Motoda H (2008) Less is More. In: Liu H, Motoda H (eds) Computational methods of feature selection. Data mining and knowledge discovery series. Taylor * Francis Group, Boca Raton, p 411
    11. Wolpert, DH, Macready, WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1: pp. 67-82 CrossRef
    12. Venkatraman, V, Dalby, AR, Yang, ZR (2004) Evaluation of mutual information and genetic programming for feature selection in QSAR. J Chem Inf Comput Sci 44: pp. 1686-1692 CrossRef
    13. Yu L, Liu H (2003) Feature selection for high-dimensional data: a fast correlation-based filter solution. In: Proceedings of the Twentieth international conference on machine learning, Washington DC
    14. Kira, K, Rendell, L (1992) The feature selection problem: traditional methods and a new algorithm. Association for the advancement of artificial intelligence. AAAI Press and MIT Press, Cambridge, pp. 129-134
    15. Kullback, S, Leibler, RA (1951) On information and sufficiency. Ann Math Stat 22: pp. 79-86 CrossRef
    16. Jeffreys, H (1946) An invariant form for the prior probability in estimation problems. Proc Roy Soc A 186: pp. 453-461 CrossRef
    17. Jennifer GD (2008) Unsupervised Feature Selection. In: Liu H, Motoda H (eds) Computational methods of feature selection. Data mining and knowledge discovery series. Taylor & Francis Group, Boca Raton, p 411
    18. Varshavsky, R, Gottlieb, A, Linial, M, Horn, D (2006) Novel unsupervised feature filtering of biological data. Bioinformatics 22: pp. e507-e513 CrossRef
    19. Maldonado, AG, Doucet, JP, Petitjean, M, Fan, B-T (2006) Molecular similarity and diversity in chemoinformatics: from theory to applications. Mol Divers 10: pp. 39-79 CrossRef
    20. Godden, JW, Stahura, FL (2000) Variability of molecular descriptors in compound databases revealed by Shannon entropy calculations. J Chem Inf Comput Sci 40: pp. 796-800 CrossRef
    21. Godden, JW, Bajorath, J (2002) Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis. J Chem Inf Comput Sci 42: pp. 87-93 CrossRef
    22. Barigye, SJ, Marrero-Ponce, Y, P茅rez-Gim茅nez, F, Bonchev, D (2014) Trends in information theory-based chemical structure codification. Mol Divers 18: pp. 673-686 CrossRef
    23. Witten IH, Eibe F, Hall MA (2011) Data mining: practical machine learning tools and techniques. The Morgan Kaufmann series in data management systems, 3rd edn. Morgan Kaufmann, Burlington
    24. Alter, O, Brown, PO, Botstein, D (2000) Singular value decomposition for genome-wide expression data processing and modeling. Proc Natl Acad Sci USA 97: pp. 10101-10106 CrossRef
    25. Devakumari D, Thangavel K (2010) Unsupervised adaptive floating search feature selection based on contribution entropy. In: 2010 international conference on communication and computational intelligence (INCOCCI), pp 623鈥?27
    26. Dash M, Choi K, Scheuermann P, Huan L (2002) Feature selection for clustering鈥攁 filter solution. In: Proceedings of the 2002 IEEE international conference on data mining (ICDM 2003), pp 115鈥?22. doi:10.1109/icdm.2002.1183893
    27. Stahura, FL, Godden, JW, Bajorath, J (2002) Differential Shannon entropy analysis identifies molecular property descriptors that predict aqueous solubility of synthetic compounds with high accuracy in binary QSAR calculations. J Chem Inf Comput Sci 42: pp. 550-558 CrossRef
    28. Wassermann, AM, Nisius, B, Vogt, M, Bajorath, J (2010) Identification of descriptors capturing compound class-specific features by mutual information analysis. J Chem Inf Model 50: pp. 1935-1940 CrossRef
    29. Cover, TM, Thomas, JA (1991) Elements of Information theory. Wiley, New York CrossRef
    30. Desurvire, E (2009) Classical and quantum information theory. Cambridge University Press, New York CrossRef
    31. Quinlan JR (1983) Learning efficient classification procedures and their application to chess end games. In: Michalski R, Carbonell J, Mitchell T (eds) Machine learning. Symbolic computation. Springer, Berlin, pp 463鈥?82. doi:10.1007/978-3-662-12405-5_15
    32. Press, WH, Flannery, BP, Teukolsky, SA, Vetterling, WT (1988) Numerical recipes in C: the art of scientific computing. Cambridge University Press, New York
    33. Consonni, V, Todeschini, R, Pavan, M, Gramatica, P (2002) Structure/response correlations and similarity/diversity analysis by GETAWAY descriptors. Part 2. Application of the novel 3D molecular descriptors to QSAR/QSPR studies. J Chem Inf Comput Sci 42: pp. 693-705 CrossRef
    34. P茅rez Gonz谩lez, M, Ter谩n, C, Teijeira, M, Gonz谩lez-Moa, MJ (2005) GETAWAY descriptors to predicting A2A adenosine receptors agonists. Eur J Med Chem 40: pp. 1080-1086 CrossRef
    35. Saiz-Urra, L, P茅rez Gonz谩lez, M (2007) Quantitative structure-activity relationship studies of HIV-1 integrase inhibition.1. GETAWAY descriptors. Eur J Med Chem 42: pp. 64-70 CrossRef
    36. Fedorowicz, A, Singh, H, Soderholm, S, Demchuk, E (2005) Structure鈥揳ctivity models for contact sensitization. Chem Res Toxicol 18: pp. 954-969 CrossRef
    37. Saiz-Urra, L, P茅rez Gonz谩lez, M (2006) QSAR studies about cytotoxicity of benzophenazines with dual inhibition toward both topoisomerases I and II: 3D-MoRSE descriptors and statistical considerations about variable selection. Bioorg Med Chem 14: pp. 7347-7358 CrossRef
    38. Gasteiger, J, Sadowski, J, Schuur, J, Selzer, P, Steinhauer, L, Steinhauer, V (1996) Chemical information in 3Dspace. J Chem Inf Comput Sci 36: pp. 1030-1037 CrossRef
    39. Gasteiger, J, Schuur, J, Selzer, P, Steinhauer, L, Steinhauer, V (1997) Finding the 3D structure of a molecule in its IR spectrum. Fresen J Anal Chem 359: pp. 50-55 CrossRef
    40. Schuur, J, Selzer, P, Gasteiger, J (1996) The coding of the three-dimensional structure of molecules by molecular transforms and its application to structure-spectra correlations and studies of biological activity. J Chem Inf Comput Sci 36: pp. 334-344 CrossRef
    41. Baumann, K (1999) Uniform-length molecular descriptors for quantitative structure-property relationships (QSPR) and quantitative structure-activity relationships (QSAR): classification studies and similarity searching. TRAC 18: pp. 36-46
    42. Jelcic, Z (2004) Solvent molecular descriptors on poly(D, L-lactide-co-glycolide) particle size in emulsification-diffusion process. Coll Surf A Physico-Chem Eng Asp 242: pp. 159-166 CrossRef
    43. Todeschini, R, Bettiol, C, Giurin, G, Gramatica, P, Miana, P, Argese, E (1996) Modeling and prediction by using WHIM descriptors in QSAR studies. Submitochondrial particles (SMP) as toxicity biosensors of chlorophenols. Chemosphere 33: pp. 71-79 CrossRef
    44. Randic, M (1995) Molecular profiles. Novel geometry-dependent molecular descriptors. New J Chem 19: pp. 781-791
    45. Fayyad UM, Irani KB (1993) Multi-interval discretization of continuous-valued attributes for classification learning. In: Proceedings of the 13th international joint conference on artificial intelligence, pp 1022鈥?027. http://dblp.uni-trier.de/db/conf/ijcai/ijcai93.html#FayyadI93
    46. Newman DJ, Hettich S, Blake CL, Merz CJ (1998) UCI repository of machine learning databases. University of California, Department of Information and Computer Science, Irvine, CA. http://www.ics.uci.edu/~mlearn/MLRepository.html
    47. Guyon I, Gunn SR, Ben-Hur A, Dror G (2004) Result analysis of the NIPS 2003 feature selection challenge. In: Advances in neural information processing systems, Vancouver, BC, pp 545鈥?52. http://papers.nips.cc/paper/2728-result-analysis-of-the-nips-2003-feature-selection-challenge
    48. Webb AR (2002) Statistical pattern recognition, 2nd edn. Wiley, Chichester
    49. Cover, TM (1974) The best two independent measurements are not the two best. IEEE Trans Syst Man Cybern 4: pp. 116-117 CrossRef
  • 刊物类别:Chemistry and Materials Science
  • 刊物主题:Chemistry
    Analytical Chemistry
    Polymer Sciences
    Organic Chemistry
    Pharmacy
  • 出版者:Springer Netherlands
  • ISSN:1573-501X
文摘
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.