A Novel Multivariate Mapping Method for Analyzing High-Dimensional Numerical Datasets
详细信息    查看全文
  • 关键词:Feature selection ; Dimensionality reduction ; Density estimation
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9728
  • 期:1
  • 页码:311-319
  • 全文大小:782 KB
  • 参考文献:1.Aldana-Bobadilla, E., Alfaro-Prez, C.: Finding the optimal sample based on shannon entropy and genetic algorithms. In: Sidorov, G., Galicia-Haro, S.N. (eds.) MICAI 2015. LNCS, vol. 9413, pp. 353–363. Springer, Heidelberg (2015)CrossRef
    2.Cox, K.A., Dante, H.M., Maher, R.J.: Product appearance inspection methods and apparatus employing low variance filter, 17 August 1993. US Patent 5,237,621
    3.Doane, D.P.: Aesthetic frequency classifications. Am. Stat. 30(4), 181–183 (1976)MathSciNet
    4.Gowda, K.C., Krishna, G.: The condensed nearest neighbor rule using the concept of mutual nearest neighborhood. IEEE Trans. Inf. Theor. 25(4), 488–490 (1979)CrossRef
    5.Hyndman, R.J.: The problem with sturges rule for constructing histograms. Monash University (1995)
    6.Hyndman, R.J., Fan, Y.: Sample quantiles in statistical packages. Am. Stat. 50(4), 361–365 (1996)
    7.Kalegele, K., Takahashi, H., Sveholm, J., Sasai, K., Kitagata, G., Kinoshita, T.: On-demand data numerosity reduction for learning artifacts. In: 2012 IEEE 26th International Conference on Advanced Information Networking and Applications (AINA), pp. 152–159. IEEE (2012)
    8.Lane, D.M.: Online statistics education: an interactive multimedia course of study (2015). http://​onlinestatbook.​com/​2/​graphing_​distributions/​histograms.​html . Accessed 03 Dec 2015
    9.Liu, H., Motoda, H.: Instance Selection and Construction for Data Mining, vol. 608. Springer, Heidelberg (2013)
    10.Reeves, C.R., Bush, D.R.: Using genetic algorithms for training data selection in RBF networks. In: Liu, H., Motoda, H. (eds.) Instance Selection and Construction for Data Mining, vol. 608, pp. 339–356. Springer, Heidelberg (2001)CrossRef
    11.Shlens, J.: A tutorial on principal component analysis (2014). arXiv preprint arXiv:​1404.​1100
    12.Skalak, D.B.: Prototype and feature selection by sampling and random mutation hill climbing algorithms. In: Proceedings of the Eleventh International Conference on Machine Learning, pp. 293–301 (1994)
    13.Randall Wilson, D., Martinez, T.R.: Reduction techniques for instance-based learning algorithms. Mach. Learn. 38(3), 257–286 (2000)CrossRef MATH
    14.Lei, Y., Liu, H.: Feature selection for high-dimensional data: a fast correlation-based filter solution. ICML 3, 856–863 (2003)
  • 作者单位:Edwin Aldana-Bobadilla (14)
    Alejandro Molina-Villegas (15)

    14. CINVESTAV-Unidad Tamaulipas, Ciudad Victoria, Mexico
    15. The National Commission for Knowledge and Use of Biodiversity, Mexico City, Mexico
  • 丛书名:Advances in Data Mining. Applications and Theoretical Aspects
  • ISBN:978-3-319-41561-1
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9728
文摘
In modern science, dealing with high dimensional datasets is a very common task due to the increasing availability of data. Multivariate data analysis represents challenges in both theoretical and empirical levels. Until now, several methods for dimensionality reduction like Principal Component Analysis, Low Variance Filter and High Correlated Columns has been proposed. However, sometimes the reduction achieved by existing methods is not accurate enough to analyze datasets where, for practical reasons, more reduction of the original dataset is required. In this paper, we propose a new method to transform high dimensional dataset into a one-dimensional. We show that such transformation preserves the properties of the original dataset and thus, it can be suitable for many applications where a high reduction is required.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700