A Novel Algorithm for Efficient Learning of Bayesian Networks from High-Dimensional Data and Prior Knowledge
详细信息   
  • 作者:Su ; Chengwei
  • 学历:Doctor
  • 年:2015
  • 关键词:Social sciences ; Biological sciences ; Applied scienc
  • 导师:Borsuk,Mark E.
  • 毕业院校:Dartmouth College
  • Department:Engineering
  • 专业:Genetics;Demography;Computer science
  • ISBN:9781321607550
  • CBH:3685145
  • Country:USA
  • 语种:English
  • FileSize:5186173
  • Pages:112
文摘
The primary objective of the research is to develop and validate the approach and algorithms for efficiently learning a complex web of relations from a combination of prior knowledge,published literature,and high-dimensional data. Algorithms for inferring the structure of Bayesian networks (BNs) from data have become an increasingly popular method for uncovering the direct and indirect influences among variables in complex systems. A Bayesian model averaging method,Markov Chain Monte Carlo (MCMC),is typically applied for BN structural learning from data. However,existing state-of-the-art MCMC-based learning algorithms are rather slow in mixing and convergence in high-dimensional domains. To address these challenges,we first developed and tested intelligent strategies for prioritizing the structural search space using prior information. Second,we present a novel Markov blanket resampling (MBR) scheme that intermittently reconstructs the entire Markov blanket of nodes,thus allowing the sampler to more effectively traverse low-probability regions between local maxima. Experiments across a range of network sizes show that the MBR scheme outperforms other state-of-the-art algorithms,both in terms of learning performance and convergence rate. In particular,MBR achieves better learning performance when the number of observations is relatively small and faster convergence when the number of variables in the network is large. It is anticipated that our methodology will be especially useful for deciphering how genes and the environment interact to determine cancer risk by allowing BNs to be extended to a genome-wide scale.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700