Causal discovery in the geosciences—Using synthetic data to learn how to interpret results

详细信息查看全文

作者：Imme Ebert-Uphoff^a ; ^{iebert@engr.colostate.edu} ; Yi Deng^b ; ^{yi.deng@eas.gatech.edu}
关键词：Probabilistic graphical model ; Structure learning ; Causal discovery ; Information flow ; Geoscience ; Climate ; Atmospheric science
刊名：Computers & Geosciences
出版年：2017
出版时间：February 2017
年：2017
卷：99
期：Complete
页码：50-60
全文大小：2402 K
卷排序：99

文摘

Causal discovery algorithms based on probabilistic graphical models have recently emerged in geoscience applications for the identification and visualization of dynamical processes. The key idea is to learn the structure of a graphical model from observed spatio-temporal data, thus finding pathways of interactions in the observed physical system. Studying those pathways allows geoscientists to learn subtle details about the underlying dynamical mechanisms governing our planet. Initial studies using this approach on real-world atmospheric data have shown great potential for scientific discovery. However, in these initial studies no ground truth was available, so that the resulting graphs have been evaluated only by whether a domain expert thinks they seemed physically plausible. The lack of ground truth is a typical problem when using causal discovery in the geosciences. Furthermore, while most of the connections found by this method match domain knowledge, we encountered one type of connection for which no explanation was found. To address both of these issues we developed a simulation framework that generates synthetic data of typical atmospheric processes (advection and diffusion). Applying the causal discovery algorithm to the synthetic data allowed us (1) to develop a better understanding of how these physical processes appear in the resulting connectivity graphs, and thus how to better interpret such connectivity graphs when obtained from real-world data; (2) to solve the mystery of the previously unexplained connections.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700