Comparison of classification-then-modelling and species-by-species modelling for predicting lake phytoplankton assemblages

详细信息	查看全文 \| 推荐本文 \|

作者：Simon Hallstan ; ^{simon.hallstan@slu.se} ; Richard K. Johnson ; Eva Willé ; n ; Ulf Grandin
关键词：Classification-then-modelling ; Environmental assessment ; Reference condition ; RIVPACS ; Species distribution model ; Species-by-species modelling
刊名：Ecological Modelling
出版年：2012
期刊代码：49_03043800
类别：cp
出版时间：24 April, 2012
卷：231
期：Complete
页码：11-19
文件大小：683 K

摘要

Species distribution models are used for a wide range of ecological applications, such as assessment of ecological status. For many such assessments, predictions of entire communities are preferred. When entire community compositions are modelled, two options are available: (1) to model all of the communities鈥?species individually and (2) to incorporate community information into the models. Here, we compared the accuracy of these two modelling approaches for predicting boreal lake phytoplankton assemblages and their ability to detect human impact. The modelling approaches tested were specifically classification-then-modelling (here a RIVPACS-type model, using random forest to predict biological group membership) and species-by-species modelling, using a random forest model for each species.

The species-by-species models performed better than the RIVPACS model according to the dissimilarity measure BC, the area under curve (AUC) and proportion of true positives. In contrast, the taxonomic completeness index (O/E), commonly used for freshwater assessments, indicated that the RIVPACS model performed better. However, we believe that O/E overestimates model performance, due to the index omitting false negative errors (i.e. errors where species are wrongly predicting as absent).

No support was found for our hypothesis that rare species would be better modelled by the RIVPACS model. Indeed, the RIVPACS model predicted common species significantly better than the species-by-species models, whilst the species-by-species models predicted rare species better than the RIVPACS model.

Both modelling methods were able to separate impaired sites (acidified and eutrophic) from reference sites.

We suggest that classification-then-modelling is evaluated using data-set containing more possible biological interactions, e.g. phytoplankton, zooplankton and fish. We also suggest that AUC is used as a complement to taxonomic completeness when evaluating models for reference condition taxa composition.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700