论文标题
具有不完善的高维空间数据检测的联合物种分布模型
Joint species distribution models with imperfect detection for high-dimensional spatial data
论文作者
论文摘要
确定物种和社区的空间分布是生态和保护的关键目标。联合物种分布模型使用多物种检测非探测数据来估计物种和社区分布。该数据的分析因物种,不完善的检测和空间自相关之间的残留相关性而变得复杂。尽管存在适应这些复杂性中每一个的方法,但文献中很少有同时解决和探索所有三个复杂性的例子。在这里,我们开发了一个空间因子多物种占用模型,以明确说明物种相关性,不完善的检测和空间自相关。提出的模型使用空间因子维度减小方法和最近的邻居高斯过程,以确保具有大量物种(例如> 100)和空间位置(例如100,000)的数据集的计算效率。我们将提出的模型性能与五个候选模型进行了比较,每个模型都涉及三个复杂性的子集。我们在Spocputancy软件中实现了拟议的和竞争的模型,旨在通过可访问,有据可查的开源R软件包来促进应用程序。使用仿真,我们发现当当前导致劣等模型预测性能时,忽略了三个复杂性。使用对美国大陆上98种鸟类物种的案例研究,空间因子多物种占用模型在候选模型中具有最高的预测性能。我们提出的框架及其在Spocputancy中的实现,是一种用户友好的工具,可以了解物种分布和生物多样性指标的空间变化,同时解决多物种检测 - 检测非探测数据中的共同复杂性。
Determining spatial distributions of species and communities are key objectives of ecology and conservation. Joint species distribution models use multi-species detection-nondetection data to estimate species and community distributions. The analysis of such data is complicated by residual correlations between species, imperfect detection, and spatial autocorrelation. While methods exist to accommodate each of these complexities, there are few examples in the literature that address and explore all three complexities simultaneously. Here we developed a spatial factor multi-species occupancy model to explicitly account for species correlations, imperfect detection, and spatial autocorrelation. The proposed model uses a spatial factor dimension reduction approach and Nearest Neighbor Gaussian Processes to ensure computational efficiency for data sets with both a large number of species (e.g., > 100) and spatial locations (e.g., 100,000). We compare the proposed model performance to five candidate models, each addressing a subset of the three complexities. We implemented the proposed and competing models in the spOccupancy software, designed to facilitate application via an accessible, well-documented, and open-source R package. Using simulations, we found ignoring the three complexities when present leads to inferior model predictive performance. Using a case study on 98 bird species across the continental US, the spatial factor multi-species occupancy model had the highest predictive performance among the candidate models. Our proposed framework, together with its implementation in spOccupancy, serves as a user-friendly tool to understand spatial variation in species distributions and biodiversity metrics while addressing common complexities in multi-species detection-nondetection data.