论文标题
通过反向模型推断文化景观
Inferring Cultural Landscapes with the Inverse Ising Model
论文作者
论文摘要
可能的人类文化的空间是巨大的,但是某些文化构型与认知和社会限制更一致。这导致了我们物种在文化进化中探索的可能性的``景观''。但是,这种健身格局构成和指导文化进化的状况如何?可以回答这些问题的机器学习算法通常是针对大型数据集开发的。在历史记录中发现的稀疏,不一致和不完整的数据的应用受到了较少的关注,标准建议可能导致对边缘化,研究不足或少数族裔文化的偏见。我们展示了如何将最小概率流算法和逆伊斯汀模型(一种由物理启发的机器学习手术试验)适应挑战。一系列的自然扩展(包括对缺失数据的动态估计以及与正则化的交叉验证)可靠地重建基本约束。我们展示了我们的宗教史数据库的策划子集的方法:从人类历史上407个宗教团体的记录,从青铜时代到今天。这揭示了一个复杂,坚固的景观,既有尖锐的,明确的山峰,又有国家认可的宗教倾向于集中精力,并且可以找到福音派宗教,非国家的精神实践和神秘宗教的文化洪泛区。
The space of possible human cultures is vast, but some cultural configurations are more consistent with cognitive and social constraints than others. This leads to a ``landscape'' of possibilities that our species has explored over millennia of cultural evolution. But what does this fitness landscape, which constrains and guides cultural evolution, look like? The machine-learning algorithms that can answer these questions are typically developed for large-scale datasets. Applications to the sparse, inconsistent, and incomplete data found in the historical record have received less attention, and standard recommendations can lead to bias against marginalized, under-studied, or minority cultures. We show how to adapt the Minimum Probability Flow algorithm and the Inverse Ising model, a physics-inspired workhorse of machine learning, to the challenge. A series of natural extensions -- including dynamical estimation of missing data, and cross-validation with regularization -- enables reliable reconstruction of the underlying constraints. We demonstrate our methods on a curated subset of the Database of Religious History: records from 407 religious groups throughout human history, ranging from the Bronze Age to the present day. This reveals a complex, rugged, landscape, with both sharp, well-defined peaks where state-endorsed religions tend to concentrate, and diffuse cultural floodplains where evangelical religions, non-state spiritual practices, and mystery religions can be found.