When Life Paths Cross: Extracting Human Interactions in Time and Space from Wikipedia

Preprint

ShanghaiTech University

Abstract

Interactions among notable individuals—whether examined individually, in groups, or as networks—often convey significant messages across cultural, economic, political, scientific, and historical perspectives. By analyzing the times and locations of these interactions, we can observe how dynamics unfold across regions over time.

To bridge the gap resulting from the scarcity of such interaction data, in this work, we mine millions of biography pages from Wikipedia through the proposed model FALCON, extracting 685,966 interaction records in the form of (Person1, Person2, Time, Location) interaction quadruplets.

We further conduct an empirical analysis of intra- and inter-party interactions among political figures to examine political polarization in the US, showcasing the potential of the extracted data from a perspective that may not be possible without this data.

Our code, the extracted interaction data, and the WikiInteraction benchmark of 4,507 labeled interaction quadruplets are publicly available.

Architecture of FALCON

To accurately associate individuals with their specific spatio-temporal interactions, especially when the information is scattered throughout a text, FALCON exerts a framework combining multitask and transfer learning. As shown in the figure, FALCON uses a modified AR-BERT model to extract features for two related tasks: a main interaction classification task and an auxiliary trajectory classification task. Meanwhile, it incorporates features from a separate, frozen AR-BERT model (pre-trained on trajectory data) through a feature fusion module.

Interactions of US Political Figures 🇺🇸

A New Lens on Political Polarization

To demonstrate our data's potential, we analyzed 14,084 interactions between 3,896 Republican and 3,995 Democratic figures from 1960 to 2024. This offers a new perspective on political polarization by focusing on direct, real-world interactions rather than polls or social media data.

The following bar chart reveals that the nature of inter-party interactions has dramatically shifted over time. Adversarial interactions have steadily increased to dominate by the 2020s, while Cooperative and Neutral interactions have sharply declined, indicating growing polarization.


Political Interaction Network

We then map all interactions from 1960-2024, with red nodes for Republicans and blue for Democrats; edges are weighted by interaction type. The two partisan groups are visually separated, with few cross-party connections, clearly illustrates the structure of political polarization.

Analysis of Polarization Trends

Statistics in (a) shows a clear upward trend in national polarization, with accelerated growth during the Obama and Trump presidencies. While in (b), the state-level data reveals diverse patterns, such as a steep rise in Washington D.C. and a recent decline in California.

BibTeX

@article{liu2025life,
      title={When Life Paths Cross: Extracting Human Interactions in Time and Space from Wikipedia},
      author={Liu, Zhongyang and Zhang, Ying and Xiao, Xiangyi and Liu, Wenting and Zha, Yuanting and Zhang, Haipeng},
      journal={arXiv preprint arXiv:2510.00019},
      year={2025}
}