https://doi.org/10.1140/epjds/s13688-026-00649-y
Research
Mapping violence perceptions through YouTube comments: a new approach to real-time violence monitoring
1
University of Sydney, Sydney, Australia
2
Institute for Economics & Peace, Sydney, Australia
3
Monash University, Melbourne, Australia
a
This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
12
November
2025
Accepted:
17
March
2026
Published online:
27
March
2026
Abstract
This paper introduces the Violence Perception Index (VPI), a novel methodology for quantifying violence-related discourse through geolocated YouTube comments. Utilizing the YouTube API and natural language processing techniques, the VPI measures public references to violence across 1.2 million unique geolocated videos in Mexico (2020–2024), extracting 14.8 million comments from over 500,000 videos with user engagement. This approach provides spatiotemporally granular data on violence-related discourse, which we treat as a proxy for violence perceptions, extending beyond traditional event-based datasets by capturing not only documented violence but also rumors, fears, and community discourse about violence, dimensions that influence community behavior and social stability independently of official records. Violence scores are constructed using a weighted Spanish-language dictionary developed through semantic network expansion from violence-related seed terms. The dictionary-based scoring approach demonstrates moderate-to-substantial agreement with large language model classifications across 700 stratified comments (75-81% agreement), validating the method’s capacity to systematically identify violence-related discourse at scale while maintaining computational efficiency for processing millions of comments. The VPI is benchmarked against established violence indicators including ACLED fatalities and official municipal homicide statistics through panel regression specifications incorporating comprehensive spatial and temporal fixed effects. Analysis reveals systematic geographic heterogeneity: the VPI correlates strongly with ACLED data in high-population areas but exhibits stronger correlation with official homicide records in low-population contexts. Rather than constituting a methodological limitation, this pattern demonstrates the VPI’s enhanced sensitivity in marginalized and remote regions where news-based datasets suffer from systematic reporting bias. The methodology is immediately scalable across languages and geographies, providing complementary intelligence for conflict monitoring, early warning systems, and policy interventions in precisely those underrepresented areas where traditional event-based monitoring systems provide incomplete coverage.
Key words: Violence perception / Social media analysis / Conflict monitoring / Geospatial data / Mexico / Social sensing / Computational social science
Supplementary Information The online version contains supplementary material available at https://doi.org/10.1140/epjds/s13688-026-00649-y.
Handling Editor: Johannes Wachs
© The Author(s) 2026
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

