https://doi.org/10.1140/epjds/s13688-025-00583-5
Research
Fusing content and social relationships: a multi-modal heterogeneous graph transformer approach for social bot detection
Department of Management Science and Engineering, Zhejiang Sci-tech University, Hangzhou, China
a
This email address is being protected from spambots. You need JavaScript enabled to view it.
Received:
18
February
2025
Accepted:
20
August
2025
Published online:
2
September
2025
Social bots pose a significant threat to online platforms, demanding robust methods to detect their increasingly complex behaviors. This paper introduces MM-HGT-Bot, a multi-modal framework that advances the field by operationalizing social network theory in a new way. Our core contribution is the deconstruction of social ties into two distinct, theoretically-grounded dimensions: information source selection (the following network) and potential influence (the follower network). Our architecture employs a Heterogeneous Graph Transformer (HGT) to learn the unique patterns emerging from these different relationship types. It then synergistically fuses these relational insights with context-aware representations of user-generated content. Extensive experiments on the widely-used Cresci-15 and Twibot-20 datasets demonstrate that our approach consistently outperforms state-of-the-art baselines. These findings highlight that a more fine-grained and theoretically-informed modeling of social relationships is crucial for building effective and robust bot detection systems.
Key words: Social bot detection / Heterogeneous graph transformers / Multi-modal learning / Social network analysis / Relational learning / Content analysis / Attention mechanism
© The Author(s) 2025
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

