IOHunter: Graph foundation model to uncover online information operations

Summary

IOHunter proposes a graph foundation model for detecting drivers of coordinated information operations (IOs) on social media. The authors argue that existing IO-detection pipelines are brittle: they are typically trained per-campaign, depend on scarce labels, and fail to transfer across operations originating from different state actors or platforms. By fusing a language model encoder with graph neural networks over user-interaction graphs, IOHunter learns representations that jointly capture content and relational structure, enabling detection that generalizes across heterogeneous campaigns. Evaluated on six state-sponsored campaigns on X, the model outperforms baselines and remains effective in out-of-distribution and few-shot regimes, positioning foundation-model approaches as a promising direction for IO defense.

Key Contributions

  • Introduces a graph foundation model paradigm for IO detection, extending the foundation-model framing from text and vision to graph-structured social data.
  • Provides a unified architecture that integrates language-model text embeddings with GNN-based structural representations of user activity.
  • Empirically demonstrates cross-campaign generalization across six state-sponsored IO datasets on X.
  • Shows that the approach supports few-shot detection, mitigating the chronic label scarcity problem in IO research.

Methods

  • Builds user-interaction graphs from X data covering six state-sponsored IO campaigns originating from different countries.
  • Encodes node-level textual signals with a language model and combines them with a GNN backbone to learn driver-level representations.
  • Trains and evaluates under three regimes: standard supervised, out-of-distribution (held-out campaigns), and few-shot.
  • Benchmarks against prior IO driver-detection baselines using classification metrics.

Findings

  • IOHunter outperforms baselines across the six campaign datasets in supervised settings.
  • It generalizes to unseen campaigns, suggesting transferable signatures of coordinated behavior across state actors.
  • The joint text + graph representation outperforms either modality on its own.
  • Performance remains strong with limited labels, supporting practical few-shot deployment.

Connections

This work fits within a growing body of machine-learning approaches to coordinated inauthentic behavior detection that move beyond per-campaign classifiers; it relates closely to Luceri2025-tr and Mannocci2025-ig on detection methodology, and to Gerard2025-br and Yang2025-iv on identifying coordinated actors at scale. Its emphasis on cross-campaign generalization and few-shot transfer connects it conceptually to broader methodological efforts in computational social science such as Bouchaud2026-lr and Kansaon2025-id that grapple with measuring coordination across diverse operations.