State media control influences large language models
Summary
This paper provides systematic empirical evidence that state control over media environments shapes the outputs of commercial large language models through a previously underexamined pathway: training data. Using China as a primary case and a 37-country cross-national audit as a generalization test, the authors trace a causal chain from state-coordinated media production, to its inclusion in open-source web corpora, to memorization by commercial LLMs, to measurable pro-regime tilts in model outputs—especially when prompted in languages exclusive to lower-press-freedom countries. The core argument is that LLMs can “launder” state-manipulated content into seemingly neutral responses, constituting an indirect form of institutional influence distinct from direct regulation of model design.
Key Contributions
- First systematic, multi-study empirical demonstration that state media control influences commercial LLM outputs via training data rather than direct regulatory pressure.
- A triangulation design combining corpus overlap analysis, memorization audits, controlled pretraining experiments, cross-language prompt audits, and a 37-country generalization audit.
- Bridges political communication research on authoritarian media systems with NLP work on multilingual training, memorization, and bias.
- Identifies a generalizable mechanism—language-exclusive media environments under institutional control—that extends beyond state actors and beyond China.
- Publicly releases replication materials and an interactive site updating results on newer models.
Methods
The authors run six interlocking studies. Study 1 matches ~189M Chinese CulturaX documents against scripted state media using five-word gram cosine similarity. Study 2 audits memorization in Claude Sonnet/Opus, GPT-3.5, GPT-4, and GPT-4o by prompting with 10-word prefixes of phrases identified (via lasso regression) as distinctively state-coordinated. Study 3 performs LoRA continued pretraining on Llama 2 13b (replicated on Llama 3.1) with scripted, non-scripted state, or random CulturaX corpora, then measures favorability with GPT-4o-as-judge across languages. Studies 4–5 are pre-registered English-vs-Chinese audits using both researcher-designed prompts (human-rated) and real-user prompts from WildChat, Baidu Zhidao, and Zhihu. Study 6 generalizes via 6,051 prompts across 37 language-exclusive countries, correlating outputs with World Press Freedom Index scores.
Findings
- ~1.64% of Chinese CulturaX documents substantially overlap with state-coordinated media; this rises to 3.28–23.98% for politically salient content (e.g., Xi Jinping, CCP, Party Congress).
- Commercial LLMs memorize state-coordinated 20-word phrases at 3–10% rates, equal to or exceeding memorization of common Chinese phrases.
- Just 6,400 additional pretraining examples of scripted news shift Llama 2 toward more pro-CCP outputs ~80% of the time; scripted state news has stronger effects than non-scripted state media or random web text.
- Pretraining effects spill over most to languages sharing tokens with simplified Chinese (traditional Chinese, Japanese, Korean).
- GPT-3.5 responses to Chinese prompts were rated more favorable to China 75.3% of the time; the gap grew with model scale (88.2% for Claude Opus).
- Real-user prompts from WildChat, Baidu, and Zhihu reproduce the cross-language favorability gap.
- Across 37 countries, lower press freedom predicts greater target-language favorability relative to English; high-press-freedom countries show flat or reversed patterns.
Connections
This paper sits alongside other audits of LLM political behavior and bias under the generative-AI register, including work on multilingual and ideological skew such as Tornberg2025-ir and Le-Mens2025-qz, and broader assessments of LLM persuasive capacity like Hackenburg2025-dj. It also extends the author’s earlier work on Chinese information control (Waight2025-al) into the AI domain, and complements concerns about AI-mediated information disorder raised in DeVerna2025-dl and Triedman2025-uy. The geopolitical framing of training-data influence makes it a natural counterpart to studies of state-aligned information operations and propaganda diffusion in the wider topic cluster.
Podcast
A research-radio episode discusses this paper: Listen