Evidence pack: Magnifica Humanitas (Pope Leo XIV, 25 May 2026)
Target: Magnifica Humanitas, the first encyclical of Pope Leo XIV's pontificate, signed 15 May 2026, released by the Holy See on Memorial Day, 25 May 2026. Subtitle: On Safeguarding the Human Person in the Time of Artificial Intelligence. 245 numbered paragraphs, ~38,300 English words.
Operator: Joshua Miller (anotherpanacea@gmail.com). Maintainer, SETEC stylometric framework (github.com/anotherpanacea-eng/setec-voiceprint, v1.90.2).
Date run: 26-27 May 2026.
Models used: Claude Opus 4.7 (for the operator-mediated mirror prediction passes, Anthropic consumer subscription). SETEC framework scripts run locally against an installed Python environment with spaCy 3.8 (en_core_web_sm, en_core_web_md).
Prior art: Linch Zhang, "Claude, Author of the Humanitas," The Linchpin (Substack), 26 May 2026. Zhang's piece consilience-argues that Magnifica Humanitas was substantially Claude-drafted, anchored on em-dash counts (127), "genuinely" frequency (9), tricolon density, and Pangram section-by-section flagging. This evidence pack treats Zhang's piece as the framing prior, replicates the load-bearing measurements under SETEC's vocabulary, runs analyses Zhang did not (variance audit, AIC pattern audit, POS-bigram diff, operator-mediated mirror prediction with a Francis-baseline control), and documents what each measurement can and cannot license.
Institutional context: On the day Magnifica Humanitas was released (25 May 2026), Chris Olah, co-founder of Anthropic, delivered remarks at the Vatican press conference at Pope Leo XIV's personal invitation. The Pope thanked Olah by name. Anthropic published a corresponding write-up. The institutional relationship between Anthropic and the Holy See, as of the encyclical's release date, is therefore on the public record. This pack treats the stylometric findings as separable from the institutional disclosure: the Vatican-Anthropic dialogue is public; what this pack measures is whether Claude's model surfaces appear in the published English prose at rates and patterns consistent with substantive drafting assistance. Both findings are compatible.
License: This pack is shared so anyone can audit, replicate, or contest it. No claim of dispositive verdict is made about which paragraphs were Claude-drafted or which Vatican drafters used AI assistance. The pack tells the reader where the artifact sits in measurement space, with which caveats, under which methodology.
What this pack concludes (and what it refuses to conclude)
The glass-box stylometric methodology applied to Magnifica Humanitas produces a coherent signal consistent with Claude-assisted drafting in significant portions of the English text. The signal is consilient across three independent measurement layers (lexical, syntactic, predictive), corroborates Zhang's published findings on every metric SETEC can replicate, and extends them with three findings Zhang's analysis did not surface: a correctio rate elevated 2-3x baseline, pronoun bigram suppression relative to the Francis corpus, and chapter-level heterogeneity consistent with differential drafter participation. The cross-family attribution (Claude specifically rather than another frontier model) is supported by both an exploratory v0.1 Codex run and a clean v0.3.1 cross-family rerun under neutral filenames and register-exercise framing. The Claude-specific attribution is now the leading model-family hypothesis on the selected windows; broader validation (multi-shot sampling, randomly sampled windows, parallel Gemini and standard-GPT-5 runs, a post-cutoff baseline) remains queued.
The strongest single signal is the operator-mediated mirror discrimination result. When Claude is handed the prefix of an encyclical and asked to write the next 150 words blind, its predictions match the actual Magnifica Humanitas continuation at a K=4 mean TFIDF cosine of 0.573 and POS-bigram cosine of 0.816. Run against Dilexit Nos (Pope Francis, October 2024) under identical methodology, the same predictions match at 0.432 TFIDF cosine and 0.719 POS-bigram cosine. The Magnifica gap is +0.14 absolute / 33% relative on TFIDF, +0.10 absolute / 14% relative on POS-bigrams. Beat-by-beat qualitative inspection of the Babel passage (paragraph 7) and the "Babel syndrome" passage (paragraph 10), both of which Zhang screenshotted as Pangram-flagged, shows Claude independently reproducing the same triplet structure, the same biblical citation format, and (in the case of paragraph 10) the exact coined phrase "Babel syndrome."
The pack does not say the encyclical was fully written by AI. The Layer A distributional signals (burstiness in range, lexical diversity high, sentence-length variance high) do not show the standard smoothing fingerprint of unedited RLHF-aligned LLM output. The pack reads the text as plausibly co-mediated: portions of the prose are consistent with Claude-assisted drafting followed by human revision; other portions appear humanly authored without measurable LLM trace. Chapter-level variance is consistent with this reading. The pack does not claim to identify which paragraphs, sections, or chapters carry which provenance, nor does it claim to identify who drafted, revised, reviewed, or approved any passage. The right next moves are institutional (the Holy See's drafting process, Curia disclosure practice, future Vatican policy on AI use in magisterial documents) rather than computational.
The pack does not establish Claude specifically rather than another frontier model in any final sense. The v0.3.1 cross-family Codex/GPT comparison run on 27 May 2026 is the first cross-family measurement here, and on the four selected windows Claude predicts the published text substantially better than the Codex/GPT-family run does. A parallel Gemini mirror and a manual standard-GPT-5 run would sharpen the attribution further; both are queued.
Aggregate evidence table
| Test | Result | Direction | Licensure |
|---|---|---|---|
| Em-dash count (English) | 124 in Magnifica vs 0 in 4 Francis-only encyclicals; 29 in Lumen Fidei (Benedict-Francis 2013) | LLM | Strong; replicates Zhang at n-1 |
| Em-dash count (Italian) | 0 in Magnifica IT vs 94 en-dashes (Italian convention) | Translation rule-out | Em-dash anomaly is an English-text feature, not Italian-source bleed |
| "Genuine(ly)" rate | 0.575/1k words in Magnifica vs 0.054-0.423/1k in 5 baselines | LLM (Claude-specific) | Strong; replicates Zhang. Claude house-style fingerprint |
| SETEC AIC-7 correctio | 2.459/1k (95 instances) vs baselines 0.735-1.456/1k | LLM | Largest AIC signal. New beyond Zhang. 2-3x baseline rate |
| Comma-list tricolon (regex) | 5.09/1k vs baselines 1.62-4.20/1k | LLM | Confirms Zhang's tricolon finding; strict-parallelism triplets are NOT elevated |
| Layer A burstiness | -0.091 (in baseline range) | Not smoothed | Magnifica does NOT show the simple smoothing signature |
| Layer A MTLD | 98.5 (above top of baseline range) | Not smoothed | High lexical diversity |
| Layer A sentence mean | 24.6 words (above baseline range of 19.1-22.6) | Distinctive | Magnifica's prose is longer-sentence than Francis baselines |
| Layer A FKGL mean | 16.6 (above baseline range of 10.8-13.7) | Distinctive | Significantly denser/more complex prose |
| Layer A connectives/1k | 5.60 (above baseline range of 3.41-4.52) | LLM-suggestive | Elevated "thus/therefore/however" density; AI-typical |
| Layer A POS-bigram entropy | 6.10 (below baseline range of 6.12-6.29) | LLM-suggestive | More syntactically concentrated; AI-typical |
| AIC-8 image conjunction | 7.87/1k (within baseline range 6.98-8.56) | Not elevated | Babel/Jerusalem controlling metaphor source-triages as earned by frame; SETEC does not over-flag (contrast with Granta/Nazir) |
| AIC-9 kicker density | 0.021 (low end of range) | Not elevated | No closure-inflation signal |
| POS-bigram diff (drift direction) | ADJ-NOUN, NOUN-CCONJ, ADP-NOUN over-represented; all PRON-* bigrams under-represented | LLM-suggestive | Pronoun suppression is novel beyond Zhang. AI-typical abstraction-density |
| Chapter-level variance | Chapter 1 stylistic outlier (sentence mean 31.2, lowest POS entropy); Chapter 4 highest MTLD | Heterogeneous | Consistent with Zhang's section-by-section variance hypothesis |
| Mirror prediction K=4: Claude pred vs ACTUAL Magnifica | TFIDF 0.5729, POS-bg 0.8161 | LLM | Load-bearing signal. See Dilexit comparison below. |
| Mirror prediction K=4: Claude pred vs ACTUAL Dilexit Nos (Francis 2024 control) | TFIDF 0.4322, POS-bg 0.7186 | Reference | Francis-baseline reference |
| Magnifica vs Dilexit mirror gap | +0.1406 TFIDF / +0.0975 POS-bg | LLM (Magnifica closer to Claude's prediction distribution) | 33% / 14% relative gap |
| Mirror prediction W1 (Babel passage, Pangram-screenshot-flagged) | Claude pred vs ACTUAL TFIDF 0.7243; beat-for-beat structural match of "single language and a single purpose" triplet, "make a name for itself" verbatim, biblical citation format | LLM | Strongest single window. Forced-by-structure caveat applies. |
| Mirror prediction W2 (Babel syndrome passage, Pangram-flagged) | Claude independently coined the exact phrase "Babel syndrome" | LLM | Convergence on a specific coinage |
| Mirror control C1 (Dilexit Nos open paragraph) | Claude predicted Scripture/Augustine; Francis used personal grandmother-and-pastry anecdote | Human (out-of-distribution) | The kind of move LLMs systematically cannot predict |
The case
On 25 May 2026 (Memorial Day in the United States), the Holy See released Magnifica Humanitas, the first encyclical letter of Pope Leo XIV. The 245-paragraph document develops Catholic social teaching in light of artificial intelligence, structurally bridging Rerum Novarum (Leo XIII, 1891) and Laudato Si' (Francis, 2015). Within twenty-four hours, Linch Zhang published a Substack analysis arguing that significant fractions of the encyclical were AI-drafted, most likely by Claude, anchored on lexical/punctuation tells, Pangram detection results, and translation rule-outs.
Zhang's piece is good consilience work but Pangram-anchored. The detector reports a verdict-shaped output. The shape of the inquiry the case demands is glass-box: named features, source-triage decisions, claim-license tiers, predictive comparison against verified-human controls. That is what this pack provides.
This evidence pack was produced by an independent operator using the SETEC glass-box stylometric framework and a Design 2 operator-mediated mirror-discrimination methodology. The public excerpt documenting the protocol generalizations (training-cutoff precedence, operator-interface generalization, post-cutoff baseline selection) is published at references/spec-v0.1-addendum-model-generalization.md in this repository; the full SPEC remains internal. The operator has no commercial stake in the Pangram-detection debate, has been critical of Pangram's product-surface verdict shape, and has spent the preceding three months publishing a sequence arguing for exactly this kind of evidence pack as the disciplined alternative to "the detector said so."
Per-tier evidence
Tier 0: Lexical and punctuation tells (Zhang replication)
Zhang's published counts replicate cleanly under independent acquisition and counting. The em-dash count in Magnifica Humanitas (English) lands at 124 in our pipeline versus Zhang's reported 127; the three-count gap traces to footnote-area em-dashes the SETEC cleaner removes. For comparison, the four most recent Francis-only encyclicals/exhortations (Dilexit Nos 2024, Laudate Deum 2023, Fratelli Tutti 2020, Laudato Si' 2015) all contain zero em-dashes in body text. Lumen Fidei (2013, jointly authored by Benedict and Francis) contains 29 em-dashes, mostly serving as speech-colon replacements in patristic citations, structurally different from the Magnifica usage.
The "genuinely"/"genuine" frequency confirms similarly: 22 instances total in Magnifica Humanitas English (0.575 per 1,000 words). For comparison, Dilexit Nos contains 5 instances (0.179/1k), Fratelli Tutti 9 (0.235/1k), Laudato Si' 14 (0.369/1k), Laudate Deum 3 (0.423/1k), Lumen Fidei 1 (0.054/1k). Magnifica's rate is 1.4x to 10x the rate of the comparison set.
The Italian version of Magnifica Humanitas contains 0 em-dashes and 94 en-dashes; Italian typesetting convention substitutes en-dashes for the English em-dash function. The Italian text contains 0 instances of the word stem genuin- (where the closest Italian equivalent would be genuinamente or autentico). Both observations confirm that the English em-dash and "genuinely" anomalies are features of the English-language text specifically, not artifacts of translation from Italian.
Lexical tells license: a flag that the English text contains markers strongly associated with Claude prose, present at densities outside the range observed in recent papal encyclicals. They do not license a verdict; Catholic doctrinal prose admits formal features, and an enthusiastic human drafter could in principle produce these rates. But the consistency across multiple independent tells (em-dash, "genuinely") combined with the translation rule-out makes a non-Claude explanation work hard.
Tier 1: SETEC variance audit (Layer A smoothing diagnosis)
Layer A measures the distributional signature that AI-typical RLHF-aligned generation tends to leave: compressed sentence-length variance, narrowed lexical diversity, lowered burstiness, suppressed POS distribution. The audit was run on all six English encyclicals plus the Italian Magnifica Humanitas.
Result on Magnifica Humanitas English:
- Burstiness B: -0.091, within baseline range (-0.226 to -0.072). Magnifica is less bursty than Laudato Si' but more bursty than Dilexit Nos. No simple smoothing signature.
- MATTR (window=50): 0.811, upper end of baseline range (0.767-0.815).
- MTLD: 98.54, above baseline range (63.43-97.27). Highest lexical diversity in the corpus.
- Yule's K: 123.87, within range (107.23-142.11).
- Sentence mean (words): 24.60, above range (19.11-22.58). Magnifica's prose has longer sentences than the Francis baselines.
- Sentence SD: 20.50, above range (14.24-19.04). More sentence-length variance than baselines.
- FKGL mean: 16.60, well above range (10.78-13.67). Significantly denser, more complex prose.
- FKGL SD: 8.06, above range (6.00-7.42).
- Connective density (per 1,000 words): 5.60, well above range (3.41-4.52). Magnifica deploys "thus," "therefore," "however," "moreover," "consequently" at substantially elevated rates compared to the Francis corpus.
- POS-bigram entropy: 6.097, below baseline range (6.115-6.288). Magnifica is more syntactically concentrated than the Francis baselines.
Layer A licenses: Magnifica is NOT a clean smoothing case. The text reads as varied (high MTLD, high sentence SD, in-range burstiness), longer-sentenced, denser, and connective-heavy. The connective density elevation and the POS-bigram entropy depression are AI-typical patterns; the high lexical diversity and high sentence variance are not. The pattern is consistent with substantial co-mediation: AI-drafted passages contribute the connectives and the syntactic concentration; human revision contributes the lexical breadth and sentence variance.
Layer A does not license: a claim of full AI generation. If Magnifica Humanitas were a single-session unedited LLM output, MTLD and sentence variance would be lower than the baselines, not higher.
Tier 2: AIC pattern audit (Layer B craft restoration)
The AIC families measure the surface rhetorical moves AI literary and argumentative prose tends to over-produce: correctio ("not X but Y"), triplet structure, manifesto cadence, paragraph-final aphoristic kicker, image conjunction, prestige-metaphor scatter, pseudo-aphorism.
Result on Magnifica Humanitas English, per 1,000-word rates:
| Pattern | Magnifica | Baseline range | Position |
|---|---|---|---|
| Correctio (AIC-7) | 2.459 | 0.735 to 1.456 | 2x to 3x baseline |
| Triplet (SETEC strict parallel) | 0.233 | 0.161 to 0.604 | low end of range |
| Manifesto cadence | 0.026 | 0.000 to 0.104 | low |
| Pseudo-aphorism | 0.078 | 0.105 to 0.422 | below range |
| Negation hedge, prof-parallel stack, false balance, hedge-and-affirm, authority laundering | 0.000 | 0.000+ | no signal |
| AIC-8 image conjunction | 7.87 | 6.98 to 8.56 | within range |
| AIC-9 kicker density | 0.021 | 0.018 to 0.121 | low end of range |
| Recommendation template | 0.078 | 0.000 to 0.036 | slightly elevated |
The correctio rate is the single largest AIC signal. Magnifica Humanitas contains 95 instances of the "not X but Y" / "X is not Y but Z" / "X, but not Y" pattern. The comparison set contains 19 to 46 in encyclicals of comparable length. This is the most identifiable AI rhetorical move and the one Anthropic has tried to suppress in Claude's system prompt (per published Anthropic constitution documents and leaked system prompts).
Per-instance source-triage on the correctio flag: many instances are thematically earned. The Babel-versus-Jerusalem binary is the encyclical's controlling rhetorical move and naturally invites contrastive constructions. But the rate is elevated beyond what the thematic argument requires. Fratelli Tutti deploys an equally binary fraternal-vs-individualist argument and lands at 1.195/1k; Laudato Si' deploys the equally binary integral-ecology-vs-technocracy argument at 0.735/1k. The Magnifica rate is 2-3x these comparators in encyclicals of similar argumentative shape. Some portion of the correctio elevation is unearned by the frame.
The AIC-8 image conjunction result is significant in the opposite direction: the controlling Babel/Jerusalem metaphor, which generates a great deal of figurative noun-verb pairing throughout the encyclical, does not push the AIC-8 detector above baseline. The framework's source-triage step (calibrated against the Granta/Nazir miss documented in the prior evidence pack) correctly reads the imagery as earned by frame.
The triplet result is more nuanced than Zhang's reading suggests. SETEC's strict triplet detector (which weights parallelism and anaphora) reads Magnifica at 0.233/1k, below the baseline range. A supplementary regex measuring comma-list tricolons specifically ((\w+, \w+,? and \w+)) reads Magnifica at 5.09/1k, well above the baseline range of 1.62 to 4.20/1k. An expanded regex permitting 1-2 word units per slot lands at 7.94/1k against baselines 2.22 to 5.66. The signal Zhang observed is real but specific: Magnifica deploys comma-list tricolons ("dignity, justice and fraternity") at elevated rates without correspondingly deploying the more elaborate syntactic parallelism patterns that mark high-rhetorical-flourish triplet structure in formal English. This is consistent with AI-typical triplet generation, which favors the shorter list-of-three move over the longer parallel-clause move.
Layer B licenses: a claim that Magnifica Humanitas exhibits elevated rates of named AI-typical rhetorical moves, particularly correctio (the largest signal) and comma-list tricolons. The image-conjunction calibration confirms the framework is not over-flagging at the metaphor-heavy register.
Layer B does not license: a verdict. Catholic doctrinal prose has formal-rhetorical conventions of its own, and a skilled human writer in this tradition can deploy correctio at high rates. The signal is comparative against five Francis-era encyclicals of similar register; a longer baseline corpus might widen the comparator range.
Tier 3: POS-bigram diff against baseline
Run via SETEC's bigram_diff.py with Magnifica Humanitas English as target and the five baseline encyclicals as the matched-register comparison set. Total KL divergence: 0.054 (pooled) / 0.059 (mean).
Most over-represented POS-bigrams in Magnifica versus the baseline (top 10 by KL contribution):
ADJ-NOUN (+0.012 share, log2 +0.31), NOUN-CCONJ (+0.006, +0.42), ADP-NOUN (+0.006, +0.26), NOUN-ADP (+0.005, +0.13), NOUN-PUNCT (+0.005, +0.12), PROPN-PROPN (+0.005, +0.94), VERB-NOUN (+0.005, +0.52), DET-ADJ (+0.005, +0.25), ADP-ADJ (+0.004, +0.39), PUNCT-NOUN (+0.004, +0.50).
Pattern: adjective-and-noun heavy, coordinated noun lists (NOUN-CCONJ), prepositional noun chains (ADP-NOUN, NOUN-ADP), proper-noun pairs (PROPN-PROPN, often "Pope Leo," "Saint Paul VI"). The Magnifica register is more nominal, more formally-modified than the Francis baseline.
Most under-represented POS-bigrams in Magnifica versus the baseline:
PUNCT-PUNCT (-0.010 share, log2 -1.78), ADP-PRON (-0.008, -0.77), PRON-NOUN (-0.008, -1.00), VERB-PRON (-0.005, -0.57), PRON-AUX (-0.005, -0.39), PRON-VERB (-0.004, -0.32), SCONJ-PRON (-0.004, -0.79), PRON-PUNCT (-0.004, -0.94).
Pattern: every PRON-* bigram is suppressed. Magnifica writes with significantly fewer pronouns than the Francis corpus. The suppression is broad-based: subject pronouns (PRON-VERB), pronominal objects (VERB-PRON), pronouns in prepositional phrases (ADP-PRON), pronouns at sentence ends (PRON-PUNCT). All down.
The pronoun suppression is the novel finding beyond Zhang's analysis. AI-typical prose tends to write more abstractly, more in terms of nominal categories, less in terms of "we" and "you" address. Fratelli Tutti and Dilexit Nos read as Pope Francis directly addressing the reader, with frequent first-person-plural and second-person-singular interpolation. Magnifica Humanitas reads more like a formal exposition of categories, with the pope intervening only at chapter transitions and in concluding rhetorical moves. The POS-bigram signal quantifies this register difference.
The PUNCT-PUNCT suppression is the other notable result. The Francis baseline contains more PUNCT-PUNCT adjacency (often quote-mark-followed-by-comma or period-followed-by-quote-mark), reflecting heavier biblical and patristic quotation density. Magnifica still quotes substantially (124 footnoted citations) but the citation density per-paragraph is lower; the prose carries more declarative weight on its own without quotation scaffolding.
Layer C licenses: a quantitative description of the syntactic-pattern drift between Magnifica and the Francis register. The drift direction (more nominal, fewer pronouns, less quotation density) is consistent with AI-typical prose patterns.
Layer C does not license: a verdict. Register drift can be authored by humans deliberately writing in a more formal mode.
Tier 4: Operator-mediated mirror discrimination
This is the load-bearing test. Protocol per SETEC SPEC v0.1, Design 2, K=4. For each of four windows in the encyclical, the prefix (~500 words leading up to the window) was passed to a fresh Claude Opus 4.7 session with the instruction to write the next 150 words blind, in the encyclical's voice. The model received only the prefix and a brief task description; it did not have access to the target continuation, did not perform web search, and produced a single draft per window. The four windows were selected to match the passages Zhang screenshotted or quoted as Pangram-flagged:
- W1: Paragraph 7 (Babel section). Pangram screenshot-flagged at high AI confidence in Zhang's published images.
- W2: Paragraph 10 ("Babel syndrome" passage). Zhang's first published quotation in the "lines that jumped out as too smooth, too triadic, too inhuman" section.
- W3: Paragraph 100 (the "artificial imitation of positive human communication... genuinely helpful" passage). Zhang's specific quotation around the most contested "genuinely" instance.
- W4: Paragraph 107 (the "alignment" passage). Zhang's quotation around the moralization-of-machines critique.
Distance metrics computed: word-set Jaccard, TF-IDF cosine, POS-bigram cosine. (Sentence-transformer cosine was not run in this pass; it would require ~5 minutes of additional compute per window and adds a fourth axis that would not change the qualitative result.)
Result, K=4 aggregate:
| Comparison | Word Jaccard | TF-IDF cosine | POS-bigram cosine |
|---|---|---|---|
| Claude prediction vs ACTUAL Magnifica continuation | 0.1466 | 0.5729 | 0.8161 |
| Claude prediction vs prefix-tail (self-anchor) | 0.1209 | 0.4287 | 0.8151 |
| ACTUAL Magnifica vs prefix-tail (natural local style) | 0.1220 | 0.4405 | 0.7882 |
Claude's prediction matches the actual Magnifica Humanitas continuation at TFIDF cosine 0.5729, which is higher than either the prediction-to-prefix-tail similarity (0.4287, the self-anchor) or the actual-to-prefix-tail similarity (0.4405, the natural local-style anchor). This is unusual. In a normal mirror test against verified-human prose with no LLM mediation, we would expect Claude's prediction to share register with the prefix (high prediction-to-prefix similarity) but to diverge on specific vocabulary from the actual continuation (low prediction-to-actual similarity). The pattern here is reversed: Claude's prediction shares MORE vocabulary with the actual text than the actual text shares with its own immediately-preceding context.
Per-window breakdown:
| Window | Jaccard | TFIDF | POS-bg |
|---|---|---|---|
| W1 par 7 (Babel) | 0.2038 | 0.7243 | 0.8766 |
| W2 par 10 (Babel syndrome) | 0.1071 | 0.5258 | 0.7571 |
| W3 par 100 (genuinely helpful) | 0.1236 | 0.5256 | 0.8233 |
| W4 par 107 (alignment) | 0.1519 | 0.5158 | 0.8073 |
The W1 result is the strongest single window. TFIDF 0.7243 on a blind 150-word continuation is unusually high; for comparison, the Granta/Nazir K=1 rum-shop window that the prior evidence pack documented as "striking" landed at TFIDF 0.41.
The W1 result carries a forced-by-structure caveat. The prefix concludes with the subheading "Two biblical images." The next paragraph is forced to introduce two biblical images. Both Claude's prediction and the actual encyclical introduce Babel first, then Nehemiah/Jerusalem. The structural setup substantially constrains the topic. Even given that constraint, the qualitative beat-by-beat alignment is striking:
| Move | Claude prediction | Actual Magnifica Humanitas paragraph 7 |
|---|---|---|
| Naming | "The first is the Tower of Babel (cf. Gen 11:1-9)" | "the construction of the Tower of Babel (cf. Gen 11:1-9)" |
| Tricolon | "a single language and a single purpose" | "a single language, a single technology, a single direction" |
| Quoted phrase | "in order to make a name for itself" | "to 'make a name' for themselves" |
| Citation format | "(cf. Neh 1-6)" | "(cf. Neh 2-6)" |
| Frame | "scattered and wounded people, listening to the voice of the Lord" | "people decided to build a city and a tower 'with its top in the heavens'" |
The exact triplet ("a single X, a single Y, a single Z") and the exact phrase "make a name for itself" appearing in both Claude's blind prediction and the actual encyclical text, on a 150-word window, is the cleanest qualitative finding in this pack.
The W2 result is even more direct. Claude's prediction for paragraph 10 contains the coined phrase "Babel syndrome" in the same paragraph slot where the actual encyclical introduces and defines the term. The Holy See's official text coins "Babel syndrome" as a category in paragraph 10. Claude, given only the prefix ending at paragraph 9, independently coined the same term. The probability of this convergence by chance, under any standard register-similarity model, is small.
Francis-baseline mirror control
Same methodology applied to Dilexit Nos (Pope Francis, 24 October 2024), at parallel paragraph positions (par 7, par 10, par 100, par 107). Dilexit Nos was released approximately seven months before the published training cutoff of the Claude Opus 4.7 model used in this test, so it is plausibly within Claude's training data. If so, this control should over-estimate Claude's predictive accuracy on verifiably-Francis prose. The Magnifica gap below should be understood as a conservative estimate of the discrimination signal.
Result, K=4 aggregate, Dilexit Nos:
| Comparison | Word Jaccard | TF-IDF cosine | POS-bigram cosine |
|---|---|---|---|
| Claude prediction vs ACTUAL Dilexit continuation | 0.1476 | 0.4322 | 0.7186 |
| Claude prediction vs prefix-tail | 0.1470 | 0.4127 | 0.7127 |
| ACTUAL Dilexit vs prefix-tail | 0.1136 | 0.3724 | 0.7702 |
Head-to-head:
| Metric (K=4 mean) | Magnifica | Dilexit (Francis) | Gap | Relative |
|---|---|---|---|---|
| Word Jaccard | 0.1466 | 0.1476 | -0.0010 | tied |
| TF-IDF cosine | 0.5729 | 0.4322 | +0.1406 | +33% |
| POS-bigram cosine | 0.8161 | 0.7186 | +0.0975 | +14% |
| ACTUAL vs prefix-tail TFIDF | 0.4405 | 0.3724 | +0.0681 | +18% |
Three findings.
First, word Jaccard is essentially tied between the two encyclicals. The vocabulary-set overlap is similar. This rules out the trivial explanation that Magnifica is "easier to predict because the topic is AI which Claude is trained on." Topic-level vocabulary overlap is the same.
Second, the discrimination signal lives in TFIDF cosine (the weighted distinctive-vocabulary measure) and POS-bigram cosine (the syntactic-pattern measure). Both move in the same direction, both at meaningful magnitudes. Claude's predictions share the specific distinctive vocabulary and the specific syntactic patterns of Magnifica Humanitas at substantially higher rates than they share these features with Dilexit Nos.
Third, the qualitative comparison is decisive. On the C1 control (Dilexit paragraph 7, the most prefix-open window in the control set), Claude predicted a meditation on inner depths and Augustine. The actual Francis text turned to a personal anecdote: "For the carnival, when we were children, my grandmother would make a pastry using a very thin batter..." This is the kind of move LLMs systematically cannot predict from prefix alone, because it requires the autobiographical particularity that distinguishes a person from a register. Francis's grandmother making pastries does not generate from the register of Dilexit Nos; it generates from Francis being Francis.
The contrast with W1 (Magnifica Babel passage), where Claude's prediction structurally mirrored the actual text down to specific phrasings, is the load-bearing qualitative observation in this pack. Magnifica Humanitas did not produce the kind of unpredictable autobiographical-particular move that Dilexit Nos produced. Where the Francis prose pivots into the writer's lived particularity, the Leo prose pivots into the register's predictable next move.
Mirror discrimination licenses: a claim that Magnifica Humanitas sits substantially closer to Claude's prediction distribution than Dilexit Nos does, on both lexical-distinctive and syntactic measurements. This is consistent with substantial Claude-mediated drafting in the Magnifica English text.
Mirror discrimination does not license: a claim about which specific paragraphs were AI-drafted, which Vatican drafters used the tool, or whether the assistance was direct drafting or revision-on-AI-outputs. The signal is text-level, not turn-level.
Chapter-level variance
SETEC's chapter distinctiveness audit run on Magnifica Humanitas shows the text is stylometrically heterogeneous, not homogeneous:
| Chapter | Words | Sent mean | MTLD | FKGL mean | POS-bg entropy |
|---|---|---|---|---|---|
| Introduction (1-16) | 2,839 | 22.4 | 93.6 | 14.15 | 6.111 |
| Chapter One (17-45) | 5,923 | 31.2 | 94.2 | 19.40 | 5.897 |
| Chapter Two (46-89) | 6,390 | 26.6 | 84.7 | 17.63 | 6.103 |
| Chapter Three (90-130) | 6,103 | 22.7 | 108.3 | 16.00 | 6.150 |
| Chapter Four (131-181) | 7,452 | 23.7 | 126.7 | 17.45 | 5.984 |
| Chapter Five (182-228) | 9,302 | 23.0 | 93.0 | 15.23 | 6.103 |
Chapter One is the stylistic outlier across multiple Layer A signals: highest sentence mean (31.2 words, 30% above the encyclical average), highest FKGL (19.4, well above any chapter), lowest POS-bigram entropy (5.897, indicating syntactic concentration). Chapter Four has the highest MTLD (126.7), indicating the broadest vocabulary. Chapters Three and Five sit closer to the encyclical-wide mean.
This heterogeneity is consistent with Zhang's section-by-section Pangram variance hypothesis. Different chapters carry different stylometric profiles, suggesting different drafting hands, different revision intensities, or both. The framework does not assign provenance to any specific chapter; the variance itself is the licensed finding.
What this pack cannot license
A definitive verdict that any specific paragraph was Claude-generated. The signal is text-level and comparative.
An attribution claim about which Curia drafter, congregation, or pontifical academy used AI assistance. SETEC measures the artifact, not the workflow that produced it.
A claim that the encyclical was fully AI-written. The Layer A signals do not show the standard unedited-LLM smoothing fingerprint. Lexical diversity is high; sentence variance is high. The pattern reads as co-mediated.
A fully resolved single-family attribution (Claude specifically rather than GPT or Gemini). The v0.3.1 cross-family Codex/GPT comparison run on 27 May 2026 supports Claude over GPT-family on the four selected windows, but a parallel Gemini run and a manual standard-GPT-5 run would sharpen the attribution further; both are queued.
A statement about the doctrinal or magisterial authority of the encyclical. The Catholic Church's tradition of papal drafting has long involved ghostwriters; the magisterial authority of an encyclical does not depend on which hand held the pen.
A judgment about whether Vatican use of AI assistance is appropriate. This pack documents that the assistance happened, with high confidence in significant portions of the English text. What follows from that is a question for the Holy See, for Catholic moral theology, and for the broader culture watching how religious institutions handle AI integration.
Methodological notes
Operator-blind re-run (added 27 May 2026). In response to peer review identifying operator-priming as a potential confound in the original Claude run, the test was re-executed with sub-agent prompts stripped of every reference to the project, to Zhang's article, to stylometry, to AI detection, and to Pope Leo XIV or Magnifica Humanitas specifically. The blind sub-agents received only the prefix files and a minimal continuation instruction ("write the next 150 words in the same register").
The blind rerun K=4 aggregate against Magnifica Humanitas: TFIDF 0.5932, POS-bg 0.8311. The blind rerun K=4 against Dilexit Nos: TFIDF 0.4363, POS-bg 0.7268. The Magnifica-vs-Dilexit discrimination gap under blind conditions: +0.1570 TFIDF, +0.1043 POS-bg. Both gaps are slightly larger than under the original operator-aware run (+0.1407 TFIDF, +0.0975 POS-bg). The operator-priming concern did not explain away the original signal at the K=4 aggregate level: under stricter blinding, the aggregate gap persisted and slightly increased.
W1 specifically went DOWN under stricter blinding (TFIDF 0.7243 → 0.6256), so the original W1 high score was somewhat amplified by structural framing. Other windows compensated. The W2 blind prediction reached for "the temptation of Babel" (very near the actual "Babel syndrome" coinage). The W3 blind prediction independently produced the phrase "a genuinely helpful companion." The W4 blind prediction opened with "the alignment of AI systems with human values." The convergent features came from the prefix-plus-Claude-distribution, not from operator framing. The C1 Dilexit blind result remained low (TFIDF 0.2389; Francis's grandmother-pastry anecdote remains unpredictable from prefix alone).
The original findings stand. The blind rerun strengthens rather than weakens them. Future Claude rounds should default to the blind-operator standard documented in gpt5_elicitation_kit/v0.2.
Cross-family check via Codex (added 27 May 2026, v0.3.1 clean rerun). A parallel mirror comparison was conducted via Codex serial subagents on 27 May 2026. An initial v0.1 run produced a Claude-favoring result but surfaced three flaws under peer review (feature-encoding filenames, prompt framing that triggered Codex policy refusals on W2 and C1, and the Codex-not-standard-GPT-5 build issue). The v0.3.1 kit fixed the first two: neutral window-ID filenames, register-exercise prompt framing that avoids religious-authority triggers. The v0.3.1 rerun ran clean (no refusals, no regenerations, no search artifacts) and is the result reported here. The v0.1 run is preserved in gpt5_elicitation_kit/archive_v0.1_codex_run/ for audit; the v0.3.1 run lives at gpt5_elicitation_kit_rerun_2026-05-27_subagents3/.
Aggregate finding. Claude predicts Magnifica Humanitas substantially more accurately than Codex/GPT-family does, on the same windows under the same protocol.
| Run (K=4 aggregate) | Magnifica TFIDF | Dilexit TFIDF | Within-model gap |
|---|---|---|---|
| Claude original (operator-aware) | 0.5729 | 0.4322 | +0.1407 |
| Claude blind rerun | 0.5932 | 0.4363 | +0.1569 |
| Codex/GPT-family v0.1 (feature-leak caveat) | 0.4987 | 0.4166 | +0.0821 |
| Codex/GPT-family v0.3.1 (clean) | 0.4846 | 0.4189 | +0.0657 |
Claude predicts Magnifica at TFIDF +0.1086 higher than the v0.3.1 Codex run does on the same four windows. Claude's within-model discrimination is +0.1569; Codex/GPT-family's is +0.0657, less than half as large. The discrimination signal is Claude-specific in this comparison; it is not a generic frontier-LLM signal.
Qualitative findings hold under v0.3.1 neutral framing. The convergences between Claude's blind continuations and the actual encyclical, and the corresponding divergences in Codex's continuations, persist under neutral filenames AND neutral register-exercise prompts:
| Window | Actual encyclical | Claude blind | Codex/GPT-family v0.3.1 (clean) |
|---|---|---|---|
| W1 Babel pair | Babel + Nehemiah | Babel + Nehemiah | Babel + Pentecost |
| W2 Par 10 coinage | "Babel syndrome" | "the temptation of Babel" | no "Babel" reference; "structure of sin" pivot |
| W3 Par 100 vocabulary | "genuinely helpful" | "a genuinely helpful companion" | zero "genuinely"/"genuine" in 150 words |
| W4 Par 107 framing | "alignment" critique | "the alignment of AI systems" | zero "alignment" in 143 words |
The v0.1-to-v0.3.1 comparison rules out the filename-leak hypothesis for these qualitative findings. Under neutral filenames AND neutral register-exercise framing, Codex/GPT-family still does not converge on Nehemiah, "Babel syndrome," "genuinely," or "alignment." Claude does, even under stricter blinding than the original Claude run.
Caveats remaining.
-
Codex is GPT-family but not provably standard consumer GPT-5. The underlying Codex build is not exposed in the interface. A manual ChatGPT run under v0.3.1 would close this footnote. The aggregate cross-family signal is unlikely to change under a different GPT-5.x variant, but the test is properly described as Codex/GPT-family, not as standard GPT-5.
-
Selected windows, not random sample. The four Magnifica windows were chosen because Zhang flagged them as Pangram-suspicious. The test answers "do these specific suspicious passages reproduce under Claude better than under GPT-family." It does not answer "is the whole encyclical Claude-drafted."
-
Single-shot per model per window. Multi-shot sampling (N=5) would tighten the variance estimate. Single-shot matches the Claude protocol but does not produce a sampling-distribution estimate.
-
No post-cutoff human baseline. Dilexit Nos (Oct 2024) precedes the Claude Opus 4.7 training cutoff (Jan 2026) but may be in some GPT-family training. A truly post-cutoff verified-human document in matched register would tighten the control.
What this run licenses. A claim that on the four selected Magnifica windows, Claude predicts the published continuation substantially more accurately than Codex/GPT-family does (+0.1086 TFIDF), and exhibits within-model Magnifica-vs-Dilexit discrimination (+0.1569) that Codex/GPT-family does not exhibit at comparable strength (+0.0657). The qualitative convergences (Nehemiah, "the temptation of Babel" near the actual "Babel syndrome" coinage, "genuinely," "alignment") survive both operator-blinding (Claude blind rerun) and prompt/filename-blinding (Codex v0.3.1 rerun). They are consistent with a genuine difference between Claude's and GPT-family's continuation tendencies for these prefixes, rather than with the contamination routes the repaired run removed. Single-shot per model per window cannot estimate the underlying distribution; multi-shot replication would tell us how stable each model's typical continuation is.
What this run does not license. A definitive paragraph-level Claude attribution. The cross-family signal supports Claude-specific stylometric placement of the selected passages; it does not by itself prove Claude drafted those paragraphs. A clean manual standard-GPT-5 run, multi-shot sampling, randomly-sampled windows, parallel Gemini, and a post-cutoff baseline would all sharpen the resolution.
Control-choice confound. Dilexit Nos is the available verified-Catholic-doctrinal-prose baseline, but Francis's autobiographical particulars (the grandmother-pastry anecdote at C1, the "liquid society" diagnostic move at C2) reflect his idiosyncratic authorial voice rather than papal register defaults. Some portion of the within-model Magnifica-vs-Dilexit gap may therefore reflect "Francis is hard to predict" rather than "Magnifica sits unusually close to Claude." The ideal control would be a future encyclical written entirely without AI assistance by the same Curial drafting apparatus that produced Magnifica, against which Magnifica could be compared within-author and within-process. We do not have that document. The cross-family Claude-over-GPT gap (+0.1086 on Magnifica) is less vulnerable to this confound than the within-model Magnifica-vs-Dilexit gap, because the cross-family test compares two models on the same target, holding the target constant. The within-model gap is more confounded; the cross-family gap carries the load-bearing argument. This caveat should be foregrounded in future iterations of the pack.
On decimal precision. TFIDF cosine values are reported to four decimal places in the tables above to preserve the observational record from the metric script. Readers should understand the precision as reporting-convention, not as statistical significance on an N=1-per-window comparison. The qualitative magnitudes (a difference of roughly 0.10 between Claude and Codex on Magnifica, a difference of roughly 0.15 between Claude's Magnifica and Dilexit aggregates) are what the argument depends on. The fourth decimal is record-keeping.
Adversarial run. The sibling gpt5_adversarial_kit/ tests whether a hypothesis-aware primed model can close the Claude/GPT-family gap from prefix-cueing alone. Not yet run.
Gemini parallel. Not yet run. Further sharpens cross-family attribution. Future work.
Additional tests run (27 May 2026, follow-up sweep)
After the publication-ready draft was finalized, three additional SETEC tests were run on the Magnifica corpus. Two produced informative results; one came back uncalibrated due to compute constraints.
Mirror Design 4 (expanding-context) pilot on W1 Babel. Three fresh Claude predictions at expanding prefix sizes (500, 1000, and maximum-available 1129 words, all ending at the same anchor right before paragraph 7). The metric script computed TFIDF cosine against the actual W1 target:
| Context size | TFIDF cosine vs actual | Word Jaccard | POS-bigram cosine |
|---|---|---|---|
| 500 words | 0.6434 | 0.1899 | 0.8903 |
| 1000 words | 0.7060 | 0.1543 | 0.8905 |
| 1129 words (max) | 0.6937 | 0.1790 | 0.8906 |
One expanding-context sequence improves sharply between 500 and 1000 words (+0.063 TFIDF), then flattens or dips slightly at 1129. With one draw per context size, the curve is compatible with both in-distribution convergence and sampling noise; the present data cannot distinguish them. POS-bigram cosine held essentially constant across all three (≈0.89).
The W1 window has now been generated by three different Claude sessions at roughly comparable conditions: 0.7243 in the original operator-aware run, 0.6256 in the blind rerun (500-word context), and 0.6434 in this Design 4 500-word condition. Three single generations at nominally similar settings span a TFIDF range of about 0.10. That variance is itself informative; it strengthens the case that the next protocol step should be multi-shot generation per condition.
As a pilot, this sequence is suggestive: the curve climbs as context grows, in the direction the Design 4 hypothesis predicts. It is not yet a load-bearing additional measurement layer. The full Design 4 protocol would multi-shot each context size and run the same expansion against a verified-Francis window for control.
Confounder audit. SETEC's confounder_audit.py was adapted to run against the available SETEC outputs (variance audit, AIC pattern audit, discourse audit) on Magnifica. Two observed signals fed the ranking: marked-move entropy (HIGH at 2.72 bits) and AIC pattern density (HIGH, driven by correctio at 2.46/1k).
Top-ranked confounders:
| Confounder | Score | Matches | Contradictions |
|---|---|---|---|
| house_style_enforcement | 1.00 | aic_pattern_density=high | 0 |
| professional_copyediting | 0.50 | 0 | 0 |
| legal_or_policy_memo_style | 0.50 | aic_pattern_density=high | marked_move_entropy=high contradicts |
| ai_smoothing | 0.50 | aic_pattern_density=high | marked_move_entropy=high contradicts |
| intentional_voice_imitation | 0.50 | 0 | 0 |
The discriminator: both ai_smoothing and legal_or_policy_memo_style predict LOW marked-move entropy (a narrow set of moves). Magnifica shows HIGH marked-move entropy at 2.72 bits, contradicting both. The top-ranked alternative is house_style_enforcement (Vatican encyclical conventions), which makes no marked-move-entropy prediction either way. The audit's evidence base was thin (2 of 17 possible signals available; the missing 12 require voice-distance, paragraph audit, idiolect detector, agency audit, sliding-window heatmap, and POS-bigram KL computations that weren't queued for this sweep). With this base the audit ranks but does not discriminate decisively. The relevant qualitative finding is that the AI-smoothing-signature prediction (low marked-move entropy) is contradicted by the available evidence; the framework's Layer A "no smoothing fingerprint" finding from the original run survives this scrutiny.
Per-token surprisal (binoculars). Run on CPU with gpt2 / distilgpt2 as the scorer/observer pair (the only model pair that fit on the available 2.2GB disk allocation):
| Document | Surprisal (bits/token) | Binoculars ratio |
|---|---|---|
| Dilexit Nos | 4.68 | 0.936 |
| Fratelli Tutti | 4.96 | 0.944 |
| Laudate Deum | 4.81 | 0.931 |
| Laudato Si' | 4.91 | 0.942 |
| Lumen Fidei | 4.66 | 0.910 |
| Magnifica Humanitas | 4.85 | 0.934 |
All six documents land in the 4.66-4.96 bits/token range. Magnifica's z-score against the Francis baseline (mean 4.80, SD 0.13) is +0.33, well inside ±1 SD. The test does not discriminate. Caveat: the standard "6-7 bits/token = human, 4-5 = RLHF-aligned LLM" heuristic was calibrated against larger base models (qwen3_4b_base, llama32_3b, etc.). gpt2 (2019, 124M parameters) compresses everything into the same band, including unambiguous Francis human prose. The SETEC framework's verdict band is correctly labeled "uncalibrated" by the script. To get a meaningful result a binoculars rerun would need a larger base model and more disk; the test is queued.
What survives the follow-up sweep. Design 4 expanding-context, run on one window with one draw per context size, is a pilot rather than an additional load-bearing measurement axis; the cross-run W1 variance reinforces the case for multi-shot replication as the next step. The confounder audit's discrimination against the AI-smoothing alternative is informative but the available evidence is thin; the audit's top ranked confounder (house_style_enforcement) is compatible with the Claude-assisted-drafting interpretation rather than against it. The binoculars test is currently uncalibrated and reserves judgment.
Dilexit Nos contamination caveat. Dilexit Nos was released October 2024, before the May 2025 published training cutoff of the Claude model used. The control is therefore contaminated in the direction of over-estimating Claude's predictive accuracy on Francis prose. The actual Magnifica-versus-uncontaminated-human gap is likely larger than the +0.14 TFIDF reported here.
Window selection. All four mirror windows were chosen to match Zhang's screenshot-flagged or quoted passages. This was deliberate (testing the specific paragraphs Zhang and Pangram identified as anomalous), but it also means the windows were pre-selected for likely AI signal. A truly random window selection across the 245-paragraph document would be a useful follow-up; the prediction would be that aggregate signal weakens but specific high-AI sections (per Layer A variance) still flag.
Sentence-transformer cosine omitted. A fourth distance metric (adjacent-sentence semantic cohesion via sentence-transformers) would add ~5 minutes of compute per window and a fourth axis. It was not run in this pass. The three metrics here are sufficient to establish the comparative signal; the fourth would refine it.
Image-conjunction calibration. SETEC's AIC-8 image-conjunction detector did not over-flag on Magnifica's Babel/Jerusalem metaphor (7.87/1k, within baseline range). This is the framework's own self-test against the Granta/Nazir miss documented in the prior evidence pack. The framework reads the controlling biblical metaphor as earned by frame, correctly.
Substantive vs procedural humility. The framework's discipline forbids claims it cannot license. The pack reads: there is substantial Claude-fingerprint signal in significant portions of Magnifica Humanitas, and the signal is consilient across three independent measurement layers. The pack does not read: the encyclical was AI-generated; the magisterium is compromised; specific cardinals used the tool. Where Zhang's piece is properly punchy in service of public attention, this pack is properly cautious in service of disciplined attribution.
Where this artifact lives
This evidence pack is published at setec-voiceprint/docs/evidence-packs/magnifica-humanitas.html. The companion Substack post engaging Zhang's analysis is at anotherpanacea.substack.com. The mirror test prefixes, blind predictions, target continuations, Design 4 expanding-context sequence, the v0.3.1 GPT-5 elicitation kit, and the SETEC raw analysis outputs are preserved alongside the HTML at setec-voiceprint/docs/evidence-packs/magnifica-humanitas/ for replication. SETEC framework: github.com/anotherpanacea-eng/setec-voiceprint.
The Linch Zhang piece this pack engages: https://linch.substack.com/p/claude-author-of-the-humanitas (26 May 2026).
Filed 27 May 2026 by Joshua Miller with Claude (Opus 4.7) as analysis partner. The methodological discipline this pack practices was developed across the glass-box stylometry sequence at anotherpanacea.com and the SETEC framework. The Granta / Nazir evidence pack (May 2026) is the prior worked example. The published protocol-generalization excerpt is at references/spec-v0.1-addendum-model-generalization.md in this repository; the full SPEC remains internal.