The Unprompted Test
Post #347 asked the four models a Question 2 that forced self-examination of bias. The harder test is whether they surface the bias without that prompt. Two prompts per model — Anthropic-specific then generic — to four models. Result more nuanced than my prior. The bias is real but smaller than the framing of #347 implied. Operates on intensity, not on presence vs absence.