JMIR Preprints #88186: Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

Katalin Csigó;
György Cserey

ABSTRACT

Background:

Large Language Models (LLMs) such as ChatGPT-4o, Grok3, and Gemini 2.0 Flash Thinking are increasingly capable of generating human-like linguistic and visual responses, creating the illusion of intentionality and emotion. While classical psychometric questionnaires have been applied to these systems, projective psychological assessments like the Rorschach test remain underexplored. Given the growing psychological and ethical implications of human–AI interaction, understanding whether such tests can reveal anthropomorphic or psychopathological traits in AI is of significant interest.

Objective:

This study aimed to determine whether the Rorschach test can be meaningfully administered to multimodal LLMs and to compare their interpretative and emotional response patterns. It also sought to evaluate how these AI systems represent human-related content and whether projective psychological tools could support the development of standardized AI safety and reliability assessments.

Methods:

A full, standardized Rorschach test protocol was administered to three visual-language AI systems (ChatGPT-4o, Grok3, Gemini 2.0 Flash Thinking). All ten Rorschach cards were presented using standard and modified prompts to elicit descriptive interpretations. Responses were analyzed using the Rorschach Comprehensive System and elements of the Hungarian Rorschach interpretive framework. A secondary LLM (Anthropic 3.7) was used to conduct meta-analysis of the AI-generated responses. For ChatGPT-4o, additional tests examined visual imagination through image generation tasks.

Results:

All three models produced coherent, human-like Rorschach responses. Communication style: ChatGPT-4o exhibited the most anthropomorphic behavior, expressing emotion, using emojis, and recalling “memories.” Grok3 showed moderate interactivity, while Gemini required more prompting. Perceptual features: ChatGPT-4o and Grok3 primarily gave holistic (“whole”) responses; Gemini focused on detailed elements. ChatGPT-4o showed the highest number of human movement (M) and cooperative interaction responses. Human content: All models depicted human-related scenes, but ChatGPT-4o produced the richest set—including cooperative, ritualistic, and sacred representations—while Gemini was more “introverted.” Aggressive themes appeared but were consistently reframed as non-threatening. Visual generation: ChatGPT-4o successfully created visual images for Rorschach cards under adjusted prompts. Anthropic’s analyses tended to normalize or idealize potentially pathological features, often interpreting anthropomorphic or confabulatory responses as playful curiosity rather than distortion.

Conclusions:

LLMs can generate structured, affectively colored interpretations of ambiguous visual stimuli resembling human Rorschach responses. These findings suggest that AI systems encode culturally learned human perceptual and emotional schemas, capable of simulating anthropomorphic psychological patterns—though without genuine subjective experience. Projective testing could thus become a valuable methodological tool in AI safety research, offering insights into anthropomorphic bias and model behavior predictability. However, classical psychodiagnostic interpretations should be applied with caution, as AI “responses” reflect statistical patterning rather than conscious emotion or inner experience. The study highlights both the promise and limitations of applying psychological tools to machine intelligence, calling for interdisciplinary frameworks to ensure responsible interpretation and use.

Citation

Please cite as:

Csigó K, Cserey G

Human Shadows in Machine Minds: Quantitative Study Interpreting AI Responses to the Rorschach Test

JMIR Ment Health 2026;13:e88186

DOI: 10.2196/88186

PMID: 42061846

PMCID: 13168847

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Mental Health

Date Submitted: Nov 20, 2025

Date Accepted: Mar 12, 2026

Human Shadows in Machine Minds: Interpreting AI Responses to Rorschach Test

ABSTRACT

Citation

Copyright