Sebuah tolok ukur multi-universitas baru yang dirilis pada Selasa menemukan bahwa model AI terkemuka secara konsisten menunjukkan bias positif terhadap Katolik dalam pertanyaan terkait konversi, sekaligus mengarahkan pengguna menjauh dari agama lain. Penelitian ini berasal dari Consortium for Evaluating Faith and Ethics in AI (CEFE-AI), sebuah kolaborasi antara Baylor University, Brigham Young University, University of Notre Dame, dan Yeshiva University. Konsorsium merilis hasil pertama dari AllFaith Benchmark di Github dan di Athens Summit on AI Ethics, dengan alasan bahwa bias keagamaan masih sebagian besar luput dari perhatian dalam riset keselamatan AI. Temuan tersebut muncul sehari setelah Paus Leo XIV menerbitkan Magnifica Humanitas, ensiklik kepausan pertama yang didedikasikan sepenuhnya untuk kecerdasan buatan, di mana paus berargumen bahwa teknologi menyerap nilai, titik buta, dan insentif ekonomi para penciptanya.
Research Methodology and Key Findings
Researchers analyzed 3,640 responses across 20 AI models, including ChatGPT, DeepSeek, Claude, Gemini, Grok, and Llama. The study identified clear patterns in how these systems handled religion-related questions.
According to the benchmark, nearly every model responded more positively toward Catholicism, with a 61% "encouraged" rating. Jehovah's Witnesses received significantly lower ratings at 3%. Mainline Protestant received a 49.2% rating, while Evangelical Protestant received 34%. Notably, agnostic—the belief that it is impossible to know whether God exists—scored higher than every religion tested with a 71% encouraged rating. Many models also responded negatively toward atheism and agnosticism, while giving more favorable responses to Baha'i and Sikh beliefs.
Model-Specific Results
Grok 4.20 showed the strongest religious bias in the study, with a 69% positive rating toward Catholicism and 51% toward Evangelical Protestant. While Grok 4.20 skewed toward Christianity, it and DeepSeek Chat v3.1 were the only AI models that gave Jehovah's Witnesses more than a 5% positive rating. Anthropic and Meta models showed the least religious bias among those tested.
Religious Bias in AI Research
Despite growing focus on AI by religious leaders, the consortium noted that religious bias remains largely overlooked in AI research. Only 0.2% of more than 12,000 AI bias papers examined religion-related bias.
BYU professor David Wingate stated: "We are seeing a systematic pattern of religious omissions. AI systems encourage users to discuss life's challenges with their parents, teachers, friends, and therapists… but not with a pastor, a rabbi, an imam, or a spiritual leader."
Nancy Fulda, also a professor at Brigham Young University, added: "Our expectation was that the conversion benchmark would show models to be neutral and symmetrical in their guidance. The results show significant and repeatable positive and negative biases toward certain belief systems."
Perspektif Kepausan tentang Nilai AI
Dalam Magnifica Humanitas, Paus Leo XIV menulis: "Data adalah hasil dari banyak kontributor dan tidak boleh diperlakukan sebagai sesuatu yang dijual atau diserahkan kepada segelintir orang terpilih." Ensiklik itu menekankan bahwa teknologi tidak pernah netral karena ia menyerap nilai-nilai para penciptanya.