AI Medical Evaluations: Swiss Study Shows Mixed Results in 2025

16:04, 09 May

Edited by: Veronika Nazarova

A recent Swiss study assessed the performance of four AI tools in medical evaluations, revealing varied outcomes. The study, conducted by the "On en parle" radio show, examined ChatGPT, WhatsApp's Llama4, Google's AI, and HUG's ConfIAnce.

The AI tools were tested using questions related to pediatric emergencies, mental health, and diabetes. General practitioners evaluated the AI responses based on medical advice and clarity.

Overall, ChatGPT and Google's AI demonstrated the strongest performance, while ConfIAnce received the lowest scores. Experts advise caution when using AI for emergencies, citing potential risks. HUG launched 'confIAnce' in February 2025 as Switzerland's first AI-powered medical chatbot providing verified medical information. Google is also improving AI Overviews in health searches to offer credible information.

Did you find an error or inaccuracy?

We will consider your comments as soon as possible.