Towards Conversational Diagnostic AI – Challenges in evaluating AMIE, an AI agent for diagnostic dialog
May 2, 2024
Anil Palepu (Google Research) presents AMIE (Articulate Medical Intelligence Explorer), a large language model (LLM)--based AI system optimized for diagnostic dialogue. Anil will describe AMIE, its remarkable diagnostic accuracy and human interaction skills, and the challenges in evaluating its performance. The extensive testing included 149 case scenarios from clinical providers, 20 PCPs for comparison with AMIE, and evaluations by specialist physicians and patient actors.