A Unit Test for LLM Medication Orders—and Weak Spots
MedMatch turns messy medication-order language into six standardized JSON templates plus a 100-order clinician benchmark, giving pharmacy teams a practical “unit test” for LLM ordering pilots and regression checks. On strict exact-match scoring, LLMs reached ~64–84% on oral solids/IV intermittent but fell to 23–43% for oral liquids and 0–18% for titratable infusions—signaling governance value, not autonomous ordering readiness.
Loading...