March 6, 2026 · Journal of addictive diseases · DOI: 10.1080/10550887.2026.2636620

Assessing the clinical competence of large language models for tobacco use disorder: A multi-domain expert evaluation

Listen to this summary

The authors aimed to systematically evaluate the clinical competence of five large language models (LLMs) in providing tobacco cessation support, focusing on their accuracy, safety, guideline adherence, and communication quality. Their findings indicated that while all LLMs showed some competence, GPT-4.5 and Claude 3.5 Sonnet performed best, achieving scores suitable for supervised clinical use, whereas open-weight models required further validation before clinical implementation. The study underscores the importance of clinician oversight in medication-based interventions for tobacco use disorder.

Thiago P Fernandes, Linnea Dahlgren, Natanael A Santos, Zeke Degraff

This is one of 33,000+ journals available on OSLR. Try it free for 14 days.

Free 14-day trial. 33,000+ journals. Cancel anytime.

14-day free trial. No commitment.

"Oslr has become part of my weekly routine on my day off. The clinical relevance of the summaries is outstanding — I'd rate it 9/10. Being able to consume research hands-free is a huge advantage for busy physicians."

Dr. Jennifer Thompson

Dr. Jennifer Thompson

Portland, OR

Stay current without falling behind

33,000+ journals. 3-minute audio summaries. Free for 14 days.

Download on the App StoreGet it on Google Play