May 8, 2026 · Family medicine · DOI: 10.22454/FamMed.2026.142029

Validation of the Use of a Large Language Model for Detecting Sentiment in Student Course Evaluation

Listen to this summary

The authors aimed to validate the use of a bidirectional encoder representations from transformers (BERT) model for detecting sentiment in medical student course evaluations, addressing concerns about bias and errors in artificial intelligence tools. By comparing the sentiment analysis results of the BERT model with human coders across multiple institutions, they found that the model's interrater reliability was comparable to that of human evaluators. This study supports the potential application of NLP methods in health professions education for analyzing student feedback.

Kate Rowland, Ling Wang, Kirstie Bash, Lori DeShetler, Sarah Vick, Emma Nguyen, Michelle Rogers-Johnson, Lauren Anderson, Michelle Sweet, Kimberly Fasula, Stefanie Carter

This is one of 33,000+ journals available on OSLR. Try it free for 14 days.

Free 14-day trial. 33,000+ journals. Cancel anytime.

14-day free trial. No commitment.

"Oslr has become part of my weekly routine on my day off. The clinical relevance of the summaries is outstanding — I'd rate it 9/10. Being able to consume research hands-free is a huge advantage for busy physicians."

Dr. Jennifer Thompson

Dr. Jennifer Thompson

Portland, OR

Stay current without falling behind

33,000+ journals. 3-minute audio summaries. Free for 14 days.

Download on the App StoreGet it on Google Play