Searched for: in-biosketch:true
person:burkrj01
Large Language Model-Based Assessment of Clinical Reasoning Documentation in the Electronic Health Record Across Two Institutions: Development and Validation Study
Schaye, Verity; DiTullio, David; Guzman, Benedict Vincent; Vennemeyer, Scott; Shih, Hanniel; Reinstein, Ilan; Weber, Danielle E; Goodman, Abbie; Wu, Danny T Y; Sartori, Daniel J; Santen, Sally A; Gruppen, Larry; Aphinyanaphongs, Yindalon; Burk-Rafel, Jesse
BACKGROUND:Clinical reasoning (CR) is an essential skill; yet, physicians often receive limited feedback. Artificial intelligence holds promise to fill this gap. OBJECTIVE:We report the development of named entity recognition (NER), logic-based and large language model (LLM)-based assessments of CR documentation in the electronic health record across 2 institutions (New York University Grossman School of Medicine [NYU] and University of Cincinnati College of Medicine [UC]). METHODS:-scores for the NER, logic-based model and area under the receiver operating characteristic curve (AUROC) and area under the precision-recall curve (AUPRC) for the LLMs. RESULTS:-scores 0.80, 0.74, and 0.80 for D0, D1, D2, respectively. The GatorTron LLM performed best for EA2 scores AUROC/AUPRC 0.75/ 0.69. CONCLUSIONS:This is the first multi-institutional study to apply LLMs for assessing CR documentation in the electronic health record. Such tools can enhance feedback on CR. Lessons learned by implementing these models at distinct institutions support the generalizability of this approach.
PMID: 40117575
ISSN: 1438-8871
CID: 5813782
Community Racial and Ethnic Representation Among Physicians in US Internal Medicine Residency Programs
Kim, Jung G; Lett, Elle; Boscardin, Christy K; Hauer, Karen E; Chen, Isabel L; Henderson, Mark C; Hogan, Sean O; Yamazaki, Kenji; Burk-Rafel, Jesse; Fancher, Tonya; Nguyen, Mytien; Holmboe, Eric S; McDade, William; Boatright, Dowin H
IMPORTANCE/UNASSIGNED:Increasing underrepresented in medicine (URIM) physicians among historically underserved communities helps reduce health disparities. The concordance of URIM physicians with their communities improves access to care, particularly for American Indian and Alaska Native, Black, and Hispanic or Latinx individuals. OBJECTIVES/UNASSIGNED:To explore county-level racial and ethnic representation of US internal medicine (IM) residents, examine racial and ethnic concordance between residents and their communities, and assess whether representation varies by presence of academic institutions or underserved settings. DESIGN, SETTING, AND PARTICIPANTS/UNASSIGNED:This retrospective cross-sectional study collected data from the Association of American Medical Colleges, Accreditation Council for Graduate Medical Education (ACGME), Area Health Resources Files, and US Department of Education data on ACGME-accredited US IM residency programs and their associated county populations. Self-reported racial and ethnic data from 2018 for 4848 residents in 393 IM programs in 205 counties were used. Data were analyzed between February 15 and September 20, 2024. EXPOSURE/UNASSIGNED:County-level presence for academic health centers (AHCs), minority-serving institutions (MSIs), health professional shortage areas (HPSAs), and rurality. MAIN OUTCOMES AND MEASURES/UNASSIGNED:Main outcomes were representation quotients (RQs) or the ratio of the proportion of IM residents and their concordant county-level racial and ethnic populations. Quantile linear regression models on median representation were used to identify the association with URIM, Asian, and White residents by US Census division and county-level AHCs, MSIs, HPSAs, and rurality. RESULTS/UNASSIGNED:Among 4848 residents, 4 (0.08%) self-identified as American Indian or Alaskan Native, 1709 (35.3%) as Asian, 289 (6.0%) as Black, 211 (4.4%) as Hispanic or Latinx, 2 (0.04%) as Native Hawaiian or Other Pacific Islander, and 2633 (54.3%) as White. A total of 761 (15.7%) were classified as URIM. Among URIM groups, American Indian and Alaska Native (mean [SE] RQ, 0.00 [0.04]), Black (mean [SE] RQ, 0.09 [0.20]), Hispanic and Latinx (mean [SE] RQ, 0.00 [0.04]), and Native Hawaiian and other Pacific Islander (mean [SE] RQ, 0.00 [0.26]) residents were grossly underrepresented compared with their training sites' county-level representation. Fifty-one of 205 counties (24.8%) with IM programs had no URIM residents. Black and Hispanic or Latinx residents had higher representation in counties with more MSIs (mean [SD] RQ, 0.19 [0.24]; P = .04; mean [SD] RQ, 0.15 [0.04]; P < .001, respectively), and Hispanic or Latinx residents were less represented in counties with more AHCs (mean [SD] RQ, 0.00 [0.06]; P < .001). Asian residents had lower RQs in counties with more MSIs (mean [SD] RQ, 6.00 [0.65]; P < .001), and White residents had higher representation in counties with greater presence of AHCs (mean [SD] RQ, 0.77 [0.04]; P = .007). CONCLUSIONS AND RELEVANCE/UNASSIGNED:In this cross-sectional study, URIM IM residents remained underrepresented compared with their program's county populations. These findings should inform racial and ethnic diversity policies to address the continuing underrepresentation among graduate medical education physicians, which adversely impacts the care of historically underserved communities.
PMCID:11783195
PMID: 39883461
ISSN: 2574-3805
CID: 5781162
Characterizing Residents' Clinical Experiences-A Step Toward Precision Education
Burk-Rafel, Jesse; Drake, Carolyn B; Sartori, Daniel J
PMID: 39693075
ISSN: 2574-3805
CID: 5764502
A Theoretical Foundation to Inform the Implementation of Precision Education and Assessment
Drake, Carolyn B; Heery, Lauren M; Burk-Rafel, Jesse; Triola, Marc M; Sartori, Daniel J
Precision education (PE) uses personalized educational interventions to empower trainees and improve learning outcomes. While PE has the potential to represent a paradigm shift in medical education, a theoretical foundation to guide the effective implementation of PE strategies has not yet been described. Here, the authors introduce a theoretical foundation for the implementation of PE, integrating key learning theories with the digital tools that allow them to be operationalized. Specifically, the authors describe how the master adaptive learner (MAL) model, transformative learning theory, and self-determination theory can be harnessed in conjunction with nudge strategies and audit and feedback dashboards to drive learning and meaningful behavior change. The authors also provide practical examples of these theories and tools in action by describing precision interventions already in use at one academic medical center, concretizing PE's potential in the current clinical environment. These examples illustrate how a firm theoretical grounding allows educators to most effectively tailor PE interventions to fit individual learners' needs and goals, facilitating efficient learning and, ultimately, improving patient and health system outcomes.
PMID: 38113440
ISSN: 1938-808x
CID: 5612362
Learner Assessment and Program Evaluation: Supporting Precision Education
Richardson, Judee; Santen, Sally A; Mejicano, George C; Fancher, Tonya; Holmboe, Eric; Hogan, Sean O; Marin, Marina; Burk-Rafel, Jesse
Precision education (PE) systematically leverages data and advanced analytics to inform educational interventions that, in turn, promote meaningful learner outcomes. PE does this by incorporating analytic results back into the education continuum through continuous feedback cycles. These data-informed sequences of planning, learning, assessing, and adjusting foster competence and adaptive expertise. PE cycles occur at individual (micro), program (meso), or system (macro) levels. This article focuses on program- and system-level PE.Data for PE come from a multitude of sources, including learner assessment and program evaluation. The authors describe the link between these data and the vital role evaluation plays in providing evidence of educational effectiveness. By including prior program evaluation research supporting this claim, the authors illustrate the link between training programs and patient outcomes. They also describe existing national reports providing feedback to programs and institutions, as well as 2 emerging, multiorganization program- and system-level PE efforts. The challenges encountered by those implementing PE and the continuing need to advance this work illuminate the necessity for increased cross-disciplinary collaborations and a national cross-organizational data-sharing effort.Finally, the authors propose practical approaches for funding a national initiative in PE as well as potential models for advancing the field of PE. Lessons learned from successes by others illustrate the promise of these recommendations.
PMID: 38166211
ISSN: 1938-808x
CID: 5736972
Leveraging Electronic Health Record Data and Measuring Interdependence in the Era of Precision Education and Assessment
Sebok-Syer, Stefanie S; Small, William R; Lingard, Lorelei; Glober, Nancy K; George, Brian C; Burk-Rafel, Jesse
PURPOSE:The era of precision education is increasingly leveraging electronic health record (EHR) data to assess residents' clinical performance. But precision in what the EHR-based resident performance metrics are truly assessing is not fully understood. For instance, there is limited understanding of how EHR-based measures account for the influence of the team on an individual's performance-or conversely how an individual contributes to team performances. This study aims to elaborate on how the theoretical understandings of supportive and collaborative interdependence are captured in residents' EHR-based metrics. METHOD:Using a mixed methods study design, the authors conducted a secondary analysis of 5 existing quantitative and qualitative datasets used in previous EHR studies to investigate how aspects of interdependence shape the ways that team-based care is provided to patients. RESULTS:Quantitative analyses of 16 EHR-based metrics found variability in faculty and resident performance (both between and within resident). Qualitative analyses revealed that faculty lack awareness of their own EHR-based performance metrics, which limits their ability to act interdependently with residents in an evidence-informed fashion. The lens of interdependence elucidates how resident practice patterns develop across residency training, shifting from supportive to collaborative interdependence over time. Joint displays merging the quantitative and qualitative analyses showed that residents are aware of variability in faculty's practice patterns and that viewing resident EHR-based measures without accounting for the interdependence of residents with faculty is problematic, particularly within the framework of precision education. CONCLUSIONS:To prepare for this new paradigm of precision education, educators need to develop and evaluate theoretically robust models that measure interdependence in EHR-based metrics, affording more nuanced interpretation of such metrics when assessing residents throughout training.
PMID: 38207084
ISSN: 1938-808x
CID: 5686572
Foreword: The Next Era of Assessment and Precision Education
Schumacher, Daniel J; Santen, Sally A; Pugh, Carla M; Burk-Rafel, Jesse
PMID: 38109655
ISSN: 1938-808x
CID: 5612462
Precision Education: The Future of Lifelong Learning in Medicine
Desai, Sanjay V; Burk-Rafel, Jesse; Lomis, Kimberly D; Caverzagie, Kelly; Richardson, Judee; O'Brien, Celia Laird; Andrews, John; Heckman, Kevin; Henderson, David; Prober, Charles G; Pugh, Carla M; Stern, Scott D; Triola, Marc M; Santen, Sally A
The goal of medical education is to produce a physician workforce capable of delivering high-quality equitable care to diverse patient populations and communities. To achieve this aim amidst explosive growth in medical knowledge and increasingly complex medical care, a system of personalized and continuous learning, assessment, and feedback for trainees and practicing physicians is urgently needed. In this perspective, the authors build on prior work to advance a conceptual framework for such a system: precision education (PE).PE is a system that uses data and technology to transform lifelong learning by improving personalization, efficiency, and agency at the individual, program, and organization levels. PE "cycles" start with data inputs proactively gathered from new and existing sources, including assessments, educational activities, electronic medical records, patient care outcomes, and clinical practice patterns. Through technology-enabled analytics, insights are generated to drive precision interventions. At the individual level, such interventions include personalized just-in-time educational programming. Coaching is essential to provide feedback and increase learner participation and personalization. Outcomes are measured using assessment and evaluation of interventions at the individual, program, and organizational level, with ongoing adjustment for repeated cycles of improvement. PE is rooted in patient, health system, and population data; promotes value-based care and health equity; and generates an adaptive learning culture.The authors suggest fundamental principles for PE, including promoting equity in structures and processes, learner agency, and integration with workflow (harmonization). Finally, the authors explore the immediate need to develop consensus-driven standards: rules of engagement between people, products, and entities that interact in these systems to ensure interoperability, data sharing, replicability, and scale of PE innovations.
PMID: 38277444
ISSN: 1938-808x
CID: 5625442
The Next Era of Assessment: Can Ensuring High-Quality, Equitable Patient Care Be the Defining Characteristic?
Schumacher, Daniel J; Kinnear, Benjamin; Burk-Rafel, Jesse; Santen, Sally A; Bullock, Justin L
Previous eras of assessment in medical education have been defined by how assessment is done, from knowledge exams popularized in the 1960s to the emergence of work-based assessment in the 1990s to current efforts to integrate multiple types and sources of performance data through programmatic assessment. Each of these eras was a response to why assessment was performed (e.g., assessing medical knowledge with exams; assessing communication, professionalism, and systems competencies with work-based assessment). Despite the evolution of assessment eras, current evidence highlights the graduation of trainees with foundational gaps in the ability to provide high-quality care to patients presenting with common problems, and training program leaders report they graduate trainees they would not trust to care for themselves or their loved ones. In this article, the authors argue that the next era of assessment should be defined by why assessment is done: to ensure high-quality, equitable care. Assessment should place focus on demanding graduates possess the knowledge, skills, attitudes, and adaptive expertise to meet the needs of all patients and ensuring that graduates are able to do this in an equitable fashion. The authors explore 2 patient-focused assessment approaches that could help realize the promise of this envisioned era: entrustable professional activities (EPAs) and resident sensitive quality measures (RSQMs)/TRainee Attributable and Automatable Care Evaluations in Real-time (TRACERs). These examples illustrate how the envisioned next era of assessment can leverage existing and new data to provide precision education assessment that focuses on providing formative and summative feedback to trainees in a manner that seeks to ensure their learning outcomes prepare them to ensure high-quality, equitable patient outcomes.
PMID: 38109659
ISSN: 1938-808x
CID: 5612472
A New Tool for Holistic Residency Application Review: Using Natural Language Processing of Applicant Experiences to Predict Interview Invitation
Mahtani, Arun Umesh; Reinstein, Ilan; Marin, Marina; Burk-Rafel, Jesse
PROBLEM:Reviewing residency application narrative components is time intensive and has contributed to nearly half of applications not receiving holistic review. The authors developed a natural language processing (NLP)-based tool to automate review of applicants' narrative experience entries and predict interview invitation. APPROACH:Experience entries (n = 188,500) were extracted from 6,403 residency applications across 3 application cycles (2017-2019) at 1 internal medicine program, combined at the applicant level, and paired with the interview invitation decision (n = 1,224 invitations). NLP identified important words (or word pairs) with term frequency-inverse document frequency, which were used to predict interview invitation using logistic regression with L1 regularization. Terms remaining in the model were analyzed thematically. Logistic regression models were also built using structured application data and a combination of NLP and structured data. Model performance was evaluated on never-before-seen data using area under the receiver operating characteristic and precision-recall curves (AUROC, AUPRC). OUTCOMES:The NLP model had an AUROC of 0.80 (vs chance decision of 0.50) and AUPRC of 0.49 (vs chance decision of 0.19), showing moderate predictive strength. Phrases indicating active leadership, research, or work in social justice and health disparities were associated with interview invitation. The model's detection of these key selection factors demonstrated face validity. Adding structured data to the model significantly improved prediction (AUROC 0.92, AUPRC 0.73), as expected given reliance on such metrics for interview invitation. NEXT STEPS:This model represents a first step in using NLP-based artificial intelligence tools to promote holistic residency application review. The authors are assessing the practical utility of using this model to identify applicants screened out using traditional metrics. Generalizability must be determined through model retraining and evaluation at other programs. Work is ongoing to thwart model "gaming," improve prediction, and remove unwanted biases introduced during model training.
PMID: 36940395
ISSN: 1938-808x
CID: 5708082