Searched for: in-biosketch:true
person:masura01
Evaluating Large Language Models in Extracting Cognitive Exam Dates and Scores
Zhang, Hao; Jethani, Neil; Jones, Simon; Genes, Nicholas; Major, Vincent J; Jaffe, Ian S; Cardillo, Anthony B; Heilenbach, Noah; Ali, Nadia Fazal; Bonanni, Luke J; Clayburn, Andrew J; Khera, Zain; Sadler, Erica C; Prasad, Jaideep; Schlacter, Jamie; Liu, Kevin; Silva, Benjamin; Montgomery, Sophie; Kim, Eric J; Lester, Jacob; Hill, Theodore M; Avoricani, Alba; Chervonski, Ethan; Davydov, James; Small, William; Chakravartty, Eesha; Grover, Himanshu; Dodson, John A; Brody, Abraham A; Aphinyanaphongs, Yindalon; Masurkar, Arjun; Razavian, Narges
IMPORTANCE/UNASSIGNED:Large language models (LLMs) are crucial for medical tasks. Ensuring their reliability is vital to avoid false results. Our study assesses two state-of-the-art LLMs (ChatGPT and LlaMA-2) for extracting clinical information, focusing on cognitive tests like MMSE and CDR. OBJECTIVE/UNASSIGNED:Evaluate ChatGPT and LlaMA-2 performance in extracting MMSE and CDR scores, including their associated dates. METHODS/UNASSIGNED:Our data consisted of 135,307 clinical notes (Jan 12th, 2010 to May 24th, 2023) mentioning MMSE, CDR, or MoCA. After applying inclusion criteria 34,465 notes remained, of which 765 underwent ChatGPT (GPT-4) and LlaMA-2, and 22 experts reviewed the responses. ChatGPT successfully extracted MMSE and CDR instances with dates from 742 notes. We used 20 notes for fine-tuning and training the reviewers. The remaining 722 were assigned to reviewers, with 309 each assigned to two reviewers simultaneously. Inter-rater-agreement (Fleiss' Kappa), precision, recall, true/false negative rates, and accuracy were calculated. Our study follows TRIPOD reporting guidelines for model validation. RESULTS/UNASSIGNED:For MMSE information extraction, ChatGPT (vs. LlaMA-2) achieved accuracy of 83% (vs. 66.4%), sensitivity of 89.7% (vs. 69.9%), true-negative rates of 96% (vs 60.0%), and precision of 82.7% (vs 62.2%). For CDR the results were lower overall, with accuracy of 87.1% (vs. 74.5%), sensitivity of 84.3% (vs. 39.7%), true-negative rates of 99.8% (98.4%), and precision of 48.3% (vs. 16.1%). We qualitatively evaluated the MMSE errors of ChatGPT and LlaMA-2 on double-reviewed notes. LlaMA-2 errors included 27 cases of total hallucination, 19 cases of reporting other scores instead of MMSE, 25 missed scores, and 23 cases of reporting only the wrong date. In comparison, ChatGPT's errors included only 3 cases of total hallucination, 17 cases of wrong test reported instead of MMSE, and 19 cases of reporting a wrong date. CONCLUSIONS/UNASSIGNED:In this diagnostic/prognostic study of ChatGPT and LlaMA-2 for extracting cognitive exam dates and scores from clinical notes, ChatGPT exhibited high accuracy, with better performance compared to LlaMA-2. The use of LLMs could benefit dementia research and clinical care, by identifying eligible patients for treatments initialization or clinical trial enrollments. Rigorous evaluation of LLMs is crucial to understanding their capabilities and limitations.
PMCID:10888985
PMID: 38405784
CID: 5722422
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE [Meeting Abstract]
Khreish, Maroun; Zambrano, Ronald; Lee, TingFang; Hu, Jiyuan; Martinez, Philip; Diamond, Julia L.; Toyos, Allison; Balcer, Laura J.; Masurkar, Arjun; Schuman, Joel S.; Wollstein, Gadi
ISI:001313316206082
ISSN: 0146-0404
CID: 5765622
Feasibility of measuring blood-brain barrier permeability using ultra-short echo time radial magnetic resonance imaging
Bae, Jonghyun; Qayyum, Sawwal; Zhang, Jin; Das, Ayesha; Reyes, Isabel; Aronowitz, Eric; Stavarache, Mihaela A; Kaplitt, Michael G; Masurkar, Arjun; Kim, Sungheon Gene
BACKGROUND AND PURPOSE/OBJECTIVE:The purpose of this study is to evaluate the feasibility of using 3-dimensional (3D) ultra-short echo time (UTE) radial imaging method for measurement of the permeability of the blood-brain barrier (BBB) to gadolinium-based contrast agent. In this study, we propose to use the golden-angle radial sparse parallel (GRASP) method with 3D center-out trajectories for UTE, hence named as 3D UTE-GRASP. We first examined the feasibility of using 3D UTE-GRASP dynamic contrast-enhanced (DCE)-magnetic resonance imaging (MRI) for differentiating subtle BBB disruptions induced by focused ultrasound (FUS). Then, we examined the BBB permeability changes in Alzheimer's disease (AD) pathology using Alzheimer's disease transgenic mice (5xFAD) at different ages. METHODS:For FUS experiments, we used four Sprague Dawley rats at similar ages where we compared BBB permeability of each rat receiving the FUS sonication with different acoustic power (0.4-1.0 MPa). For AD transgenic mice experiments, we included three 5xFAD mice (6, 12, and 16 months old) and three wild-type mice (4, 8, and 12 months old). RESULTS:The result from FUS experiments showed a progressive increase in BBB permeability with increase of acoustic power (p < .05), demonstrating the sensitivity of DCE-MRI method for detecting subtle changes in BBB disruption. Our AD transgenic mice experiments suggest an early BBB disruption in 5xFAD mice, which is further impaired with aging. CONCLUSION/CONCLUSIONS:The results in this study substantiate the feasibility of using the proposed 3D UTE-GRASP method for detecting subtle BBB permeability changes expected in neurodegenerative diseases, such as AD.
PMID: 38616297
ISSN: 1552-6569
CID: 5646042
Diffusion imaging markers of accelerated aging of the lower cingulum in subjective cognitive decline
Flaherty, Ryn; Sui, Yu Veronica; Masurkar, Arjun V; Betensky, Rebecca A; Rusinek, Henry; Lazar, Mariana
INTRODUCTION/UNASSIGNED:Alzheimer's Disease (AD) typically starts in the medial temporal lobe, then develops into a neurodegenerative cascade which spreads to other brain regions. People with subjective cognitive decline (SCD) are more likely to develop dementia, especially in the presence of amyloid pathology. Thus, we were interested in the white matter microstructure of the medial temporal lobe in SCD, specifically the lower cingulum bundle that leads into the hippocampus. Diffusion tensor imaging (DTI) has been shown to differentiate SCD participants who will progress to mild cognitive impairment from those who will not. However, the biology underlying these DTI metrics is unclear, and results in the medial temporal lobe have been inconsistent. METHODS/UNASSIGNED: = 325, 127 SCD). Diffusion MRI was processed to generate regional and voxel-wise diffusion tensor values in bilateral lower cingulum white matter, while T1-weighted MRI was processed to generate regional volume and cortical thickness in the medial temporal lobe white matter, entorhinal cortex, temporal pole, and hippocampus. RESULTS/UNASSIGNED:SCD participants had thinner cortex in bilateral entorhinal cortex and right temporal pole. No between-group differences were noted for any of the microstructural metrics of the lower cingulum. However, correlations with delayed story recall were significant for all diffusion microstructure metrics in the right lower cingulum in SCD, but not in controls, with a significant interaction effect. Additionally, the SCD group showed an accelerated aging effect in bilateral lower cingulum with MD, AxD, and RD. DISCUSSION/UNASSIGNED:The diffusion profiles observed in both interaction effects are suggestive of a mixed neuroinflammatory and neurodegenerative pathology. Left entorhinal cortical thinning correlated with decreased FA and increased RD, suggestive of demyelination. However, right entorhinal cortical thinning also correlated with increased AxD, suggestive of a mixed pathology. This may reflect combined pathologies implicated in early AD. DTI was more sensitive than cortical thickness to the associations between SCD, memory, and age. The combined effects of mixed pathology may increase the sensitivity of DTI metrics to variations with age and cognition.
PMCID:11111894
PMID: 38784911
ISSN: 1664-2295
CID: 5651982
Sensitivity of unconstrained quantitative magnetization transfer MRI to amyloid burden in preclinical Alzheimer's disease
Mao, Andrew; Flassbeck, Sebastian; Marchetto, Elisa; Masurkar, Arjun V; Rusinek, Henry; Assländer, Jakob
Magnetization transfer MRI is sensitive to semisolid macromolecules, includingamyloid beta, and has previously been used to discriminate Alzheimer'sdisease (AD) patients from controls. Here, we fit an unconstrained 2-poolquantitative MT (qMT) model, that is, without constraints on the longitudinalrelaxation rate
PMID: 40800521
ISSN: 2837-6056
CID: 5907312
Improving measurement of blood-brain barrier permeability with reduced scan time using deep-learning-derived capillary input function
Bae, Jonghyun; Li, Chenyang; Masurkar, Arjun; Ge, Yulin; Kim, Sungheon Gene
PURPOSE:In Dynamic contrast-enhanced MRI (DCE-MRI), Arterial Input Function (AIF) has been shown to be a significant contributor to uncertainty in the estimation of kinetic parameters. This study is to assess the feasibility of using a deep learning network to estimate local Capillary Input Function (CIF) to estimate blood-brain barrier (BBB) permeability, while reducing the required scan time. MATERIALS AND METHOD:-10min methods in estimating the PS values. RESULTS:-10min. We found a 75% increase of BBB permeability in the gray matter and a 35% increase in the white matter, when comparing the older group to the younger group. CONCLUSIONS:We demonstrated the feasibility of estimating the capillary-level input functions using a deep learning network. We also showed that this method can be used to estimate subtle age-related changes in BBB permeability with reduced scan time, without compromising accuracy. Moreover, the trained deep learning network can automatically select CIF, reducing the potential uncertainty resulting from manual user-intervention.
PMCID:10475161
PMID: 37507078
ISSN: 1095-9572
CID: 5591772
Impact of white matter hyperintensities on subjective cognitive decline phenotype in a diverse cohort of cognitively normal older adults
Rothstein, Aaron; Zhang, Yian; Briggs, Anthony Q; Bernard, Mark A; Shao, Yongzhao; Favilla, Christopher; Sloane, Kelly; Witsch, Jens; Masurkar, Arjun V
OBJECTIVES:Subjective cognitive decline (SCD) is a preclinical stage of AD. White matter hyperintensities (WMH), an MRI marker of cerebral small vessel disease, associate with AD biomarkers and progression. The impact of WMH on SCD phenotype is unclear. METHODS/DESIGN:A retrospective, cross-sectional analysis was conducted on a diverse cohort with SCD evaluated at the NYU Alzheimer's Disease Research Center between January 2017 and November 2021 (n = 234). The cohort was dichotomized into none-to-mild (n = 202) and moderate-to-severe (n = 32) WMH. Differences in SCD and neurocognitive assessments were evaluated via Wilcoxon or Fisher exact tests, with p-values adjusted for demographics using multivariable logistic regression. RESULTS:Moderate-to-severe WMH participants reported more difficulty with decision making on the Cognitive Change Index (1.5 SD 0.7 vs. 1.2 SD 0.5, p = 0.0187) and worse short-term memory (2.2 SD 0.4 vs. 1.9 SD 0.3, p = 0.0049) and higher SCD burden (9.5 SD 1.6 vs. 8.7 SD 1.7, p = 0.0411) on the Brief Cognitive Rating Scale. Moderate-to-severe WMH participants scored lower on the Mini-Mental State Examination (28.0 SD 1.6 vs. 28.5 SD 1.9, p = 0.0491), and on delayed paragraph (7.2 SD 2.0 vs. 8.8 SD 2.9, p = 0.0222) and designs recall (4.5 SD 2.3 vs. 6.1 SD 2.5, p = 0.0373) of the Guild Memory Test. CONCLUSIONS:In SCD, WMH impact overall symptom severity, specifically in executive and memory domains, as well as objective performance on global and domain-specific tests in verbal memory and visual working/associative memory.
PMID: 37291739
ISSN: 1099-1166
CID: 5605232
Vigorous, regular physical exercise may slow disease progression in Alzheimer's disease
Devanand, Davangere P; Masurkar, Arjun V; Wisniewski, Thomas
INTRODUCTION/BACKGROUND:Mild to moderate exercise may decrease Alzheimer's disease (AD) risk, but the effects of vigorous, regular physical exercise remain unclear. METHODS:Two patients with initial diagnoses of amnestic mild cognitive impairment (MCI) demonstrated positive AD biomarkers throughout 16 and 8 years of follow-up, with final diagnoses of mild AD and amnestic MCI, respectively. RESULTS:Patient 1 was diagnosed with amnestic MCI at age 64. Neuropsychological testing, magnetic resonance imaging (MRI), fluorodeoxyglucose-positron emission tomography (FDG-PET), amyloid imaging PET, and cerebrospinal fluid (CSF) biomarkers during follow-ups remained consistent with AD. By age 80, progression was minimal with Montreal Cognitive Assessment (MoCA) 26 of 30. Patient 2 was diagnosed with amnestic MCI at age 72. Neuropsychological testing, MRI, FDG-PET, and amyloid imaging PET during follow-ups remained consistent with AD. At age 80, MoCA was 27 of 30 with no clinical progression. Both patients regularly performed vigorous, regular exercise that increased after retirement/work reduction. DISCUSSION/CONCLUSIONS:Vigorous, regular exercise may slow disease progression in biomarker-positive amnestic MCI and mild AD.
PMID: 36722738
ISSN: 1552-5279
CID: 5426712
Intracranial artery stenosis is associated with cortical thinning in stroke-free individuals of two longitudinal cohorts
Yang, Dixon; Masurkar, Arjun V; Khasiyev, Farid; Rundek, Tatjana; Wright, Clinton B; Elkind, Mitchell S V; Sacco, Ralph L; Gutierrez, Jose
BACKGROUND:We examined the association between asymptomatic intracranial artery stenosis (aICAS) and cortical thickness using brain magnetic resonance morphometry in two cohorts. METHODS:This cross-sectional study included stroke-free participants from the Northern Manhattan Study (NOMAS) and the National Alzheimer's Coordinating Center (NACC). We represented the predictor aICAS in NOMAS as a continuous global stenosis score reflecting an overall burden of stenosis (possible range 0-44) assessed by magnetic resonance angiography and in NACC as a dichotomous autopsy-determined Circle of Willis (CoW) atherosclerosis (none-mild vs moderate-severe). The primary outcome of interest was total cortical thickness. We analyzed each dataset separately using multivariable linear regression. RESULTS:The analysis included 1209 NOMAS (46% had any stenosis, 5% had ≥70% stenosis of at least one vessel; stenosis score range 0-11) and 392 NACC (36% moderate-severe CoW atherosclerosis) participants. We found an inverse relationship between stenosis score and total cortical thickness (β-estimate [95% confidence interval (CI)]: -2.98 [-5.85, -0.11]) in adjusted models. We replicated these results in NACC (β-estimate [95% CI]: -0.06 [-0.11, -0.003]). Post-hoc, we segregated stenosis scores by location and only posterior circulation stenosis score was associated with total cortical thickness (anterior β-estimate [95% CI]: -0.90 [-5.16, 3.36], posterior β-estimate [95% CI]: -7.25 [-14.30, -0.20]). CONCLUSION/CONCLUSIONS:We found both radiographically and neuropathologically determined aICAS to be associated with global cortical thinning. Interestingly, posterior circulation stenoses appeared to drive this association with global cortical thinning, raising the possibility of pathophysiologic mechanisms for cortical thinning other than impaired hemodynamics.
PMCID:9880900
PMID: 36577280
ISSN: 1878-5883
CID: 5409642
On gaps of clinical diagnosis of dementia subtypes: A study of Alzheimer"™s disease and Lewy body disease
Wei, Hui; Masurkar, Arjun V.; Razavian, Narges
Introduction: Alzheimer"™s disease (AD) and Lewy body disease (LBD) are the two most common neurodegenerative dementias and can occur in combination (AD+LBD). Due to overlapping biomarkers and symptoms, clinical differentiation of these subtypes could be difficult. However, it is unclear how the magnitude of diagnostic uncertainty varies across dementia spectra and demographic variables. We aimed to compare clinical diagnosis and post-mortem autopsy-confirmed pathological results to assess the clinical subtype diagnosis quality across these factors. Methods: We studied data of 1,920 participants recorded by the National Alzheimer"™s Coordinating Center from 2005 to 2019. Selection criteria included autopsy-based neuropathological assessments for AD and LBD, and the initial visit with Clinical Dementia Rating (CDR) stage of normal, mild cognitive impairment, or mild dementia. Longitudinally, we analyzed the first visit at each subsequent CDR stage. This analysis included positive predictive values, specificity, sensitivity and false negative rates of clinical diagnosis, as well as disparities by sex, race, age, and education. If autopsy-confirmed AD and/or LBD was missed in the clinic, the alternative clinical diagnosis was analyzed. Findings: In our findings, clinical diagnosis of AD+LBD had poor sensitivities. Over 61% of participants with autopsy-confirmed AD+LBD were diagnosed clinically as AD. Clinical diagnosis of AD had a low sensitivity at the early dementia stage and low specificities at all stages. Among participants diagnosed as AD in the clinic, over 32% had concurrent LBD neuropathology at autopsy. Among participants diagnosed as LBD, 32% to 54% revealed concurrent autopsy-confirmed AD pathology. When three subtypes were missed by clinicians, "No cognitive impairment" and "primary progressive aphasia or behavioral variant frontotemporal dementia" were the leading primary etiologic clinical diagnoses. With increasing dementia stages, the clinical diagnosis accuracy of black participants became significantly worse than other races, and diagnosis quality significantly improved for males but not females. Discussion: These findings demonstrate that clinical diagnosis of AD, LBD, and AD+LBD are inaccurate and suffer from significant disparities on race and sex. They provide important implications for clinical management, anticipatory guidance, trial enrollment and applicability of potential therapies for AD, and promote research into better biomarker-based assessment of LBD pathology.
SCOPUS:85151542204
ISSN: 1663-4365
CID: 5460452