Try a new search

Format these results:

Searched for:

in-biosketch:true

person:oermae01

Total Results:

115


Self-improving generative foundation model for synthetic medical image generation and clinical applications

Wang, Jinzhuo; Wang, Kai; Yu, Yunfang; Lu, Yuxing; Xiao, Wenchao; Sun, Zhuo; Liu, Fei; Zou, Zixing; Gao, Yuanxu; Yang, Lei; Zhou, Hong-Yu; Miao, Hanpei; Zhao, Wenting; Huang, Lisha; Zeng, Lingchao; Guo, Rui; Chong, Ieng; Deng, Boyu; Cheng, Linling; Chen, Xiaoniao; Luo, Jing; Zhu, Meng-Hua; Baptista-Hon, Daniel; Monteiro, Olivia; Li, Ming; Ke, Yu; Li, Jiahui; Zeng, Simiao; Guan, Taihua; Zeng, Jin; Xue, Kanmin; Oermann, Eric; Luo, Huiyan; Yin, Yun; Zhang, Kang; Qu, Jia
In many clinical and research settings, the scarcity of high-quality medical imaging datasets has hampered the potential of artificial intelligence (AI) clinical applications. This issue is particularly pronounced in less common conditions, underrepresented populations and emerging imaging modalities, where the availability of diverse and comprehensive datasets is often inadequate. To address this challenge, we introduce a unified medical image-text generative model called MINIM that is capable of synthesizing medical images of various organs across various imaging modalities based on textual instructions. Clinician evaluations and rigorous objective measurements validate the high quality of MINIM's synthetic images. MINIM exhibits an enhanced generative capability when presented with previously unseen data domains, demonstrating its potential as a generalist medical AI (GMAI). Our findings show that MINIM's synthetic images effectively augment existing datasets, boosting performance across multiple medical applications such as diagnostics, report generation and self-supervised learning. On average, MINIM enhances performance by 12% for ophthalmic, 15% for chest, 13% for brain and 17% for breast-related tasks. Furthermore, we demonstrate MINIM's potential clinical utility in the accurate prediction of HER2-positive breast cancer from MRI images. Using a large retrospective simulation analysis, we demonstrate MINIM's clinical potential by accurately identifying targeted therapy-sensitive EGFR mutations using lung cancer computed tomography images, which could potentially lead to improved 5-year survival rates. Although these results are promising, further validation and refinement in more diverse and prospective settings would greatly enhance the model's generalizability and robustness.
PMID: 39663467
ISSN: 1546-170x
CID: 5762792

Generalizability of Kidney Transplant Data in Electronic Health Records - The Epic Cosmos Database versus the Scientific Registry of Transplant Recipients

Mankowski, Michal A; Bae, Sunjae; Strauss, Alexandra T; Lonze, Bonnie E; Orandi, Babak J; Stewart, Darren; Massie, Allan B; McAdams-DeMarco, Mara A; Oermann, Eric K; Habal, Marlena; Iturrate, Eduardo; Gentry, Sommer E; Segev, Dorry L; Axelrod, David
Developing real-world evidence from electronic health records (EHR) is vital to advance kidney transplantation (KT). We assessed the feasibility of studying KT using the Epic Cosmos aggregated EHR dataset, which includes 274 million unique individuals cared for in 238 U.S. health systems, by comparing it with the Scientific Registry of Transplant Recipients (SRTR). We identified 69,418 KT recipients transplanted between January 2014 and December 2022 in Cosmos (39.4% of all US KT transplants during this period). Demographics and clinical characteristics of recipients captured in Cosmos were consistent with the overall SRTR cohort. Survival estimates were generally comparable, although there were some differences in long-term survival. At 7 years post-transplant, patient survival was 80.4% in Cosmos and 77.8% in SRTR. Multivariable Cox regression showed consistent associations between clinical factors and mortality in both cohorts, with minor discrepancies in the associations between death and both age and race. In summary, Cosmos provides a reliable platform for KT research, allowing EHR-level clinical granularity not available with either the transplant registry or healthcare claims. Consequently, Cosmos will enable novel analyses to improve our understanding of KT management on a national scale.
PMID: 39550008
ISSN: 1600-6143
CID: 5754062

Economics and Equity of Large Language Models: Health Care Perspective

Nagarajan, Radha; Kondo, Midori; Salas, Franz; Sezgin, Emre; Yao, Yuan; Klotzman, Vanessa; Godambe, Sandip A; Khan, Naqi; Limon, Alfonso; Stephenson, Graham; Taraman, Sharief; Walton, Nephi; Ehwerhemuepha, Louis; Pandit, Jay; Pandita, Deepti; Weiss, Michael; Golden, Charles; Gold, Adam; Henderson, John; Shippy, Angela; Celi, Leo Anthony; Hogan, William R; Oermann, Eric K; Sanger, Terence; Martel, Steven
Large language models (LLMs) continue to exhibit noteworthy capabilities across a spectrum of areas, including emerging proficiencies across the health care continuum. Successful LLM implementation and adoption depend on digital readiness, modern infrastructure, a trained workforce, privacy, and an ethical regulatory landscape. These factors can vary significantly across health care ecosystems, dictating the choice of a particular LLM implementation pathway. This perspective discusses 3 LLM implementation pathways-training from scratch pathway (TSP), fine-tuned pathway (FTP), and out-of-the-box pathway (OBP)-as potential onboarding points for health systems while facilitating equitable adoption. The choice of a particular pathway is governed by needs as well as affordability. Therefore, the risks, benefits, and economics of these pathways across 4 major cloud service providers (Amazon, Microsoft, Google, and Oracle) are presented. While cost comparisons, such as on-demand and spot pricing across the cloud service providers for the 3 pathways, are presented for completeness, the usefulness of managed services and cloud enterprise tools is elucidated. Managed services can complement the traditional workforce and expertise, while enterprise tools, such as federated learning, can overcome sample size challenges when implementing LLMs using health care data. Of the 3 pathways, TSP is expected to be the most resource-intensive regarding infrastructure and workforce while providing maximum customization, enhanced transparency, and performance. Because TSP trains the LLM using enterprise health care data, it is expected to harness the digital signatures of the population served by the health care system with the potential to impact outcomes. The use of pretrained models in FTP is a limitation. It may impact its performance because the training data used in the pretrained model may have hidden bias and may not necessarily be health care-related. However, FTP provides a balance between customization, cost, and performance. While OBP can be rapidly deployed, it provides minimal customization and transparency without guaranteeing long-term availability. OBP may also present challenges in interfacing seamlessly with downstream applications in health care settings with variations in pricing and use over time. Lack of customization in OBP can significantly limit its ability to impact outcomes. Finally, potential applications of LLMs in health care, including conversational artificial intelligence, chatbots, summarization, and machine translation, are highlighted. While the 3 implementation pathways discussed in this perspective have the potential to facilitate equitable adoption and democratization of LLMs, transitions between them may be necessary as the needs of health systems evolve. Understanding the economics and trade-offs of these onboarding pathways can guide their strategic adoption and demonstrate value while impacting health care outcomes favorably.
PMID: 39541580
ISSN: 1438-8871
CID: 5753562

Economics and Equity of Large Language Models: Health Care Perspective

Nagarajan, Radha; Kondo, Midori; Salas, Franz; Sezgin, Emre; Yao, Yuan; Klotzman, Vanessa; Godambe, Sandip A; Khan, Naqi; Limon, Alfonso; Stephenson, Graham; Taraman, Sharief; Walton, Nephi; Ehwerhemuepha, Louis; Pandit, Jay; Pandita, Deepti; Weiss, Michael; Golden, Charles; Gold, Adam; Henderson, John; Shippy, Angela; Celi, Leo Anthony; Hogan, William R; Oermann, Eric K; Sanger, Terence; Martel, Steven
Large language models (LLMs) continue to exhibit noteworthy capabilities across a spectrum of areas, including emerging proficiencies across the health care continuum. Successful LLM implementation and adoption depend on digital readiness, modern infrastructure, a trained workforce, privacy, and an ethical regulatory landscape. These factors can vary significantly across health care ecosystems, dictating the choice of a particular LLM implementation pathway. This perspective discusses 3 LLM implementation pathways-training from scratch pathway (TSP), fine-tuned pathway (FTP), and out-of-the-box pathway (OBP)-as potential onboarding points for health systems while facilitating equitable adoption. The choice of a particular pathway is governed by needs as well as affordability. Therefore, the risks, benefits, and economics of these pathways across 4 major cloud service providers (Amazon, Microsoft, Google, and Oracle) are presented. While cost comparisons, such as on-demand and spot pricing across the cloud service providers for the 3 pathways, are presented for completeness, the usefulness of managed services and cloud enterprise tools is elucidated. Managed services can complement the traditional workforce and expertise, while enterprise tools, such as federated learning, can overcome sample size challenges when implementing LLMs using health care data. Of the 3 pathways, TSP is expected to be the most resource-intensive regarding infrastructure and workforce while providing maximum customization, enhanced transparency, and performance. Because TSP trains the LLM using enterprise health care data, it is expected to harness the digital signatures of the population served by the health care system with the potential to impact outcomes. The use of pretrained models in FTP is a limitation. It may impact its performance because the training data used in the pretrained model may have hidden bias and may not necessarily be health care-related. However, FTP provides a balance between customization, cost, and performance. While OBP can be rapidly deployed, it provides minimal customization and transparency without guaranteeing long-term availability. OBP may also present challenges in interfacing seamlessly with downstream applications in health care settings with variations in pricing and use over time. Lack of customization in OBP can significantly limit its ability to impact outcomes. Finally, potential applications of LLMs in health care, including conversational artificial intelligence, chatbots, summarization, and machine translation, are highlighted. While the 3 implementation pathways discussed in this perspective have the potential to facilitate equitable adoption and democratization of LLMs, transitions between them may be necessary as the needs of health systems evolve. Understanding the economics and trade-offs of these onboarding pathways can guide their strategic adoption and demonstrate value while impacting health care outcomes favorably.
PMID: 39541580
ISSN: 1438-8871
CID: 5753572

Trials and Tribulations: Responses of ChatGPT to Patient Questions About Kidney Transplantation

Xu, Jingzhi; Mankowski, Michal; Vanterpool, Karen B; Strauss, Alexandra T; Lonze, Bonnie E; Orandi, Babak J; Stewart, Darren; Bae, Sunjae; Ali, Nicole; Stern, Jeffrey; Mattoo, Aprajita; Robalino, Ryan; Soomro, Irfana; Weldon, Elaina; Oermann, Eric K; Aphinyanaphongs, Yin; Sidoti, Carolyn; McAdams-DeMarco, Mara; Massie, Allan B; Gentry, Sommer E; Segev, Dorry L; Levan, Macey L
PMID: 39477825
ISSN: 1534-6080
CID: 5747132

ChatGPT Solving Complex Kidney Transplant Cases: A Comparative Study With Human Respondents

Mankowski, Michal A; Jaffe, Ian S; Xu, Jingzhi; Bae, Sunjae; Oermann, Eric K; Aphinyanaphongs, Yindalon; McAdams-DeMarco, Mara A; Lonze, Bonnie E; Orandi, Babak J; Stewart, Darren; Levan, Macey; Massie, Allan; Gentry, Sommer; Segev, Dorry L
INTRODUCTION/BACKGROUND:ChatGPT has shown the ability to answer clinical questions in general medicine but may be constrained by the specialized nature of kidney transplantation. Thus, it is important to explore how ChatGPT can be used in kidney transplantation and how its knowledge compares to human respondents. METHODS:We prompted ChatGPT versions 3.5, 4, and 4 Visual (4 V) with 12 multiple-choice questions related to six kidney transplant cases from 2013 to 2015 American Society of Nephrology (ASN) fellowship program quizzes. We compared the performance of ChatGPT with US nephrology fellowship program directors, nephrology fellows, and the audience of the ASN's annual Kidney Week meeting. RESULTS:Overall, ChatGPT 4 V correctly answered 10 out of 12 questions, showing a performance level comparable to nephrology fellows (group majority correctly answered 9 of 12 questions) and training program directors (11 of 12). This surpassed ChatGPT 4 (7 of 12 correct) and 3.5 (5 of 12). All three ChatGPT versions failed to correctly answer questions where the consensus among human respondents was low. CONCLUSION/CONCLUSIONS:Each iterative version of ChatGPT performed better than the prior version, with version 4 V achieving performance on par with nephrology fellows and training program directors. While it shows promise in understanding and answering kidney transplantation questions, ChatGPT should be seen as a complementary tool to human expertise rather than a replacement.
PMCID:11441623
PMID: 39329220
ISSN: 1399-0012
CID: 5714092

Longitudinal deep neural networks for assessing metastatic brain cancer on a large open benchmark

Link, Katherine E; Schnurman, Zane; Liu, Chris; Kwon, Young Joon Fred; Jiang, Lavender Yao; Nasir-Moin, Mustafa; Neifert, Sean; Alzate, Juan Diego; Bernstein, Kenneth; Qu, Tanxia; Chen, Viola; Yang, Eunice; Golfinos, John G; Orringer, Daniel; Kondziolka, Douglas; Oermann, Eric Karl
The detection and tracking of metastatic cancer over the lifetime of a patient remains a major challenge in clinical trials and real-world care. Advances in deep learning combined with massive datasets may enable the development of tools that can address this challenge. We present NYUMets-Brain, the world's largest, longitudinal, real-world dataset of cancer consisting of the imaging, clinical follow-up, and medical management of 1,429 patients. Using this dataset we developed Segmentation-Through-Time, a deep neural network which explicitly utilizes the longitudinal structure of the data and obtained state-of-the-art results at small (<10 mm3) metastases detection and segmentation. We also demonstrate that the monthly rate of change of brain metastases over time are strongly predictive of overall survival (HR 1.27, 95%CI 1.18-1.38). We are releasing the dataset, codebase, and model weights for other cancer researchers to build upon these results and to serve as a public benchmark.
PMCID:11408643
PMID: 39289405
ISSN: 2041-1723
CID: 5720652

Longitudinal deep neural networks for assessing metastatic brain cancer on a large open benchmark

Link, Katherine E; Schnurman, Zane; Liu, Chris; Kwon, Young Joon Fred; Jiang, Lavender Yao; Nasir-Moin, Mustafa; Neifert, Sean; Alzate, Juan Diego; Bernstein, Kenneth; Qu, Tanxia; Chen, Viola; Yang, Eunice; Golfinos, John G; Orringer, Daniel; Kondziolka, Douglas; Oermann, Eric Karl
The detection and tracking of metastatic cancer over the lifetime of a patient remains a major challenge in clinical trials and real-world care. Advances in deep learning combined with massive datasets may enable the development of tools that can address this challenge. We present NYUMets-Brain, the world's largest, longitudinal, real-world dataset of cancer consisting of the imaging, clinical follow-up, and medical management of 1,429 patients. Using this dataset we developed Segmentation-Through-Time, a deep neural network which explicitly utilizes the longitudinal structure of the data and obtained state-of-the-art results at small (<10 mm3) metastases detection and segmentation. We also demonstrate that the monthly rate of change of brain metastases over time are strongly predictive of overall survival (HR 1.27, 95%CI 1.18-1.38). We are releasing the dataset, codebase, and model weights for other cancer researchers to build upon these results and to serve as a public benchmark.
PMCID:11408643
PMID: 39289405
ISSN: 2041-1723
CID: 5720662

Longitudinal deep neural networks for assessing metastatic brain cancer on a large open benchmark

Link, Katherine E; Schnurman, Zane; Liu, Chris; Kwon, Young Joon Fred; Jiang, Lavender Yao; Nasir-Moin, Mustafa; Neifert, Sean; Alzate, Juan Diego; Bernstein, Kenneth; Qu, Tanxia; Chen, Viola; Yang, Eunice; Golfinos, John G; Orringer, Daniel; Kondziolka, Douglas; Oermann, Eric Karl
The detection and tracking of metastatic cancer over the lifetime of a patient remains a major challenge in clinical trials and real-world care. Advances in deep learning combined with massive datasets may enable the development of tools that can address this challenge. We present NYUMets-Brain, the world's largest, longitudinal, real-world dataset of cancer consisting of the imaging, clinical follow-up, and medical management of 1,429 patients. Using this dataset we developed Segmentation-Through-Time, a deep neural network which explicitly utilizes the longitudinal structure of the data and obtained state-of-the-art results at small (<10 mm3) metastases detection and segmentation. We also demonstrate that the monthly rate of change of brain metastases over time are strongly predictive of overall survival (HR 1.27, 95%CI 1.18-1.38). We are releasing the dataset, codebase, and model weights for other cancer researchers to build upon these results and to serve as a public benchmark.
PMCID:11408643
PMID: 39289405
ISSN: 2041-1723
CID: 5720672

Assessing superficial temporal artery-middle cerebral artery anastomosis patency using FLOW 800 hemodynamics

Sangwon, Karl L; Nguyen, Matthew; Wiggan, Daniel D; Negash, Bruck; Alber, Daniel A; Liu, Xujin Chris; Liu, Albert; Rabbin-Birnbaum, Corinne; Sharashidze, Vera; Baranoski, Jacob; Raz, Eytan; Shapiro, Maksim; Rutledge, Caleb; Nelson, Peter Kim; Riina, Howard; Russin, Jonathan; Oermann, Eric K; Nossek, Erez
OBJECTIVE:The objective of this study was to investigate the use of indocyanine green videoangiography with FLOW 800 hemodynamic parameters intraoperatively during superficial temporal artery-middle cerebral artery (STA-MCA) bypass surgery to predict patency prior to anastomosis performance. METHODS:A retrospective and exploratory data analysis was conducted using FLOW 800 software prior to anastomosis to assess four regions of interest (ROIs; proximal and distal recipients and adjacent and remote gyri) for four hemodynamic parameters (speed, delay, rise time, and time to peak). Medical records were used to classify patients into flow and no-flow groups based on immediate or perioperative anastomosis patency. Hemodynamic parameters were compared using univariate and multivariate analyses. Principal component analysis was used to identify high risk of no flow (HRnf) and low risk of no flow (LRnf) groups, correlated with prospective angiographic follow-ups. Machine learning models were fitted to predict patency using FLOW 800 features, and the a posteriori effect of complication risk of those features was computed. RESULTS:A total of 39 cases underwent STA-MCA bypass surgery with complete FLOW 800 data collection. Thirty-five cases demonstrated flow after anastomosis revascularization and were compared with 4 cases with no flow after revascularization. Proximal and distal recipient speeds were significantly different between the no-flow and flow groups (proximal: 238.3 ± 120.8 and 138.5 ± 93.6, respectively [p < 0.001]; distal: 241.0 ± 117.0 and 142.1 ± 103.8, respectively [p < 0.05]). Based on principal component analysis, the HRnf group (n = 10) was characterized by high-flow speed (> 75th percentile) in all ROIs, whereas the LRnf group (n = 10) had contrasting patterns. In prospective long-term follow-up, 6 of 9 cases in the HRnf group, including the original no-flow cases, had no or low flow, whereas 8 of 8 cases in the LRnf group maintained robust flow. Machine learning models predicted patency failure with a mean F1 score of 0.930 and consistently relied on proximal recipient speed as the most important feature. Computation of posterior likelihood showed a 95.29% chance of patients having long-term patency given a lower proximal speed. CONCLUSIONS:These results suggest that a high proximal speed measured in the recipient vessel prior to anastomosis can elevate the risk of perioperative no flow and long-term reduction of flow. With an increased dataset size, continued FLOW 800-based ROI metric analysis could be used to guide intraoperative anastomosis site selection prior to anastomosis and predict patency outcome.
PMID: 39151199
ISSN: 1933-0693
CID: 5727032