NYUHSL Faculty Bibliography

Searched for:

in-biosketch:yes

person:aphiny01

Total Results:

101

BMJ health & care informatics. 2021:28(1).DOI: 10.1136/bmjhci-2020-100267

Validation of parsimonious prognostic models for patients infected with COVID-19

Harish, Keerthi; Zhang, Ben; Stella, Peter; Hauck, Kevin; Moussa, Marwa M; Adler, Nicole M; Horwitz, Leora I; Aphinyanaphongs, Yindalon

OBJECTIVES/OBJECTIVE:Predictive studies play important roles in the development of models informing care for patients with COVID-19. Our concern is that studies producing ill-performing models may lead to inappropriate clinical decision-making. Thus, our objective is to summarise and characterise performance of prognostic models for COVID-19 on external data. METHODS:We performed a validation of parsimonious prognostic models for patients with COVID-19 from a literature search for published and preprint articles. Ten models meeting inclusion criteria were either (a) externally validated with our data against the model variables and weights or (b) rebuilt using original features if no weights were provided. Nine studies had internally or externally validated models on cohorts of between 18 and 320 inpatients with COVID-19. One model used cross-validation. Our external validation cohort consisted of 4444 patients with COVID-19 hospitalised between 1 March and 27 May 2020. RESULTS:Most models failed validation when applied to our institution's data. Included studies reported an average validation area under the receiver-operator curve (AUROC) of 0.828. Models applied with reported features averaged an AUROC of 0.66 when validated on our data. Models rebuilt with the same features averaged an AUROC of 0.755 when validated on our data. In both cases, models did not validate against their studies' reported AUROC values. DISCUSSION/CONCLUSIONS:Published and preprint prognostic models for patients infected with COVID-19 performed substantially worse when applied to external data. Further inquiry is required to elucidate mechanisms underlying performance deviations. CONCLUSIONS:Clinicians should employ caution when applying models for clinical prediction without careful validation on local data.

PMCID:8421114

PMID: 34479962

ISSN: 2632-1009

CID: 5000192

JAMIA open. 2021:4(3).DOI: 10.1093/jamiaopen/ooab083

Predicting inpatient pharmacy order interventions using provider action data

Balestra, Martina; Chen, Ji; Iturrate, Eduardo; Aphinyanaphongs, Yindalon; Nov, Oded

Objective/UNASSIGNED:The widespread deployment of electronic health records (EHRs) has introduced new sources of error and inefficiencies to the process of ordering medications in the hospital setting. Existing work identifies orders that require pharmacy intervention by comparing them to a patient's medical records. In this work, we develop a machine learning model for identifying medication orders requiring intervention using only provider behavior and other contextual features that may reflect these new sources of inefficiencies. Materials and Methods/UNASSIGNED:Data on providers' actions in the EHR system and pharmacy orders were collected over a 2-week period in a major metropolitan hospital system. A classification model was then built to identify orders requiring pharmacist intervention. We tune the model to the context in which it would be deployed and evaluate global and local feature importance. Results/UNASSIGNED:The resultant model had an area under the receiver-operator characteristic curve of 0.91 and an area under the precision-recall curve of 0.44. Conclusions/UNASSIGNED:Providers' actions can serve as useful predictors in identifying medication orders that require pharmacy intervention. Careful model tuning for the clinical context in which the model is deployed can help to create an effective tool for improving health outcomes without using sensitive patient data.

PMCID:8490931

PMID: 34617009

ISSN: 2574-2531

CID: 5092072

NPJ digital medicine. 2021:4(1).DOI: 10.1038/s41746-021-00453-0

An artificial intelligence system for predicting the deterioration of COVID-19 patients in the emergency department

Shamout, Farah E; Shen, Yiqiu; Wu, Nan; Kaku, Aakash; Park, Jungkyu; Makino, Taro; JastrzÄ™bski, StanisÅ‚aw; Witowski, Jan; Wang, Duo; Zhang, Ben; Dogra, Siddhant; Cao, Meng; Razavian, Narges; Kudlowitz, David; Azour, Lea; Moore, William; Lui, Yvonne W; Aphinyanaphongs, Yindalon; Fernandez-Granda, Carlos; Geras, Krzysztof J

During the coronavirus disease 2019 (COVID-19) pandemic, rapid and accurate triage of patients at the emergency department is critical to inform decision-making. We propose a data-driven approach for automatic prediction of deterioration risk using a deep neural network that learns from chest X-ray images and a gradient boosting model that learns from routine clinical variables. Our AI prognosis system, trained using data from 3661 patients, achieves an area under the receiver operating characteristic curve (AUC) of 0.786 (95% CI: 0.745-0.830) when predicting deterioration within 96â€‰hours. The deep neural network extracts informative areas of chest X-ray images to assist clinicians in interpreting the predictions and performs comparably to two radiologists in a reader study. In order to verify performance in a real clinical setting, we silently deployed a preliminary version of the deep neural network at New York University Langone Health during the first wave of the pandemic, which produced accurate predictions in real-time. In summary, our findings demonstrate the potential of the proposed system for assisting front-line physicians in the triage of COVID-19 patients.

PMID: 33980980

ISSN: 2398-6352

CID: 4867572

Proceedings of machine learning research. 2021:130:1459-1467.DOI:

Have We Learned to Explain?: How Interpretability Methods Can Learn to Encode Predictions in their Interpretations

Jethani, Neil; Sudarshan, Mukund; Aphinyanaphongs, Yindalon; Ranganath, Rajesh

While the need for interpretable machine learning has been established, many common approaches are slow, lack fidelity, or hard to evaluate. Amortized explanation methods reduce the cost of providing interpretations by learning a global selector model that returns feature importances for a single instance of data. The selector model is trained to optimize the fidelity of the interpretations, as evaluated by a predictor model for the target. Popular methods learn the selector and predictor model in concert, which we show allows predictions to be encoded within interpretations. We introduce EVAL-X as a method to quantitatively evaluate interpretations and REAL-X as an amortized explanation method, which learn a predictor model that approximates the true data generating distribution given any subset of the input. We show EVAL-X can detect when predictions are encoded in interpretations and show the advantages of REAL-X through quantitative and radiologist evaluation.

PMCID:8096519

PMID: 33954293

ISSN: 2640-3498

CID: 4866542

Circulation. 2021:143(13):1338-1340.DOI: 10.1161/CIRCULATIONAHA.120.053311

Multiple Biomarker Approach to Risk Stratification in COVID-19 [Letter]

Smilowitz, Nathaniel R; Nguy, Vuthy; Aphinyanaphongs, Yindalon; Newman, Jonathan D; Xia, Yuhe; Reynolds, Harmony R; Hochman, Judith S; Fishman, Glenn I; Berger, Jeffrey S

PMID: 33587646

ISSN: 1524-4539

CID: 4786532

Communications of the ACM. 2021:64(3):46-48.DOI: 10.1145/3417518

The transformation of patient-clinician relationships with AI-based medical advice

Nov, Oded; Aphinyanaphongs, Yindalon; Lui, Yvonne W.; Mann, Devin; Porfiri, Maurizio; Riedl, Mark; Rizzo, John Ross; Wiesenfeld, Batia

The transformation of patient-clinician relationships with AI-based medical advice is discussed. many new tools are based on entirely new "˜black-box"™ AI-based technologies, whose inner workings are likely not fully understood by patients or clinicians. Most patients with Type 1 diabetes now use continuous glucose monitors and insulin pumps to tightly manage their disease. Their clinicians carefully review the data streams from both devices to recommend dosage adjustments. Recently new automated recommender systems to monitor and analyze food intake, insulin doses, physical activity, and other factors influencing glucose levels, and provide data-intensive, AI-based recommendations on how to titrate the regimen, are in different stages of FDA approval using "˜black box"™ technology, which is an alluring proposition for a clinical scenario that requires identification of meaningful patterns in complex and voluminous data.

SCOPUS:85101579091

ISSN: 0001-0782

CID: 4832842

NEJM catalyst. 2021:2.DOI: 10.1056/CAT.20.0655

Supporting Acute Advance Care Planning with Precise, Timely Mortality Risk Predictions

Wang, Erwin; Major, Vincent J; Adler, Nicole; Hauck, Kevin; Austrian, Jonathan; Aphinyanaphongs, Yindalon; Horwitz, Leora I

ORIGINAL:0015307

ISSN: n/a

CID: 5000212

arXiv. 2021.DOI:

COVID-19 Deterioration Prediction via Self-Supervised Representation Learning and Multi-Image Prediction [PrePrint]

Sriram, Anuroop; Muckley, Matthew; Sinha, Koustuv; Shamout, Farah; Pineau, Joelle; Geras, Krzysztof J; Azour, Lea; Aphinyanaphongs, Yindalon; Yakubova, Nafissa; Moore, William

The rapid spread of COVID-19 cases in recent months has strained hospital resources, making rapid and accurate triage of patients presenting to emergency departments a necessity. Machine learning techniques using clinical data such as chest X-rays have been used to predict which patients are most at risk of deterioration. We consider the task of predicting two types of patient deterioration based on chest X-rays: adverse event deterioration (i.e., transfer to the intensive care unit, intubation, or mortality) and increased oxygen requirements beyond 6 L per day. Due to the relative scarcity of COVID-19 patient data, existing solutions leverage supervised pretraining on related non-COVID images, but this is limited by the differences between the pretraining data and the target COVID-19 patient data. In this paper, we use self-supervised learning based on the momentum contrast (MoCo) method in the pretraining phase to learn more general image representations to use for downstream tasks. We present three results. The first is deterioration prediction from a single image, where our model achieves an area under receiver operating characteristic curve (AUC) of 0.742 for predicting an adverse event within 96 hours (compared to 0.703 with supervised pretraining) and an AUC of 0.765 for predicting oxygen requirements greater than 6 L a day at 24 hours (compared to 0.749 with supervised pretraining). We then propose a new transformer-based architecture that can process sequences of multiple images for prediction and show that this model can achieve an improved AUC of 0.786 for predicting an adverse event at 96 hours and an AUC of 0.848 for predicting mortalities at 96 hours. A small pilot clinical study suggested that the prediction accuracy of our model is comparable to that of experienced radiologists analyzing the same information.

PMCID:7814828

PMID: 33469559

ISSN: 2331-8422

CID: 4760552

Journal of general internal medicine. 2021:Conference:(2021).DOI:

Notesense: development of a machine learning algorithm for feedback on clinical reasoning documentation [Meeting Abstract]

Schaye, V; Guzman, B; Burk, Rafel J; Kudlowitz, D; Reinstein, I; Miller, L; Cocks, P; Chun, J; Aphinyanaphongs, Y; Marin, M

BACKGROUND: Clinical reasoning (CR) is a core component of medical training, yet residents often receive little feedback on their CR documentation. Here we describe the process of developing a machine learning (ML) algorithm for feedback on CR documentation to increase the frequency and quality of feedback in this domain.
METHOD(S): To create this algorithm, note quality first had to be rated by gold standard human rating. We selected the IDEA Assessment Tool-a note rating instrument across four domains (I=Interpretive summary, D=Differential diagnosis, E=Explanation of reasoning, A=Alternative diagnoses explained) that uses a 3-point Likert scale without descriptive anchors. To develop descriptive anchors we conducted an iterative process reviewing notes from the EHR written by medicine residents and validated the Revised-IDEA Assessment Tool using Messick's framework- content validity, response process, relation to other variables, internal structure, and consequences. Using the Hofstee standard setting method, cutoffs for high quality clinical reasoning for the IDEA and DEA scores were set. We then created a dataset of expertrated notes to create the ML algorithm. First, a natural language processing software was applied to the set of notes that enabled recognition and automatic encoding of clinical information as a diagnosis or disease (D's), a sign or symptom (E or A), or semantic qualifier (e.g. most likely). Input variables to the ML algorithm included counts of D's, E/A's, semantic qualifiers, and proximity of semantic qualifiers to disease/ diagnosis. ML output focused on DEA quality and was binarized to low or high quality CR. Finally, 200 notes were randomly selected for human validation review comparing ML output to human rated DEA score.
RESULT(S): The IDEA and DEA scores ranged from 0-10 and 0-6, respectively. IDEA score of >= 6.5 and a DEA score of >= 3 was deemed high quality. 252 notes were rated to create the dataset and 20% were rated by 3 raters with high intraclass correlation 0.84 (95% CI 0.74-0.90). 120 of these notes comprised the testing set for ML model development. The logistic regression model was the best performing model with an AUC 0.87 and a positive predictive value (PPV) of 0.65. 48 (40%) of the notes were high quality. There was substantial interrater reliability between ML output and human rating on the 200 note validation set with a Cohen's Kappa 0.64.
CONCLUSION(S): We have developed a ML algorithm for feedback on CR documentation that we hypothesize will increase the frequency and quality of feedback in this domain. We have subsequently developed a dashboard that will display the output of the ML model. Next steps will be to provide internal medicine residents' feedback on their CR documentation using this dashboard and assess the impact this has on their documentation quality. LEARNING OBJECTIVE #1: Describe the importance of high quality documentation of clinical reasoning. LEARNING OBJECTIVE #2: Identify machine learning as a novel assessment tool for feedback on clinical reasoning documentation

EMBASE:635796491

ISSN: 1525-1497

CID: 4985012

Frontiers in oral health. 2021:2.DOI: 10.3389/froh.2021.729144

Comparative Effects of E-Cigarette Aerosol on Periodontium of Periodontitis Patients

Xu, Fangxi; Aboseria, Eman; Janal, Malvin N; Pushalkar, Smruti; Bederoff, Maria V; Vasconcelos, Rebeca; Sapru, Sakshi; Paul, Bidisha; Queiroz, Erica; Makwana, Shreya; Solarewicz, Julia; Guo, Yuqi; Aguallo, Deanna; Gomez, Claudia; Shelly, Donna; Aphinyanaphongs, Yindalon; Gordon, Terry; Corby, Patricia M; Kamer, Angela R; Li, Xin; Saxena, Deepak

PMCID:8757783

PMID: 35048050

ISSN: 2673-4842

CID: 5131632