Try a new search

Format these results:

Searched for:

in-biosketch:yes

person:nsr3

Total Results:

72


Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning

Coudray, Nicolas; Ocampo, Paolo Santiago; Sakellaropoulos, Theodore; Narula, Navneet; Snuderl, Matija; Fenyö, David; Moreira, Andre L; Razavian, Narges; Tsirigos, Aristotelis
Visual inspection of histopathology slides is one of the main methods used by pathologists to assess the stage, type and subtype of lung tumors. Adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) are the most prevalent subtypes of lung cancer, and their distinction requires visual inspection by an experienced pathologist. In this study, we trained a deep convolutional neural network (inception v3) on whole-slide images obtained from The Cancer Genome Atlas to accurately and automatically classify them into LUAD, LUSC or normal lung tissue. The performance of our method is comparable to that of pathologists, with an average area under the curve (AUC) of 0.97. Our model was validated on independent datasets of frozen tissues, formalin-fixed paraffin-embedded tissues and biopsies. Furthermore, we trained the network to predict the ten most commonly mutated genes in LUAD. We found that six of them-STK11, EGFR, FAT1, SETBP1, KRAS and TP53-can be predicted from pathology images, with AUCs from 0.733 to 0.856 as measured on a held-out population. These findings suggest that deep-learning models can assist pathologists in the detection of cancer subtype or gene mutations. Our approach can be applied to any cancer type, and the code is available at https://github.com/ncoudray/DeepPATH .
ORIGINAL:0014811
ISSN: 1556-0864
CID: 4662042

Deep EHR: Chronic Disease Prediction Using Medical Notes [PrePrint]

Liu, Jingshu; Zhang, Zachariah; Razavian, Narges
Early detection of preventable diseases is important for better disease management, improved inter-ventions, and more efficient health-care resource allocation. Various machine learning approacheshave been developed to utilize information in Electronic Health Record (EHR) for this task. Majorityof previous attempts, however, focus on structured fields and lose the vast amount of information inthe unstructured notes. In this work we propose a general multi-task framework for disease onsetprediction that combines both free-text medical notes and structured information. We compareperformance of different deep learning architectures including CNN, LSTM and hierarchical this http URL contrast to traditional text-based prediction models, our approach does not require disease specificfeature engineering, and can handle negations and numerical values that exist in the text. Ourresults on a cohort of about 1 million patients show that models using text outperform modelsusing just structured data, and that models capable of using numerical values and negations in thetext, in addition to the raw text, further improve performance. Additionally, we compare differentvisualization methods for medical professionals to interpret model predictions
ORIGINAL:0014819
ISSN: 2331-8422
CID: 4662152

Determining EGFR and STK11 mutational status in lung adenocarcinoma histopathology images using deep learning [Meeting Abstract]

Coudray, Nicolas; Moreira, Andre L; Sakellaropoulos, Theodore; Fenyo, David; Razavian, Narges; Tsirigos, Aristotelis
ORIGINAL:0014812
ISSN: 1538-7445
CID: 4662052

Multi-task Prediction of Disease Onsets from Longitudinal Laboratory Tests

Razavian, Narges; Marcus, Jake; Sontag, David
Disparate areas of machine learning have benefited from models that can take raw data with little preprocessing as input and learn rich representations of that raw data in order to perform well on a given prediction task. We evaluate this approach in healthcare by using longitudinal measurements of lab tests, one of the more raw signals of a patient’s health state widely available in clinical data, to predict disease onsets. In particular, we train a Long Short-Term Memory (LSTM) recurrent neural network and two novel convolutional neural networks for multi-task prediction of disease onset for 133 conditions based on 18 common lab tests measured over time in a cohort of 298K patients derived from 8 years of administrative claims data. We compare the neural networks to a logistic regression with several hand-engineered, clinically relevant features. We find that the representation-based learning approaches significantly outperform this baseline. We believe that our work suggests a new avenue for patient risk stratification based solely on lab results
ORIGINAL:0012257
ISSN: 1938-7288
CID: 2706922

Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors

Razavian, Narges; Blecker, Saul; Schmidt, Ann Marie; Smith-McLallen, Aaron; Nigam, Somesh; Sontag, David
We present a new approach to population health, in which data-driven predictive models are learned for outcomes such as type 2 diabetes. Our approach enables risk assessment from readily available electronic claims data on large populations, without additional screening cost. Proposed model uncovers early and late-stage risk factors. Using administrative claims, pharmacy records, healthcare utilization, and laboratory results of 4.1 million individuals between 2005 and 2009, an initial set of 42,000 variables were derived that together describe the full health status and history of every individual. Machine learning was then used to methodically enhance predictive variable set and fit models predicting onset of type 2 diabetes in 2009-2011, 2010-2012, and 2011-2013. We compared the enhanced model with a parsimonious model consisting of known diabetes risk factors in a real-world environment, where missing values are common and prevalent. Furthermore, we analyzed novel and known risk factors emerging from the model at different age groups at different stages before the onset. Parsimonious model using 21 classic diabetes risk factors resulted in area under ROC curve (AUC) of 0.75 for diabetes prediction within a 2-year window following the baseline. The enhanced model increased the AUC to 0.80, with about 900 variables selected as predictive (p < 0.0001 for differences between AUCs). Similar improvements were observed for models predicting diabetes onset 1-3 years and 2-4 years after baseline. The enhanced model improved positive predictive value by at least 50% and identified novel surrogate risk factors for type 2 diabetes, such as chronic liver disease (odds ratio [OR] 3.71), high alanine aminotransferase (OR 2.26), esophageal reflux (OR 1.85), and history of acute bronchitis (OR 1.45). Liver risk factors emerge later in the process of diabetes development compared with obesity-related factors such as hypertension and high hemoglobin A1c. In conclusion, population-level risk prediction for type 2 diabetes using readily available administrative data is feasible and has better prediction performance than classical diabetes risk prediction algorithms on very large populations with missing data. The new model enables intervention allocation at national scale quickly and accurately and recovers potentially novel risk factors at different stages before the disease onset.
PMID: 27441408
ISSN: 2167-647x
CID: 2185492

PREDICTING CHRONIC COMORBID CONDITIONS OF TYPE 2 DIABETES IN NEWLY-DIAGNOSED DIABETIC PATIENTS [Meeting Abstract]

Razavian, N; Smith-McLallen, A; Nigam, S; Blecker, S; Schmidt, AM; Sontag, D
ISI:000354498500282
ISSN: 1524-4733
CID: 2333322

PREVALENCE AND TIMING OF COMORBID COMPLICATIONS OF TYPE 2 DIABETES IN LARGE COHORT OF INSURANCE SUBSCRIBERS [Meeting Abstract]

Razavian, N; Smith-McLallen, A; Nigam, S; Blecker, S; Schmidt, AM; Sontag, D
ISI:000354498500284
ISSN: 1524-4733
CID: 2333332

Population-level Prediction of Type 2 Diabetes from Insurance Claims and Analysis of Risk Factors [Meeting Abstract]

Razavian, Narges; Smith-Mclallen, Aaron; Nigam, Somesh; Blecker, Saul; Schmidt, Ann Marie; Sontag, David
ISI:000359482700153
ISSN: 1939-327x
CID: 2333342

Advancing the frontier of data-driven healthcare

Razavian, Narges
Suchi Saria of Johns Hopkins University shares how big data and machine learning can help improve the practice of healthcare, and how computing students can contribute
ORIGINAL:0012255
ISSN: 1528-4980
CID: 2706892

Continuous graphical models for static and dynamic distributions : application to structural biology

Razavian, Narges Sharif
Pittsburgh PA : Carnegie Mellon University, 2013
Extent: 153 p.
ISBN: n/a
CID: 2706952