Searched for: person:heacol01
in-biosketch:true
PACS-integrated machine learning breast density classifier: clinical validation
Lewin, John; Schoenherr, Sven; Seebass, Martin; Lin, MingDe; Philpotts, Liane; Etesami, Maryam; Butler, Reni; Durand, Melissa; Heller, Samantha; Heacock, Laura; Moy, Linda; Tocino, Irena; Westerhoff, Malte
OBJECTIVE:To test the performance of a novel machine learning-based breast density tool. The tool utilizes a convolutional neural network to predict the BI-RADS based density assessment of a study. The clinical density assessments of 33,000 mammographic examinations (164,000 images) from one academic medical center (Site A) were used for training. MATERIALS AND METHODS/METHODS:This was an IRB approved HIPAA compliant study performed at two academic medical centers. The validation data set was composed of 500 studies from one site (Site A) and 700 from another (Site B). At Site A, each study was assessed by three breast radiologists and the majority (consensus) assessment was used as truth. At Site B, if the tool agreed with the clinical reading, then it was considered to have correctly predicted the clinical reading. In cases where the tool and the clinical reading disagreed, then the study was evaluated by three radiologists and the consensus reading was used as the clinical reading. RESULTS:For the classification into the four categories of the Breast Imaging Reporting and Data System (BI-RADS®), the AI classifier had an accuracy of 84.6% at Site A and 89.7% at Site B. For binary classification (dense vs. non-dense), the AI classifier had an accuracy of 94.4% at Site A and 97.4% at Site B. In no case did the classifier disagree with the consensus reading by more than one category. CONCLUSIONS:The automated breast density tool showed high agreement with radiologists' assessments of breast density.
PMID: 37421715
ISSN: 1873-4499
CID: 5539562
Improving Information Extraction from Pathology Reports using Named Entity Recognition
Zeng, Ken G; Dutt, Tarun; Witowski, Jan; Kranthi Kiran, G V; Yeung, Frank; Kim, Michelle; Kim, Jesi; Pleasure, Mitchell; Moczulski, Christopher; Lopez, L Julian Lechuga; Zhang, Hao; Harbi, Mariam Al; Shamout, Farah E; Major, Vincent J; Heacock, Laura; Moy, Linda; Schnabel, Freya; Pak, Linda M; Shen, Yiqiu; Geras, Krzysztof J
Pathology reports are considered the gold standard in medical research due to their comprehensive and accurate diagnostic information. Natural language processing (NLP) techniques have been developed to automate information extraction from pathology reports. However, existing studies suffer from two significant limitations. First, they typically frame their tasks as report classification, which restricts the granularity of extracted information. Second, they often fail to generalize to unseen reports due to variations in language, negation, and human error. To overcome these challenges, we propose a BERT (bidirectional encoder representations from transformers) named entity recognition (NER) system to extract key diagnostic elements from pathology reports. We also introduce four data augmentation methods to improve the robustness of our model. Trained and evaluated on 1438 annotated breast pathology reports, acquired from a large medical center in the United States, our BERT model trained with data augmentation achieves an entity F1-score of 0.916 on an internal test set, surpassing the BERT baseline (0.843). We further assessed the model's generalizability using an external validation dataset from the United Arab Emirates, where our model maintained satisfactory performance (F1-score 0.860). Our findings demonstrate that our NER systems can effectively extract fine-grained information from widely diverse medical reports, offering the potential for large-scale information extraction in a wide range of medical and AI research. We publish our code at https://github.com/nyukat/pathology_extraction.
PMCID:10350195
PMID: 37461545
CID: 5588752
Women 75 Years Old or Older: To Screen or Not to Screen?
Lee, Cindy S; Lewin, Alana; Reig, Beatriu; Heacock, Laura; Gao, Yiming; Heller, Samantha; Moy, Linda
Breast cancer is the most common cancer in women, with the incidence rising substantially with age. Older women are a vulnerable population at increased risk of developing and dying from breast cancer. However, women aged 75 years and older were excluded from all randomized controlled screening trials, so the best available data regarding screening benefits and risks in this age group are from observational studies and modeling predictions. Benefits of screening in older women are the same as those in younger women: early detection of smaller lower-stage cancers, resulting in less invasive treatment and lower morbidity and mortality. Mammography performs significantly better in older women with higher sensitivity, specificity, cancer detection rate, and positive predictive values, accompanied by lower recall rates and false positives. The overdiagnosis rate is low, with benefits outweighing risks until age 90 years. Although there are conflicting national and international guidelines about whether to continue screening mammography in women beyond age 74 years, clinicians can use shared decision making to help women make decisions about screening and fully engage them in the screening process. For women aged 75 years and older in good health, continuing annual screening mammography will save the most lives. An informed discussion of the benefits and risks of screening mammography in older women needs to include each woman's individual values, overall health status, and comorbidities. This article will review the benefits, risks, and controversies surrounding screening mammography in women 75 years old and older and compare the current recommendations for screening this population from national and international professional organizations. ©RSNA, 2023 Quiz questions for this article are available through the Online Learning Center.
PMID: 37053102
ISSN: 1527-1323
CID: 5464252
New Horizons: Artificial Intelligence for Digital Breast Tomosynthesis
Goldberg, Julia E; Reig, Beatriu; Lewin, Alana A; Gao, Yiming; Heacock, Laura; Heller, Samantha L; Moy, Linda
The use of digital breast tomosynthesis (DBT) in breast cancer screening has become widely accepted, facilitating increased cancer detection and lower recall rates compared with those achieved by using full-field digital mammography (DM). However, the use of DBT, as compared with DM, raises new challenges, including a larger number of acquired images and thus longer interpretation times. While most current artificial intelligence (AI) applications are developed for DM, there are multiple potential opportunities for AI to augment the benefits of DBT. During the diagnostic steps of lesion detection, characterization, and classification, AI algorithms may not only assist in the detection of indeterminate or suspicious findings but also aid in predicting the likelihood of malignancy for a particular lesion. During image acquisition and processing, AI algorithms may help reduce radiation dose and improve lesion conspicuity on synthetic two-dimensional DM images. The use of AI algorithms may also improve workflow efficiency and decrease the radiologist's interpretation time. There has been significant growth in research that applies AI to DBT, with several algorithms approved by the U.S. Food and Drug Administration for clinical implementation. Further development of AI models for DBT has the potential to lead to improved practice efficiency and ultimately improved patient health outcomes of breast cancer screening and diagnostic evaluation. See the invited commentary by Bahl in this issue. ©RSNA, 2022.
PMID: 36331878
ISSN: 1527-1323
CID: 5356862
Improving breast cancer diagnostics with deep learning for MRI
Witowski, Jan; Heacock, Laura; Reig, Beatriu; Kang, Stella K; Lewin, Alana; Pysarenko, Kristine; Patel, Shalin; Samreen, Naziya; Rudnicki, Wojciech; ÅuczyÅ„ska, Elżbieta; Popiela, Tadeusz; Moy, Linda; Geras, Krzysztof J
Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) has a high sensitivity in detecting breast cancer but often leads to unnecessary biopsies and patient workup. We used a deep learning (DL) system to improve the overall accuracy of breast cancer diagnosis and personalize management of patients undergoing DCE-MRI. On the internal test set (n = 3936 exams), our system achieved an area under the receiver operating characteristic curve (AUROC) of 0.92 (95% CI: 0.92 to 0.93). In a retrospective reader study, there was no statistically significant difference (P = 0.19) between five board-certified breast radiologists and the DL system (mean ΔAUROC, +0.04 in favor of the DL system). Radiologists' performance improved when their predictions were averaged with DL's predictions [mean ΔAUPRC (area under the precision-recall curve), +0.07]. We demonstrated the generalizability of the DL system using multiple datasets from Poland and the United States. An additional reader study on a Polish dataset showed that the DL system was as robust to distribution shift as radiologists. In subgroup analysis, we observed consistent results across different cancer subtypes and patient demographics. Using decision curve analysis, we showed that the DL system can reduce unnecessary biopsies in the range of clinically relevant risk thresholds. This would lead to avoiding biopsies yielding benign results in up to 20% of all patients with BI-RADS category 4 lesions. Last, we performed an error analysis, investigating situations where DL predictions were mostly incorrect. This exploratory work creates a foundation for deployment and prospective analysis of DL-based models for breast MRI.
PMID: 36170446
ISSN: 1946-6242
CID: 5334352
Estimation of the capillary level input function for dynamic contrast-enhanced MRI of the breast using a deep learning approach
Bae, Jonghyun; Huang, Zhengnan; Knoll, Florian; Geras, Krzysztof; Pandit Sood, Terlika; Feng, Li; Heacock, Laura; Moy, Linda; Kim, Sungheon Gene
PURPOSE/OBJECTIVE:To develop a deep learning approach to estimate the local capillary-level input function (CIF) for pharmacokinetic model analysis of DCE-MRI. METHODS:A deep convolutional network was trained with numerically simulated data to estimate the CIF. The trained network was tested using simulated lesion data and used to estimate voxel-wise CIF for pharmacokinetic model analysis of breast DCE-MRI data using an abbreviated protocol from women with malignant (n = 25) and benign (n = 28) lesions. The estimated parameters were used to build a logistic regression model to detect the malignancy. RESULT/RESULTS:The pharmacokinetic parameters estimated using the network-predicted CIF from our breast DCE data showed significant differences between the malignant and benign groups for all parameters. Testing the diagnostic performance with the estimated parameters, the conventional approach with arterial input function (AIF) showed an area under the curve (AUC) between 0.76 and 0.87, and the proposed approach with CIF demonstrated similar performance with an AUC between 0.79 and 0.81. CONCLUSION/CONCLUSIONS:This study shows the feasibility of estimating voxel-wise CIF using a deep neural network. The proposed approach could eliminate the need to measure AIF manually without compromising the diagnostic performance to detect the malignancy in the clinical setting.
PMID: 35001423
ISSN: 1522-2594
CID: 5118282
Differences between human and machine perception in medical diagnosis
Makino, Taro; Jastrzębski, Stanisław; Oleszkiewicz, Witold; Chacko, Celin; Ehrenpreis, Robin; Samreen, Naziya; Chhor, Chloe; Kim, Eric; Lee, Jiyon; Pysarenko, Kristine; Reig, Beatriu; Toth, Hildegard; Awal, Divya; Du, Linda; Kim, Alice; Park, James; Sodickson, Daniel K; Heacock, Laura; Moy, Linda; Cho, Kyunghyun; Geras, Krzysztof J
Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since they can fail for reasons unrelated to underlying pathology. Humans are less likely to make such superficial mistakes, since they use features that are grounded on medical science. It is therefore important to know whether DNNs use different features than humans. Towards this end, we propose a framework for comparing human and machine perception in medical diagnosis. We frame the comparison in terms of perturbation robustness, and mitigate Simpson's paradox by performing a subgroup analysis. The framework is demonstrated with a case study in breast cancer screening, where we separately analyze microcalcifications and soft tissue lesions. While it is inconclusive whether humans and DNNs use different features to detect microcalcifications, we find that for soft tissue lesions, DNNs rely on high frequency components ignored by radiologists. Moreover, these features are located outside of the region of the images found most suspicious by radiologists. This difference between humans and machines was only visible through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into the comparison.
PMCID:9046399
PMID: 35477730
ISSN: 2045-2322
CID: 5205672
Advances in Abbreviated Breast MRI and Ultrafast Imaging
Patel, Shalin; Heacock, Laura; Gao, Yiming; Elias, Kristin; Moy, Linda; Heller, Samantha
Abbreviated breast MRI is an emerging technique that is being incorporated into clinical practice for breast cancer imaging and screening. Conventional breast MRI includes barriers such as high examination cost and lengthy examination times which make its use in the screening setting challenging. Abbreviated MRI aims to address these pitfalls by reducing overall examination time and increasing accessibility to MRI while preserving diagnostic accuracy. Sequences selected for abbreviated MRI protocols allow for preserved accuracy in breast cancer detection and characterization. Novel techniques such as ultrafast imaging are being used to provide kinetic information from early post-contrast imaging.
PMID: 35523528
ISSN: 1558-4658
CID: 5213942
Biomarkers, Prognosis, and Prediction Factors
Chapter by: Reig, Beatriu; Moy, Linda; Sigmund, Eric E.; Heacock, Laura
in: Diffusion MRI of the Breast by
[S.l.] : Elsevier, 2022
pp. 49-70
ISBN: 9780323811026
CID: 5445962
Reducing False-Positive Biopsies using Deep Neural Networks that Utilize both Local and Global Image Context of Screening Mammograms
Wu, Nan; Huang, Zhe; Shen, Yiqiu; Park, Jungkyu; Phang, Jason; Makino, Taro; Gene Kim, S; Cho, Kyunghyun; Heacock, Laura; Moy, Linda; Geras, Krzysztof J
Breast cancer is the most common cancer in women, and hundreds of thousands of unnecessary biopsies are done around the world at a tremendous cost. It is crucial to reduce the rate of biopsies that turn out to be benign tissue. In this study, we build deep neural networks (DNNs) to classify biopsied lesions as being either malignant or benign, with the goal of using these networks as second readers serving radiologists to further reduce the number of false-positive findings. We enhance the performance of DNNs that are trained to learn from small image patches by integrating global context provided in the form of saliency maps learned from the entire image into their reasoning, similar to how radiologists consider global context when evaluating areas of interest. Our experiments are conducted on a dataset of 229,426 screening mammography examinations from 141,473 patients. We achieve an AUC of 0.8 on a test set consisting of 464 benign and 136 malignant lesions.
PMID: 34731338
ISSN: 1618-727x
CID: 5038152