Semi-supervised learning with natural language processing for right ventricle classification in echocardiography-a scalable approach.

Hagberg E, Hagerman D, Johansson R, Hosseini N, Liu J, Björnsson E, Alvén J, Hjelmgren O

Comput Biol Med 143 (-) 105282 [2022-02-15; online 2022-02-15]

We created a deep learning model, trained on text classified by natural language processing (NLP), to assess right ventricular (RV) size and function from echocardiographic images. We included 12,684 examinations with corresponding written reports for text classification. After manual annotation of 1489 reports, we trained an NLP model to classify the remaining 10,651 reports. A view classifier was developed to select the 4-chamber or RV-focused view from an echocardiographic examination (n = 539). The final models were two image classification models trained on the predicted labels from the combined manual annotation and NLP models and the corresponding echocardiographic view to assess RV function (training set n = 11,008) and size (training set n = 9951. The text classifier identified impaired RV function with 99% sensitivity and 98% specificity and RV enlargement with 98% sensitivity and 98% specificity. The view classification model identified the 4-chamber view with 92% accuracy and the RV-focused view with 73% accuracy. The image classification models identified impaired RV function with 93% sensitivity and 72% specificity and an enlarged RV with 80% sensitivity and 85% specificity; agreement with the written reports was substantial (both κ = 0.65). Our findings show that models for automatic image assessment can be trained to classify RV size and function by using model-annotated data from written echocardiography reports. This pipeline for auto-annotation of the echocardiographic images, using a NLP model with medical reports as input, can be used to train an image-assessment model without manual annotation of images and enables fast and inexpensive expansion of the training dataset when needed.

AIDA Data Hub [Service]

Bioinformatics (NBIS) [Service]

PubMed 35220074

DOI 10.1016/j.compbiomed.2022.105282

Crossref 10.1016/j.compbiomed.2022.105282

pii: S0010-4825(22)00074-9