Aligning, autoencoding and prompting large language models for novel disease reporting

Liu F.; Wu X.; Huang J.; Yang B.; Branson K.; Schwab P.; Clifton L.; Zhang P.; Luo J.; Zheng Y.; Clifton DA.

Aligning, autoencoding and prompting large language models for novel disease reporting

Liu F., Wu X., Huang J., Yang B., Branson K., Schwab P., Clifton L., Zhang P., Luo J., Zheng Y., Clifton DA.

Given radiology images, automatic radiology report generation aims to produce informative text that reports diseases. It can benefit current clinical practice in diagnostic radiology. Existing methods typically rely on large-scale medical datasets annotated by clinicians to train desirable models. However, for novel diseases, sufficient training data are typically not available. We propose a prompt-based deep learning framework, i.e., PromptLLM, to align, autoencode, and prompt the (large) language model to generate reports for novel diseases accurately and efficiently. Our method includes three major steps: (1) aligning visual images and textual reports to learn general knowledge across modalities from diseases where labeled data are sufficient, (2) autoencoding the LLM using unlabeled data of novel diseases to learn the specific knowledge and writing styles of the novel disease, and (3) prompting the LLM with learned knowledge and writing styles to report the novel diseases contained in the radiology images. Through the above three steps, with limited labels on novel diseases, we show that PromptLLM can rapidly learn the corresponding knowledge for accurate novel disease reporting. The experiments on COVID-19 and diverse thorax diseases show that our approach, utilizing 1% of the training data, achieves desirable performance compared to previous methods. It shows that our approach allows us to relax the reliance on labeled data that is common to existing methods. It could have a real-world impact on data analysis during the early stages of novel diseases. Our code and data are available at https://github.com/ai-in-health/PromptLLM.

Original publication

DOI

10.1109/tpami.2025.3534586

Type

Journal article

Journal

Ieee transactions on pattern analysis and machine intelligence

Publisher

IEEE

Publication Date

27/01/2025

Keywords

autoencoders, deep learning, thorax, dictionaries, diseases, COVID-19, radiology, training data, visualization, decoding

Cookies on this website

Aligning, autoencoding and prompting large language models for novel disease reporting

Liu F., Wu X., Huang J., Yang B., Branson K., Schwab P., Clifton L., Zhang P., Luo J., Zheng Y., Clifton DA.

DOI

Type

Journal

Publisher

Publication Date

Keywords