Protocol for a systematic review on the methodological and reporting quality of prediction model studies using machine learning techniques
Andaur Navarro CL., Damen JAAG., Takada T., Nijman SWJ., Dhiman P., Ma J., Collins GS., Bajpai R., Riley RD., Moons KGM., Hooft L.
<jats:sec><jats:title>Introduction</jats:title><jats:p>Studies addressing the development and/or validation of diagnostic and prognostic prediction models are abundant in most clinical domains. Systematic reviews have shown that the methodological and reporting quality of prediction model studies is suboptimal. Due to the increasing availability of larger, routinely collected and complex medical data, and the rising application of Artificial Intelligence (AI) or machine learning (ML) techniques, the number of prediction model studies is expected to increase even further. Prediction models developed using AI or ML techniques are often labelled as a ‘black box’ and little is known about their methodological and reporting quality. Therefore, this comprehensive systematic review aims to evaluate the reporting quality, the methodological conduct, and the risk of bias of prediction model studies that applied ML techniques for model development and/or validation.</jats:p></jats:sec><jats:sec><jats:title>Methods and analysis</jats:title><jats:p>A search will be performed in PubMed to identify studies developing and/or validating prediction models using any ML methodology and across all medical fields. Studies will be included if they were published between January 2018 and December 2019, predict patient-related outcomes, use any study design or data source, and available in English. Screening of search results and data extraction from included articles will be performed by two independent reviewers. The primary outcomes of this systematic review are: (1) the adherence of ML-based prediction model studies to the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD), and (2) the risk of bias in such studies as assessed using the Prediction model Risk Of Bias ASsessment Tool (PROBAST). A narrative synthesis will be conducted for all included studies. Findings will be stratified by study type, medical field and prevalent ML methods, and will inform necessary extensions or updates of TRIPOD and PROBAST to better address prediction model studies that used AI or ML techniques.</jats:p></jats:sec><jats:sec><jats:title>Ethics and dissemination</jats:title><jats:p>Ethical approval is not required for this study because only available published data will be analysed. Findings will be disseminated through peer-reviewed publications and scientific conferences.</jats:p></jats:sec><jats:sec><jats:title>Systematic review registration</jats:title><jats:p>PROSPERO, CRD42019161764.</jats:p></jats:sec>