Jie Ma

BSc, MSc

Medical Statistician

Statistician interested in how prognosis research is conducted

My research interests are focused on methodological aspects of developing and validating multivariable prediction models. Risk prediction models can predict how a person’s disease is likely to develop (their prognosis) or their probability of developing a disease in the future, using their current characteristics like weight and blood pressure. I use applied statistics, systematic review and simulation techniques to investigate how prediction models are usually developed and validated, and how we can improve this methodology for more useful, accurate models.

Prediction models are developed using information from a group of patients, selecting the important characteristics and working out how they are associated with the outcomes of interest. After the model has been developed, it is validated using information from another, similar group of patients, to check that it works for all relevant patients. I am using an independent external validation dataset to investigate how a risk model’s performance is affected by missing data in, and the sample size of, its development dataset. I am also looking at how missing data should be dealt with when developing and validating a model.

I am also interested in how well prediction model studies are reported in the scientific literature. When important details are left out of a research article, we cannot judge how well the study was carried out and how useful the resulting model is. The TRIPOD reporting guideline was developed to help researchers fully report their prediction modelling studies. We are investigating how well prediction model studies in diabetes follow TRIPOD, to give us a baseline assessment of the quality of reporting in this area.

Most recently, I have been looking at machine learning approaches and compare them to more traditional approaches for prediction.

I graduated with a BSc in Mathematics from the Royal Holloway University of London in 2013, followed by an MSc in Statistics (Medical) from UCL in 2014. My MSc thesis is entitled “How should we choose the shrinkage parameter when using penalized regression to develop risk models”. I joined the Centre for Statistics in Medicine in 2015.

Recent publications

Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review.
Journal article

Dhiman P. et al, (2023), Bmc med res methodol, 23
Poor handling of continuous predictors in clinical prediction models using logistic regression: a systematic review.
Journal article

Ma J. et al, (2023), J clin epidemiol
Systematic review highlights high risk of bias of clinical prediction models for blood transfusion in patients undergoing elective surgery.
Journal article

Dhiman P. et al, (2023), J clin epidemiol, 159, 10 - 30
Systematic review finds "Spin" practices and poor reporting standards in studies on machine learning-based prediction models.
Journal article

Andaur Navarro CL. et al, (2023), J clin epidemiol
Systematic review identifies the design and methodological conduct of studies on machine learning-based prediction models.
Journal article

Andaur Navarro CL. et al, (2022), J clin epidemiol

More publications

Cookies on this website

Contact information

Research groups

Jie Ma

Recent publications