Research groups
Jie Ma
BSc, MSc
Medical Statistician
Statistician interested in how prognosis research is conducted
My research interests are focused on methodological aspects of developing and validating multivariable prediction models. Risk prediction models can predict how a person’s disease is likely to develop (their prognosis) or their probability of developing a disease in the future, using their current characteristics like weight and blood pressure. I use applied statistics, systematic review and simulation techniques to investigate how prediction models are usually developed and validated, and how we can improve this methodology for more useful, accurate models.
Prediction models are developed using information from a group of patients, selecting the important characteristics and working out how they are associated with the outcomes of interest. After the model has been developed, it is validated using information from another, similar group of patients, to check that it works for all relevant patients. I am using an independent external validation dataset to investigate how a risk model’s performance is affected by missing data in, and the sample size of, its development dataset. I am also looking at how missing data should be dealt with when developing and validating a model.
I am also interested in how well prediction model studies are reported in the scientific literature. When important details are left out of a research article, we cannot judge how well the study was carried out and how useful the resulting model is. The TRIPOD reporting guideline was developed to help researchers fully report their prediction modelling studies. We are investigating how well prediction model studies in diabetes follow TRIPOD, to give us a baseline assessment of the quality of reporting in this area.
Most recently, I have been looking at machine learning approaches and compare them to more traditional approaches for prediction.
I graduated with a BSc in Mathematics from the Royal Holloway University of London in 2013, followed by an MSc in Statistics (Medical) from UCL in 2014. My MSc thesis is entitled “How should we choose the shrinkage parameter when using penalized regression to develop risk models”. I joined the Centre for Statistics in Medicine in 2015.
Recent publications
Peer review of prediction model studies in oncology needs improvement: A systematic review of open peer review reports from BMC journals.
Journal article
Ma J. et al, (2025), J Clin Epidemiol, 188
Peer review reports of randomized controlled trials in oncology can be short and superficial.
Journal article
Logullo P. et al, (2025), J Clin Epidemiol, 185
Development of a multicentre cohort study to understand the role of MRI and ultrasound in the diagnosis of acute haematogenous bone and joint infection in children (the PIC Bone study) : a study protocol.
Journal article
Nogaro M-C. et al, (2025), Bone Jt Open, 6, 677 - 684
Poor handling of continuous predictors in clinical prediction models using logistic regression: a systematic review.
Journal article
Ma J. et al, (2023), J Clin Epidemiol, 161, 140 - 151
Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review.
Journal article
Dhiman P. et al, (2023), BMC Med Res Methodol, 23