Neural Regeneration Research ›› 2018, Vol. 13 ›› Issue (6): 962-970.doi: 10.4103/1673-5374.233433

Previous Articles     Next Articles

How random is the random forest ? Random forest algorithm on the service of structural imaging biomarkers for Alzheimer’s disease: from Alzheimer’s disease neuroimaging initiative (ADNI) database

Stavros I. Dimitriadis1, 2, 3, 4, 5, 6, Dimitris Liparas7, 8, for the Alzheimer’s Disease Neuroimaging Initiative   

  1. 1 Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
    2 Cardiff University Brain Research Imaging Centre, School of Psychology, Cardiff University, Cardiff, UK
    3 School of Psychology, Cardiff University, Cardiff, UK
    4 Neuroinformatics Group, Cardiff University Brain Research Imaging Centre, School of Psychology, Cardiff University, Cardiff, UK
    5 Neuroscience and Mental Health Research Institute, Cardiff University, Cardiff, UK
    6 MRC Centre for Neuropsychiatric Genetics and Genomics, School of Medicine, Cardiff University, Cardiff, UK
    7 High Performance Computing Center Stuttgart (HLRS), University of Stuttgart, Stuttgart, Germany
    8 Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki, Greece
  • Received:2018-04-10 Online:2018-06-15 Published:2018-06-15
  • Contact: Stavros I. Dimitriadis,DimitriadisS@cardiff.ac.uk or stidimitriadis@gmail.com or dimitrios.liparas@gmail.com.
  • Supported by:

    This work was supported by Medical Research Council (MRC) grant MR/K004360/1 to SID, and MARIE CURIE COFUND EU-UK Research Fellowship to SID.

Abstract:

Neuroinformatics is a fascinating research field that applies computational models and analytical tools to high dimensional experimental neuroscience data for a better understanding of how the brain functions or dysfunctions in brain diseases. Neuroinformaticians work in the intersection of neuroscience and informatics supporting the integration of various sub-disciplines (behavioural neuroscience, genetics, cognitive psychology, etc.) working on brain research. Neuroinformaticians are the pathway of information exchange between informaticians and clinicians for a better understanding of the outcome of computational models and the clinical interpretation of the analysis. Machine learning is one of the most significant computational developments in the last decade giving tools to neuroinformaticians and finally to radiologists and clinicians for an automatic and early diagnosis-prognosis of a brain disease. Random forest (RF) algorithm has been successfully applied to high-dimensional neuroimaging data for feature reduction and also has been applied to classify the clinical label of a subject using single or multi-modal neuroimaging datasets. Our aim was to review the studies where RF was applied to correctly predict the Alzheimer’s disease (AD), the conversion from mild cognitive impairment (MCI) and its robustness to overfitting, outliers and handling of non-linear data. Finally, we described our RF-based model that gave us the 1st position in an international challenge for automated prediction of MCI from MRI data.

Key words: random forest, Alzheimer’s disease, mild cognitive impairment, neuroimaging, classification, machine learning, biomarker, magnetic resonance imaging