TY - JOUR
T1 - Not all roads lead to the immune system
T2 - the genetic basis of multiple sclerosis severity
AU - Jokubaitis, Vilija G.
AU - Campagna, Maria Pia
AU - Ibrahim, Omar
AU - Stankovich, Jim
AU - Kleinova, Pavlina
AU - Matesanz, Fuencisla
AU - Hui, Daniel
AU - Eichau, Sara
AU - Slee, Mark
AU - Lechner-Scott, Jeannette
AU - Lea, Rodney A.
AU - Kilpatrick, Trevor John
AU - Kalincik, Tomas
AU - De Jager, Philip Laurence
AU - Beecham, Ashley H
AU - McCauley, Jacob L
AU - Taylor, Bruce V.M.
AU - Vucic, Steve
AU - Laverick, Louise
AU - Vodehnalova, Karolina
AU - García-Sanchéz, Maria-Isabel
AU - Alcina, Antonio
AU - Van Der Walt, Anneke
AU - Havrdova, Eva Kubala
AU - Izquierdo, Guillermo
AU - Patsopoulos, Nikolaos A
AU - Horáková, Dana
AU - Butzkueven, Helmut
N1 - Funding Information:
This work was supported by a Research Fellowship awarded to V.G.J. from Multiple Sclerosis Research Australia (16–0206), and research grant support from the Royal Melbourne Hospital (MH2013–055), Charity Works for MS (2012 Project grant), the MSBase Foundation and Monash University.
Funding Information:
We thank all the people with multiple sclerosis who participated in this research without whom this work would not be possible. We would also like to acknowledge the patients and the Biobank Nodo Hospital Virgen Macarena (Biobanco del Sistema Sanitario Público de Andalucía) integrated in the Spanish National biobanks Network (PT20/00069) supported by ISCIII and FEDER funds, for their collaboration in this work. The authors would like to acknowledge research nurses Ms Jo Baker, Ms Jodi Haartsen, Ms Sandra Williams, Ms Lisa Taylor for assisting with blood collection for this study and Ms Malgorzata Krupa for research assistance. Further, the authors acknowledge the Center for Genome Technology within the University of Miami John P. Hussman Institute for Human Genomics for generating all the MegaEx array genotype data for this project and specifically Anna Konidari for overseeing the genotyping efforts and assistance with the Illumina custom design process. E.H., D.H. and P.K. are supported by the Czech Ministry of Education, project PROGRES Q27/LF1. F.M. and A.A. receive support from the Agencia Española de Investigación (AEI)-Fondos Europeos de Desarrollo Regional (FEDER) (PID2019–110487R-C21) and Junta de Andalucía (P18-RT-2623).
Publisher Copyright:
© 2023 Oxford University Press. All rights reserved.
PY - 2023/6
Y1 - 2023/6
N2 - Multiple sclerosis is a leading cause of neurological disability in adults. Heterogeneity in multiple sclerosis clinical presentation has posed a major challenge for identifying genetic variants associated with disease outcomes. To overcome this challenge, we used prospectively ascertained clinical outcomes data from the largest international multiple sclerosis registry, MSBase. We assembled a cohort of deeply phenotyped individuals of European ancestry with relapse-onset multiple sclerosis. We used unbiased genome-wide association study and machine learning approaches to assess the genetic contribution to longitudinally defined multiple sclerosis severity phenotypes in 1813 individuals. Our primary analyses did not identify any genetic variants of moderate to large effect sizes that met genome-wide significance thresholds. The strongest signal was associated with rs7289446 (β = −0.4882, P = 2.73 × 10−7), intronic to SEZ6L on chromosome 22. However, we demonstrate that clinical outcomes in relapse-onset multiple sclerosis are associated with multiple genetic loci of small effect sizes. Using a machine learning approach incorporating over 62 000 variants together with clinical and demographic variables available at multiple sclerosis disease onset, we could predict severity with an area under the receiver operator curve of 0.84 (95% CI 0.79–0.88). Our machine learning algorithm achieved positive predictive value for outcome assignation of 80% and negative predictive value of 88%. This outperformed our machine learning algorithm that contained clinical and demographic variables alone (area under the receiver operator curve 0.54, 95% CI 0.48–0.60). Secondary, sex-stratified analyses identified two genetic loci that met genome-wide significance thresholds. One in females (rs10967273; βfemale = 0.8289, P = 3.52 × 10−8), the other in males (rs698805; βmale = −1.5395, P = 4.35 × 10−8), providing some evidence for sex dimorphism in multiple sclerosis severity. Tissue enrichment and pathway analyses identified an overrepresentation of genes expressed in CNS compartments generally, and specifically in the cerebellum (P = 0.023). These involved mitochondrial function, synaptic plasticity, oligodendroglial biology, cellular senescence, calcium and G-protein receptor signalling pathways. We further identified six variants with strong evidence for regulating clinical outcomes, the strongest signal again intronic to SEZ6L (adjusted hazard ratio 0.72, P = 4.85 × 10−4). Here we report a milestone in our progress towards understanding the clinical heterogeneity of multiple sclerosis outcomes, implicating functionally distinct mechanisms to multiple sclerosis risk. Importantly, we demonstrate that machine learning using common single nucleotide variant clusters, together with clinical variables readily available at diagnosis can improve prognostic capabilities at diagnosis, and with further validation has the potential to translate to meaningful clinical practice change.
AB - Multiple sclerosis is a leading cause of neurological disability in adults. Heterogeneity in multiple sclerosis clinical presentation has posed a major challenge for identifying genetic variants associated with disease outcomes. To overcome this challenge, we used prospectively ascertained clinical outcomes data from the largest international multiple sclerosis registry, MSBase. We assembled a cohort of deeply phenotyped individuals of European ancestry with relapse-onset multiple sclerosis. We used unbiased genome-wide association study and machine learning approaches to assess the genetic contribution to longitudinally defined multiple sclerosis severity phenotypes in 1813 individuals. Our primary analyses did not identify any genetic variants of moderate to large effect sizes that met genome-wide significance thresholds. The strongest signal was associated with rs7289446 (β = −0.4882, P = 2.73 × 10−7), intronic to SEZ6L on chromosome 22. However, we demonstrate that clinical outcomes in relapse-onset multiple sclerosis are associated with multiple genetic loci of small effect sizes. Using a machine learning approach incorporating over 62 000 variants together with clinical and demographic variables available at multiple sclerosis disease onset, we could predict severity with an area under the receiver operator curve of 0.84 (95% CI 0.79–0.88). Our machine learning algorithm achieved positive predictive value for outcome assignation of 80% and negative predictive value of 88%. This outperformed our machine learning algorithm that contained clinical and demographic variables alone (area under the receiver operator curve 0.54, 95% CI 0.48–0.60). Secondary, sex-stratified analyses identified two genetic loci that met genome-wide significance thresholds. One in females (rs10967273; βfemale = 0.8289, P = 3.52 × 10−8), the other in males (rs698805; βmale = −1.5395, P = 4.35 × 10−8), providing some evidence for sex dimorphism in multiple sclerosis severity. Tissue enrichment and pathway analyses identified an overrepresentation of genes expressed in CNS compartments generally, and specifically in the cerebellum (P = 0.023). These involved mitochondrial function, synaptic plasticity, oligodendroglial biology, cellular senescence, calcium and G-protein receptor signalling pathways. We further identified six variants with strong evidence for regulating clinical outcomes, the strongest signal again intronic to SEZ6L (adjusted hazard ratio 0.72, P = 4.85 × 10−4). Here we report a milestone in our progress towards understanding the clinical heterogeneity of multiple sclerosis outcomes, implicating functionally distinct mechanisms to multiple sclerosis risk. Importantly, we demonstrate that machine learning using common single nucleotide variant clusters, together with clinical variables readily available at diagnosis can improve prognostic capabilities at diagnosis, and with further validation has the potential to translate to meaningful clinical practice change.
KW - disease severity
KW - genetics
KW - machine learning
KW - multiple sclerosis
KW - prognostics
UR - http://www.scopus.com/inward/record.url?scp=85161660538&partnerID=8YFLogxK
U2 - 10.1093/brain/awac449
DO - 10.1093/brain/awac449
M3 - Article
C2 - 36448302
AN - SCOPUS:85161660538
SN - 0006-8950
VL - 146
SP - 2316
EP - 2331
JO - Brain
JF - Brain
IS - 6
ER -