Research | Open | Published:
A data-driven approach links microglia to pathology and prognosis in amyotrophic lateral sclerosis
Acta Neuropathologica Communicationsvolume 5, Article number: 23 (2017)
Amyotrophic lateral sclerosis (ALS) is a devastating neurodegenerative disease that lacks a predictive and broadly applicable biomarker. Continued focus on mutation-specific upstream mechanisms has yet to predict disease progression in the clinic. Utilising cellular pathology common to the majority of ALS patients, we implemented an objective transcriptome-driven approach to develop noninvasive prognostic biomarkers for disease progression. Genes expressed in laser captured motor neurons in direct correlation (Spearman rank correlation, p < 0.01) with counts of neuropathology were developed into co-expression network modules. Screening modules using three gene sets representing rate of disease progression and upstream genetic association with ALS led to the prioritisation of a single module enriched for immune response to motor neuron degeneration. Genes in the network module are important for microglial activation and predict disease progression in genetically heterogeneous ALS cohorts: Expression of three genes in peripheral lymphocytes - LILRA2, ITGB2 and CEBPD – differentiate patients with rapid and slowly progressive disease, suggesting promise as a blood-derived biomarker. TREM2 is a member of the network module and the level of soluble TREM2 protein in cerebrospinal fluid is shown to predict survival when measured in late stage disease (Spearman rank correlation, p = 0.01). Our data-driven systems approach has, for the first time, directly linked microglia to the development of motor neuron pathology. LILRA2, ITGB2 and CEBPD represent peripherally accessible candidate biomarkers and TREM2 provides a broadly applicable therapeutic target for ALS.
Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease without effective treatment or a predictive biomarker . Progressive motor neuron loss leads to a median survival of only 32 months, with death most often the result of respiratory failure . A useful non-invasive biomarker for ALS progression should anticipate disease severity in advance, be broadly applicable independent of genetic background, and should provide a basis for therapeutic intervention. Many biomarkers in development currently are potentially limited because they are phenomenological including electrophysiological , imaging , clinical  and fluid-based  measures. Other upstream markers are specific to relatively rare genetic variants e.g. RNA foci and dipeptide-repeat proteins in C9ORF72-ALS [8, 30].
Data-driven methods employing transcriptomics have successfully identified biomarkers in other clinically and genetically heterogeneous disease states, including breast and gastric cancers, psoriasis and progressive supranuclear palsy [26, 43, 48, 54]. To achieve a similar goal we planned an approach with a discovery phase followed by a biomarker phase. In the discovery phase, we utilised a systematic, data-driven approach to discover and prioritise modules of tightly co-expressed genes relevant to ALS pathogenesis (Fig. 1a–c). In the biomarker assessment phase we tested the capability of top performing modules as a biomarker (s) when measured in accessible tissue (Fig. 1d).
In order to deconstruct central nervous system (CNS) pathophysiology, studies have concentrated on a single dysfunctional feature, an approach that may not yield sufficiently broad insights into global disease mechanisms. An advantage of a global transcriptome-based analysis is the capacity to exhaustively describe a biological system without prior information . Integrating transcriptome profiling with measurement of a disease-specific covariate reveals the contribution of individual genes to pathogenesis . Extending this concept to the level of networks of interacting genes rather than isolated genes provides further physiological insight .
The aim of our approach is to develop non-invasive, broadly applicable prognostic biomarkers for ALS disease progression. Although ALS is markedly heterogeneous both genetically and phenotypically, more than 98% of ALS patients develop p62- and TDP-43-positive neuronal cytoplasmic inclusions within degenerating motor neurons . Post-mortem studies have indicated that the frequency of neuronal TDP-43-positive cytoplasmic aggregates predicts the severity of neurodegeneration in a region-specific manner . We selected motor neuron pathology as a covariate measure of disease severity which could potentially be used to identify important, broadly applicable, transcriptome changes related to outcome.
To maximize signal from the relevant affected system, gene expression profiling was performed on laser captured motor neurons from ALS patients. Genes correlated in their levels of expression with motor neuron pathology were then developed into co-expression network modules which were filtered and prioritised based on independently curated markers of ALS biology: gene sets related to rate of progression and upstream genetic association with ALS. In the discovery phase (Fig. 1a–c) our systematic approach led to identification of two gene modules enriched with ALS biology. Functional enrichment within the top scoring network module revealed genes which encode an immune response to motor neuron pathology; the majority of these genes are expressed by microglia.
Gene expression within the CNS has been observed in peripheral tissues [18, 22] so in the biomarker assessment phase (Fig. 1d) of our analysis, we explored the possibility that our modules, generated from CNS tissue, may include genes with tissue-independent ability to predict disease severity. Components of the immune module were assessed by mRNA and protein quantification in accessible tissues such as blood and cerebrospinal fluid (CSF). We demonstrate candidate biomarkers that provide insight into potential therapeutic targets.
Materials and methods
Laser captured motor neurons
Brain and spinal cord tissue from fourteen ALS patients was obtained from the Sheffield Brain Tissue Bank (Table 1). Seven of these patients carried a hexanucleotide repeat expansion of C9ORF72 and seven patients suffered sporadic ALS with no identified pathogenic mutation. C9ORF72-ALS samples were identified by repeat-primed PCR of the C9ORF72 gene . Common mutations in C9ORF72, TARDBP, FUS, CHMP2B and SOD1 were excluded in the sporadic ALS patients. Tissue donated for research was obtained with written informed consent from the next of kin, and in accordance with the UK Human Tissue Authority guidelines on tissue donation. The work was approved by the South Yorkshire Ethics Committee.
Spinal cord sections from the limb enlargements were collected postmortem, processed according to standard protocols , and stored at −80 °C until required. Cervical spinal cord sections were prepared, between 800 and 1200 motor neurons were isolated, and RNA was extracted using methods described previously . RNA quantity and quality was assessed on the Nanodrop spectrophotometer and Agilent Bioanalyser, respectively, to ensure all samples were of comparable and sufficient quality to proceed. RNA (20–25 ng) was linearly amplified using the Affymetrix Two Cycle cDNA synthesis protocol to produce biotin-labelled copy RNA. Copy RNA (15 μg) was fragmented for 15 min and hybridized to the Human Genome U133 Plus 2.0 GeneChips, according to Affymetrix protocols. Array washing and staining was performed in the GeneChip® fluidics station 400 and arrays were scanned on the GeneChip® 3000 scanner. GeneChip® Operating Software was used to generate signal intensities for each transcript.
Lymphoblastoid cell lines
Lymphoblastoid cell lines derived from 46 Caucasian ALS patients, all of Northern European descent, were obtained from the UK Motor Neurone Disease Association DNA Bank (Table 2). C9ORF72-ALS samples were identified by repeat-primed PCR of the C9ORF72 gene . All samples were collected with written informed consent from the donor, and the work was approved by the South Yorkshire Ethics Committee.
Total RNA was extracted from ALS patient and control-derived lymphoblastoid cell lines using QIAGEN’s RNeasy® Mini Kit following the manufacturer’s recommendations. A 75 μL LCL suspension, containing approximately 5x106 cells, typically yields between 1.9 and 13.6 μg total RNA with a mean concentration of approximately 170 ng/μl as assessed the by the NanoDrop 1000 spectrophotometer (Thermo Scientific). The quality of the isolated material was analysed using the 2100 bioanalyzer with an RNA 6000 Nano LabChip® Kit (Agilent Technologies, Inc.). Linear amplification of RNA with an input of approximately 300 ng of starting material was performed using the Ambion® Whole Transcript (WT) Expression Assay (Applied Biosystems) and Affymetrix GeneChip® WT Terminal Labelling Kit. This procedure generated fragments of biotin-labelled sense-stranded copy DNA (6–10 μg) between 40 and 70 nucleotides in length that were hybridized onto Human Exon 1.0ST GeneChip® Arrays according to Affymetrix protocols. Array washing, staining and visualisation were performed as described for motor neuron derived RNA.
Cervical spinal cord anterior horn was examined from 11 ALS patients including seven C9ORF72-ALS patients and four patients with sporadic ALS (Table 1, patients 1–11). Immunohistochemistry was performed for p62 and phospho-TDP-43.
In staining for p62, slides were first deparaffinised through two changes of xylene and hydrated through decreasing concentrations of alcohol (2×100%/1x95%/1x70%). Antigen retrieval was achieved by boiling the samples in trisodium citrate at pH 6.5, and endogenous peroxidase was blocked in 3% H2O2 in methanol for 20 min. The slides were then stained using the VECTASTAIN Elite ABC Kit (Vector Laboratories, California, US) following these incubation protocol: serum 30 min RT, anti-p62 Ick ligand antibody (Cat. 610832, BD Transduction Laboratories, California, US) 1 h RT, 2° biotinylated antibody 30 min RT, ABC reagent 30 min RT, Vector DAB reagent 10 min, HCM (Harry’s haematoxylin 2 min, Scott’s tap water until blue colour, dehydration and clear through 70%/95%/2x100% ethanol/2× xylene, mount in DPX).
In staining for phospho-TDP-43, deparaffinisation, hydration and antigen retrieval were done in a pressure cooker (Antigen Access Unit, A. Menarini, Berkshire, UK) at pH 6 using the Access Citrate solution. Then, the slides were stained using the A. Menarini Intellipath Kit through the following incubation steps: Endogenous peroxidase block 5 min room temperature (RT), casein background blocker 10 min RT, anti-phospho-TDP-43 antibody (Cat. TIP-PTD-M01, Cosmo Bio Co, Tokyo, Japan) 1 h RT, universal probe 15 min RT, HRP-polymer 15 min RT, DAB chromogen 5 min RT, HCM.
Genome wide association study
ALS susceptibility genes were identified by a large genome wide association study (GWAS) which used the NeuroX chip  to genotype 3539 ALS cases and 5191 normal controls; the NeuroX chip includes genotyping of standard Illumina exome content of approximately 240,000 variants, and additionally, more than 24,000 custom content variants to improve coverage in genes associated with neurological diseases. Genes significantly associated with ALS were unchanged when the analysis was performed with the custom NeuroX chip content removed to avoid potential bias. GWA on the NeuroX collaboration was analysed using PLINK . 267607 SNPs were analysed in 10081 founders (0 non-founders identified). No SNPs failed frequency and genotyping pruning. Association with ALS was determined by Chi2-test; threshold for significance was set at an unadjusted p-value of 5E-08 .
Alzheimer’s GWA genes were identified using GWAS Central (http://www.gwascentral.org), which is a compilation of summary level findings from genetic association studies. 57 studies were identified containing the keyword ‘Alzheimer’s’. Variants associated with Alzeimer’s disease at a p-value <5E-08 and their associated genes were identified.
Gene expression data analysis
Microarray data were normalised using the Puma package which quantifies technical variability to improve the estimation of gene expression . The significance of association of transcript expression levels with continuous variables such as pathology counts and disease duration was determined by Spearman rank correlation. Differential expression between two groups was determined by Mann–Whitney U-test. In the identification of significant enrichment of gene list ‘x’ in gene list ‘y’ we utilised Fisher’s exact test to calculate the probability that the observed overlap occurred by chance.
Conversion between various gene/transcript identifiers was performed using Affymetrix Human Genome U133 Plus 2.0 Array annotation data and BioMart .
Network detection was performed using weighted gene coexpression analysis (WGCNA) . Global interaction partners of network genes were identified using co-expression and proteomics data from the GeneMANIA prediction server .
For the purpose of all analyses in lymphoblastoid cells and in CSF, patients with disease duration <2 years were defined as rapidly progressive and patients with disease duration >4 years were defined as slowly progressive.
Measurement of soluble TREM2 in CSF
CSF concentrations of sTREM2 were measured using a standard sandwich ELISA consisting of a biotinylated polyclonal goat anti-human TREM2 capture antibody (R & D Systems BAF1828); a monoclonal mouse anti-human TREM2 detection antibody (R & D Systems MAB1828); and a SULFO-TAG–labeled anti-mouse secondary antibody (Meso Scale Discovery R32AC-1). Recombinant human TREM2 protein (huTrem2-hIgG1aglyFc) was produced at Biogen in Chinese hamster ovary (CHO) cells and purified by size-exclusion chromatography to remove aggregates since aggregated proteins can lead to higher binding. Streptavidin-coated 96-well plates (Meso Scale Discovery L15SB) were blocked overnight at 4 °C in blocking buffer [0.5% bovine serum albumin (BSA) and 0.05% Tween 20 in PBS (pH 7.4)]. The plates were then incubated with the capture antibody for 1 h at room temperature. Plates were washed three times with washing buffer (0.05% Tween 20 in PBS) and incubated with the CSF samples diluted 1:4 or with a titration of recombinant human TREM2 protein (2000 ng/ml to 0.1 ng/ml) for 2 h at room temperature. Plates were washed three times with washing buffer before incubation with the detection antibody for 1 h at room temperature. After three additional washes, plates were incubated with the secondary antibody for 1 h at room temperature. All incubation steps were performed with gentle shaking. Plates were washed three times with wash buffer, then the electrochemical signal was developed by adding 2× Meso Scale Discovery Read buffer and the light emission measured using the Mesoscale Discovery SECTOR S 600.
All lumbar punctures were clinically indicated. We aimed to compare levels of soluble TREM2 in CSF from sporadic ALS patients to levels in normal controls. Previously it has been noted that levels of soluble TREM2 can be elevated in a number of inflammatory CNS diseases ; therefore our criteria for selection of control cases were: normal CSF constituents and no evidence of neuroinflammation. Diagnoses in control cases included headache with normal CSF, medically unexplained symptoms and cerebrovascular disease. Controls and patients (Table 3) were matched for age and sex; mean age of controls was 51 years (range 34–74 years), mean age of ALS patients was 58 years (range 32–83 years). Controls included 11 males and 9 females; ALS patients included 28 males and 18 females. All samples were collected with written informed consent from the donor, and the work was approved by the South Yorkshire Ethics Committee. Based on the time of sampling relative to disease onset and time of death it was possible to identify at what point in the patients disease course CSF sampling occurred. Early disease was defined as the first 25th-centile of all patients assayed and late disease was defined as the last 25-centile of all patients assayed. In order to minimise the effect of outliers statistical tests were performed using ranks instead of actual values. Correlation with clinical variables was determined by Spearman rank correlation, and differences between groups were determined by Mann–Whitney U-test.
We aimed to identify a set of genes that can predict ALS disease progression when measured in tissues that are core to disease, but also in tissues that are accessible clinically. Our systems approach has two phases of investigation: a discovery phase (Fig. 1a–c) and a subsequent biomarker assessment phase (Fig. 1d).
Identifying correlates of neuropathology
To identify genes expressed in correlation with the number of proteinaceous inclusions within motor neurons (pathology correlates), we performed targeted immunohistochemistry and gene expression profiling in ALS motor neurons (Fig. 1a). Cervical spinal cord anterior horn was examined from 11 ALS patients including seven C9ORF72-ALS patients and four patients with sporadic ALS (Table 1, patients 1–11). Total RNA was extracted from isolated motor neurons and expression of 54,675 annotated transcripts was measured by microarray analysis. In adjacent tissue from the same patients we counted the number of motor neurons per unit area containing a p62-positive cytoplasmic inclusion (Additional file 1: Figure S1). Spearman rank correlation was calculated between the expression of each transcript and the pathology counts. Eighty-three transcripts, corresponding to 59 genes, correlated with the quantity of pathology (p < 0.01) (Additional file 2: Table S1).
Motor neuron inclusions in ALS are expected to stain for TDP-43 and p62 . We confirmed coincidence of p62- and TDP-43-positive staining in cervical cord from the same cases. Presence of p62 and TDP-43 was significantly correlated, despite measurement in non-overlapping tissue sections (Spearman rank correlation, p < 0.05, Additional file 1: Figure S2).
Derivation of gene modules associated with neuropathology
To explore their functional context, each of the 83 pathology-correlated transcripts were used as a seed to identify transcripts with similar expression (top 1% of transcripts by Pearson’s correlation, Fig. 1b). Transcripts were combined into a single network, which was divided into modules of highly correlated genes by weighted gene co-expression analysis (WGCNA) [25, 55]. WGCNA is an established method for module detection within gene co-expression networks and has been previously applied in biomarker development . WGCNA analysis revealed 82 network modules of between 35 and 515 transcripts (Additional file 1: Figure S3, Additional file 2: Table S1). Forty-five network modules contained one or more of the 83 pathology-correlate transcripts. The remaining modules were discarded.
Systematic prioritisation of gene modules based on enrichment with ALS biology
To identify which of the 45 gene modules have potential as biomarkers we needed an independent and systematic test of their relevance to ALS biology. For this purpose three assessment gene sets were curated to represent rate of progression and upstream genetic association with ALS (Fig. 1c). A motor neuron gene set was generated from laser captured motor neurons (Table 1, patients 1–14); motor neurons were obtained from a set of patients which overlapped but was distinct from the data used to derive the 83 pathology-correlated transcripts. It contained 1705 transcripts significantly correlated with disease duration (p < 0.05) (Additional file 2: Table S2). A lymphoblastoid cell gene set was generated from peripherally accessible blood-derived lymphoblastoid cells (Table 2 patients 1–26). It contained 4070 transcripts differentially expressed between patients with rapidly and slowly progressive disease (p < 0.05). (Additional file 2: Table S3).
The third assessment set was a genome wide association gene set which consisted of 62 genes containing variants (unadjusted asymptotic association test, p < 5E-08) associated with ALS in a genome wide association study (GWAS) of 3539 ALS cases and 5191 normal controls (Additional file 2: Table S4). Genetic variants associated with ALS are by definition upstream. Modules enriched with genetic determinants are more likely to be predictive as genetic determinants are by definition upstream of a disease that occurs in adulthood.
Three of the 45 modules were enriched with all three assessment gene sets (Fig. 2, modules 1, 25 and 27). Module 25 was the most highly enriched with the motor neuron gene set (p = 1.92E-20) and module 27 was the most highly enriched with the lymphoblastoid gene set (p = 7.82E-21). Both module 25 and 27 were significantly enriched with ALS GWA genes (p value < 0.05). Module 25 and 27 were selected for further characterisation.
To determine whether modules 25 and 27 captured aspects of disease pathogenesis or simply show motor neuron-specific gene expression, we constructed a negative control from an artificial module consisting of genes which are expressed specifically in non-diseased motor neurons  (Additional file 2: Table S1). The negative control module was not enriched with the assessment gene set derived from motor neurons or with ALS GWA genes; there was limited enrichment with the lymphoblastoid gene set (p = 0.001). Modules 25 and 27 showed significantly better enrichment with ALS biology gene sets than the negative control module.
Functional characterisation of modules
As modules 25 and 27 showed significant enrichment with ALS biology gene sets we sought to determine the function of genes in these modules (Fig. 1c). Module 25 enriched for the Gene Ontology (GO) biological process categories ‘synaptic transmission, cholinergic’ and ‘response to oxygen stimulus’ (g: Profiler, corrected p-value = 0.04) . Module 27 enriched for the GO category ‘immune system process’ (corrected p-value = 7.32E-07). Module 27 will henceforth be referred to as the immune module.
Enrichment within module 27 of immune-associated genes suggests that glial cells proximal to diseased motor neurons may have been laser captured alongside extracted motor neurons. To explore which glial cells might have been isolated we examined cell-specific expression of module 27 genes using a reference database of transcriptome data from isolated human brain cell lines (http://web.stanford.edu/group/barres_lab/brainseqMariko/brainseq2.html). Genes within the immune module were classified as expressed in one or more of microglia/macrophage cells, astrocytes and oligodendrocytes (Additional file 2: Table S5). 61% of the genes in the immune module are known to be expressed in human microglia/macrophage cells as compared to 30% in mature astrocytes and 17% in oligodendrocytes. The majority of immune module genes are expressed in microglia/macrophage cells rather than alternative glial subtypes.
Development of immune module (module 27) into a peripheral tissue biomarker
A clinically translatable biomarker needs to be measurable in accessible tissue. Markers of inflammation associated with neurodegeneration have been observed in blood  and CSF . Module 27 (the immune module) was highly enriched with the assessment set containing genes associated with disease progression in lymphoblastoid cells (p = 7.82E-21). We chose to focus on the immune module for biomarker development.
To reduce neuron-specific signal and improve likelihood of detecting genes expressed in the immune module in peripheral tissues, we added tissue-independent globally co-expressed genes and protein-protein interacting partners  using a database of broadly co-expressed genes with functional association data (GeneMANIA)  (Fig. 1d). The immune module was expanded from 65 to 77 genes (Fig. 3). We tested the expanded module against the assessment gene sets representing rate of ALS disease progression and showed an improvement in biomarker performance. The module showed improved enrichment with gene sets related to rate of disease progression in motor neurons gene (p = 6.14E-03 from 4.56E-02), and in lymphoblastoid cells (p = 1.94E-32 from 7.82E-21).
Assessment of immune module as a potential biomarker in blood
To provide evidence in support of the immune module as a potential biomarker, we first explored its predictive capabilities in lymphoblastoid cells derived from the blood of C9ORF72- and sporadic ALS patients with rapid and slowly progressive disease (Fig. 1d, Table 2). C9ORF72-ALS patient samples were used at a previous stage to prioritise the immune module but the sporadic ALS patients comprise an entirely independent dataset. By testing for biomarker performance of the immune module in both datasets separately we aimed to reduce the likelihood of a false positive result.
First we evaluated whether gene expression in the immune module could predict ALS severity as indicated by the time between onset of symptoms and death. Age of onset and sex have been independently linked to prognosis in ALS . Clinical interventions such as artificial respiratory support have also been shown to affect survival but this data was not available. We fitted a Cox proportional hazards model including age of symptom onset, sex and disease duration (to nearest half-year, Additional file 1: Figure S4) together with the top 15 principal components of gene expression in the immune module. In both C9ORF72 and sporadic ALS, the model was significantly predictive of disease severity (Chi2; C9ORF72-ALS p = 0.01; sporadic ALS p = 0.004). To further test the significance of this result we performed an identical analysis using the negative control module representing genes specifically expressed in non-diseased motor neurons. The top 15 principal components of gene expression in the control module were not significantly predictive in either dataset (Chi2, p > 0.1).
Next, to determine if the module could be useful to support personalised treatment based on classification, we asked whether gene expression in the immune module could effectively classify patients with rapid versus slowly progressing disease. Binomial logistic regression on expression of individual genes within the immune module identified those genes which differentiated lymphoblastoid cells from patients with rapid and slowly progressive disease compared to the null model. Fifteen of the immune module genes differentiated rapid and slowly progressive C9ORF72-ALS cases; and in sporadic ALS, 20 genes differentiated rapid and slowly progressive cases (Additional file 2: Table S6). LILRA2, ITGB2 and CEBPD (Fig. 3) were predictive in both C9ORF72-ALS and sporadic ALS. Fitting binomial logistic regression with leave-one-out cross validation confirmed that a model combining expression of LILRA2, ITGB2 and CEBPD was able to correctly classify patients by disease severity more often than would be expected by chance (85% of C9ORF72 and 60% of sporadic ALS classified correctly, Additional file 1: Figure S4). Interestingly LILRA2, ITGB2 and CEBPD are expressed by microglia/macrophage cells (Additional file 2: Table S5).
Assessment of immune module as a potential biomarker in CSF
CSF is frequently used to observe CNS-inflammation . We wished to determine if members of the immune module may have potential as a biomarker in CSF. CSF is relatively acellular and therefore suited to a protein-level rather than gene expression quantification. It was not technically feasible to assess all members of the immune module. TREM2, a member of the immune module (Fig. 3), had an available assay and known association with neurodegeneration [20, 34, 36, 47]. We chose to evaluate soluble TREM2 in CSF as a potential biomarker for ALS (Fig. 1d). Concentrations of soluble TREM2, which is cleaved from the surface of microglia , have been measured by ELISA in CSF [24, 34]. Genes thought to determine levels of soluble TREM2 in CSF identified by genome-wide complex trait analysis  (Additional file 2: Table S7), are enriched in the immune module (Fisher’s exact test, p = 0.04).
Levels of soluble TREM2 were measured in CSF from sporadic ALS patients with varying disease severity (n = 46) and controls with normal CSF constituents (n = 20) (Table 3). The effectiveness of TREM2 as a biomarker was investigated in two ways; first, we examined whether levels of soluble TREM2 are altered in ALS in comparison to healthy controls, and second, we tested whether soluble TREM2 can classify rapid and slowly progressive ALS. Levels of soluble TREM2 were significantly higher in CSF from ALS patients compared to controls (mean of 18 ng/ml compared to mean of 7 ng/ml, Mann–Whitney p = 0.04, Fig. 4a). Levels of measured soluble TREM2 in controls are comparable to other studies [36, 47].
TREM2 has been implicated in stimulation of microglia to clear Alzheimer’s-associated protein aggregates . We tested for enrichment of Alzheimer’s disease GWA genes (Additional file 2: Table S8) within the immune module and found that it is highly enriched (Fisher’s exact test, p = 1.83E-07). From this we postulate that the immune module captures a molecular response to neuropathology not just in ALS, but in neurodegeneration more broadly.
In Alzheimer’s disease levels of soluble TREM2 are higher in early phase disease [46, 47]. The same is true in ALS: mean soluble TREM2 levels are three-times higher in early disease compared to late stage disease (mean soluble TREM2 in early disease = 36 ng/ml, mean soluble TREM2 in late disease = 13 ng/ml, Fig. 4b). Strikingly, in late stage disease levels of soluble TREM2 show a significant positive correlation with disease duration (Spearman rank correlation, p = 0.01, Fig. 4c). In early disease there is not a significant correlation. Early elevation of TREM2 expression may reflect an initial immune response to deposition of pathological aggregates which declines over time; higher levels of TREM2 in late disease may reflect a sustained neuroprotective microglial response (Fig. 4d).
Our analysis consisted of a data-driven systematic discovery phase leading to discovery of gene modules which were further evaluated in a biomarker assessment phase. In the discovery phase (Fig. 1a–c), transcriptome-wide gene expression changes in proportion to the development of cytoplasmic proteinaceous inclusions in ALS motor neurons allowed us to discover molecular determinants of disease severity. Gene expression and pathology counts were carried out in the same cell population to avoid confounding by variation between populations. The extent of pathology varies between neuronal populations even within individual patients . Transcripts found to be expressed in proportion to the development of neuropathology were utilised to produce 45 modules of co-expressed genes. In a systematic filtering process these modules were then prioritised by demonstration of enrichment with independent measures of ALS biology. We discovered two gene modules strikingly enriched with gene sets associated with rate of ALS progression in both motor neurons and lymphoblastoid cells, and also with ALS GWA genes.
In the biomarker assessment phase (Fig. 1d) we selected one of the top scoring modules which showed the highest enrichment with rate of progression genes in lymphoblastoid cells, and was enriched with genes associated with immune function. The majority of genes within this module are expressed in microglia as opposed to other glial subtypes. Microglia are crucial for clearance of protein aggregates [16, 51] which is biologically consistent with our focus on motor neuron pathology. Many genes within the immune module have not been previously implicated in ALS, however others have highlighted the role of neuroinflammation and microglial activation in disease progression [10, 44, 45] making this module a good candidate for further investigation. Given that CNS immune function can be observed peripherally [18, 31], we tested the potential of this module to be a prognostic biomarker in peripherally accessible tissue.
In tissue derived from patient blood, we demonstrated that expression of the immune module as a whole was significantly associated with ALS disease duration. Moreover, a three-gene panel comprising LILRA2, ITGB2 and CEBPD was found to correctly classify individuals as suffering from rapid or slowly progressive disease, independent of both genetic background and clinical intervention such as respiratory support. Measurement in a relatively small number of patients relying on microarray technology is a limitation of these data but a larger biomarker validation study is beyond the scope of this study.
CSF is also peripherally accessible. TREM2 is a member of the immune module which has been previously linked to both ALS pathogenesis  and microglial activation . We investigated the potential for soluble TREM2 in CSF to predict disease course in ALS patients with mixed genetic background. Soluble TREM2 cleaved from the surface of microglia has been proposed as a biomarker in other neurological diseases including Alzheimer’s disease and multiple sclerosis [20, 34, 36, 47]. We show that soluble TREM2 levels are significantly elevated in ALS compared to controls. Elevation is most marked in early disease, as has been observed in Alzheimer’s disease [46, 47]. Importantly, in patients where CSF was acquired in late stage disease, higher concentrations of soluble TREM2 are strongly associated with slower disease progression. Marked early elevation of TREM2 expression may reflect an initial immune response to deposition of pathological aggregates which declines over time. It is hypothesised that patients with higher levels of TREM2 in late disease have mounted a sustained neuroprotective microglial response (Fig. 4d).
Loss-of-function (LOF) mutations in TREM2, which have been linked to risk of Alzheimer’s disease [7, 23], and ALS , reduce phagocytosis of aggregated protein by microglia . Reduced phagocytosis may be toxic to stressed neurons and indeed TREM2 activity has been positively associated with a neuroprotective microglial phenotype . Modulating microglial activity through TREM2 has been proposed as a therapeutic target in Alzheimer’s disease . Our data suggests that this therapeutic strategy may also be applicable in ALS. In addition to TREM2, it is probable that our immune module contains other determinants of neuropathology relevant to neurodegeneration more broadly: consistent with this the immune module is enriched with GWA genes for both ALS and Alzheimer’s disease.
The role of microglia in neurodegeneration is controversial. There is evidence for microglia mediated neurotoxicity and neuroprotection. For example, [11C](R)-PK11195 positron emission tomography assay of microglial activation in motor cortex is positively correlated with burden of upper motor neuron degeneration  but compromised microglial function through LOF mutations in TREM2 increases the risk of ALS . To explain this controversy microglia are thought to be capable of multiple phenotypes which are variably neuroprotective and neurotoxic . Our immune module is derived by association with motor neuron pathology and predicts ALS prognosis. The prevalence of microglial-expressed genes within this module supports the possibility that there is a direct link between microglial function and motor neuron death. The positive correlation we have identified between soluble TREM2 concentration in CSF and a more benign ALS phenotype supports the possibility of neuroprotection mediated by microglial phagocytosis.
We have performed a scalable systematic, objective discovery of potential predictive biomarkers and a potential therapeutic target. Pathology-correlated gene expression in motor neurons has, for the first time in a data-driven manner, identified microglial function as an important determinant of ALS pathogenesis across a broad spectrum of genetically heterogeneous patients who all display TDP-43/p62 proteinopathy. Microglia are implicated in neurodegenerative disease [10, 45] and are thought to be responsible for clearance of protein aggregates, clearly linking them to development of neuropathology [16, 51]. We propose that phagocytosis of protein aggregates by microglia is likely to be therapeutic and enhanced by TREM2 signalling, making phagocytosis of protein aggregates by microglia an important focus for future translational research in ALS and other neurodegenerative diseases.
Amyotrophic lateral sclerosis
Chinese hamster ovary cells
Central nervous system
Genome wide association study
Beers DR, Henkel JS, Zhao W, Wang J, Huang A, Wen S et al (2011) Endogenous regulatory T lymphocytes ameliorate amyotrophic lateral sclerosis in mice and correlate with disease progression in patients with amyotrophic lateral sclerosis. Brain 134:1293–314
Brettschneider J, Arai K, Del Tredici K, Toledo JB, Robinson JL, Lee EB et al (2014) TDP-43 pathology and neuronal loss in amyotrophic lateral sclerosis spinal cord. Acta Neuropathol 128:423–37
Brettschneider J, Del Tredici K, Toledo JB, Robinson JL, Irwin DJ, Grossman M et al (2013) Stages of pTDP-43 pathology in amyotrophic lateral sclerosis. Ann Neurol 74:20–38
Bush WS, Moore JH (2012) Chapter 11: Genome-wide association studies. PLoS Comput Biol 8:e1002822
Cady J, Koval ED, Benitez BA, Zaidman C, Jockel-Balsarotti J, Allred P et al (2014) TREM2 variant p.R47H as a risk factor for sporadic amyotrophic lateral sclerosis. JAMA Neurol 71:449–53
Cantoni C, Bollman B, Licastro D, Xie M, Mikesell R, Schmidt R et al (2015) TREM2 regulates microglial cell activation in response to demyelination in vivo. Acta Neuropathol 129:429–47
Colonna M, Wang Y (2016) TREM2 variants: new keys to decipher Alzheimer disease pathogenesis. Nat Rev Neurosci 17:201–7
Cooper-Knock J, Kirby J, Highley R, Shaw PJ (2015) The spectrum of C9orf72-mediated neurodegeneration and amyotrophic lateral sclerosis. Neurotherapeutics 12:326–39
DeJesus-Hernandez M, Mackenzie IR, Boeve BF, Boxer AL, Baker M, Rutherford NJ et al (2011) Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron 72:245–56
DiSabato DJ, Quan N, Godbout JP (2016) Neuroinflammation: the devil is in the details. J Neurochem 139(Suppl 2):136–153
Dobrin R, Zhu J, Molony C, Argman C, Parrish ML, Carlson S et al (2009) Multi-tissue coexpression networks reveal unexpected subnetworks associated with disease. Genome Biol 10:R55
Doyle JP, Dougherty JD, Heiman M, Schmidt EF, Stevens TR, Ma G et al (2008) Application of a translational profiling approach for the comparative analysis of CNS cell types. Cell 135:749–62
Durinck S, Spellman PT, Birney E, Huber W (2009) Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc 4:1184–91
Fathi D, Mohammadi B, Dengler R, Boselt S, Petri S, Kollewe K (2016) Lower motor neuron involvement in ALS assessed by motor unit number index (MUNIX): Long-term changes and reproducibility. Clin Neurophysiol 127:1984–8
Ferraiuolo L, Heath PR, Holden H, Kasher P, Kirby J, Shaw PJ (2007) Microarray analysis of the cellular pathways involved in the adaptation to and progression of motor neuron injury in the SOD1 G93A mouse model of familial ALS. J Neurosci 27:9201–19
Fu R, Shen Q, Xu P, Luo JJ, Tang Y (2014) Phagocytosis of microglia in the central nervous system diseases. Mol Neurobiol 49:1422–34
Geschwind DH, Konopka G (2009) Neuroscience in the era of functional genomics and systems biology. Nature 461:908–15
Gladkevich A, Kauffman HF, Korf J (2004) Lymphocytes as a neural probe: potential for studying psychiatric disorders. Prog Neuropsychopharmacol Biol Psychiatry 28:559–76
Grolez G, Moreau C, Danel-Brunaud V, Delmaire C, Lopes R, Pradat PF et al (2016) The value of magnetic resonance imaging as a biomarker for amyotrophic lateral sclerosis: a systematic review. BMC Neurol 16:155
Heslegrave A, Heywood W, Paterson R, Magdalinou N, Svensson J, Johansson P et al (2016) Increased cerebrospinal fluid soluble TREM2 concentration in Alzheimer’s disease. Mol Neurodegener 11:3
Ince PG, McArthur FK, Bjertness E, Torvik A, Candy JM, Edwardson JA (1995) Neuropathological diagnoses in elderly patients in Oslo: Alzheimer’s disease, Lewy body disease, vascular lesions. Dementia 6:162–8
Jaeger PA, Lucin KM, Britschgi M, Vardarajan B, Huang RP, Kirby ED et al (2016) Network-driven plasma proteomics expose molecular changes in the Alzheimer’s brain. Mol Neurodegener 11:31
Jonsson T, Stefansson H, Steinberg S, Jonsdottir I, Jonsson PV, Snaedal J et al (2013) Variant of TREM2 associated with the risk of Alzheimer’s disease. N Engl J Med 368:107–16
Kleinberger G, Yamanishi Y, Suarez-Calvet M, Czirr E, Lohmann E, Cuyvers E et al (2014) TREM2 mutations implicated in neurodegeneration impair cell surface transport and phagocytosis. Sci Transl Med 6:243ra86
Langfelder P, Horvath S (2008) WGCNA: an R package for weighted correlation network analysis. BMC Bioinf 9:559
Liu X, Chang X (2016) Identifying module biomarkers from gastric cancer by differential correlation network. Onco Targets Ther 9:5701–5711
Liu X, Gao Z, Zhang L, Rattray M (2013) puma 3.0: improved uncertainty propagation methods for gene and transcript expression analysis. BMC Bioinf 14:39
Lombardo MV, Lai MC, Auyeung B, Holt RJ, Allison C, Smith P et al (2016) Unsupervised data-driven stratification of mentalizing heterogeneity in autism. Sci Rep 6:35333
Lu CH, Macdonald-Wallis C, Gray E, Pearce N, Petzold A, Norgren N et al (2015) Neurofilament light chain: A prognostic biomarker in amyotrophic lateral sclerosis. Neurology 84:2247–57
Mackenzie IR, Frick P, Neumann M (2014) The neuropathology associated with repeat expansions in the C9ORF72 gene. Acta Neuropathol 127:347–57
Melah KE, Lu SY, Hoscheidt SM, Alexander AL, Adluru N, Destiche DJ et al (2016) Cerebrospinal fluid markers of Alzheimer’s disease pathology and microglial activation are associated with altered white matter microstructure in asymptomatic adults at risk for Alzheimer’s disease. J Alzheimers Dis 50:873–86
Nalls MA, Bras J, Hernandez DG, Keller MF, Majounie E, Renton AE et al (2015) NeuroX, a fast and efficient genotyping platform for investigation of neurodegenerative diseases. Neurobiol Aging 36:1605.e7–12
Neumann M, Sampathu DM, Kwong LK, Truax AC, Micsenyi MC, Chou TT et al (2006) Ubiquitinated TDP-43 in frontotemporal lobar degeneration and amyotrophic lateral sclerosis. Science 314:130–3
Piccio L, Buonsanti C, Cella M, Tassi I, Schmidt RE, Fenoglio C et al (2008) Identification of soluble TREM-2 in the cerebrospinal fluid and its association with multiple sclerosis and CNS inflammation. Brain 131:3081–91
Piccio L, Cantoni C, Bollman B, Cignarella F, Mikesell R (2016) TREM2 regulates microglia activation in response to CNS demyelination. Mult Scler J 22:54–54
Piccio L, Deming Y, Del-Aguila JL, Ghezzi L, Holtzman DM, Fagan AM et al (2016) Cerebrospinal fluid soluble TREM2 is higher in Alzheimer disease and associated with mutation status. Acta Neuropathol 131:925–33
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–75
Qureshi MM, Hayden D, Urbinelli L, Ferrante K, Newhall K, Myers D et al (2006) Analysis of factors that modify susceptibility and rate of progression in amyotrophic lateral sclerosis (ALS). Amyotroph Lateral Scler 7:173–82
Raha AA, Henderson JW, Stott SR, Vuono R, Foscarin S, Friedland RP et al (2016) Neuroprotective Effect of TREM-2 in Aging and Alzheimer’s Disease Model. J Alzheimers Dis 55:199–217
Reimand J, Arak T, Adler P, Kolberg L, Reisberg S, Peterson H et al (2016) g:Profiler-a web server for functional interpretation of gene lists (2016 update). Nucleic Acids Res 44:W83
Rutkove SB (2015) Clinical measures of disease progression in amyotrophic lateral sclerosis. Neurotherapeutics 12:384–93
Salameh JS, Brown RH Jr, Berry JD (2015) Amyotrophic lateral sclerosis: review. Semin Neurol 35:469–76
Santiago JA, Potashkin JA (2014) A network approach to diagnostic biomarkers in progressive supranuclear palsy. Mov Disord 29:550–5
Saris CG, Horvath S, van Vught PW, van Es MA, Blauw HM, Fuller TF et al (2009) Weighted gene co-expression network analysis of the peripheral blood from Amyotrophic Lateral Sclerosis patients. BMC Genomics 10:405
Sochocka M, BS Diniz and J Leszek (2016) Inflammatory Response in the CNS: Friend or Foe? Mol Neurobiol
Suarez-Calvet M, Araque Caballero MA, Kleinberger G, Bateman RJ, Fagan AM, Morris JC et al (2016) Early changes in CSF sTREM2 in dominantly inherited Alzheimer’s disease occur after amyloid deposition and neuronal injury. Sci Transl Med 8:369ra178
Suarez-Calvet M, Kleinberger G, Araque Caballero MA, Brendel M, Rominger A, Alcolea D et al (2016) sTREM2 cerebrospinal fluid levels are a potential biomarker for microglia activity in early-stage Alzheimer’s disease and associate with neuronal injury markers. EMBO Mol Med 8:466–76
Sundarrajan S, Arumugam M (2016) Weighted gene co-expression based biomarker discovery for psoriasis detection. Gene 593:225–34
Turner MR, Cagnin A, Turkheimer FE, Miller CC, Shaw CE, Brooks DJ et al (2004) Evidence of widespread cerebral microglial activation in amyotrophic lateral sclerosis: an [11C](R)-PK11195 positron emission tomography study. Neurobiol Dis 15:601–9
Turner MR, Gray E (2016) Are neurofilaments heading for the ALS clinic? J Neurol Neurosurg Psychiatry 87:3–4
von Bernhardi R, Eugenin-von Bernhardi L, Eugenin J (2015) Microglial cell dysregulation in brain aging and neurodegeneration. Front Aging Neurosci 7:124
Warde-Farley D, Donaldson SL, Comes O, Zuberi K, Badrawi R, Chao P et al (2010) The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res 38:W214–20
Wes PD, Sayed FA, Bard F, Gan L (2016) Targeting microglia for the treatment of Alzheimer’s Disease. Glia 64:1710–32
Yang R, Daigle BJ Jr, Petzold LR, Doyle FJ 3rd (2012) Core module biomarker identification with network exploration for breast cancer metastasis. BMC Bioinf 13:12
Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4:Article17
Samples used in this research were in part obtained from the UK MND DNA Bank for MND Research, funded by the MND Association and the Wellcome Trust. We are grateful to all of the patients with ALS and their family members who donated biosamples for research. Special thanks to Lez Kobzik, Harvard School of Public Health for his insightful help in reviewing and improving the manuscript.
This work was supported in part by the European Community’s Seventh Framework Programme [FP7/2007-2013] under the EuroMOTOR project [grant agreement no 259867 to JK and PJS]. PJS is also supported as a National Institute for Health Research Senior Investigator and by the Medical Research Council. JCK is funded by a National Institutes for Health Research (NIHR) Clinical Lectureship in Neurology. ALP is funded by a Pathological Society of Great Britain PhD studentship award and a scholarship for postgraduate studies awarded by ‘la Caixa’ Foundation (Spain). JK has been funded by a Sheffield Hospitals Charitable Trust [grant no 131425]. This work was supported in part by the Intramural Research Programs of the NIH, National Institute on Aging [Z01-AG000949-02 to BJT]. BJT was also supported by the Agency of Toxic Substances and Disease Registry, Centre for Disease Control.
Availability of data and materials
The gene expression CEL files are available at Gene Expression Omnibus (www.ncbi.nlm.nih.gov/geo/).
Conceived the concept and designed the experiments: WH, JCK, JRH, AP, JK, GA, WW, CG and PJS. Performed the experiments: JCK, JRH, AP, JJB, MW, PRH, KD, KO and BT. Analyzed the data: JCK, KO, GA, WW and WH. Contributed reagents/materials/analysis tools: BT, KO, JRH, JK, PRH and PJS. Wrote the paper: JCK, CG, GA, KO, BT, JK, PJS and WH. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
All samples were collected with written informed consent from the donor. Tissue donated for research was obtained with written informed consent from the next of kin, and in accordance with the UK Human Tissue Authority guidelines on tissue donation. The work was approved by the South Yorkshire Ethics Committee.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.