- Open Access
Identification of a neuronal transcription factor network involved in medulloblastoma development
Acta Neuropathologica Communications volume 1, Article number: 35 (2013)
Medulloblastomas, the most frequent malignant brain tumours affecting children, comprise at least 4 distinct clinicogenetic subgroups. Aberrant sonic hedgehog (SHH) signalling is observed in approximately 25% of tumours and defines one subgroup. Although alterations in SHH pathway genes (e.g. PTCH1, SUFU) are observed in many of these tumours, high throughput genomic analyses have identified few other recurring mutations. Here, we have mutagenised the Ptch+/- murine tumour model using the Sleeping Beauty transposon system to identify additional genes and pathways involved in SHH subgroup medulloblastoma development.
Mutagenesis significantly increased medulloblastoma frequency and identified 17 candidate cancer genes, including orthologs of genes somatically mutated (PTEN, CREBBP) or associated with poor outcome (PTEN, MYT1L) in the human disease. Strikingly, these candidate genes were enriched for transcription factors (p= 2x10-5), the majority of which (6/7; Crebbp, Myt1L, Nfia, Nfib, Tead1 and Tgif2) were linked within a single regulatory network enriched for genes associated with a differentiated neuronal phenotype. Furthermore, activity of this network varied significantly between the human subgroups, was associated with metastatic disease, and predicted poor survival specifically within the SHH subgroup of tumours. Igf2, previously implicated in medulloblastoma, was the most differentially expressed gene in murine tumours with network perturbation, and network activity in both mouse and human tumours was characterised by enrichment for multiple gene-sets indicating increased cell proliferation, IGF signalling, MYC target upregulation, and decreased neuronal differentiation.
Collectively, our data support a model of medulloblastoma development in SB-mutagenised Ptch+/- mice which involves disruption of a novel transcription factor network leading to Igf2 upregulation, proliferation of GNPs, and tumour formation. Moreover, our results identify rational therapeutic targets for SHH subgroup tumours, alongside prognostic biomarkers for the identification of poor-risk SHH patients.
Medulloblastoma (MB) is the most common malignant brain tumour of childhood, accounting for around 10% of paediatric cancer deaths. Despite recent therapeutic advances, up to 40% of patients still die from their disease and cure is often associated with disabling therapy-related effects in later life. New therapeutic approaches based on improved biological understanding of the disease will be essential to improve outcomes, through strategies such as the delivery of risk-adapted therapies guided by molecular prognostic biomarkers, and the stratified use of molecularly-targeted agents (reviewed in ).
Four major molecular subtypes of medulloblastoma with distinct biological, clinical and pathological features are now recognised, and this subgrouping is beginning to have clinical impact [1–3]. WNT subgroup tumours (~15% of total) are characterised by activation of the wnt/wingless pathway through CTNNB1 mutations, and appear to originate from progenitor cells derived from the dorsal brain stem . Patients with WNT-associated tumours have a favourable prognosis and will receive reduced therapies in forthcoming international clinical trials [1, 5, 6]. The SHH subgroup (~25% of total) is defined by activation of the sonic hedgehog signalling pathway, and mutations in SHH pathway genes (e.g. PTCH1, SUFU) arise in a significant subset [2, 3]. All evidence suggests that these tumours originate from cerebellar granule neuron precursors (cGNPs) within the external granular layer of the developing cerebellum (reviewed in ) or cochlear nuclei of the brainstem . SHH tumours are associated with an intermediate prognosis. Early clinical trials of SHH pathway inhibitors are underway, although acquired resistance has been reported  and tumours with downstream pathway mutations (e.g. GLI2, SUFU) are predicted to be insensitive to their action . Group 3 and 4 tumours are more heterogeneous and show overlapping molecular features such as frequent chromosome 17 defects. However, Group 3 tumours (~25% of total) have been associated with high-risk features, such as MYC gene amplification and large-cell/anaplastic (LCA) pathology, and a poor prognosis .
Recent genome-wide studies have further underlined the complexity of MB; heterogeneous mutations targeting processes including histone methylation and chromatin remodelling have been discovered, but these typically describe limited subsets of tumours and few additional recurrent mutations targeting specific genes and/or pathways have been identified [11–15]. Critical genes therefore remain to be uncovered in all MB subgroups and there is a growing need to identify low-frequency alterations which drive disease progression, distinguish these from passenger mutations, and determine their mechanisms of action and clinical significance. Primary tumour data alone may not be sufficient to achieve these goals.
A number of murine MB models have been created allowing comparative analyses (reviewed in ), including models which recapitulate WNT  and Group 3 [17, 18] tumours. However, the most widely studied mouse model is a knockout of the Shh transmembrane receptor, Ptch , which mimics SHH subgroup tumours. Ptch+/- heterozygotes develop MBs at a frequency which is significantly influenced by genetic background . Sleeping Beauty ( SB) murine mutagenesis [21, 22], coupled to statistical analysis of insertion site distribution [23, 24], has emerged as a powerful method to identify genes involved in a wide variety of human cancers [24–29]. Recently the SB11 transposase  driven by the Math1 promoter has been used to mutagenise developing neuronal tissues in both the Ptch+/- and p53 loss-of-function models of MB . This identified a large number of candidate genes potentially involved in MB progression, and demonstrated that the genetic events observed in metastases show limited overlap with those in matched primary tumours, supporting a bicompartmental genetic model of metastatic disease [30, 31].
Here, we report the application of whole-body SB mutagenesis  to the Ptch+/- tumour model, and the identification of 17 genes associated with enhanced medulloblastoma formation. We show that these genes are enriched for neuronal transcription factors defining a novel gene network which, when mutagenised by SB, is associated with increased cell proliferation and reduced neuronal differentiation. Significantly, increased expression of Igf2, a gene known to be essential for tumour formation in the Ptch+/- model, is associated with insertional mutations within this network. Moreover, we show that in human disease, network activity predicts poor survival specifically within the SHH tumour subgroup. Together, these findings provide important novel insights into the molecular mechanisms of medulloblastoma pathogenesis, and identify exploitable therapeutic targets and prognostic biomarkers for development towards improved therapy.
The incidence of MB but not RMS is enhanced in mutagenised Ptch+/- mice
A total of 243 mutagenised Ptch+/- animals (Ptch+/-; SB11+/-;T2Onc+/-) and 195 control littermates were aged for up to 15 months and monitored for tumour development. Mortality in mutagenised Ptch+/- animals was approximately 90% after 1 year, significantly higher than in the predisposition only (Ptch+/-;T2Onc+/-) or transposition only (T2Onc+/-;SB11+/-) control genotypes (Figure 1a and Additional file 1: Table S1). Approximately ~28% of mutagenised Ptch+/- animals aged for more than 6 months succumbed to haematological neoplasms, which usually presented as large thoracic tumours and/or hepatosplenomegaly. A low frequency of parenchymal brain lesions consistent with glial tumours (2%) was also observed. These malignancies occurred at comparable frequencies within the transposition controls, have previously been reported using the same transposase/transposon combination [26, 29], and are not analysed further here.
Despite this mutagen-specific burden, significant mortality within mutagenised Ptch+/- animals was related to the Ptch genotype. Large exophytic and/or invasive MBs (Figure 1b and c) developed in ~23% of mutagenised Ptch+/- animals aged for over 6 months, with no macroscopic meningeal masses being observed (typical of the Ptch model ). Tumours with indistinguishable pathology were observed in the predisposition controls but at a much lower frequency (6%), with survival analysis providing clear evidence that SB mutagenesis enhanced the predisposition of Ptch+/- mice to MB (p<0.0001, Figure 1d). Slow growing and locally invasive rhabdomyosarcomas (RMSs), also typical of the Ptch+/- model , developed in ~22% of experimental animals aged over 6 months (Figure 1e and f). However, mutagenesis did not significantly alter RMS related mortality relative to predisposition controls (Figure 1g, p=0.31). Finally, multiple liver tumours (Figure 1h-j), morphologically similar to those generated using a conditional SB11 screen for hepatocellular carcinoma , were observed in 19 mutagenised Ptch+/- mice but not in predisposition controls. Although not previously reported in whole body SB mutagenesis screens, liver tumours were also observed at low frequency in transposition only controls (Additional file 1: Table S1), suggesting that they can be induced by SB mutagenesis alone.
SB insertions target neuronal transcription factors
To identify genes responsible for the impact of mutagenesis upon MB incidence, transposon insertion sites within MBs and control cerebellar tissues were recovered using Splinkerette PCR, sequenced, and their distribution analysed using Gaussian kernel convolution (GKC ) to identify common insertion sites (CISs, see Methods). A total of 17 genes were identified within 20 CISs recovered [median p-value = 0.008 (Table 1)]. The majority were also recovered using Monte Carlo simulation analysis , and these are highlighted in bold in Table 1. Three of the CISs target known tumourigenic genes (Crebbp, Nfib and Pten), mutations within orthologs of two (Crebbp and Pten) are somatically mutated at low frequency in human MB [11, 33], and six (Crebbp, Nfia, Nfib, Pten, Sfi1, and Tead1) have recently been identified as MB CISs in a tissue-specific mutagenesis screen . Strikingly, ontological analysis established that six of the 17 CIS genes have transcription factor activity (Nfia, Tead1, Tgif2, Nfib, Myt1l and L3mbtl4), a highly significant excess relative to expectation (FDR corrected p-value = 2×10-5). Furthermore, seven genes (Tgif2, Pten, Nfia, Nfib, Myt1l, Slit3 and Fgf13) are implicated in neuronal biological processes. In contrast, only three genic CISs were identified in control tissue, all with modest p-values (0.01-0.05, Table 1). All of these observations are consistent with insertional mutations at CISs contributing to increased penetrance of the tumour phenotype. As all SB insertions within Tgif2 mapped to a single ~4 kb intron, these were analysed in detail using both genomic DNA and cDNA templates. This both validated our sequence data and confirmed the inferred upregulation of this gene by SB insertion (Additional file 2: Figure S1).
Association of CIS genes with survival and focal copy number alterations
To investigate the association between CIS gene expression and survival, log-rank (Mantel-Cox) tests were performed on median split microarray expression data from human tumours . Reduced expression of two genes, PTEN (a tumour suppressor previously implicated in MB) and MYT1L, was associated with poor outcome (Figure 2). This is consistent with the inferred mode of action of the SB insertions that target these genes (Table 1). Furthermore, the association of MYT1L expression with survival remained significant within a Cox-regression model incorporating high-risk clinical features using the data from Cho, Tsherniak et al , even after exclusion of the good prognosis WNT subgroup (p=0.011, see Additional file 3: Table S2).
To investigate the relationship between CIS genes and regions of chromosomal loss or gain defined within primary tumours [35, 36], a GISTIC analysis was also performed : PTEN maps within a well-established region of common chromosomal loss on 10q associated with SHH tumours , and both NFIB and TMEM45B were found to be present within peak regions of localised copy number gain (Additional file 4: Figure S2). While this is consistent with the mode of action inferred for PTEN and NFIB, the mode of action of TMEM45B remains unclear from insert data alone (Table 1).
CIS genes are differentially expressed in MB clinico- genetic subgroups
To establish whether the CIS genes are relevant specifically to the SHH subgroup of tumours, we used published data sets [38, 39] to compare expression of human orthologs in SHH subgroup tumours with expression in all other subgroups. Of the 17 CIS genes, 9 show significant differential expression when SHH subgroup tumours are compared to all others (including MYT1L and PTEN discussed above), and 15 show differential expression in one or more clinicogenetic subgroups (Table 2). However, only two (ITGBL1 and L3MBTL4) show clear differential expression in the SHH subgroup alone. Some show marked differential expression in a single non-SHH subgroup, such as NFIA (Group 3 tumours) and FGF13 (Group 4 tumours), or in more than one subgroup (e.g. TGIF2 and MYT1L), suggesting that these genes may be relevant to MB in general. We also investigated expression with respect to the presence/absence of metastatic disease, and 7 genes show a significant association, most notably genes with extreme expression values in Group 3 and 4 tumours where metastatic disease is common (Table 2).
For 13 genes, a comparison of expression in human tumours and normal cerebellum was also possible (using data from Cho, Tsherniak et al. ) and a total of 10 genes show significantly different expression between cerebellum and either SHH subgroup tumours alone or all tumours (Additional file 5: Table S3), consistent with dysregulation of expression during tumorigenesis. Furthermore, the direction of expression change observed is generally consistent with the predicted mode of action of each CIS. For instance, expression levels of FGF13, NFIB, TEAD1 and TGIF2 are all increased versus normal cerebellum whilst expression of MYT1L, SFI1, and SLIT3 is appropriately reduced (Table S2) in line with the inferred mechanism of action (Table 1).
CIS genes define a neuronal transcription factor network in human MBs
The significant enrichment for transcription factor (TF) activity within the MB CIS genes raised the possibility that they could be present within co-ordinated signalling or developmental pathways. ARACNE [40, 41] is a method which uses gene-gene co-regulation measures, and elimination of indirect relationships, to infer TF-target interactions within expression data. It has successfully been used to identify novel oncogenes in expression datasets from glioma  and acute lymphoblastic leukaemia . We used ARACNE to infer regulatory networks within publicly available MB gene expression data ([38, 39] see Methods). Strikingly, seven CIS genes, including four of the five CISs with the highest GKC p-values, were linked within a single network either directly or via nearest neighbours (Figure 3). This cluster of CIS nodes is highly significant (p=0.006 using 1000 randomly re-sampled networks) and consists of 6 genes with transcription factor/cofactor activity (CREBBP, MYT1L, NFIA, NFIB, TEAD1 and TGIF2) and one neuronal growth factor (FGF13). Gene ontology analysis (see Methods) established that the extended network is enriched both for transcription factors/regulators (p=0.0026/0.0035, Additional file 6: Table S4, yellow in Figure 3) and for genes with ontologies relating to cellular components of differentiated neurons (p=0.019-0.0042, Additional file 6: Table S4, green in Figure 3). This suggests that the network consists primarily of neuronal transcription factors and their targets.
To investigate the relative activity of genes within these enriched ontologies in human tumours, expression heatmaps of CIS genes, transcription factors, and neuronal genes within the network were generated (Figure 4a). There are clear expression differences between clinicogenetic subgroups, consistent both with the ANOVA analysis of CIS gene expression across subgroups (Table 2) and with the presence of genes previously shown to be highly expressed in Group 3 and 4 tumours (e.g. NEUROD2, GABBR2 ). However, most striking are the neuronal genes which include neurotransmitter receptors and synaptic scaffold/matrix proteins, the vast majority of which show low expression in the SHH and WNT tumours.
Four of the seven networked CIS genes have recently been identified as MB CISs in a tissue-specific SB mutagenesis screen of primary tumours generated using the Ptch model (Crebbp, Nfia, Nfib and Tead1 ), and orthologs of a further three genes within our network (Dip2c, Edil3 and Erbb4) were identified as MB CISs in the same screen. Strikingly, with the exception of Tead1, both alleles of all of these genes were targeted by inserts in primary tumours in this screen , indicative of key tumour promoting events. We therefore analysed the distribution of all 17 biallelic events identified in this tissue-specific study and found them to be significantly enriched within our network (6/90, 7%) compared to outside our network (11/5823, 0.19%; Fisher’s Exact Test p <0.00001). This provides evidence that the TF network identified here was also targeted in an independent SB screen.
Network activity correlates with advanced disease and survival in SHH tumours
To assess network activity within human tumour datasets further, we generated a single “metagene” metric to summarise the expression of CIS network genes. The expression of each gene was signed according to direction of correlation with other CIS genes, such that a single score reflected the unified action of all genes (see Additional file 7). As a result, the expression of genes where CISs are inferred to cause loss of function was positively correlated with metagene score (e.g. MYT1L), whereas the expression of genes where CISs are inferred to cause gain of function (e.g. TGIF2, Additional file 2: Figure S1) was negatively correlated. Metagene activity was then investigated with respect to MB subgroups and clinical features. As expected from the expression heatmaps (Figure 4a), the activity of the network metagene differs significantly between MB clinicogenetic subgroups (F = 62.8 p<0.0001 Figure 4b), with the highest network activity being observed in Group 4 tumours. Metagene activity is also higher in tumours presenting with metastatic disease when compared to those that do not (bootstrapped t = 2.388; p<0.013 Figure 4c). This is likely to reflect the high metagene expression in Group 4 tumours where metastases are frequently observed. A subgroup specific analysis of available expression data which has associated survival information , however, showed that network activity correlates significantly with survival in SHH subgroup tumours (log-rank 8.03; p <0.005, Figure 4d), but not in other subgroups (data not shown).
Microarray expression analysis of mutagenised tumours identifies Igf2 as a key network associated gene
To identify specific genes whose expression might be altered by network mutations, we generated murine expression data using Illumina bead arrays (see Methods) from 30 SB induced tumours, 6 non-mutagenised tumours from Ptch+/- animals, and 6 normal cerebella. We first validated the mutagenised murine model in terms of gene expression as follows: Four metagenes were generated from human tumour expression data using NMF (see Methods) to define the four MB biological subgroups. These metagenes were then projected across the mouse tumour data using all available orthologous probes (Figure 5a), and the subgroup identity of the mouse tumours was then tested with a Support Vector Machine (SVM) using the metagene scores for the human data as the training set and the mouse tumours as the test set. All human tumours were trained correctly with zero errors. The mouse SB tumours were also correctly predicted to be SHH tumours in 29/30 (96%) of cases (Figure 5b) and 6/6 (100%) of non-transposon PTCH MB controls, establishing that gene expression in SB induced mouse tumours is similar to expression in human SHH tumours.
We then looked for genes differentially expressed between tumours with hits in CIS network genes and tumours with no hits in these genes. This identified the Insulin-like growth factor 2 (Igf2) as the most differentially expressed gene with a mean fold change of 3.58 (p=0.002, Additional file 8: Table S5). To validate the association between network insertions and Igf2 expression, we quantified Igf2 expression in all of our SB-induced MB tumours for which RNA was available using Real Time PCR (See Methods). This confirmed that Igf2 was expressed at a significantly higher level in tumours with one or more insertions in a network CIS gene, than in tumours with no insertion in a network CIS gene (p<0.0001, Figure 5c). Furthermore, we also established that tumours with hits in Nfia, the CIS gene most frequently affected by SB insertion, expressed Igf2 at higher levels than network tumours with no insert in Nfia (Figure 5c).
CIS related Network activity is associated with proliferation and reduced differentiation
Finally, to gain insight into biological processes which may be affected by the network, we also performed a Gene Set Enrichment Analysis (GSEA) which uses gene ranking to test for enrichment of predefined genesets . This was performed both in mouse tumours (ranking genes by fold change of expression in SB-induced tumours with network hits compared to Ptch+/- MBs not exposed to mutagenesis) and in primary human MBs (ranking by association with metagene score).
GSEA revealed a broad picture of increased cell proliferation and reduced differentiation associated with network hits in mice and low network metagene activity in human tumours (the concordant gene expression pattern), as demonstrated by significant enrichment of multiple genesets (see Additional file 9: Table S6). For instance, genesets indicative of neuronal differentiation, such as “Cahoy_Neuronal” and “Lein_Neuron_Markers”, are significantly enriched in human MBs with high metagene expression and in MBs from murine PTCH controls with no transposition (e.g. Figure 5d). A similar effect is seen for genesets describing genes containing CREB and cAMP responsive elements (e.g. Additional file 10: Figure S3A). In contrast, genesets denoting proliferation and elevated cell growth are significantly enriched in human MB with low network metagene activity and mouse PTCH MBs with CIS network hits. These include genesets linked to mitosis and cell cycle, MYC targets (Additional file 10: Figure S3B), and ribosome biogenesis (Figure 5e).
Consistent with increased Igf2 expression in MBs from mice with network hits, enrichment of IGF-related genesets is also observed in human and mouse tumours; e.g. enrichment of Pacher_Targets_of_IGF1 _and_IGF2 _up in mouse tumours with network hits (Additional file 10: Figure S3C), and of Boudoukha_Bound_by_IGF2BP2 in human tumours with a low Metagene score (Additional file 10: Figure S3D). In addition, the genesets Lee_Targets_of PTCH1 _and_SUFU “ _up” and “_dn”, indicative of SHH dependent murine tumorigenesis , show consistent differential enrichment; upregulated targets are enriched in mouse network tumours and human tumours with a low Metagene score (Additional file 10: Figure S3E), and downregulated targets are enriched in mouse tumours with no mutagenesis and human tumours with a high Metagene score (Additional file 10: Figure S3F). Collectively, the broad concordance of human and murine datasets indicates conservation of CIS mutational function across species, and substantiates the concerted action of CIS network genes as a tumourigenic process promoting cell proliferation and inhibiting neuronal differentiation.
We have shown that whole body SB mutagenesis of Ptch+/- mice significantly enhances MB frequency without affecting latency, and does not induce these tumours in wild type mice. The majority of candidate genes identified have either been implicated in neuronal development, differentiation and/or migration, have been linked to SHH signalling, or have been shown to be mutated in SHH subgroup tumours in humans. Furthermore, we found that one gene identified from our screen, MYT1L, is of prognostic value within a multivariate analysis of human MB survival data. These genes, therefore, warrant individual assessment as potential therapeutic targets and/or biomarkers for the improved stratification and treatment of medulloblastoma.
Notably, 7 of the genes (6 transcription factors/cofactors and 1 growth factor) show significant associations with each other within a novel MB expression network, implying a previously unidentified functional relationship which does not map to established canonical pathways. This network is enriched for transcriptional regulators and genes with neuronal ontologies, and links genes with roles in stem cell/neuron migration, neurite growth and neuronal cell cycle progression, to genes which encode structural and functional elements of mature neurons (See Additional file 11: Table S7 for known gene functions). This suggests that the network comprises transcription factors involved in the proliferation and differentiation of cGNPs, the cell of origin for SHH MB , and their targets. Interestingly, lineage commitment to cGNP identity is a pre-requisite for SHH tumour development [46, 47].
The relationships identified here between murine network activity, metagene activity in human tumour subtypes, and individual CIS gene expression, are summarised in Figure 6. This highlights the variable network activity within SHH tumours and Group 3 tumours, and identifies CIS genes with similar, or wholly divergent, expression patterns relative to the metagene. Of these, TGIF2 and MYT1L are of particular interest as they influence neuronal SHH expression and development, respectively: A conditional Tgif1/Tgif2 double knock-out has recently been shown to reduce Shh expression in the developing brain and to recapitulate holoprosencephaly [a human condition caused by SHH and TGIF1 mutations (OMIM# 142945 and 142946)], while the transcriptional repressor MYT1L can contribute to the re-programming of human fibroblasts into neurons [48, 49]. As neither gene has been implicated in MB development to date, both are prime targets for further investigation.
Importantly, network activity has clinical relevance, as high activity is associated with advanced disease in all tumours and low activity is associated with poor survival specifically in SHH subgroup tumours (Figure 6). These associations appear incongruous, but the former is likely due to the high incidence of metastases in Group 4 tumours where network activity is uniformly high. In contrast, the SHH subgroup-specific association with outcome may reflect clinically important variation in the developmental status of individual tumours, and highlights the potential utility of network activity as a prognostic biomarker for the prediction of outcome within the SHH subgroup.
The GSEA analysis in mouse and human tumours demonstrates very clearly a role for this network in inhibiting neuronal differentiation and promoting cell proliferation. Consistent with this, several common functional pathways were identified in both species of potential relevance to disease. Of these, the MYC and IGF-dependent signalling pathways are of particular interest, the latter having recently been highlighted in an independent SB screen . Furthermore, our analysis of gene expression in SB mutagenized mouse tumours identified Igf2 upregulation as a key output of SB-induced network perturbation. Igf2 is already known to be required for MB development in the Ptch+/- model as no tumours are observed in Igf2 null; Ptch+/- mice , over-expression of Igf2 in Ptch+/- mice increases the frequency of MBs generated by Shh transfection of cerebellar neural progenitors , and at the cellular level Igf2 acts synergistically with Shh to increases murine cGNP cell proliferation 10 fold .
The results presented here suggest that network mutations converge to inhibit differentiation and upregulate Igf2. This extends the existing model of MB formation in SB mutagenised Ptch+/- mice by identifying genes underpinning the upregulation of Igf2 which leads to the persistence of Ptch+/--induced cerebellar proliferative lesions and progression to MB . Consistent with this model, several network CIS genes, or genes which they bind/modulate, have already been implicated in Igf2 expression or activity, including Tead1 , Nfia and Nfib [55, 56], and Crebbp . There is an unmet clinical need for the development of SHH pathway-independent targeted therapies for SHH subgroup tumours, particularly in view of the predicted acquired or intrinsic resistance to current SMO inhibitors [9, 10]. The implication of insulin-dependent signalling in human and mouse SHH tumours strongly support its development as a therapeutic target for SHH subgroup tumours.
The application of SB mutagenesis to additional murine MB models [17, 18] could identify genes relevant to other tumour subgroups. However, our results contrast sharply with a recent Math1 driven tissue-restricted screen of a more penetrant Ptch+/- model  where tumour latency was reduced from 8 to 2.5 months, a high frequency of metastases (80%) was observed, and divergent primary and metastatic insertional mutation signatures were defined. Notably, we did not observe metastases following whole-body mutagenesis in this study. These two models are, therefore, not directly comparable and suggest that penetrance of the tumour predisposition, and the power of the mutagen, are likely to determine the nature of genes identified in future screens.
Finally, this is the first time to our knowledge that mutagenesis data from a murine cancer model have been integrated with human expression networks to explore biological mechanisms of tumourigenesis. The identification of novel and biologically relevant candidate genes linked within a single expression network, the activity of which correlates with disease state and survival within the subgroup of tumours being modelled, illustrates the utility of this cross-species approach. Clarification of the interactions between network genes identified here, their roles in the pathways highlighted by our GSEA analysis, and establishment of their therapeutic relevance will, however, require extensive functional analyses of multiple genes both individually and in concert.
Here, we have used SB mutagenesis to define a novel neuronal transcription factor network involved in medulloblastoma formation within the Ptch+/- model, and provide evidence that disruption of this network upregulates Igf2, critical for proliferation of GNPs and tumour formation. Moreover, we have identified rational therapeutic targets for SHH subgroup tumours, alongside prognostic biomarkers for the identification of poor-risk SHH patients, supporting the further development of these findings as a basis for improved and individualised therapy. Our results also suggest that the integration of mutagenesis data and expression network analysis may help to unravel key events in other cancers which disrupt complex developmental programmes, for which murine models are available.
The following strains were used: B6 Ptch1tm1Mps/J mice ; CBA wild type (Charles River laboratories, Margate, UK); T2Onc line 76  and SB11 Rosa26 . All animal work adhered to UK Home Office guidelines and was performed under Project Licence PLL/60/3621. Animal numbers, together with tumour incidence in each genotype, are given in Additional file 1: Table S1.
Sample processing, insertion site mapping and CIS identification
Tumours and other abnormal tissue identified upon post-mortem examination were collected for histological examination and DNA/RNA isolation. Insertion sites were identified using splinkerette PCR  coupled to GS-FLX amplicon sequencing  and sequence reads were mapped to the mouse genome (NCBI37/mm9) as described previously . Common insertion sites (CISs) were identified using Gaussian kernel convolution (GKC) . The raw p-value of each CIS peak was corrected for the total number of CIS peaks on the chromosome to which it maps, with a cut off of p<0.05. Monte Carlo simulation methods  were also used for comparative purposes.
Illumina expression analysis and Real Time PCR
ng of each RNA was amplified and biotin labelled using the Illumina TotalPrep RNA amplification kit (Applied Biosystems, Foster City, CA. USA). cRNA size distribution was assessed using an Agilent Bioanalyser. Approximately 750 ng of each cRNA was hybridised to the Illumina Mouse8 Reference Array (Illumina, Essex, UK) according to the manufacturer’s recommended protocols by the Wellcome Trust Clinical Research Facility (Edinburgh, UK). Real Time PCR was performed using the 5’ nuclease assay on the ABI PRISM™ 7700 Sequence detector (Perkin-Elmer, Applied Biosystems, Foster City, CA, USA). Oligonucleotides were designed using Primer Express software (v3.0, PE Biosystems, Foster City, CA, USA), and were designed to span an intron to avoid amplification from genomic DNA. Mean Ct values were normalised against the average expression of the endogenous control gene β-Actin. Relative gene expression was calculated by the 2-ΔΔCt method  using the control cerebellar RNA which showed the highest expression level as measured by the Illumina microarray data. For primer sequences see Additional file 7.
Expression profiles comprising 119 Affymetrix HGU133p2 arrays were taken from published studies [38, 39]. CEL files were processed using the Bioconductor RMA package . The ARACNE network was constructed using the aracne2 standalone software package according to authors instructions . Ontology analyses were performed using the Bingo 2.44 Cytoscape plug in , with Benjamini and Hochberg FDR-corrected hypergeometric tests and the whole annotation set as background. The topological arrangement of 7 network genes was significance tested by calculating the mean shortest path distance to the nearest connecting CIS gene and permuting 10,000 times with 7 randomly selected genes in order to create a null distribution.
GISTIC analyses  were performed using the module provided in Genepattern , with Affymetrix SNP Chip .CEL files [35, 36] being processed using the Aroma package  and segmented using the CBS algorithm . GSEA  was performed using the standalone package (http://www.broadinstitute.org/gsea/) and genesets were taken from the MsigDB library . NMF (Non-Negative Matrix Factorisation) was performed using a script adapted from . All other statistical tests were performed using R . For further details of procedures and analyses, see Additional file 7.
Pizer BL, Clifford SC: The potential impact of tumour biology on improved clinical practice for medulloblastoma: progress towards biologically driven clinical trials. Br J Neurosurg 2009,23(4):364–375. 10.1080/02688690903121807
Taylor MD, Northcott PA, Korshunov A, Remke M, Cho YJ, Clifford SC, Eberhart CG, Parsons DW, Rutkowski S, Gajjar A, Ellison DW, et al.: Molecular subgroups of medulloblastoma: the current consensus. Acta Neuropathol 2012,123(4):465–472. 10.1007/s00401-011-0922-z
Schwalbe EC, Williamson D, Lindsey JC, Hamilton D, Ryan SL, Megahed H, Garami M, Hauser P, Dembowska-Baginska B, Perek D, Northcott PA, et al.: DNA methylation profiling of medulloblastoma allows robust subclassification and improved outcome prediction using formalin-fixed biopsies. Acta Neuropathol 2013,125(3):359–371. 10.1007/s00401-012-1077-2
Gibson P, Tong Y, Robinson G, Thompson MC, Currle DS, Eden C, Kranenburg TA, Hogg T, Poppleton H, Martin J, Finkelstein D, et al.: Subtypes of medulloblastoma have distinct developmental origins. Nature 2010,468(7327):1095–1099. 10.1038/nature09587
Ellison DW, Onilude OE, Lindsey JC, Lusher ME, Weston CL, Taylor RE, Pearson AD, Clifford SC: Beta-catenin status predicts a favorable outcome in childhood medulloblastoma: the United Kingdom children’s cancer study group brain tumour committee. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 2005,23(31):7951–7957. 10.1146/annurev.pathmechdis.3.121806.151518
Ellison DW, Kocak M, Dalton J, Megahed H, Lusher ME, Ryan SL, Zhao W, Nicholson SL, Taylor RE, Bailey S, Clifford SC: Definition of disease-risk stratification groups in childhood medulloblastoma using combined clinical, pathologic, and molecular variables. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 2011,29(11):1400–1407. 10.1007/s00401-012-0961-0
Gilbertson RJ, Ellison DW: The origins of medulloblastoma subtypes. Annu Rev Pathol 2008, 3: 341–365. 10.1126/science.1179386
Grammel D, Warmuth-Metz M, von Bueren AO, Kool M, Pietsch T, Kretzschmar HA, Rowitch DH, Rutkowski S, Pfister SM, Schuller U: Sonic hedgehog-associated medulloblastoma arising from the cochlear nuclei of the brainstem. Acta Neuropathol 2012,123(4):601–614. 10.1007/s00401-011-0918-8
Yauch RL, Dijkgraaf GJ, Alicke B, Januario T, Ahn CP, Holcomb T, Pujara K, Stinson J, Callahan CA, Tang T, Bazan JF, et al.: Smoothened mutation confers resistance to a hedgehog pathway inhibitor in medulloblastoma. Science 2009,326(5952):572–574. 10.1126/science.1198056
Korshunov A, Remke M, Kool M, Hielscher T, Northcott PA, Williamson D, Pfaff E, Witt H, Jones DT, Ryzhova M, Cho YJ, et al.: Biological and clinical heterogeneity of mycn-amplified medulloblastoma. Acta Neuropathol 2012,123(4):515–527. 10.1038/nature11284
Parsons DW, Li M, Zhang X, Jones S, Leary RJ, Lin JC, Boca SM, Carter H, Samayoa J, Bettegowda C, Gallia GL, et al.: The genetic landscape of the childhood cancer medulloblastoma. Science 2011,331(6016):435–439. 10.1038/nature11327
Jones DT, Jager N, Kool M, Zichner T, Hutter B, Sultan M, Cho YJ, Pugh TJ, Hovestadt V, Stutz AM, Rausch T, et al.: Dissecting the genomic complexity underlying medulloblastoma. Nature 2012,488(7409):100–105. 10.1038/nature11329
Northcott PA, Shih DJ, Peacock J, Garzia L, Morrissy AS, Zichner T, Stutz AM, Korshunov A, Reimand J, Schumacher SE, Beroukhim R, et al.: Subgroup-specific structural variation across 1,000 medulloblastoma genomes. Nature 2012,488(7409):49–56. 10.1038/nature11213
Pugh TJ, Weeraratne SD, Archer TC, Pomeranz Krummel DA, Auclair D, Bochicchio J, Carneiro MO, Carter SL, Cibulskis K, Erlich RL, Greulich H, et al.: Medulloblastoma exome sequencing uncovers subtype-specific somatic mutations. Nature 2012,488(7409):106–110.
Robinson G, Parker M, Kranenburg TA, Lu C, Chen X, Ding L, Phoenix TN, Hedlund E, Wei L, Zhu X, Chalhoub N, et al.: Novel mutations target distinct subgroups of medulloblastoma. Nature 2012,488(7409):43–48. 10.1016/j.ccr.2011.12.021
Wu X, Northcott PA, Croul S, Taylor MD: Mouse models of medulloblastoma. Chin J Cancer 2011,30(7):442–449. 10.1016/j.ccr.2011.12.023
Pei Y, Moore CE, Wang J, Tewari AK, Eroshkin A, Cho YJ, Witt H, Korshunov A, Read TA, Sun JL, Schmitt EM, et al.: An animal model of myc-driven medulloblastoma. Cancer Cell 2012,21(2):155–167. 10.1126/science.277.5329.1109
Kawauchi D, Robinson G, Uziel T, Gibson P, Rehg J, Gao C, Finkelstein D, Qu C, Pounds S, Ellison DW, Gilbertson RJ, et al.: A mouse model of the most aggressive subgroup of human medulloblastoma. Cancer Cell 2012,21(2):168–180. 10.1016/j.canlet.2005.03.047
Goodrich LV, Milenkovic L, Higgins KM, Scott MP: Altered neural cell fates and medulloblastoma in mouse patched mutants. Science 1997,277(5329):1109–1113. 10.1038/nature03681
Pazzaglia S: Ptc1 heterozygous knockout mice as a model of multi-organ tumorigenesis. Cancer Lett 2006,234(2):124–134. 10.1038/nature03691
Collier LS, Carlson CM, Ravimohan S, Dupuy AJ, Largaespada DA: Cancer gene discovery in solid tumours using transposon-based somatic mutagenesis in the mouse. Nature 2005,436(7048):272–276. 10.1371/journal.pcbi.0020166
Dupuy AJ, Akagi K, Largaespada DA, Copeland NG, Jenkins NA: Mammalian mutagenesis using a highly mobile somatic sleeping beauty transposon system. Nature 2005,436(7048):221–226. 10.1126/science.1163040
de Ridder J, Uren A, Kool J, Reinders M, Wessels L: Detecting statistically significant common insertion sites in retroviral insertional mutagenesis screens. PLoS Comput Biol 2006,2(12):e166. 10.1182/blood-2011-02-338848
Starr TK, Allaei R, Silverstein KA, Staggs RA, Sarver AL, Bergemann TL, Gupta M, O’Sullivan MG, Matise I, Dupuy AJ, Collier LS, et al.: A transposon-based genetic screen in mice identifies genes altered in colorectal cancer. Science 2009,323(5922):1747–1750. 10.1158/0008-5472.CAN-09-1760
van der Weyden L, Giotopoulos G, Rust AG, Matheson LS, van Delft FW, Kong J, Corcoran AE, Greaves MF, Mullighan CG, Huntly BJ, Adams DJ: Modeling the evolution of etv6-runx1-induced b-cell precursor acute lymphoblastic leukemia in mice. Blood 2011,118(4):1041–1051. 10.1038/nbt.1526
Collier LS, Adams DJ, Hackett CS, Bendzick LE, Akagi K, Davies MN, Diers MD, Rodriguez FJ, Bender AM, Tieu C, Matise I, et al.: Whole-body sleeping beauty mutagenesis can cause penetrant leukemia/lymphoma and rare high-grade glioma without associated embryonic lethality. Cancer Res 2009,69(21):8429–8437. 10.1073/pnas.1018012108
Keng VW, Villanueva A, Chiang DY, Dupuy AJ, Ryan BJ, Matise I, Silverstein KA, Sarver A, Starr TK, Akagi K, Tessarollo L, et al.: A conditional transposon-based insertional mutagenesis screen for genes associated with mouse hepatocellular carcinoma. Nat Biotechnol 2009,27(3):264–274. 10.1158/0008-5472.CAN-09-4674
Starr TK, Scott PM, Marsh BM, Zhao L, Than BL, O’Sullivan MG, Sarver AL, Dupuy AJ, Largaespada DA, Cormier RT: A sleeping beauty transposon-mediated screen identifies murine susceptibility genes for adenomatous polyposis coli (apc)-dependent intestinal tumorigenesis. Proc Natl Acad Sci USA 2011,108(14):5765–5770. 10.1038/nature10825
Bender AM, Collier LS, Rodriguez FJ, Tieu C, Larson JD, Halder C, Mahlum E, Kollmeyer TM, Akagi K, Sarkar G, Largaespada DA, et al.: Sleeping beauty-mediated somatic mutagenesis implicates csf1 in the formation of high-grade astrocytomas. Cancer Res 2010,70(9):3557–3565. 10.1038/nature10949
Wu X, Northcott PA, Dubuc A, Dupuy AJ, Shih DJ, Witt H, Croul S, Bouffet E, Fults DW, Eberhart CG, Garzia L, et al.: Clonal selection drives genetic divergence of metastatic medulloblastoma. Nature 2012,482(7386):529–533. 10.1038/nm0598-619
Clifford SC: Cancer genetics: evolution after tumour spread. Nature 2012,482(7386):481–482.
Hahn H, Wojnowski L, Zimmer AM, Hall J, Miller G, Zimmer A: Rhabdomyosarcomas and radiation hypersensitivity in a mouse model of gorlin syndrome. Nat Med 1998,4(5):619–622. 10.1073/pnas.0710052104
Robinson G, Parker M, Kranenburg TA, Lu C, Chen X, Ding L, Phoenix TN, Hedlund E, Wei L, Zhu X, Chalhoub N, et al.: Novel mutations target distinct subgroups of medulloblastoma. Nature 2012.
Mattison J, van der Weyden L, Hubbard T, Adams DJ: Cancer gene discovery in mouse and man. Biochim Biophys Acta 2009,1796(2):140–161. 10.1371/journal.pone.0003088
Cho YJ, Tsherniak A, Tamayo P, Santagata S, Ligon A, Greulich H, Berhoukim R, Amani V, Goumnerova L, Eberhart CG, Lau CC, et al.: Integrative genomic analysis of medulloblastoma identifies a molecular subgroup that drives poor clinical outcome. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 2011,29(11):1424–1430. 10.1002/path.2514
Northcott PA, Korshunov A, Witt H, Hielscher T, Eberhart CG, Mack S, Bouffet E, Clifford SC, Hawkins CE, French P, Rutka JT, et al.: Medulloblastoma comprises four distinct molecular variants. Journal of clinical oncology: official journal of the American Society of Clinical Oncology 2011,29(11):1408–1414. 10.1038/ng1532
Beroukhim R, Getz G, Nghiemphu L, Barretina J, Hsueh T, Linhart D, Vivanco I, Lee JC, Huang JH, Alexander S, Du J, et al.: Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma. Proc Natl Acad Sci USA 2007,104(50):20007–20012. 10.1038/nature08712
Kool M, Koster J, Bunt J, Hasselt NE, Lakeman A, van Sluis P, Troost D, Meeteren NS, Caron HN, Cloos J, Mrsic A, et al.: Integrated genomics identifies five medulloblastoma subtypes with distinct genetic profiles, pathway signatures and clinicopathological features. PLoS One 2008,3(8):e3088. 10.1038/nm.2610
Fattet S, Haberler C, Legoix P, Varlet P, Lellouch-Tubiana A, Lair S, Manie E, Raquin MA, Bours D, Carpentier S, Barillot E, et al.: Beta-catenin status in paediatric medulloblastomas: correlation of immunohistochemical expression with mutational status, genetic profiles, and clinical characteristics. J Pathol 2009,218(1):86–94. 10.1073/pnas.0506580102
Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Dalla Favera R, Califano A: Aracne: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinforma 2006., 7: Suppl 1(S7
Basso K, Margolin AA, Stolovitzky G, Klein U, Dalla-Favera R, Califano A: Reverse engineering of regulatory networks in human b cells. Nat Genet 2005,37(4):382–390. 10.1038/sj.onc.1210467
Carro MS, Lim WK, Alvarez MJ, Bollo RJ, Zhao X, Snyder EY, Sulman EP, Anne SL, Doetsch F, Colman H, Lasorella A, et al.: The transcriptional network for mesenchymal transformation of brain tumours. Nature 2010,463(7279):318–325. 10.1016/j.ccr.2008.07.003
Della Gatta G, Palomero T, Perez-Garcia A, Ambesi-Impiombato A, Bansal M, Carpenter ZW, De Keersmaecker K, Sole X, Xu L, Paietta E, Racevskis J, et al.: Reverse engineering of tlx oncogenic transcriptional networks identifies runx1 as tumor suppressor in t-all. Nat Med 2012,18(3):436–440. 10.1016/j.ccr.2008.07.005
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 2005,102(43):15545–15550.
Lee Y, Kawagoe R, Sasai K, Li Y, Russell HR, Curran T, McKinnon PJ: Loss of suppressor-of-fused function promotes tumorigenesis. Oncogene 2007,26(44):6442–6447. 10.1016/j.stem.2011.07.002
Yang ZJ, Ellis T, Markant SL, Read TA, Kessler JD, Bourboulas M, Schuller U, Machold R, Fishell G, Rowitch DH, Wainwright BJ, et al.: Medulloblastoma can be initiated by deletion of patched in lineage-restricted progenitors or stem cells. Cancer Cell 2008,14(2):135–145.
Schuller U, Heine VM, Mao J, Kho AT, Dillon AK, Han YG, Huillard E, Sun T, Ligon AH, Qian Y, Ma Q, et al.: Acquisition of granule neuron precursor identity is a critical determinant of progenitor cell competence to form shh-induced medulloblastoma. Cancer Cell 2008,14(2):123–134. 10.1038/sj.onc.1207818
Pang ZP, Yang N, Vierbuchen T, Ostermeier A, Fuentes DR, Yang TQ, Citri A, Sebastiano V, Marro S, Sudhof TC, Wernig M: Induction of human neuronal cells by defined transcription factors. Nature 2011,476(7359):220–223. 10.1016/S0002-9440(10)62335-8
Ambasudhan R, Talantova M, Coleman R, Yuan X, Zhu S, Lipton SA, Ding S: Direct reprogramming of adult human fibroblasts to functional neurons under defined conditions. Cell Stem Cell 2011,9(2):113–118. 10.1158/0008-5472.CAN-08-2135
Hahn H, Wojnowski L, Specht K, Kappler R, Calzada-Wack J, Potter D, Zimmer A, Muller U, Samson E, Quintanilla-Martinez L: Patched target igf2 is indispensable for the formation of medulloblastoma and rhabdomyosarcoma. J Biol Chem 2000,275(37):28341–28344. 10.1038/onc.2011.379
Rao G, Pedone CA, Del Valle L, Reiss K, Holland EC, Fults DW: Sonic hedgehog and insulin-like growth factor signaling synergize to induce medulloblastoma formation from nestin-expressing neural progenitors in mice. Oncogene 2004,23(36):6156–6162. 10.1016/j.bbagrm.2008.08.013
Hartmann W, Koch A, Brune H, Waha A, Schuller U, Dani I, Denkhaus D, Langmann W, Bode U, Wiestler OD, Schilling K, et al.: Insulin-like growth factor ii is involved in the proliferation control of medulloblastoma and its cerebellar precursor cells. Am J Pathol 2005,166(4):1153–1162. 10.1159/000274458
Corcoran RB, Bachar Raveh T, Barakat MT, Lee EY, Scott MP: Insulin-like growth factor 2 is required for progression to advanced medulloblastoma in patched1 heterozygous mice. Cancer Res 2008,68(21):8788–8795.
Fernandez LA, Squatrito M, Northcott P, Awan A, Holland EC, Taylor MD, Nahle Z, Kenney AM: Oncogenic yap promotes radioresistance and genomic instability in medulloblastoma through igf2-mediated akt activation. Oncogene 2012,31(15):1923–1937. 10.1158/0008-5472.CAN-05-3085
Perez-Casellas LA, Wang X, Howard KD, Rehage MW, Strong DD, Linkhart TA: Nuclear factor i transcription factors regulate igf binding protein 5 gene transcription in human osteoblasts. Biochim Biophys Acta 2009,1789(2):78–87. 10.1038/nprot.2009.64
Fernandez C, Tatard VM, Bertrand N, Dahmane N: Differential modulation of sonic-hedgehog-induced cerebellar granule cell precursor proliferation by the igf signaling network. Dev Neurosci 2010,32(1):59–70. 10.1038/ng.990
Svaren J, Ehrig T, Abdulkadir SA, Ehrengruber MU, Watson MA, Milbrandt J: Egr1 target genes in prostate carcinoma cells identified by microarray analysis. J Biol Chem 2000,275(49):38524–38531. 10.1006/meth.2001.1262
Collier LS, Largaespada DA: Hopping around the tumor genome: transposons for cancer gene discovery. Cancer Res 2005,65(21):9607–9610. 10.1093/nar/gng015
Uren AG, Mikkers H, Kool J, van der Weyden L, Lund AH, Wilson CH, Rance R, Jonkers J, van Lohuizen M, Berns A, Adams DJ: A high-throughput splinkerette-pcr method for the isolation and sequencing of retroviral insertion sites. Nat Protoc 2009,4(5):789–798. 10.1093/bioinformatics/bti551
March HN, Rust AG, Wright NA, ten Hoeve J, de Ridder J, Eldridge M, van der Weyden L, Berns A, Gadiot J, Uren A, Kemp R, et al.: Insertional mutagenesis identifies multiple networks of cooperating genes driving intestinal tumorigenesis. Nat Genet 2011,43(12):1202–1209. 10.1038/ng0506-500
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative pcr and the 2(-delta delta c(t)) method. Methods 2001,25(4):402–408. 10.1093/bioinformatics/btp371
Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP: Summaries of affymetrix genechip probe level data. Nucleic Acids Res 2003,31(4):e15. 10.1093/bioinformatics/btl646
Maere S, Heymans K, Kuiper M: Bingo: A cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 2005,21(16):3448–3449. 10.1093/bioinformatics/btr260
Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP: Genepattern 2.0. Nat Genet 2006,38(5):500–501. 10.1073/pnas.0701068104
Bengtsson H, Wirapati P, Speed TP: A single-array preprocessing method for estimating full-resolution raw copy numbers from all affymetrix genotyping arrays including genomewidesnp 5 & 6. Bioinformatics 2009,25(17):2149–2156.
Venkatraman ES, Olshen AB: A faster circular binary segmentation algorithm for the analysis of array cgh data. Bioinformatics 2007,23(6):657–663.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP: Molecular signatures database (msigdb) 3.0. Bioinformatics 2011,27(12):1739–1740.
Tamayo P, Scanfeld D, Ebert BL, Gillette MA, Roberts CW, Mesirov JP: Metagene projection for cross-platform, cross-species characterization of global transcriptional states. Proc Natl Acad Sci USA 2007,104(14):5959–5964.
R_Development_Core_Team: R: A language and environment for statistical computing. In R Foundation for Statistical Computing. Vienna, Austria; 2008. ISBN 3–900051–07–0, URL http://www.R-project.org
The financial support of Cancer Research UK (grants C7679/A7311 and C8464/A13457) is gratefully acknowledged. DJA is supported by Cancer Research UK and the Wellcome Trust (grant 0980051). LC is supported by Cancer Research UK (grant C34648/A12054), Samantha Dickson Trust (grant SDR004X) and Christopher’s Smile (grant CXC002H). H.H.A-B. was supported by a studentship from the Government of Kuwait. H.A-A. was supported by a studentship from the Government of Saudia Arabia.
The authors declared that they have no competing interests.
MSJ, SCC, CPFR, and HP conceived the research. MŁ and MSJ performed all animal husbandry. MŁ, HAl-A, HHAl-B, HS and EM performed all molecular biological analysis of tumours apart from the GS-FLX sequencing which was performed by JC. CMB performed pathological analysis and classification of all tumours, ADB performed additional pathological review and classification of liver tumours. MS-K performed all murine survival analyses, LC provided MYCN model tumour DNAs, AGR and DJA performed all statistical analysis of transposon inserts, and DW performed all downstream bioinformatic analyses. MSJ, SCC and DW interpreted the data and wrote the manuscript with input from all authors. All authors approved the final version of the manuscript.
Electronic supplementary material
Additional file 1: Table S1: A. Tumours in experimental and control cohorts. Median age of onset and range of onset is given for common tumour types. Genotypes: Experimental – Ptch+/-;SB11+/-;T2Onc+/-. Predisposition - Ptch+/-;T2Onc+/-. Transposition - SB11+/-;T2Onc+/-. 1Parenchymal brain lesions were consistent with gliomas and included one with pseudopalisading necrosis consistent with glioblastoma multiforme. 2Morphological and immunohistochemical analysis established that ~50% of haematological malignancies were precursor T-cell lymphoblastic lymphomas/leukaemias, ~5% were confirmed myeloid neoplasms, and the remainder were poorly differentiated haematological neoplasms, mostly of probable myeloid lineage. 3RMSs developed most frequently on the hindquarters and lower limbs. 4Adenomas showed mild dysplasia with frequent intralesional steatosis, although sufficient atypia and eosinophilic cytoplasmic inclusions were observed in 2 nodules to warrant classification as hepatocellular carcinoma. All but 2 were identified in animals where other malignancies were also present (4 RMS, 10 Haematological, 1 with RMS+Haematological, and 2 unclassified). All animals with liver adenomas were male, a highly significant bias (p<0.0001). 5Hydrocephalus has previously been reported in this model  B. Transposon Insertions by sample group. N – number of samples analysed. All inserts – total number of inserts after initial mapping and filtering. SSIs removed – Number of Same Site Inserts (present in same dinucleotide in different samples) removed. STIs removed – Number of Same Tumour Inserts (multiple inserts present in same CIS from same sample) removed. 1One tumour did not yield high quality DNA and could not be used for CIS identification. For details of filtering, see Additional file 7. (PDF 13 KB)
Additional file 2: Figure S1: MB T2Onc insertions in Tgif2. A. Genomic organisation of Tgif2 is shown with orientation and location of T2Onc insertions indicated, together with PCR amplicons from tumour genomic DNA using primers specific for T2Onc and Exon 2. An amplicon of the expected size is obtained from each tumour, confirming that all inserts defined by the GS-FLX sequencing are present. B. Identification of chimeric T2Onc/Tgif2 transcripts. The schematic shows the relative position of primers within T2Onc and Exons 1 and 2 of Tgif2, together with PCR amplicons obtained from tumour cDNA templates. Top panel – primers E1 and E2. Tumours with inserts show reduced intensity of the expected E1-E2 transcript. Middle panel – primers SD to E2. Chimeric T2Onc/Tgif2 transcripts are observed only in tumours with inserts. Bottom panel – Gapdh loading control. Approximate size of expected amplicons is shown in all cases. M – 100 bp ladder, C – no DNA controls. For primer sequences, see Additional file 7 and Keng et al. . (PDF 100 KB)
Additional file 3: Table S2: Multivariate analysis of human survival data. Models used the data from Cho Tsherniak et al.  and incorporated as variables their “high-risk clinical” factor (which incorporates multiple risk factors including the presence of metastases), and the presence of LCA pathology. WNT subgroup tumours were removed as they are known to have a distinct outcome. For MYT1L, low expression was retained as a significant multivariate factor (Relative Risk=2.8 p=0.019) alongside LCA (Relative Risk = 2.983 p=0.013). “Clinical Risk Group” was not significant in combination with Low MYT1L expression which was preferentially retained in a forward stepwise model. Likewise, for PTEN, low expression was significantly associated with increased risk of death (Relative Risk = 2.6 p = 0.031) independent of LCA pathology (Relative Risk = 3.6 p = 0.003). All models have 1 degree of freedom. (PDF 303 KB)
Additional file 4: Figure S2: Combined data from Cho, Tsherniak et al.  and Northcott, Korshunov et al.  were analysed. A. Significant overrepresentation of chromosome 10 loss at the PTEN locus and B. significant overrepresentation of chromosome 9 and Chromosome 11 gain at the NFIB and TMEM45B loci respectively. Dotted lines indicate position of centromeres. Upper values in each panel are G-scores, lower values are q-values. C. SNP array data showing localised gain of TMEM45B. Approximate cytogenetic positions are given above the schematic, with Megabase position on chromosome 10 given below it. The region of gain, although small, includes additional genes with known roles in cancer (ETS1, FLI1) and neuronal development (PRDM10). (PDF 95 KB)
Additional file 5: Table S3: Gene Expression in tumours versus normal cerebellum. Comparisons are shown between all tumours and normal cerebellum (left) and between SHH subgroup tumours only and normal cerebellum (right), using data from Cho, Tsherniak et al. . Genes and associated Affymetrix probe IDs are shown. AveExpr - mean log2 expression levels, t – t test statistic, P.Value – Raw p-value, adj.P.Val – p-value adjusted for multiple tests. (PDF 293 KB)
Additional file 6: Table S4: Cellular Component and Molecular Function Ontologies over-represented in Network. P-value correction was achieved using a Benjamini and Hochberg False Discovery rate of 0.05. Gene frequencies in the network list and genome are shown. Ontologies marked with an asterisk are not significantly enriched when CIS genes are excluded. (XLSX 15 KB)
Additional file 8: Table S5: Expression differences in Network v Non-Network tumours. Genes showing the highest average fold change between tumours with 1 or more insert in a Network CIS and tumours with no network insertions are shown, together with Illumina Probe IDs. All p-values are Student t-tests with 28 degrees of freedom. (PDF 292 KB)
Additional file 10: Figure S3: GSEA plots for human tumours ranked by metagene score (left panels) and mouse tumours ranked by fold change of expression in tumours with network hits compared to predisposition control MBs not exposed to mutagenesis (right panels). ES-Enrichment Score, RM-Ranked list Metric. A. Human NES=1.37 p=0.003; Mouse NES=-1.62 p=0.005. B. Human NES = -2.33, p<0.001; Mouse NES = 1.80, p<0.001. C. Human NES = 1.74, p=0.002; Mouse NES = -1.41, p=0.067. D. Human NES = 1.36, p=0.07; Mouse NES = -1.68, p<0.001. E. Human NES = -2.29, p<0.001; Mouse NES = 1.77, p = 0.002. F. Human NES = 2.19, p<0.001; Mouse NES= - 1.68, p=0.002. For details, see Additional file 9: Table S6. (TIFF 3 MB)
Additional file 11: Table S7: CIS and Network Genes with functions of potential relevance to MB. Genes implicated in neuronal structure, neuronal development, cell cycle, apoptosis or cancer are listed together with relevant citations. A. CIS genes. Network CISs are shown in bold. B. Other transcriptional regulators and neuronal genes in network (from Figure 3 and Additional file 6: Table S4). (PDF 41 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.