Skip to main content

The carboxyl termini of RAN translated GGGGCC nucleotide repeat expansions modulate toxicity in models of ALS/FTD


An intronic hexanucleotide repeat expansion in C9ORF72 causes familial and sporadic amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). This repeat is thought to elicit toxicity through RNA mediated protein sequestration and repeat-associated non-AUG (RAN) translation of dipeptide repeat proteins (DPRs). We generated a series of transgenic Drosophila models expressing GGGGCC (G4C2) repeats either inside of an artificial intron within a GFP reporter or within the 5′ untranslated region (UTR) of GFP placed in different downstream reading frames. Expression of 484 intronic repeats elicited minimal alterations in eye morphology, viability, longevity, or larval crawling but did trigger RNA foci formation, consistent with prior reports. In contrast, insertion of repeats into the 5′ UTR elicited differential toxicity that was dependent on the reading frame of GFP relative to the repeat. Greater toxicity correlated with a short and unstructured carboxyl terminus (C-terminus) in the glycine-arginine (GR) RAN protein reading frame. This change in C-terminal sequence triggered nuclear accumulation of all three RAN DPRs. A similar differential toxicity and dependence on the GR C-terminus was observed when repeats were expressed in rodent neurons. The presence of the native C-termini across all three reading frames was partly protective. Taken together, these findings suggest that C-terminal sequences outside of the repeat region may alter the behavior and toxicity of dipeptide repeat proteins derived from GGGGCC repeats.


Nucleotide repeat expansions are a common cause of neurodegenerative disease. Transcribed repetitive sequences can elicit toxicity through at least two possible mechanisms. First, repeats as RNA can bind to and sequester specific proteins and preclude them from performing their normal functions [1]. Alternatively, repeats can be translated into proteins that accumulate and elicit toxicity in model systems [2]. Recently, it was recognized that translation of these repetitive sequences can occur in the absence of an AUG start codon through a process known as repeat-associated non-AUG (RAN) translation [3, 4]. As such, repeat expansions located outside of protein coding open reading frames can also produce toxic proteins.

In 2011, two groups simultaneously identified GGGGCC repeat expansions as a cause of amyotrophic lateral sclerosis and frontotemporal dementia (C9ALS/FTD) [5, 6]. This mutation is common, explaining upwards of 40% of all familial cases of ALS and FTD and 10% of all sporadic cases of these conditions. Most people have only a few repeats, with a cutoff for normal defined as less than 22–28 [7]. Pathologic expansions are usually hundreds to thousands of repeats. The repeat resides in the first intron of C9ORF72, a gene that is highly expressed in the brain but whose function remains unclear, though evidences indicate the encoded protein may act as guanine exchange factors for activating Rab proteins [8,9,10]. It is bi-directionally transcribed, such that both G4C2 and C4G2 repeat RNAs are produced, and RNA foci from both transcripts are observed in patient-derived cells and tissues [11,12,13]. Moreover, products of RAN translation are produced from both of these transcripts, leading to 6 different dipeptide repeat (DPR) containing proteins [11, 12, 14, 15]. Repeat expansions also impact C9ORF72 transcription, isoform choice, and splicing [5, 14, 16,17,18].

A series of model systems have provided insights into C9ALS/FTD pathogenesis. Drosophila models demonstrate repeat-associated toxicity when the repeat is placed within the 5′ untranslated region (UTR; hereafter referred to as 5′ leader) of a transgene such as GFP [19]. In contrast, long repeats interrupted by stop codons elicit little toxicity [20, 21]. Flies expressing dipeptide repeat proteins via AUG initiated translation and independent of G4C2 repeat sequences are also toxic when expressed in flies and cells in some, but not all reading frames. Specifically, toxicity appears greatest with expression of glycine-arginine and proline-arginine repeat proteins in Drosophila, with evidence for a role of glycine-alanine proteins in mammalian neurons and model systems [20,21,22,23,24,25]. Mouse models using interventricular adenoviral delivery of 66 or 149 G4C2 repeats in isolation exhibit RNA foci, RAN translation, neurodegeneration, and motor phenotypes [26]. Taken together, these studies do not preclude a role for the repeat RNA in disease toxicity, but suggest that RAN translation directly contributes to neurodegeneration in C9ALS/FTD.

Less attention has thus far been placed on the native sequence context of the repeat. The initial screening of expansion of G4C2 repeats near ALS loci found that though such repeats are quite common in human genome, the expansion of G4C2 repeats were only detectable in C9ORF72 gene but not in other ALS loci genes [27], indicating that the context of the expanded repeats at C9ORF72 gene are quite unique for expanded repeat toxicity. The sequence context may well be critical to determining the final location of the repeat RNA, its interactions with translational machinery, and the RNA binding proteins with which it interfaces. Tran et al. addressed this issue in Drosophila by providing evidence that (a) repeats located within introns are less toxic than repeats placed in 5’m7G capped and polyadenylated transcripts, and (b) this enhanced toxicity was associated with increased DPR production [21, 28]. However, sequence context may also alter the final translated peptides produced by RAN translation and this could influence their toxicity. For example, in Huntington disease, expression of a huntingtin (HTT) isoform containing exon 1 alone with the CAG repeat produces a highly toxic and aggregate-prone factor that accumulates in patient brains [29, 30]. More recently, Sellier et al. demonstrated a role for the carboxyl terminus (C-terminus) of FMRpolyG, a RAN translated protein derived from a CGG repeat expansion in the Fragile X gene, as critical for repeat toxicity in models of Fragile X-associated Tremor Ataxia Syndrome (FXTAS) [31]. We therefore sought to explore whether the repeat location within a transcript and the sequences 3′ to the repeat that might influence the final RAN translated products, therefore influence the GGGGCC repeat toxicity in disease models.

Here we show that expression of up to 484 interrupted G4C2 repeats within a Drosophila intron exhibit minimal toxicity, largely consistent with prior studies [20, 21, 28]. In contrast, relatively short (28) repeats placed in the 5′ UTR of GFP elicited marked toxicity in Drosophila in some, but not all, reading frames relative to a downstream reporter. This differential toxicity correlated with the size and content of the C-terminus of the GR reading frame: with larger C-termini or GFP fusions leading to low toxicity. The relative toxicity observed in these Drosophila models correlates with nuclear accumulation of DPR-containing proteins and is recapitulated in rodent neurons. The inclusion of the native C-termini provides some neuroprotective effects. Taken together, these findings suggest that repeat context has a significant influence on its toxicity and identifies a new role for the C-termini in modulating DPR toxicity across model systems.

Materials and methods

Construction of mammalian transfection vectors

Constructs for mammalian expression experiments were generated by digesting GFP (G4C2)71 pcDNA3.2 (a gift from Christopher Shaw) with XbaI and inserting the resulting (G4C2)71 containing fragment into pcDNA3.1 containing GFP. One to 2 nucleotides were inserted upstream of GFP to result in each reading frame. Using NheI and PmeI restriction enzymes, the (G4C2)71 GFP fragment was inserted into pGW for neuronal expression. To generate plasmids with or without the native C-termini, a small fragment was inserted in place of GFP using AscI and MfeI in pGW. The ATG-V5 fragments were inserted upstream of the repeat using Acc65I and NotI in pGW.

Drosophila stocks and genetics

All flies were maintained with standard food and culture conditions at 25 °C, while all crosses were made and maintained at 29 °C unless otherwise stated. Fly lines acquired from the Bloomington Stock Center were: GMR-GAL4 (#8605), and UAS-GFP (multiple lines). Other lines used were the ubiquitous driver Actin5C-GAL4/CyO (a gift from Zhe Han), the motor neuron specific driver OK6-GAL4 (a gift from Cathy Collins), and a RU-486 inducible Geneswitch Tubulin-GAL4 (Tub5) driver line (gift from Scott Pletcher).

The sequences for intronic and 5′ leader fly lines are detailed in Supplemental Table S1. Briefly, for intronic repeat lines, a mini intron from the fly Prospero gene [32] was inserted into the middle of GFP, synthesized commercially (Genewiz, NJ), and cloned into the EcoRI-XbaI site of pUAST. An intronic fragment with either 3 or 28 G4C2 repeats along with 40 nt of upstream and 260 nt of downstream intronic sequence from human C9ORF72 was PCR amplified according to [33] from genomic DNA and inserted into the intron. The 21 repeat tiling fragment was generated using primer NotI-C9 AnchorR+PspOMI tiling (Supplemental Table S2). The 28 repeat insert was generated using primer NotI-C9R + XhoI-C9F directly from genomic DNA of an ALS patient with 28 repeats. Repeat blocks of (G4C2)21 bracketed by NotI and PspOMI restriction sites were then serially inserted next to this repeat, producing concatamerized intermediate 49, 70, 91, and 121 repeat containing constructs (Table S1). This element was then further concatamerized to generate lines with 242 (GFP-iC242) or 484 (GFP-iC484) interrupted repeats.

5′ leader repeat lines were generated by cloning the same 28 repeat sequence described above with approximately 30 nt on either side of intronic sequence into the 5’UTR of pEGFP-N1 (Takara Bio USA, CA) in the 0+ reading frame. The repeat and GFP were then sub cloned into the NotI site in pUAST. To induce frameshifts, annealing primers were inserted into the AgeI site between the repeat and the AUG of GFP (Supplemental Table S2). pUAST vectors carrying the respective inserts were used to generate transgenic lines by standard p-element insertion (BestGene, CA).

For the fly lines with or without native C-termini, the sequence flanking the (G4C2)69 repeats in constructs for neuron transfection were digested by KpnI and XbaI, and the inserts were cloned to the vector pUAST-AttB (Drosophila Genomics Resource Center, IN) digested with the same set of enzymes. The resulting pUAST-AttB vectors were then used for site-specific transgenesis using PhiC31 integrase technique to locus AttP40 [34] (BestGene, CA). All constructs were verified by Sanger sequencing and agarose gel sizing (Supplemental Figure S1A).

Eye phenotype imaging and quantification

Representative fly eye images were taken by Leica M125 stereomicroscope and photographed with a Leica DFC425 digital camera as previously described [35]. Eye morphology of 1-2d post eclosion flies was quantitatively scored as previously described [36]. Briefly, we used the following criteria: supernumerary inter-ommatidial bristles, abnormal bristle orientation, ommatidium fusion, ommatidium pitting, disorganization of ommatidial array, and retinal collapse. The presence of each feature was given 1 point. An additional 2 points were given if more than 5% of the eyes were affected or 4 points if more than 50% of eyes were affected. Higher scores mean the eyes were more degenerated. Over 100 flies were scored per genotype in at minimum of three independent crosses. Scores were calculated and are presented as mean ± SEM.

Fly viability and eclosion rates

Analysis of eclosion rates were performed as described [37]. Each transgenic line was crossed to the ubiquitous driver Actin5C-GAL4 (act5C-GAL4) balanced over a marker chromosome (CyO), on standard food at 29 °C. If the transgene elicited no toxicity, then 50% of progeny should have the CyO marker and 50% express the transgene. Over 100 flies of each genotype were scored over multiple crosses. The relative percent progeny carrying the transgene were expressed as a percent of total eclosed flies. Analysis of viability post eclosion was performed as described [38].

Larval crawling assay

Transgenic lines were crossed to the motor neuron specific driver line OK6-GAL4 at 29 °C and the 3rd instar larvae with desired genotypes were collected. The crawling assay was performed with slight modification from [39]: 3rd instar larvae were collected by adding 20% sucrose on the food, briefly washed with MilliQ water, and then maintained in a water droplet until being analyzed. The larvae were accommodated at room temperature on 2% agarose gel in 10 cm petri dish for 1 min, and then the crawling distance in the next 1 min was recorded and processed by ImageJ. At least 50 larvae from the appropriate genotype were recorded and pooled as the same genotype.

Drosophila lifespan assay

The UAS transgenic lines were crossed to Tub5-GAL4 (Tub5) Geneswitch driver flies on standard food absent of RU-486 at 29 °C. Adult offspring of the desired genotypes were collected 2-3d after eclosion and transferred to standard fly food containing 200 μM RU-486 without yeast granules. The flies were transferred to fresh food with drug every 3-4d. Each genotype started with at least 4 vials of 25 flies/vial (2 vials of males and 2 vials of females) and the survival was determined daily or every other day for 50d or until all flies were dead. For each genotype, at least 2 individual lines were examined.

Generation of specific dipeptide antibodies

The rabbit polyclonal antibodies were generated by Abclonal (Cambridge, MA). Synthetic peptides corresponding to the repeat region containing 6 repeats were generated. All antibodies were affinity purified from anti-sera prior to use. Characterization of GA and GR specific antibodies was performed as previously described [40]. Characterization of the GP specific antibody can be seen in Supplemental Figure S2.

Western blotting

Protein quantification was performed as previously described and imaged on standard film [41, 42]. Briefly, fly head or cell lysates were lysed in RIPA buffer with protease inhibitors (Sigma Aldrich, MO) and passed through a 27-gauge needle to shear DNA. Equal amounts of protein were run on a 12% SDS polyacrylamide gel. After transfer to PVDF membrane, blots were incubated with the following antibodies: monoclonal mouse anti-GFP (Sigma Aldrich, MO; clone 7.1 and 13.1, 1:1000), mouse anti-Tubulin (DSHB, IA; clone E7, 1:1000), mouse anti-V5 (Thermo Fisher, CA; 1:1000), rabbit anti-GA (Abclonal, MA 1:100), rabbit anti-GP (Abclonal, MA 1:5000), rabbit anti-GR (Abclonal, MA; 1:5000) or mouse anti-β-Actin (Sigma Aldrich, MO; 1:5000). At least three independent experiments were performed and scanned films were processed and quantified using ImageJ software.

In situ hybridization

1-3d post eclosion fly heads from each genotype crossed to GMR-GAL4 driver were isolated, immediately frozen in OCT media, and then cryosectioned to 10 μm. Transverse sections were then fixed with 4% paraformaldehyde in 1x PBS for 15 min, washed 3x in 1x PBS, and permeabilized with 2% acetone for 5 min. In situ hybridization was performed as follows: sections were pre-hybridized with 50% formamide in 2x SSC for 30 min at room temperature and then hybridized with 500 μL hybridization solution in a light proof box for overnight at 37 °C. The hybridization solution was as follows: 0.6 ng/mL Cy5 labeled 2′-O-Me-(CCCCGG)5 RNA probe (IDT DNA Technologies, IA), 0.02% BSA, 132 mg/mL yeast RNA (Sigma Aldrich, MO), 1 μL RNAse inhibitor (Sigma Aldrich, MO), 50% formamide in 2x SSC. After hybridization, slides were washed with 3 × 30 min with 50% formamide in 0.5x SSC at 56 °C, followed by 3x10min washes with 0.5x SSC wash at room temperature. After washing, samples were dried for 5 min, incubated with 100 μL Prolong Gold with DAPI (Thermo Fisher, MA) for 1 h, and examined on an Olympus FV1000 confocal microscope with identical laser settings for each slide. Images were overlaid and quantified using ImageJ software.

Fly based immunofluorescence

1-3d post eclosion fly heads from each genotype crossed to GMR-GAL4 were isolated, immediately frozen in OCT media, and then cryosectioned to 10 μm. Transverse sections were then fixed with 4% paraformaldehyde in 1x PBS for 15 min, washed 3x5min with 1x PBS, and permeabilized with 0.1% triton X-100 in 1x PBS for 5 min. Slides were pre-incubated with 5% normal goat serum in 1XPBS for 1 h at 4 °C, then incubated with antibodies of GA (1:100), GP (1:500), or GR (1:500) diluted in 5% normal goat serum in 1XPBS at 4 °C overnight. Slides were washed 2 × 5 min with 1x PBS. Slides were then incubated with Alexa fluo-568 goat anti-rabbit secondary antibody (Abcam, Cambridge, MA) diluted in 5% normal goat serum in 1x PBS for 30 min at room temperature in a dark box. After 3 × 5 min washes, slides were dried 5 min and then incubated with 100 μL prolong gold with DAPI (Thermal Fisher, MA) for 1 h and then were sealed with nail polish. Slides were examined with Olympus FV1000 confocal microscope with identical laser setting for each slide. Images were overlaid in ImageJ software, and 1 random-selected region of interest (ROI) in retina area of one fly head was analyzed and at least 10 fly heads were analyzed for each genotype.

Cell based immunocytochemistry

COS-7 cells were maintained 37 °C in 5% CO2 incubators. Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 10% fetal bovine serum and 1% Pen-Strep was used as culture media. For immunofluorescent detection of GFP, cells were cultured on 4 well chamber slides. Cells were transfected using Lipofectamine LTX with Plus Reagent (Thermo Fisher, MA) using manufacturer’s protocol. 48 h after transfection, cells were fixed with 4% paraformaldehyde for 15 min, washed with 1x PBS, and permeabilized 0.1% triton X-100. Cells were then blocked with 5% normal goat serum in 1x PBS containing 0.1% triton X-100 for 1 h and incubated with GFP (1:1000) plus GA (1:100), GP (1:500), or GR (1:500) antibodies overnight at 4 °C. Slides were washed with 1x PBS and incubated with AlexaFluor 488 labeled goat anti-mouse (Thermo Fisher, MA; 1:500) and Alexa Fluor 555 labeled goat anti-rabbit (Thermo Fisher, MA; 1:500) antibodies and visualized with an Olympus epifluorescence microscope with Slidebook 5.5 software with identical fluorescent settings for each slide.

RNA isolation and qRT-PCR

Total RNA was extracted from 15 fly heads of each genotype using Trizol (ThermoFisher, MA) and quantified by Nano-Drop (Thermo Fisher, MA). 1 μg of total RNA was then reverse-transcribed to cDNA by iScript cDNA synthesis kit (Bio-Rad, CA). The primers used are detailed in Supplemental Table S2. PCR analysis was performed using iQ SYBR Green Supermix on a MyiQ Single Color qPCR system (BioRad, CA). All runs included a standard dilution curve representing 2x to 0.02x of the RNA concentration utilized for all primer sets to ensure linearity. Equivalent efficiency of individual primer sets was confirmed prior to data analysis. The level of GFP was normalized to RPL32 mRNA for each sample run and expressed as a ratio of levels to GFP-iC3 lines (fold control expression) unless otherwise stated. All samples were run in triplicate in each qPCR run and all data represent at least three independent experiments.

Primary neuron cultures & transfection

All mammalian rodent work was approved by the Institutional Animal Care and Use Committee (IACUC) at the University of Michigan. Primary mixed cortical neurons were dissected from embryonic day 19–20 Long-Evans rat pups and cultured at 6.0 × 105 cells/mL in 96 well cell culture plates as previously described [43,44,45]. Brains of a single litter were combined to maximize cell counts, resulting in neurons from a mixed population of male and female pups. Neurons were cultured in NEUMO photostable medium containing SOS supplement (Cell Guidance Systems, MO) at 37 °C in 5% CO2. After 4d in culture, neurons were transfected with 0.2 μg DNA (total) and 0.5 μL Lipofectamine 2000 (Thermo Fisher, CA) per well in a 96 well plate, as described previously [45]. Neurons were incubated with the DNA/Lipofectamine 2000 mix for 20 min at 37 °C before rinsing. The remainder of the protocol followed the manufacturer’s protocol.

Longitudinal fluorescence microscopy

Automated longitudinal fluorescence microscopy began 24 h post-transfection for 10d, as previously described [43,44,45]. Images were acquired using an inverted microscope (Nikon Instruments, NY) with a 20x objective lens, a PerfectFocus system, a Lambda XL Xenon lamp (Sutter Instruments, CA) with 5 mm liquid light guide (Sutter Instruments, CA), and either an Andor iXon3 897 EMCCD camera or Andor Zyla4.2 (+) sCMOS camera. All stage, shutter, and filter wheel movements were done using a custom code written in μManager, ImageJ [45].

Statistical analysis

For most fly comparisons, scores were analyzed using a non-parametric Kruskal-Wallis ANOVA with Dunn’s correction for multiple comparisons. A Log-rank (Mantel-Cox) Chi square test was performed for Drosophila survival rates. Error bars represent the standard error of the mean except for proportion numbers, where the error bars represent the 95% confidence interval.


Generation of an intronic repeat model of C9ALS/FTD in Drosophila

To create an intronic repeat model of C9ALS/FTD, we inserted the first intron of the fly Prospero gene [32] into the sequence of GFP in a fashion that retained the canonical splice site donor and acceptor sequences (Fig. 1a). We then inserted either three G4C2 repeats or concatamers of 28 and 21 G4C2 repeat units with short interruptions into the intron, up to a maximal repeat length of 484 repeats (Supplemental Figure S1, Supplemental Table S1). After initial studies demonstrated no significant phenotype associated with 28 or 49 intronic repeats (not shown), we focused our analysis on lines expressing 3 (iC3), 242 (iC242), or 484 (iC484) intronic repeats.

Fig. 1

Intronic G4C2 repeats influence splicing and elicit RNA foci in Drosophila.a Schematic of constructs used to generate transgenic fly lines. The Prospero fly intron 1 (gray lines) was introduced into UAS-GFP (blue lines). Either three G4C2 repeats or serial (G4C2)21–28 repeat units separated by 14 nt interruptions were serially inserted into the intron. b Locations of the primer pairs used for measuring unspliced, spliced, and total mRNA are show in the schematic. Quantification of the expression of the mature (left) and unspliced (right) GFP product in the indicated fly lines (right), n = 9. c Western blot of lysates from heads of G4C2 intronic repeat flies (left) with quantification of GFP normalized to beta-tubulin (right), n = 5. d Quantification of RNA foci in the retina of the indicated intronic fly, n = 10. e Representative eye phenotypes from flies of the indicated genotypes crossed to GMR-GAL4 to drive expression in developing ommatidia (left) and quantification of eye phenotype scores (right). f Quantification of the number of progeny carrying the transgene after flies of the indicated genotypes were crossed to a ubiquitous driver (act5c-GAL4). g Quantification of the distance crawled by 3rd instar larvae from the crosses of the indicated fly genotypes to a larval motor neuron specific driver (OK6-GAL4). h Flies carrying the indicated transgenes were crossed to a Tubulin Geneswitch driver (Tub5) to allow ubiquitous expression post eclosion. Adult male flies were placed on food containing RU-486 to activate gene expression and their viability was tracked over 50 days. Graphs represent means ± SEM. * p < 0.05; ** p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons. The number of flies for each genotype for E, F, G and H was > 100

The G4C2 repeat as RNA can form a strong G-quadruplex secondary structure that might interfere with RNA metabolism and splicing [18, 46, 47]. To determine if the repeat taken out of its native context is capable of eliciting alterations in mRNA splicing, we measured the total, unspliced, and spliced mRNA from fly lines containing 3, 242, or 484 intronic repeats (Fig. 1b). There was no significant difference in the abundance of spliced GFP RNA across repeat sizes (Fig. 1b). Consistent with this, the amount of GFP protein expressed in the intronic repeat flies was comparable to that seen in flies expressing GFP alone and did not decline with larger repeat sizes (Fig. 1c). In contrast, there was a significant repeat-length dependent increase in unspliced pre-mRNA containing the G4C2 repeat when normalized to RPL32 (Fig. 1b). Additionally, RNA foci were observed in flies expressing 484 G4C2 repeats but not 3 repeats (Fig. 1d and Supplemental Figure S3).

We utilized the UAS-GAL4 system to target expression of the intronic repeats to different tissues. Expression of GFP-iC3, GFP-iC242, or GFP-iC484 in the eye using a GMR-GAL4 driver elicited modest toxicity in GFP-iC242 eye phenotypes compared to a control line expressing GFP alone using a standardized toxicity scale [35] (Fig. 1e). No significant eye phenotypes were observed in GFP-iC484 expressing lines. Similarly, activating ubiquitous expression during development using an Actin5C-GAL4 driver elicited no differences in eclosion rates between expanded intron lines and the control GFP line (Fig. 1f). We also selectively activated expression of the intronic repeat transgenes in motor neurons using an OK6-GAL4 driver line and evaluated larval crawling as a motor phenotype. We observed no difference between short and long intronic repeats compared to flies expressing GFP alone (Fig. 1g). To evaluate whether a reduction in lifespan might be elicited in adulthood, we activated transgene expression in adult male flies after eclosion using a Geneswitch Tubulin-GAL4 driver by addition of RU-486 to the fly food. We observed no significant impact of intronic repeat expression on viability (Fig. 1h).

The role of repeat location in GGGGCC toxicity

To determine if the repeat location within a transcript might impact its toxicity, we inserted the same 28 G4C2 repeat cassette used to generate the intronic repeats into the 5′ leader of GFP rather than an intron (Fig. 2a). No start codons were present between the transcription start site and the repeat, although the ATG of GFP was retained 3′ to the repeat. We introduced small frameshift mutations between the repeat and the open reading frame of GFP so that it acted as a reporter for RAN translation in the 0+ (GA), 1+ (GP), or 2+ (GR) reading frames, respectively.

Fig. 2

Drosophila 5’UTR G4C2 repeat models. a Schematic of constructs used to generate transgenic fly lines. (G4C2)28 repeats were inserted into the 5’UTR of GFP without an upstream start codon in all three reading frames relative to the GFP reading frame. b Western blot (left) of lysates from heads of G4C2 exonic repeat flies normalized to β-tubulin (right), n = 6. c Quantification of RNA foci in the retina of the indicated fly lines, n = 10. Graphs represent means ± SEM. * p < 0.05; ** p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons

When the G4C2 repeat was expressed in an intronic context, levels of GFP protein were similar between short repeats and expanded repeats. However, placing the repeat in the 5′ leader of GFP dramatically affected overall GFP expression. A higher molecular weight GFP fusion protein product was observed by western blot in the 0+ (GA) frame, but undetectable in the GP (1+) or GR (2+) reading frames (Fig. 2b), though the impact of such RAN translation DPRs in GFP protein solubility was not fully explored. Expression of all three constructs led to detectable RNA foci (Fig. 2c and Supplemental Figure S3). The presence and abundance of RNA foci did not directly predict toxicity in these flies, similar to prior observations [28].

When expressed in fly ommatidia, 5′ leader repeat fly lines elicited greater toxicity than was observed in the intronic repeat fly lines across a range of measures, consistent with a prior publication [28]. However, surprisingly, there were significant differences in the toxicity observed that was dependent on the reading frame in which the GFP reporter was placed. Specifically, 1+ (G4C2)28 GFP fly lines elicited a marked degeneration of the eye (Fig. 3a) and to a lesser extent in 2+ (G4C2)28 GFP flies. Developmental toxicity, as assessed by eclosion rates after ubiquitous expression, and a severe larval motor phenotype observed with isolated motor neuron expression, largely mirrored toxicity findings observed in the eye (Fig. 3b-c). Moreover, rapid declines in viability after transgene activation in adulthood were observed in all 5′ leader flies, with the most robust toxicity in 1+ (G4C2)28 GFP fly lines (Fig. 3d).

Fig. 3

5’UTR G4C2 repeats with different carboxyl termini elicit differential toxicity in Drosophilaa Representative eye phenotypes from flies of the indicated genotypes crossed to GMR-GAL4 to drive expression in developing ommatidia (right) and quantification of eye phenotype scores (right). b The number of progeny carrying the transgene was determined after flies of the indicated genotypes were crossed to a ubiquitous driver (act5c-GAL4). c Quantification of the distance crawled by 3rd instar larvae from the crosses of the indicated fly genotypes to a larval motor neuron specific driver (OK6-GAL4). d Flies carrying the indicated transgenes were crossed to a Tubulin Geneswitch driver (Tub5) to allow ubiquitous expression post eclosion. Adult male flies were placed on food containing RU-486 to activate gene expression and their viability was tracked over 50 days. Graphs represent means ± SEM. ** p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons. Log-rank (Mantel-Cox) test for survival, * p < 0.05, ** p < 0.01. The number of flies for each genotype was > 100

In an attempt to explain these differences in phenotype, we compared transgene mRNA expression across lines and observed no correlation between toxicity and abundance of the transgene expression (Supplemental Figure S1B). We next analyzed three independent lines for each transgene to control for effects of transgene insertion sites. Phenotypes for different lines of the same transgene were very similar despite significant differences across transgenes (Supplemental Figure S1C). Of note, the DNA sequence 5′ to the repeat in all three transgene constructs were identical, precluding contributions from an upstream initiation event in only one construct as a driving force in the observed phenotypic differences (Supplemental Table S1).

Accumulation and localization of DPR protein correlates with toxicity

To determine what factors might be triggering the differential toxicity between these Drosophila lines, we used a series of new and recently developed [40] antibodies against the three sense strand DPRs: GA, GR, and GP. These polyclonal antibodies recognize epitope tagged versions of the indicated repeat proteins both by western blot and by immunocytochemistry in transfected cells (Supplemental Figure S2A-B). These antibodies also stain perinuclear inclusions in cerebellar slices from ALS patients with C9ORF72 repeat expansions but not in healthy controls (Supplemental Figure S2C).

We utilized these antibodies to determine the relative abundance and distribution of each DPR in all of the transgenic Drosophila lines described. We reasoned that the epitope tags provide information on a single reading frame, but that RAN translation occurs in all 3 reading frames and thus the relative abundance of each product might be informative for their roles in toxicity. The expected RAN translation products from the intronic constructs would produce 3 possible polypeptides with mixed all three DPRs in one frame, and pure poly (GP)28 in the second frame, and mixed poly (GR) and poly (GA) DPRs in the third frame (Table S1), while the expected RAN translation products from all three 5′ leader constructs would just produce 3 pure individual DPRs in each respective reading frames (Table 1). Using transverse ommatidial sections derived from Drosophila expressing each transgene (UAS-GFP, GFP-iC3, GFP-iC484, and 0+, 1+, or 2+ (G4C2)28 GFP lines) crossed to GMR-GAL4, we performed immunofluorescence analysis against each of the three DPRs using these new antibodies. No staining was observed in lines expressing GFP in isolation or in the driver line alone. However, staining was consistently present with both GA and GR antibodies in the intronic fly models (Fig. 4). In contrast, GP staining was not reliably observed from the intronic fly lines. In 5′ leader lines there were marked differences in the staining for both GR and GP between lines, with the greatest staining present in the 1+ (G4C2)28 GFP lines which also exhibited the greatest toxicity (Fig. 4c-dD). In addition, this same line exhibited a marked increase in both total nuclear staining and in the nuclear-cytoplasmic ratio for all three DPR proteins (Fig. 4d), suggesting a correlation between both abundance and cellular localization of these proteins and the observed toxicity in Drosophila [20, 25, 28, 48].

Table 1 RAN Translation Peptides in G4C2 repeat transgenic flies and plasmids
Fig. 4

G4C2 repeats toxicity is associated with nuclear accumulation of RAN translation peptides. a Representative images of transverse retinal sections from Drosophila of the indicated genotypes probed with antibody against GA dipeptide repeats. b Quantification of total GA repeat staining by immunofluorescence (left) and the percentile of cells with significant nuclear accumulation of GA DPR staining for the indicated genotypes (right). c Representative Immunofluorescence images of Drosophila retinal sections probed with antibodies against the GP dipeptide repeats. d Quantification of total GP repeat staining by immunofluorescence (left) and the percentile of cells with significant nuclear accumulation of GP DPR staining for the indicated genotypes (right). E) Representative Immunofluorescence image of transverse retinal sections from Drosophila of the indicated genotypes probed with antibody against GR dipeptide repeats. F) Quantification of total GR repeat staining by immunofluorescence (left) and the percentile of cells with significant nuclear accumulation of GR DPR staining for the indicated genotypes. Graphs represent means ± SEM. * p < 0.05; ** p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons. The number of flies analyzed for each respective genotype in B, D and F was 10

Prior studies have demonstrated that C-terminal sequences and epitope tags can influence the toxicity associated with translated repeat expansions such as polyglutamine proteins [30]. More recently, the C-terminus of the RAN translated protein FMRpolyG generated from GGC repeats in FXTAS was found to influence the relative toxicity of this protein through aberrant protein-protein interactions [31]. We therefore analyzed the C-termini of the transgenes used to generate the different exonic flies (Table 1). In the native C9ORF72 gene, each DPR reading frame has 30–55 amino acids after the repeat prior to the presence of a stop codon. Such C-termini were detectable in ALS patients carrying the expanded G4C2 repeats according to a previous study [13]. In our exonic flies, 2 reading frames have C-termini over 200 amino acids, one of which is the fused GFP, while the other is a long polypeptide with no known homology with human proteins, whereas the third reading frame is much shorter (15–21 amino acids).

Based on these observations, we hypothesized that the differential C-termini alone or combined with the DPRs could alter the stability and or distribution of such DPRs and subsequently alter the toxicity. To explore this idea further, we utilized longitudinal microscopy [31, 43,44,45] to track survival of individual neurons expressing sequences similar to those used in the fly experiments (Fig. 5a). Primary rodent cortical neurons were transfected with plasmids encoding mApple, to visualize neuronal soma, and each of the 5′ leader G4C2 constructs. These cells were imaged at 24 h intervals over a 10d period, and the time of death for individual neurons determined programmatically using a set of sensitive criteria (rounding or blebbing of the cell body or neurites, loss of fluorescence) validated in previous studies [49, 50]. Relative survival in each condition was compared by Cox proportional hazards analysis. As we observed in Drosophila, expression of an expanded G4C2 repeat in the 5′ UTR of GFP was toxic in neurons. The relative toxicity of each construct was again dictated by the reading frame of the GFP tag, with toxicity being greatest in those with 1+ (G4C2)71GFP or 2+ (G4C2)71 GFP, mirroring what was observed in Drosophila (Fig. 5a).

Fig. 5

Native carboxyl terminal sequences reduce GGGGCC repeat toxicity in rodent neurons and in flies. a Top: Schematic of the constructs transfected into neurons, with GFP placed in different reading frames relative to the GGGGCC repeat. Bottom: Neuronal survival of neurons expressing these constructs is graphed as the cumulative risk of death (Y axis, with higher values reflecting increased death) over time (X-axis, measured in hours). b Top: Schematic of the constructs transfected into neurons with shorter or expanded G4C2 repeats and with or without the native Cterminus. Bottom: Cumulative risk of death of neurons. c Top: Schematic of the constructs used to generate transgenic fly lines. Bottom: Representative eye phenotypes from flies of the indicated genotypes crossed to GMR-GAL4 to drive expression in developing ommatidia (left) and quantification of eye phenotype scores (right). Graphs represent means ± SEM. For Panels A and B, n = number of neurons. **p < 0.01 difference in survival measured by Cox proportional hazard analysis. Each graph represents at least 3 independent experiments. For panel C, ** p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons compared to control group. † p < 0.01 by Kruskal-Wallis after Dunn’s correction for multiple comparisons compared to (G4C2)69 CT group. The number of flies analyzed in panel C was > 100

To evaluate the potential role of different C-terminal sequences on toxicity, we generated a new series of expression vectors that take into account the length and sequence of the C-termini. One set of vectors have the native C-termini (CT) for all three reading frames. Another set of vectors was generated with shortened C-terminal regions in all three reading frames (ΔCT) (Fig. 5b). The G4C2 repeat with the native C-terminal sequence showed increased toxicity with an increasing repeat size (Fig. 5b, solid green vs solid blue line). However, when the native C-termini were removed and replaced with short C-termini, toxicity was significantly increased (Fig. 5b, dashed lines vs solid lines). These (G4C2)69 repeats with the native C-termini (CT) or with short C-termini (ΔCT) were also introduced to the same locus (AttP40) in Drosophila using pUAST-AttB vectors to achieve equal expression levels. Consistent to the partially protective effects found in the culture neurons, the inclusion of the native C-termini partially suppressed the repeat-associated toxicity when these constructs were expressed in the fly eyes (Fig. 5c).

In an attempt to determine which C-terminal reading frame was most critical for toxicity, we introduced a series of AUG driven V5 tags upstream of the repeat in different reading frames. Enhancing production of GA or GR from these repeats significantly boosted the toxicity of these constructs to a greater degree than placement of an AUG codon in the GP reading frame. However, replacing the native C-terminal sequences with stop codons only enhanced repeat toxicity when the AUG codon was placed in the GR reading frame (Fig. 6c-e). Taken together, these data suggest that the native C-termini may mitigate toxicity arising from RAN translation, and in particular toxicity related to GR DPR proteins.

Fig. 6

Native carboxyl terminal sequences decrease GR dipeptide repeat licited toxicity in rodent neurons. a Top: Schematic of constructs generated to evaluate impact of including an AUG codon in each individual reading frame of the repeat while maintaining the native C-terminal sequence across all 3 reading frames. Bottom: Survival of neurons expressing the indicated constructs as measured by cumulative risk of death. b Schematic of constructs used in panels c-e. c Cumulative risk of death of neurons transfected with constructs containing an AUG codon in the GA reading frame with or without the native C-terminus. d) Cumulative risk of death of neurons transfected with constructs containing an AUG codon in the GP reading frame with or without the native C-terminus. e Cumulative risk of death of neurons transfected with constructs containing an AUG codon in the GR reading frame with or without the native C-terminus. *p < 0.05 and ** p < 0.01 indicate a difference in survival as determined by Cox proportional hazard analysis. The number of neurons for each analysis were as indicated in the respective lines. Each graph represents at least 3 independent experiments


How hexanucleotide repeat expansions in C9ORF72 elicit neurodegeneration is a topic of intense research [51, 52]. Work by multiple groups has established the presence of both nuclear RNA foci and RAN translation derived DPR proteins in tissues and cells from affected patients. Initial studies largely relied on expression of the repeat out of its normal sequence context or production of DPR proteins via AUG-initiated translation. These studies importantly demonstrate that even relatively short RNA repeats and small DPRs can be toxic [19, 20, 22,23,24,25,26, 53,54,55]. The position of the GGGGCC repeat within an mRNA is a major factor in determining its relative toxicity in Drosophila, with repeats residing in an intron exhibiting minimal toxicity while even short repeats in the 5′ leader elicit significant toxicity [28]. In this work we observe that sequences 3′ to the repeat also modulate repeat toxicity by altering the C-termini of DPRs generated via RAN translation. These findings suggest a specific role for C-terminal sequences in modulating repeat toxicity in C9ALS/FTD while making it clear that repeat sequence context needs to be carefully considered in future studies of disease pathogenesis.

Sequences located in coding regions or within the 5′ or 3′ UTR are typically processed to capped and polyadenylated mRNAs and rapidly trafficked out of the nucleus, where they interface with a series of RNA binding proteins and the translational machinery. In contrast, intronic sequences are typically spliced out during transcription, formed into RNA lariats and targeted for debranching and degradation in the nucleus. As such, one potential explanation for the observed differences in toxicity between intronic and 5′ leader sequence repeats is the distribution of the repeat containing transcripts within the cell and their potential for translation. In vitro and cell based studies using reporter systems suggest that while GGGGCC repeats can be translated from intronic or bi-cistronic transcripts, their production is most robust when present in capped and poly-adenylated mRNAs [56,57,58,59,60]. Consistent with this prediction, the abundance of specific RAN translation products from relatively short 5′ leader repeats were much greater than that observed from larger intronic repeats (Figs. 1, 2, 3 and 4). These differences were also consistent with past studies of DPRs production in in vitro, cellular and Drosophila models of GGGGCC repeats [21, 28, 57]. Moreover, the relative abundance and distribution of these DPR proteins was predictive of toxicity, while the total RNA abundance and RNA foci counts were similar across repeat containing lines, consistent with prior studies [21, 28]. While studies in mammalian or human model systems at endogenous expression levels are needed to fully arbitrate the relative roles of RNA and protein in GGGGCC repeat toxicity, our work is consistent with the emerging consensus supporting a significant role of RAN translation in C9ALS/FTD pathogenesis.

In contrast, our finding that the frame in which the C-terminal epitope tag resides was strongly predictive of repeat toxicity is surprising. This finding was quite robust, as we observed this same relationship across multiple different Drosophila lines with distinct insertion sites (Supplemental Figure S1) and also when these same constructs were expressed in rodent neurons (Fig. 6). The “GFP frame” where the toxicity was greatest would create a fusion with a GP dipeptide repeat protein—the least toxic DPR based on multiple studies using AUG driven expression. Thus, it could be that “out of frame” untagged products are largely driving toxicity while the GFP tag is actually suppressing toxicity derived from certain DPRs. In this context, it is intriguing that the small sequence changes we introduced below the repeat to shift the reading frame resulted in a short disordered 20 amino acid C-terminal tail in the GR reading frame and a long polypeptide with no known homology along the GA DPR reading frame from the 1+ (G4C2)28 GFP flies with the most severe phenotypes. Our data suggests that these C-terminal changes correlated with both more aggregates of GR DPR in 1+ (G4C2)28 GFP flies and greater nuclear localization of all three DPRs. Moreover, modulating C-terminal sequences in transcripts expressed in rodent neurons had the greatest effect on AUG-driven GR associated toxicity (Fig. 6). However, poly (GA) DPRs can interact with poly (GR) DPRs to contribute the repeat toxicity [25] and are themselves toxic in isolation in mammalian neurons [22, 23, 61] - a finding that is consistent with our own data (Fig. 6a). Thus, while our data is most consistent with a primary role of GR in modulating this C-terminal toxicity, it does not preclude important roles for GA or GP in this process. Future studies will be needed to determine how C-terminal changes to these DPRs in isolation might impact their interactions and toxicity.

Isolated expression of GR DPRs in the absence of the RNA repeat exhibit marked toxicity in Drosophila, rodent and cell based model systems [20, 25, 53, 62, 63]. As such, changes in its relative solubility, stability or distribution may well influence the toxicity elicited by G4C2 repeats. In contrast, the relative lack of toxicity associated with the 0+ (G4C2)28 GFP transgene may also be partially explained by a stabilizing effect of the GFP tag on the GA DPR. Isolated expression of GA DPRs is toxic in mammalian neurons and, to a lesser extent, in Drosophila [20, 22, 23, 61]. However, this GA-GFP fusion product remains largely cytoplasmic and soluble in our assays, which may impede GR mediated toxicity in Drosophila [25]. Regardless, our results suggest that the C-terminal region of RAN derived proteins influences their relative toxicity and cellular distribution: a finding with significant implications for design of future model systems at both this and other disease causing repeat expansions.

In rodent neurons, we observed a protective effect when the native C-termini are retained in all three reading frames. Based on studies where we serially enhanced expression of each DPR reading frame by inserting an AUG codon in a strong Kozak context above the GGGGCC repeat element, this protective effect seems to be largely driven by the GR reading frame, as elimination of the C-terminus in GR driven constructs enhanced toxicity while doing so in GA or GP driven constructs had no significant or even a mildly protective effect. Given the propensity for GR DPRs to phase separate [64,65,66], this native C-terminus may influence the behavior of these peptides when generated in patients. Future studies will be needed to determine if and how such surrounding sequences, especially the downstream sequences influence the expression and localization of these DPRs.

It is noteworthy that in rodent neurons with the protective effects of the native C-termini on the expanded GGGGCC repeat toxicity with AUG were not as prominent as that on the repeats without AUG (Figs. 5b vs 6). Several potential mechanisms might explain this discrepancy: First, since the DPR production is tightly tied to toxicity in C9orf72 overexpression models, it may be that the toxic effects of the ATG-driven DPRs overwhelm the subtler contribution from the C-termini. Second, because the ATG-driven constructs also contain a V5 tag, they have a larger N-terminal extension on the DPR that might also influence the impact of C-terminal sequences. Third, RAN translation may be more permissive of initiation in multiple frames, allowing for production of all 3 DPR products within a given neuron rather than one major reading frame, and in this context the C-termini may have larger impacts. A full array of characterizations of these DPRs and their interactions with/without the native C-termini will be needed to dissect these mechanisms in future studies.

In summary, our findings demonstrate that the position of the repeat within a transcript as well as its immediate surrounding sequences can have a major impact on its relative toxicity, and on the distribution and abundance of different RAN translated proteins. Future work will be needed to determine the selective effects that these sequence modifications have on the biophysical properties of RAN translated proteins and their exact roles in disease pathogenesis in ALS and FTD. When coupled with recent findings in FXTAS [31] and Spinocerebellar Ataxia type 36 [67, 68], these findings suggest that similar analyses in other repeat disorders are likely warranted.

Availability of data and materials

Not applicable.



Amyotrophic lateral sclerosis


Frontotemporal dementia

RAN translation:

Repeat-associated non-AUG initiated translation


Dipeptide repeats


Untranslated region


Fragile X-associated tremor/ataxia syndrome








Carboxyl terminal


  1. 1.

    Todd PK, Paulson HL (2010) RNA-mediated neurodegeneration in repeat expansion disorders. Ann Neurol 67:291–300

    CAS  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Zoghbi HY, Orr HT (2000) Glutamine repeats and Neurodegeneration. Annu Rev Neurosci 23:217–247

    CAS  PubMed  Google Scholar 

  3. 3.

    Cleary JD, Ranum LPW (2014) Repeat associated non-ATG (RAN) translation: new starts in microsatellite expansion disorders. Curr Opin Genet Dev 26:6–15

    CAS  PubMed  Google Scholar 

  4. 4.

    Zu T, Gibbens B, Doty NS, Gomes-Pereira M, Huguet A, Stone MD et al (2011) Non-ATG-initiated translation directed by microsatellite expansions. Proc Natl Acad Sci U S A 108:260–265

    CAS  PubMed  Google Scholar 

  5. 5.

    DeJesus-Hernandez M, Mackenzie IR, Boeve BF, Boxer AL, Baker M, Rutherford NJ et al (2011) Expanded GGGGCC Hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron. 72:245–256

    CAS  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Renton AE, Majounie E, Waite A, Simón-Sánchez J, Rollinson S, Gibbs JR et al (2011) A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD. Neuron. 72:257–268

    CAS  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Woollacott IOC, Mead S (2014) The C9ORF72 expansion mutation: gene structure, phenotypic and diagnostic issues. Acta Neuropathol 127:319–332

    CAS  PubMed  Google Scholar 

  8. 8.

    Farg MA, Sundaramoorthy V, Sultana JM, Yang S, Atkinson RAK, Levina V et al (2014) C9ORF72, implicated in amytrophic lateral sclerosis and frontotemporal dementia, regulates endosomal trafficking. Hum Mol Genet 23:3579–3595

    CAS  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Levine TP, Daniels RD, Gatta AT, Wong LH, Hayes MJ (2013) The product of C9orf72, a gene strongly implicated in neurodegeneration, is structurally related to DENN Rab-GEFs. Bioinformatics. 29:499–503

    CAS  PubMed  PubMed Central  Google Scholar 

  10. 10.

    Zhang D, Iyer LM, He F, Aravind L (2012) Discovery of novel DENN proteins: implications for the evolution of eukaryotic intracellular membrane structures and human disease. Front Genet 3:283

    PubMed  PubMed Central  Google Scholar 

  11. 11.

    Zu T, Liu Y, Bañez-Coronel M, Reid T, Pletnikova O, Lewis J et al (2013) RAN proteins and RNA foci from antisense transcripts in C9ORF72 ALS and frontotemporal dementia. Proc Natl Acad Sci U S A 110:E4968–E4977

    CAS  PubMed  PubMed Central  Google Scholar 

  12. 12.

    Gendron TF, Bieniek KF, Zhang YJ, Jansen-West K, Ash PEA, Caulfield T et al (2013) Antisense transcripts of the expanded C9ORF72 hexanucleotide repeat form nuclear RNA foci and undergo repeat-associated non-ATG translation in c9FTD/ALS. Acta Neuropathol 126:829–844

    CAS  PubMed  PubMed Central  Google Scholar 

  13. 13.

    Mori K, Arzberger T, Grässer FA, Gijselinck I, May S, Rentzsch K et al (2013) Bidirectional transcripts of the expanded C9orf72 hexanucleotide repeat are translated into aggregating dipeptide repeat proteins. Acta Neuropathol 126:881–893

    CAS  PubMed  Google Scholar 

  14. 14.

    Mori K, Weng SM, Arzberger T, May S, Rentzsch K, Kremmer E et al (2013) The C9orf72 GGGGCC repeat is translated into aggregating dipeptide-repeat proteins in FTLD/ALS. Science 339:1335–1338

    CAS  PubMed  Google Scholar 

  15. 15.

    Ash PEA, Bieniek KF, Gendron TF, Caulfield T, Lin W-L, Dejesus-Hernandez M et al (2013) Unconventional translation of C9ORF72 GGGGCC expansion generates insoluble polypeptides specific to c9FTD/ALS. Neuron. 77:639–646

    CAS  PubMed  PubMed Central  Google Scholar 

  16. 16.

    Gijselinck I, Van Langenhove T, van der Zee J, Sleegers K, Philtjens S, Kleinberger G et al (2012) A C9orf72 promoter repeat expansion in a Flanders-Belgian cohort with disorders of the frontotemporal lobar degeneration-amyotrophic lateral sclerosis spectrum: a gene identification study. Lancet Neurol 11:54–65

    CAS  PubMed  Google Scholar 

  17. 17.

    Liu EY, Russ J, Wu K, Neal D, Suh E, McNally AG et al (2014) C9orf72 hypermethylation protects against repeat expansion-associated pathology in ALS/FTD. Acta Neuropathol 128:525–541

    CAS  PubMed  PubMed Central  Google Scholar 

  18. 18.

    Haeusler AR, Donnelly CJ, Periz G, Simko EAJ, Shaw PG, Kim MS et al (2014) C9orf72 nucleotide repeat structures initiate molecular cascades of disease. Nature. 507:195–200

    CAS  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Xu Z, Poidevin M, Li X, Li Y, Shu L, Nelson DL et al (2013) Expanded GGGGCC repeat RNA associated with amyotrophic lateral sclerosis and frontotemporal dementia causes neurodegeneration. Proc Natl Acad Sci U S A 110:7778–7783

    CAS  PubMed  PubMed Central  Google Scholar 

  20. 20.

    Mizielinska S, Grönke S, Niccoli T, Ridler CE, Clayton EL, Devoy A et al (2014) C9orf72 repeat expansions cause neurodegeneration in Drosophila through arginine-rich proteins. Science. 345:1192–1194

    CAS  PubMed  PubMed Central  Google Scholar 

  21. 21.

    Moens TG, Mizielinska S, Niccoli T, Mitchell JS, Thoeng A, Ridler CE et al (2018) Sense and antisense RNA are not toxic in Drosophila models of C9orf72-associated ALS/FTD. Acta Neuropathol 135:445–457

    CAS  PubMed  PubMed Central  Google Scholar 

  22. 22.

    May S, Hornburg D, Schludi MH, Arzberger T, Rentzsch K, Schwenk BM et al (2014) C9orf72 FTLD/ALS-associated Gly-Ala dipeptide repeat proteins cause neuronal toxicity and Unc119 sequestration. Acta Neuropathol 128:485–503

    CAS  PubMed  PubMed Central  Google Scholar 

  23. 23.

    Zhang YJ, Jansen-West K, Xu YF, Gendron TF, Bieniek KF, Lin WL et al (2014) Aggregation-prone c9FTD/ALS poly (GA) RAN-translated proteins cause neurotoxicity by inducing ER stress. Acta Neuropathol 128:505–524

    CAS  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Wen X, Tan W, Westergard T, Krishnamurthy K, Markandaiah SS, Shi Y et al (2014) Antisense proline-arginine RAN dipeptides linked to C9ORF72-ALS/FTD form toxic nuclear aggregates that initiate invitro and invivo neuronal death. Neuron. 84:1213–1225

    CAS  PubMed  PubMed Central  Google Scholar 

  25. 25.

    Yang D, Abdallah A, Li Z, Lu Y, Almeida S, Gao FB (2015) FTD/ALS-associated poly (GR) protein impairs the notch pathway and is recruited by poly (GA) into cytoplasmic inclusions. Acta Neuropathol 130:525–535

    CAS  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Chew J, Gendron TF, Prudencio M, Sasaguri H, Zhang YJ, Castanedes-Casey M et al (2015) C9ORF72 repeat expansions in mice cause TDP-43 pathology, neuronal loss, and behavioral deficits. Science 348:1151–1154

    CAS  PubMed  PubMed Central  Google Scholar 

  27. 27.

    He F, Jones JM, Figueroa-Romero C, Zhang D, Feldman EL, Goutman SA et al (2016) Screening for novel hexanucleotide repeat expansions at ALS-and FTD-associated loci. Neurol Genet 2:e71

    PubMed  PubMed Central  Google Scholar 

  28. 28.

    Tran H, Almeida S, Moore J, Gendron TF, Chalasani UD, Lu Y et al (2015) Differential toxicity of nuclear RNA foci versus dipeptide repeat proteins in a Drosophila model of C9ORF72 FTD/ALS. Neuron. 87:1207–1214

    CAS  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Ross CA, Tabrizi SJ (2011) Huntington’s disease: from molecular pathogenesis to clinical treatment. Lancet Neurol 10:83–98

    CAS  PubMed  Google Scholar 

  30. 30.

    Sathasivam K, Neueder A, Gipson TA, Landles C, Benjamin AC, Bondulich MK et al (2013) Aberrant splicing of HTT generates the pathogenic exon 1 protein in Huntington disease. Proc Natl Acad Sci U S A 110:2366–2370

    CAS  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Sellier C, Buijsen RAM, He F, Natla S, Jung L, Tropel P et al (2017) Translation of expanded CGG repeats into FMRpolyG is pathogenic and May contribute to fragile X tremor Ataxia syndrome. Neuron. 93:331–347

    CAS  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Scamborova P, Wong A, Steitz JA (2004) An Intronic enhancer regulates splicing of the Twintron of Drosophila melanogaster prospero pre-mRNA by two different Spliceosomes. Mol Cell Biol 24:1855–1869

    CAS  PubMed  PubMed Central  Google Scholar 

  33. 33.

    Meisler MH, Grant AE, Jones JM, Lenk GM, He F, Todd PK et al (2013) C9ORF72 expansion in a family with bipolar disorder. Bipolar Disord 15:326–332

    CAS  PubMed  PubMed Central  Google Scholar 

  34. 34.

    Markstein M, Pitsouli C, Villalta C, Celniker SE, Perrimon N (2008) Exploiting position effects and the gypsy retrovirus insulator to engineer precisely expressed transgenes. Nat Genet 40:476–483

    CAS  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Todd PK, Oh SY, Krans A, He F, Sellier C, Frazer M et al (2013) CGG repeat-associated translation mediates neurodegeneration in fragile X tremor ataxia syndrome. Neuron. 78:440–455

    CAS  PubMed  Google Scholar 

  36. 36.

    Pandey UB, Nie Z, Batlevi Y, McCray BA, Ritson GP, Nedelsky NB et al (2007) HDAC6 rescues neurodegeneration and provides an essential link between autophagy and the UPS. Nature. 447:859–863

    CAS  PubMed  Google Scholar 

  37. 37.

    Lanson NA, Maltare A, King H, Smith R, Kim JH, Taylor JP et al (2011) A Drosophila model of FUS-related neurodegeneration reveals genetic interaction between FUS and TDP-43. Hum Mol Genet 20:2510–2523

    CAS  PubMed  PubMed Central  Google Scholar 

  38. 38.

    Todd PK, Oh SY, Krans A, Pandey UB, Di Prospero NA, Min KT et al (2010) Histone deacetylases suppress cgg repeat-induced neurodegeneration via transcriptional silencing in models of fragile X tremor Ataxia syndrome. PLoS Genet 6:e1001240

    PubMed  PubMed Central  Google Scholar 

  39. 39.

    Nichols CD, Becnel J, Pandey UB (2012) Methods to assay Drosophila behavior. J Vis Exp 61:3795

    Google Scholar 

  40. 40.

    Flores BN, Dulchavsky ME, Krans A, Sawaya MR, Paulson HL, Todd PK et al (2016) Distinct c9orf72-associated dipeptide repeat structures correlate with neuronal toxicity. PLoS One 11:e0165084.

  41. 41.

    He F, Krans A, Freibaum BD, Paul Taylor J, Todd PK (2014) TDP-43 suppresses CGG repeat-induced neurotoxicity through interactions with HnRNP A2/B1. Hum Mol Genet 23:5036–5051

    CAS  PubMed  PubMed Central  Google Scholar 

  42. 42.

    Iliff AJ, Renoux AJ, Krans A, Usdin KU, Sutton MA, Todd PK (2013) Impaired activity-dependent FMRP translation and enhanced mGluR-dependent LTD in fragile X premutation mice. Hum Mol Genet 22:1180–1192

    CAS  PubMed  Google Scholar 

  43. 43.

    Flores BN, Li X, Malik AM, Martinez J, Beg AA, Barmada SJ (2019) An Intramolecular salt bridge linking TDP43 RNA binding, protein stability, and TDP43-dependent Neurodegeneration. Cell Rep 27:1133–1150

    CAS  PubMed  PubMed Central  Google Scholar 

  44. 44.

    Malik AM, Miguez RA, Li X, Ho YS, Feldman EL, Barmada SJ (2018) Matrin 3-dependent neurotoxicity is modified by nucleic acid binding and nucleocytoplasmic localization. Elife. 7:e35977

    PubMed  PubMed Central  Google Scholar 

  45. 45.

    Weskamp K, Safren N, Miguez R, Barmada S (2019) Monitoring neuronal survival via longitudinal fluorescence microscopy. J Vis Exp 143:10.3791/59036

    Google Scholar 

  46. 46.

    Conlon EG, Lu L, Sharma A, Yamazaki T, Tang T, Shneider NA et al (2016) The C9ORF72 GGGGCC expansion forms RNA G-quadruplex inclusions and sequesters hnRNP H to disrupt splicing in ALS brains. eLife 5:e17820.

  47. 47.

    Reddy K, Zamiri B, Stanley SYR, Macgregor RB, Pearson CE (2013) The disease-associated r (GGGGCC) n repeat from the C9orf72 gene forms tract length-dependent uni- and multimolecular RNA G-quadruplex structures. J Biol Chem 288:9860–9866

    CAS  PubMed  PubMed Central  Google Scholar 

  48. 48.

    Schludi MH, May S, Grässer FA, Rentzsch K, Kremmer E, Küpper C et al (2015) Distribution of dipeptide repeat proteins in cellular models and C9orf72 mutation cases suggests link to transcriptional silencing. Acta Neuropathol 130:537–555

    CAS  PubMed  PubMed Central  Google Scholar 

  49. 49.

    Arrasate M, Finkbeiner S (2005) Automated microscope system for determining factors that predict neuronal fate. Proc Natl Acad Sci U S A 102:3840–3845

    CAS  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Arrasate M, Mitra S, Schweitzer ES, Segal MR, Finkbeiner S (2004) Inclusion body formation reduces levels of mutant huntingtin and the risk of neuronal death. Nature. 431:805–810

    CAS  PubMed  Google Scholar 

  51. 51.

    Balendra R, Isaacs AM (2018) C9orf72-mediated ALS and FTD: multiple pathways to disease. Nat Rev Neurol 14:544–558

    CAS  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Gitler AD, Tsuiji H (1647) There has been an awakening: emerging mechanisms of C9orf72 mutations in FTD/ALS. Brain Res 2016:19–29

    Google Scholar 

  53. 53.

    Kwon I, Xiang S, Kato M, Wu L, Theodoropoulos P, Wang T et al (2014) Poly-dipeptides encoded by the C9orf72 repeats bind nucleoli, impede RNA biogenesis, and kill cells. Science 345:1139–1145

    CAS  PubMed  PubMed Central  Google Scholar 

  54. 54.

    Tao Z, Wang H, Xia Q, Li K, Li K, Jiang X et al (2015) Nucleolar stress and impaired stress granule formation contribute to C9orf72 RAN translation-induced cytotoxicity. Hum Mol Genet 24:2426–2441

    CAS  PubMed  Google Scholar 

  55. 55.

    Yamakawa M, Ito D, Honda T, Kubo KI, Noda M, Nakajima K et al (2015) Characterization of the dipeptide repeat protein in the molecular pathogenesis of c9FTD/ALS. Hum Mol Genet 24:1630–1645

    CAS  PubMed  Google Scholar 

  56. 56.

    Green KM, Glineburg MR, Kearse MG, Flores BN, Linsalata AE, Fedak SJ et al (2017) RAN translation at C9orf72-associated repeat expansions is selectively enhanced by the integrated stress response. Nat Commun 8:2005

    PubMed  PubMed Central  Google Scholar 

  57. 57.

    Cheng W, Wang S, Mestre AA, Fu C, Makarem A, Xian F et al (2018) C9ORF72 GGGGCC repeat-associated non-AUG translation is upregulated by stress through eIF2α phosphorylation. Nat Commun 9:51

    PubMed  PubMed Central  Google Scholar 

  58. 58.

    Sonobe Y, Ghadge G, Masaki K, Sendoel A, Fuchs E, Roos RP (2018) Translation of dipeptide repeat proteins from the C9ORF72 expanded repeat is associated with cellular stress. Neurobiol Dis 116:155–165

    CAS  PubMed  PubMed Central  Google Scholar 

  59. 59.

    Tabet R, Schaeffer L, Freyermuth F, Jambeau M, Workman M, Lee CZ et al (2018) CUG initiation and frameshifting enable production of dipeptide repeat proteins from ALS/FTD C9ORF72 transcripts. Nat Commun 9:152

    PubMed  PubMed Central  Google Scholar 

  60. 60.

    Westergard T, McAvoy K, Russell K, Wen X, Pang Y, Morris B et al (2019) Repeat-associated non- AUG translation in C9orf72- ALS / FTD is driven by neuronal excitation and stress. EMBO Mol Med 11:e9423

    PubMed  PubMed Central  Google Scholar 

  61. 61.

    Lee YB, Baskaran P, Gomez-Deza J, Chen HJ, Nishimura AL, Smith BN et al (2017) C9orf72 poly GA RAN-translated protein plays a key role in amyotrophic lateral sclerosis via aggregation and toxicity. Hum Mol Genet 26:4765-77.

  62. 62.

    Lopez-Gonzalez R, Lu Y, Gendron TF, Karydas A, Tran H, Yang D et al (2016) Poly (GR) in C9ORF72-related ALS/FTD compromises mitochondrial function and increases oxidative stress and DNA damage in iPSC-derived motor neurons. Neuron. 92:383–391

    CAS  PubMed  PubMed Central  Google Scholar 

  63. 63.

    Zhang YJ, Gendron TF, Ebbert MTW, O’Raw AD, Yue M, Jansen-West K et al (2018) Poly (GR) impairs protein translation and stress granule dynamics in C9orf72-associated frontotemporal dementia and amyotrophic lateral sclerosis. Nat Med 24:1136–1142

    CAS  PubMed  PubMed Central  Google Scholar 

  64. 64.

    Gittings LM, Boeynaems S, Lightwood D, Clargo A, Topia S, Nakayama L et al (2020) Symmetric dimethylation of poly-GR correlates with disease duration in C9orf72 FTLD and ALS and reduces poly-GR phase separation and toxicity. Acta Neuropathol 139:407–410

    CAS  PubMed  Google Scholar 

  65. 65.

    Boeynaems S, Bogaert E, Kovacs D, Konijnenberg A, Timmerman E, Volkov A et al (2017) Phase separation of C9orf72 dipeptide repeats perturbs stress granule dynamics. Mol Cell 65:1044–1055

    CAS  PubMed  PubMed Central  Google Scholar 

  66. 66.

    Lee KH, Zhang P, Kim HJ, Mitrea DM, Sarkar M, Freibaum BD et al (2016) C9orf72 dipeptide repeats impair the assembly, dynamics, and function of membrane-less organelles. Cell. 167:774–788

    CAS  PubMed  PubMed Central  Google Scholar 

  67. 67.

    Todd TW, McEachin ZT, Chew J, Burch AR, Jansen-West K, Tong J et al (2020) Hexanucleotide Repeat Expansions in c9FTD/ALS and SCA36 Confer Selective Patterns of Neurodegeneration In Vivo. Cell Rep 31:107616.

  68. 68.

    McEachin ZT, Gendron TF, Raj N, García-Murias M, Banerjee A, Purcell RH et al (2020) Chimeric Peptide Species Contribute to Divergent Dipeptide Repeat Pathology in c9ALS/FTD and SCA36. Neuron 107:292-305.e6.

Download references


We thank Scott Pletcher, Zhe Han, and Cathy Collins for sharing fly lines. We thank Michael Sutton and the MBNI at Michigan for use of their confocal microscope. Reagents generated in the Drosophila Genome research center were supported by NIH Grant 2P40OD010949.


This project was funded by NIH R01NS086810, R01NS099280, the Department of Veterans Affairs BLRDs 1I21BX001841 and I01BX003231, and a pilot grant from ALS Association to PKT. Funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information




FH and PKT designed the study. FH, MF, AK, BF, SN, SN, and OA conducted the experiments. FH, AK, and PKT wrote the manuscript. All authors edited the manuscript. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Fang He or Peter K. Todd.

Ethics declarations

Ethics approval and consent to participate

All mammalian rodent work was approved by the Institutional Animal Care and Use Committee (IACUC) at the University of Michigan.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Supplemental Table S1.

Sequences for G4C2 Repeat-containing pUAST Vectors. The detailed DNA sequences for generating GFP-tagged G4C2 repeat–containing vectors.

Additional file 2: Supplemental Table S2.

Primers used for cloning, in situ, and qPCR. The detailed primer sequences used in the various experimental settings.

Additional file 3: Supplemental Figure S1.

Repeat size, RNA expression, and eye phenotypes for intronic and exonic G4C2 repeat flies. The characterization of multiple lines of transgenic flies

Additional file 4: Supplemental Figure S2.

Characterization of dipeptide repeat antibodies. The antibodies targeting the dipeptide repeats effectively detected the respective antigen in transfected cells and in patient cells.

Additional file 5 Supplemental Figure S3.

Characterization of G4C2 repeat RNA foci. The Cy5 labeled 2′-O-Me-(CCCCGG)5 RNA probe detected the RNA foci in long repeat-containing intronic and 5′-leader repeat flies.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

He, F., Flores, B.N., Krans, A. et al. The carboxyl termini of RAN translated GGGGCC nucleotide repeat expansions modulate toxicity in models of ALS/FTD. acta neuropathol commun 8, 122 (2020).

Download citation


  • C9ORF72
  • Amyotrophic lateral sclerosis
  • Dipeptide repeat proteins (DPRs)
  • Repeat-associated non-AUG translation (RAN)
  • Carboxyl terminus
  • Drosophila