Abstract
RNA ligases are present across all forms of life. While enzymatic RNA ligation between 5â²-PO4 and 3â²-OH termini is prevalent in viruses, fungi, and plants, such RNA ligases are yet to be identified in vertebrates. Here, using a nucleotide-based chemical probe targeting human AMPylated proteome, we have enriched and identified the hitherto uncharacterised human protein chromosome 12 open reading frame 29 (C12orf29) as a human enzyme promoting RNA ligation between 5â²-PO4 and 3â²-OH termini. C12orf29 catalyses ATP-dependent RNA ligation via a three-step mechanism, involving tandem auto- and RNA AMPylation. Knock-out of C12ORF29 gene impedes the cellular resilience to oxidative stress featuring concurrent RNA degradation, which suggests a role of C12orf29 in maintaining RNA integrity. These data provide the groundwork for establishing a human RNA repair pathway.
Similar content being viewed by others
Introduction
RNA ligases play vital roles in sealing RNA strands during intron-containing tRNA splicing1, tRNA repair2, mRNA splicing in the unfolded-protein response (UPR)3, RNA recombination4, as well as biogenesis of circular RNAs5. Across all life forms, proteinaceous RNA ligases are present with distinct catalytic mechanisms1. Like DNA ligases, RNA ligases are known that join 5â²-PO4 and 3â²-OH termini of RNA via a classic three-step mechanism1 (Fig. 1a). First, the RNA ligase undergoes adenosine 5â²-triphosphate (ATP)-dependent auto-AMPylation (also known as auto-adenylylation) at the catalytic lysine residue. The AMP is subsequently transferred from the ligase-(lysyl-N)-AMP to the 5â²-PO4 end of RNA (pRNA) to yield the RNA-adenylate intermediate (AppRNA). Finally, the two RNA ends are ligated by a phosphodiester bond upon nucleophilic attack of the 3â²-OH to the AppRNA, liberating the AMP. These enzymes, termed 5â²â3â² RNA ligases1, have been heavily exploited as molecular biology tools in RNA editing and sequencing6,7. Although the existence of a 5â²â3â² RNA ligase in HeLa cells has been suggested earlier8, such an enzyme remains to be identified in vertebrates. So far, only one proteinaceous RNA ligase has been identified in human, which is a guanidine 5â²-triphosphate (GTP)-dependent 3â²-5â² RNA ligase that joins 2â²,3â²-cyclic PO4 (cPO4) and 5â²-OH termini of RNA as a subunit of a human tRNA splicing ligase complex9.
a Schematic display of the three-step mechanism of RNA ligation by a 5â²â3â² RNA ligase. In step 1, the ligase is auto-AMPylated on the catalytic lysine using ATP as the co-substrate. In step 2, the AMP is transferred from the catalytic lysine to the 5â²-PO4 end of RNA (pRNA), giving the RNA-adenylate intermediate (AppRNA). In step 3, the ligated RNA is obtained upon the attack of 3â²-OH to the AppRNA in the presence of the ligase along with the liberation of AMP. PPi, pyrophosphate. b Structures of Ap3A analogues employed in this study. c Schematic display of the workflow for the identification of C12orf29. Cell lysates are incubated with C2-eAp3A or controls. AMPylated proteins are expected to bear ethynyl functionalities that enable selective modification with an affinity tag desthiobiotin (DB) via CuAAC. Labelled proteins are enriched and identified by ABPP, and further verified by immunoblotting. d Structure of the azide-bearing desthiobiotin as affinity tag. e Affinity enrichment of C12orf29 from two cell lysates verified by immunoblotting (representative images of nâ=â3). Source data are provided as a Source Data file.
We are interested in nucleotides and their involvement in processes of post-translational modification (PTM)10,11. AMPylation is a PTM in which an AMP molecule is covalently bound to a side chain or the C-terminus of a protein12,13, which is often catalysed by an enzyme using ATP as a co-substrate. Over the past decade, AMPylation has been found to participate in versatile biological processes, such as pathogen infection14,15,16,17, UPR18,19, and cellular redox homoeostasis20. Several ATP derivatives have been designed for profiling AMPylation20,21,22. In recent studies masked AMP analogues were used as probes to investigate AMPylation23,24. After internalisation they were processed to ATP and used by the cellular machinery for protein AMPylation.
Here, to further dissect the human AMPylation proteome, we have synthesised an alkyne-modified chemical probe based on diadenosine triphosphate (Ap3A, Fig. 1b)âa naturally occurring nucleotide whose concentration increases as a result of cellular stress and with higher stability in cell lysates than ATP25. Using this chemical probe, we have conducted activity-based proteomic profiling (ABPP) (Fig. 1c, d) and enriched chromosome 12 open reading frame 29 (C12orf29), a hitherto uncharacterised protein, from human cell lysates (Fig. 1e). C12orf29 is a 37âkDa human protein consisting of 325 amino acids. Sequence analysis shows that it is highly conserved among the chordate superphylum26 with C12orf29 orthologs being present in vertebrates. In contrast, we could not identify C12orf29 orthologs in any other eukaryotes, except for a few species including gastropods, bivalves, and cephalopods within the phylum Mollusca26 (Supplementary Fig. 1). Our subsequent investigation identified C12orf29 as a human 5â²â3â² RNA ligase that operates via a tandem auto- and RNA-AMPylation mechanism. Preliminary cellular studies also demonstrate its involvement in maintaining RNA integrity under reactive oxygen species (ROS)-induced cellular stress conditions. Overall, our work presents a chemistry-led functional identification of a previously unknown protein as a human 5â²â3â² RNA ligase, which suggests a latent RNA repair machinery.
Results
Discovery of C12orf29 by chemical proteomics dedicated to identifying AMPylated proteins
To study the human AMPylated proteome, we designed and synthesised a C2,C2â²-ethynyl-modified Ap3A probe (C2-eAp3A) (Fig. 1b and see details in Supplementary Methods). Once incubated with whole-cell lysates, this chemical probe was expected to be used as a co-substrate in place of ATP during the AMPylation of cellular proteins. Consequently, a target protein would be covalently modified with a C2-alkynyl AMP group, which would allow the attachment of an affinity tag, desthiobiotin (DB) (Fig. 1bâd), via Cu(I)-catalysed azide-alkyne cycloaddition (CuAAC) for downstream ABPP. Among the proteins identified from both human non-small cell lung carcinoma cells (H1299) and human embryonic kidney cells (HEK293T), one caught our attention since it had not yet been characterised: C12orf29 (see details in Supplementary Fig. 2 and Supplementary Data 1, 2 and 3). Its enrichment from H1299 and HEK293T was further verified by immunoblotting of the elution fractions using an anti-C12orf29 antibody (Fig. 1e).
To confirm the AMPylation of C12orf29, we prepared the recombinant wild-type (WT) C12orf29. Surprisingly, liquid chromatography-mass spectrometry (LC-MS) analysis revealed that the protein purified from Escherichia coli (E. coli) had a mass of 38,101âDaâ329âDa higher than the calculated mass of 37,771 (found 37,772) Da for C12orf29WTâin consistence with an AMPylated form (Fig. 2a). Furthermore, the AMPylation of C12orf29 was validated by immunoblotting using a monoclonal antibody against AMPylation (Fig. 2a)27. Meanwhile, we also developed an expression and purification protocol that allowed the preparation of recombinant C12orf29 without AMPylation (Fig. 2a and see details in âMethodsâ section). The AMPylated and the non-modified form of C12orf29 showed different migration behaviours in an SDS-PAGE gel (Fig. 2a). With C12orf29WT and C12orf29WT-AMP in hand, we further proved that C12orf29 was auto-AMPylated using ATP as a co-substrate in the presence of Mg2+ as the most proficient cofactor (Fig. 2b).
a (Top) immunoblotting of C12orf29WT-AMP and C12orf29K57A and LC-MS analysis of C12orf29WT-AMP. C12orf29K57A is deficient in auto-AMPylation activity (representative images of n = 3). Mass spectra indicating a 329 Da increase in mass upon auto-AMPylation of C12orf29WT. C12orf29WT-AMP: calc. 38,100 Da, found 38,101 Da. C12orf29WT: calc. 37,771 Da, found 37,772 Da. (Bottom) preparation of C12orf29WT and C12orf29WT-AMP with different buffer systems (representative images of n = 3). b Divalent metal ion dependency of auto-AMPylation activity of C12orf29WT. c Superimposition of the structure of C12orf29 predicted by AlphaFold (green)31,51 on the structure of NgrRnl-AMP (PDB ID: 5COT, grey). OB, oligonucleotide-binding. NT, nucleotidyltransferase. Structures were superposed in Coot52 using structural equivalent residues identified by the DALI webserver32. d Enlarged view of the catalytic site of NgrRnl-AMP (PDB ID: 5COT, grey) and predicted C12orf29 (green). In the NgrRnl-AMP structure, AMP is covalently attached to the side chain of K170. D172, E227, and E312 bind Mn2+ via water-mediated contacts and K326 contacts the phosphate moiety of AMP. Corresponding residues in the putatively catalytic site of C12orf29 are indicated. Mn2+ and water molecules are depicted as yellow and red spheres, respectively. Atomic contacts are depicted as dashed lines. Source data are provided as a Source Data file.
C12orf29 is an RNA ligase operating via sequential auto- and RNA AMPylation
In parallel, we performed structure prediction of C12orf29 using bioinformatic tools. A putative structure of a truncated form of C12orf29 was obtained by Phyre228 based on the identity of 22 out of 71 residues (195-265) with those of Naegleria gruberi RNA ligase (NgrRnl)29,30. NgrRnl is a 5â²â3â² RNA ligase and as such operates via the classic three-step mechanism (Fig. 1a)29,30. The structural similarity between C12orf29 and NgrRnl motivated us to investigate whether C12orf29 also possessed RNA ligase activity. Recently, AlphaFold31 structure predictions for almost all proteins of the human genome were made available at https://alphafold.ebi.ac.uk31. We submitted the AlphaFold structure prediction for C12orf29, which was expected to be of high quality (average pLDDT = 91.4), to the Dali server32, and compared it exhaustively to a representative subset of the Protein Data Bank. The results of structural superposition by Dali are highly significant and display RNA ligases as the most similar structures (top hit, Z-score = 7.4 for NgrRnl), confirming our initial sequence-based suggestion for the functional assignment of C12orf29. In turn, the structure of C12orf29 predicted by AlphaFold was superimposed on NgrRnl (Fig. 2c, d and Supplementary Fig. 3). While an N-terminal oligonucleotide-binding (OB) domain as in NgrRnl is not observed for C12orf29, several secondary structure patterns in the predicted C12orf29 structure precisely overlap with the C-terminal nucleotidyltransferase (NT) domain of NgrRnl (Fig. 2c, d). Of note, C12orf29 contains a lysine at position 57 within the conserved sequence motif KX(D/H/N)G that defines the superfamily of nucleotidyltransferases (Supplementary Fig. 3)33. Indeed, mutation of the putatively critical lysine at position 57 to alanine (K57A) abolishes auto-AMPylation of the recombinant C12orf29 purified from E. coli (Fig. 2a).
Encouraged by the auto-AMPylation activity of C12orf29 in the presence of ATP and either Mg2+ or Mn2+ (Fig. 2b), we next investigated whether C12orf29 had 5â²â3â² RNA ligase activity. We first assessed the nick-sealing activity of C12orf29 with a collection of double-stranded substrates consisting of a DNA oligonucleotide splint annealed to two DNA or RNA oligonucleotides that were separated by a nick to be ligated. Although the nick-sealing activity was reported for NgrRnl29, it was not observed with C12orf29 (Supplementary Fig. 4). In turn, we focused on single-stranded RNA (ssRNA) substrates and tested C12orf29 with constructs folded into an âopen hairpinâ (Fig. 3a). The 5â²-terminus of one strand (17 nt) within the open loop was labelled with 32P-phosphate while the other strand (10 nt) was unlabelled (Fig. 3a and Supplementary Table 1). Successful ligation would lead to a 27-nt oligonucleotide with significantly altered migration within a denaturing PAGE gel. Indeed, we have found that C12orf29 is proficient in ligating the ssRNA overhangs to give a product that migrates at the expected length (Fig. 3a). Moreover, the formation of self-cyclised RNA as well as AppRNA was also observed. The enzyme exhibits profound selectivity for RNA:RNA constructs are efficiently ligated within short time while the respective DNA constructs are not detectably converted under the very same conditions (Fig. 3a). We also investigated whether nucleotides other than ATP represented proficient co-substrates and found that other adenosine nucleotides such as diadenosine tri- and tetraphosphate (Ap3A, Ap4A), and to a lesser extent even dATP were used as co-substrates, while NAD+ was not used for promoting RNA ligation (Fig. 3b). Among the other nucleotides investigated, only GTP enables RNA ligation (Fig. 3b). Additional kinetic investigations, however, have revealed that the apparent catalytic efficiency (kcat/KM) of C12orf29 is about 29-fold higher with ATP than with GTP. This difference is mainly due to differences in KM while the kcat is similar for both nucleotides (Fig. 3c, Supplementary Fig. 5, and Supplementary Notes).
RNA/DNA oligonucleotides are schematically depicted in blue/green, respectively. The 32P-labelled 5â²-ends are depicted in red. The radioactive oligonucleotides were resolved by denaturing PAGE and analysed by phosphorimaging. a (Top) Schematic display of two nucleic acid substrates used. (Bottom) PAGE analysis of reaction products after incubation with C12orf29 for various time points as indicated. Ligation was observed for 5â²-phosphorylated RNA (representative images of nâ=â3). b Reaction of C12orf29 with indicated RNA constructs using various nucleotides (representative images of nâ=â3). Besides ATP, C12orf29 is able to efficiently catalyse ligation by processing GTP, dATP, Ap3A, and Ap4A. Nuc., nucleotide. c (Left) C12orf29 promoted ligation reaction at various concentrations of ATP and GTP (representative images of nâ=â3). (Right) Michaelis-Menten fits to initial rates providing KM, kcat and kcat/KM for ATP and GTP. Plotted data represent the mean value ± SD for three biological replicates. d (Left) Proposed ligation scheme starting from 2â²,3â²-cyclic phosphorylated RNA by the sequential action of ANGEL2-ÎN and C12orf29WT. (Right) PAGE analysis of the depicted reaction (representative images of nâ=â3). All oligonucleotide sequences are provided in Supplementary Table 1. Source data are provided as a Source Data file.
Next, we investigated if RNA substrates with a 3â²-terminus bearing a 2â²,3â²-cyclic phosphate (cPO4) or a 2â²-phosphate end could be ligated by C12orf29. Neither construct was processed by the protein (Supplementary Fig. 6). However, when the RNA substrate with a 2â²,3â²-cPO4 at the 3â²-terminus was incubated in the presence of N-terminus-truncated ANGEL2 (ANGEL2-ÎN)âan enzyme recently reported to cleave 2â²,3â²-cPO4 from the 3â²-terminus to furnish a non-phosphorylated 3â²-terminus34âwe observed C12orf29-dependent ligation to occur (Fig. 3d). Altogether, these results show that non-phosphorylated 3â-RNA termini are essential for the C12orf29-mediated RNA ligation.
In order to investigate the structural requirements for C12orf29-catalysed RNA ligation within the single-stranded regions, we designed and studied single RNA oligonucleotides modified with 32P-phosphate at the 5â² ends, which folded into dumbbell structures bearing two single-stranded regions at the 5â² and 3â² ends (Fig. 4a, b). By varying the nucleotide composition at the ligation site, we have found that RNA constructs bearing purines at the ligation site are most efficiently processed (Fig. 4a). Moreover, RNA with longer 5â²-overhangs are also ligated more efficiently (Fig. 4b).
RNA oligonucleotides that were used for intramolecular ligation resulting in cyclisation are schematically depicted. The 32P-labelled 5â²-ends were depicted in red. The radioactive oligonucleotides were resolved by denaturing PAGE and analysed by phosphorimaging. a, Investigation of the impact of the nucleobase composition at the 5â²- and 3â²- termini at the ligation site on the ligation efficiency of C12orf29 (representative images of nâ=â3). The depicted constructs were incubated under the same conditions for various time points. Most efficient ligation was observed when purines were at the ligation site. b Investigation of the impact of the length of the 5â²-overhang on the ligation efficiency of C12orf29 (representative images of nâ=â3). c Impact of single site mutations on RNA ligase efficiency of C12orf29. Graph bars represent mean ligation efficiencies ± SEM and hollow circles represent individual data points for nâ=â3 biological replicates. All oligonucleotide sequences are provided in Supplementary Table 1. Source data are provided as a Source Data file.
Previous studies have identified several key residues at the catalytic centre of NgrRnl, which are responsible for substrate and cofactor coordination that underpinned the NgrRnl-mediated RNA ligation29,30. As shown in Fig. 2e and Supplementary Figs. 1 and 3, these residues are conserved in C12orf29 (i.e., K57, D59, E195, E250, K263), two of which are identified to be mutated in cancer cells (e.g., D59N35, K263N36). In addition, mutations outside the catalytic site (e.g., R77L, E123D) are also found in patients suffering from glioblastoma37 and chronic lymphocytic leukaemia38, respectively. To examine the importance of these residues for RNA ligation, a series of C12orf29 variants bearing mutations at the respective positions were prepared and tested. Most of the point mutations are detrimental to RNA ligation (i.e., no detectable RNA ligation activity, Fig. 4c), whereas E123D and E123Q are fully active and R77L retained ~12% activity.
Knockout (KO) of C12orf29 impedes cellular resilience to reactive oxygen species
To gain insights into the function of C12orf29 in a cellular context, we generated human embryonic kidney (HEK293) cells, in which the C12ORF29 gene (Supplementary Fig. 7) was knocked-out by CRISPR/Cas, and compared the properties of these cells with those of parental HEK293 cells expressing the WT enzyme. When cells were grown under physiological conditions, WT and KO cells showed similar phenotypes with respect to cell growth, adherence, and appearance (Fig. 5a). However, when cells were treated with up to 40 μM menadioneâa molecule known to generate ROS-based cellular stress39,40âsignificant differences were observed in the viability levels of WT and KO cells (Fig. 5a). In comparison to WT cells, KO cells started to round up and detach at lower concentrations of menadione. Indeed, KO cells are more vulnerable to the treatment of menadione (Fig. 5b, c). For example, after a 3-hour treatment with 40 μM menadione, >85% WT cells were found to remain viable, whereas only <10% KO cells were alive. We also measured the level of ROS induced by 40 μM menadione over time by bioluminescence in both cell lines. Although a similar ROS level was found for KO cells after a 90-min treatment and WT cells after a 120-min treatment, the cell viability levels of the two cell lines differed by >50% (viability >85% for WT and <30% for KO) (Fig. 5c).
a Light microscopy of HEK293 WT and C12ORF29-KO cells. The cells were treated with 40âμM menadione for 3âh or only with the carrier as control (0âμM menadione). Scale bars are 100 μm. b Cell viability of HEK293 WT and C12ORF29-KO cells after 3âh treatment with different menadione concentrations. Hollow circles represent individual data points for nâ=â9 biological replicates. Error bars represent the ± SEM. Significance was calculated by two-way ANOVA with Sidakâs multiple comparisons test: nsPâ>â0.05; *Pââ¤â0.05; ****Pââ¤â0.0001. c Cell viability and corresponding H2O2 concentrations in HEK293 WT and C12ORF29-KO cells at different time points after treatment with 40âμM menadione. Hollow circles represent individual data points for nâ=â3 biological replicates. Error bars represent the ± SEM. Significance was calculated by two-way ANOVA with Sidakâs multiple comparisons test: nsPâ>â0.05; ****Pââ¤â0.0001. RLU, relative light units. Source data are provided as a Source Data file.
Next, we isolated and analysed the total RNA from the two cell lines treated with various concentrations of menadione (Fig. 6). Very prominent in this analysis are the signals of 28S and 18S rRNA. We found that the RNA degradation level depended on the concentration of the menadione applied. While the signal for 18S rRNA remained almost constant even at higher menadione concentrations, 28S rRNA appeared to be more susceptible to ROS damage. More importantly, in KO cells, 28S rRNA was found to start degrading upon treatment of 40 μM menadione. In contrast, 100 μM menadione treatment is required to induce significant 28S rRNA degradation in WT cells (Fig. 6 and Supplementary Table 2). Since the intracellular ROS level triggered by menadione treatment was similar in both cell lines (Fig. 5c), we attributed the more pronounced RNA decay in KO cells to the lack of C12orf29. This indicated that C12orf29 was important in maintaining RNA integrity under stress conditions.
(Top) Cells (left: HEK293 WT cells; right: C12ORF29-KO HEK293 cells) were treated with different menadione concentrations for 3âh. In turn, total RNA was isolated and subsequently analysed by TapeStation (version 4.1.1). The ratio of 28S and 18S rRNA intensities were listed below the electrophoregram (a.u., arbitrary units). (Bottom) Electropherograms of the TapeStation analysis of total RNA from cell extracts of HEK293 WT (in red) and C12ORF29-KO (in yellow) cells. Source data are provided as a Source Data file.
Discussion
Here, we have devised an Ap3A-based chemical probe to interrogate the human AMPylated proteome. As a structural analogue of ATP, Ap3A has shown superior chemical and enzymatic stability under reaction conditions25, rendering Ap3A-based probes promising alternatives to enrich AMPylated proteins from cell lysates without the need of external AMPylators20,21,22. The design of chemical probes based on dinucleoside polyphosphates can be potentially adopted by future research into protein NMPylation (e.g., UMPylation, GMPylation). Given the symmetrical design of the Ap3A probe, it might be incorporated into cellular proteins upon both phosphorylation or AMPylation. We therefore note that care should be taken when interpreting the proteomic data to recognise potentially phosphorylated populations.
Using the Ap3A-based chemical probe, we have discovered C12orf29 as a 5â²â3â² RNA ligase in humans, which bears auto-AMPylation activity. We have shown in detail that C12orf29 ligates 5â²-PO4 and 3â²-OH RNA termini within single-stranded regions via a three-step catalytic mechanism involving successive auto- and RNA-AMPylation. RNA lesions are generated by a transesterification reaction initiated by a 2â²-OH and an adjacent 3â²-5â²-phosphodiester linkage to result in 2â²,3â²-cPO4 and 5â²-OH termini. So far, only a GTP-dependent 3â²-5â² RNA ligase, RtcB (or HSPC117)9, has been reported in human, which ligates such RNA termini via successive auto- and RNA-GMPylation41,42. Alternative multi-step repair pathways are seen across viruses, fungi, and plants, which involve a phosphatase and a kinase to convert 2â²,3â²-cPO4 and 5â²-OH termini to 3â²-OH and 5â²-PO4 ends respectively for 5â²â3â² RNA ligation1,33,43. Our discovery of a human 5â²â3â² RNA ligase suggests a latent âhealing and sealingâ RNA repair mechanism in human.
Several nucleic acid ligases are known to be involved in repair of nucleic acids that are damaged e.g., by ROS44. Therefore, we investigated C12ORF29-KO cells in their response to ROS. In preliminary cellular studies, we have shown that C12ORF29-KO cells exhibit poor resilience to ROS with concurrent RNA (esp. rRNA) decay, which indicates the role of C12orf29 in maintaining RNA integrity as a cellular adaptive behaviour under oxidative stress conditions. rRNA accounts for >85% of the total RNA in most organisms as a core component of ribosomes45, of which the synthesis is a major energy-consuming event in cells46. We reason that it is therefore energy- and time-saving for cells to rapidly restore functional rRNA by RNA repair, where C12orf29 may participate as an RNA ligase, rather than by de novo synthesis.
While C12orf29 does not appear to be necessary for cell viability under normal growth conditions, it is essential for maintaining RNA integrity and cell viability upon ROS-induced stress. In contrast, the RNA ligase RtcB and its homologues are reported to play vital roles in intron-containing tRNA maturation9 and X-box binding protein 1 (XBP1) mRNA splicing during UPR3. Furthermore, it has been suggested that the E. coli RtcB can repair nuclease- or antibiotic-induced rRNA damage47,48. The recently identified 2â²,3â²-cylic phosphatase ANGEL2 can hydrolyse the 2â²,3â²-cPO4 at 3â²-termini to 2â²-OH-3â²-OH, thereby antagonising RtcB-mediated tRNA and XBP1 mRNA splicing34. This antagonistic effect may suggest another layer of regulation of RNA processing events. Along with the 5â-OH RNA kinase HsClp149, RNA termini are processed to 2â²-OH-3â²-OH and 5â²-PO4, which can be recognised by C12orf29 for ligation. Therefore, we speculate that C12orf29 may be involved in an RNA ligation machinery in concert with other RNA processing enzymes when the RNA termini and activities are compromised in the RtcB-mediated RNA ligation system. Further work to decipher the endogenous interactors and RNA substrates of C12orf29 are required to fully elucidate the C12orf29-mediated RNA ligation pathway.
Collectively, we have demonstrated a chemistry-led discovery of the human 5â²â3â² RNA ligase C12orf29. Therefore, we propose to name the protein after Homo sapiens RNA ligase (HsRnl).
Methods
Cell culture and lysate preparation
HEK293T (ATCC) or H1299 (ATCC) cells were cultured in DMEM (GibcoTM) supplemented with 10% (v/v) foetal bovine serum at 37â°C, 5% CO2. Cells were harvested by centrifugation for 10âmin at 500 à g at 4â°C, and washed with ice-cold 1à PBS thrice. The supernatant was discard. Cell pellets were frozen in liquid nitrogen and stored at â80â°C.
Cell pellets were resuspended in lysis buffer (1à PBS pH 7.4, 1âmM EDTA, and 1à cOmpleteâ¢, EDTA-free protease inhibitor cocktail (Roche)) on ice and lysed by sonication. The lysates were cleared by centrifugation (30âmin, 21,885 à g, 4â°C). The protein concentration of the supernatant was determined by bicinchoninic acid assay.
Chemical proteomics towards the identification of C12orf29
The synthesis of the probes is detailed in the Supplementary Methods. The NMR data were processed by MestReNova (version 14.1.2-25024). In the chemical proteomics assay, 200âμM Ap3A, C2-eAp3A, or MilliQ® H2O were incubated with 2.0âmg/mL H1299 or HEK293T cell lysates in 1à AMPylation buffer (20âmM HEPES pH 7.4, 100âmM NaCl, 5âmM MgCl2, and 1âmM DTT) at 37â°C for 1âh in a total volume of 450âμL. The reaction was stopped by adding 1.8âmL pre-cold MeOH. The resulting mixture stood at â20â°C for 2âh to precipitate. Protein pellets were obtained after centrifugation at 14,000 à g for 10âmin at 4â°C, which were dried for 5âmin and reconstituted in 450âμL 1à resuspension buffer (50âmM triethanolamine pH 7.4, 150âmM NaCl, and 4% SDS). A master mix was prepared freshly with 0.5âmM N3-(Arg)PEG3-DB, 2.5âmM CuSO4, 0.25âmM TBTA, and 2.5âmM TCEP in 0.4à AMPylation buffer. Three hunderd microliters of the master mix was added to the pre-cold reaction mixture to yield 0.2âmM N3-(Arg)PEG3-DB, 1.0âmM CuSO4, 0.1âmM TBTA, and 1.0âmM TCEP in a total volume of 750âμL. The CuAAC was conducted at 25â°C for 1âh, which was quenched by adding 3âmL pre-cold acetone. The resulting mixture stood at â20â°C overnight to precipitate. Protein pellets were obtained after centrifugation at 14,000 à g for 10âmin at 4â°C, which were washed with 300âμL cold MeOH trice and dried for 5âmin. The pellets were reconstituted in 200âμL 1à PBS pH 7.4 supplemented with 4% SDS, followed by addition of 800âμL 1à PBS pH 7.4 and centrifugation at 12,000 à g for 5âmin at room temperature to remove any undissolved residue. The supernatant was incubated with high capacity streptavidin agarose beads in a bed volume of 25âμL at 25â°C for 15âmin with an end-over-end rotator. The beads were pelleted by centrifugation at 150 à g for 2âmin at room temperature, which were washed successively with 1à PBS pH 7.4 supplemented with 1% SDS (3 à 100âμL), washing buffer (8 à 100âμL, 1à PBS pH 7.4, 150âmM NaCl, 4âM urea, and 1% SDS), and 50âmM NH4HCO3 pH 7.8 (5 à 100âμL). The beads were treated with 0.8âmM biotin in 50âmM NH4HCO3 pH 7.8 supplemented with 0.1% RapiGest SF (3 à 50âμL) and incubated at 37â°C for 10âmin with shaking at 600ârpm to elute the AMPylated proteins. The elution fractions were kept on ice for downstream in-solution digestion (see below). Alternatively, the elution fractions were added 6x loading buffer and heated to 95â°C for denaturation and concentration. The resulting mixture was resolved by SDS-PAGE and subjected to immunoblotting. Mouse anti-C12orf29 antibody (Santa Cruz Biotechnology, sc-390730, 1:1000) and goat anti-mouse HRP-conjugated antibody were (Jackson ImmunoResearch, 115-035-062, 1:30,000) were used as primary and secondary antibody, respectively.
Prior to MS analysis, the elution fractions were treated with 5âmM DTT at 60â°C for 1âh. After cooling down, the mixture was treated with 50âmM 2-chloroacetamide at room temperature for 1âh in dark. In turn, the resulting mixture was incubated with 3âμg trypsin at 37â°C overnight. The digested protein mixture was added 3âμL TFA and incubated at 37â°C for 45âmin to hydrolyse RapiGest SF. The resulting mixture was cleared by centrifugation at 12,000 à g for 30âmin at room temperature. The supernatant was transferred carefully and lyophilised overnight. Finally, the tryptic peptides were dissolved in 60âμL 0.1% TFA supplemented with 10âμL 10% TFA and desalted using U-C18 ZipTips. Tryptic peptides were separated on an EASY-nLC 1200 system (Thermo Scientific) at a flow rate of 300ânl/min using a 39âmin gradient from 2.5% MeCN/0.1% formic acid to 32% MeCN/0.1% formic acid, 1âmin to 75% MeCN/0.1% formic acid, followed by a 5âmin washing step at 75% MeCN/0.1% formic acid. Mass spectra were recorded on a QExactive HF mass spectrometer (Thermo Scientific) operated in data dependent Top20 mode with dynamic exclusion set to 5âs. Full scan MS spectra were acquired at a resolution of 120,000 (at m/z 200) and scan range 350â1600âm/z with an automatic gain control target value of 3e6 and a maximum injection time of 60âms. Most intense precursors with charge states of 2â6 reaching a minimum automatic gain control target value of 1e3 were selected for MS/MS experiments. Normalised collision energy was set to 28. MS/MS spectra were collected at a resolution of 30,000 (at m/z 200), an automatic gain control target value of 1e5 and 100âms maximum injection time. Each of the biological triplicates was measured as technical duplicates resulting in six measurements per condition (Ap3A, C2-eAp3A and bead control).
Raw files from LCâMS/MS measurements were analysed using MaxQuant (version 1.6.1.0) with match between runs and label-free quantification (LFQ) (minimum ratio count 1) enabled. The minimal peptide length was set to 5. For protein identification, the human reference proteome downloaded from the UniProt database (download date: 2018-02-22) and the integrated database of common contaminants were used. Data of different cell types were quantified in separate analyses. Further data processing was performed using Perseus software (version 1.6.15.0). Identified proteins were filtered for reverse hits, common contaminants and proteins that were only identified by site. LFQ intensities were log2 transformed, filtered to be detected in at least 4 out of 6 replicate measurements of at least one condition and missing values were imputed from a normal distribution (width 0.3 and shift 1.8), based on the assumption that these proteins were below the detection limit. LFQ intensities of technical replicates were averaged and significantly enriched proteins were identified by pairwise comparisons. For each cell type (HEK293T and H1299) two Studentâs t-tests (S0â=â0.1 and FDRâ=â0.001) were performed, separately comparing C2-eAp3A against Ap3A and against the bead control. Only proteins significantly enriched and showing a t-test difference >1 in both comparisons were considered to be C2-eAp3A-enriched for the cell type.
Chemical proteomics mass spectrometry data have been deposited to the ProteomeXchange Consortium via the PRIDE50Â partner repository with the dataset identifier PXD038132.
Expression and purification of recombinant proteins
As for expression and purification of AMPylated C12orf29WT and its variants, plasmid constructs pET15b-C12orf29WT or plasmids containing inserts of C12orf29 variants were transformed in E. coli BL21 (DE3) competent cells, which were cultured in 50âmL LB medium containing 100âμg/mL carbenicillin at 37â°C, 180ârpm overnight. In turn, a defined volume of cell suspension was transferred to 1âL LB medium containing 100âμg/mL carbenicillin to reach OD600â=â0.1, followed by the incubation at 37 °C at 180ârpm until OD600â=â0.7. The mixture was cooled down on ice for 30âmin and then incubated with 1.0âmM IPTG at 18 °C for 18âh at 180ârpm. Cells were harvested by centrifugation at 8000 à g for 30âmin at 4 °C. The pellet was resuspended in 30âmL cold lysis buffer (50âmM Tris-HCl pH 8.0, 150âmM NaCl, 1.0âmM DTT, 0.1% (v/v) Triton X-100, 20âmM imidazole, 1âμg/mL aprotinin, 1âμg/mL leupeptin, and 1âmg/mL Pefabloc® SC) and lysed by sonication on ice. The lysates were centrifuged at 40,000 à g for 30âmin at 4 °C and filtered through 0.45 μm syringe filter. The N-terminal His6-tagged AMPylated C12orf29 was purified using a 5âmL HisTrapTM FF crude column (Buffer A: 50âmM Tris-HCl pH 8.0, 150âmM NaCl, 1.0âmM DTT, and 20âmM imidazole; Buffer B: 50âmM Tris-HCl pH 8.0, 150âmM NaCl, 1.0âmM DTT, and 500âmM imidazole). Fractions containing AMPylated His6-C12orf29 were pooled and dialysed against a buffer containing 50âmM Tris-HCl pH 8.0, 100âmM NaCl, 5 U/mg thrombin, and 1âmM DTT at 4 °C overnight. The resulting solution was further purified by anion IEX on a 5âmL HiTrapTM Q HP column (Buffer A: 50âmM Tris-HCl pH 8.0, 100âmM NaCl, and 1.0âmM DTT; Buffer B: 50âmM Tris-HCl, pH 8.0, 1000âmM NaCl, and 1.0âmM DTT). Pure fractions were pooled, concentrated, and stored at â20 °C in a storage buffer containing 25âmM Tris-HCl pH 8.0, 100âmM NaCl, 1âmM DTT, and 50% (v/v) glycerol.
As for expression and purification of deAMPylated C12orf29WT and its variants, plasmid constructs pET15b-C12orf29WT or plasmids containing inserts of C12orf29 variants were transformed in E. coli BL21 (DE3) competent cells, which were cultured in 50âmL LB medium containing 100âμg/mL carbenicillin at 37 °C, 180ârpm overnight. In turn, a defined volume of cell suspension was transferred to 1âL LB medium containing 100âμg/mL carbenicillin to reach OD600â=â0.1, followed by the incubation at 37 °C at 180ârpm until OD600â=â0.7. The mixture was cooled down on ice for 30âmin and then incubated with 1.0âmM IPTG at 18 °C for 18âh at 180ârpm. Cells were harvested by centrifugation at 8000 à g for 30âmin at 4 °C. The pellet was resuspended in 30âmL cold lysis buffer (50âmM KH2PO4 pH 8.0, 10âmM Na4P2O7, 150âmM NaCl, 1.0âmM DTT, 0.1% (v/v) Triton X-100, 20âmM imidazole, 1âμg/mL aprotinin, 1âμg/mL leupeptin, and 1âmg/mL Pefabloc® SC) and lysed by sonication on ice. The lysates were centrifuged at 40,000 à g for 30âmin at 4 °C and filtered through 0.45 μm syringe filter. The N-terminal His6-tagged deAMPylated C12orf29 was purified using a 5âmL HisTrapTM FF crude column (Buffer A: 50âmM KH2PO4 pH 8.0, 10âmM Na4P2O7, 150âmM NaCl, 1.0âmM DTT, and 20âmM imidazole; Buffer B: 50âmM KH2PO4 pH 8.0, 10âmM Na4P2O7, 150âmM NaCl, 1.0âmM DTT, and 500âmM imidazole). Fractions containing His6-C12orf29 were pooled and dialysed against a buffer containing 50âmM Tris-HCl pH 8.0, 100âmM NaCl, 5 U/mg thrombin, and 1âmM DTT at 4 °C overnight. The resulting solution was further purified by anion IEX on a 5âmL HiTrapTM Q HP column (Buffer A: 50âmM Tris-HCl pH 8.0, 100âmM NaCl, and 1.0âmM DTT; Buffer B: 50âmM Tris-HCl, pH 8.0, 1000âmM NaCl, and 1.0âmM DTT). Pure fractions were pooled, concentrated, and stored at â20 °C in a storage buffer containing 25âmM Tris-HCl pH 8.0, 100âmM NaCl, 1âmM DTT, and 50% (v/v) glycerol.
As for expression and purification of ANGEL2-ÎN, plasmid constructs pET15b-ANGEL2-ÎN were transformed in E. coli BL21 (DE3) cells, which were cultured in 50âmL LB medium containing 100âμg/mL carbenicillin at 37 °C, 180ârpm overnight. In turn, a defined volume of cell suspension was transferred to 1âL LB medium containing 100âμg/mL carbenicillin to reach OD600â=â0.1, followed by the incubation at 37 °C at 180ârpm until OD600â=â0.7. The expression was induced by the addition of 1.0âmM IPTG after cooling down the mixture on ice for 30âmin. The mixture was then incubated at 18 °C for 18âh at 180ârpm before the cells were harvested by centrifugation at 8,000 à g for 30âmin at 4 °C. The pellet was resuspended in 30âmL cold lysis buffer (50âmM Tris-HCl pH 8.0, 100âmM KCl, 1âmg/mL lysozyme, 0.1% (v/v) Triton X-100, 1âmM DTT, 1âμg/mL aprotinin, 1âμg/mL leupeptin, and 1âmg/mL Pefabloc® SC) on ice for 45âmin and sonicated. The lysate was centrifuged at 40,000 à g for 30âmin at 4 °C and filtered through 0.45 μm syringe filter. The His6-tagged ANGEL2-ÎN was purified using a 5âmL HisTrapTM FF crude column (Buffer A: 50âmM Tris-HCl pH 8.0, 100âmM KCl, 1.0âmM DTT, and 20âmM imidazole; Buffer B: 50âmM TrisâHCl pH 8.0, 100âmM KCl, 1.0âmM DTT, and 500âmM imidazole). Fractions containing His6-tagged ANGEL2-ÎN were pooled and dialysed against a buffer containing 50âmM Tris-HCl pH 8.0, 100âmM KCl, 5 U/mg thrombin, and 1âmM DTT at 4 °C overnight. The resulting solution was further purified by anion IEX on a 5âmL HiTrapTM Q HP column (Buffer A: 50âmM Tris-HCl pH 8.0, 100âmM KCl, and 1.0âmM DTT; Buffer B: 50âmM Tris-HCl, pH 8.0, 1000âmM KCl, and 1.0âmM DTT). Pure fractions were pooled, concentrated, and stored at â20 °C in a storage buffer containing 25âmM Tris-HCl pH 8.0, 50âmM KCl, 1âmM DTT, and 50% (v/v) glycerol.
Identification of the intact protein mass
All samples were purified by UHPLC on a Dionex UltiMate3000 (Thermo Fisher Scientific, Germany) using an analytical Zorbax 300SB-C8 column (150âmm à 2.1âmm) with 3.5âµm silica as a stationary phase (Agilent, USA). Prior to purification, all samples were acidified with 10% TFA. Gradient elution (3âmin at 0% Buffer B; in 19âmin to 80% Buffer B; then in 8âmin to 100% Buffer B) with Buffer A (0.02% TFA in water) and Buffer B (0.02% TFA in MeCN/water (80:20, v/v)) was performed at a flow rate of 300âµL/min. The signals were monitored by UV absorbance at 220ânm.
Intact proteins were then analysed by direct infusion on an amazon speed ETD mass spectrometer (Bruker Daltonics) with a flow rate of 4âµL/min. The mass spectrometric data were acquired for about 10âminutes and the final mass spectrum was averaged over the whole acquisition time. Mass spectrometric data were evaluated and deconvoluted using the Compass Data Analysis Version 4.4 (Bruker Daltonics) software.
General procedure of C12orf29 auto-AMPylation assays
Unless otherwise noted, the C12orf29 auto-AMPylation assays were performed as follow. 1.0âμM C12orf29WT-AMP, C12orf29WT or its variants were incubated with 200âμM ATP or a mixture of ATP:α-32P-ATP (185 TBq/mmol, Hartmann Analytic, FP-307) in 9:1 ratio in 1à auto-AMPylation buffer (50âmM Tris-HCl pH 8.5, 5âmM MgCl2, and 1âmM DTT) at 37 °C for 30âmin in a total volume of 18âμL. The reaction was stopped by transferring 15âμL reaction mixture to a pre-cold PCR tube containing 0.43âμL 0.5âM EDTA, 0.36âμL 2âmg/mL BSA, and 3.16âμL 6x loading buffer (50âmM TrisâHCl pH 6.8, 10% (v/v) glycerol, 2% (w/v) SDS, and 1% (v/v) β-mercaptoethanol). The resulting mixture was heated at 95 °C for 5âmin. Samples were resolved by SDS-PAGE and analysed by Coomassie staining, autoradiographic imaging, or immunoblotting. Mouse anti-AMPylation antibody27 (1:1000) and goat anti-mouse HRP-conjugated antibody (Jackson ImmunoResearch, 115-035-062, 1:30,000) were used as primary and secondary antibody, respectively.
Preparation of 5â² 32P-labelling of oligonucleotides
Oligonucleotides (1.0âμM) were incubated with 15 units of T4 PNK (New England BioLabs, M0202S) and 200âμM 0.555 MBq γ-32P-ATP (185 TBq/mmol, Hartmann Analytic, SRP-401) in 1à T4 PNK reaction buffer at 37â°C for 1âh in a total volume of 15âμL. The reaction was stopped by heating to 95 °C for 2âmin. The excess amount of γ-32P-ATP was removed by gel filtration using SephadexTM G-10 resin to give 5â² 32P-lablled oligonucleotides in a concentration of 1.0âμM. When labelling RNA oligos bearing 2â²,3â²-cPO4 or 2â²-PO4-3â²-OH on the 3â²-ends, T4 PNK 3â phosphatase minus (New England BioLabs, M0236S) was used.
General procedures of RNA ligation assays
As for general procedure of RNA ligation with C12orf29, unless otherwise noted, the RNA ligation with C12orf29 were performed as follow. 0.1âμM 5â² 32P-labelled oligonucleotide substrates were incubated with 1âμM C12orf29WT or its variants and 200âμM ATP in 1à RNA ligation buffer (50âmM Tris-HOAc pH 7.0, 5âmM MgCl2, and 1âmM DTT) at 37 °C 1âh in a total volume of 10âμL. The reaction was quenched by adding 10âμL stopping solution (80% (v/v) formamide, 20âmM EDTA, 0.025% (w/v) bromophenol blue, and 0.025% (w/v) xylene cyanol) and heating at 95 °C for 2âmin. One microliter of the resulting mixture was further diluted to give 0.005âμM 5â² 32P-labelled oligonucleotides. Samples were resolved by urea-PAGE and analysed by autoradiographic imaging.
In the time course study, the reaction was performed as stated above with C12orf29WT-AMP and 1 U/µL recombinant RNasin® Ribonuclease Inhibitor (Promega) in a total volume of 20âμL. Aliquots (1.5âμL) were taken at 0, 3, 10, 20, 30, and 60âmin and quenched by adding 28.5âμL stopping solution (50âmM EDTA pH 8.0). The resulting mixture was heated at 95 °C for 2âmin. Samples were resolved by urea-PAGE and analysed by autoradiographic imaging.
RNA ligation in the presence of ANGEL2-ÎN with RNA substrates bearing 2â²,3â²-cPO4 on the 3â² ends
5.0âµM of the 5â²-OH RNA oligo3 substrate was incubated with 10 units of T4 PNK and 1.0âmM ATP in 1x T4 PNK reaction buffer at 37â°C for 1âh in a total volume of 50âµl. The reaction was stopped by heating to 95â°C for 2âmin. The excess of ATP was removed by gel filtration to yield the respective 5â² phosphorylated RNA oligo3 in a concentration of 5.0âµM. 0.5âµM of the 32P-labelled RNA oligo4 was mixed with 0.6âµM of the non-radioactively 5â²-phosphorylated RNA oligo3 in MilliQ® H2O. The RNA strands were annealed using the annealing programme described in general procedure.
0.1âµM of the annealed 5â² 32P-labelled RNA oligo4 and with 5â² non-radioactively phosphorylated RNA oligo3 complexes were incubated with different concentrations of C12orf29, 1âµM ANGEL2-ÎN, and 2.0âmM ATP in 1à RNA ligation buffer at 37â°C for 2âh in a total volume of 10âµL. The reaction was quenched by adding 95âµL stopping solution to 5âµL of the sample and heating to 95â°C for 2âmin. The samples were then resolved by urea-PAGE (12%) and analysed by autoradiographic imaging.
Light-microscopy of menadione-treated HEK293 cells
1.2 à 106 cells were seeded in 4âmL DMEM GlutaMAXTM medium (GibcoTM, Thermo Fisher) supplemented with 10% (v/v) FCS on 6âcm cell culture dishes (Sarstedt). After 48âh, the cells were treated directly with either 4âµL EtOH (control, no menadione) or with 4âµL of 40âmM menadione in EtOH (final concentration = 40âµM menadione). After 3âh, pictures were taken with a light microscope using a 5à objective.
Cell viability assay of HEK293 cells treated with different concentrations of menadione
The CellTiter-Glo® Luminescent Cell Viability Assay (Promega) was used according to the manufacturerâs instruction. 4.0 à 104 cells per well were seeded one day before the menadione treatment in 90âµL DMEM GlutaMAXTM medium with 10% (v/v) FCS in a 96-well plate (Sarstedt). The plate was incubated at 37â°C, 5% CO2, and 100% humidity for 24âh.
1000à menadione stock solutions were prepared in ethanol and frozen in aliquots. On the day of the treatment, aliquots were thawed and diluted 1/100 in MilliQ® H2O. The cells were treated with 10âµL of different concentrations of menadione in MilliQ® H2O. The plate was incubated at 37â°C, 5% CO2, and 100% humidity. After 3âh, the cells were equilibrated to RT (15âmin) and 100âµL of CellTiter-Glo® reagent was added per well and mixed thoroughly. The plate was incubated for 10âmin on a shaker. 100 μL of the solution was transferred to a black 96-well plate and luminescence read-out was performed with a plate reader (PerkinElmer Victor3TM Multilabel Counter 1420).
The luminescence values of the treated cells were compared to the value of the cells of the control treatment, which were treated with 0âµM menadione (equals 100% viability), to give the calculated cell viability.
RNA integrity analysis of HEK293 WT and KO cells treated with different concentrations of menadione
5.0 à 105 cells were seeded in 2âmL DMEM GlutaMAXTM medium with 10% (v/v) FCS in a 6-well plate (Sarstedt) 48âh prior to the experiment. 1000à menadione stock solutions were prepared in ethanol and frozen in aliquots. Two microliters of the respective menadione stock was given to the cells resulting in the final desired menadione concentration for the treatment. After 3âh the cells were scraped down in the present medium and centrifuged (500 à g, 5âmin, 4â°C). The cell pellet was washed with 1âmL ice-cold PBS and centrifuged again (500 à g, 5âmin, 4â°C). The RNA was then extracted from the pellet using Quick-RNA Miniprep Kit (Zymo Research) with the provided in-column DNase I digest. The resulting RNA was analysed using an Agilent 4150 TapeStation system.
Cell viability assay in combination with ROS assay
The ROS-GloTM H2O2 Assay (Promega) was used according to the manufacturerâs instruction. 4.0 à 104 cells per well were seeded one day before the menadione treatment in 70âµL DMEM GlutaMAXTM medium with 10% (v/v) FCS in a 96-well plate (Sarstedt). The plate was incubated at 37â°C, 5% CO2, and 100% humidity for 24âh. 3âh before the final readout, 20âµL of the provided H2O2 substrate in H2O2 substrate dilution buffer was directly added to the cells in their growing medium (final concentration was 25âµM). Afterwards, 10âµL of menadione stock solution in EtOH was added to give the desired end concentration in a final volume of 100âµL. The cells were incubated for the desired stress time at 37â°C. The cells were equilibrated at RT for 15âmin.
(A) For detection of ROS, 50âµL of the present medium of each well was transferred in a new, black 96-well plate and 50âµL of the provided ROS detection solution was added. After 20âmin incubation at RT, the luminescence was measured using a plate reader (PerkinElmer Victor3TM Multilabel Counter 1420).
(B) For the cell viability assay, 50âµL of CellTiter-Glo® reagent was added to the remaining 50âµL medium in the 96-well plate and mixed thoroughly. The plate was incubated for 10âmin on a shaker. 100âµL of the solution was transferred to a black 96-well plate and the luminescence was measured using a plate reader (PerkinElmer Victor3TM Multilabel Counter 1420).
Data availability
The data that support this study are available from the corresponding authors upon request. The study made use of the following publicly available data set: PDB entry: [https://doi.org/10.2210/pdb5COT/pdb] The mass spectrometry proteomics data generated in this study have been deposited in the ProteomeXchange Consortium via the PRIDE partner repository50 under accession code PXD038132. Figures with associated raw data: Figs. 3c, 4c, 5b, c, 6 and Supplementary Fig. 5. Specific data P values are also included within the Source Data file. Source data are provided with this paper.
References
Popow, J., Schleiffer, A. & Martinez, J. Diversity and roles of (t)RNA ligases. Cell. Mol. Life Sci. 69, 2657â2670 (2012).
Amitsur, A., Levitz, R. & Kaufmann, G. Bacteriophage T4 anticodon nuclease, polynucleotide kinase and RNA ligase reprocess the host lysine tRNA. EMBO J. 6, 2499â2503 (1987).
Kosmaczewski, S. G. et al. The RtcB RNA ligase is an essential component of the metazoan unfolded protein response. EMBO Rep. 15, 1278â1285 (2014).
Moldovan, J. B., Wang, Y., Shuman, S., Mills, R. E. & Moran, J. V. RNA ligation precedes the retrotransposition of U6/LINE-1 chimeric RNA. Biol. Sci. 116, 20612â20622 (2019).
Schmidt, C. A., Giusto, J. D., Bao, A., Hopper, A. K. & Matera, A. G. Molecular determinants of metazoan tricRNA biogenesis. Nucleic Acids Res. 47, 6452â6465 (2019).
Paredes, E., Evans, M. & Das, S. R. RNA labeling, conjugation and ligation. Methods 54, 251â259 (2011).
Stark, R., Grzelak, M. & Hadfield, J. RNA sequencing: the teenage years. Nat. Rev. Genet. 20, 631â656 (2019).
Zillmann, M., Gorovsky, M. A. & Phizicky, E. M. Conserved mechanism of tRNA splicing in eukaryotes. Mol. Cell. Biol. 11, 5410â5416 (1991).
Popow, J. et al. HSPC117 is the essential subunit of a human tRNA splicing ligase complex. Science 331, 760â764 (2011).
Götz, K. H. et al. Formation of the alarmones diadenosine triphosphate and tetraphosphate by ubiquitin- and ubiquitin-like-activating enzymes. Cell Chem. Biol. 26, P1535â1543.E5 (2019).
Lehner, M. et al. Profiling ADP-ribosylome living cells. Angew. Chem. Int. Ed. 134, e202200977 (2022).
Casey, A. K. & Orth, K. Enzymes involved in AMPylation and deAMPylation. Chem. Rev. 118, 1199â1215 (2018).
Gulen, B. & Itzen, A. Revisiting AMPylation through the lens of Fic enzymes. Trends Microbiol. 30, 350â363 (2022).
Yarbrough, M. L. et al. AMPylation of Rho GTPases by Vibrio VopS disrupts effector binding and downstream signaling. Science 323, 269â272 (2009).
Müller, P. M. et al. The Legionella effector protein DrrA AMPylates the membrane traffic regulator Rab1b. Science 329, 946â949 (2010).
Neunuebel, M. R. et al. De-AMPylation of the small GTPase Rab1 by the pathogen Legionella pneumophila. Science 333, 453â456 (2011).
Tan, Y. & Luo, Z.-Q. Legionella pneumophila SidD is a deAMPylase that modifies Rab1. Nature 475, 506â509 (2011).
Ham, H. et al. Unfolded protein response-regulated Drosophila Fic (dFic) protein reversibly AMPylates BiP chaperone during endoplasmic reticulum homeostasis. J. Biol. Chem. 289, 36059â36069 (2014).
Sanyal, A. et al. A novel link between Fic (filamentation induced by cAMP)-mediated Adenylylation/AMPylation and the unfolded protein response. J. Biol. Chem. 290, 8482â8499 (2015).
Sreelatha, A. et al. Protein AMPylation by an evolutionarily conserved pseudokinase. Cell 175, 809â821 (2018).
Grammel, M., Luong, P., Orth, K. & Hang, H. C. A chemical reporter for protein AMPylation. J. Am. Chem. Soc. 133, 17103â17105 (2011).
Broncel, M., Serwa, R. A., Bunney, T. D., Katan, M. & Tate, E. W. Global profiling of Huntingtin-associated protein E (HYPE)-mediated AMPylation through a chemical proteomic approach. Mol. Cell. Proteom. 15, 715â725 (2016).
Kielkowski, P. et al. FICD activity and AMPylation remodelling modulate human neurogenesis. Nat. Commun. 11, 517 (2020).
Kielkowski, P. et al. A pronucleotide probe for live-cell imaging of protein AMPylation. ChemBioChem 21, 1285â1287 (2020).
Krüger, L. et al. Chemical proteomic profiling reveals protein interactors of the alarmones diadenosine triphosphate and tetraphosphate. Nat. Commun. 12, 5808 (2021).
C12orf29 orthologs were analysed with NCBI database. 2022. https://www.ncbi.nlm.nih.gov/gene/?term=c12orf29#.
Höpfner, D. et al. Monoclonal anti-AMP antibodies are sensitive and valuable tools for detecting patterns of AMPylation. iScience 23, 101800 (2020).
Kelley, L., Mezulis, S., Yates, C., Wass, M. N. & Sternberg, M. J. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 10, 845â858 (2015).
Unciuleac, M.-C. & Shuman, S. Characterization of a novel eukaryal nick-sealing RNA ligase from Naegleria gruberi. RNA 21, 824â832 (2015).
Unciuleac, M.-C., Goldur, Y. & Shuman, S. Structure and two-metal mechanism of a eukaryal nick-sealing RNA ligase. Proc. Natl Acad. Sci. USA 112, 13868â13873 (2015).
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583â589 (2021).
Holm, L. Dali server: structural unification of protein families. Nucleic Acids Res. 50, W210âW215 (2022).
Shuman, S. & Lima, C. D. The polynucleotide ligase and RNA capping enzyme superfamily of covalent nucleotidyltransferases. Curr. Opin. Struct. Biol. 14, 757â764 (2004).
Pinto, P. H. et al. ANGEL2 is a member of the CCR4 family of deadenylases with 2â,3â-cyclic phosphatase activity. Science 369, 524â530 (2020).
Gao, Y.-B. et al. Genetic landscape of esophageal squamous cell carcinoma. Nat. Genet. 46, 1097â1102 (2014).
Cancer Genome Atlas Research Network. Integrated genomic analyses of ovarian carcinoma. Nature 474, 609â615 (2011).
Frattini, V. et al. The integrated landscape of driver genomic alterations in glioblastoma. Nat. Genet. 45, 1141â1149 (2013).
Sanger Institute. COSMIC, The Catalogue of Somatic Mutations in Cancer. https://cancer.sanger.ac.uk/cosmic. Accessed 25 May 2022.
Misra, H. P. & Fridovich, I. The univalent reduction of oxygen by reduced flavins and quinones. J. Biol. Chem. 247, 188â192 (1972).
Goffart, S., Tikkanen, P., Michell, C., Wilson, T. & Pohjoismäki, J. L. O. The type and source of reactive oxygen species influences the outcome of oxidative stress in cultured cells. Cells 10, 1075 (2021).
Chakravarty, A. K., Subbotin, R., Chait, B. T. & Shuman, S. RNA ligase RtcB splices 3â-phosphate and 5â-OH ends via covalent RtcB-(histidinyl)-GMP and polynucleotide-(3â)pp(5â)G intermediates. Proc. Natl Acad. Sci. USA 109, 6072â6077 (2012).
Tanaka, N., Chakravarty, A. K., Maughan, B. & Shuman, S. Novel mechanism of RNA repair by RtcB via sequential 2â,3â-cyclic phosphodiesterase and 3â-phosphate/5â-hydroxyl ligation reactions. J. Biol. Chem. 286, P43134âP43143 (2011).
Burroughs, A. M. & Aravid, L. RNA damage in biological conflicts and the diversity of responding RNA repair systems. Nucleic Acids Res. 44, 8525â8555 (2016).
Schärer, O. D. Chemistry and Biology of DNA Repair. Angew. Chem. Int. Ed. 42, 2946â2974 (2003).
Kraus, A. J., Brink, B. G. & Siegel, T. N. Efficient and specific oligo-based depletion of rRNA. Sci. Rep. 9, 12281 (2019).
Murayama, A. et al. Epigenetic control of rDNA loci in response to intracellular energy status. Cell 133, 627â639 (2008).
Temmel, H. et al. The RNA ligase RtcB reverses MazF-induced ribosome heterogeneity in Escherichia coli. Nucleic Acids Res. 45, 4708â4721 (2016).
Engl, C., Schaefer, J., Kotta-Loizou, I. & Buck, M. Cellular and molecular phenotypes depending upon the RNA repair system RtcAB of Escherichia coli. Nucleic Acids Res. 5, gkw628 (2016).
Weitzer, S. & Martinez, J. The human RNA kinase hClp1 is active on 3â transfer RNA exons and short interfering RNAs. Nature 447, 222â226 (2007).
Perez-Riverol, Y. et al. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res. 47, D442âD450 (2019).
Tunyasuvunakool, K. et al. Highly accurate protein structure prediction for the human proteome. Nature 596, 590â596 (2021).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D60, 2126â2132 (2004).
Acknowledgements
We thank A. Marquardt and A. Sladewska-Marquardt (Proteomics Center, University of Konstanz) for assistance in mass spectrometry. Y.Y., F.M.S., L.A.S., P.S., L.B.H., M.F., and E.H. thank the Konstanz Research School Chemical Biology for support. We thank D. Höpfner and A. Itzen, UKE Hamburg, for the provision of the anti-AMPylation antibody. A.M. acknowledges the European Research Council for funding of this project (ERC AdG AMP-Alarm, 101019280). F.S. acknowledges the Deutsche Forschungsgemeinschaft for funding (STE 2517/5-1). O.P.S. acknowledges support by the Alexander von Humboldt Foundation.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
Y.Y., F.M.S., L.A.S., O.P.S. and A.M. conceived the study and experimental approach; Y.Y., F.M.S., L.A.S., O.P.S. generated C12orf29 and its mutants, performed AMPylation and RNA ligation experiments, Y.Y. and P.S. conducted the modelling and MS analysis. F.M.S. conducted cellular experiments. L.B.H. and M.F. provided expert expertise in PAGE analysis and AMPylation reactions, respectively. M.S., E.H., F.S., K.D. provided expert expertise on cellular studies, mass spectrometry and modelling, respectively. Y.Y., F.M.S., L.A.S., O.P.S. and A.M. analysed the data, A.M. wrote the manuscript with input from all authors.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Jun-Seok Lee and the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yuan, Y., Stumpf, F.M., Schlor, L.A. et al. Chemoproteomic discovery of a human RNA ligase. Nat Commun 14, 842 (2023). https://doi.org/10.1038/s41467-023-36451-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-36451-x
This article is cited by
-
Noncanonical circRNA biogenesis driven by alpha and gamma herpesviruses
The EMBO Journal (2025)
-
DDX1 is required for non-spliceosomal splicing of tRNAs but not of XBP1 mRNA
Communications Biology (2025)
-
Analysis of chimeric reads characterises the diverse targetome of AGO2-mediated regulation
Scientific Reports (2023)
-
Insights into the structure and function of the RNA ligase RtcB
Cellular and Molecular Life Sciences (2023)








