Exogenous synthetic RNA references, known as spike-ins, serve as vital calibration standards in a range of RNA assays, including gene expression microarrays, reverse transcription quantitative polymerase chain reaction (RT-qPCR), and RNA sequencing (RNA-Seq) analyses. Popular spike-ins like Spike-In RNA Variants (SIRV) and External RNA Controls Consortium (ERCC) mixes are widely used due to their reproducibility and well-established concentration levels, enabling accurate quantification and normalization of gene expression across experimental samples1,2.
However, a significant drawback of commercial spike-in mixes is their lack of the 7-Methylguanosine (m7G) cap structure, typically present at the 5’ end of natural eukaryotic RNA transcripts. This m7G cap plays crucial roles in various cellular processes, including mRNA stability, translation efficiency, and mRNA export from the nucleus. Importantly, many library preparation protocols for RNA sequencing rely on the presence of m7G cap structures to distinguish between complete and incomplete transcripts and to efficiently capture and amplify full-length cDNA molecules.
Due to the absence of m7G caps, commercial spike-ins are unsuitable for library preparation methods requiring 5’ cap enrichment steps, potentially introducing biases and inaccuracies in downstream sequencing analyses. To address this limitation, we present a protocol for modifying synthetic spike-ins by adding 5’ m7G caps, thus enabling their compatibility with library preparation procedures reliant on m7G cap recognition.
In this study, we detail a comprehensive method for introducing 5’ m7G caps to synthetic RNA spike-ins, allowing their use as external references in RNA sequencing experiments. Leveraging the pp5'N structure inherent at the 5’ end of spike-in sequences, we utilize the catalytic activity of the vaccinia capping enzyme to add 7-methylguanylate cap structures3,4 (Cap 0). Subsequently, RNA magnetic beads are employed to remove protein constituents, including the capping enzyme, from the spike-ins, ensuring their purity and compatibility with downstream library preparation protocols.
This protocol offers a valuable solution for researchers aiming to integrate synthetic spike-ins into RNA sequencing experiments while maintaining the accuracy and fidelity of gene expression measurements.