TRAID-Seq Protocol
A. Clone rNTase/enzyme of interest into MS2 cassette vector
Clone gene corresponding to rNTase/enzyme of interest using multiple cloning site in MAP72 2μ LEU2 MS2 cassette vector.
Notes: (1) Verify that the chosen restriction enzymes do not cut within the coding sequence of your gene of interest. (2) When designing primers for gene amplification, be sure to remove the stop codon of the gene of interest. A stop codon is included in the construct after the (RGS)His6 tag.
As a negative control, create a construct in which putative catalytic residues are mutated. As an example, for candidate rNTases, mutate predicted catalytic aspartic acids to alanine.
Figure 2: Annotated Sequence of MS2 Cassette (MAP72 2μ LEU2 MS2 cassette vector)
See figure in Figures section.Sequence only (vector backbone indicated with bold)
ttcacacaggaaacagctatgaccatgattacgcca agcttACAATGCATACTTTGTACGTTCAAAATACAATGCAGTAGATATATTTATGCATATTACATATAATACATATCACATAGGAAGCAACAGGCGCGTTGGACTTTTAATTTTCGAGGACCGCGAATCCTTACATCACACCCAATCCCCCACAAGTGATCCCCCACACACCATAGCTTCAAAATGTTTCTACTCCTTTTTTACTCTTCCAGATTTTCTCGGACTCCGCGCATCGCCGTACCACTTCAAAACACCCAAGCACAGCATACTAAATTTCCCCTCTTTCTTCCTCTAGGGTGTCGTTAATTACCCGTACTAAAGGTTTGGAAAAGAAAAAAGAGACCGCCTCGTTTCTTTTTCTTCGTCGAAAAAGGCAATAAAAATTTTTATCACGTTTCTTTTTCTTGAAAATTTTTTTTTTTGATTTTTTTCTCTTTCGATGACCTCCCATTGATATTTAAGTTAATAAACGGTCTTCAATTTCTCAAGTTTCAGTTTCATTTTTCTTGTTCTATTACAACTTTTTTTACTTCTTGCTCATTAGAAAGAAAGCATAGCAATCTAATCTAAGTTTTAATTACAAAACTAGTatggcttctaactttactcagttcgttctcgtcgacaatggcggaactggcgacgtgactgtcgccccaagcaacttcgctaacggggtcgctgaatggatcagctctaactcgcgttcacaggcttacaaagtaacctgtagcgttcgtcagagctctgcgcagaagcgcaaatacaccatcaaagtcgaggtgcctaaagtggcaacccagactgttggtggtgtagagcttcctgtagccgcatggcgttcgtacttaaatatggaactaaccattccaattttcgccacgaattccgactgcgagcttattgttaaggcaatgcaaggtctcctaaaagatggaaacccgattccctcggccatcgcagcaaactccggcatctacgtcGGATCCCCCGGGCGGCCGCCTTCTAGACTGCAGGGTACCCCAAAGAAGAAGAGAAAGGTTAGAGGTTCTCACCACCACCACCACCACTAACTTTGGACTTCTTCGCCAGAGGTTTGGTCAAGTCTCCAATCAAGGTTGTCGGCTTGTCTACCTTGCCAGAAATTTACGAAAAGATGGAAAAGGGTCAAATCGTTGGTAGATACGTTGTTGACACTTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTATAAAAAAAATAAGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTATTCTTGAGTAACTCTTTCCTGTAGGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGCTCGAGc actggccgtcgttttacaacgtcgtgactgg
Primers to sequence inserted gene of interest:
MS2cassette-seq 5': 5' CACGAATTCCGACTGCGAGCT 3'
MS2cassette-seq 3': 5' CAACGTATCTACCAACGATTTGACC 3'
B. Transform rNTase and tRNA reporter into BY4741 yeast
Inoculate YPD media with BY4741 and incubate at 30°C overnight with shaking.
Calculate the amount of cells to use for transformations.
Need 1.67 OD600 per transformation
Measure OD600 of 1/50 dilution (in YPD) from overnight culture.
Transfer the appropriate volume of culture to a new sterile tube.
Centrifuge at 4000 rpm for 5 minutes at room temperature
During centrifugation, prepare appropriate amount of One Step Buffer (50μl of One Step Buffer per transformation).
Prepare fresh each time. Due to viscosity of PEG, be sure to prepare extra volume.
See figure in Figures section.Remove supernatant and resuspend cell pellet in One Step Buffer by gently pipetting up and down.
For example, 600μL of One Step Buffer is needed for 12 transformations.
Add appropriate volume of salmon testes single stranded DNA (Sigma-Aldrich D7656-1ML) to the cell/One Step Buffer mixture (2.5 μL per transformation).
For example, add 30 μL of salmon testes single stranded DNA for 12 transformations.
Mix by gently pipetting up and down
To a 1.5ml eppie tube, add 0.5 μL of miniprep DNA (100-400 ng/μL) corresponding to tRNA reporter (MAP80A 2μ URA3 tRNASer(AGA) with MS2 stem loop in variable arm in tRNAHis cassette) and 0.5 μL of miniprep DNA (100-400 ng/μL) corresponding to the rNTase construct to test.
Add 50 μL of cells/One Step Buffer/salmon testes DNA mixture to each tube. Tap mix immediately after adding the cell mixture.
Heat shock at 45°C for 35 minutes.
Add 75 uL YPD to each tube and mix.
Plate 30 μL onto SD-Ura-Leu plates.
Incubate at 30°C for 3-5 days.
C. Grow yeast to log phase and harvest
Inoculate 5 mL of SD-Ura-Leu with a single colony resulting from the transformation above.
Grow at 30°C with shaking for approximately 24 hours. Cultures should reach saturation.
Use this culture to start 50 mL cultures of each sample in SD-Ura-Leu at OD600=0.15.
Grow to log phase (OD600=0.8-1.0).
Put the flasks on ice for about 5 minutes to chill the cultures.
Transfer culture to 50 mL conical tubes, taking note of the final volume. Multiply the volume by the OD600 reading to calculate the number of cells harvested.
Centrifuge at 3000 rpm in a table top centrifuge for 10 minutes.
Wash pellet by resuspending with 2 mL of cold ddH2O
Transfer an equal volume (~1 mL) into each of two 2 mL eppie tubes to produce 2 pellets.
Note: One pellet will be used to confirm expression of the protein of interest, and the other will be used to extract RNA.
Centrifuge at 5000 rpm in microfuge at 4°C.
Remove and discard supernatant.
Freeze pellets on dry ice.
Store at -80°C until ready to process.
D. Verify expression of rNTase/enzyme of interest by Western Blotting
ESB Buffer:
See figure in Figures section.- Resuspend cell pellet in 30 μL of ESB buffer.
- Heat to 100°C for 3 minutes. Cool slightly (enough to handle tubes).
- Carefully open the lid, and add 425-600μm glass beads (Sigma G8772) to saturate sample (about 0.1 g).
- Vortex for 2 minutes.
- Add 70 μL of ESB buffer, vortex briefly.
- Heat to 100°C for 1 minute.
- Load lysate corresponding to 1 OD600 of cells onto a 4-15% SDS-PAGE gel. Use total OD600 calculated from cultures to determine the volume to add.
- Transfer to an activated PVDF membrane for Western blotting.
- Block in 1X TBS/0.1% Tween/5% BSA (TBST/BSA) for 1 hour at room temperature. Blocking with milk (even after blot stripping) will result in a high background for the anti-enolase antibody.
- For a loading control, incubate with rabbit anti-enolase antibody (Cell Signaling #8171 Enolase-2 D20H2 rabbit).
- Wash 3x with TBST at room temperature for 10 minutes each.
- Incubate with goat anti rabbit antibody 1/15,000 dilution in TBST/BSA at room temperature for 1 h with shaking.
- Wash 3x with TBST at room temperature for 10 minutes each.
- Image blot.
- Following imaging, strip the blot and prepare to reprobe with anti-RGSHis antibody (5-Prime/Qiagen Cat No./ID: 34610).
- Block in 1X TBS 0.1% Tween (TBST)/5% milk for 1 hour at RT.
- Incubate with 1/2500 dilution of mouse anti RGS-His antibody for 1 hour at room temperature.
- Wash 3x with TBST at room temperature for 10 minutes each.
- Incubate with 1/10,000 dilution of goat anti mouse antibody for 1 hour.
- Wash 3x with TBST at room temperature for 10 minutes each.
- Image blot to confirm expression of each rNTase/enzyme of interest.
- Proceed only with samples from which rNTase/enzyme of interest is detected.
E. Extract total cellular RNA
Needed per sample:
1.25 mL ISO Buffer
2 mL Phenol/Chloroform/Isoamyl Alcohol (25:24:1 Mixture, pH 6.7). Referred to as PCA in the protocol below.
- Prepare RNA ISO Buffer.
See figure in Figures section.
a. Dissolve SDS in DEPC-treated water. Then add to remaining buffer ingredients.
b. Filter sterilize with 0.2 μm filter.
Note: RNA ISO Buffer can be made ahead and stored at room temperature. If SDS precipitates, buffer can be warmed at 65°C for 10 minutes to resuspend, but the buffer must cool to room temperature prior to using in extraction.
Resuspend cell pellet in 0.5 mL RNA ISO buffer.
Add 0.5 g of 425-600 μm glass beads (Sigma G8772) to each tube. Make sure that the glass beads used are reserved only for RNA purification in order to avoid RNase contamination.
Add 0.5 mL PCA (25:24:1) to each tube. Wrap tops of tubes in parafilm to prevent tubes from popping open during vortexing.
Vortex for 20 seconds then place on ice. Repeat this process for a total of 10 times.
Quick-spin tubes to bring volume out of the tube lid.
Transfer as much of the volume as possible into two new 1.5 mL eppie tubes. Make sure that the volume in each of the two new tubes is equal.
Add 0.375mL RNA ISO buffer to each tube.
Add 0.375mL PCA (25:24:1) to each tube.
Mix by inverting the tubes.
Centrifuge at maximum speed in a microfuge at 4°C for 10 minutes.
During the sample centrifugation step, centrifuge Phase Lock Gel Heavy (PLGH) tubes (QuantaBio/5PRIME) at maximum speed for 5 minutes at room temperature.
Note: One PLGH tube is needed per starting sample. Aqueous layer from the 2 tubes will be recombined into the PLGH tube.
Transfer aqueous phase into 1 pre-spun PLGH tube per sample (combine the 2 tubes per sample).
Add 0.75 mL PCA (25:24:1). If the entire volume of PCA doesn't fit into the tube, just fill as much as possible so that the tube can still close.
Invert tube to mix.
Centrifuge at maximum speed at room temperature for 5 minutes.
Transfer aqueous phase into two new 2 mL eppie tubes. Make sure that the volume in each of the two new tubes is equal.
Add 1 mL of 100% ethanol to each tube.
Mix gently.
Incubate on dry ice for 20 minutes or at -80°C for 1 hour to overnight.
Centrifuge at maximum speed in a microfuge at 4°C for 10 minutes.
Remove and discard supernatant.
Dissolve each pellet in 25 μL of DEPC-treated water.
Combine the 2 tubes per sample into 1 tube.
Proceed with DNase treatment
Note: The following DNase treatment protocol is optimized to digest overexpressed plasmid DNA corresponding to the tRNA reporter. This amount of DNase is needed because downstream amplification steps target a sequence in the tRNA reporter that is also in the DNA used for expression.
Prepare appropriate volume of DNase mix needed for all samples following the recipe below.
See figure in Figures section.Add 50 μL of DNase mix to each sample.
Incubate at 37°C for 4 hours.
Clean up RNA with GeneJET RNA Purification Kit (ThermoFisher).
a. Add 300 μL of Lysis Buffer to DNase reactions and mix by pipetting.
b. Add 180 μL of 100% ethanol and mix by pipetting.
c. Transfer sample to GeneJET RNA purification column.
d. Centrifuge at maximum speed for 1 minute. Transfer each column to a new 2mL collection tube.
e. Add 700 μL of Wash Buffer 1. Centrifuge at maximum speed for 1 minute.
f. Add 600 μL of Wash Buffer 2. Centrifuge at maximum for 1 minute.
g. Add 250 μL of Wash Buffer 2. Centrifuge at maximum for 2 minute.
h. Discard flowthrough and centrifuge again at maximum for 1 minute.
i. Add 50 μL of DEPC-treated water. Centrifuge at maximum speed for 1.5 minutes.
- Measure concentration.
- Store at -80°C in aliquots to avoid freeze-thawing.
F. Ligate Adapter
AppN7-RNATruSeq3’Adapter: 5 ' AppNNNNNNN TGGAATTCTCGGGTGCCAAGG ddC 3'
Important note: Adenylated adapter must be PAGE gel purified to ensure that all adapter added into the ligation reaction is adenylated. PAGE purification or HPLC purification offered by IDT is not sufficient to remove all unmodified adapter. This was confirmed by discussions with IDT Technical Support, as the resolution of their methods is not precise enough for this application, partially due to the variable size of the random heptamer.
PAGE purification of adenylated adapter
Prepare a 20% acrylamide 8M urea, 1.5mm thick sequencing-length gel using the SequaGel - UreaGel System (National Diagnostics).
Length of gel is key to ensure single nucleotide resolution.
Pre-run gel and load 8 nmol of adapter. Also load 5 pmol of unmodified adapter and 1 μL of 10bp DNA Step Ladder (Promega Corporation).
Run gel until 10 bp band is 1/3 of the way down the gel (bromophenol band in loading dye should run off the gel).
Stain with 1:20,000 dilution of SYBR Gold Nucleic Acid Gel Stain (ThermoFisher) in 1x TBE for 10 minutes.
Return gel to glass plate and wrap in plastic wrap.
UV-shadow band with short-wave UV (to use a sharpie to mark the adenylated adapter band on the plastic wrap).
Use a razor blade to trace the marked plastic wrap and cut the band out of the gel. To increase gel extraction efficiency, cut the gel band in half.
Transfer each gel piece into its own 3 mL syringe
Use the syringe plunger to crush the gel piece into a Falcon 2059 round bottom tube.
Add 1.6 mL of Gel Elution Buffer (recipe below can be made ahead of time and stored at room temperature).
See figure in Figures section.Parafilm lids of tubes and incubate overnight at 37°C with gentle shaking.
After overnight incubation, add ammonium acetate to 2M (480 μL of 5 M ammonium acetate to each tube).
Centrifuge tube at 1000 rpm for 2 minutes to pull all volume to the bottom of the tube
Add all liquid and gel to empty Biospin column (Bio-Rad) in a 2059 Falcon tube to separate liquid from gel.
Centrifuge for 5 minutes at 2500 rpm in table top centrifuge.
During centrifugation step, pre-spin four 2 mL PLGH tubes.
Transfer cleared liquid to pre-spun PLGH tubes (about 0.75 mL per tube), taking note of total volume.
Add 0.75 mL of PCA pH 6.7. Mix by inverting tubes.
Centrifuge at maximum speed at room temperature for 5 minutes.
Transfer aqueous layer to 2 new 2059 Falcon tubes.
Add 2X volume of 100% ethanol.
Incubate for 15 minutes on dry ice (or overnight at -20°C).
Centrifuge for 45 minutes at 5000 rpm in a table top centrifuge.
Remove and discard supernatant.
Wash pellets with 70% EtOH.
Centrifuge for 45 minutes at 5000 rpm in a table top centrifuge.
Remove all liquid (do not need to dry pellet, but can dry for a maximum of 10 minutes before resuspension).
Resuspend each pellet in 40 μL of TE pH 8.0.
Measure concentration and calculate as pmol/μL.
Store at -80°C in 5μL aliquots.
To check purification, run 5 pmol on 20% PAGE gel, 8M urea. Also run 5 pmol of unpurified adapter, 5 pmol of unmodified adapter and 1 μL of 10bp DNA Step Ladder (Promega Corporation). See figure below.
Stain with SYBR Gold (1/20,000 in 1x TBE) for 10 minutes and image.
See figure in Figures section.
Ligation Reaction
- Dilute each RNA sample to 100 ng/μL (will use 100 ng for ligation).
- Prepare volume of reaction mix appropriate for the number of ligation reactions.
T4 RNA Ligase 2, truncated KQ (NEB) reaction mix
See figure in Figures section.- Add 19 μL of reaction mix to each tube.
- Add 1 μL of yeast total RNA (100 ng).
- Incubate overnight at 16°C.
- Proceed with reverse transcription or store at -20°C.
G. Reverse transcribe to form cDNA
Illumina RNA RT primer (15013981): 5' GCCTTGGCACCCGAGAATTCCA 3'
Dilute Illumina RNA RT primer to 5 pmol/μL in TE buffer.
In a 0.2 mL PCR tube, combine 1 μL (5 pmol) of RNA RT primer with 10 μL of ligation reaction from above. Mix.
Incubate at 70°C for 5 minutes in a thermal cycler.
Immediately chill on ice for at least 5 minutes.
Prepare the following mix for ImProm-II Reverse Transcriptase (Promega Corporation):
See figure in Figures section.Add 9 μL of RT mix to each RNA/primer mix.
Program a thermal cycler for the following incubations.
Incubate at 25°C for 5 minutes to anneal.
Incubate at 42°C for 1 hour to extend.
Incubate at 70°C for 15 minutes to inactivate the reverse transcriptase.
Store reactions at -20°C or proceed with PCR amplification.
H. PCR amplify and add Illumina TruSeq sequences and barcode
5' primer:
5' AATGATACGGCGACCACCGAGATCTACACGTTCAGAGTTCTACAGTCCGACGATCGAGGATCACCCATGTCGCAG 3'
3' Primer = RNA PCR Primer
5' CAAGCAGAAGACGGCATACGAGAT [NNNNNN] GTGACTGGAGTTCCTTGGCACCCGAGAATTCCA 3'
[NNNNNN] = Illumina RPI sequence for barcoding. A different index primer should be used for amplification of each sample.
Expected product: >167 bp [29 (Illumina 5'), 26 (5' adapter) + 41 (tRNA) + added tail + 28 (N7 3' adapter) + 43 (barcode + 3' Illumina seq)]
Prepare 4 reactions per sample for RT reactions. Only 1 reaction per RNA ligation control is needed.
See figure in Figures section.Perform the following steps at room temperature
Prepare reaction mix for all components except for template and 3' primer.
Add barcode primer to each of 4 tubes per sample.
Add 1 μL of RT reaction to each of 4 tubes per sample. Add RNA ligation/water to 1 tube per sample.
Add 10.88 μL of reaction mix to each tube. Mix.
Run the following PCR protocol:
See figure in Figures section.Pool the 4 PCRs for each sample into 1 tube.
Run 4 μl of each sample on a 2.5% agarose gel to check products.
Store remaining sample at -20°C.
I. Gel purify tRNA reporter with tails
Prepare a 20% acrylamide 8 M urea, 1.5 mm thick sequencing-length gel using the SequaGel - UreaGel System (National Diagnostics).
Length of gel is key to ensure sufficient resolution.
Pre-run gel.
Load entire volume of PCR for each sample. Also load 50bp Step Ladder (Promega Corporation) and 1Kb Ladder (Promega Corporation). Because bands will run off the gel, both ladders are needed to verify the identity of the bands.
Run gel until bromophenol blue band in loading dye just runs off the gel.
Stain with 1/20,000 dilution of SYBR Gold Nucleic Acid Gel Stain (ThermoFisher) in 1x TBE for 10 minutes.
Image the gel using fluorescence detection methods to determine where to cut the gel for each sample.
Important Note: The goal is to cut out the tail added to the mature tRNA and a small portion of the mature tRNA. Cutting too far above the mature tRNA PCR band will result in pre-tRNA being brought through the purification, which will reduce the amount of reads generated for tailed mature tRNA. In initial tests and earlier version of computational analyses, tails were observed on the pre-tRNA, but its processing was atypical and difficult to predict sample to sample. Most consistent results were obtained when focusing analyses on the mature tRNA. See example image below for guidance on cutting out gel bands.
See figure in Figures section.Return gel to glass plate and wrap in plastic wrap.
Place glass plate and gel on a UV box and use a sharpie to mark the adenylated adapter band on the plastic wrap.
Use a razor blade trace the marked plastic wrap and cut each band out of the gel.
Transfer each gel piece into its own 3 mL syringe.
Use the syringe plunger to crush the gel piece into a Falcon 2059 round bottom tube.
Add 450 μL of TRAID-Seq Gel Elution Buffer (recipe below can be made ahead of time and stored at room temperature). As compared to the elution buffer used above, this recipe increases ammonium acetate to 1M, and it omits EDTA and SDS, as those reagents negatively affect Illumina sequencing.
See figure in Figures section.Parafilm lids of tubes and either store overnight at -20°C (can also be stored over a weekend) or proceed with elution.
Incubate at 37°C with gentle shaking for at least 4 hours.
Centrifuge tubes at 1000 rpm for 2 minutes to pull all volume to the bottom of the tube
Separate gel from buffer using empty Bio-Rad Micro BioSpin columns.
a. Put spin column in 2 mL eppie tube.
b. Pour gel/buffer mix into column.
c. Centrifuge at 4000 rpm in a standard microfuge at room temperature for 4 minutes.
d. Discard column and proceeded to next step with eluate.
During centrifugation step, pre-spin one 2 mL PLGH tube per sample (maximum speed at room temperature for 5 minutes).
Transfer eluate to pre-spun PLGH tube, taking note of total volume (~400μL).
Add equal volume of PCA pH 6.7, and mix.
Centrifuge at maximum speed at room temperature for 4 minutes.
Repeat steps 19 and 20.
Transfer aqueous layer to a new 1.5 mL eppie tube.
Add equal volume of chloroform (~350 μL), and mix.
Centrifuge at maximum speed at 4°C for 10 minutes.
Transfer supernatant to new 1.5 mL eppie tube.
Repeat steps 23-25.
Add 2.25x volume of 100% EtOH (~790 μL).
Incubate for 15 minutes on dry ice or overnight at -80°C.
Centrifuge at maximum speed at 4°C for 30 minutes.
Remove supernatant.
Wash with 500 μL of cold 70% ethanol.
Centrifuge at maximum speed at 4°C for 15 minutes.
Remove all liquid (do not need to dry pellet, but it is important to remove all ethanol before resuspension).
Resuspend with 10 μL of nuclease-free ddH2O.
Measure concentration
Note: Nanodrop can be used but results in an inaccurate estimation of concentration. Quantitation of concentration by using a fluorescent DNA binding dye, such as QuantiFluor ONE dye (Promega Corporation), is more accurate and will produce more equal representation of each sample.
Combine all samples at an equal concentration (ideally 1 ng/μL each sample).
Note: Due to the number of multiplexed samples, 1 ng/μL will likely not be achievable. Combine in such a way that each sample is at its highest possible concentration in the final mixed population.
J. Paired-end sequencing
Use Illumina paired-end sequencing of at least 2 x 50 bp (2 x 100 bp has also been used) to generate sequencing data. TRAID-Seq sample have been run on a HiSeq2000 and HiSeq2500.
The target number of reads per sample is 1 million. Depending on the diversity of the tails added to the tRNA, more than 1 million reads may result in longer computational time to remove PCR duplicates.
Important consideration: Read 1 will always have the same sequence for the first ~40 nucleotides. For high-quality sequencing, it is best to mix TRAID-Seq samples with a sample or samples that have diverse sequences. If such experimental sample are not available, PhiX can be used. TRAID-Seq samples have been successfully run in the control lane of a HiSeq2000 or HiSeq2500. It is best to discuss recommended best practices for running these samples with technical staff at your local sequencing core.
K. Data analysis
Use the PuppyTails suite of programs to analyze data.
Programs are freely available here: https://github.com/melanieapreston/PuppyTails
Depending on the number of data files and size of files, it might be most efficient to analyze data via a Linux-based computational server (rather than a single Linux-based PC) in sets of data corresponding to each sequencing instrument run.
Step-by step instructions for analysis:
1. Extract tail sequences, remove PCR duplicates, and count number of uniquely observed tails.
Copy extract_inserts.py to appropriate directory
Navigate to appropriate directory
python extract_Tails.py
Program may take several hours to run on a computational server, depending upon the dataset file sizes.
Output:
Inserts folder containing one .inserts file (Samplename.fastq.inserts) per sample with 3 data columns:
- 3' end of the tRNA reporter
- added tail sequence
- number of times observed
2. Analyze tail sequences to determine percentages of each nucleotide at each tail length and generate tail-o-grams.
Once the .inserts files are generated, the next step of analysis is to use the analyzeTails.py program:
python analyzeTails.py -o -n -p
Details on the added options:
-o will only use tails with a CCA end.
-n will normalize to dataset size (per 1 million).
-p will output values as percents.
This program can take up to an hour to run on a computational server, depending on size of .inserts files and the number of files to analyze.
Outputs:
--tables folder containing text files of the most commonly observed tail sequences
--figs folder containing tail-o-grams for each sample
--dataForGraph.txt file containing percent of each nucleotide at each observed tail length, and number of unique tails observed at each tail length
Data generated from these analyses were used to create tail-o-grams in the corresponding manuscript.
3. Compile nucleotide addition information to generate summaries of percent incorporation for each nucleotide.
Rename dataForGraph.txt files to include dataset name
Place all files into the same folder with Total_Percent_nt_Analyses-vs1d.pl Perl script and SampleKey.out file
This can be run on a Windows 7 or Windows 10 OS. Other operating systems were not tested.
Run Total_Percent_nt_Analyses-vs1d.pl
Uses dataForGraph.txt files output from PuppyTails program as input
Uses SampleKey.out file to decode filenames/barcode numbers into sample names
An example of a SampleKey.out file is shown here: https://github.com/melanieapreston/PuppyTails
Column #1 is a coded name for the sample; column #2 is the actual sample name
Outputs:
A. Generates 2 files for each sample:
Samplename.out
File that has removed repeated lines from dataForGraph.txt file
Samplename-averages.out
File that summarizes total nucleotide addition metrics:
--Sum Total unique tails
--Unweighted Average nucleotide incorporation (raw average of calculated percentages at each tail length)
--Total of each nucleotide added
--Total number of nucleotides added
--Weighted Average nucleotide incorporation sum of each nucleotide added divided total of all nucleotides added
B. Generates 2 files per set of data input, with metrics listed:
- UnweightedTailCompSummaryForAllSamples.out
--Sum total unique tails
--Unweighted Average incorporation for each nucleotide
--Average Tail Length
--Percent of reporter tRNAs tailed
- WeightedTailCompSummaryForAllSamples.out
--Sum Total unique tails
--Total number of nucleotides added
--Weighted Average incorporation for each nucleotide
--Average Tail Length
--Percent of reporter tRNAs tailed