Sequence Trimming and Assembly
- Drag sequences from bidirectional Sanger sequence output (.ab1 files containing electropherograms) into "GeneiousPro"http://www.geneious.com/web/geneious/download-geneious main window.
- Select both sequences, right click and choose "trim ends" and choose default values.
- Select trimmed sequences and click deNovo assembly option under "assembly" menu.
- Right click assembled sequence and choose "Generate Consensus Sequence", name it and describe with NCBI Taxon ID-if any.
BLASTn Sequence Similarity Search
- Select the sequence generated in step 4 and choose "sequence search" option on the top menu.
- In the BLAST options, choose BLASTn (Nucleotide BLAST) as the database, opt for "Fully annotate hits" and choose 100 as number of hits.
- Once the search is complete, drag these results to the folder in which you are working.
- In the left panel, choose 'NCBI > Nucleotide" and get additional sequences of interest. This include target taxonomic representatives and suitable outgroups.Drag all pertinent results to the folder.
Multiple Sequence Alignment (MSA)
- Select all sequences that need to be aligned and choose Ctrl+Shift+A
- Align first by Geneious alignment, with default parameters. Make sure "Automatically determine sequence's direction" is selected.
- Align once again (Ctrl_Shift+A) using MUSCLE alignment with 8 iterations.
- Once sequence is aligned, zoom to check accuracy of the alignment. Obviously un-alignable sequences should be removed and realigned. Ends of the alignment may be trimmed to match ends of query sequence. Alignment should be carefully edited by eye and introduce gaps wherever necessary.
Importing MSA in MEGA and performing analysis there
- Select the final trimmed alignment and choose Ctrl+Shift+E
- Choose FASTA and all the default options, and save it to a folder of your data
- Open this folder, right click on the fasta file and choose “Open with MEGA”
- In the alignment Explorer window of MEGA, choose “Phylogenetic analysis” in the main menu.
- Choose appropriate option. If sequence is Introns or other non-coding regions, choose no. If sequence is a CDS/Gene, choose Yes.
- In the MEGA main menu, choose “Find best DNA/Protein Models” under Models.
- Choose an appropriate options. For sequences with many gaps, “use all sites” may be appropriate. For general, good quality alignments, “Complete deletion” option is better. Perform the ModelTest.
- In the result table, choose the first model and note its BIC score to quote in paper.
- In the MEGA main menu, choose Distance > Compute Pairwise Distance
- In the options, choose appropriate. Choose the best model found in step 20. Choose same options selected for step 19. Perform the analysis.
- Result of distance matrix will be presented. Choose “export/print distances” from file menu and choose lower-left matrix with excel as output format.
- In the MEGA main menu, choose Phylogeny >Construct/Test ML Phylogeny
- Choose appropriate options. Choose the best model found in step 20. Choose same options selected for step 19. Choose 1000 bootstrap replicates. Perform the analysis.
- Save this tree in a vector format and export as .nexus for uploading to TreeBASE.
Performing Bayesian Inference Phylogeny
- Go back to Geneious and choose the same alignment.
- In the Tree option, choose MrBayes (with MrBayes add-in installed). Choose pertinent options. Choose the best model found in step 20. Choose same options selected for step 19. If the best model is not available, choose the model with lowest BIC score from available options. Perform the analysis.
- Save this tree in a vector format and export as .nexus for uploading to TreeBASE.
- Use appropriate vector image editor (Adobe Illustrator) to combine these two trees and make the final tree.