Frequently Asked Questions
Tutorial Questions
Most tools seem to have options for assembly using long and short reads, what are the pros and cons of the different tools?
Question: Most tools seem to have options for assembly using long and short reads, what are the pros and cons of the different tools?In our experience, when both long and short reads are allowed as input, the difference comes down to the order in which set is assembled first. For example, Unicycler assembles the short reads first (which can be good, because they are more accurate), and then scaffolds these into larger contigs using long reads. Other tools (or workflows) often assemble long reads first (which can also be good because these can span repeat regions), then correct this assembly with information from the more accurate short reads. There may also be other variations on long/short read assembly, and/or iterations of these types of steps (assemble, correct). My preference is to assemble long reads first, but that’s because I’m really interested in covering repeat regions. If accuracy was the aim, rather than contig length, the short-reads-first approach may be better. For even more complexity … I think some tools now allow input of “trusted contigs” - i.e. contigs assembled from other tools. Ryan Wick has a new tool called Trycyler that can take in multiple assemblies to make a consensus (bacterial genomes).
Igv
Add Mapped reads track to IGV from Galaxy
- Install IGV (if not already installed)
- Launch IGV on your computer
- Check if the reference genome is available on the IGV instance
- Expand the BAM dataset with the mapped reads in the history
- Click on the
local
indisplay with IGV
to load the reads into the IGV browserSwitch to the IGV instance
The mapped reads track should appear. Be sure that all files have the same genome ID
Add genome and annotations to IGV from Galaxy
- Upload a FASTA file with the reference genome and a GFF3 file with its annotation in the history (if not already there)
- Install IGV (if not already installed)
- Launch IGV on your computer
- Expand the FASTA dataset with the genome in the history
- Click on the
local
indisplay with IGV
to load the genome into the IGV browser- Wait until all Dataset status are
ok
Close the window
An alert
ERROR Parameter "file" is required
may appear. Ignore it.- Expand the GFF3 dataset with the annotations of the genome in the history
- Click on the
local
indisplay with IGV
to load the annotation into the IGV browserSwitch to the IGV instance
The annotation track should appear. Be careful that all files have the same genome ID
Visualisation
Using IGV with Galaxy
You can send data from your Galaxy history to IGV for viewing as follows:
- Install IGV on your computer (IGV download page)
- Start IGV
- In recent versions of IGV, you will have to enable the port:
- In IGV, go to
View > Preferences > Advanced
- Check the box
Enable Port
- In Galaxy, expand the dataset you would like to view in IGV
- Make sure you have set a reference genome/database correctly (dbkey) (instructions)
- Under
display in IGV
, click onlocal