Finding BAM dataset identifiers


Quickly learn what the identifiers are in any **BAM** dataset that is the result from mapping
  1. Run Samtools: IdxStats on the aligned data (bam dataset).
  2. The “index header” chromosome names and lengths will be listed in the output (along with read counts).
  3. Compare the chromosome identifiers to the chromosome (aka “chrom”) field in all other inputs: VCF, GTF, GFF(3), BED, Interval, etc.

Note:

  • The original mapping target may have been a built-in genome index, custom genome (transcriptome, exome, other) – the same bam data will still be summarized.
  • This method will not work for “sequence-only” bam datasets, as these usually have no header.
Still have questions?
Gitter Chat Support
Galaxy Help Forum
Want to embed this snippet (FAQ) in your GTN Tutorial?
{% snippet  faqs/galaxy/datasets_BAM_dataset_identifiers.md %}