Utah Center for Genetic Discovery
  • Home
  • People
    • Investigators
    • Research Team
    • Collaborators
  • Core Facility
  • Research
    • Projects
    • Publications
    • Sponsors
  • Software
  • News
  • Join Us

Software

This page serves as an index for the applications written and distributed by the Yandell, Marth, and Quinlan labs. Each item may include links to: documentation, code, and publications.

  • All
  • Yandell Lab
  • Marth Lab
  • Quinlan Lab
Software is listed with most recent releases first.


VARPRISM (VARiant PRIoritization SuM)
A software package that identifies genes with a statistical excess of damaging de novo mutations among individuals with a genetic disease. VARPRISM incorporates functional variant prediction information (the VAAST CASM score) to improve the statistical power of risk gene mapping and controls for local mutation rate heterogeneity. The beta version of VARPRISM is currently available for download.

  • Docs
  • Code
  • Paper


VCFAnno
annotates a VCF with any number of sorted and tabixed input BED, BAM, and VCF files in parallel. It does this by finding overlaps as it streams over the data and applying user-defined operations on the overlapping annotations.

  • Docs
  • Code
  • Paper


Taxonomer
Taxonomer is an ultrafast web-tool for comprehensive metagenomics data analysis and interactive results visualization. Taxonomer is unique in providing integrated nucleotide and protein-based classification and simultaneous host mRNA transcript profiling.

  • Docs
  • Code
  • Paper


WHAM (WHole-genome Alignment Metrics)
A structural variant (SV) caller that integrates several sources of mapping information to identify SVs. WHAM classifies SVs using a flexible and extendable machine-learning algorithm (random forest).

  • Docs
  • Code
  • Paper


Genome Query Tools (GQT)
A command line tool and a C API for storing and querying large-scale genotype data sets like those produced by 1000 Genomes, the Uk100K, and forthcoming datasets involving millions of genomes.

  • Docs
  • Code
  • Paper


SpeedSeq
An open-source genome analysis platform that accomplishes alignment, variant detection and functional annotation of a 50× human genome in 13 h on a low-cost server and alleviates a bioinformatics bottleneck that typically demands weeks of computation with extensive hands-on expert involvement.

  • Docs
  • Code
  • Paper


Pedagree
Compares familial-relationships and sexes as reported in a PED file with those inferred from a VCF.

  • Docs
  • Code


MAKER-P
A pipeline designed to make the annotation of novel plant genomes tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources.

  • Docs
  • Code
  • Paper


iobio
iobio uses immediate visual feedback to make understanding complex genomic datasets more intuitive, and analysis more interactive.

  • Docs
  • Code
  • Paper


Poretools
A flexible toolkit for exploring datasets generated by nanopore sequencing devices from MinION for the purposes of quality control and downstream analysis.

  • Docs
  • Code
  • Paper


Tangram
A C/C++ command line toolbox for structural variation(SV) detection.

  • Docs
  • Code


GKNO
A tool and pipeline management system that can be used to effectively deploy the majority of tools developed in the MarthLab as well as other third-party tools.

  • Docs
  • Code


RUFUS
A new approach to variant detection that does not rely on mapping or whole genome assembly methods.

  • Docs
  • Code


bedtools
These utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks.

  • Docs
  • Code
  • Paper


SubcloneSeeker
A computational framework for reconstructing tumor subclone structures.

  • Docs
  • Code
  • Paper


Lumpy
A probabilistic framework that we have developed to integrate multiple structural variation signals such as discordant paired-end alignments and split-read alignments.

  • Docs
  • Code
  • Paper


pVAAST (pedigree Variant Annotation, Analysis & Search Tool)
A disease-gene identification tool designed for high-throughput sequence data in pedigrees.

  • Docs
  • Code
  • Paper


PHEVOR (Phenotype Driven Variant Ontological Re-ranking tool)
Integrates phenotype, gene function, and disease information with personal genomic data for improved power to identify disease-causing alleles.

  • Docs
  • Code
  • Paper


MOSAIK
A stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome.

  • Docs
  • Code


GEMINI
A powerful framework for exploring genetic variation in the context of the wealth of existing genome annotations that are available for the human genome.

  • Docs
  • Code
  • Paper


GPAT ++ (Genotype Phenotype Association Toolkit)
The application of population genomics to non-model organisms is greatly facilitated by the low cost of next generation sequencing (NGS).

  • Docs
  • Code
  • Paper


ImagePlane
Python based image analysis software designed for the automated analysis of images of the animal S.

  • Docs
  • Code


MAKER
A portable and easily configurable genome annotation pipeline.

  • Docs
  • Code
  • Paper


VAAST 2 (Variant Annotation, Analysis & Search Tool)
Probabilistic search tool for identifying damaged genes and their disease-causing variants in personal genome sequences.

  • Docs
  • Code
  • Paper


BamTools
A Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment.

  • Docs
  • Code
  • Paper


RepeatRunner
A CGL-based program that integrates RepeatMasker with BLASTX to provide a comprehensive means of identifying repetitive elements.

  • Docs
  • Code


CGL (Comparitive Genomics Library, and pronounced as “Seagull”)
Provides an informatics infrastructure for a laboratory, department, or research institute engaged in the large-scale analysis of genomes and their annotations.

  • Docs
  • Code
  • Paper


Freebayes
A Bayesian genetic variant detector designed to find small polymorphisms.

  • Docs
  • Code


SCISSORS
A split-read aligner that maps orphaned read mates (i.e. where one end-mate is aligned with high mapping quality, but the other mate is unmapped), as well as re-maps severely clipped reads (reads mapped with many unaligned or “clipped-off” bases).

  • Docs
  • Code

VISIT the Yandell Lab Software page


VARPRISM (VARiant PRIoritization SuM)
A software package that identifies genes with a statistical excess of damaging de novo mutations among individuals with a genetic disease. VARPRISM incorporates functional variant prediction information (the VAAST CASM score) to improve the statistical power of risk gene mapping and controls for local mutation rate heterogeneity. The beta version of VARPRISM is currently available for download.

  • Docs
  • Code
  • Paper


Taxonomer
Taxonomer is an ultrafast web-tool for comprehensive metagenomics data analysis and interactive results visualization. Taxonomer is unique in providing integrated nucleotide and protein-based classification and simultaneous host mRNA transcript profiling.

  • Docs
  • Code
  • Paper


WHAM (WHole-genome Alignment Metrics)
A structural variant (SV) caller that integrates several sources of mapping information to identify SVs. WHAM classifies SVs using a flexible and extendable machine-learning algorithm (random forest).

  • Docs
  • Code
  • Paper


pVAAST (pedigree Variant Annotation, Analysis & Search Tool)
A disease-gene identification tool designed for high-throughput sequence data in pedigrees.

  • Docs
  • Code
  • Paper


PHEVOR (Phenotype Driven Variant Ontological Re-ranking tool)
Integrates phenotype, gene function, and disease information with personal genomic data for improved power to identify disease-causing alleles.

  • Docs
  • Code
  • Paper


GPAT ++ (Genotype Phenotype Association Toolkit)
The application of population genomics to non-model organisms is greatly facilitated by the low cost of next generation sequencing (NGS).

  • Docs
  • Code
  • Paper


ImagePlane
Python based image analysis software designed for the automated analysis of images of the animal S.

  • Docs
  • Code


VAAST 2 (Variant Annotation, Analysis & Search Tool)
Probabilistic search tool for identifying damaged genes and their disease-causing variants in personal genome sequences.

  • Docs
  • Code
  • Paper


MAKER-P
A pipeline designed to make the annotation of novel plant genomes tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources.

  • Docs
  • Code
  • Paper


MAKER
A portable and easily configurable genome annotation pipeline.

  • Docs
  • Code
  • Paper


RepeatRunner
A CGL-based program that integrates RepeatMasker with BLASTX to provide a comprehensive means of identifying repetitive elements.

  • Docs
  • Code


CGL (Comparitive Genomics Library, and pronounced as “Seagull”)
Provides an informatics infrastructure for a laboratory, department, or research institute engaged in the large-scale analysis of genomes and their annotations.

  • Docs
  • Code
  • Paper
VISIT the Marth Lab Software page


iobio
iobio uses immediate visual feedback to make understanding complex genomic datasets more intuitive, and analysis more interactive.

  • Docs
  • Code
  • Paper


GKNO
A tool and pipeline management system that can be used to effectively deploy the majority of tools developed in the MarthLab as well as other third-party tools.

  • Docs
  • Code


RUFUS
A new approach to variant detection that does not rely on mapping or whole genome assembly methods.

  • Docs
  • Code


SubcloneSeeker
A computational framework for reconstructing tumor subclone structures.

  • Docs
  • Code
  • Paper


Freebayes
A Bayesian genetic variant detector designed to find small polymorphisms.

  • Docs
  • Code


BamTools
A Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment.

  • Docs
  • Code
  • Paper


MOSAIK
A stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome.

  • Docs
  • Code


SCISSORS
A split-read aligner that maps orphaned read mates (i.e. where one end-mate is aligned with high mapping quality, but the other mate is unmapped), as well as re-maps severely clipped reads (reads mapped with many unaligned or “clipped-off” bases).

  • Docs
  • Code


Tangram
A C/C++ command line toolbox for structural variation(SV) detection.

  • Docs
  • Code

VISIT the Quinlan Lab Software page


VCFAnno
annotates a VCF with any number of sorted and tabixed input BED, BAM, and VCF files in parallel. It does this by finding overlaps as it streams over the data and applying user-defined operations on the overlapping annotations.

  • Docs
  • Code
  • Paper


Genome Query Tools (GQT)
A command line tool and a C API for storing and querying large-scale genotype data sets like those produced by 1000 Genomes, the Uk100K, and forthcoming datasets involving millions of genomes.

  • Docs
  • Code
  • Paper


SpeedSeq
An open-source genome analysis platform that accomplishes alignment, variant detection and functional annotation of a 50× human genome in 13 h on a low-cost server and alleviates a bioinformatics bottleneck that typically demands weeks of computation with extensive hands-on expert involvement.

  • Docs
  • Code
  • Paper


Pedagree
Compares familial-relationships and sexes as reported in a PED file with those inferred from a VCF.

  • Docs
  • Code


bedtools
These utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks.

  • Docs
  • Code
  • Paper


Poretools
A flexible toolkit for exploring datasets generated by nanopore sequencing devices from MinION for the purposes of quality control and downstream analysis.

  • Docs
  • Code
  • Paper


Lumpy
A probabilistic framework that we have developed to integrate multiple structural variation signals such as discordant paired-end alignments and split-read alignments.

  • Docs
  • Code
  • Paper


GEMINI
A powerful framework for exploring genetic variation in the context of the wealth of existing genome annotations that are available for the human genome.

  • Docs
  • Code
  • Paper

Recent Tweets

Twitter Feeds
logo

© Copyright 2025. Utah Center for Genetic Discovery. All Rights Reserved.

logo