Utah Center for Genetic Discovery
  • Home
  • People
    • Investigators
    • Research Team
    • Collaborators
  • Recharge Center
  • Research
    • Projects
    • Publications
    • Sponsors
  • Software
  • News
  • Join Us

Software

This page serves as an index for the applications written and distributed by the Yandell, Marth, and Quinlan labs. Each item may include links to: documentation, code, and publications.

  • All
  • Yandell Lab
  • Marth Lab
  • Quinlan Lab
Software is listed with most recent releases first.


VARPRISM (VARiant PRIoritization SuM)
A software package that identifies genes with a statistical excess of damaging de novo mutations among individuals with a genetic disease. VARPRISM incorporates functional variant prediction information (the VAAST CASM score) to improve the statistical power of risk gene mapping and controls for local mutation rate heterogeneity. The beta version of VARPRISM is currently available for download.

  • Docs
  • Code
  • Paper


VCFAnno
annotates a VCF with any number of sorted and tabixed input BED, BAM, and VCF files in parallel. It does this by finding overlaps as it streams over the data and applying user-defined operations on the overlapping annotations.

  • Docs
  • Code
  • Paper


Taxonomer
Taxonomer is an ultrafast web-tool for comprehensive metagenomics data analysis and interactive results visualization. Taxonomer is unique in providing integrated nucleotide and protein-based classification and simultaneous host mRNA transcript profiling.

  • Docs
  • Code
  • Paper


WHAM (WHole-genome Alignment Metrics)
A structural variant (SV) caller that integrates several sources of mapping information to identify SVs. WHAM classifies SVs using a flexible and extendable machine-learning algorithm (random forest).

  • Docs
  • Code
  • Paper


Genome Query Tools (GQT)
A command line tool and a C API for storing and querying large-scale genotype data sets like those produced by 1000 Genomes, the Uk100K, and forthcoming datasets involving millions of genomes.

  • Docs
  • Code
  • Paper


SpeedSeq
An open-source genome analysis platform that accomplishes alignment, variant detection and functional annotation of a 50× human genome in 13 h on a low-cost server and alleviates a bioinformatics bottleneck that typically demands weeks of computation with extensive hands-on expert involvement.

  • Docs
  • Code
  • Paper


Pedagree
Compares familial-relationships and sexes as reported in a PED file with those inferred from a VCF.

  • Docs
  • Code


MAKER-P
A pipeline designed to make the annotation of novel plant genomes tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources.

  • Docs
  • Code
  • Paper


iobio
iobio uses immediate visual feedback to make understanding complex genomic datasets more intuitive, and analysis more interactive.

  • Docs
  • Code
  • Paper


Poretools
A flexible toolkit for exploring datasets generated by nanopore sequencing devices from MinION for the purposes of quality control and downstream analysis.

  • Docs
  • Code
  • Paper


Tangram
A C/C++ command line toolbox for structural variation(SV) detection.

  • Docs
  • Code


GKNO
A tool and pipeline management system that can be used to effectively deploy the majority of tools developed in the MarthLab as well as other third-party tools.

  • Docs
  • Code


RUFUS
A new approach to variant detection that does not rely on mapping or whole genome assembly methods.

  • Docs
  • Code


bedtools
These utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks.

  • Docs
  • Code
  • Paper


SubcloneSeeker
A computational framework for reconstructing tumor subclone structures.

  • Docs
  • Code
  • Paper


Lumpy
A probabilistic framework that we have developed to integrate multiple structural variation signals such as discordant paired-end alignments and split-read alignments.

  • Docs
  • Code
  • Paper


pVAAST (pedigree Variant Annotation, Analysis & Search Tool)
A disease-gene identification tool designed for high-throughput sequence data in pedigrees.

  • Docs
  • Code
  • Paper


PHEVOR (Phenotype Driven Variant Ontological Re-ranking tool)
Integrates phenotype, gene function, and disease information with personal genomic data for improved power to identify disease-causing alleles.

  • Docs
  • Code
  • Paper


MOSAIK
A stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome.

  • Docs
  • Code


GEMINI
A powerful framework for exploring genetic variation in the context of the wealth of existing genome annotations that are available for the human genome.

  • Docs
  • Code
  • Paper


GPAT ++ (Genotype Phenotype Association Toolkit)
The application of population genomics to non-model organisms is greatly facilitated by the low cost of next generation sequencing (NGS).

  • Docs
  • Code
  • Paper


ImagePlane
Python based image analysis software designed for the automated analysis of images of the animal S.

  • Docs
  • Code


MAKER
A portable and easily configurable genome annotation pipeline.

  • Docs
  • Code
  • Paper


VAAST 2 (Variant Annotation, Analysis & Search Tool)
Probabilistic search tool for identifying damaged genes and their disease-causing variants in personal genome sequences.

  • Docs
  • Code
  • Paper


BamTools
A Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment.

  • Docs
  • Code
  • Paper


RepeatRunner
A CGL-based program that integrates RepeatMasker with BLASTX to provide a comprehensive means of identifying repetitive elements.

  • Docs
  • Code


CGL (Comparitive Genomics Library, and pronounced as “Seagull”)
Provides an informatics infrastructure for a laboratory, department, or research institute engaged in the large-scale analysis of genomes and their annotations.

  • Docs
  • Code
  • Paper


Freebayes
A Bayesian genetic variant detector designed to find small polymorphisms.

  • Docs
  • Code


SCISSORS
A split-read aligner that maps orphaned read mates (i.e. where one end-mate is aligned with high mapping quality, but the other mate is unmapped), as well as re-maps severely clipped reads (reads mapped with many unaligned or “clipped-off” bases).

  • Docs
  • Code

VISIT the Yandell Lab Software page


VARPRISM (VARiant PRIoritization SuM)
A software package that identifies genes with a statistical excess of damaging de novo mutations among individuals with a genetic disease. VARPRISM incorporates functional variant prediction information (the VAAST CASM score) to improve the statistical power of risk gene mapping and controls for local mutation rate heterogeneity. The beta version of VARPRISM is currently available for download.

  • Docs
  • Code
  • Paper


Taxonomer
Taxonomer is an ultrafast web-tool for comprehensive metagenomics data analysis and interactive results visualization. Taxonomer is unique in providing integrated nucleotide and protein-based classification and simultaneous host mRNA transcript profiling.

  • Docs
  • Code
  • Paper


WHAM (WHole-genome Alignment Metrics)
A structural variant (SV) caller that integrates several sources of mapping information to identify SVs. WHAM classifies SVs using a flexible and extendable machine-learning algorithm (random forest).

  • Docs
  • Code
  • Paper


pVAAST (pedigree Variant Annotation, Analysis & Search Tool)
A disease-gene identification tool designed for high-throughput sequence data in pedigrees.

  • Docs
  • Code
  • Paper


PHEVOR (Phenotype Driven Variant Ontological Re-ranking tool)
Integrates phenotype, gene function, and disease information with personal genomic data for improved power to identify disease-causing alleles.

  • Docs
  • Code
  • Paper


GPAT ++ (Genotype Phenotype Association Toolkit)
The application of population genomics to non-model organisms is greatly facilitated by the low cost of next generation sequencing (NGS).

  • Docs
  • Code
  • Paper


ImagePlane
Python based image analysis software designed for the automated analysis of images of the animal S.

  • Docs
  • Code


VAAST 2 (Variant Annotation, Analysis & Search Tool)
Probabilistic search tool for identifying damaged genes and their disease-causing variants in personal genome sequences.

  • Docs
  • Code
  • Paper


MAKER-P
A pipeline designed to make the annotation of novel plant genomes tractable for small groups with limited bioinformatics experience and resources, and faster and more transparent for large groups with more experience and resources.

  • Docs
  • Code
  • Paper


MAKER
A portable and easily configurable genome annotation pipeline.

  • Docs
  • Code
  • Paper


RepeatRunner
A CGL-based program that integrates RepeatMasker with BLASTX to provide a comprehensive means of identifying repetitive elements.

  • Docs
  • Code


CGL (Comparitive Genomics Library, and pronounced as “Seagull”)
Provides an informatics infrastructure for a laboratory, department, or research institute engaged in the large-scale analysis of genomes and their annotations.

  • Docs
  • Code
  • Paper
VISIT the Marth Lab Software page


iobio
iobio uses immediate visual feedback to make understanding complex genomic datasets more intuitive, and analysis more interactive.

  • Docs
  • Code
  • Paper


GKNO
A tool and pipeline management system that can be used to effectively deploy the majority of tools developed in the MarthLab as well as other third-party tools.

  • Docs
  • Code


RUFUS
A new approach to variant detection that does not rely on mapping or whole genome assembly methods.

  • Docs
  • Code


SubcloneSeeker
A computational framework for reconstructing tumor subclone structures.

  • Docs
  • Code
  • Paper


Freebayes
A Bayesian genetic variant detector designed to find small polymorphisms.

  • Docs
  • Code


BamTools
A Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment.

  • Docs
  • Code
  • Paper


MOSAIK
A stable, sensitive and open-source program for mapping second and third-generation sequencing reads to a reference genome.

  • Docs
  • Code


SCISSORS
A split-read aligner that maps orphaned read mates (i.e. where one end-mate is aligned with high mapping quality, but the other mate is unmapped), as well as re-maps severely clipped reads (reads mapped with many unaligned or “clipped-off” bases).

  • Docs
  • Code


Tangram
A C/C++ command line toolbox for structural variation(SV) detection.

  • Docs
  • Code

VISIT the Quinlan Lab Software page


VCFAnno
annotates a VCF with any number of sorted and tabixed input BED, BAM, and VCF files in parallel. It does this by finding overlaps as it streams over the data and applying user-defined operations on the overlapping annotations.

  • Docs
  • Code
  • Paper


Genome Query Tools (GQT)
A command line tool and a C API for storing and querying large-scale genotype data sets like those produced by 1000 Genomes, the Uk100K, and forthcoming datasets involving millions of genomes.

  • Docs
  • Code
  • Paper


SpeedSeq
An open-source genome analysis platform that accomplishes alignment, variant detection and functional annotation of a 50× human genome in 13 h on a low-cost server and alleviates a bioinformatics bottleneck that typically demands weeks of computation with extensive hands-on expert involvement.

  • Docs
  • Code
  • Paper


Pedagree
Compares familial-relationships and sexes as reported in a PED file with those inferred from a VCF.

  • Docs
  • Code


bedtools
These utilities are a swiss-army knife of tools for a wide-range of genomics analysis tasks.

  • Docs
  • Code
  • Paper


Poretools
A flexible toolkit for exploring datasets generated by nanopore sequencing devices from MinION for the purposes of quality control and downstream analysis.

  • Docs
  • Code
  • Paper


Lumpy
A probabilistic framework that we have developed to integrate multiple structural variation signals such as discordant paired-end alignments and split-read alignments.

  • Docs
  • Code
  • Paper


GEMINI
A powerful framework for exploring genetic variation in the context of the wealth of existing genome annotations that are available for the human genome.

  • Docs
  • Code
  • Paper

Recent Tweets

Twitter Feeds
Aaron Quinlan
Aaron Quinlan@aaronquinlanRT @JulieCKiefer: "The outstanding science stemming from this program is changing an understanding of HIV/AIDS, other viral diseases." U o…47 minutes ago09 August 2022
Aaron Quinlan
Aaron Quinlan@aaronquinlanWe have a paper that has been sitting at a journal for more than four months without view reviewers assigned, despi… twitter.com/i/web/status/1…1 day ago08 August 2022
brent pedersen
brent pedersen@brent_pamong many other nuggest: 4.7X improvement in diagnosis rate given a full trio. twitter.com/carolinefwrigh…2 weeks ago27 July 2022
brent pedersen
brent pedersen@brent_pRT @carolinefwright: Mega-DDD diagnostics preprint! Iterative variant detection, annotation, filtering and interpretation, plus analysis of…2 weeks ago27 July 2022
brent pedersen
brent pedersen@brent_pRT @QAGreenways: We should redesign cities for autonomous kids, not autonomous cars. pic.twitter.com/bJ2ZerRdHA1 month ago28 June 2022
brent pedersen
brent pedersen@brent_pRT @EssiLaajala: "we note that this method will have a sensitivity and specificity that will depend on each dataset and we recommend that u…2 months ago22 June 2022
brent pedersen
brent pedersen@brent_pRT @EssiLaajala: Ten years ago, @brent_p et al. developed an efficient and generalizable tool called Comb-p to adjust spatially correlated…2 months ago22 June 2022
brent pedersen
brent pedersen@brent_pRT @EssiLaajala: For anyone working on #DNAMethylation or other spatially correlated data: doi.org/10.1080/155922… On how to avoid serious…2 months ago22 June 2022
IDbyDNA
IDbyDNA@idbydnaincThis week is National Laboratory Professionals week. Thank you to all laboratory professionals, who play a vital ro… twitter.com/i/web/status/1…3 months ago28 April 2022
IDbyDNA
IDbyDNA@idbydnaincIDbyDNA announces software update to their #Explify®* platform, improving assay analytical performance and software… twitter.com/i/web/status/1…3 months ago27 April 2022
IDbyDNA
IDbyDNA@idbydnaincIn case you missed us at #ECCMID: Check out our scientific posters and poster presentations featuring the use of… twitter.com/i/web/status/1…3 months ago26 April 2022
IDbyDNA
IDbyDNA@idbydnaincDon’t miss @johnrossen at the @illumina Integrated Symposium at #ECCMID 2022: Moving Next-Generation Sequencing to… twitter.com/i/web/status/1…4 months ago22 April 2022
IDbyDNA
IDbyDNA@idbydnainc#ECCMID 2022 - Pathogen Surveillance by #mNGS for Intubated Patients: A Reflex to Culture Model (RPIP).” Don’t miss… twitter.com/i/web/status/1…4 months ago22 April 2022
IDbyDNA
IDbyDNA@idbydnainc#ECCMID 2022 - Abstract preview: “Uropathogen Detection by Precision Metagenomics in Culture-Positive, Culture-Nega… twitter.com/i/web/status/1…4 months ago21 April 2022
Alistair Ward
Alistair Ward@AlistairNWardOur gene.iobio tool from the #iobio suite that is an important part of Frameshift Genomics' Mosaic platform has jus… twitter.com/i/web/status/1…9 months ago01 November 2021
Gabor Marth
Gabor Marth@MarthGaborThe paper describing our gene.iobio interactive variant analysis web tool is finally out: rdcu.be/czqFG.… twitter.com/i/web/status/1…10 months ago13 October 2021
Alistair Ward
Alistair Ward@AlistairNWardOur paper on our genetic variant analysis tool, gene.iobio, just came out. Take a look at the paper and the tool (… twitter.com/i/web/status/1…10 months ago13 October 2021
Alistair Ward
Alistair Ward@AlistairNWardWe've added a lot of new functionality to Mosaic. See the tutorial for more details: frameshift.io/blog/using-cha…1 year ago02 August 2021
Alistair Ward
Alistair Ward@AlistairNWardA new app from the iobio team has just been released. Take a look at the blog post and give the app a try iobio.io/2020/08/25/cli…2 years ago27 August 2020
Alistair Ward
Alistair Ward@AlistairNWardRT @pkerpedjiev: Here’s a fun project I’ve been working on for a while: a snappy bam file viewer built with @higlass_io 😎 Check out the dem…2 years ago25 March 2020
Gabor Marth
Gabor Marth@MarthGaborThe much-awaited Mosaic genomic viz and analytics platform is out from Framshift! Great work, guys! twitter.com/ChaseAllnMille…3 years ago28 January 2020
UCGDGenetics
UCGDGenetics@USTARGeneticsGreat news! twitter.com/averyholton/st…3 years ago19 November 2019
UCGDGenetics
UCGDGenetics@USTARGeneticsRT @NPR: New DNA-sequencing technology has made it possible to rapidly diagnose some baffling rare diseases that make babies sick. And eve…3 years ago28 October 2019
UCGDGenetics
UCGDGenetics@USTARGeneticsRT @JulieCKiefer: "We as parents are not all equal. Some of us pass on more mutations than others and this is an important source of geneti…3 years ago28 October 2019
Gabor Marth
Gabor Marth@MarthGaborRT @strnr: Another useful tool from an excellent team: genepanel.iobio.io twitter.com/biorxiv_genomi…3 years ago07 August 2019
UCGDGenetics
UCGDGenetics@USTARGeneticsRT @aaronquinlan: Check out our latest from @tomsasani detailing mosaicism and marked inter-family variability in germline mutation accumul…3 years ago19 February 2019
UCGDGenetics
UCGDGenetics@USTARGeneticsRT @tomsasani: Our new work describing de novo mutation dynamics in 33 large, three-generation CEPH/Utah pedigrees is up on bioRxiv (https:…3 years ago19 February 2019
UCGDGenetics
UCGDGenetics@USTARGeneticsRT @JulieCKiefer: When the top minds in medicine and science work together, they can do incredible things. Watch One in a Million, an 8-min…3 years ago11 February 2019
Gabor Marth
Gabor Marth@MarthGaborA brand new iobio.io app just out! genepanel.iobio.io helps you create a list of candidate gene… twitter.com/i/web/status/1…4 years ago04 December 2018
Gabor Marth
Gabor Marth@MarthGaborRT @aaronquinlan: In our new manuscript, we used WGS and comprehensive variant detection to diagnose 14/14 patients w/ EIEE. We implicated…4 years ago15 August 2018
Gabor Marth
Gabor Marth@MarthGaborRT @StacyWKish: Doctors @UofUHealth applied high-tech tools developed at @USTARGenetics to identify mutations causing a rare childhood dise…4 years ago13 August 2018
logo

© Copyright 2022. Utah Center for Genetic Discovery. All Rights Reserved.

logo