metaseq: a Python framework for integrating high-throughput sequencing analyses

A session at SciPy 2013

  • Ryan Dale

Wednesday 26th June, 2013

5:13pm to 5:26pm (CST)

metaseq is a Python package that ties together a growing ecosystem of bioinformatics Python tools and file formats, focusing on flexibility and interactive exploration of high-throughput sequencing data (e.g., ChIP-seq, RNA-seq, and RIP-seq).

This talk will use a worked example to illustrate some practical bioinformatics applications of metaseq's features. For example, its filetype adapters provide random-access, uniform support for commonly-used formats (BAM, bigBed/bigWig, and, via tabix, any tab-delimited format). Combined with multiprocessing and a rebinning routine compiled by Cython, this allows relatively rapid population of NumPy arrays of binned signal over thousands of genes (or other features of interest).

metaseq's "mini-browser" framework connects these arrays -- or any other plot that considers genomic intervals, such as scatterplots of control vs treatment RNA-seq signal -- via callbacks to interactive creation of matplotlib figures that show the local genomic signal and gene models.
Alternatively, callbacks can upload data and display them in the UCSC genome browser for further visualization alongside the wealth of publicly available data.

About the speaker

This person is speaking at this event.
Ryan Dale

National Institute of Diabetes and Digestive and Kidney Diseases

Coverage of this session

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 5:13pm5:26pm CST

Date Wed 26th June 2013

Short URL


View the schedule


See something wrong?

Report an issue with this session