Hyb

a bioinformatics pipeline for the analysis of CLASH (crosslinking, ligation and sequencing of hybrids) data

Anthony J Travis, Jonathan Moody, Aleksandra Helwak, David Tollervey, Grzegorz Kudla*

*Corresponding author for this work

Research output: Contribution to journalArticle

21 Citations (Scopus)
4 Downloads (Pure)

Abstract

Associations between proteins and RNA-RNA duplexes are important in post-transcriptional regulation of gene expression. The CLASH (Cross-linking, Ligation and Sequencing of Hybrids) technique captures RNA-RNA interactions by physically joining two RNA molecules associated with a protein complex into a single chimeric RNA molecule. These events are relatively rare and considerable effort is needed to detect a small number of chimeric sequences amongst millions of non-chimeric cDNA reads resulting from a CLASH experiment. We present the "hyb" bioinformatics pipeline, which we developed to analyse high-throughput cDNA sequencing data from CLASH experiments. Although primarily designed for use with AGO CLASH data, hyb can also be used for the detection and annotation of chimeric reads in other high-throughput sequencing datasets. We examined the sensitivity and specificity of chimera detection in a test dataset using the BLAST, BLAST+, BLAT, pBLAT and Bowtie2 read alignment programs. We obtained the most reliable results in the shortest time using a combination of preprocessing with Flexbar and subsequent read-mapping using Bowtie2. The "hyb" software is distributed under the GNU GPL (General Public License) and can be downloaded from https://github.com/gkudla/hyb. (c) 2013 The Authors. Published by Elsevier Inc. All rights reserved.

Original languageEnglish
Pages (from-to)263-273
Number of pages11
JournalMethods
Volume65
Issue number3
DOIs
Publication statusPublished - Feb 2014

Keywords

  • clash
  • RNA-RNA interactions
  • bioinformatics
  • high-throughput sequencing
  • wide identification
  • RNA interactions
  • binding-sites
  • clip
  • alignment
  • protein
  • transcripts
  • microRNAs
  • software
  • brain

Cite this

Hyb : a bioinformatics pipeline for the analysis of CLASH (crosslinking, ligation and sequencing of hybrids) data. / Travis, Anthony J; Moody, Jonathan; Helwak, Aleksandra; Tollervey, David; Kudla, Grzegorz.

In: Methods, Vol. 65, No. 3, 02.2014, p. 263-273.

Research output: Contribution to journalArticle

Travis, Anthony J ; Moody, Jonathan ; Helwak, Aleksandra ; Tollervey, David ; Kudla, Grzegorz. / Hyb : a bioinformatics pipeline for the analysis of CLASH (crosslinking, ligation and sequencing of hybrids) data. In: Methods. 2014 ; Vol. 65, No. 3. pp. 263-273.
@article{c0d9da4fac674140b336cc908bc65def,
title = "Hyb: a bioinformatics pipeline for the analysis of CLASH (crosslinking, ligation and sequencing of hybrids) data",
abstract = "Associations between proteins and RNA-RNA duplexes are important in post-transcriptional regulation of gene expression. The CLASH (Cross-linking, Ligation and Sequencing of Hybrids) technique captures RNA-RNA interactions by physically joining two RNA molecules associated with a protein complex into a single chimeric RNA molecule. These events are relatively rare and considerable effort is needed to detect a small number of chimeric sequences amongst millions of non-chimeric cDNA reads resulting from a CLASH experiment. We present the {"}hyb{"} bioinformatics pipeline, which we developed to analyse high-throughput cDNA sequencing data from CLASH experiments. Although primarily designed for use with AGO CLASH data, hyb can also be used for the detection and annotation of chimeric reads in other high-throughput sequencing datasets. We examined the sensitivity and specificity of chimera detection in a test dataset using the BLAST, BLAST+, BLAT, pBLAT and Bowtie2 read alignment programs. We obtained the most reliable results in the shortest time using a combination of preprocessing with Flexbar and subsequent read-mapping using Bowtie2. The {"}hyb{"} software is distributed under the GNU GPL (General Public License) and can be downloaded from https://github.com/gkudla/hyb. (c) 2013 The Authors. Published by Elsevier Inc. All rights reserved.",
keywords = "clash, RNA-RNA interactions, bioinformatics, high-throughput sequencing, wide identification, RNA interactions, binding-sites, clip, alignment, protein, transcripts, microRNAs, software, brain",
author = "Travis, {Anthony J} and Jonathan Moody and Aleksandra Helwak and David Tollervey and Grzegorz Kudla",
year = "2014",
month = "2",
doi = "10.1016/j.ymeth.2013.10.015",
language = "English",
volume = "65",
pages = "263--273",
journal = "Methods",
issn = "1046-2023",
publisher = "Academic Press Inc.",
number = "3",

}

TY - JOUR

T1 - Hyb

T2 - a bioinformatics pipeline for the analysis of CLASH (crosslinking, ligation and sequencing of hybrids) data

AU - Travis, Anthony J

AU - Moody, Jonathan

AU - Helwak, Aleksandra

AU - Tollervey, David

AU - Kudla, Grzegorz

PY - 2014/2

Y1 - 2014/2

N2 - Associations between proteins and RNA-RNA duplexes are important in post-transcriptional regulation of gene expression. The CLASH (Cross-linking, Ligation and Sequencing of Hybrids) technique captures RNA-RNA interactions by physically joining two RNA molecules associated with a protein complex into a single chimeric RNA molecule. These events are relatively rare and considerable effort is needed to detect a small number of chimeric sequences amongst millions of non-chimeric cDNA reads resulting from a CLASH experiment. We present the "hyb" bioinformatics pipeline, which we developed to analyse high-throughput cDNA sequencing data from CLASH experiments. Although primarily designed for use with AGO CLASH data, hyb can also be used for the detection and annotation of chimeric reads in other high-throughput sequencing datasets. We examined the sensitivity and specificity of chimera detection in a test dataset using the BLAST, BLAST+, BLAT, pBLAT and Bowtie2 read alignment programs. We obtained the most reliable results in the shortest time using a combination of preprocessing with Flexbar and subsequent read-mapping using Bowtie2. The "hyb" software is distributed under the GNU GPL (General Public License) and can be downloaded from https://github.com/gkudla/hyb. (c) 2013 The Authors. Published by Elsevier Inc. All rights reserved.

AB - Associations between proteins and RNA-RNA duplexes are important in post-transcriptional regulation of gene expression. The CLASH (Cross-linking, Ligation and Sequencing of Hybrids) technique captures RNA-RNA interactions by physically joining two RNA molecules associated with a protein complex into a single chimeric RNA molecule. These events are relatively rare and considerable effort is needed to detect a small number of chimeric sequences amongst millions of non-chimeric cDNA reads resulting from a CLASH experiment. We present the "hyb" bioinformatics pipeline, which we developed to analyse high-throughput cDNA sequencing data from CLASH experiments. Although primarily designed for use with AGO CLASH data, hyb can also be used for the detection and annotation of chimeric reads in other high-throughput sequencing datasets. We examined the sensitivity and specificity of chimera detection in a test dataset using the BLAST, BLAST+, BLAT, pBLAT and Bowtie2 read alignment programs. We obtained the most reliable results in the shortest time using a combination of preprocessing with Flexbar and subsequent read-mapping using Bowtie2. The "hyb" software is distributed under the GNU GPL (General Public License) and can be downloaded from https://github.com/gkudla/hyb. (c) 2013 The Authors. Published by Elsevier Inc. All rights reserved.

KW - clash

KW - RNA-RNA interactions

KW - bioinformatics

KW - high-throughput sequencing

KW - wide identification

KW - RNA interactions

KW - binding-sites

KW - clip

KW - alignment

KW - protein

KW - transcripts

KW - microRNAs

KW - software

KW - brain

U2 - 10.1016/j.ymeth.2013.10.015

DO - 10.1016/j.ymeth.2013.10.015

M3 - Article

VL - 65

SP - 263

EP - 273

JO - Methods

JF - Methods

SN - 1046-2023

IS - 3

ER -