Mapping affymetrix microarray probes to the rat genome via a persistent index

Susan Fairley, John McClure, Neil Hanlon, Robert W Irving, Martin W McBride, Anna F Dominiczak, Ela Hunt

Research output: Contribution to journalArticle

Abstract

A probe mapping technique using a novel implementation of a persistent q-gram index was developed. It guarantees to find all matches that meet certain definitions. These include exact matching of the central 19 bases of 25 base probes, matching the central 19 bases with at most one or three mismatches and exact matching of any 16 bases. In comparison with BLAST and BLAT, the new methods were either significantly faster or identified matches missed by the heuristics. The 16 bp method was used to map the 342,410 perfect match probes from the Affymetrix GeneChip Rat Genome 230 2.0 Array to the genome. When compared with the mapping from Ensembl, the new mapping included over seven million novel matches, providing additional evidence for researchers wishing to further investigate the sources of signals measured in microarray experiments. The results demonstrate the practicality of the index, which could support other q-gram based algorithms.
Original languageEnglish
Pages (from-to)48-65
Number of pages18
JournalInternational Journal of Knowledge Discovery in Bioinformatics
Volume1
Issue number1
DOIs
Publication statusPublished - Jan 2010

Fingerprint

genome
probe
heuristics
index
experiment
method
comparison

Cite this

Fairley, S., McClure, J., Hanlon, N., Irving, R. W., McBride, M. W., Dominiczak, A. F., & Hunt, E. (2010). Mapping affymetrix microarray probes to the rat genome via a persistent index. International Journal of Knowledge Discovery in Bioinformatics, 1(1), 48-65. https://doi.org/10.4018/978-1-4666-1785-8.ch002

Mapping affymetrix microarray probes to the rat genome via a persistent index. / Fairley, Susan; McClure, John; Hanlon, Neil; Irving, Robert W; McBride, Martin W; Dominiczak, Anna F; Hunt, Ela.

In: International Journal of Knowledge Discovery in Bioinformatics, Vol. 1, No. 1, 01.2010, p. 48-65.

Research output: Contribution to journalArticle

Fairley, Susan ; McClure, John ; Hanlon, Neil ; Irving, Robert W ; McBride, Martin W ; Dominiczak, Anna F ; Hunt, Ela. / Mapping affymetrix microarray probes to the rat genome via a persistent index. In: International Journal of Knowledge Discovery in Bioinformatics. 2010 ; Vol. 1, No. 1. pp. 48-65.
@article{b387aa00975e416599e83087617e1c12,
title = "Mapping affymetrix microarray probes to the rat genome via a persistent index",
abstract = "A probe mapping technique using a novel implementation of a persistent q-gram index was developed. It guarantees to find all matches that meet certain definitions. These include exact matching of the central 19 bases of 25 base probes, matching the central 19 bases with at most one or three mismatches and exact matching of any 16 bases. In comparison with BLAST and BLAT, the new methods were either significantly faster or identified matches missed by the heuristics. The 16 bp method was used to map the 342,410 perfect match probes from the Affymetrix GeneChip Rat Genome 230 2.0 Array to the genome. When compared with the mapping from Ensembl, the new mapping included over seven million novel matches, providing additional evidence for researchers wishing to further investigate the sources of signals measured in microarray experiments. The results demonstrate the practicality of the index, which could support other q-gram based algorithms.",
author = "Susan Fairley and John McClure and Neil Hanlon and Irving, {Robert W} and McBride, {Martin W} and Dominiczak, {Anna F} and Ela Hunt",
year = "2010",
month = "1",
doi = "10.4018/978-1-4666-1785-8.ch002",
language = "English",
volume = "1",
pages = "48--65",
journal = "International Journal of Knowledge Discovery in Bioinformatics",
issn = "1947-9115",
number = "1",

}

TY - JOUR

T1 - Mapping affymetrix microarray probes to the rat genome via a persistent index

AU - Fairley, Susan

AU - McClure, John

AU - Hanlon, Neil

AU - Irving, Robert W

AU - McBride, Martin W

AU - Dominiczak, Anna F

AU - Hunt, Ela

PY - 2010/1

Y1 - 2010/1

N2 - A probe mapping technique using a novel implementation of a persistent q-gram index was developed. It guarantees to find all matches that meet certain definitions. These include exact matching of the central 19 bases of 25 base probes, matching the central 19 bases with at most one or three mismatches and exact matching of any 16 bases. In comparison with BLAST and BLAT, the new methods were either significantly faster or identified matches missed by the heuristics. The 16 bp method was used to map the 342,410 perfect match probes from the Affymetrix GeneChip Rat Genome 230 2.0 Array to the genome. When compared with the mapping from Ensembl, the new mapping included over seven million novel matches, providing additional evidence for researchers wishing to further investigate the sources of signals measured in microarray experiments. The results demonstrate the practicality of the index, which could support other q-gram based algorithms.

AB - A probe mapping technique using a novel implementation of a persistent q-gram index was developed. It guarantees to find all matches that meet certain definitions. These include exact matching of the central 19 bases of 25 base probes, matching the central 19 bases with at most one or three mismatches and exact matching of any 16 bases. In comparison with BLAST and BLAT, the new methods were either significantly faster or identified matches missed by the heuristics. The 16 bp method was used to map the 342,410 perfect match probes from the Affymetrix GeneChip Rat Genome 230 2.0 Array to the genome. When compared with the mapping from Ensembl, the new mapping included over seven million novel matches, providing additional evidence for researchers wishing to further investigate the sources of signals measured in microarray experiments. The results demonstrate the practicality of the index, which could support other q-gram based algorithms.

U2 - 10.4018/978-1-4666-1785-8.ch002

DO - 10.4018/978-1-4666-1785-8.ch002

M3 - Article

VL - 1

SP - 48

EP - 65

JO - International Journal of Knowledge Discovery in Bioinformatics

JF - International Journal of Knowledge Discovery in Bioinformatics

SN - 1947-9115

IS - 1

ER -