Functional principal component data analysis

a new method for analysing microbial community fingerprints

Janine B Illian, James I Prosser, Kate L Baker, J Ignacio Rangel-Castro

Research output: Contribution to journalArticle

18 Citations (Scopus)

Abstract

A common approach to molecular characterisation of microbial communities in natural environments is the amplification of small subunit (SSU) rRNA genes or genes encoding enzymes essential for a particular ecosystem function. A range of 'fingerprinting' techniques are available for the analysis of amplification products of both types of gene enabling quantitative or semi-quantitative analysis of relative abundances of different community members, and facilitating analysis of communities from large numbers of samples, including replicates. Statistical models that have been applied in this context suffer from a number of unavoidable limitations, including lack of distinction between closely adjacent bands or peaks, particularly when these differ significantly in intensity or size. Current approaches to the analysis of banding structures derived from gels are typically based on standard multivariate analysis methods such as principal component analysis (PCA) which do not consider structure of DGGE gels but treat the intensity of each band as independent from the other bands, ignoring local neighbourhood structures. This paper assesses whether a new statistical analytical technique, based on functional data analysis (FDA) methods, improves the discriminatory ability of molecular fingerprinting techniques. The approach regards band intensities as a mathematical function of the location on the gel and explicitly includes neighbourhood structure in the analysis. A simulation study clearly reveals the weaknesses of the standard PCA approach as opposed to the FDA approach, which is then used to analyse experimental DGGE data.
Original languageEnglish
Pages (from-to)89-95
Number of pages7
JournalJournal of Microbiological Methods
Volume79
Issue number1
DOIs
Publication statusPublished - Oct 2009

Fingerprint

Dermatoglyphics
Principal Component Analysis
Gels
Statistical Models
rRNA Genes
Genes
Ecosystem
Multivariate Analysis
Enzymes

Keywords

  • biodiversity
  • cluster analysis
  • colony count, microbial
  • DNA fingerprinting
  • DNA, bacterial
  • electrophoresis, polyacrylamide gel
  • environmental microbiology
  • nucleic acid denaturation
  • principal component analysis

Cite this

Functional principal component data analysis : a new method for analysing microbial community fingerprints. / Illian, Janine B; Prosser, James I; Baker, Kate L; Rangel-Castro, J Ignacio.

In: Journal of Microbiological Methods, Vol. 79, No. 1, 10.2009, p. 89-95.

Research output: Contribution to journalArticle

@article{471c433a77424fde9169037060e51805,
title = "Functional principal component data analysis: a new method for analysing microbial community fingerprints",
abstract = "A common approach to molecular characterisation of microbial communities in natural environments is the amplification of small subunit (SSU) rRNA genes or genes encoding enzymes essential for a particular ecosystem function. A range of 'fingerprinting' techniques are available for the analysis of amplification products of both types of gene enabling quantitative or semi-quantitative analysis of relative abundances of different community members, and facilitating analysis of communities from large numbers of samples, including replicates. Statistical models that have been applied in this context suffer from a number of unavoidable limitations, including lack of distinction between closely adjacent bands or peaks, particularly when these differ significantly in intensity or size. Current approaches to the analysis of banding structures derived from gels are typically based on standard multivariate analysis methods such as principal component analysis (PCA) which do not consider structure of DGGE gels but treat the intensity of each band as independent from the other bands, ignoring local neighbourhood structures. This paper assesses whether a new statistical analytical technique, based on functional data analysis (FDA) methods, improves the discriminatory ability of molecular fingerprinting techniques. The approach regards band intensities as a mathematical function of the location on the gel and explicitly includes neighbourhood structure in the analysis. A simulation study clearly reveals the weaknesses of the standard PCA approach as opposed to the FDA approach, which is then used to analyse experimental DGGE data.",
keywords = "biodiversity, cluster analysis, colony count, microbial, DNA fingerprinting, DNA, bacterial, electrophoresis, polyacrylamide gel, environmental microbiology, nucleic acid denaturation, principal component analysis",
author = "Illian, {Janine B} and Prosser, {James I} and Baker, {Kate L} and Rangel-Castro, {J Ignacio}",
year = "2009",
month = "10",
doi = "10.1016/j.mimet.2009.08.010",
language = "English",
volume = "79",
pages = "89--95",
journal = "Journal of Microbiological Methods",
issn = "0167-7012",
publisher = "Elsevier",
number = "1",

}

TY - JOUR

T1 - Functional principal component data analysis

T2 - a new method for analysing microbial community fingerprints

AU - Illian, Janine B

AU - Prosser, James I

AU - Baker, Kate L

AU - Rangel-Castro, J Ignacio

PY - 2009/10

Y1 - 2009/10

N2 - A common approach to molecular characterisation of microbial communities in natural environments is the amplification of small subunit (SSU) rRNA genes or genes encoding enzymes essential for a particular ecosystem function. A range of 'fingerprinting' techniques are available for the analysis of amplification products of both types of gene enabling quantitative or semi-quantitative analysis of relative abundances of different community members, and facilitating analysis of communities from large numbers of samples, including replicates. Statistical models that have been applied in this context suffer from a number of unavoidable limitations, including lack of distinction between closely adjacent bands or peaks, particularly when these differ significantly in intensity or size. Current approaches to the analysis of banding structures derived from gels are typically based on standard multivariate analysis methods such as principal component analysis (PCA) which do not consider structure of DGGE gels but treat the intensity of each band as independent from the other bands, ignoring local neighbourhood structures. This paper assesses whether a new statistical analytical technique, based on functional data analysis (FDA) methods, improves the discriminatory ability of molecular fingerprinting techniques. The approach regards band intensities as a mathematical function of the location on the gel and explicitly includes neighbourhood structure in the analysis. A simulation study clearly reveals the weaknesses of the standard PCA approach as opposed to the FDA approach, which is then used to analyse experimental DGGE data.

AB - A common approach to molecular characterisation of microbial communities in natural environments is the amplification of small subunit (SSU) rRNA genes or genes encoding enzymes essential for a particular ecosystem function. A range of 'fingerprinting' techniques are available for the analysis of amplification products of both types of gene enabling quantitative or semi-quantitative analysis of relative abundances of different community members, and facilitating analysis of communities from large numbers of samples, including replicates. Statistical models that have been applied in this context suffer from a number of unavoidable limitations, including lack of distinction between closely adjacent bands or peaks, particularly when these differ significantly in intensity or size. Current approaches to the analysis of banding structures derived from gels are typically based on standard multivariate analysis methods such as principal component analysis (PCA) which do not consider structure of DGGE gels but treat the intensity of each band as independent from the other bands, ignoring local neighbourhood structures. This paper assesses whether a new statistical analytical technique, based on functional data analysis (FDA) methods, improves the discriminatory ability of molecular fingerprinting techniques. The approach regards band intensities as a mathematical function of the location on the gel and explicitly includes neighbourhood structure in the analysis. A simulation study clearly reveals the weaknesses of the standard PCA approach as opposed to the FDA approach, which is then used to analyse experimental DGGE data.

KW - biodiversity

KW - cluster analysis

KW - colony count, microbial

KW - DNA fingerprinting

KW - DNA, bacterial

KW - electrophoresis, polyacrylamide gel

KW - environmental microbiology

KW - nucleic acid denaturation

KW - principal component analysis

U2 - 10.1016/j.mimet.2009.08.010

DO - 10.1016/j.mimet.2009.08.010

M3 - Article

VL - 79

SP - 89

EP - 95

JO - Journal of Microbiological Methods

JF - Journal of Microbiological Methods

SN - 0167-7012

IS - 1

ER -