To be or not to be associated

power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies

Elise Vaumourin*, Gwenael Vourc'h, Sandra Telfer, Xavier Lambin, Diaeldin Salih, Ulrike Seitzer, Serge Morand, Nathalie Charbonnel, Muriel Vayssier-Taussat, Patrick Gasqui

*Corresponding author for this work

Research output: Contribution to journalArticle

11 Citations (Scopus)
3 Downloads (Pure)

Abstract

A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e., the generalized chi-square, the network and the multinomial GLM approaches) to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: (1) rodents infected with Bartonella sp., Babesia microfi and Anaplasma phagocytophilum and (2) bovine population infected with Theileria sp. and Babesia sp. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T parva, T mutans, and T velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unraveling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions between parasites.

Original languageEnglish
Article number62
Number of pages11
JournalFrontiers in cellular and infection microbiology
Volume4
DOIs
Publication statusPublished - 15 May 2014

Fingerprint

Parasites
Cross-Sectional Studies
Bartonella
Babesia
Anaplasma phagocytophilum
Theileria
Arvicolinae
Parasitic Diseases
Chi-Square Distribution
Genomics
Population
Rodentia
Epidemiology
Medicine
Technology
Research

Keywords

  • associations
  • interactions
  • modeling
  • parasite community
  • screening
  • GLM approach
  • network model
  • chi-square test
  • component community structure
  • central equatoria state
  • Lake District Region
  • molecular-detection
  • Southern Sudan
  • Northern Spain
  • networks
  • Babesia
  • population
  • Theileria

Cite this

To be or not to be associated : power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies. / Vaumourin, Elise; Vourc'h, Gwenael; Telfer, Sandra; Lambin, Xavier; Salih, Diaeldin; Seitzer, Ulrike; Morand, Serge; Charbonnel, Nathalie; Vayssier-Taussat, Muriel; Gasqui, Patrick.

In: Frontiers in cellular and infection microbiology, Vol. 4, 62, 15.05.2014.

Research output: Contribution to journalArticle

Vaumourin, Elise ; Vourc'h, Gwenael ; Telfer, Sandra ; Lambin, Xavier ; Salih, Diaeldin ; Seitzer, Ulrike ; Morand, Serge ; Charbonnel, Nathalie ; Vayssier-Taussat, Muriel ; Gasqui, Patrick. / To be or not to be associated : power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies. In: Frontiers in cellular and infection microbiology. 2014 ; Vol. 4.
@article{3376d27c4b544cafb59feb940b5d9a06,
title = "To be or not to be associated: power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies",
abstract = "A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e., the generalized chi-square, the network and the multinomial GLM approaches) to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: (1) rodents infected with Bartonella sp., Babesia microfi and Anaplasma phagocytophilum and (2) bovine population infected with Theileria sp. and Babesia sp. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T parva, T mutans, and T velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unraveling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions between parasites.",
keywords = "associations, interactions, modeling, parasite community, screening, GLM approach, network model, chi-square test, component community structure, central equatoria state, Lake District Region, molecular-detection, Southern Sudan, Northern Spain, networks, Babesia, population, Theileria",
author = "Elise Vaumourin and Gwenael Vourc'h and Sandra Telfer and Xavier Lambin and Diaeldin Salih and Ulrike Seitzer and Serge Morand and Nathalie Charbonnel and Muriel Vayssier-Taussat and Patrick Gasqui",
note = "Acknowledgments We are grateful to the « Tiques et Maladies {\`a} Tiques » working group of the « R{\'e}seau Ecologie des Interactions Durables » for discussion and support. This modeling work was supported by the Animal Health department of National Institute of Agronomic Research (http://www.inra.fr), Auvergne region (http://www.auvergnesciences.com), the Metaprogramme MEM (projet Patho-ID) of INRA and the EU grant FP7-261504 EDENext. It is cataloged by the EDENext Steering Committee as EDENext208 (http://www.edenext.eu). The contents of this publication are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission. The field vole fieldwork was supported by funding from the Natural Environment Research Council (grant GR3/13051) and the Wellcome Trust (grants 075202/Z/04/Z and 070675/Z/03/Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.",
year = "2014",
month = "5",
day = "15",
doi = "10.3389/fcimb.2014.00062",
language = "English",
volume = "4",
journal = "Frontiers in cellular and infection microbiology",
issn = "2235-2988",
publisher = "Frontiers Media S.A.",

}

TY - JOUR

T1 - To be or not to be associated

T2 - power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies

AU - Vaumourin, Elise

AU - Vourc'h, Gwenael

AU - Telfer, Sandra

AU - Lambin, Xavier

AU - Salih, Diaeldin

AU - Seitzer, Ulrike

AU - Morand, Serge

AU - Charbonnel, Nathalie

AU - Vayssier-Taussat, Muriel

AU - Gasqui, Patrick

N1 - Acknowledgments We are grateful to the « Tiques et Maladies à Tiques » working group of the « Réseau Ecologie des Interactions Durables » for discussion and support. This modeling work was supported by the Animal Health department of National Institute of Agronomic Research (http://www.inra.fr), Auvergne region (http://www.auvergnesciences.com), the Metaprogramme MEM (projet Patho-ID) of INRA and the EU grant FP7-261504 EDENext. It is cataloged by the EDENext Steering Committee as EDENext208 (http://www.edenext.eu). The contents of this publication are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission. The field vole fieldwork was supported by funding from the Natural Environment Research Council (grant GR3/13051) and the Wellcome Trust (grants 075202/Z/04/Z and 070675/Z/03/Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

PY - 2014/5/15

Y1 - 2014/5/15

N2 - A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e., the generalized chi-square, the network and the multinomial GLM approaches) to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: (1) rodents infected with Bartonella sp., Babesia microfi and Anaplasma phagocytophilum and (2) bovine population infected with Theileria sp. and Babesia sp. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T parva, T mutans, and T velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unraveling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions between parasites.

AB - A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e., the generalized chi-square, the network and the multinomial GLM approaches) to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: (1) rodents infected with Bartonella sp., Babesia microfi and Anaplasma phagocytophilum and (2) bovine population infected with Theileria sp. and Babesia sp. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T parva, T mutans, and T velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unraveling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions between parasites.

KW - associations

KW - interactions

KW - modeling

KW - parasite community

KW - screening

KW - GLM approach

KW - network model

KW - chi-square test

KW - component community structure

KW - central equatoria state

KW - Lake District Region

KW - molecular-detection

KW - Southern Sudan

KW - Northern Spain

KW - networks

KW - Babesia

KW - population

KW - Theileria

U2 - 10.3389/fcimb.2014.00062

DO - 10.3389/fcimb.2014.00062

M3 - Article

VL - 4

JO - Frontiers in cellular and infection microbiology

JF - Frontiers in cellular and infection microbiology

SN - 2235-2988

M1 - 62

ER -