Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

M. A. Petersen, M. Groenvold, J. Bjorner, N. Aaronson, T. Conroy, A. Cull, Peter Fayers, M. Hjermstad, M. A. G. Sprangers, M. Sullivan, European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group

Research output: Contribution to journalArticle

91 Citations (Scopus)

Abstract

In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

Original languageEnglish
Pages (from-to)373-385
Number of pages12
JournalQuality of Life Research
Volume12
DOIs
Publication statusPublished - 2003

Keywords

  • differential item functioning
  • multi-item scales
  • translation
  • validity
  • LOGISTIC-REGRESSION
  • RESPONSE THEORY
  • QUALITY
  • BIAS

Cite this

Petersen, M. A., Groenvold, M., Bjorner, J., Aaronson, N., Conroy, T., Cull, A., ... European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group (2003). Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire. Quality of Life Research, 12, 373-385. https://doi.org/10.1023/A:1023488915557

Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire. / Petersen, M. A.; Groenvold, M.; Bjorner, J.; Aaronson, N.; Conroy, T.; Cull, A.; Fayers, Peter; Hjermstad, M.; Sprangers, M. A. G.; Sullivan, M.; European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group.

In: Quality of Life Research, Vol. 12, 2003, p. 373-385.

Research output: Contribution to journalArticle

Petersen, MA, Groenvold, M, Bjorner, J, Aaronson, N, Conroy, T, Cull, A, Fayers, P, Hjermstad, M, Sprangers, MAG, Sullivan, M & European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group 2003, 'Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire', Quality of Life Research, vol. 12, pp. 373-385. https://doi.org/10.1023/A:1023488915557
Petersen, M. A. ; Groenvold, M. ; Bjorner, J. ; Aaronson, N. ; Conroy, T. ; Cull, A. ; Fayers, Peter ; Hjermstad, M. ; Sprangers, M. A. G. ; Sullivan, M. ; European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group. / Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire. In: Quality of Life Research. 2003 ; Vol. 12. pp. 373-385.
@article{14589bffb8154e30b5258aa5a7206cd0,
title = "Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire",
abstract = "In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.",
keywords = "differential item functioning, multi-item scales, translation, validity, LOGISTIC-REGRESSION, RESPONSE THEORY, QUALITY, BIAS",
author = "Petersen, {M. A.} and M. Groenvold and J. Bjorner and N. Aaronson and T. Conroy and A. Cull and Peter Fayers and M. Hjermstad and Sprangers, {M. A. G.} and M. Sullivan and {European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group}",
year = "2003",
doi = "10.1023/A:1023488915557",
language = "English",
volume = "12",
pages = "373--385",
journal = "Quality of Life Research",
issn = "0962-9343",
publisher = "Springer",

}

TY - JOUR

T1 - Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

AU - Petersen, M. A.

AU - Groenvold, M.

AU - Bjorner, J.

AU - Aaronson, N.

AU - Conroy, T.

AU - Cull, A.

AU - Fayers, Peter

AU - Hjermstad, M.

AU - Sprangers, M. A. G.

AU - Sullivan, M.

AU - European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group

PY - 2003

Y1 - 2003

N2 - In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

AB - In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

KW - differential item functioning

KW - multi-item scales

KW - translation

KW - validity

KW - LOGISTIC-REGRESSION

KW - RESPONSE THEORY

KW - QUALITY

KW - BIAS

U2 - 10.1023/A:1023488915557

DO - 10.1023/A:1023488915557

M3 - Article

VL - 12

SP - 373

EP - 385

JO - Quality of Life Research

JF - Quality of Life Research

SN - 0962-9343

ER -