Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

M. A. Petersen; M. Groenvold; J. Bjorner; N. Aaronson; T. Conroy; A. Cull; Peter Fayers; M. Hjermstad; M. A. G. Sprangers; M. Sullivan; European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group

doi:10.1023/A:1023488915557

Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

M. A. Petersen, M. Groenvold, J. Bjorner, N. Aaronson, T. Conroy, A. Cull, Peter Fayers, M. Hjermstad, M. A. G. Sprangers, M. Sullivan, European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group

Applied Health Sciences

Research output: Contribution to journal › Article › peer-review

111 Citations (Scopus)

Abstract

In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

Original language	English
Pages (from-to)	373-385
Number of pages	12
Journal	Quality of Life Research
Volume	12
DOIs	https://doi.org/10.1023/A:1023488915557
Publication status	Published - 2003

Keywords

differential item functioning
multi-item scales
translation
validity
LOGISTIC-REGRESSION
RESPONSE THEORY
QUALITY
BIAS

Access to Document

10.1023/A:1023488915557

Cite this

Petersen, M. A., Groenvold, M., Bjorner, J., Aaronson, N., Conroy, T., Cull, A., Fayers, P., Hjermstad, M., Sprangers, M. A. G., Sullivan, M., & European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group (2003). Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire. Quality of Life Research, 12, 373-385. https://doi.org/10.1023/A:1023488915557

Petersen, MA, Groenvold, M, Bjorner, J, Aaronson, N, Conroy, T, Cull, A, Fayers, P, Hjermstad, M, Sprangers, MAG, Sullivan, M & European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group 2003, 'Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire', Quality of Life Research, vol. 12, pp. 373-385. https://doi.org/10.1023/A:1023488915557

@article{14589bffb8154e30b5258aa5a7206cd0,

title = "Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire",

abstract = "In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.",

keywords = "differential item functioning, multi-item scales, translation, validity, LOGISTIC-REGRESSION, RESPONSE THEORY, QUALITY, BIAS",

author = "Petersen, {M. A.} and M. Groenvold and J. Bjorner and N. Aaronson and T. Conroy and A. Cull and Peter Fayers and M. Hjermstad and Sprangers, {M. A. G.} and M. Sullivan and {European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group}",

year = "2003",

doi = "10.1023/A:1023488915557",

language = "English",

volume = "12",

pages = "373--385",

journal = "Quality of Life Research",

issn = "0962-9343",

publisher = "Springer ",

}

TY - JOUR

T1 - Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

AU - Petersen, M. A.

AU - Groenvold, M.

AU - Bjorner, J.

AU - Aaronson, N.

AU - Conroy, T.

AU - Cull, A.

AU - Fayers, Peter

AU - Hjermstad, M.

AU - Sprangers, M. A. G.

AU - Sullivan, M.

AU - European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group

PY - 2003

Y1 - 2003

N2 - In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

AB - In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.

KW - differential item functioning

KW - multi-item scales

KW - translation

KW - validity

KW - LOGISTIC-REGRESSION

KW - RESPONSE THEORY

KW - QUALITY

KW - BIAS

U2 - 10.1023/A:1023488915557

DO - 10.1023/A:1023488915557

M3 - Article

SN - 0962-9343

VL - 12

SP - 373

EP - 385

JO - Quality of Life Research

JF - Quality of Life Research

ER -

Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

Abstract

Keywords

Access to Document

Fingerprint

Cite this