Double data entry: What value, what price?

Simon Day*, Peter Fayers, Derek Harvey

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

72 Citations (Scopus)

Abstract

We challenge the notion that double data entry is either sufficient or necessary to ensure good-quality data in clinical trials. Although we do not completely reject that notion, we quantify some of the effects that poor quality data have on final study results in terms of estimation, significance testing, and power. By introducing digit errors into simulated blood pressure measurements we demonstrate that simple range checks allow us to detect (and therefore correct) the main errors that impact the final study results and conclusions. The errors that cannot easily be detected by such range checks, although possibly numerous, are shown to be of little importance in drawing the correct conclusions from the statistical analysis of data. Exploratory data analysis cannot identify all errors that a second data entry would detect, but on the other hand, not all errors that are found by exploratory data analysis are detectable by double data entry. Double data entry is concerned solely with ensuring, to a high degree of certainty, that what is recorded on the case record form is transcribed into the database. Exploratory data analysis looks beyond the case record form to challenge the plausibility of the written data. In this sense, the second entering of data has some benefit, but the use of exploratory data analysis methods, either as data entry is ongoing or at the end of data entry and as the first stage in an analysis strategy, should always be mandatory.

Original languageEnglish
Pages (from-to)15-24
Number of pages10
JournalControlled Clinical Trials
Volume19
Issue number1
DOIs
Publication statusPublished - 1 Feb 1998

Keywords

  • Data management
  • Data verification
  • Double data entry
  • Error rates
  • Good clinical practice
  • Quality
  • Single data entry

Fingerprint

Dive into the research topics of 'Double data entry: What value, what price?'. Together they form a unique fingerprint.

Cite this