Double data entry: What value, what price?

Simon Day; Peter Fayers; Derek Harvey

doi:10.1016/S0197-2456(97)00096-2

Double data entry: What value, what price?

Simon Day^*, Peter Fayers, Derek Harvey

^*Corresponding author for this work

Applied Health Sciences

Research output: Contribution to journal › Article › peer-review

72 Citations (Scopus)

Abstract

We challenge the notion that double data entry is either sufficient or necessary to ensure good-quality data in clinical trials. Although we do not completely reject that notion, we quantify some of the effects that poor quality data have on final study results in terms of estimation, significance testing, and power. By introducing digit errors into simulated blood pressure measurements we demonstrate that simple range checks allow us to detect (and therefore correct) the main errors that impact the final study results and conclusions. The errors that cannot easily be detected by such range checks, although possibly numerous, are shown to be of little importance in drawing the correct conclusions from the statistical analysis of data. Exploratory data analysis cannot identify all errors that a second data entry would detect, but on the other hand, not all errors that are found by exploratory data analysis are detectable by double data entry. Double data entry is concerned solely with ensuring, to a high degree of certainty, that what is recorded on the case record form is transcribed into the database. Exploratory data analysis looks beyond the case record form to challenge the plausibility of the written data. In this sense, the second entering of data has some benefit, but the use of exploratory data analysis methods, either as data entry is ongoing or at the end of data entry and as the first stage in an analysis strategy, should always be mandatory.

Original language	English
Pages (from-to)	15-24
Number of pages	10
Journal	Controlled Clinical Trials
Volume	19
Issue number	1
DOIs	https://doi.org/10.1016/S0197-2456(97)00096-2
Publication status	Published - 1 Feb 1998

Keywords

Data management
Data verification
Double data entry
Error rates
Good clinical practice
Quality
Single data entry

Access to Document

10.1016/S0197-2456(97)00096-2Licence: Unspecified

Cite this

@article{32f4925760104e09a9ece5847c7e57a3,

title = "Double data entry: What value, what price?",

abstract = "We challenge the notion that double data entry is either sufficient or necessary to ensure good-quality data in clinical trials. Although we do not completely reject that notion, we quantify some of the effects that poor quality data have on final study results in terms of estimation, significance testing, and power. By introducing digit errors into simulated blood pressure measurements we demonstrate that simple range checks allow us to detect (and therefore correct) the main errors that impact the final study results and conclusions. The errors that cannot easily be detected by such range checks, although possibly numerous, are shown to be of little importance in drawing the correct conclusions from the statistical analysis of data. Exploratory data analysis cannot identify all errors that a second data entry would detect, but on the other hand, not all errors that are found by exploratory data analysis are detectable by double data entry. Double data entry is concerned solely with ensuring, to a high degree of certainty, that what is recorded on the case record form is transcribed into the database. Exploratory data analysis looks beyond the case record form to challenge the plausibility of the written data. In this sense, the second entering of data has some benefit, but the use of exploratory data analysis methods, either as data entry is ongoing or at the end of data entry and as the first stage in an analysis strategy, should always be mandatory.",

keywords = "Data management, Data verification, Double data entry, Error rates, Good clinical practice, Quality, Single data entry",

author = "Simon Day and Peter Fayers and Derek Harvey",

year = "1998",

month = feb,

day = "1",

doi = "10.1016/S0197-2456(97)00096-2",

language = "English",

volume = "19",

pages = "15--24",

journal = "Controlled Clinical Trials",

issn = "0197-2456",

publisher = "Elsevier BV",

number = "1",

}

TY - JOUR

T1 - Double data entry

T2 - What value, what price?

AU - Day, Simon

AU - Fayers, Peter

AU - Harvey, Derek

PY - 1998/2/1

Y1 - 1998/2/1

N2 - We challenge the notion that double data entry is either sufficient or necessary to ensure good-quality data in clinical trials. Although we do not completely reject that notion, we quantify some of the effects that poor quality data have on final study results in terms of estimation, significance testing, and power. By introducing digit errors into simulated blood pressure measurements we demonstrate that simple range checks allow us to detect (and therefore correct) the main errors that impact the final study results and conclusions. The errors that cannot easily be detected by such range checks, although possibly numerous, are shown to be of little importance in drawing the correct conclusions from the statistical analysis of data. Exploratory data analysis cannot identify all errors that a second data entry would detect, but on the other hand, not all errors that are found by exploratory data analysis are detectable by double data entry. Double data entry is concerned solely with ensuring, to a high degree of certainty, that what is recorded on the case record form is transcribed into the database. Exploratory data analysis looks beyond the case record form to challenge the plausibility of the written data. In this sense, the second entering of data has some benefit, but the use of exploratory data analysis methods, either as data entry is ongoing or at the end of data entry and as the first stage in an analysis strategy, should always be mandatory.

AB - We challenge the notion that double data entry is either sufficient or necessary to ensure good-quality data in clinical trials. Although we do not completely reject that notion, we quantify some of the effects that poor quality data have on final study results in terms of estimation, significance testing, and power. By introducing digit errors into simulated blood pressure measurements we demonstrate that simple range checks allow us to detect (and therefore correct) the main errors that impact the final study results and conclusions. The errors that cannot easily be detected by such range checks, although possibly numerous, are shown to be of little importance in drawing the correct conclusions from the statistical analysis of data. Exploratory data analysis cannot identify all errors that a second data entry would detect, but on the other hand, not all errors that are found by exploratory data analysis are detectable by double data entry. Double data entry is concerned solely with ensuring, to a high degree of certainty, that what is recorded on the case record form is transcribed into the database. Exploratory data analysis looks beyond the case record form to challenge the plausibility of the written data. In this sense, the second entering of data has some benefit, but the use of exploratory data analysis methods, either as data entry is ongoing or at the end of data entry and as the first stage in an analysis strategy, should always be mandatory.

KW - Data management

KW - Data verification

KW - Double data entry

KW - Error rates

KW - Good clinical practice

KW - Quality

KW - Single data entry

UR - http://www.scopus.com/inward/record.url?scp=0031933157&partnerID=8YFLogxK

U2 - 10.1016/S0197-2456(97)00096-2

DO - 10.1016/S0197-2456(97)00096-2

M3 - Article

C2 - 9492966

AN - SCOPUS:0031933157

SN - 0197-2456

VL - 19

SP - 15

EP - 24

JO - Controlled Clinical Trials

JF - Controlled Clinical Trials

IS - 1

ER -

Double data entry: What value, what price?

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this