Sources of variation in performance on a shared OSCE station across four UK medical schools

Alistair Chesser; Helen Cameron; Phillip Evans; Jennifer Cleland; Kathy Boursicot; Gary Mires

doi:10.1111/j.1365-2923.2009.03370.x

Sources of variation in performance on a shared OSCE station across four UK medical schools

Alistair Chesser^*, Helen Cameron, Phillip Evans, Jennifer Cleland, Kathy Boursicot, Gary Mires

^*Corresponding author for this work

Medical Education

Research output: Contribution to journal › Article › peer-review

31 Citations (Scopus)

Abstract

Context High-stakes undergraduate clinical assessments should be based on transparent standards comparable between different medical schools. However, simply sharing questions and pass marks may not ensure comparable standards and judgements. We hypothesised that in multicentre examinations, teaching institutions contribute to systematic variations in students' marks between different medical schools through the behaviour of their markers, standard-setters and simulated patients. Methods We embedded a common objective structured clinical examination (OSCE) station in four UK medical schools. All students were examined by a locally trained examiner as well as by a centrally provided examiner. Central and local examiners did not confer. Pass scores were calculated using the borderline groups method. Mean scores awarded by each examiner group were also compared. Systematic variations in scoring between schools and between local and central examiners were analysed. Results Pass scores varied slightly but significantly between each school, and between local and central examiners. The patterns of variation were usually systematic between local and central examiners (either consistently lower or higher). In some cases scores given by one examiner pair were significantly different from those awarded by other pairs in the same school, implying that other factors (possibly simulated patient behaviour) make a significant difference to student scoring. Conclusions Shared undergraduate clinical assessments should not rely on scoring systems and standard setting which fail to take into account other differences between schools. Examiner behaviour and training and other local factors are important contributors to variations in scores between schools. The OSCE scores of students from different medical schools should not be directly compared without taking such systematic variations into consideration.

Original language	English
Pages (from-to)	526-532
Number of pages	7
Journal	Medical Education
Volume	43
Issue number	6
DOIs	https://doi.org/10.1111/j.1365-2923.2009.03370.x
Publication status	Published - Jun 2009

Access to Document

10.1111/j.1365-2923.2009.03370.xLicence: Unspecified

Cite this

@article{215c6f0de9504f91957e2082b250fa0b,

title = "Sources of variation in performance on a shared OSCE station across four UK medical schools",

abstract = "Context High-stakes undergraduate clinical assessments should be based on transparent standards comparable between different medical schools. However, simply sharing questions and pass marks may not ensure comparable standards and judgements. We hypothesised that in multicentre examinations, teaching institutions contribute to systematic variations in students' marks between different medical schools through the behaviour of their markers, standard-setters and simulated patients. Methods We embedded a common objective structured clinical examination (OSCE) station in four UK medical schools. All students were examined by a locally trained examiner as well as by a centrally provided examiner. Central and local examiners did not confer. Pass scores were calculated using the borderline groups method. Mean scores awarded by each examiner group were also compared. Systematic variations in scoring between schools and between local and central examiners were analysed. Results Pass scores varied slightly but significantly between each school, and between local and central examiners. The patterns of variation were usually systematic between local and central examiners (either consistently lower or higher). In some cases scores given by one examiner pair were significantly different from those awarded by other pairs in the same school, implying that other factors (possibly simulated patient behaviour) make a significant difference to student scoring. Conclusions Shared undergraduate clinical assessments should not rely on scoring systems and standard setting which fail to take into account other differences between schools. Examiner behaviour and training and other local factors are important contributors to variations in scores between schools. The OSCE scores of students from different medical schools should not be directly compared without taking such systematic variations into consideration.",

author = "Alistair Chesser and Helen Cameron and Phillip Evans and Jennifer Cleland and Kathy Boursicot and Gary Mires",

year = "2009",

month = jun,

doi = "10.1111/j.1365-2923.2009.03370.x",

language = "English",

volume = "43",

pages = "526--532",

journal = "Medical Education",

issn = "0308-0110",

publisher = "WILEY-BLACKWELL",

number = "6",

}

TY - JOUR

T1 - Sources of variation in performance on a shared OSCE station across four UK medical schools

AU - Chesser, Alistair

AU - Cameron, Helen

AU - Evans, Phillip

AU - Cleland, Jennifer

AU - Boursicot, Kathy

AU - Mires, Gary

PY - 2009/6

Y1 - 2009/6

N2 - Context High-stakes undergraduate clinical assessments should be based on transparent standards comparable between different medical schools. However, simply sharing questions and pass marks may not ensure comparable standards and judgements. We hypothesised that in multicentre examinations, teaching institutions contribute to systematic variations in students' marks between different medical schools through the behaviour of their markers, standard-setters and simulated patients. Methods We embedded a common objective structured clinical examination (OSCE) station in four UK medical schools. All students were examined by a locally trained examiner as well as by a centrally provided examiner. Central and local examiners did not confer. Pass scores were calculated using the borderline groups method. Mean scores awarded by each examiner group were also compared. Systematic variations in scoring between schools and between local and central examiners were analysed. Results Pass scores varied slightly but significantly between each school, and between local and central examiners. The patterns of variation were usually systematic between local and central examiners (either consistently lower or higher). In some cases scores given by one examiner pair were significantly different from those awarded by other pairs in the same school, implying that other factors (possibly simulated patient behaviour) make a significant difference to student scoring. Conclusions Shared undergraduate clinical assessments should not rely on scoring systems and standard setting which fail to take into account other differences between schools. Examiner behaviour and training and other local factors are important contributors to variations in scores between schools. The OSCE scores of students from different medical schools should not be directly compared without taking such systematic variations into consideration.

AB - Context High-stakes undergraduate clinical assessments should be based on transparent standards comparable between different medical schools. However, simply sharing questions and pass marks may not ensure comparable standards and judgements. We hypothesised that in multicentre examinations, teaching institutions contribute to systematic variations in students' marks between different medical schools through the behaviour of their markers, standard-setters and simulated patients. Methods We embedded a common objective structured clinical examination (OSCE) station in four UK medical schools. All students were examined by a locally trained examiner as well as by a centrally provided examiner. Central and local examiners did not confer. Pass scores were calculated using the borderline groups method. Mean scores awarded by each examiner group were also compared. Systematic variations in scoring between schools and between local and central examiners were analysed. Results Pass scores varied slightly but significantly between each school, and between local and central examiners. The patterns of variation were usually systematic between local and central examiners (either consistently lower or higher). In some cases scores given by one examiner pair were significantly different from those awarded by other pairs in the same school, implying that other factors (possibly simulated patient behaviour) make a significant difference to student scoring. Conclusions Shared undergraduate clinical assessments should not rely on scoring systems and standard setting which fail to take into account other differences between schools. Examiner behaviour and training and other local factors are important contributors to variations in scores between schools. The OSCE scores of students from different medical schools should not be directly compared without taking such systematic variations into consideration.

UR - http://www.scopus.com/inward/record.url?scp=65649130464&partnerID=8YFLogxK

U2 - 10.1111/j.1365-2923.2009.03370.x

DO - 10.1111/j.1365-2923.2009.03370.x

M3 - Article

C2 - 19493176

AN - SCOPUS:65649130464

SN - 0308-0110

VL - 43

SP - 526

EP - 532

JO - Medical Education

JF - Medical Education

IS - 6

ER -

Sources of variation in performance on a shared OSCE station across four UK medical schools

Abstract

Access to Document

Other files and links

Fingerprint

Cite this