The impact of systematically repairing multiple choice questions with low discrimination on assessment reliability: An interrupted time series analysis

Desy, Janeve; Harvey, Adrian; Weeks, Sarah; Busche, Kevin D; Martin, Kerri; Paget, Michael; Naugler, Christopher; McLaughlin, Kevin

doi:https://doi.org/10.36834/cmej.77596

Messick S. Validity. 3rd ed. New York, NY: American Council on Education and Macmillan, 1989.

Kane MT. Validation. In: Brennan RL, ed. Educational measurement. 4th ed. Westport.: Praeger; 2006:17-64.

Messick S. The interplay of evidence and consequences in the validation of performance assessments. Education Researcher 1994;32:13-23. https://doi.org/10.2307/1176219

10.2307/1176219 Google Scholar

Cook DA, Brydges R, Ginsburg S, Hatala R. A contemporary approach to validity arguments: a practical guide to Kane's framework. Med Educ 2015;49(6):560-75. https://doi.org/10.1111/medu.12678

10.1111/medu.12678 Google Scholar

De Champlain AF. A primer on classical test theory and item response theory for assessments in medical education. Med Educ 2010;44(1):109-17. https://doi.org/10.1111/j.1365-2923.2009.03425.x

10.1111/j.1365-2923.2009.03425.x Google Scholar

Thorndike RL, Hagen E. Measurement and evaluation in psychology and education. New York: John Wiley and Sons Inc, 1961.

Google Scholar

Richardson MW. Notes on the rationale of item analysis. Psychometrika 1936;1:69-76. https://doi.org/10.1007/BF02287926

10.1007/BF02287926 Google Scholar

Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika 1951;16:297-334. https://doi.org/10.1007/BF02310555

10.1007/BF02310555 Google Scholar

Glass GV, Hopkins, K.D. Statistical methods in education and psychology. 3rd ed. Needham Heights, MA: Allyn and Bacon, 1995.

Google Scholar

Chiavaroli N. Negatively-worded multiple choice questions: an avoidable threat to validity. Pract Assessment Res Eval 2017;22:1-14. https://doi.org/10.1201/9780203739976-1

10.1201/9780203739976-1 Google Scholar

Schuwirth LW, van der Vleuten CP, Donkers HH. A closer look at cueing effects in multiple-choice questions. Med Educ 1996;30(1):44-9. https://doi.org/10.1111/j.1365-2923.1996.tb00716.x

10.1111/j.1365-2923.1996.tb00716.x Google Scholar

Rodriguez MC, Kettler RJ, Elliott SN. Distractor functioning in modified items for test accessibility. SAGE Open 2014;4(4). https://doi.org/10.1177/2158244014553586

10.1177/2158244014553586 Google Scholar

Office of Educational Assessment UoW. Understanding item analyses. Available from https://www.washington.edu/assessment/scanning-scoring/scoring/reports/item-analysis/

Google Scholar

McDowall D, McCleary R, Meidinger EE, Hay RA. Interrupted time series analysis. Newbury Park, CA: Sage Publications, 1980. https://doi.org/10.4135/9781412984607

10.4135/9781412984607 Google Scholar

15. Mandin H, Harasym P, Eagle C, Watanabe M. Developing a "clinical presentation" curriculum at the University of Calgary. Acad Med 1995;70(3):186-93. https://doi.org/10.1097/00001888-199503000-00008

10.1097/00001888-199503000-00008 Google Scholar

Ali SH, Carr PA, Ruit KG. Validity and reliability of scores obtained on multiple-choice questions: why functioning distractors matter. J Schol Teach Learn 2016;16:1-14. https://doi.org/10.14434/josotl.v16i1.19106

10.14434/josotl.v16i1.19106 Google Scholar

Hudson J, Fielding S, Ramsay CR. Methodology and reporting characteristics of studies using interrupted time series design in healthcare. BMC Med Res Methodol 2019;19(1):137. https://doi.org/10.1186/s12874-019-0777-x

10.1186/s12874-019-0777-x Google Scholar

Linden A. Conducting interrupted time-series analysis for single- and multiple-group comparisons. Stata J. 2015;15:480-500. https://doi.org/10.1177/1536867X1501500208

10.1177/1536867X1501500208 Google Scholar

Jiang S, Wang C, Weiss DJ. Sample size requirements for estimation of item parameters in the multidimensional graded response model. Front Psychol 2016;7:109. https://doi.org/10.3389/fpsyg.2016.00109

10.3389/fpsyg.2016.00109 Google Scholar

Strauss V. The real problem with multiple-choice tests. The Washington Post2013

Google Scholar

Abstract

Résumé

Bibliography

Résumés

Abstract

Résumé

Parties annexes

Bibliography

Outils de citation

Citer cet article

Exporter la notice de cet article