Une comparaison de l’étendue intra- et interindividuelle du niveau de sévérité d’examinateurs en français langue étrangère

Chénier, Christophe

doi:https://doi.org/10.7202/1093066ar

American Educational Research Association, American Psychological Association et National Council on Measurement in Education. (2014). Standards for Educational and Psychological Testing. American Educational Research Association.

Google Scholar

Aryadoust, V., Ying Ng, L., & Sayama, H. (2021). A comprehensive review of Rasch measurement in language assessment : Recommendations and guidelines for research. Language Testing, 38(1), 6-40. https://doi.org/10.1177%2F0265532220927487

10.1177/0265532220927487 Google Scholar

Barkaoui, K. (2010). Do ESL Essay Raters’ Evaluation Criteria Change With Experience ? A Mixed-Method, Cross-Sectional Study. TESOL Quarterly, 44(1), 31-57. https://doi.org/10.5054/tq.2010.214047

10.5054/tq.2010.214047 Google Scholar

Bond, T. G., & Fox, C. M. (2015). Applying the Rasch Model : Fundamental Measurement in the Human Sciences (3e éd.). Routledge.

Google Scholar

Cardinet, J. (1987). L’objectivité de l’évaluation. Recherches, Institut romand de recherches et de documentations pédagogiques.

Google Scholar

Casanova, D., & Demeuse, M. (2011). Analyse des différentes facettes influant sur la fidélité de l’épreuve d’expression écrite d’un test de français langue étrangère. Mesure et évaluation en éducation, 34(1), 25-53. https://doi.org/10.7202/1024862ar

10.7202/1024862ar Google Scholar

Casanova, D., & Demeuse, M. (2016). Évaluateurs évalués : évaluation diagnostique des compétences en évaluation des correcteurs d’une épreuve d’expression écrite à forts enjeux. Mesure et évaluation en éducation, 39(3), 59-96. https://doi.org/10.7202/1040137ar

10.7202/1040137ar Google Scholar

CCI Paris-Île-de-France Éducation. (2021). Test d’évaluation du français. https://www.lefrancaisdesaffaires.fr/tests-diplomes/test-evaluation-francais-tef/

Google Scholar

Chénier, C. (2018). Étude longitudinale du niveau de sévérité d’examinateurs d’un test d’expression orale en français langue étrangère [Thèse de doctorat non publiée]. Université du Québec à Montréal.

Google Scholar

Congdon, P. J., & McQueen, J. (2000). The Stability of Rater Severity in Large-Scale Assessment Programs. Journal of Educational Measurement, 37(2), 163-178. https://www.jstor.org/stable/1435283

10.1111/j.1745-3984.2000.tb01081.x Google Scholar

Conseil de l’Europe (2011). Manuel pour l’élaboration et la passation de tests et d’examens de langue. Association of language testers in Europe.

Google Scholar

Council of Europe (2009). Relating Language Examinations to the Common European Framework of Reference for Languages : Learning, Teaching, Assessment (CEFR). Council of Europe Publishing.

Google Scholar

Davis, L. (2016). The influence of training and experience on rater performance in scoring spoken language. Language Testing, 33(1), 117-135. https://doi.org/10.1177%2F0265532215582282

10.1177/0265532215582282 Google Scholar

Eckes, T. (2012). Operational Rater Types in Writing Assessment : Linking Rater Cognition to Rater Behavior. Language Assessment Quarterly, 9(3), 270-292. https://doi.org/10.1080/15434303.2011.649381

10.1080/15434303.2011.649381 Google Scholar

Eckes, T. (2015). Introduction to Many-Facet Rasch Measurement (2^e éd.). Peter Lang.

Google Scholar

Edgeworth, F. Y. (1888). The Statistics of Examinations. Journal of the Royal Statistical Society, 51(3), 599-635. https://www.jstor.org/stable/2339898

Google Scholar

Edgeworth, F. Y. (1890). The Element of Chance in Competitive Examinations. Journal of the Royal Statistical Society, 53(4), 644-663. https://www.jstor.org/stable/2979446

Google Scholar

Gerard, F.-M. (2002). L’indispensable subjectivité de l’évaluation. Antipodes, 156, 26-34.

Google Scholar

Hadji, C. (1992). L’évaluation des actions éducatives. Presses Universitaires de France.

10.3917/puf.hadji.1992.01 Google Scholar

Kim, H. J. (2011). Investigating raters’ development of rating ability on a second language speaking assessment [Thèse de doctorat non publiée]. Teachers College, Columbia University.

Google Scholar

Lamprianou, I., Tsagari, D., & Kyriakou, N. (2021). The longitudinal stability of rating characteristics in an EFL examination : Methodological and substantive considerations. Language Testing, 38(2), 273-301. https://doi.org/10.1177%2F0265532220940960

10.1177/0265532220940960 Google Scholar

Leckie, G., & Baird, J.-A. (2011). Rater Effects on Essay Scoring : A Multilevel Analysis of Severity Drift, Central Tendency, and Rater Experience. Journal of Educational Measurement, 48(4), 399-418. https://doi.org/10.1111/j.1745-3984.2011.00152.x

10.1111/j.1745-3984.2011.00152.x Google Scholar

Lim, G. (2009). Prompt and rater effects in second language writing and performance assessment. [Thèse doctorale non publiée]. Michigan State University.

Google Scholar

Lim, G. (2011). The development and maintenance of rating quality in performance writing assessment : A longitudinal study of new and experienced raters. Language testing, 28(4), 543-560. https://doi.org/10.1177%2F0265532211406422

10.1177/0265532211406422 Google Scholar

Linacre, J. M. (1994). Many-Facet Rasch Measurement (2e éd.). Mesa.

Google Scholar

Linacre, J. M. (2021a). Facets computer program for many-facet Rasch measurement Program Manual. Winsteps.com.

Google Scholar

Linacre, J. M. (2021b). Facets computer program for many-facet Rasch measurement, version 3.83.6. Winsteps.com.

Google Scholar

Lumley, T., & McNamara, T. F. (1995). Rater characteristics and rater bias : implications for training. Language Testing, 12(1), 54-71. https://doi.org/10.1177%2F026553229501200104

10.1177/026553229501200104 Google Scholar

Myford, C. M., & Wolfe, E. W. (2003). Detecting and measuring rater effects using many-facet Rasch measurement : Part 1. Journal of Applied Measurement, 4(4), 386-422.

Google Scholar

Park, H., & Yan, X. (2019). An investigation into rater performance with a holistic scale and a binary, analytic scale on an ESL writing placement test. Papers in Language Testing and Assessment, 8(2), 34-64.

10.58379/NKDC1529 Google Scholar

R Core Team (2021). R : A language and environment for statistical computing (version 4.1.2). R Foundation for Statistical Computing.

Google Scholar

Shehadeh, A. (2012). Task-Based Language Assessment : Components, Development, and Implementation. Dans C. Coombe, P. Davidson, B. O’Sullivan et S. Stoynoff (dir.), The Cambridge Guide to Second Language Assessment (p. 156-163), Cambridge University Press.

Google Scholar

Shin, Y. (2017). Time Series Analysis in the Social Sciences. University of California Press.

10.1525/california/9780520293168.003.0001 Google Scholar

Spolsky, B. (2000). Language Testing in The Modern Language Journal. The Modern Language Journal, 84(4), 536-552. https://doi.org/10.1111/0026-7902.00086

10.1111/0026-7902.00086 Google Scholar

Uyaniker, P. (2017). Language Assessment : Now and Then. Avrasya Dil E-itimi ve Ara-tırmaları Dergisi, 1(1), 1- 20.

Google Scholar

Wind, S. A., & Engelhard Jr., G. (2016). Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis. Educational and Psychological Measurement, 76(4), 685-706. https://doi.org/10.1177/0013164415604704

10.1177/0013164415604704 Google Scholar

Wolfe, E. W., Myford, C. M., Engelhard Jr., G., & Manalo, J. R. (2007). Monitoring ReaderPerformance and DRIFT in the AP® English Literature and Composition Examination Using Benchmark Essays. (Rapport de recherche no 2007-2). College Board.

Google Scholar

Zhang, J. (2016). Same text different processing ? Exploring how raters’ cognitive and meta-cognitive strategies influence rating accuracy in essay scoring. Assessing Writing, 27, 37-53. https://doi.org/10.1016/j.asw.2015.11.001

10.1016/j.asw.2015.11.001 Google Scholar

Une comparaison de l’étendue intra- et interindividuelle du niveau de sévérité d’examinateurs en français langue étrangère[Record]

Résumé

Abstract

Resumo

Liste de références

Abstracts

Résumé

Abstract

Resumo

Appendices

Liste de références

Citation Tools

Cite this article

Export the record for this article