Main Article Content
Large-scale language testing uses statistical information to account for the quality of an assessment system. In this reflection article, I explain how basic statistics can be used meaningfully in the context of classroom language assessment. The paper explores a series of statistical calculations that can be used to examine test scores and assessment decisions in the language classroom. Therefore, interpretations for criterion-referenced assessment underlie the paper. Finally, I discuss limitations and include recommendations for teachers to use statistics.
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The authors of the manuscripts accepted for publication in HOW journal are required to sign a nonexclusive license agreement allowing ASOCOPI to reproduce the full text on the Internet or in any other available source. Authors retain copyright of their manuscripts with the following restrictions: first publication is granted to ASOCOPI; nonexclusive agreements with third parties can be established as long as the original publication in the HOW journal is properly acknowledged.
Alderson, C., Clapham, C., & Wall, D. (1995). Language test construction and evaluation. Cambridge University Press.
Bachman, L. F. (2004). Statistical analysis for language assessment. Cambridge University Press.
Brown, J. D. (1988). Understanding research in second language learning. Cambridge University Press.
Brown, J. D. (2003). Questions and answers about language testing statistics: Criterion-referenced item analysis (The difference index and B-index). SHIKEN: The JALT Testing & Evaluation SIG Newsletter, 7 (3), 18-24.
Brown, J. D. (2011). Testing in language programs: A comprehensive guide to English language assessment. McGraw Hill.
Brown, J. D. (2013). Teaching statistics in language testing courses. Language Assessment Quarterly, 10(3), 351-369. https://doi.org/10.1080/15434303.2013.769554
Brown, J. D., & Hudson, T. (2002). Criterion-referenced language testing. Cambridge University Press.
Carr, N. T. (2008). Using Microsoft Excel® to calculate descriptive statistics and create graphs. Language Assessment Quarterly, 5(1), 43-62. https://doi.org/10.1080/15434300701776336
Carr, N. T. (2011). Designing and analyzing language tests. Oxford University Press.
Chapelle, C. (2012). Conceptions of validity. In G. Fulcher & F. Davidson (Eds.), The Routledge handbook of language testing (pp. 21-33). Routledge.
Douglas, D. (2014). Understanding language testing. Routledge.
Fulcher, G. & Davidson, F. (2007). Language testing and assessment: An advanced resource book. Routledge.
Fulcher, G. (2010). Practical language testing. Hodder Education.
Fulcher, G. (2012). Assessment literacy for the language classroom. Language Assessment Quarterly, 9(2), 113-132. https://doi.org/10.1080/15434303.2011.642041
Giraldo, F. (2019). Designing language assessments in context: Theoretical, technical, and institutional considerations. HOW Journal, 26(2), 123-143. https://doi.org/10.19183/how.26.2.512
Giraldo, F., & Murcia, D. (2018). Language assessment literacy for pre-service teachers: Course expectations from different stakeholders. GiST: Education and Learning Research Journal, 16, 56-77. https://doi.org/10.26817/16925777.425
Gravetter, F., & Wallnau, L. (2014). Essentials of statistics for the behavioral sciences. 8th edition. Cengage Learning.
Hughes, A. (2010). Testing for language teachers. Cambridge University Press.
Inbar-Lourie, O. (2008). Constructing a language assessment knowledge base: A focus on language assessment courses. Language Testing, 25(3), 385-402. https://doi.org/10.1177/0265532208090158
Inbar-Lourie, O. (2012). Language assessment literacy. In C. Chapelle (Ed.), The encyclopedia of applied linguistics (pp. 1-9). John Wiley & Sons. https://doi.org/10.1002/9781405198431.wbeal0605
Malone, M. (2017). Training in language assessment. In E. Shohamy, I. G. Or, & S. May (Eds.), Language testing and assessment: Encyclopedia of language and education (3rd Ed., pp. 225-240). Springer. https://doi.org/10.1007/978-3-319-02261-1_16
McNamara, T. (2000). Language testing. Oxford University Press.
Ministerio de Educación Nacional de Colombia (2016). Pedagogical principles and guidelines suggested English curriculum. Team Toon Studio.
Moss, P. A. (2003). Reconceptualizing validity for classroom assessment. Educational Measurement: Issues and Practice, 22(4), 13–25. https://doi.org/10.1111/j.1745-3992.2003.tb00140.x
Popham, W. (2009). Assessment literacy for teachers: Faddish or fundamental? Theory into Practice, 48(1), 4-11. https://doi.org/10.1080/00405840802577536
Vogt, K., & Tsagari, D. (2014). Assessment literacy of foreign language teachers: Findings of a European study. Language Assessment Quarterly, 11(4), 374-402. https://doi.org/10.1080/15434303.2014.960046