Is This Reliable Enough? Examining Classification Consistency and Accuracy in a Criterion-Referenced Test

Journal Title: International Journal of Assessment Tools in Education - Year 2016, Vol 3, Issue 2

Abstract

One important step for assessing the quality of a test is to examine the reliability of test score interpretation. Which aspect of reliability is the most relevant depends on what type of test it is and how the scores are to be used. For criterion-referenced tests, and in particular certification tests, where students are classified into performance categories, primary focus need not be on the size of error but on the impact of this error on classification. This impact can be described in terms of classification consistency and classification accuracy. In this article selected methods from classical test theory for estimating classification consistency and classification accuracy were applied to the theory part of the Swedish driving licence test, a high-stakes criterion-referenced test which is rarely studied in terms of reliability of classification. The results for this particular test indicated a level of classification consistency that falls slightly short of the recommended level which is why lengthening the test should be considered. More evidence should also be gathered as to whether the placement of the cut-off score is appropriate since this has implications for the validity of classifications.

Authors and Affiliations

Susanne Alger

Keywords

Related Articles

Comparing Physics Textbooks in Terms of Assessment and Evaluation Tools

Assessment and evaluation instruments provide teachers the opportunity of shaping education in the beginning, contributing to education during the process and evaluating education at the end of the process. Textbooks, on...

Prospective Teachers’ Tendencies to Utilize From the Facilities of Contemporary Educational Technology

In terms of effectiveness and efficiency, it is important to determine the views of prospective teachers related to taking advantage of the facilities of contemporary educational technology. This study which aims to iden...

Developmental Mathematics Students: Who are They and What is Their Mathematics Self-Efficacy?

The purpose of this quantitative study was to determine differences in developmental mathematics students’ self-efficacy within the demographic data from the survey. Data from a sample of 240 Intermediate Algebra student...

Development of the Teacher Candidates’ Level of being Affected from Public Personnel Selection Examination Scale

This study aimed to develop a scale to evaluate teacher candidates' level of being affected from the public personnel selection examination. The participants of the study consisted of the final year students at Pamukkale...

Assessing Metacognition: Theory and Practices

Many researchers in education emphasized students’ metacognition should be fostered for academic development and achievement. However, to support students’ metacognitive development and adequacy appropriately, their meta...

Download PDF file
  • EP ID EP174034
  • DOI 10.21449/ijate.245198
  • Views 132
  • Downloads 0

How To Cite

Susanne Alger (2016). Is This Reliable Enough? Examining Classification Consistency and Accuracy in a Criterion-Referenced Test. International Journal of Assessment Tools in Education, 3(2), 137-150. https://europub.co.uk/articles/-A-174034