Article

A comparison of two different methods for setting performance standards for a test with constructed-response items

Authors
  • Gunilla Näsström
  • Peter Nyström

Abstract

The trustworthiness of performance standards influences the credibility of criterion-referenced large-scale testing. In this paper, two standard-setting methods are evaluated and compared, when applied to a test with polytomously scored constructed-response items. A version of the Angoff method is chosen as representative of the class of test-centred standard-setting procedures and the borderline-group method represents the class of examinee-centred procedures. The evaluation is based on procedural, internal and external evidence. The results indicate that both methods provide reasonable and trustworthy approaches to standard setting, but also confirm some of the potential problems with these methods.Accessed 23,651 times on https://pareonline.net from September 15, 2008 to December 31, 2019. For downloads from January 1, 2020 forward, please click on the PlumX Metrics link to the right.

Keywords: Test Construction

How to Cite:

Näsström, G. & Nyström, P., (2008) “A comparison of two different methods for setting performance standards for a test with constructed-response items”, Practical Assessment, Research, and Evaluation 13(1): 9. doi: https://doi.org/10.7275/bhb9-8t88

Downloads:
Download PDF
View PDF

200 Views

39 Downloads