ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Classification	5
Pass Fail Grading	5
Statistical Analysis	5
Achievement Rating	2
Cutting Scores	2
Evaluation Criteria	2
Measurement Objectives	2
Academic Standards	1
Accuracy	1
Certification	1
Decision Making	1
Educational Policy	1
Educational Practices	1
Educational Quality	1
Educational Testing	1
Equated Scores	1
Error of Measurement	1
Evaluation	1
Evaluation Methods	1
Exit Examinations	1
Foreign Countries	1
Grading	1
High Stakes Tests	1
Item Analysis	1
Item Response Theory	1
More ▼

Source

Assessment in Education:…	1
Practical Assessment,…	1
Psychological Assessment	1

Author

Avner, R. A.	1
Bashkov, Bozhidar M.	1
Beguin, A. A.	1
Breyer, F. Jay	1
Clauser, Jerome C.	1
Dwyer, Carol Anne	1
Lewis, Charles	1
Verstralen, H. H. F. M.	1
van Rijn, P. W.	1

Publication Type

Journal Articles	3
Reports - Evaluative	2
Reports - Descriptive	1
Reports - Research	1
Speeches/Meeting Papers	1

Education Level

Secondary Education

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Cut Scores and Testing: Statistics, Judgment, Truth, and Error.

Peer reviewed

Dwyer, Carol Anne – Psychological Assessment, 1996

The uses and abuses of cut scores are examined. The article demonstrates (1) that cut scores always entail judgment; (2) that cut scores inherently result in misclassification; (3) that cut scores impose an artificial dichotomy on an essentially continuous distribution of knowledge, skill, or ability; and (4) that no true cut scores exist. (SLD)

Descriptors: Classification, Cutting Scores, Educational Testing, Error of Measurement

Pass-Fail Reliability for Tests with Cut Scores: A Simplified Method.

Download full text

Breyer, F. Jay; Lewis, Charles – 1994

A single-administration classification reliability index is described that estimates the probability of consistently classifying examinees to mastery or nonmastery states as if those examinees had been tested with two alternate forms. The procedure is applicable to any test used for classification purposes, subdividing that test into two…

Descriptors: Classification, Cutting Scores, Objective Tests, Pass Fail Grading

Objective Criteria for Evaluation of Grading Scales.

Download full text

Avner, R. A. – 1970

This report compares maximum linear prediction, maximum total correct classifications for a group, and maximum probability of correct classification for an individual as objective criteria for univariate grading scales. Since the goals of valid prediction and valid classification lead to conflicting criteria, it is possible that a compromise…

Descriptors: Achievement Rating, Classification, Evaluation, Evaluation Criteria