ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	7

Descriptor

Classification	7
Pass Fail Grading	7
Accuracy	3
Decision Making	3
Foreign Countries	3
Item Response Theory	3
Educational Research	2
High Stakes Tests	2
Probability	2
Statistical Analysis	2
Test Items	2
Test Reliability	2
Testing	2
Academic Standards	1
Achievement Rating	1
At Risk Students	1
Bayesian Statistics	1
Certification	1
Comparative Testing	1
Computer Assisted Testing	1
Computer Science Education	1
Correlation	1
Data Analysis	1
Data Collection	1
Early Intervention	1
More ▼

Source

Practical Assessment,…	2
Assessment in Education:…	1
Educational Measurement:…	1
Educational Research	1
Educational and Psychological…	1
Journal of Learning Analytics	1

Author

Amanda A. Wolkowitz	1
Bashkov, Bozhidar M.	1
Beguin, A. A.	1
Bramley, Tom	1
Casey, Kevin	1
Clauser, Jerome C.	1
Feinberg, Richard A.	1
Liu, Ren	1
Luo, Xiao	1
Qian, Hong	1
Russell Smith	1
Verstralen, H. H. F. M.	1
Woo, Ada	1
van Rijn, P. W.	1
More ▼

Publication Type

Journal Articles	7
Reports - Descriptive	3
Reports - Research	3
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Ireland (Dublin)	1
Netherlands	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

Estimating Classification Decisions for Incomplete Tests

Peer reviewed

Direct link

Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021

Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…

Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics

Relative Diagnostic Profile: A Subscore Reporting Framework

Peer reviewed

Direct link

Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018

Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…

Descriptors: Classification, Probability, Pass Fail Grading, Scores

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Using Keystroke Analytics to Improve Pass-Fail Classifiers

Peer reviewed
PDF on ERIC

Download full text

Casey, Kevin – Journal of Learning Analytics, 2017

Learning analytics offers insights into student behaviour and the potential to detect poor performers before they fail exams. If the activity is primarily online (for example computer programming), a wealth of low-level data can be made available that allows unprecedented accuracy in predicting which students will pass or fail. In this paper, we…

Descriptors: Keyboarding (Data Entry), Educational Research, Data Collection, Data Analysis

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making