ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Classification	16
Pass Fail Grading	16
Item Response Theory	5
Statistical Analysis	5
Cutting Scores	4
Decision Making	4
Accuracy	3
Computer Assisted Testing	3
Evaluation	3
Foreign Countries	3
Licensing Examinations…	3
Probability	3
Scores	3
Test Format	3
Test Items	3
Ability	2
Achievement Rating	2
Bayesian Statistics	2
Comparative Testing	2
Computer Simulation	2
Educational Research	2
Error of Measurement	2
Evaluation Criteria	2
Evaluation Methods	2
High Stakes Tests	2
More ▼

Source

Practical Assessment,…	2
Assessment in Education:…	1
Educational Measurement:…	1
Educational Research	1
Educational and Psychological…	1
Journal of Educational and…	1
Journal of Health, Physical…	1
Journal of Learning Analytics	1
Psychological Assessment	1

Publication Type

Journal Articles	9
Reports - Evaluative	7
Reports - Research	5
Speeches/Meeting Papers	4
Reports - Descriptive	3

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Ireland (Dublin)	1
Netherlands	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

A Practical Comparison of Decision Consistency Estimates

Peer reviewed
PDF on ERIC

Download full text

Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024

A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…

Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making

Estimating Classification Decisions for Incomplete Tests

Peer reviewed

Direct link

Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021

Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…

Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics

Relative Diagnostic Profile: A Subscore Reporting Framework

Peer reviewed

Direct link

Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018

Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…

Descriptors: Classification, Probability, Pass Fail Grading, Scores

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Using Keystroke Analytics to Improve Pass-Fail Classifiers

Peer reviewed
PDF on ERIC

Download full text

Casey, Kevin – Journal of Learning Analytics, 2017

Learning analytics offers insights into student behaviour and the potential to detect poor performers before they fail exams. If the activity is primarily online (for example computer programming), a wealth of low-level data can be made available that allows unprecedented accuracy in predicting which students will pass or fail. In this paper, we…

Descriptors: Keyboarding (Data Entry), Educational Research, Data Collection, Data Analysis

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Performance Based Contract Teaching

Mundy, C. Jean – Journal of Health, Physical Education and Recreation, 1974

Descriptors: Classification, Competency Based Education, Course Objectives, Evaluation

Cut Scores and Testing: Statistics, Judgment, Truth, and Error.

Peer reviewed

Dwyer, Carol Anne – Psychological Assessment, 1996

The uses and abuses of cut scores are examined. The article demonstrates (1) that cut scores always entail judgment; (2) that cut scores inherently result in misclassification; (3) that cut scores impose an artificial dichotomy on an essentially continuous distribution of knowledge, skill, or ability; and (4) that no true cut scores exist. (SLD)

Descriptors: Classification, Cutting Scores, Educational Testing, Error of Measurement

Pass-Fail Reliability for Tests with Cut Scores: A Simplified Method.

Download full text

Breyer, F. Jay; Lewis, Charles – 1994

A single-administration classification reliability index is described that estimates the probability of consistently classifying examinees to mastery or nonmastery states as if those examinees had been tested with two alternate forms. The procedure is applicable to any test used for classification purposes, subdividing that test into two…

Descriptors: Classification, Cutting Scores, Objective Tests, Pass Fail Grading

The Classification Accuracy of Shortened versus Full Length Tests with Number Correct Scoring.

Download full text

Schulz, E. Matthew; Wang, Lin – 2001

In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…

Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading

Comparison of SPRT and Sequential Bayes Procedures for Classifying Examinees into Two Categories Using a Computerized Test.

Peer reviewed

Spray, Judith A.; Reckase, Mark D. – Journal of Educational and Behavioral Statistics, 1996

Two procedures for classifying examinees into categories, one based on the sequential probability ratio test (SPRT) and the other on sequential Bayes methodology, were compared to determine which required fewer items for classification. Results showed that the SPRT procedure requires fewer items to achieve the same accuracy level. (SLD)

Descriptors: Ability, Bayesian Statistics, Classification, Comparative Analysis

Objective Criteria for Evaluation of Grading Scales.

Download full text

Avner, R. A. – 1970

This report compares maximum linear prediction, maximum total correct classifications for a group, and maximum probability of correct classification for an individual as objective criteria for univariate grading scales. Since the goals of valid prediction and valid classification lead to conflicting criteria, it is possible that a compromise…

Descriptors: Achievement Rating, Classification, Evaluation, Evaluation Criteria

Passing the NTE: A Classification of State Requirements and Passing Rates, by Ethnicity.

Download full text

DeMauro, Gerald E. – 1989

It is difficult to estimate the percentage of examinees who pass National Teacher Evaluation (NTE) tests because many users of the tests require that examinees pass different combinations of tests or use different passing scores for each of the tests. This study first develops a taxonomy of state NTE requirements and then computes passing rates…

Descriptors: Blacks, Classification, Cutting Scores, Ethnicity

Assessing the Impact of Multidimensionality on the Classification Decisions of an IRT-Based Licensure Examination.

Download full text

Sykes, Robert C.; And Others – 1992

A part-form methodology was used to study the effect of varying degrees of multidimensionality on the consistency of pass/fail classification decisions obtained from simulated unidimensional item response theory (IRT) based licensure examinations. A control on the degree of form multidimensionality permitted an assessment throughout the range of…

Descriptors: Classification, Comparative Testing, Computer Simulation, Decision Making

Previous Page | Next Page »

Pages: 1 | 2

Amanda A. Wolkowitz	1
Avner, R. A.	1
Bashkov, Bozhidar M.	1
Beguin, A. A.	1
Bramley, Tom	1
Breyer, F. Jay	1
Casey, Kevin	1
Clauser, Jerome C.	1
DeMauro, Gerald E.	1
Dwyer, Carol Anne	1
Feinberg, Richard A.	1
Lewis, Charles	1
Liu, Ren	1
Luo, Xiao	1
Mundy, C. Jean	1
Qian, Hong	1
Reckase, Mark D.	1
Reshetar, Rosemary A.	1
Russell Smith	1
Schulz, E. Matthew	1
Spray, Judith A.	1
Sykes, Robert C.	1
Verstralen, H. H. F. M.	1
Wang, Lin	1
More ▼