ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	11

Descriptor

Probability	15
Test Items	15
Item Response Theory	5
Models	4
Statistical Analysis	4
College Entrance Examinations	3
Computation	3
Equated Scores	3
Foreign Countries	3
Maximum Likelihood Statistics	3
Adaptive Testing	2
Admission (School)	2
Difficulty Level	2
Educational Assessment	2
Educational Testing	2
Evaluation Research	2
Item Analysis	2
Law Schools	2
Scores	2
Statistical Distributions	2
Test Bias	2
Ability	1
Achievement Tests	1
Bayesian Statistics	1
Cheating	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	15
Reports - Evaluative	6
Reports - Research	6
Reports - Descriptive	3

Education Level

Higher Education	3
Postsecondary Education	3
Grade 8	1

Audience

Location

Belgium	1
Netherlands (Amsterdam)	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Kernel Equating Using Propensity Scores for Nonequivalent Groups

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019

When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…

Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations

The Prevalence and Implications of Slipping on Low-Stakes, Large-Scale Assessments

Peer reviewed

Direct link

Culpepper, Steven Andrew – Journal of Educational and Behavioral Statistics, 2017

In the absence of clear incentives, achievement tests may be subject to the effect of slipping where item response functions have upper asymptotes below one. Slipping reduces score precision for higher latent scores and distorts test developers' understandings of item and test information. A multidimensional four-parameter normal ogive model was…

Descriptors: Measurement, Achievement Tests, Item Response Theory, National Competency Tests

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Modeling Answer Changes on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong – Journal of Educational and Behavioral Statistics, 2012

The probability of test takers changing answers upon review of their initial choices is modeled. The primary purpose of the model is to check erasures on answer sheets recorded by an optical scanner for numbers and patterns that may be indicative of irregular behavior, such as teachers or school administrators changing answer sheets after their…

Descriptors: Probability, Models, Test Items, Educational Testing

Robust Estimation of Latent Ability in Item Response Models

Peer reviewed

Direct link

Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011

Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…

Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics

Beta Regression Finite Mixture Models of Polarization and Priming

Peer reviewed

Direct link

Smithson, Michael; Merkle, Edgar C.; Verkuilen, Jay – Journal of Educational and Behavioral Statistics, 2011

This paper describes the application of finite-mixture general linear models based on the beta distribution to modeling response styles, polarization, anchoring, and priming effects in probability judgments. These models, in turn, enhance our capacity for explicitly testing models and theories regarding the aforementioned phenomena. The mixture…

Descriptors: Priming, Research Methodology, Probability, Item Response Theory

Conditional Item-Exposure Control in Adaptive Testing Using Item-Ineligibility Probabilities

Peer reviewed

Direct link

van der Linden, Wim J.; Veldkamp, Bernard P. – Journal of Educational and Behavioral Statistics, 2007

Two conditional versions of the exposure-control method with item-ineligibility constraints for adaptive testing in van der Linden and Veldkamp (2004) are presented. The first version is for unconstrained item selection, the second for item selection with content constraints imposed by the shadow-test approach. In both versions, the exposure rates…

Descriptors: Law Schools, Adaptive Testing, Item Analysis, Probability

Constraining Item Exposure in Computerized Adaptive Testing with Shadow Tests

Peer reviewed

Direct link

van der Linden, Wim J.; Veldkamp, Bernard P. – Journal of Educational and Behavioral Statistics, 2004

Item-exposure control in computerized adaptive testing is implemented by imposing item-ineligibility constraints on the assembly process of the shadow tests. The method resembles Sympson and Hetter's (1985) method of item-exposure control in that the decisions to impose the constraints are probabilistic. The method does not, however, require…

Descriptors: Probability, Law Schools, Admission (School), Adaptive Testing

Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M. – Journal of Educational and Behavioral Statistics, 2003

Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…

Descriptors: Test Items, Markov Processes, Educational Testing, Probability

Identifying Impediments to Learning Probability and Statistics from an Assessment of Instructional Software.

Peer reviewed

Cohen, Steve; And Others – Journal of Educational and Behavioral Statistics, 1996

A detailed multisite evaluation of instructional software, the ConStatS package, designed to help students conceptualize introductory probability and statistics, yielded patterns of error on several assessment items. Results from 739 college students demonstrated 10 misconceptions that may be among the most difficult concepts to teach. (SLD)

Descriptors: College Students, Computer Assisted Instruction, Computer Software Evaluation, Educational Assessment

Assessing and Explaining Differential Item Functioning Using Logistic Mixed Models

Peer reviewed

Direct link

Van den Noortgate, Wim; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2005

Although differential item functioning (DIF) theory traditionally focuses on the behavior of individual items in two (or a few) specific groups, in educational measurement contexts, it is often plausible to regard the set of items as a random sample from a broader category. This article presents logistic mixed models that can be used to model…

Descriptors: Test Bias, Item Response Theory, Educational Assessment, Mathematical Models

van der Linden, Wim J.	4
Johnson, Matthew S.	2
Sinharay, Sandip	2
Veldkamp, Bernard P.	2
Andreas Kurz	1
Can Gürer	1
Clemens Draxler	1
Cohen, Steve	1
Culpepper, Steven Andrew	1
De Boeck, Paul	1
Jan Philipp Nolte	1
Jeon, Minjeong	1
Kuijpers, Renske E.	1
Longford, Nicholas T.	1
Merkle, Edgar C.	1
Molenaar, Dylan	1
Schuster, Christof	1
Smithson, Michael	1
Van den Noortgate, Wim	1
Verkuilen, Jay	1
Visser, Ingmar	1
Wallin, Gabriel	1
Wiberg, Marie	1
Williamson, David M.	1
More ▼