ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	14

Descriptor

Statistical Distributions	37
Item Response Theory	11
Test Items	10
Equated Scores	8
Estimation (Mathematics)	8
Mathematical Models	8
Models	8
Scores	8
Comparative Analysis	6
Computation	6
Statistical Analysis	6
Statistical Studies	5
Computer Assisted Testing	4
Computer Simulation	4
Equations (Mathematics)	4
Scaling	4
Test Construction	4
Adaptive Testing	3
Bayesian Statistics	3
Goodness of Fit	3
Item Banks	3
Item Bias	3
Monte Carlo Methods	3
Multiple Choice Tests	3
Raw Scores	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	37
Reports - Evaluative	17
Reports - Research	16
Reports - Descriptive	3
Opinion Papers	2

Education Level

Secondary Education

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
National Assessment of…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

Multiple-Group Joint Modeling of Item Responses, Response Times, and Action Counts with the Conway-Maxwell-Poisson Distribution

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong; He, Qiwei – Journal of Educational Measurement, 2023

Multiple group modeling is one of the methods to address the measurement noninvariance issue. Traditional studies on multiple group modeling have mainly focused on item responses. In computer-based assessments, joint modeling of response times and action counts with item responses helps estimate the latent speed and action levels in addition to…

Descriptors: Multivariate Analysis, Models, Item Response Theory, Statistical Distributions

A New Statistic for Detection of Aberrant Answer Changes

Peer reviewed

Direct link

Sinharay, Sandip; Duong, Minh Q.; Wood, Scott W. – Journal of Educational Measurement, 2017

As noted by Fremer and Olson, analysis of answer changes is often used to investigate testing irregularities because the analysis is readily performed and has proven its value in practice. Researchers such as Belov, Sinharay and Johnson, van der Linden and Jeon, van der Linden and Lewis, and Wollack, Cohen, and Eckerly have suggested several…

Descriptors: Identification, Statistics, Change, Tests

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

Response-time models are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis of no misfit is proved…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

Dual-Objective Item Selection Criteria in Cognitive Diagnostic Computerized Adaptive Testing

Peer reviewed

Direct link

Kang, Hyeon-Ah; Zhang, Susu; Chang, Hua-Hua – Journal of Educational Measurement, 2017

The development of cognitive diagnostic-computerized adaptive testing (CD-CAT) has provided a new perspective for gaining information about examinees' mastery on a set of cognitive attributes. This study proposes a new item selection method within the framework of dual-objective CD-CAT that simultaneously addresses examinees' attribute mastery…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Test Items

On Attempting to Do What Lord Said Was Impossible: Commentary on van der Linden's "Some Conceptual Issues in Observed-Score Equating"

Peer reviewed

Direct link

Dorans, Neil J. – Journal of Educational Measurement, 2013

van der Linden (this issue) uses words differently than Holland and Dorans. This difference in language usage is a source of some confusion in van der Linden's critique of what he calls equipercentile equating. I address these differences in language. van der Linden maintains that there are only two requirements for score equating. I maintain…

Descriptors: Equated Scores, Language Usage, Statistical Distributions

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

Peer reviewed

Direct link

Kim, Seonghoon – Journal of Educational Measurement, 2013

With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

Descriptors: Item Response Theory, Scores, Computation, Mathematics

Situations Where It Is Appropriate to Use Frequency Estimation Equipercentile Equating

Peer reviewed

Direct link

Guo, Hongwen; Oh, Hyeonjoo J.; Eignor, Daniel – Journal of Educational Measurement, 2013

In operational equating situations, frequency estimation equipercentile equating is considered only when the old and new groups have similar abilities. The frequency estimation assumptions are investigated in this study under various situations from both the levels of theoretical interest and practical use. It shows that frequency estimation…

Descriptors: Equated Scores, Computation, Statistical Analysis, Test Items

Local Equating Using the Rasch Model, the OPLM, and the 2PL IRT Model--or--What Is It Anyway if the Model Captures Everything There Is to Know about the Test Takers?

Peer reviewed

Direct link

von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013

Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…

Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores

The Effects of Selection Strategies for Bivariate Loglinear Smoothing Models on NEAT Equating Functions

Peer reviewed

Direct link

Moses, Tim; Holland, Paul W. – Journal of Educational Measurement, 2010

In this study, eight statistical strategies were evaluated for selecting the parameterizations of loglinear models for smoothing the bivariate test score distributions used in nonequivalent groups with anchor test (NEAT) equating. Four of the strategies were based on significance tests of chi-square statistics (Likelihood Ratio, Pearson,…

Descriptors: Equated Scores, Models, Statistical Distributions, Statistical Analysis

The Reliability of Difference Scores in Populations and Samples

Peer reviewed

Direct link

Zimmerman, Donald W. – Journal of Educational Measurement, 2009

This study was an investigation of the relation between the reliability of difference scores, considered as a parameter characterizing a population of examinees, and the reliability estimates obtained from random samples from the population. The parameters in familiar equations for the reliability of difference scores were redefined in such a way…

Descriptors: Computer Simulation, Reliability, Population Groups, Scores

Selection Strategies for Univariate Loglinear Smoothing Models and Their Effect on Equating Function Accuracy

Peer reviewed

Direct link

Moses, Tim; Holland, Paul W. – Journal of Educational Measurement, 2009

In this study, we compared 12 statistical strategies proposed for selecting loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. The major focus was on evaluating the effects of the selection strategies on equating function accuracy. Selection strategies' influence…

Descriptors: Equated Scores, Selection, Statistical Analysis, Models

Empirical Validation of DIMTEST on Nonnormal Ability Distributions.

Peer reviewed

Nandakumar, Ratna; Yu, Feng – Journal of Educational Measurement, 1996

DIMTEST is a nonparametric statistical test procedure for assessing unidimensionality of binary item response data that uses the T-statistic of W. F. Stout (1987). This study investigates the performance of the T-statistic with respect to different shapes of ability distributions and confirms its nonparametric nature. (SLD)

Descriptors: Ability, Nonparametric Statistics, Statistical Distributions, Validity

An Empirical Bayes Approach to Mantel-Haenszel DIF Analysis.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T.; Lewis, Charles – Journal of Educational Measurement, 1999

Developed an empirical Bayes enhancement to Mantel-Haenszel (MH) analysis of differential item functioning (DIF) in which it is assumed that the MH statistics are normally distributed and that the prior distribution of underlying DIF parameters is also normal. (Author/SLD)

Descriptors: Bayesian Statistics, Item Bias, Statistical Distributions, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Dorans, Neil J.	2
Holland, Paul W.	2
Lewis, Charles	2
Livingston, Samuel A.	2
Moses, Tim	2
Sinharay, Sandip	2
Tate, Richard L.	2
Wainer, Howard	2
Burket, George R.	1
Chang, Hua-Hua	1
Cohen, Allan S.	1
Duong, Minh Q.	1
Eignor, Daniel	1
Ersen, Rabia Karatoprak	1
Ferrara, Steven	1
González B., Jorge	1
Guo, Hongwen	1
He, Qiwei	1
Huynh, Huynh	1
Jiao, Hong	1
Kane, Michael T.	1
Kang, Hyeon-Ah	1
Kim, Seonghoon	1
King, F. J.	1
More ▼