ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	32

Descriptor

Measurement Techniques	98
Test Construction	21
Item Response Theory	20
Test Items	19
Models	18
Psychometrics	18
Scores	17
Test Reliability	15
Simulation	12
Testing Problems	12
Correlation	11
Test Validity	11
Testing	11
Comparative Analysis	10
Rating Scales	10
Statistical Analysis	10
Mathematical Models	9
Cognitive Measurement	8
Computation	8
Student Evaluation	8
Academic Achievement	7
Achievement Tests	7
Educational Assessment	7
Elementary Secondary Education	7
Error of Measurement	7
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	77
Reports - Research	38
Reports - Evaluative	19
Reports - Descriptive	10
Book/Product Reviews	5
Opinion Papers	4
Information Analyses	3
Speeches/Meeting Papers	3

Education Level

Secondary Education	3
Elementary Secondary Education	1
Middle Schools	1

Audience

Location

Australia	1
Jordan	1
New Jersey	1

Laws, Policies, & Programs

Assessments and Surveys

Metropolitan Achievement Tests	2
National Assessment of…	2
Program for International…	2
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Peabody Picture Vocabulary…	1
SAT (College Admission Test)	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing 1 to 15 of 98 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Psychometric Methods to Evaluate Measurement and Algorithmic Bias in Automated Scoring

Peer reviewed

Direct link

Johnson, Matthew S.; Liu, Xiang; McCaffrey, Daniel F. – Journal of Educational Measurement, 2022

With the increasing use of automated scores in operational testing settings comes the need to understand the ways in which they can yield biased and unfair results. In this paper, we provide a brief survey of some of the ways in which the predictive methods used in automated scoring can lead to biased, and thus unfair automated scores. After…

Descriptors: Psychometrics, Measurement Techniques, Bias, Automation

MSAEM Estimation for Confirmatory Multidimensional Four-Parameter Normal Ogive Models

Peer reviewed

Direct link

Jia Liu; Xiangbin Meng; Gongjun Xu; Wei Gao; Ningzhong Shi – Journal of Educational Measurement, 2024

In this paper, we develop a mixed stochastic approximation expectation-maximization (MSAEM) algorithm coupled with a Gibbs sampler to compute the marginalized maximum a posteriori estimate (MMAPE) of a confirmatory multidimensional four-parameter normal ogive (M4PNO) model. The proposed MSAEM algorithm not only has the computational advantages of…

Descriptors: Algorithms, Achievement Tests, Foreign Countries, International Assessment

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Calculating Conditional Reliability for Dynamic Measurement Model Capacity Estimates

Peer reviewed

Direct link

McNeish, Daniel; Dumas, Denis – Journal of Educational Measurement, 2018

Dynamic measurement modeling (DMM) is a recent framework for measuring developing constructs whose manifestation occurs after an assessment is administered (e.g., learning capacity). Empirical studies have suggested that DMM may improve consequential validity of test scores because DMM learning capacity estimates were shown to be much less related…

Descriptors: Measurement Techniques, Test Reliability, Accuracy, Computation

Comparing the Accuracy of Student Growth Measures

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020

Testing programs are often interested in using a student growth measure. This article presents analytic derivations of the accuracy of common student growth measures on both the raw scale of the test and the percentile rank scale in terms of the proportional reduction in mean squared error and the squared correlation between the estimator and…

Descriptors: Student Evaluation, Accuracy, Testing, Student Development

A Comparison of Procedures for Estimating Person Reliability Parameters in the Graded Response Model

Peer reviewed

Direct link

LaHuis, David M.; Bryant-Lees, Kinsey B.; Hakoyama, Shotaro; Barnes, Tyler; Wiemann, Andrea – Journal of Educational Measurement, 2018

Person reliability parameters (PRPs) model temporary changes in individuals' attribute level perceptions when responding to self-report items (higher levels of PRPs represent less fluctuation). PRPs could be useful in measuring careless responding and traitedness. However, it is unclear how well current procedures for estimating PRPs can recover…

Descriptors: Comparative Analysis, Reliability, Error of Measurement, Measurement Techniques

Standard Errors of IRT Parameter Scale Transformation Coefficients: Comparison of Bootstrap Method, Delta Method, and Multiple Imputation Method

Peer reviewed

Direct link

Zhang, Zhonghua; Zhao, Mingren – Journal of Educational Measurement, 2019

The present study evaluated the multiple imputation method, a procedure that is similar to the one suggested by Li and Lissitz (2004), and compared the performance of this method with that of the bootstrap method and the delta method in obtaining the standard errors for the estimates of the parameter scale transformation coefficients in item…

Descriptors: Item Response Theory, Error Patterns, Item Analysis, Simulation

Integrating Multiple Sources of Validity Evidence for an Assessment-Based Cognitive Model

Peer reviewed

Direct link

Langenfeld, Thomas; Thomas, Jay; Zhu, Rongchun; Morris, Carrie A. – Journal of Educational Measurement, 2020

An assessment of graphic literacy was developed by articulating and subsequently validating a skills-based cognitive model intended to substantiate the plausibility of score interpretations. Model validation involved use of multiple sources of evidence derived from large-scale field testing and cognitive labs studies. Data from large-scale field…

Descriptors: Evidence, Scores, Eye Movements, Psychometrics

Estimating the Accuracy of Relative Growth Measures Using Empirical Data

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020

The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…

Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods

Statistical Assessment of Estimated Transformations in Observed-Score Equating

Peer reviewed

Direct link

Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…

Descriptors: Statistical Analysis, Equated Scores, Scores, Models

A Standardized Generalized Dimensionality Discrepancy Measure and a Standardized Model-Based Covariance for Dimensionality Assessment for Multidimensional Models

Peer reviewed

Direct link

Levy, Roy; Xu, Yuning; Yel, Nedim; Svetina, Dubravka – Journal of Educational Measurement, 2015

The standardized generalized dimensionality discrepancy measure and the standardized model-based covariance are introduced as tools to critique dimensionality assumptions in multidimensional item response models. These tools are grounded in a covariance theory perspective and associated connections between dimensionality and local independence.…

Descriptors: Item Response Theory, Models, Measurement Techniques, Correlation

Assessment of Person Fit Using Resampling-Based Approaches

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2016

De la Torre and Deng suggested a resampling-based approach for person-fit assessment (PFA). The approach involves the use of the [math equation unavailable] statistic, a corrected expected a posteriori estimate of the examinee ability, and the Monte Carlo (MC) resampling method. The Type I error rate of the approach was closer to the nominal level…

Descriptors: Sampling, Research Methodology, Error Patterns, Monte Carlo Methods

Cross-Country Heterogeneity in Students' Reporting Behavior: The Use of the Anchoring Vignette Method

Peer reviewed

Direct link

Vonkova, Hana; Zamarro, Gema; Hitt, Collin – Journal of Educational Measurement, 2018

Self-reports are an indispensable source of information in education research but they are often affected by heterogeneity in reporting behavior. Failing to correct for this heterogeneity can lead to invalid comparisons across groups. The researchers use the parametric anchoring vignette method to correct for cross-country incomparability of…

Descriptors: Vignettes, Educational Research, Achievement Tests, Foreign Countries

Structured Constructs Models Based on Change-Point Analysis

Peer reviewed

Direct link

Shin, Hyo Jeong; Wilson, Mark; Choi, In-Hee – Journal of Educational Measurement, 2017

This study proposes a structured constructs model (SCM) to examine measurement in the context of a multidimensional learning progression (LP). The LP is assumed to have features that go beyond a typical multidimentional IRT model, in that there are hypothesized to be certain cross-dimensional linkages that correspond to requirements between the…

Descriptors: Middle School Students, Student Evaluation, Measurement Techniques, Learning Processes

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Bennett, Randy Elliot	3
Livingston, Samuel A.	3
McCaffrey, Daniel F.	3
Castellano, Katherine E.	2
Embretson, Susan E.	2
Gierl, Mark J.	2
Hambleton, Ronald K.	2
Kim, Sooyeon	2
Morley, Mary	2
Penfield, Randall D.	2
Shavelson, Richard J.	2
Sinharay, Sandip	2
Sirotnik, Kenneth A.	2
Wiberg, Marie	2
Wilson, Mark	2
van der Linden, Wim J.	2
Ackerman, Terry	1
Almond, Russell G.	1
Barcikowski, Robert S.	1
Barnes, Tyler	1
Bentler, Peter M.	1
Bergan, John R.	1
Braun, Henry I.	1
Brennan, Robert L.	1
More ▼