ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	9

Descriptor

Computer Software	11
Test Items	11
Item Response Theory	7
Models	5
Accuracy	4
Foreign Countries	4
Item Analysis	3
Achievement Tests	2
Classification	2
College Entrance Examinations	2
Computation	2
Estimation (Mathematics)	2
International Assessment	2
Responses	2
Simulation	2
Statistical Analysis	2
Test Bias	2
Test Format	2
Artificial Intelligence	1
Automation	1
Bayesian Statistics	1
Beliefs	1
Chi Square	1
Cluster Grouping	1
Coding	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Evaluative	3

Education Level

Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Germany	1
Hong Kong	1
Saudi Arabia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	2

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach

Peer reviewed

Direct link

Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024

To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…

Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

A Multilevel Mixture IRT Framework for Modeling Response Times as Predictors or Indicators of Response Engagement in IRT Models

Peer reviewed

Direct link

Nagy, Gabriel; Ulitzsch, Esther – Educational and Psychological Measurement, 2022

Disengaged item responses pose a threat to the validity of the results provided by large-scale assessments. Several procedures for identifying disengaged responses on the basis of observed response times have been suggested, and item response theory (IRT) models for response engagement have been proposed. We outline that response time-based…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Predictor Variables, Classification

A Short Note on Estimating the Testlet Model with Different Estimators in Mplus

Peer reviewed

Direct link

Luo, Yong – Educational and Psychological Measurement, 2018

Mplus is a powerful latent variable modeling software program that has become an increasingly popular choice for fitting complex item response theory models. In this short note, we demonstrate that the two-parameter logistic testlet model can be estimated as a constrained bifactor model in Mplus with three estimators encompassing limited- and…

Descriptors: Computer Software, Models, Statistical Analysis, Computation

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

Automatic Coding of Short Text Responses via Clustering in Educational Assessment

Peer reviewed

Direct link

Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016

Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…

Descriptors: Educational Assessment, Coding, Automation, Responses

Studying Differential Item Functioning via Latent Variable Modeling: A Note on a Multiple-Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Lee, Chun-Lung; Chang, Chi – Educational and Psychological Measurement, 2013

This note is concerned with a latent variable modeling approach for the study of differential item functioning in a multigroup setting. A multiple-testing procedure that can be used to evaluate group differences in response probabilities on individual items is discussed. The method is readily employed when the aim is also to locate possible…

Descriptors: Test Bias, Statistical Analysis, Models, Hypothesis Testing

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

The DIF-Free-Then-DIF Strategy for the Assessment of Differential Item Functioning

Peer reviewed

Direct link

Wang, Wen-Chung; Shih, Ching-Lin; Sun, Guo-Wei – Educational and Psychological Measurement, 2012

The DIF-free-then-DIF (DFTD) strategy consists of two steps: (a) select a set of items that are the most likely to be DIF-free and (b) assess the other items for DIF (differential item functioning) using the designated items as anchors. The rank-based method together with the computer software IRTLRDIF can select a set of DIF-free polytomous items…

Descriptors: Test Bias, Test Items, Item Response Theory, Evaluation Methods

DIF: A Computer Program for the Analysis of Differential Item Performance.

Peer reviewed

Klieme, Eckhard; Stumpf, Heinrich – Educational and Psychological Measurement, 1991

A FORTRAN 77 computer program is presented to perform analyses of differential item performance in psychometric tests. The program performs the Mantel-Haenszel procedure and computes additional classical indices of differential item functioning (DIF) and associated effect size measures. (Author/SLD)

Descriptors: Chi Square, Computer Software, Effect Size, Estimation (Mathematics)

Analyzing Optional Test Items.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1989

Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)

Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests

Luo, Yong	2
Wang, Wen-Chung	2
Aiken, Lewis R.	1
Chang, Chi	1
Chen, Hui-Fang	1
Dimitrov, Dimiter M.	1
Goldhammer, Frank	1
Jin, Kuan-Yu	1
Khorramdel, Lale	1
Klieme, Eckhard	1
Lee, Chun-Lung	1
Marcoulides, George A.	1
Nagy, Gabriel	1
Raykov, Tenko	1
Shih, Ching-Lin	1
Stumpf, Heinrich	1
Sun, Guo-Wei	1
Sälzer, Christine	1
Thorsten Meiser	1
Tyack, Lillian	1
Ulitzsch, Esther	1
Zehner, Fabian	1
von Davier, Matthias	1
Ö. Emre C. Alagöz	1
More ▼