ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	50

Descriptor

Evaluation Methods	74
Models	74
Test Items	74
Item Response Theory	25
Psychometrics	23
Test Construction	18
Measurement Techniques	17
Item Analysis	13
Simulation	12
Classification	10
Comparative Analysis	10
Computer Assisted Testing	10
Diagnostic Tests	10
Educational Assessment	10
Goodness of Fit	10
Student Evaluation	10
Foreign Countries	9
Measurement	9
Probability	9
Testing	9
Computation	8
Scores	8
Test Bias	8
Evaluation Problems	7
Factor Analysis	7
More ▼

Publication Type

Journal Articles	58
Reports - Research	35
Reports - Evaluative	17
Reports - Descriptive	11
Opinion Papers	7
Speeches/Meeting Papers	6
Dissertations/Theses -…	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	6
Elementary Secondary Education	4
Postsecondary Education	4
Secondary Education	3
Elementary Education	2
High Schools	2
Adult Education	1
Grade 8	1
Middle Schools	1

Audience

Practitioners	2
Researchers	2

Location

Germany	3
China	1
Italy	1
Malaysia	1
Netherlands	1
Oman	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	2
National Assessment of…	2
Graduate Record Examinations	1
Hidden Figures Test	1
Medical College Admission Test	1
North Carolina End of Course…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 74 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

On the Detection of Speededness in Data Despite Selective Responding Using Factor Analysis

Peer reviewed

Direct link

Schweizer, Karl; Wang, Tengfei; Ren, Xuezhu – Journal of Experimental Education, 2022

The essay reports two studies on confirmatory factor analysis of speeded data with an effect of selective responding. This response strategy leads test takers to choose their own working order instead of completing the items along with the given order. Methods for detecting speededness despite such a deviation from the given order are proposed and…

Descriptors: Factor Analysis, Response Style (Tests), Decision Making, Test Items

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Peer reviewed

Direct link

Kim, Kyung Yong – Journal of Educational Measurement, 2020

New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…

Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation

Modeling Mediation in the Dynamic Assessment of Listening Ability from the Cognitive Diagnostic Perspective

Peer reviewed

Direct link

Meng, Yaru; Fu, Hua – Modern Language Journal, 2023

The distinguishing feature of dynamic assessment (DA) is the dialectical integration of assessment and instruction. However, how to design the targeted instruction or mediation has been relatively underexplored. To address this gap, this study proposes the attribute-based mediation model (AMM), an English-as-a-foreign-language listening mediation…

Descriptors: Evaluation Methods, Teaching Methods, Models, English (Second Language)

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

Modeling NAEP Test-Taking Behavior Using Educational Process Analysis

Peer reviewed
PDF on ERIC

Download full text

Patel, Nirmal; Sharma, Aditya; Shah, Tirth; Lomas, Derek – Journal of Educational Data Mining, 2021

Process Analysis is an emerging approach to discover meaningful knowledge from temporal educational data. The study presented in this paper shows how we used Process Analysis methods on the National Assessment of Educational Progress (NAEP) test data for modeling and predicting student test-taking behavior. Our process-oriented data exploration…

Descriptors: Learning Analytics, National Competency Tests, Evaluation Methods, Prediction

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Focusing on Interactions between Content and Cognition: A New Perspective on Gender Differences in Mathematical Sub-Competencies

Peer reviewed

Direct link

George, Ann Cathrice; Robitzsch, Alexander – Applied Measurement in Education, 2018

This article presents a new perspective on measuring gender differences in the large-scale assessment study Trends in International Science Study (TIMSS). The suggested empirical model is directly based on the theoretical competence model of the domain mathematics and thus includes the interaction between content and cognitive sub-competencies.…

Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

Innovative Assessments That Support Students' STEM Learning

Direct link

Thummaphan, Phonraphee – ProQuest LLC, 2017

The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…

Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics

CAT Model with Personalized Algorithm for Evaluation of Estimated Student Knowledge

Peer reviewed

Direct link

Andjelic, Svetlana; Cekerevac, Zoran – Education and Information Technologies, 2014

This article presents the original model of the computer adaptive testing and grade formation, based on scientifically recognized theories. The base of the model is a personalized algorithm for selection of questions depending on the accuracy of the answer to the previous question. The test is divided into three basic levels of difficulty, and the…

Descriptors: Computer Assisted Testing, Educational Technology, Grades (Scholastic), Test Construction

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Measurement:…	7
Applied Psychological…	6
Journal of Educational…	6
Educational and Psychological…	5
Journal of Educational and…	4
Psychometrika	3
Applied Measurement in…	2
Computers & Education	2
International Journal of…	2
ProQuest LLC	2
Studies in Educational…	2
CBE - Life Sciences Education	1
College Board	1
EURASIA Journal of…	1
Education and Information…	1
Educational Measurement:…	1
Educational Research and…	1
Educational Technology	1
Educational Testing Service	1
English Language Teaching	1
Instructional Science	1
Journal of Applied Testing…	1
Journal of Computers in…	1
Journal of Early Intervention	1
Journal of Educational Data…	1
More ▼

Nandakumar, Ratna	2
Revuelta, Javier	2
Rizavi, Saba	2
Robitzsch, Alexander	2
Wang, Wen-Chung	2
Way, Walter D.	2
de la Torre, Jimmy	2
Ackerman, Terry A.	1
Al Ajmi, Ahmed Ali Saleh	1
Albano, Anthony D.	1
Ali, Holi Ibrahim Holi	1
Andjelic, Svetlana	1
Ballou, Dale	1
Bartolucci, F.	1
Bejar, Isaac I.	1
Berger, Martijn P. F.	1
Berliner, David C.	1
Bertling, Jonas P.	1
Bhaskar, R.	1
Burling, Kelly	1
Carstensen, Claus H.	1
Cekerevac, Zoran	1
Choi, Youn-Jeng	1
Clough, Peter J.	1
More ▼