ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	86

Descriptor

Models	153
Test Items	153
Item Response Theory	67
Test Construction	39
Simulation	24
Computer Assisted Testing	22
Comparative Analysis	20
Difficulty Level	20
Evaluation Methods	17
Item Analysis	17
Psychometrics	17
Goodness of Fit	16
Measurement Techniques	16
Scoring	16
Test Validity	16
Adaptive Testing	15
Classification	15
Multiple Choice Tests	15
Educational Assessment	14
Foreign Countries	14
Scores	14
Responses	13
Statistical Analysis	13
Testing	13
Computation	12
More ▼

Publication Type

Reports - Evaluative	153
Journal Articles	111
Speeches/Meeting Papers	18
Numerical/Quantitative Data	3
Tests/Questionnaires	2
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	8
Secondary Education	6
Elementary Secondary Education	5
Elementary Education	3
Middle Schools	3
Postsecondary Education	3
Grade 8	2
High Schools	2
Adult Education	1
Junior High Schools	1
Kindergarten	1
More ▼

Audience

Practitioners

Location

Canada	2
New York	2
Taiwan	2
Argentina	1
California	1
China	1
Colombia	1
Malaysia	1
Netherlands	1
United Kingdom	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	6
Program for International…	5
Armed Services Vocational…	2
Trends in International…	2
ACT Assessment	1
Advanced Placement…	1
Alberta Grade Twelve Diploma…	1
Hidden Figures Test	1
Law School Admission Test	1
Medical College Admission Test	1
Peabody Picture Vocabulary…	1
Raven Advanced Progressive…	1
Test of English as a Foreign…	1
Test of English for…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 153 results Save | Export

Controlling the Speededness of Assembled Test Forms: A Generalization to the Three-Parameter Lognormal Response Time Model

Peer reviewed

Direct link

Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023

When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…

Descriptors: Test Construction, Automation, Models, Test Items

ChatGPT's Performance Evaluation in Spreadsheets Modelling to Inform Assessments Redesign

Peer reviewed

Direct link

Michelle Cheong – Journal of Computer Assisted Learning, 2025

Background: Increasingly, students are using ChatGPT to assist them in learning and even completing their assessments, raising concerns of academic integrity and loss of critical thinking skills. Many articles suggested educators redesign assessments that are more 'Generative-AI-resistant' and to focus on assessing students on higher order…

Descriptors: Artificial Intelligence, Performance Based Assessment, Spreadsheets, Models

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

A Note on the Presence of Spurious Pseudo-Guessing Parameters for Three-Parameter Logistic Models in Heterogeneous Populations

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2020

This note raises caution that a finding of a marked pseudo-guessing parameter for an item within a three-parameter item response model could be spurious in a population with substantial unobserved heterogeneity. A numerical example is presented wherein each of two classes the two-parameter logistic model is used to generate the data on a…

Descriptors: Guessing (Tests), Item Response Theory, Test Items, Models

A Factored Regression Model for Composite Scores with Item-Level Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023

Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…

Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Peer reviewed

Direct link

Kim, Kyung Yong – Journal of Educational Measurement, 2020

New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…

Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation

A Class of Cognitive Diagnosis Models for Polytomous Data

Peer reviewed

Direct link

Gao, Xuliang; Ma, Wenchao; Wang, Daxun; Cai, Yan; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2021

This article proposes a class of cognitive diagnosis models (CDMs) for polytomously scored items with different link functions. Many existing polytomous CDMs can be considered as special cases of the proposed class of polytomous CDMs. Simulation studies were carried out to investigate the feasibility of the proposed CDMs and the performance of…

Descriptors: Cognitive Measurement, Models, Test Items, Scoring

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

Assessing Teacher Attentiveness to Student Mathematical Thinking: Validity Claims and Evidence

Peer reviewed

Direct link

Carney, Michele B.; Cavey, Laurie; Hughes, Gwyneth – Elementary School Journal, 2017

This article illustrates an argument-based approach to presenting validity evidence for assessment items intended to measure a complex construct. Our focus is developing a measure of teachers' ability to analyze and respond to students' mathematical thinking for the purpose of program evaluation. Our validity argument consists of claims addressing…

Descriptors: Mathematics Instruction, Mathematical Logic, Thinking Skills, Evidence

A Framework for Examining the Utility of Technology-Enhanced Items

Peer reviewed

Direct link

Russell, Michael – Journal of Applied Testing Technology, 2016

Interest in and use of technology-enhanced items has increased over the past decade. Given the additional time required to administer many technology-enhanced items and the increased expense required to develop them, it is important for testing programs to consider the utility of technology-enhanced items. The Technology-Enhanced Item Utility…

Descriptors: Test Items, Computer Assisted Testing, Models, Fidelity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Applied Psychological…	19
Educational and Psychological…	13
Journal of Educational…	12
Journal of Educational and…	7
International Journal of…	6
Psychometrika	6
Applied Measurement in…	3
Journal of Memory and Language	3
Measurement:…	3
Multivariate Behavioral…	3
Assessment in Education:…	2
Journal of Applied Testing…	2
Journal of Experimental…	2
ACT, Inc.	1
Alberta Journal of…	1
Assessment	1
Assessment & Evaluation in…	1
Autism: The International…	1
Behavioral Research and…	1
CBE - Life Sciences Education	1
Computers & Education	1
Current Issues in Education	1
Education and Information…	1
Educational Assessment	1
Educational Measurement:…	1
More ▼

van der Linden, Wim J.	5
Berger, Martijn P. F.	3
Raykov, Tenko	3
Stocking, Martha L.	3
Wang, Wen-Chung	3
Bejar, Isaac I.	2
Bennett, Randy Elliot	2
Beretvas, S. Natasha	2
Bolt, Daniel M.	2
Flowers, Claudia P.	2
Glas, Cees A. W.	2
Kolen, Michael J.	2
Marcoulides, George A.	2
Meijer, Rob R.	2
Nering, Michael L.	2
Oshima, T. C.	2
Passos, Valeria Lima	2
Samejima, Fumiko	2
Sijtsma, Klaas	2
Sinharay, Sandip	2
Stout, William	2
Tatsuoka, Kikumi K.	2
Trevisan, Michael S.	2
Williams, Natasha J.	2
More ▼