ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	32

Descriptor

Comparative Analysis	41
Item Response Theory	41
Probability	41
Models	15
Simulation	12
Test Items	11
Scores	9
Computer Assisted Testing	7
Monte Carlo Methods	7
Bayesian Statistics	6
Difficulty Level	6
Computation	5
Equations (Mathematics)	5
Foreign Countries	5
Statistical Analysis	5
Ability	4
Adaptive Testing	4
Classification	4
Correlation	4
Factor Analysis	4
Guessing (Tests)	4
Mathematical Models	4
Multiple Choice Tests	4
Sample Size	4
Scoring	4
More ▼

Publication Type

Journal Articles	31
Reports - Research	22
Reports - Evaluative	14
Speeches/Meeting Papers	8
Reports - Descriptive	3
Dissertations/Theses -…	2
Opinion Papers	1

Education Level

Elementary Education	5
Higher Education	4
Early Childhood Education	2
Junior High Schools	2
Secondary Education	2
Elementary Secondary Education	1
Grade 11	1
Grade 12	1
Grade 4	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Kindergarten	1
Middle Schools	1
Postsecondary Education	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

Australia	1
Brazil	1
Canada	1
Taiwan	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Child Behavior Checklist	1
Graduate Record Examinations	1
Raven Progressive Matrices	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Is It Worthy to Take Account of the "Guessing" in the Performance of the Raven Test? Calling for the Principle of Parsimony for Test Validation

Peer reviewed

Direct link

Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021

The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…

Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Adjacent-Categories Mokken Models for Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2017

Molenaar extended Mokken's original probabilistic-nonparametric scaling models for use with polytomous data. These polytomous extensions of Mokken's original scaling procedure have facilitated the use of Mokken scale analysis as an approach to exploring fundamental measurement properties across a variety of domains in which polytomous ratings are…

Descriptors: Nonparametric Statistics, Scaling, Models, Item Response Theory

The Predictiveness of PFA Is Improved by Incorporating the Learner's Correct Response Time Fluctuation

Peer reviewed
PDF on ERIC

Download full text

Chu, Wei; Pavlik, Philip I., Jr. – International Educational Data Mining Society, 2023

In adaptive learning systems, various models are employed to obtain the optimal learning schedule and review for a specific learner. Models of learning are used to estimate the learner's current recall probability by incorporating features or predictors proposed by psychological theory or empirically relevant to learners' performance. Logistic…

Descriptors: Reaction Time, Accuracy, Models, Predictor Variables

Does Matching Quality Matter in Mode Comparison Studies?

Peer reviewed

Direct link

Zeng, Ji; Yin, Ping; Shedden, Kerby A. – Educational and Psychological Measurement, 2015

This article provides a brief overview and comparison of three matching approaches in forming comparable groups for a study comparing test administration modes (i.e., computer-based tests [CBT] and paper-and-pencil tests [PPT]): (a) a propensity score matching approach proposed in this article, (b) the propensity score matching approach used by…

Descriptors: Comparative Analysis, Computer Assisted Testing, Probability, Classification

Using the Stan Program for Bayesian Item Response Theory

Peer reviewed

Direct link

Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018

Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software

Tracking with (Un)certainty

Peer reviewed
PDF on ERIC

Download full text

Hofman, Abe D.; Brinkhuis, Matthieu J. S.; Bolsinova, Maria; Klaiber, Jonathan; Maris, Gunter; van der Maas, Han L. J. – Journal of Intelligence, 2020

One of the highest ambitions in educational technology is the move towards personalized learning. To this end, computerized adaptive learning (CAL) systems are developed. A popular method to track the development of student ability and item difficulty, in CAL systems, is the Elo Rating System (ERS). The ERS allows for dynamic model parameters by…

Descriptors: Teaching Methods, Computer Assisted Instruction, Difficulty Level, Individualized Instruction

IRT-Based Adaptive Hints to Scaffold Learning in Programming

Peer reviewed

Direct link

Ueno, Maomi; Miyazawa, Yoshimitsu – IEEE Transactions on Learning Technologies, 2018

Over the past few decades, many studies conducted in the field of learning science have described that scaffolding plays an important role in human learning. To scaffold a learner efficiently, a teacher should predict how much support a learner must have to complete tasks and then decide the optimal degree of assistance to support the learner's…

Descriptors: Scaffolding (Teaching Technique), Prediction, Probability, Comparative Analysis

Semiparametric Item Response Functions in the Context of Guessing

Peer reviewed

Direct link

Falk, Carl F.; Cai, Li – Journal of Educational Measurement, 2016

We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

Descriptors: Item Response Theory, Guessing (Tests), Mathematics Tests, Simulation

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

An Exploratory Analysis of Differential Item Functioning and Its Possible Sources in a Higher Education Admissions Context

Peer reviewed

Direct link

Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018

We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…

Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory

On the Performance Characteristics of Latent-Factor and Knowledge Tracing Models

Download full text

Klingler, Severin; Käser, Tanja; Solenthaler, Barbara; Gross, Markus – International Educational Data Mining Society, 2015

Modeling student knowledge is a fundamental task of an intelligent tutoring system. A popular approach for modeling the acquisition of knowledge is Bayesian Knowledge Tracing (BKT). Various extensions to the original BKT model have been proposed, among them two novel models that unify BKT and Item Response Theory (IRT). Latent Factor Knowledge…

Descriptors: Intelligent Tutoring Systems, Knowledge Level, Item Response Theory, Prediction

Comparing Student Performance on Paper-and-Pencil and Computer-Based-Tests

Peer reviewed
PDF on ERIC

Download full text

Hardcastle, Joseph; Herrmann-Abell, Cari F.; DeBoer, George E. – Grantee Submission, 2017

Can student performance on computer-based tests (CBT) and paper-and-pencil tests (PPT) be considered equivalent measures of student knowledge? States and school districts are grappling with this question, and although studies addressing this question are growing, additional research is needed. We report on the performance of students who took…

Descriptors: Academic Achievement, Computer Assisted Testing, Comparative Analysis, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	4
Applied Measurement in…	3
International Educational…	3
Journal of Educational…	3
Educational Research and…	2
IEEE Transactions on Learning…	2
ProQuest LLC	2
Applied Psychological…	1
ETS Research Report Series	1
Early Child Development and…	1
Educational Technology &…	1
Eurasian Journal of…	1
Grantee Submission	1
Hacettepe University Journal…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Intelligence	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Practical Assessment,…	1
Psychological Assessment	1
Psychological Record	1
Psychological Review	1
Psychometrika	1
More ▼

Ueno, Maomi	2
Althoff, Robert R.	1
Andrich, David	1
Atar, Burcu	1
Ayer, Lynsay A.	1
Bay, Luz	1
Beretvas, S. Natasha	1
Bergner, Yoav	1
Bolsinova, Maria	1
Bos, Wilfried	1
Bridgeman, Brent	1
Brinkhuis, Matthieu J. S.	1
Bulut, Okan	1
Cai, Li	1
Camilli, Gregory	1
Chu, Wei	1
Cogo-Moreira, Hugo	1
DeBoer, George E.	1
Droschler, Stefan	1
Eggen, Theo J. H. M.	1
Erosheva, Elena A.	1
Falk, Carl F.	1
Fluke, Rickey	1
Frick, Theodore W.	1
More ▼