ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	10

Descriptor

Foreign Countries	11
Monte Carlo Methods	11
Test Items	11
Item Response Theory	7
Bayesian Statistics	4
Markov Processes	4
Models	4
Achievement Tests	3
Adaptive Testing	3
Computer Assisted Testing	3
Computer Software	3
International Assessment	3
Simulation	3
Test Bias	3
Accuracy	2
Diagnostic Tests	2
Error of Measurement	2
High Stakes Tests	2
Information Technology	2
Internet	2
Item Analysis	2
Nonparametric Statistics	2
Sampling	2
Scores	2
Secondary School Students	2
More ▼

Source

Educational and Psychological…	3
ETS Research Report Series	1
Interactive Learning…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Pedagogical…	1
Journal of Psychoeducational…	1

Publication Type

Journal Articles	10
Reports - Research	9
Reports - Evaluative	2

Education Level

Secondary Education	4
Junior High Schools	2
Middle Schools	2
Grade 8	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Saudi Arabia	2
Canada	1
China	1
China (Shanghai)	1
Germany	1
South Africa	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Cognitive Abilities Test	1
Graduate Record Examinations	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The Feasibility of Computerized Adaptive Testing of the National Benchmark Test: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Musa Adekunle Ayanwale; Mdutshekelwa Ndlovu – Journal of Pedagogical Research, 2024

The COVID-19 pandemic has had a significant impact on high-stakes testing, including the national benchmark tests in South Africa. Current linear testing formats have been criticized for their limitations, leading to a shift towards Computerized Adaptive Testing [CAT]. Assessments with CAT are more precise and take less time. Evaluation of CAT…

Descriptors: Adaptive Testing, Benchmarking, National Competency Tests, Computer Assisted Testing

Explanatory Cognitive Diagnostic Modeling Incorporating Response Times

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong – Journal of Educational Measurement, 2021

This study proposes explanatory cognitive diagnostic model (CDM) jointly incorporating responses and response times (RTs) with the inclusion of item covariates related to both item responses and RTs. The joint modeling of item responses and RTs intends to provide more information for cognitive diagnosis while item covariates can be used to predict…

Descriptors: Cognitive Measurement, Models, Reaction Time, Test Items

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

Evaluating a Computerized Adaptive Testing Version of a Cognitive Ability Test Using a Simulation Study

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios D.; AlGhamdi, Hannan M. – Journal of Psychoeducational Assessment, 2021

This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Cognitive Ability

EW-KNN: Evaluating Information Technology Courses in High School with a Non-Parametric Cognitive Diagnosis Method

Peer reviewed

Direct link

Wanxue Zhang; Lingling Meng; Bilan Liang – Interactive Learning Environments, 2023

With the continuous development of education, personalized learning has attracted great attention. How to evaluate students' learning effects has become increasingly important. In information technology courses, the traditional academic evaluation focuses on the student's learning outcomes, such as "scores" or "right/wrong,"…

Descriptors: Information Technology, Computer Science Education, High School Students, Scoring

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

It Might Not Make a Big DIF: Improved Differential Test Functioning Statistics That Account for Sampling Variability

Peer reviewed

Direct link

Chalmers, R. Philip; Counsell, Alyssa; Flora, David B. – Educational and Psychological Measurement, 2016

Differential test functioning, or DTF, occurs when one or more items in a test demonstrate differential item functioning (DIF) and the aggregate of these effects are witnessed at the test level. In many applications, DTF can be more important than DIF when the overall effects of DIF at the test level can be quantified. However, optimal statistical…

Descriptors: Test Bias, Sampling, Test Items, Statistical Analysis

Gender and Minority Achievement Gaps in Science in Eighth Grade: Item Analyses of Nationally Representative Data. Research Report. ETS RR-17-36

Peer reviewed
PDF on ERIC

Download full text

Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017

In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…

Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8

Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2013

Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

The Theory about CD-CAT Based on FCA and Its Application

Peer reviewed

Direct link

Shuqun, Yang; Shuliang, Ding; Zhiqiang, Yao – International Journal of Distance Education Technologies, 2009

Cognitive diagnosis (CD) plays an important role in intelligent tutoring system. Computerized adaptive testing (CAT) is adaptive, fair, and efficient, which is suitable to large-scale examination. Traditional cognitive diagnostic test needs quite large number of items, the efficient and tailored CAT could be a remedy for it, so the CAT with…

Descriptors: Monte Carlo Methods, Distance Education, Adaptive Testing, Intelligent Tutoring Systems

Reliability Estimation for Single Dichotomous Items. Research Report 94-5.

Download full text

Meijer, Rob R.; And Others – 1994

Three methods for the estimation of the reliability of single dichotomous items are discussed. All methods are based on the assumptions of nondecreasing and nonintersecting item response functions and the Mokken model of double monotonicity. Based on analytical and Monte Carlo studies, it is concluded that one method is superior to the other two…

Descriptors: Estimation (Mathematics), Foreign Countries, Item Response Theory, Monte Carlo Methods

AlGhamdi, Hannan M.	1
Bilan Liang	1
Chalmers, R. Philip	1
Counsell, Alyssa	1
Dimitrov, Dimiter M.	1
Fifield, Steve	1
Flora, David B.	1
Ford, Danielle	1
Glutting, Joseoph	1
Huang, Hung-Yu	1
Jiao, Hong	1
Lingling Meng	1
Luo, Yong	1
Mdutshekelwa Ndlovu	1
Meijer, Rob R.	1
Musa Adekunle Ayanwale	1
Nandakumar, Ratna	1
Qian, Xiaoyu	1
Qiao, Xin	1
Robitzsch, Alexander	1
Shuliang, Ding	1
Shuqun, Yang	1
Sideridis, Georgios D.	1
Trendtel, Matthias	1
More ▼