ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	22

Descriptor

Bayesian Statistics	25
Statistical Analysis	25
Test Items	25
Item Response Theory	13
Computation	11
Models	10
Difficulty Level	6
Monte Carlo Methods	6
Comparative Analysis	5
Markov Processes	5
Mathematics Tests	5
Item Analysis	4
Scores	4
Test Bias	4
Ability	3
Achievement Tests	3
Classification	3
Computer Assisted Testing	3
Correlation	3
Evaluation Methods	3
Foreign Countries	3
Goodness of Fit	3
Maximum Likelihood Statistics	3
Simulation	3
Statistical Bias	3
More ▼

Source

Educational and Psychological…	5
Journal of Educational and…	5
Applied Psychological…	4
ETS Research Report Series	3
ProQuest LLC	2
Applied Measurement in…	1
EURASIA Journal of…	1
Early Education and…	1
Practical Assessment,…	1
Psychometrika	1

Publication Type

Journal Articles	22
Reports - Research	18
Reports - Evaluative	4
Dissertations/Theses -…	2
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Grade 8	2
Elementary Education	1
Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1
Preschool Education	1
Secondary Education	1
More ▼

Audience

Location

Germany	1
North Carolina (Charlotte)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

Assessing Preknowledge Cheating via Innovative Measures: A Multiple-Group Analysis of Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021

Many approaches have been proposed to jointly analyze item responses and response times to understand behavioral differences between normally and aberrantly behaved test-takers. Biometric information, such as data from eye trackers, can be used to better identify these deviant testing behaviors in addition to more conventional data types. Given…

Descriptors: Cheating, Item Response Theory, Reaction Time, Eye Movements

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

A Mixture IRTree Model for Performance Decline and Nonignorable Missing Data

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2020

In educational assessments and achievement tests, test developers and administrators commonly assume that test-takers attempt all test items with full effort and leave no blank responses with unplanned missing values. However, aberrant response behavior--such as performance decline, dropping out beyond a certain point, and skipping certain items…

Descriptors: Item Response Theory, Response Style (Tests), Test Items, Statistical Analysis

Detection of Differential Item Functioning Using the Lasso Approach

Peer reviewed

Direct link

Magis, David; Tuerlinckx, Francis; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2015

This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…

Descriptors: Test Bias, Test Items, Regression (Statistics), Scores

Definite Integral Automatic Analysis Mechanism Research and Development Using the "Find the Area by Integration" Unit as an Example

Peer reviewed

Direct link

Ting, Mu Yu – EURASIA Journal of Mathematics, Science & Technology Education, 2017

Using the capabilities of expert knowledge structures, the researcher prepared test questions on the university calculus topic of "finding the area by integration." The quiz is divided into two types of multiple choice items (one out of four and one out of many). After the calculus course was taught and tested, the results revealed that…

Descriptors: Calculus, Mathematics Instruction, College Mathematics, Multiple Choice Tests

Multidimensional Classification of Examinees Using the Mixture Random Weights Linear Logistic Test Model

Peer reviewed

Direct link

Choi, In-Hee; Wilson, Mark – Educational and Psychological Measurement, 2015

An essential feature of the linear logistic test model (LLTM) is that item difficulties are explained using item design properties. By taking advantage of this explanatory aspect of the LLTM, in a mixture extension of the LLTM, the meaning of latent classes is specified by how item properties affect item difficulties within each class. To improve…

Descriptors: Classification, Test Items, Difficulty Level, Statistical Analysis

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Examining the Validity of GOLD® with 4-Year-Old Dual Language Learners

Peer reviewed

Direct link

Kim, Do-Hong; Lambert, Richard G.; Durham, Sean; Burts, Diane C. – Early Education and Development, 2018

Research Findings: This study builds on prior work related to the assessment of young dual language learners (DLLs). The purposes of the study were to (a) determine whether latent subgroups of preschool DLLs would replicate those found previously and (b) examine the validity of GOLD® by Teaching Strategies with empirically derived subgroups.…

Descriptors: Preschool Education, Teaching Methods, Bilingualism, Bilingual Education

Confirming Testlet Effects

Peer reviewed

Direct link

DeMars, Christine E. – Applied Psychological Measurement, 2012

A testlet is a cluster of items that share a common passage, scenario, or other context. These items might measure something in common beyond the trait measured by the test as a whole; if so, the model for the item responses should allow for this testlet trait. But modeling testlet effects that are negligible makes the model unnecessarily…

Descriptors: Test Items, Item Response Theory, Comparative Analysis, Models

Improving Mantel-Haenszel DIF Estimation through Bayesian Updating

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012

This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…

Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics

Optimal Designs for the Rasch Model

Peer reviewed

Direct link

Grasshoff, Ulrike; Holling, Heinz; Schwabe, Rainer – Psychometrika, 2012

In this paper, optimal designs will be derived for estimating the ability parameters of the Rasch model when difficulty parameters are known. It is well established that a design is locally D-optimal if the ability and difficulty coincide. But locally optimal designs require that the ability parameters to be estimated are known. To attenuate this…

Descriptors: Item Response Theory, Test Items, Psychometrics, Statistical Analysis

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

A Semiparametric Model for Jointly Analyzing Response Times and Accuracy in Computerized Testing

Peer reviewed

Direct link

Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013

The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…

Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy

Previous Page | Next Page »

Pages: 1 | 2

Bradlow, Eric T.	2
Sinharay, Sandip	2
Wainer, Howard	2
Wang, Xiaohui	2
Babcock, Ben	1
Burts, Diane C.	1
Chang, Hua-Hua	1
Chang, Wanchen	1
Choi, In-Hee	1
De Boeck, Paul	1
DeCarlo, Lawrence T.	1
DeMars, Christine E.	1
Dodd, Barbara G.	1
Douglas, Jeffrey A.	1
Durham, Sean	1
Edwards, Julianne M.	1
Fan, Zhewen	1
Fifield, Steve	1
Finch, Holmes	1
Ford, Danielle	1
Glutting, Joseoph	1
Grasshoff, Ulrike	1
Gräfe, Linda	1
Harring, Jeffrey R.	1
Holling, Heinz	1
More ▼