ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	11

Descriptor

Bayesian Statistics	13
Evaluation Methods	13
Test Items	13
Item Response Theory	5
Accuracy	3
Computation	3
Item Analysis	3
Maximum Likelihood Statistics	3
Models	3
Monte Carlo Methods	3
Simulation	3
Statistical Analysis	3
Achievement Tests	2
Comparative Analysis	2
Computer Assisted Testing	2
Difficulty Level	2
Goodness of Fit	2
Markov Processes	2
Program Effectiveness	2
Research Methodology	2
Responses	2
Test Validity	2
Ability	1
Adaptive Testing	1
Algorithms	1
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Journal of Educational and…	2
Applied Measurement in…	1
Early Education and…	1
Educational Measurement:…	1
Educational Research and…	1
Grantee Submission	1

Publication Type

Journal Articles	11
Reports - Research	10
Reports - Evaluative	3

Education Level

Grade 8	1
Middle Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

North Carolina (Charlotte)

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023

Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…

Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Variance Difference between Maximum Likelihood Estimation Method and Expected A Posteriori Estimation Method Viewed from Number of Test Items

Peer reviewed
PDF on ERIC

Download full text

Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016

The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items

Examining the Validity of GOLD® with 4-Year-Old Dual Language Learners

Peer reviewed

Direct link

Kim, Do-Hong; Lambert, Richard G.; Durham, Sean; Burts, Diane C. – Early Education and Development, 2018

Research Findings: This study builds on prior work related to the assessment of young dual language learners (DLLs). The purposes of the study were to (a) determine whether latent subgroups of preschool DLLs would replicate those found previously and (b) examine the validity of GOLD® by Teaching Strategies with empirically derived subgroups.…

Descriptors: Preschool Education, Teaching Methods, Bilingualism, Bilingual Education

Bayesian Multidimensional IRT Models with a Hierarchical Structure

Peer reviewed

Direct link

Sheng, Yanyan; Wikle, Christopher K. – Educational and Psychological Measurement, 2008

As item response models gain increased popularity in large-scale educational and measurement testing situations, many studies have been conducted on the development and applications of unidimensional and multidimensional models. Recently, attention has been paid to IRT-based models with an overall ability dimension underlying several ability…

Descriptors: Test Items, Individual Testing, Item Response Theory, Evaluation Methods

Applying Bayesian Item Selection Approaches to Adaptive Tests Using Polytomous Items

Peer reviewed

Direct link

Penfield, Randall D. – Applied Measurement in Education, 2006

This study applied the maximum expected information (MEI) and the maximum posterior-weighted information (MPI) approaches of computer adaptive testing item selection to the case of a test using polytomous items following the partial credit model. The MEI and MPI approaches are described. A simulation study compared the efficiency of ability…

Descriptors: Bayesian Statistics, Adaptive Testing, Computer Assisted Testing, Test Items

A Multilevel Bayesian Item Response Theory Method for Scaling Socioeconomic Status in International Studies of Education

Peer reviewed

Direct link

May, Henry – Journal of Educational and Behavioral Statistics, 2006

In this article, a new method is presented and implemented for deriving a scale of socioeconomic status (SES) from international survey data using a multilevel Bayesian item response theory (IRT) model. The proposed model incorporates both international anchor items and nation-specific items and is able to (a) produce student family SES scores…

Descriptors: Item Response Theory, Bayesian Statistics, Socioeconomic Status, Scaling

A Procedure for Investigating the Unidimensionality of Achievement Tests Based on Item Parameter Estimates.

Peer reviewed

Bejar, Isaac I. – Journal of Educational Measurement, 1980

Two procedures are presented for detecting violations of the unidimensionality assumption made by latent trait models without requiring factor analysis of inter-item correlation matrices. Both procedures require that departures from unidimensionality be hypothesized beforehand. This is usually possible in achievement tests where several content…

Descriptors: Achievement Tests, Bayesian Statistics, Cluster Grouping, Content Analysis

The Respective Advantages and Disadvantages of Different Ways of Measuring the Instructional Sensitivity of Reading Comprehension Test Items.

Perkins, Kyle – 1987

In this paper four classes of procedures for measuring the instructional sensitivity of reading comprehension test items are reviewed. True experimental designs are not recommended because some of the most important reading comprehension variables do not lend themselves to experimental manipulation. "Ex post facto" factorial designs are…

Descriptors: Bayesian Statistics, Correlation, Elementary Secondary Education, Evaluation Methods

Model Diagnostics for Bayesian Networks

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2006

Bayesian networks are frequently used in educational assessments primarily for learning about students' knowledge and skills. There is a lack of works on assessing fit of Bayesian networks. This article employs the posterior predictive model checking method, a popular Bayesian model checking tool, to assess fit of simple Bayesian networks. A…

Descriptors: Models, Educational Assessment, Diagnostic Tests, Evaluation Methods

Bejar, Isaac I.	1
Burts, Diane C.	1
Chun Wang	1
Durham, Sean	1
Edwards, Julianne M.	1
Finch, Holmes	1
Jing Lu	1
Jiwei Zhang	1
Joo, Seang-Hwane	1
Kim, Do-Hong	1
Lambert, Richard G.	1
Lee, Philseok	1
Lozano, José H.	1
Mahmud, Jumailiyah	1
May, Henry	1
Naga, Dali S.	1
Penfield, Randall D.	1
Perkins, Kyle	1
Revuelta, Javier	1
Sheng, Yanyan	1
Sinharay, Sandip	1
Sutikno, Muzayanah	1
Wikle, Christopher K.	1
Wyse, Adam E.	1
Xue Wang	1
More ▼