ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	9

Descriptor

Bayesian Statistics	14
Error of Measurement	14
Test Items	14
Item Response Theory	7
Comparative Analysis	5
Simulation	5
Estimation (Mathematics)	4
Maximum Likelihood Statistics	4
Accuracy	3
Computation	3
Goodness of Fit	3
Item Analysis	3
Test Length	3
Adaptive Testing	2
Computer Assisted Testing	2
Correlation	2
Evaluation Criteria	2
Factor Analysis	2
Foreign Countries	2
Monte Carlo Methods	2
Sample Size	2
Scores	2
Statistical Bias	2
Test Format	2
Test Reliability	2
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Applied Measurement in…	1
Applied Psychological…	1
Education and Information…	1
Grantee Submission	1
International Journal of…	1

Publication Type

Journal Articles	9
Reports - Research	8
Reports - Evaluative	4
Dissertations/Theses -…	2

Education Level

Grade 7

Audience

Location

Saudi Arabia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Assessing Item-Level Fit for Higher Order Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Zhang, Xue; Wang, Chun; Tao, Jian – Grantee Submission, 2018

Testing item-level fit is important in scale development to guide item revision/deletion. Many item-level fit indices have been proposed in literature, yet none of them were directly applicable to an important family of models, namely, the higher order item response theory (HO-IRT) models. In this study, chi-square-based fit indices (i.e., Yen's…

Descriptors: Item Response Theory, Models, Test Items, Goodness of Fit

Bayesian Approaches to Test Score Measurement Errors in Student Growth Prediction Models

Direct link

Pei-Hsuan Chiu – ProQuest LLC, 2018

Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…

Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…

Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Accounting for Differential Item Functioning Using Bayesian Approximate Measurement Invariance

Peer reviewed

Direct link

Sideridis, Georgios D.; Tsaousis, Ioannis; Alamri, Abeer A. – Educational and Psychological Measurement, 2020

The main thesis of the present study is to use the Bayesian structural equation modeling (BSEM) methodology of establishing approximate measurement invariance (A-MI) using data from a national examination in Saudi Arabia as an alternative to not meeting strong invariance criteria. Instead, we illustrate how to account for the absence of…

Descriptors: Bayesian Statistics, Structural Equation Models, Foreign Countries, Error of Measurement

Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

Peer reviewed

Direct link

Lee, Woo-yeol; Cho, Sun-Joo – Journal of Educational Measurement, 2017

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…

Descriptors: Test Items, Item Response Theory, Item Analysis, Simulation

Application of the IRT and TRT Models to a Reading Comprehension Test

Direct link

Kim, Weon H. – ProQuest LLC, 2017

The purpose of the present study is to apply the item response theory (IRT) and testlet response theory (TRT) models to a reading comprehension test. This study applied the TRT models and the traditional IRT model to a seventh-grade reading comprehension test (n = 8,815) with eight testlets. These three models were compared to determine the best…

Descriptors: Item Response Theory, Test Items, Correlation, Reading Tests

Stochastic EM for Estimating the Parameters of a Multilevel IRT Model. Research Report.

Download full text

Fox, Jean-Paul – 2000

An item response theory (IRT) model is used as a measurement error model for the dependent variable of a multilevel model where tests or questionnaires consisting of separate items are used to perform a measurement error analysis. The advantage of using latent scores as dependent variables of a multilevel model is that it offers the possibility of…

Descriptors: Bayesian Statistics, Error of Measurement, Estimation (Mathematics), Item Response Theory

A Consideration for Variable Length Adaptive Tests.

Download full text

Wingersky, Marilyn S. – 1989

In a variable-length adaptive test with a stopping rule that relied on the asymptotic standard error of measurement of the examinee's estimated true score, M. S. Stocking (1987) discovered that it was sufficient to know the examinee's true score and the number of items administered to predict with some accuracy whether an examinee's true score was…

Descriptors: Adaptive Testing, Bayesian Statistics, Error of Measurement, Estimation (Mathematics)

Bayesian Item Selection Criteria for Adaptive Testing. Research Report 96-01.

Download full text

van der Linden, Wim J. – 1996

R. J. Owen (1975) proposed an approximate empirical Bayes procedure for item selection in adaptive testing. The procedure replaces the true posterior by a normal approximation with closed-form expressions for its first two moments. This approximation was necessary to minimize the computational complexity involved in a fully Bayesian approach, but…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computation

An Investigation of Lord's Procedure for the Detection of Differential Item Functioning.

Peer reviewed

Kim, Seock-Ho; And Others – Applied Psychological Measurement, 1994

Type I error rates of F. M. Lord's chi square test for differential item functioning were investigated using Monte Carlo simulations with marginal maximum likelihood estimation and marginal Bayesian estimation algorithms. Lord's chi square did not provide useful Type I error control for the three-parameter logistic model at these sample sizes.…

Descriptors: Algorithms, Bayesian Statistics, Chi Square, Error of Measurement

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Peer reviewed

Direct link

Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004

The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…

Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Alamri, Abeer A.	1
Cho, Sun-Joo	1
Dogan, Nuri	1
Fox, Jean-Paul	1
Gelbal, Selahattin	1
Kilic, Abdullah Faruk	1
Kim, Seock-Ho	1
Kim, Stella Yun	1
Kim, Weon H.	1
Lee, HyeSun	1
Lee, Won-Chan	1
Lee, Woo-yeol	1
Li, Yuan H.	1
Lissitz, Robert W.	1
Ozdemir, Burhanettin	1
Pei-Hsuan Chiu	1
Sideridis, Georgios D.	1
Smith, Weldon Z.	1
Tao, Jian	1
Tsaousis, Ioannis	1
Wang, Chun	1
Wingersky, Marilyn S.	1
Zhang, Xue	1
van der Linden, Wim J.	1
More ▼