ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Simulation	29
Test Theory	29
Test Items	14
Statistical Analysis	10
Item Analysis	9
Test Reliability	8
Correlation	6
Difficulty Level	6
Mathematical Models	6
Career Development	5
Equated Scores	5
Error of Measurement	5
Goodness of Fit	5
Latent Trait Theory	5
Reliability	5
Adaptive Testing	4
Bayesian Statistics	4
Comparative Analysis	4
Computation	4
Item Response Theory	4
Scores	4
Scoring	4
Testing Problems	4
Accuracy	3
Criterion Referenced Tests	3
More ▼

Source

Applied Psychological…	4
ETS Research Report Series	2
International Journal of…	2
Journal of Educational…	2
Applied Measurement in…	1
Computers & Education	1
Journal of Educational and…	1
ProQuest LLC	1
Psychological Review	1
Psychometrika	1

Publication Type

Reports - Research	18
Journal Articles	15
Reports - Evaluative	7
Speeches/Meeting Papers	6
Reports - Descriptive	2
Dissertations/Theses -…	1
Information Analyses	1
Reference Materials -…	1

Education Level

Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	2
Armed Forces Qualification…	1
Comprehensive Tests of Basic…	1
Eysenck Personality Inventory	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Problem Solving Learning Environments and Assessment: A Knowledge Space Theory Approach

Peer reviewed

Direct link

Reimann, Peter; Kickmeier-Rust, Michael; Albert, Dietrich – Computers & Education, 2013

This paper explores the relation between problem solving learning environments (PSLEs) and assessment concepts. The general framework of evidence-centered assessment design is used to describe PSLEs in terms of assessment concepts, and to identify similarities between the process of assessment design and of PSLE design. We use a recently developed…

Descriptors: Teaching Methods, Psychometrics, Problem Solving, Test Theory

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Optimal Decision Making in Neural Inhibition Models

Peer reviewed

Direct link

van Ravenzwaaij, Don; van der Maas, Han L. J.; Wagenmakers, Eric-Jan – Psychological Review, 2012

In their influential "Psychological Review" article, Bogacz, Brown, Moehlis, Holmes, and Cohen (2006) discussed optimal decision making as accomplished by the drift diffusion model (DDM). The authors showed that neural inhibition models, such as the leaky competing accumulator model (LCA) and the feedforward inhibition model (FFI), can mimic the…

Descriptors: Intelligent Tutoring Systems, Inhibition, Bayesian Statistics, Decision Making

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

How Often Do Subscores Have Added Value? Results from Operational and Simulated Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2010

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman suggested a method based on classical test theory to determine whether subscores have added value over total scores. In this article I first provide a rich collection of results regarding when subscores were found to have added…

Descriptors: Scores, Test Theory, Simulation, Reliability

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Predictive Control of Speededness in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2009

An adaptive testing method is presented that controls the speededness of a test using predictions of the test takers' response times on the candidate items in the pool. Two different types of predictions are investigated: posterior predictions given the actual response times on the items already administered and posterior predictions that use the…

Descriptors: Simulation, Adaptive Testing, Vocational Aptitude, Bayesian Statistics

Effect of Simultaneous Violations of Essential Tau-Equivalence and Uncorrelated Error on Coefficient Alpha.

Peer reviewed

Komaroff, Eugene – Applied Psychological Measurement, 1997

Evaluated coefficient alpha under violations of two classical test theory assumptions: essential tau-equivalence and uncorrelated errors through simulation. Discusses the interactive effects of both violations with true and error scores. Provides empirical evidence of the derivation of M. Novick and C. Lewis (1993). (SLD)

Descriptors: Correlation, Reliability, Simulation, Test Theory

Coefficient Alpha and Composite Reliability with Interrelated Nonhomogeneous Items.

Peer reviewed

Raykov, Tenko – Applied Psychological Measurement, 1998

Examines the relationship between Cronbach's coefficient alpha and the reliability of a composite of a prespecified set of interrelated nonhomogeneous components through simulation. Shows that alpha can over- or underestimate scale reliability at the population level. Illustrates the bias in terms of structural parameters. (SLD)

Descriptors: Reliability, Simulation, Statistical Bias, Structural Equation Models

The Reliability of Linearly Equated Tests.

Peer reviewed

Segall, Daniel O. – Psychometrika, 1994

An asymptotic expression for the reliability of a linearly equated test is developed using normal theory. Reliability is expressed as the product of test reliability before equating and an adjustment term that is a function of the sample sizes used to estimate the linear equating transformation. The approach is illustrated. (SLD)

Descriptors: Equated Scores, Error of Measurement, Estimation (Mathematics), Sample Size

Previous Page | Next Page »

Pages: 1 | 2

Yen, Wendy M.	3
Albert, Dietrich	1
Beauducel, Andre	1
Becker, Betsy Jane	1
Bogan, Evelyn Doody	1
Cliff, Norman	1
Cook, Linda L.	1
Cope, Ronald T.	1
Curry, Allen R.	1
DeCarlo, Lawrence T.	1
Deng, Nina	1
Epstein, Kenneth I.	1
Eray Selçuk	1
Ergül Demir	1
Hambleton, Ronald K.	1
Hau, Kit-Tai	1
Kickmeier-Rust, Michael	1
Kim, Sooyeon	1
Knerr, Claramae S.	1
Kogar, Hakan	1
Komaroff, Eugene	1
Livingston, Samuel A.	1
Longford, Nicholas T.	1
Marshall, J. Laird	1
More ▼