ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Simulation	14
Test Items	14
Test Theory	14
Item Analysis	8
Difficulty Level	5
Goodness of Fit	5
Statistical Analysis	5
Latent Trait Theory	4
Bayesian Statistics	3
Comparative Analysis	3
Computation	3
Item Response Theory	3
Scores	3
Scoring	3
Test Construction	3
Adaptive Testing	2
Career Development	2
Classification	2
Correlation	2
Equated Scores	2
Mathematical Models	2
Multiple Choice Tests	2
Reliability	2
Statistical Bias	2
Test Length	2
More ▼

Source

Applied Psychological…	2
International Journal of…	2
ETS Research Report Series	1
Journal of Educational…	1
Journal of Educational and…	1
ProQuest LLC	1

Author

Yen, Wendy M.	2
Bogan, Evelyn Doody	1
Cook, Linda L.	1
Curry, Allen R.	1
DeCarlo, Lawrence T.	1
Deng, Nina	1
Eray Selçuk	1
Ergül Demir	1
Hambleton, Ronald K.	1
Kogar, Hakan	1
Longford, Nicholas T.	1
Raykov, Tenko	1
Sarvela, Paul D.	1
Vale, C. David	1
Zhang, Jinming	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Research	9
Journal Articles	7
Reports - Evaluative	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Predictive Control of Speededness in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2009

An adaptive testing method is presented that controls the speededness of a test using predictions of the test takers' response times on the candidate items in the pool. Two different types of predictions are investigated: posterior predictions given the actual response times on the items already administered and posterior predictions that use the…

Descriptors: Simulation, Adaptive Testing, Vocational Aptitude, Bayesian Statistics

Coefficient Alpha and Composite Reliability with Interrelated Nonhomogeneous Items.

Peer reviewed

Raykov, Tenko – Applied Psychological Measurement, 1998

Examines the relationship between Cronbach's coefficient alpha and the reliability of a composite of a prespecified set of interrelated nonhomogeneous components through simulation. Shows that alpha can over- or underestimate scale reliability at the population level. Illustrates the bias in terms of structural parameters. (SLD)

Descriptors: Reliability, Simulation, Statistical Bias, Structural Equation Models

Conditional Covariance Theory and DETECT for Polytomous Items. Research Report. ETS RR-04-50

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming – ETS Research Report Series, 2004

This paper extends the theory of conditional covariances to polytomous items. It has been mathematically proven that under some mild conditions, commonly assumed in the analysis of response data, the conditional covariance of two items, dichotomously or polytomously scored, is positive if the two items are dimensionally homogeneous and negative…

Descriptors: Test Items, Test Theory, Correlation, National Competency Tests

Detecting Multidimensionality and Examining Its Effects on Vertical Equating with the Three-Parameter Logistic Model.

Bogan, Evelyn Doody; Yen, Wendy M. – 1983

Four multidimensional data configurations and one unidimensional data configuration were simulated for three differences in mean difficulty between two tests to be equated. Two chi-square statistics, Q1 and Q2, were examined for their ability to detect multidimensionality. Results indicated that Q1 did not discriminate between any of the…

Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Latent Trait Theory

Methods for Linking Item Parameters. Final Report.

Download full text

Vale, C. David; And Others – 1981

A simulation study to determine appropriate linking methods for adaptive testing items was designed. Three basic data sets for responses were created. These were randomly sampled, systematically sampled, and selected data sets. The evaluative criteria used were fidelity of parameter estimation, asymptotic ability estimates, root-mean-square error…

Descriptors: Adaptive Testing, Aptitude Tests, Armed Forces, Bayesian Statistics

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

Invariance of Rasch Model Ability Parameter Estimates Over Different Collections of Items.

Curry, Allen R.; And Others – 1978

The efficacy of employing subsets of items from a calibrated item pool to estimate the Rasch model person parameters was investigated. Specifically, the degree of invariance of Rasch model ability-parameter estimates was examined across differing collections of simulated items. The ability-parameter estimates were obtained from a simulation of…

Descriptors: Career Development, Difficulty Level, Equated Scores, Error of Measurement

Using Simulation Results When Choosing a Latent-Trait Model.

Yen, Wendy M. – 1979

Three test-analysis models were used to analyze three types of simulated test score data plus the results of eight achievement tests. Chi-square goodness-of-fit statistics were used to evaluate the appropriateness of the models to the four kinds of data. Data were generated to simulate the responses of 1,000 students to 36 pseudo-items by…

Descriptors: Achievement Tests, Correlation, Goodness of Fit, Item Analysis

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests