ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Item Analysis	9
Simulation	9
Test Theory	9
Test Items	8
Career Development	3
Difficulty Level	3
Goodness of Fit	3
Latent Trait Theory	3
Statistical Analysis	3
Test Construction	3
Bayesian Statistics	2
Comparative Analysis	2
Criterion Referenced Tests	2
Mathematical Models	2
Multiple Choice Tests	2
Scores	2
Test Reliability	2
Testing Problems	2
Achievement Tests	1
Adaptive Testing	1
Alternative Assessment	1
Aptitude Tests	1
Armed Forces	1
Classification	1
Computation	1
More ▼

Source

International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1

Author

Cook, Linda L.	1
Curry, Allen R.	1
DeCarlo, Lawrence T.	1
Epstein, Kenneth I.	1
Eray Selçuk	1
Ergül Demir	1
Hambleton, Ronald K.	1
Knerr, Claramae S.	1
Longford, Nicholas T.	1
Sarvela, Paul D.	1
Vale, C. David	1
Yen, Wendy M.	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	3
Information Analyses	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Methods for Linking Item Parameters. Final Report.

Download full text

Vale, C. David; And Others – 1981

A simulation study to determine appropriate linking methods for adaptive testing items was designed. Three basic data sets for responses were created. These were randomly sampled, systematically sampled, and selected data sets. The evaluative criteria used were fidelity of parameter estimation, asymptotic ability estimates, root-mean-square error…

Descriptors: Adaptive Testing, Aptitude Tests, Armed Forces, Bayesian Statistics

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

Invariance of Rasch Model Ability Parameter Estimates Over Different Collections of Items.

Curry, Allen R.; And Others – 1978

The efficacy of employing subsets of items from a calibrated item pool to estimate the Rasch model person parameters was investigated. Specifically, the degree of invariance of Rasch model ability-parameter estimates was examined across differing collections of simulated items. The ability-parameter estimates were obtained from a simulation of…

Descriptors: Career Development, Difficulty Level, Equated Scores, Error of Measurement

Using Simulation Results When Choosing a Latent-Trait Model.

Yen, Wendy M. – 1979

Three test-analysis models were used to analyze three types of simulated test score data plus the results of eight achievement tests. Chi-square goodness-of-fit statistics were used to evaluate the appropriateness of the models to the four kinds of data. Data were generated to simulate the responses of 1,000 students to 36 pseudo-items by…

Descriptors: Achievement Tests, Correlation, Goodness of Fit, Item Analysis

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests