ERIC - Search Results

Descriptor

Goodness of Fit	11
Simulation	11
Test Construction	11
Test Items	5
Item Analysis	4
Item Response Theory	4
Mathematical Models	3
Career Development	2
Difficulty Level	2
Item Bias	2
Latent Trait Theory	2
Matrices	2
Maximum Likelihood Statistics	2
Models	2
Scoring	2
Test Theory	2
Testing	2
Ability	1
Ability Grouping	1
Achievement Tests	1
Adaptive Testing	1
Analysis of Variance	1
Chi Square	1
Comparative Analysis	1
Computer Assisted Testing	1
More ▼

Source

Applied Psychological…	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Outcome Measurement	1

Author

Cook, Linda L.	1
Curry, Allen R.	1
Dinero, Thomas E.	1
Dunbar, Stephen B.	1
Frisbie, David A.	1
Haertel, Edward	1
Hambleton, Ronald K.	1
Lee, Guemin	1
McKinley, Robert	1
Reckase, Mark D.	1
Rost, Jurgen	1
Rudas, Tamas	1
Stone, Gregory Ethan	1
Tang, Huixing	1
Wang, Wen-chung	1
Zwick, Rebecca	1
von Davier, Matthias	1
More ▼

Publication Type

Reports - Evaluative	7
Journal Articles	4
Reports - Research	3
Speeches/Meeting Papers	2
Reports - General	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Iowa Tests of Basic Skills	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The Relative Appropriateness of Eight Measurement Models for Analyzing Scores from Tests Composed of Testlets.

Peer reviewed

Lee, Guemin; Dunbar, Stephen B.; Frisbie, David A. – Educational and Psychological Measurement, 2001

Conceptualized eight different types of measurement models for a test composed of testlets and studied the goodness of fit of those models to data using data from the Iowa Tests of Basic Skills and simulated data. The essentially tau-equivalent model and the congeneric model provided worse model fit than the other measurement models. (SLD)

Descriptors: Goodness of Fit, Measurement Techniques, Models, Scores

Rasch Analysis of Distractors in Multiple-choice Items.

Peer reviewed

Wang, Wen-chung – Journal of Outcome Measurement, 1998

A Rasch-type analysis is presented for multiple-choice items in which one parameter is assigned to each distractor. Results of a small simulation study show that the parameter recovery of the distractor model is very satisfactory. Analysis of a real dataset shows that some items fit the Rasch model rather than the distractor model. (SLD)

Descriptors: Distractors (Tests), Goodness of Fit, Item Response Theory, Multiple Choice Tests

A Conditional Item-Fit Index for Rasch Models.

Peer reviewed

Rost, Jurgen; von Davier, Matthias – Applied Psychological Measurement, 1994

A new item-fit index is proposed that is both a descriptive measure of deviance of single items and an index for statistical inference. This index is based on assumptions of the dichotomous and polytomous Rasch models for items with ordered categories. A simulation study is described. (SLD)

Descriptors: Equations (Mathematics), Goodness of Fit, Item Response Theory, Simulation

Estimating the Importance of Differential Item Functioning. Program Statistics Research Technical Report No. 95-3.

Download full text

Rudas, Tamas; Zwick, Rebecca – 1995

A method is proposed to assess the importance of differential item functioning (DIF) by estimating the largest possible fraction of the population in which DIF does not occur, or equivalently, the smallest possible portion of the population in which DIF may occur. The approach is based on latent class (C. C. Clogg, 1981) or mixture concepts, and…

Descriptors: Estimation (Mathematics), Goodness of Fit, Item Bias, Maximum Likelihood Statistics

Confirmatory Analysis of Test Structure Using Multidimensional Item Response Theory.

Download full text

McKinley, Robert – 1989

A confirmatory approach to assessing test structure using multidimensional item response theory (MIRT) was developed and evaluated. The approach involved adding to the exponent of the MIRT model an item structure matrix that allows the user to specify the ability dimensions measured by an item. Various combinations of item structures were fit to…

Descriptors: Ability, Chi Square, Goodness of Fit, Item Response Theory

A Computer Simulation Investigating the Applicability of the Rasch Model with Varying Item Discriminations.

Download full text

Dinero, Thomas E.; Haertel, Edward – 1976

This paper will discuss the results of a series of computer simulations comparing the Rasch logistic model to a series of models departing to various degrees from its assumption of equal discrimination power for all items. The results have implications for test construction and test scoring, indicating how closely the conventional raw score…

Descriptors: Comparative Analysis, Computer Programs, Goodness of Fit, Individual Differences

Unifactor Latent Trait Models Applied to Multifactor Tests: Results and Implications.

Peer reviewed

Reckase, Mark D. – Journal of Educational Statistics, 1979

Since all commonly used latent trait models assume a unidimensional test, the applicability of the procedure to obviously multidimensional tests is questionable. This paper presents the results of the application of latent trait, traditional, and factor analyses to a series of actual and hypothetical tests that vary in factoral complexity.…

Descriptors: Achievement Tests, Factor Analysis, Goodness of Fit, Higher Education

The Historical Development of Fit and Its Assessment in the Computer Adaptive Testing Environment.

Download full text

Stone, Gregory Ethan – 1994

The quality of fit between the data and the measurement model is fundamental to any discussion of results. Fit has been the subject of inquiry since as early as the 1920s. Most early explorations concentrated on assessing global fit or subset fits on fixed length, traditional paper and pencil tests given as a single unit. The detection of aberrant…

Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Assessment, Educational History

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

A New IRT-Based Small Sample DIF Method.

Download full text

Tang, Huixing – 1994

This paper describes an item response theory (IRT) based method of differential item functioning (DIF) detection that involves neither separate calibration nor ability grouping. IRT is used to generate residual scores, scores free of the effects of person or group ability and item difficulty. Analysis of variance is then used to test the group…

Descriptors: Ability Grouping, Analysis of Variance, Goodness of Fit, Identification

Invariance of Rasch Model Ability Parameter Estimates Over Different Collections of Items.

Curry, Allen R.; And Others – 1978

The efficacy of employing subsets of items from a calibrated item pool to estimate the Rasch model person parameters was investigated. Specifically, the degree of invariance of Rasch model ability-parameter estimates was examined across differing collections of simulated items. The ability-parameter estimates were obtained from a simulation of…

Descriptors: Career Development, Difficulty Level, Equated Scores, Error of Measurement