ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Computer Simulation	14
Error of Measurement	14
Item Response Theory	14
Estimation (Mathematics)	6
Computer Assisted Testing	5
Comparative Analysis	4
Sample Size	4
Adaptive Testing	3
Item Banks	3
Item Bias	3
Mathematical Models	3
Scores	3
Test Length	3
Bayesian Statistics	2
Correlation	2
Equated Scores	2
Models	2
Psychometrics	2
Test Construction	2
Test Format	2
Test Items	2
Ability Identification	1
Algorithms	1
College Entrance Examinations	1
Comparative Testing	1
More ▼

Source

Journal of Educational…	3
Applied Psychological…	2
Educational and Psychological…	2
ProQuest LLC	1

Author

De Ayala, R. J.	2
Hambleton, Ronald K.	2
Bukhari, Nurliyana	1
Chang, Yu-Wen	1
Davison, Mark L.	1
DeMars, Christine E.	1
Drasgow, Fritz	1
Fitzpatrick, Steven J.	1
Hedges, Larry V.	1
Jones, Russell W.	1
Kamata, Akihito	1
Linacre, John M.	1
Miller, Timothy R.	1
Morrison, Carol A.	1
Stark, Stephen	1
Tate, Richard	1
Vevea, Jack L.	1
Zwick, Rebecca	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	7
Reports - Research	6
Speeches/Meeting Papers	4
Dissertations/Theses -…	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

An Examination of the Impact of Residuals and Residual Covariance Structures on Scores for Next Generation, Mixed-Format, Online Assessments with the Existence of Potential Irrelevant Dimensions under Various Calibration Strategies

Direct link

Bukhari, Nurliyana – ProQuest LLC, 2017

In general, newer educational assessments are deemed more demanding challenges than students are currently prepared to face. Two types of factors may contribute to the test scores: (1) factors or dimensions that are of primary interest to the construct or test domain; and, (2) factors or dimensions that are irrelevant to the construct, causing…

Descriptors: Item Response Theory, Models, Psychometrics, Computer Simulation

The Performance of a Method for the Long-Term Equating of Mixed-Format Assessment

Peer reviewed

Direct link

Kamata, Akihito; Tate, Richard – Journal of Educational Measurement, 2005

The goal of this study was the development of a procedure to predict the equating error associated with the long-term equating method of Tate (2003) for mixed-format tests. An expression for the determination of the error of an equating based on multiple links using the error for the component links was derived and illustrated with simulated data.…

Descriptors: Computer Simulation, Item Response Theory, Test Format, Evaluation Methods

A Study of Equating in NAEP. NAEP Validity Studies. Working Paper Series.

Download full text

Hedges, Larry V.; Vevea, Jack L. – 2003

A computer simulation study was conducted to investigate the amount of uncertainty added to National Assessment of Educational Progress estimates by equating error under three different equating methods and while varying a number of factors that might affect accuracy of equating. Data from past NAEP administrations were used to guide the…

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Item Response Theory

Type I Error Rates for PARSCALE's Fit Index

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2005

Type I error rates for PARSCALE's fit statistic were examined. Data were generated to fit the partial credit or graded response model, with test lengths of 10 or 20 items. The ability distribution was simulated to be either normal or uniform. Type I error rates were inflated for the shorter test length and, for the graded-response model, also for…

Descriptors: Test Length, Item Response Theory, Psychometrics, Error of Measurement

An EM Approach to Parameter Estimation for the Zinnes and Griggs Paired Comparison IRT Model.

Peer reviewed

Stark, Stephen; Drasgow, Fritz – Applied Psychological Measurement, 2002

Describes item response and information functions for the Zinnes and Griggs paired comparison item response theory (IRT) model (1974) and presents procedures for estimating stimulus and person parameters. Monte Carlo simulations show that at least 400 ratings are required to obtain reasonably accurate estimates of the stimulus parameters and their…

Descriptors: Comparative Analysis, Computer Simulation, Error of Measurement, Item Response Theory

Direct and Indirect Equating: A Comparison of Four Methods Using the Rasch Model.

Download full text

Morrison, Carol A.; Fitzpatrick, Steven J. – 1992

An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement

A Simulation Study of Methods for Assessing Differential Item Functioning in Computerized Adaptive Tests.

Peer reviewed

Zwick, Rebecca; And Others – Applied Psychological Measurement, 1994

Simulated data were used to investigate the performance of modified versions of the Mantel-Haenszel method of differential item functioning (DIF) analysis in computerized adaptive tests (CAT). Results indicate that CAT-based DIF procedures perform well and support the use of item response theory-based matching variables in DIF analysis. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Error of Measurement

Influence of Item Parameter Estimation Errors in Test Development.

Peer reviewed

Hambleton, Ronald K.; And Others – Journal of Educational Measurement, 1993

Item parameter estimation errors in test development are highlighted. The problem is illustrated with several simulated data sets, and a conservative solution is offered for addressing the problem in item response theory test development practice. Steps that reduce the problem of capitalizing on chance in item selections are suggested. (SLD)

Descriptors: Computer Simulation, Error of Measurement, Estimation (Mathematics), Item Banks

Item Parameter Estimation Errors and Their Influence on Test Information Functions.

Download full text

Hambleton, Ronald K.; Jones, Russell W. – 1993

Errors in item parameter estimates have a negative impact on the accuracy of item and test information functions. The estimation errors may be random, but because items with higher levels of discriminating power are more likely to be selected for a test, and these items are most apt to contain positive errors, the result is that item information…

Descriptors: Computer Simulation, Error of Measurement, Estimation (Mathematics), Item Banks

A Simulation and Comparison of Flexilevel and Bayesian Computerized Adaptive Testing.

Peer reviewed

De Ayala, R. J.; And Others – Journal of Educational Measurement, 1990

F. M. Lord's flexilevel, computerized adaptive testing (CAT) procedure was compared to an item-response theory-based CAT procedure that uses Bayesian ability estimation with various standard errors of estimates used for terminating the test. Ability estimates of flexilevel CATs were as accurate as were those of Bayesian CATs. (TJH)

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Comparative Analysis

A Comparison of Unidimensional and Multidimensional IRT Approaches to Test Information in a Test Battery.

Download full text

Chang, Yu-Wen; Davison, Mark L. – 1992

Standard errors and bias of unidimensional and multidimensional ability estimates were compared in a factorial, simulation design with two item response theory (IRT) approaches, two levels of test correlation (0.42 and 0.63), two sample sizes (500 and 1,000), and a hierarchical test content structure. Bias and standard errors of subtest scores…

Descriptors: Comparative Testing, Computer Simulation, Correlation, Error of Measurement

The Influence of Dimensionality on CAT Ability Estimation.

Peer reviewed

De Ayala, R. J. – Educational and Psychological Measurement, 1992

Effects of dimensionality on ability estimation of an adaptive test were examined using generated data in Bayesian computerized adaptive testing (CAT) simulations. Generally, increasing interdimensional difficulty association produced a slight decrease in test length and an increase in accuracy of ability estimation as assessed by root mean square…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Computer Simulation

Designing Your Own Rasch Analysis Program.

Download full text

Linacre, John M. – 1990

Advantages and disadvantages of standard Rasch analysis computer programs are discussed. The unconditional maximum likelihood algorithm allows all observations to participate equally in determining the measures and calibrations to be obtained quickly from a data set. On the advantage side, standard Rasch programs can be used immediately, are…

Descriptors: Algorithms, Computer Assisted Testing, Computer Graphics, Computer Simulation

Empirical Estimation of Standard Errors of Compensatory MIRT Model Parameters Obtained from the NOHARM Estimation Program. ACT Research Report Series.

Download full text

Miller, Timothy R. – 1991

Two studies were carried out to evaluate the quality of multidimensional item response theory (MIRT) model parameter estimates obtained from the computer program NOHARM. The purpose of the first study was to compute empirical estimates of the standard errors of the parameters. In addition, the parameter estimates were evaluated for bias and the…

Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Equations (Mathematics)