ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Computer Simulation	12
Test Theory	12
Latent Trait Theory	7
Estimation (Mathematics)	6
Mathematical Models	5
Statistical Studies	5
Test Construction	5
Computer Assisted Testing	4
Correlation	4
Adaptive Testing	3
College Entrance Examinations	3
Error of Measurement	3
Test Items	3
Equated Scores	2
Hypothesis Testing	2
Item Analysis	2
Item Banks	2
Monte Carlo Methods	2
Test Format	2
Testing	2
Adults	1
Algorithms	1
Armed Forces	1
Bayesian Statistics	1
Cheating	1
More ▼

Source

Journal of Educational…	3
Educational and Psychological…	1
National Center for Research…	1
Psychometrika	1

Publication Type

Reports - Research	9
Journal Articles	5
Reports - Evaluative	2
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	2
ACT Assessment	1
Armed Forces Qualification…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

On the Roles of External Knowledge Representations in Assessment Design. CSE Report 722

Download full text

Mislevy, Robert J.; Behrens, John T.; Bennett, Randy E.; Demark, Sarah F.; Frezzo, Dennis C.; Levy, Roy; Robinson, Daniel H.; Rutstein, Daisy Wise; Shute, Valerie J.; Stanley, Ken; Winters, Fielding I. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

People use external knowledge representations (EKRs) to identify, depict, transform, store, share, and archive information. Learning how to work with EKRs is central to becoming proficient in virtually every discipline. As such, EKRs play central roles in curriculum, instruction, and assessment. Five key roles of EKRs in educational assessment are…

Descriptors: Educational Assessment, Computer Networks, Test Construction, Computer Assisted Testing

Reliability of Composite Measurements Based on the m Highest of n Equivalent Components.

Peer reviewed

Huynh, Huynh – Journal of Educational Statistics, 1986

Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)

Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

Item Analysis: A Short-Cut Statistic for Mastery Tests.

Peer reviewed

Harris, Deborah J.; Subkoviak, Michael J. – Educational and Psychological Measurement, 1986

This study examined three statistical methods for selecting items for mastery tests: (1) pretest-posttest; (2) latent trait; and (3) agreement statistics. The correlation between the latent trait method and agreement statistics, proposed here as an alternative, was substantial. Results for the pretest-posttest method confirmed its reputed…

Descriptors: Computer Simulation, Correlation, Item Analysis, Latent Trait Theory

Direct and Indirect Equating: A Comparison of Four Methods Using the Rasch Model.

Download full text

Morrison, Carol A.; Fitzpatrick, Steven J. – 1992

An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement

Standard Errors of Equipercentile Equating for the Common Stem Nonequivalent Populations Design.

Peer reviewed

Jarjoura, David; Kolen, Michael J. – Journal of Educational Statistics, 1985

An equating design in which two groups of examinees from slightly different populations are administered a different test form with a subset of common items is widely used. This paper presents standard errors and a simulation that verifies the equation for large samples for an equipercentile equating procedure for this design. (Author/BS)

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Estimation (Mathematics)

Algorithmic Test Design Using Classical Item Parameters. Project Psychometric Aspects of Item Banking No. 29. Research Report 88-2.

Download full text

van der Linden, Wim J.; Adema, Jos J. – 1988

Two optimalization models for the construction of tests with a maximal value of coefficient alpha are given. Both models have a linear form and can be solved by using a branch-and-bound algorithm. The first model assumes an item bank calibrated under the Rasch model and can be used, for instance, when classical test theory has to serve as an…

Descriptors: Algorithms, Computer Simulation, Estimation (Mathematics), Foreign Countries

A Nonparametric Approach for Assessing Latent Trait Unidimensionality.

Peer reviewed

Stout, William – Psychometrika, 1987

A procedure--based on item response theory--for testing the hypothesis of unidimensionality of the latent space is proposed. Use of the procedure is supported by an asymptotic theory and a Monte Carlo simulation study. The procedure tests for unidimensionality in test construction and/or compares two tests. (SLD)

Descriptors: College Entrance Examinations, Computer Simulation, Equations (Mathematics), Hypothesis Testing

Use of Sequential Testing to Prescreen Prospective Entrants into Military Service.

Weitzman, R. A. – 1982

The goal of this research was to predict from a recruit's responses to the Armed Services Vocational Aptitude Battery (ASVAB) items whether the recruit would pass the Armed Forces Qualification Test (AFQT). The data consisted of the responses (correct/incorrect) of 1,020 Navy recruits to 200 items of the ASVAB together with the scores of these…

Descriptors: Adults, Armed Forces, Computer Oriented Programs, Computer Simulation

Robustness of IRT Parameter Estimation to Violations of the Undimensionality Assumption.

Peer reviewed

Harrison, David A. – Journal of Educational Statistics, 1986

Multidimensional item response data were created. The strength of a general factor, the number of common factors, the distribution of items loadingon common factors, and the number of items in simulated tests were manipulated. LOGIST effectively recovered both item and trait parameters in nearly all of the experimental conditions. (Author/JAZ)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Correlation

The Use of Unidimensional Item Parameter Estimates of Multidimensional Items in Adaptive Testing.

Download full text

Ackerman, Terry A. – 1987

The purpose of this study was to investigate the effect of using multidimensional items in a computer adaptive test (CAT) setting which assumes a unidimensional item response theory (IRT) framework. Previous research has suggested that the composite of multidimensional abilities being estimated by a unidimensional IRT model is not constant…

Descriptors: Adaptive Testing, College Entrance Examinations, Computer Assisted Testing, Computer Simulation

ASCAL: A Microcomputer Program for Estimating Logistic IRT Item Parameters.

Vale, C. David; Gialluca, Kathleen A. – 1985

ASCAL is a microcomputer-based program for calibrating items according to the three-parameter logistic model of item response theory. It uses a modified multivariate Newton-Raphson procedure for estimating item parameters. This study evaluated this procedure using Monte Carlo Simulation Techniques. The current version of ASCAL was then compared to…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Computer Simulation

Performance Envelopes and Optimal Appropriateness Measurement.

Levine, Michael V.; Drasgow, Fritz – 1984

Some examinees' test-taking behavior may be so idiosyncratic that their scores are not comparable to the scores of more typical examinees. Appropriateness indices, which provide quantitative measures of response-pattern atypicality, can be viewed as statistics for testing a null hypothesis of normal test-taking behavior against an alternative…

Descriptors: Cheating, College Entrance Examinations, Computer Simulation, Estimation (Mathematics)

Ackerman, Terry A.	1
Adema, Jos J.	1
Behrens, John T.	1
Bennett, Randy E.	1
Demark, Sarah F.	1
Drasgow, Fritz	1
Fitzpatrick, Steven J.	1
Frezzo, Dennis C.	1
Gialluca, Kathleen A.	1
Harris, Deborah J.	1
Harrison, David A.	1
Huynh, Huynh	1
Jarjoura, David	1
Kolen, Michael J.	1
Levine, Michael V.	1
Levy, Roy	1
Mislevy, Robert J.	1
Morrison, Carol A.	1
Robinson, Daniel H.	1
Rutstein, Daisy Wise	1
Shute, Valerie J.	1
Stanley, Ken	1
Stout, William	1
Subkoviak, Michael J.	1
Vale, C. David	1
More ▼