ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	6

Descriptor

Simulation	18
Test Theory	18
Test Items	9
Statistical Analysis	8
Item Analysis	7
Correlation	5
Difficulty Level	5
Mathematical Models	5
Career Development	4
Goodness of Fit	4
Latent Trait Theory	4
Scores	4
Test Reliability	4
Criterion Referenced Tests	3
Equated Scores	3
Error of Measurement	3
Raw Scores	3
Scoring	3
Statistical Studies	3
Testing	3
Accuracy	2
Achievement Tests	2
Adaptive Testing	2
Armed Forces	2
Comparative Analysis	2
More ▼

Source

ETS Research Report Series	2
Journal of Educational…	2
Applied Measurement in…	1
Computers & Education	1
International Journal of…	1

Publication Type

Reports - Research	18
Journal Articles	7
Speeches/Meeting Papers	5

Education Level

Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Problem Solving Learning Environments and Assessment: A Knowledge Space Theory Approach

Peer reviewed

Direct link

Reimann, Peter; Kickmeier-Rust, Michael; Albert, Dietrich – Computers & Education, 2013

This paper explores the relation between problem solving learning environments (PSLEs) and assessment concepts. The general framework of evidence-centered assessment design is used to describe PSLEs in terms of assessment concepts, and to identify similarities between the process of assessment design and of PSLE design. We use a recently developed…

Descriptors: Teaching Methods, Psychometrics, Problem Solving, Test Theory

How Often Do Subscores Have Added Value? Results from Operational and Simulated Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2010

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman suggested a method based on classical test theory to determine whether subscores have added value over total scores. In this article I first provide a rich collection of results regarding when subscores were found to have added…

Descriptors: Scores, Test Theory, Simulation, Reliability

Conditional Covariance Theory and DETECT for Polytomous Items. Research Report. ETS RR-04-50

Peer reviewed
PDF on ERIC

Download full text

Zhang, Jinming – ETS Research Report Series, 2004

This paper extends the theory of conditional covariances to polytomous items. It has been mathematically proven that under some mild conditions, commonly assumed in the analysis of response data, the conditional covariance of two items, dichotomously or polytomously scored, is positive if the two items are dimensionally homogeneous and negative…

Descriptors: Test Items, Test Theory, Correlation, National Competency Tests

Detecting Multidimensionality and Examining Its Effects on Vertical Equating with the Three-Parameter Logistic Model.

Bogan, Evelyn Doody; Yen, Wendy M. – 1983

Four multidimensional data configurations and one unidimensional data configuration were simulated for three differences in mean difficulty between two tests to be equated. Two chi-square statistics, Q1 and Q2, were examined for their ability to detect multidimensionality. Results indicated that Q1 did not discriminate between any of the…

Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Latent Trait Theory

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

How Well Do the Angoff Design V Linear Equating Methods Stack Up against the Tucker and Levine Methods?

Cope, Ronald T. – 1986

Comparisons were made of three Angoff Design V linear equating methods (two forms equated to a common test, two forms predicted by a common test, or two forms used to predict a common test) and Tucker's and R. Levine's linear methods, under common item linear equating with non-equivalent populations. Forms of a professional certification test…

Descriptors: Certification, Comparative Analysis, Equated Scores, Higher Education

Asymptotic Distributions for Tests of Combined Significance.

Becker, Betsy Jane – 1986

This paper discusses distribution theory and power computations for four common "tests of combined significance." These tests are calculated using one-sided sample probabilities or p values from independent studies (or hypothesis tests), and provide an overall significance level for the series of results. Noncentral asymptotic sampling…

Descriptors: Achievement Tests, Correlation, Effect Size, Hypothesis Testing

Use of Sequential Testing to Prescreen Prospective Entrants into Military Service.

Weitzman, R. A. – 1982

The goal of this research was to predict from a recruit's responses to the Armed Services Vocational Aptitude Battery (ASVAB) items whether the recruit would pass the Armed Forces Qualification Test (AFQT). The data consisted of the responses (correct/incorrect) of 1,020 Navy recruits to 200 items of the ASVAB together with the scores of these…

Descriptors: Adults, Armed Forces, Computer Oriented Programs, Computer Simulation

Methods for Linking Item Parameters. Final Report.

Download full text

Vale, C. David; And Others – 1981

A simulation study to determine appropriate linking methods for adaptive testing items was designed. Three basic data sets for responses were created. These were randomly sampled, systematically sampled, and selected data sets. The evaluative criteria used were fidelity of parameter estimation, asymptotic ability estimates, root-mean-square error…

Descriptors: Adaptive Testing, Aptitude Tests, Armed Forces, Bayesian Statistics

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

The Mean Split-Half Coefficient of Agreement and its Relation to Other Single-Administration Test Indices: A Study Based on Simulated Data. Technical Report No. 350.

Download full text

Marshall, J. Laird – 1976

A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…

Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making

Previous Page | Next Page »

Pages: 1 | 2

Yen, Wendy M.	2
Albert, Dietrich	1
Becker, Betsy Jane	1
Bogan, Evelyn Doody	1
Cook, Linda L.	1
Cope, Ronald T.	1
Curry, Allen R.	1
DeCarlo, Lawrence T.	1
Epstein, Kenneth I.	1
Hambleton, Ronald K.	1
Hau, Kit-Tai	1
Kickmeier-Rust, Michael	1
Kim, Sooyeon	1
Knerr, Claramae S.	1
Kogar, Hakan	1
Livingston, Samuel A.	1
Marshall, J. Laird	1
Reimann, Peter	1
Sarvela, Paul D.	1
Sinharay, Sandip	1
Vale, C. David	1
Weitzman, R. A.	1
Xiao, Leifeng	1
Zhang, Jinming	1
More ▼