ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	12

Descriptor

Simulation	33
True Scores	33
Item Response Theory	16
Equated Scores	12
Error of Measurement	10
Correlation	7
Comparative Analysis	6
Computation	6
Test Reliability	6
Estimation (Mathematics)	5
Reliability	5
Test Items	5
Bayesian Statistics	4
Difficulty Level	4
Evaluation Methods	4
Measurement Techniques	4
Sample Size	4
Statistical Analysis	4
Test Format	4
Accuracy	3
Achievement Tests	3
Computer Programs	3
Item Analysis	3
Mathematical Models	3
Models	3
More ▼

Source

Applied Psychological…	6
Applied Measurement in…	5
Journal of Educational…	2
Journal of Educational and…	2
ProQuest LLC	2
Educational Assessment	1
Educational Sciences: Theory…	1
International Journal of…	1
Multivariate Behavioral…	1

Publication Type

Journal Articles	19
Reports - Research	16
Reports - Evaluative	13
Speeches/Meeting Papers	5
Dissertations/Theses -…	2
Reports - Descriptive	2

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 33 results Save | Export

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

Standards-Based Grading: History Adjusted True Score

Peer reviewed

Direct link

Hooper, Jay; Cowell, Ryan – Educational Assessment, 2014

There has been much research and discussion on the principles of standards-based grading, and there is a growing consensus of best practice. Even so, the actual process of implementing standards-based grading at a school or district level can be a significant challenge. There are very practical questions that remain unclear, such as how the grades…

Descriptors: True Scores, Grading, Academic Standards, Computation

Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

Direct link

Lee, Eunjung – ProQuest LLC, 2013

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

Descriptors: Equated Scores, Tests, Comparative Analysis, Item Response Theory

Relationships of Measurement Error and Prediction Error in Observed-Score Regression

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2012

The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…

Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Investigating the Impact of Compromised Anchor Items on IRT Equating under the Nonequivalent Anchor Test Design

Peer reviewed

Direct link

Jurich, Daniel P.; DeMars, Christine E.; Goodman, Joshua T. – Applied Psychological Measurement, 2012

The prevalence of high-stakes test scores as a basis for significant decisions necessitates the dissemination of accurate and fair scores. However, the magnitude of these decisions has created an environment in which examinees may be prone to resort to cheating. To reduce the risk of cheating, multiple test forms are commonly administered. When…

Descriptors: High Stakes Tests, Scores, Prevention, Cheating

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Coping with Memory Effect and Serial Correlation when Estimating Reliability in a Longitudinal Framework

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010

Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…

Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics

Standard Errors of Estimated Latent Variable Scores with Estimated Structural Parameters

Peer reviewed

Direct link

Hoshino, Takahiro; Shigemasu, Kazuo – Applied Psychological Measurement, 2008

The authors propose a concise formula to evaluate the standard error of the estimated latent variable score when the true values of the structural parameters are not known and must be estimated. The formula can be applied to factor scores in factor analysis or ability parameters in item response theory, without bootstrap or Markov chain Monte…

Descriptors: Monte Carlo Methods, Markov Processes, Factor Analysis, Computation

Estimation of Graded Response Model Parameters Using MULTILOG.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1997

Describes an idiosyncracy of the MULTILOG (D. Thissen, 1991) parameter estimation process discovered during a simulation study involving the graded response model. A misordering reflected in boundary function location parameter estimates resulted in a large negative contribution to the true score followed by a large positive contribution. These…

Descriptors: Estimation (Mathematics), Simulation, True Scores

Effect of Simultaneous Violations of Essential Tau-Equivalence and Uncorrelated Error on Coefficient Alpha.

Peer reviewed

Komaroff, Eugene – Applied Psychological Measurement, 1997

Evaluated coefficient alpha under violations of two classical test theory assumptions: essential tau-equivalence and uncorrelated errors through simulation. Discusses the interactive effects of both violations with true and error scores. Provides empirical evidence of the derivation of M. Novick and C. Lewis (1993). (SLD)

Descriptors: Correlation, Reliability, Simulation, Test Theory

An Empirical Bayes Approach to Subscore Augmentation: How Much Strength Can We Borrow?

Peer reviewed

Direct link

Edwards, Michael C.; Vevea, Jack L. – Journal of Educational and Behavioral Statistics, 2006

This article examines a subscore augmentation procedure. The approach uses empirical Bayes adjustments and is intended to improve the overall accuracy of measurement when information is scant. Simulations examined the impact of the method on subscale scores in a variety of realistic conditions. The authors focused on two popular scoring methods:…

Descriptors: Geometric Concepts, True Scores, Scoring, Item Response Theory

Improved Type I Error Control and Reduced Estimation Bias for DIF Detection Using SIBTEST.

Peer reviewed

Jiang, Hai; Stout, William – Journal of Educational and Behavioral Statistics, 1998

Proposes a new regression correction for the SIBTEST statistical tests (R. Shealy and W. Stout, 1993) that essentially uses a two-segment piecewise linear regression of the true on observed matching subtest scores. A simulation study illustrates the approach. (SLD)

Descriptors: Estimation (Mathematics), Item Bias, Regression (Statistics), Simulation

Factors Affecting the Sample Invariant Properties of Linear and Curvilinear Observed- and True-Score Equating Procedures.

Download full text

Stocking, Martha L.; And Others – 1988

A sequence of simulations was carried out to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker linear observed-score equating; (2) Levine equally reliable linear observed-score equating; (3) equipercentile curvilinear…

Descriptors: Equated Scores, Item Response Theory, Sample Size, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Cizek, Gregory J.	2
Eignor, Daniel R.	2
Lee, Won-Chan	2
Algina, James	1
Alonso, Ariel	1
Andrews, Benjamin James	1
Baker, Frank B.	1
Bekhuis, Tanja C. H. M.	1
Bolt, Daniel M.	1
Boughton, Keith A.	1
Brennan, Robert L.	1
Cohen, Allan S.	1
Cowell, Ryan	1
DeMars, Christine E.	1
Edwards, Michael C.	1
Gierl, Mark J.	1
Goodman, Joshua T.	1
Gotzmann, Andrea	1
Hartig, Johannes	1
Hau, Kit-Tai	1
Hicks, Marilyn M.	1
Holzel, Britta	1
Hooper, Jay	1
Hoshino, Takahiro	1
More ▼