ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Error of Measurement	19
Statistical Analysis	19
Testing	19
Test Interpretation	5
Comparative Analysis	4
Equated Scores	4
Scoring	4
Evaluation Methods	3
Models	3
Psychometrics	3
Simulation	3
Test Items	3
Test Reliability	3
College Entrance Examinations	2
Computation	2
Foreign Countries	2
Hypothesis Testing	2
Item Response Theory	2
Middle School Students	2
Probability	2
Responses	2
Robustness (Statistics)	2
Scores	2
Statistical Bias	2
Structural Equation Models	2
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Psychometrika	2
ACT, Inc.	1
Asia Pacific Journal of…	1
Audio-Visual Language Journal	1
International Journal of…	1
Measurement and Evaluation in…	1
Practical Assessment,…	1
Psychological Methods	1
Psychological Review	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	11
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Guides - General	1
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Higher Education	2
Middle Schools	2
Postsecondary Education	2
High Schools	1
Junior High Schools	1
Secondary Education	1

Audience

Researchers

Location

Japan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Longitudinal Survey…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Examining Measurement Invariance and Differential Item Functioning with Discrete Latent Construct Indicators: A Note on a Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018

A latent variable modeling method for studying measurement invariance when evaluating latent constructs with multiple binary or binary scored items with no guessing is outlined. The approach extends the continuous indicator procedure described by Raykov and colleagues, utilizes similarly the false discovery rate approach to multiple testing, and…

Descriptors: Models, Statistical Analysis, Error of Measurement, Test Bias

Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

Peer reviewed

Direct link

Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018

Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

Descriptors: Error of Measurement, Testing, Scores, Models

Effect of Multiple Testing Adjustment in Differential Item Functioning Detection

Peer reviewed

Direct link

Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013

In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…

Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

Choosing among Tucker or Chained Linear Equating in Two Testing Situations: Rater Comparability Scoring and Randomly Equivalent Groups with an Anchor

Peer reviewed

Direct link

Puhan, Gautam – Journal of Educational Measurement, 2012

Tucker and chained linear equatings were evaluated in two testing scenarios. In Scenario 1, referred to as rater comparability scoring and equating, the anchor-to-total correlation is often very high for the new form but moderate for the reference form. This may adversely affect the results of Tucker equating, especially if the new and reference…

Descriptors: Testing, Scoring, Equated Scores, Statistical Analysis

Robust Structural Equation Modeling with Missing Data and Auxiliary Variables

Peer reviewed

Direct link

Yuan, Ke-Hai; Zhang, Zhiyong – Psychometrika, 2012

The paper develops a two-stage robust procedure for structural equation modeling (SEM) and an R package "rsem" to facilitate the use of the procedure by applied researchers. In the first stage, M-estimates of the saturated mean vector and covariance matrix of all variables are obtained. Those corresponding to the substantive variables…

Descriptors: Structural Equation Models, Tests, Federal Aid, Psychometrics

Testing Mixture Models of Transitive Preference: Comment on Regenwetter, Dana, and Davis-Stober (2011)

Peer reviewed

Direct link

Birnbaum, Michael H. – Psychological Review, 2011

This article contrasts 2 approaches to analyzing transitivity of preference and other behavioral properties in choice data. The approach of Regenwetter, Dana, and Davis-Stober (2011) assumes that on each choice, a decision maker samples randomly from a mixture of preference orders to determine whether "A" is preferred to "B." In contrast, Birnbaum…

Descriptors: Evidence, Testing, Computation, Probability

Accumulative Equating Error after a Chain of Linear Equatings

Peer reviewed

Direct link

Guo, Hongwen – Psychometrika, 2010

After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…

Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores

Testing for Factorial Invariance of the Modified Leadership Scale for Sports: Using a Japanese Version

Peer reviewed

Direct link

Kwon, Hyungil Harry; Pyun, Do Young; Han, Siwan; Ogasawara, Etsuko – Asia Pacific Journal of Education, 2011

The objective of this study was to provide empirical evidence to support psychometric properties of a modified four-dimensional model of the Leadership Scale for Sports (LSS). The study tested invariance of all parameters (i.e., factor loadings, error variances, and factor variances-covariances) in the four-dimensional measurement model between…

Descriptors: Feedback (Response), Testing, Athletes, Factor Structure

A Generally Robust Approach for Testing Hypotheses and Setting Confidence Intervals for Effect Sizes

Peer reviewed

Direct link

Keselman, H. J.; Algina, James; Lix, Lisa M.; Wilcox, Rand R.; Deering, Kathleen N. – Psychological Methods, 2008

Standard least squares analysis of variance methods suffer from poor power under arbitrarily small departures from normality and fail to control the probability of a Type I error when standard assumptions are violated. This article describes a framework for robust estimation and testing that uses trimmed means with an approximate degrees of…

Descriptors: Intervals, Testing, Least Squares Statistics, Effect Size

The Relative Efficiency of Two Tests as a Function of Ability Level.

Download full text

Lord, Frederic M. – 1973

A new formula is developed for the relative efficiency of two tests measuring the same trait. The formula expresses relative efficiency solely in terms of the standard errors of measurement and, surprisingly, the frequency distributions of true scores. Approximate methods for estimating relative efficiency may make this function routinely…

Descriptors: Error of Measurement, Research Reports, Statistical Analysis, Test Interpretation

A Simple, Accurate Approximation of the Standard Error of Measurement.

Peer reviewed

Garvin, Alfred D. – Journal of Educational Measurement, 1976

A simple, usefully accurate approximation of the standard error of measurement is proposed for use by classroom teachers. An empirical comparison with Lord's approximation indicated that, though not as easy to calculate as Lord's, this approximation is more practical because it is useful at any point in the score distribution. (BW)

Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Formulas, Statistical Analysis

A Reassessment of Standard Error of Measurement.

Download full text

Klaas, Alan C. – 1975

Current usage and theory of standard error of measurement calls for one standard error of measurement figure to be used across all levels of scoring. The study revealed that scoring variance across scoring levels is not constant. As scoring ability increases scoring variance decreases. The assertion that low and high scoring subjects will…

Descriptors: Error of Measurement, Guessing (Tests), Scoring, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2

Algina, James	2
Lord, Frederic M.	2
Birnbaum, Michael H.	1
Cummings, Oliver W.	1
Deering, Kathleen N.	1
Dimitrov, Dimiter M.	1
Garvin, Alfred D.	1
Guo, Hongwen	1
Han, Siwan	1
Harris, Deborah J.	1
Hsiao, Yu-Yu	1
Inga Laukaityte	1
Keselman, H. J.	1
Kilmen, Sevilay	1
Kim, Jihye	1
Klaas, Alan C.	1
Kwok, Oi-Man	1
Kwon, Hyungil Harry	1
Lai, Mark H. C.	1
Li, Dongmei	1
Li, Tatyana	1
Lix, Lisa M.	1
Marcoulides, George A.	1
Marie Wiberg	1
Menold, Natalja	1
More ▼