ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	32

Descriptor

Equated Scores	24
Statistical Analysis	18
Comparative Analysis	13
Error of Measurement	13
Accuracy	12
Computation	11
Scores	10
Simulation	9
Sample Size	7
Models	6
Differences	5
Regression (Statistics)	5
Statistical Bias	5
Correlation	4
Statistical Significance	4
Test Construction	4
Tests	4
Data Analysis	3
Mathematics Tests	3
Multiple Choice Tests	3
Psychometrics	3
Reliability	3
Scaling	3
Statistical Distributions	3
Test Format	3
More ▼

Source

ETS Research Report Series	13
Journal of Educational…	8
Educational Testing Service	4
Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational and…	2
Educational Measurement:…	1

Author

Moses, Tim	32
Holland, Paul	4
Kim, Sooyeon	4
Deng, Weiling	3
Holland, Paul W.	2
Liu, Jinghua	2
Zhang, Wenmin	2
Zhang, Yu-Li	2
Dorans, Neil	1
Dorans, Neil J.	1
Grant, Mary	1
Kim, YoungKoung	1
McHale, Fred	1
Miao, Jing	1
Oh, Hyeonjoo	1
Oh, Hyeonjoo J.	1
Puhan, Gautam	1
Tan, Adele	1
Wilson, Christine	1
Yang, Wen-Ling	1
Yoo, Hanwook Henry	1
More ▼

Publication Type

Journal Articles	28
Reports - Research	19
Reports - Evaluative	12
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Moses, Tim X

Showing 1 to 15 of 32 results Save | Export

Linking and Comparability across Conditions of Measurement: Established Frameworks and Proposed Updates

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2022

One result of recent changes in testing is that previously established linking frameworks may not adequately address challenges in current linking situations. Test linking through equating, concordance, vertical scaling or battery scaling may not represent linkings for the scores of tests developed to measure constructs differently for different…

Descriptors: Measures (Individuals), Educational Assessment, Test Construction, Comparative Analysis

Stabilizing Conditional Standard Errors of Measurement in Scale Score Transformations

Peer reviewed

Direct link

Moses, Tim; Kim, YoungKoung – Journal of Educational Measurement, 2017

The focus of this article is on scale score transformations that can be used to stabilize conditional standard errors of measurement (CSEMs). Three transformations for stabilizing the estimated CSEMs are reviewed, including the traditional arcsine transformation, a recently developed general variance stabilization transformation, and a new method…

Descriptors: Error of Measurement, Scores, Comparative Analysis, Item Response Theory

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Alternative Smoothing and Scaling Strategies for Weighted Composite Scores

Peer reviewed

Direct link

Moses, Tim – Educational and Psychological Measurement, 2014

In this study, smoothing and scaling approaches are compared for estimating subscore-to-composite scaling results involving composites computed as rounded and weighted combinations of subscores. The considered smoothing and scaling approaches included those based on raw data, on smoothing the bivariate distribution of the subscores, on smoothing…

Descriptors: Weighted Scores, Scaling, Data Analysis, Comparative Analysis

Adjoined Piecewise Linear Approximations (APLAs) for Equating: Accuracy Evaluations of a Postsmoothing Equating Method

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2013

The purpose of this study was to evaluate the use of adjoined and piecewise linear approximations (APLAs) of raw equipercentile equating functions as a postsmoothing equating method. APLAs are less familiar than other postsmoothing equating methods (i.e., cubic splines), but their use has been described in historical equating practices of…

Descriptors: Equated Scores, Accuracy, Simulation, Comparative Analysis

Smoothing and Equating Methods Applied to Different Types of Test Score Distributions and Evaluated with Respect to Multiple Equating Criteria. Research Report. ETS RR-11-20

Download full text

Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011

In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…

Descriptors: Equated Scores, Data Analysis, Scores, Methods

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Evaluating Ranking Strategies in Assessing Change when the Measures Differ across Time

Peer reviewed

Direct link

Moses, Tim; Kim, Sooyeon – Educational and Psychological Measurement, 2012

In this study, a ranking strategy was evaluated for comparing subgroups' change using identical, equated, and nonidentical measures. Four empirical data sets were evaluated, each of which contained examinees' scores on two occasions, where the two occasions' scores were obtained on a single identical measure, on two equated tests, and on two…

Descriptors: Testing, Change, Scores, Measures (Individuals)

ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim – ETS Research Report Series, 2013

The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)

The Effects of Selection Strategies for Bivariate Loglinear Smoothing Models on NEAT Equating Functions

Peer reviewed

Direct link

Moses, Tim; Holland, Paul W. – Journal of Educational Measurement, 2010

In this study, eight statistical strategies were evaluated for selecting the parameterizations of loglinear models for smoothing the bivariate test score distributions used in nonequivalent groups with anchor test (NEAT) equating. Four of the strategies were based on significance tests of chi-square statistics (Likelihood Ratio, Pearson,…

Descriptors: Equated Scores, Models, Statistical Distributions, Statistical Analysis

Relationships of Measurement Error and Prediction Error in Observed-Score Regression

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2012

The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…

Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores

Research on Standard Errors of Equating Differences. Research Report. ETS RR-10-25

Download full text

Moses, Tim; Zhang, Wenmin – Educational Testing Service, 2010

In this paper, the "standard error of equating difference" (SEED) is described in terms of originally proposed kernel equating functions (von Davier, Holland, & Thayer, 2004) and extended to incorporate traditional linear and equipercentile functions. These derivations expand on prior developments of SEEDs and standard errors of equating and…

Descriptors: Equated Scores, Simulation, Testing, Statistical Analysis

Standard Errors of Equating Differences: Prior Developments, Extensions, and Simulations

Peer reviewed

Direct link

Moses, Tim; Zhang, Wenmin – Journal of Educational and Behavioral Statistics, 2011

The purpose of this article was to extend the use of standard errors for equated score differences (SEEDs) to traditional equating functions. The SEEDs are described in terms of their original proposal for kernel equating functions and extended so that SEEDs for traditional linear and traditional equipercentile equating functions can be computed.…

Descriptors: Equated Scores, Error Patterns, Evaluation Research, Statistical Analysis

Comparison of the One- and Bi-Direction Chained Equipercentile Equating

Peer reviewed

Direct link

Oh, Hyeonjoo; Moses, Tim – Journal of Educational Measurement, 2012

This study investigated differences between two approaches to chained equipercentile (CE) equating (one- and bi-direction CE equating) in nearly equal groups and relatively unequal groups. In one-direction CE equating, the new form is linked to the anchor in one sample of examinees and the anchor is linked to the reference form in the other…

Descriptors: Equated Scores, Statistical Analysis, Comparative Analysis, Differences

Previous Page | Next Page »

Pages: 1 | 2 | 3