ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	10

Descriptor

Statistical Distributions	11
Computation	6
Equated Scores	6
Scores	5
Accuracy	4
Comparative Analysis	4
Models	4
Error of Measurement	3
Item Response Theory	3
Probability	3
Statistical Analysis	3
Bayesian Statistics	2
Difficulty Level	2
Goodness of Fit	2
Sampling	2
Scoring	2
Test Reliability	2
Ability Grouping	1
Adaptive Testing	1
Alternative Assessment	1
Aptitude Tests	1
Causal Models	1
Cluster Grouping	1
College Entrance Examinations	1
Computer Oriented Programs	1
More ▼

Source

ETS Research Report Series

Author

Haberman, Shelby J.	2
Livingston, Samuel A.	2
Moses, Tim	2
Walker, Michael E.	2
von Davier, Alina A.	2
Ali, Usama S.	1
Casabianca, Jodi	1
Chen, Haiwen H.	1
Donoghue, John R.	1
Dorans, Neil J.	1
Guo, Hongwen	1
Hess, Melinda R.	1
Kim, Sooyeon	1
McClellan, Catherine A.	1
Moses, Tim P.	1
Oh, Hyeonjoo J.	1
Yoo, Hanwook Henry	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11

Education Level

Higher Education	2
Postsecondary Education	2
High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Graduate Record Examinations	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Providing a Context for Interpreting Predictions of Job Performance. Research Report. ETS RR-18-38

Peer reviewed
PDF on ERIC

Download full text

Dorans, Neil J. – ETS Research Report Series, 2018

A distinction is made between scores as measures of a construct and predictions of a criterion or outcome variable. The interpretation attached to predictions of criteria, such as job performance or college grade point average (GPA), differs from that attached to scores that are measures of a construct, such as reading proficiency or knowledge…

Descriptors: Job Performance, Scores, Data Interpretation, Statistical Distributions

Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

Peer reviewed
PDF on ERIC

Download full text

Livingston, Samuel A.; Chen, Haiwen H. – ETS Research Report Series, 2015

Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…

Descriptors: Scores, Statistical Distributions, Research Reports, Equated Scores

Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Walker, Michael E. – ETS Research Report Series, 2014

Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

Descriptors: Difficulty Level, Test Items, Equated Scores, Causal Models

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

Peer reviewed
PDF on ERIC

Download full text

Livingston, Samuel A. – ETS Research Report Series, 2014

In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…

Descriptors: Demography, Equated Scores, Cluster Grouping, Ability Grouping

Continuous Exponential Families: An Equating Tool. Research Report. ETS RR-08-05

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Continuous exponential families may be employed to find continuous distributions with the same initial moments as the discrete distributions encountered in typical applications of classical equating. These continuous distributions provide distribution functions and quantile functions that may be employed in equating. To illustrate, an application…

Descriptors: Equated Scores, Statistical Distributions, Probability, Computation

Linking with Continuous Exponential Families: Single-Group Designs. Research Report. ETS RR-08-61

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Continuous exponential families are applied to linking forms via a single-group design. In this application, a distribution from the continuous bivariate exponential family is used that has selected moments that match those of the bivariate distribution of scores on the forms to be linked. The selected continuous bivariate distribution then yields…

Descriptors: Equated Scores, Probability, Statistical Distributions, Models

Improved Reliability Estimates for Small Samples Using Empirical Bayes Techniques. Research Report. ETS RR-09-46

Peer reviewed
PDF on ERIC

Download full text

Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009

Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…

Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics

Loglinear Smoothing: An Alternative Numerical Approach Using SAS. Research Report. ETS RR-04-27

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; von Davier, Alina A.; Casabianca, Jodi – ETS Research Report Series, 2004

The purpose of this report is to demonstrate loglinear smoothing using SAS PROC GENMOD. The results from four published examples, which include the smoothing of a) univariate distributions, b) bivariate distributions, c) distributions with teeth, and d) bivariate distributions with structural zeros, are reproduced to show the flexibility of the…

Descriptors: Statistical Analysis, Statistical Distributions, Comparative Analysis, Graphs

A SAS Macro for Loglinear Smoothing: Applications and Implications April. Research Report. ETS RR-06-05

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim P.; von Davier, Alina A. – ETS Research Report Series, 2006

The two purposes of this paper are to provide a SAS IML macro that performs loglinear smoothing and to apply this macro to loglinear smoothing problems that have not been extensively discussed in the test-equating literature. The SAS macro is demonstrated on univariate, bivariate, and trivariate smoothing problems. The univariate and bivariate…

Descriptors: Statistical Analysis, Statistical Distributions, Equated Scores, Models