ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Prediction	12
Computation	5
Accuracy	4
Error of Measurement	4
Reliability	4
Scores	4
Scoring	4
True Scores	4
Classification	3
Correlation	3
Language Tests	3
Regression (Statistics)	3
Statistical Analysis	3
College Entrance Examinations	2
English (Second Language)	2
Equations (Mathematics)	2
Essays	2
Graduate Study	2
Measurement Techniques	2
Probability	2
Sampling	2
Second Language Learning	2
Test Theory	2
Tests	2
Validity	2
More ▼

Source

ETS Research Report Series	8
Journal of Educational and…	2
Educational Testing Service	1
Psychometrika	1

Author

Haberman, Shelby J.	12
Sinharay, Sandip	2
Puhan, Gautam	1
Qian, Jiahe	1
Sinharay, Sadip	1
Yao, Lili	1
Zhang, Mo	1

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Higher Education	3
Postsecondary Education	3
High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	3
Graduate Record Examinations	2
Test of English as a Foreign…	2

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Cross-Validation and "U"-Statistics. Research Report. ETS RR-19-27

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2019

Cross-validation is a common statistical procedure applied to problems that are otherwise computationally intractable. It is often employed to assess the effectiveness of prediction procedures. In this report, cross-validation is discussed in terms of "U"-statistics. This approach permits consideration of the statistical properties of…

Descriptors: Statistical Analysis, Generalization, Prediction, Computation

Measures of Agreement versus Measures of Prediction Accuracy. Research Report. ETS RR-19-20

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2019

Measures of agreement are compared to measures of prediction accuracy within a general context. Differences in appropriate use are emphasized, and approaches are examined for both numerical and nominal variables. General estimation methods are developed, and their large-sample properties are compared.

Descriptors: Measurement Techniques, Classification, Prediction, Accuracy

Application of Best Linear Prediction and Penalized Best Linear Prediction to ETS Tests. Research Report. ETS RR-20-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2020

Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].

Descriptors: Prediction, Scores, Tests, Testing Programs

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

How Does the Knowledge of Subgroup Membership of Examinees Affect the Prediction of True Subscores? Research Report. ETS RR-11-43

Download full text

Haberman, Shelby J.; Sinharay, Sandip – Educational Testing Service, 2011

Subscores are reported for several operational assessments. Haberman (2008) suggested a method based on classical test theory to determine if the true subscore is predicted better by the corresponding subscore or the total score. Researchers are often interested in learning how different subgroups perform on subtests. Stricker (1993) and…

Descriptors: True Scores, Test Theory, Prediction, Group Membership

The Application of the Cumulative Logistic Regression Model to Automated Essay Scoring

Peer reviewed

Direct link

Haberman, Shelby J.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

Most automated essay scoring programs use a linear regression model to predict an essay score from several essay features. This article applied a cumulative logit model instead of the linear regression model to automated essay scoring. Comparison of the performances of the linear regression model and the cumulative logit model was performed on a…

Descriptors: Scoring, Regression (Statistics), Essays, Computer Software

Linear Prediction of a True Score from a Direct Estimate and Several Derived Estimates

Peer reviewed

Direct link

Haberman, Shelby J.; Qian, Jiahe – Journal of Educational and Behavioral Statistics, 2007

Statistical prediction problems often involve both a direct estimate of a true score and covariates of this true score. Given the criterion of mean squared error, this study determines the best linear predictor of the true score given the direct estimate and the covariates. Results yield an extension of Kelley's formula for estimation of the true…

Descriptors: Prediction, Regression (Statistics), True Scores, Correlation

Subscores and Validity. Research Report. ETS RR-08-64

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…

Descriptors: Scores, Validity, Educational Testing, Correlation

Subscores for Institutions. Research Report. ETS RR-06-13

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Sinharay, Sadip; Puhan, Gautam – ETS Research Report Series, 2006

Recently, there has been an increasing level of interest in reporting subscores. This paper examines the issue of reporting subscores at an aggregate level, especially at the level of institutions that the examinees belong to. A series of statistical analyses is suggested to determine when subscores at the institutional level have any added value…

Descriptors: Scores, Statistical Analysis, Error of Measurement, Reliability

Bias in Estimation of Misclassification Rates

Peer reviewed

Direct link

Haberman, Shelby J. – Psychometrika, 2006

When a simple random sample of size n is employed to establish a classification rule for prediction of a polytomous variable by an independent variable, the best achievable rate of misclassification is higher than the corresponding best achievable rate if the conditional probability distribution is known for the predicted variable given the…

Descriptors: Bias, Computation, Sample Size, Classification

Joint and Conditional Maximum Likelihood Estimation for the Rasch Model for Binary Responses. Research Report. RR-04-20

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2004

The usefulness of joint and conditional maximum-likelihood is considered for the Rasch model under realistic testing conditions in which the number of examinees is very large and the number is items is relatively large. Conditions for consistency and asymptotic normality are explored, effects of model error are investigated, measures of prediction…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Testing

Probability Prediction and Classification. Research Report. RR-04-19

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2004

Criteria for prediction of multinomial responses are examined in terms of estimation bias. Logarithmic penalty and least squares are quite similar in behavior but quite different from maximum probability. The differences ultimately reflect deficiencies in the behavior of the criterion of maximum probability.

Descriptors: Probability, Prediction, Classification, Computation