ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	7

Descriptor

Tests	7
Scores	5
Statistical Analysis	4
Data	2
Error of Measurement	2
Goodness of Fit	2
Item Response Theory	2
Models	2
Prediction	2
Psychometrics	2
Scoring	2
Accuracy	1
College Entrance Examinations	1
Colleges	1
Computation	1
Correlation	1
Educational Assessment	1
English (Second Language)	1
Equated Scores	1
Graduate Study	1
Inferences	1
Institutions	1
Language Tests	1
Mathematical Models	1
Measurement Techniques	1
More ▼

Source

ETS Research Report Series	3
Educational Measurement:…	2
Journal of Educational and…	1
Psychometrika	1

Author

Haberman, Shelby J.	7
Sinharay, Sandip	3
Puhan, Gautam	2
Sinharay, Sadip	1

Publication Type

Journal Articles	7
Reports - Research	5
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	2
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Application of Best Linear Prediction and Penalized Best Linear Prediction to ETS Tests. Research Report. ETS RR-20-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2020

Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].

Descriptors: Prediction, Scores, Tests, Testing Programs

Pseudo-Equivalent Groups and Linking

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2015

Adjustment by minimum discriminant information provides an approach to linking test forms in the case of a nonequivalent groups design with no satisfactory common items. This approach employs background information on individual examinees in each administration so that weighted samples of examinees form pseudo-equivalent groups in the sense that…

Descriptors: Equated Scores, Statistical Analysis, Tests, Weighted Scores

How Often Is the Misfit of Item Response Theory Models Practically Significant?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2014

Standard 3.9 of the Standards for Educational and Psychological Testing ([, 1999]) demands evidence of model fit when item response theory (IRT) models are employed to data from tests. Hambleton and Han ([Hambleton, R. K., 2005]) and Sinharay ([Sinharay, S., 2005]) recommended the assessment of practical significance of misfit of IRT models, but…

Descriptors: Item Response Theory, Goodness of Fit, Models, Tests

An NCME Instructional Module on Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2011

The purpose of this ITEMS module is to provide an introduction to subscores. First, examples of subscores from an operational test are provided. Then, a review of methods that can be used to examine if subscores have adequate psychometric quality is provided. It is demonstrated, using results from operational and simulated data, that subscores…

Descriptors: Scores, Psychometrics, Tests, Data

Reporting of Subscores Using Multidimensional Item Response Theory

Peer reviewed

Direct link

Haberman, Shelby J.; Sinharay, Sandip – Psychometrika, 2010

Recently, there has been increasing interest in reporting subscores. This paper examines reporting of subscores using multidimensional item response theory (MIRT) models (e.g., Reckase in "Appl. Psychol. Meas." 21:25-36, 1997; C.R. Rao and S. Sinharay (Eds), "Handbook of Statistics, vol. 26," pp. 607-642, North-Holland, Amsterdam, 2007; Beguin &…

Descriptors: Item Response Theory, Psychometrics, Statistical Analysis, Scores

Outliers in Assessments. Research Report. ETS RR-08-41

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Outliers in assessments are often treated as a nuisance for data analysis; however, they can also assist in quality assurance. Their frequency can suggest problems with form codes, scanning accuracy, ability of examinees to enter responses as they intend, or exposure of items.

Descriptors: Educational Assessment, Quality Assurance, Scores, Regression (Statistics)

Subscores for Institutions. Research Report. ETS RR-06-13

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Sinharay, Sadip; Puhan, Gautam – ETS Research Report Series, 2006

Recently, there has been an increasing level of interest in reporting subscores. This paper examines the issue of reporting subscores at an aggregate level, especially at the level of institutions that the examinees belong to. A series of statistical analyses is suggested to determine when subscores at the institutional level have any added value…

Descriptors: Scores, Statistical Analysis, Error of Measurement, Reliability