ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Decision Making	7
Error of Measurement	7
Probability	7
Scores	4
Item Analysis	3
Measurement Techniques	3
Achievement Tests	2
Comparative Analysis	2
Computation	2
Guidelines	2
Academic Ability	1
Academic Persistence	1
Bayesian Statistics	1
Class Size	1
Classification	1
College Students	1
Construct Validity	1
Cost Effectiveness	1
Courses	1
Criterion Referenced Tests	1
Cutting Scores	1
Difficulty Level	1
Dropouts	1
Educational Change	1
Evaluators	1
More ▼

Source

Institute for Research on…	1
International Journal of…	1
International Online Journal…	1
Journal of Educational…	1
Journal of Educational and…	1
Psychological Review	1

Author

Birnbaum, Michael H.	1
Bramble, William	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Emons, Wilco H. M.	1
Hollister, Robinson	1
Kane, Michael	1
Kifer, Edward	1
Kruyen, Peter M.	1
Longford, Nicholas Tibor	1
Sekercioglu, Güçlü	1
Sijtsma, Klaas	1
Wilde, Elizabeth Ty	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	3
Reports - Evaluative	2
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Kindergarten	1
Primary Education	1

Audience

Location

Tennessee

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Decision Theory Applied to Selecting the Winners, Ranking, and Classification

Peer reviewed

Direct link

Longford, Nicholas Tibor – Journal of Educational and Behavioral Statistics, 2016

We address the problem of selecting the best of a set of units based on a criterion variable, when its value is recorded for every unit subject to estimation, measurement, or another source of error. The solution is constructed in a decision-theoretical framework, incorporating the consequences (ramifications) of the various kinds of error that…

Descriptors: Decision Making, Classification, Guidelines, Undergraduate Students

Measurement Invariance: Concept and Implementation

Peer reviewed
PDF on ERIC

Download full text

Sekercioglu, Güçlü – International Online Journal of Education and Teaching, 2018

An empirical evidence for independent samples of a population regarding measurement invariance implies that factor structure of a measurement tool is equal across these samples; in other words, it measures the intended psychological trait within the same structure. In this case, the evidence of construct validity would be strengthened within the…

Descriptors: Factor Analysis, Error of Measurement, Factor Structure, Construct Validity

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Testing Mixture Models of Transitive Preference: Comment on Regenwetter, Dana, and Davis-Stober (2011)

Peer reviewed

Direct link

Birnbaum, Michael H. – Psychological Review, 2011

This article contrasts 2 approaches to analyzing transitivity of preference and other behavioral properties in choice data. The approach of Regenwetter, Dana, and Davis-Stober (2011) assumes that on each choice, a decision maker samples randomly from a mixture of preference orders to determine whether "A" is preferred to "B." In contrast, Birnbaum…

Descriptors: Evidence, Testing, Computation, Probability

The Calibration of a Criterion-Referenced Test.

Download full text

Kifer, Edward; Bramble, William – 1974

A latent trait model, the Rasch, was fitted to a criterion-referenced test. Approximately 90 percent of the items fit the model. Those items which fit the model were then calibrated. Based on the item calibration, individual ability estimates and the standard errors of those estimates were calculated. Using the ability estimates, it was possible,…

Descriptors: Academic Ability, Achievement Tests, Criterion Referenced Tests, Decision Making

How Close Is Close Enough? Testing Nonexperimental Estimates of Impact against Experimental Estimates of Impact with Education Test Scores as Outcomes. Discussion Paper No. 1242-02

Direct link

Wilde, Elizabeth Ty; Hollister, Robinson – Institute for Research on Poverty, 2002

In this study we test the performance of some nonexperimental estimators of impacts applied to an educational intervention--reduction in class size--where achievement test scores were the outcome. We compare the nonexperimental estimates of the impacts to "true impact" estimates provided by a random-assignment design used to assess the…

Descriptors: Computation, Outcome Measures, Achievement Tests, Scores