ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Source

Educational Measurement:…

Author

Crisp, Victoria	1
Faulkner-Bond, Molly	1
Guion, Robert M.	1
Messick, Samuel	1
Reckase, Mark D.	1
Rubright, Jonathan D.	1
Walker, A. Adrienne	1
Wind, Stefanie A.	1
Wolf, Mikyung Kim	1

Publication Type

Journal Articles	7
Reports - Research	4
Opinion Papers	2
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

United Kingdom

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Impact of Both Local Item Dependencies and Cut-Point Locations on Examinee Classifications

Peer reviewed

Direct link

Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2018

Performance assessments, scenario-based tasks, and other groups of items carry a risk of violating the local item independence assumption made by unidimensional item response theory (IRT) models. Previous studies have identified negative impacts of ignoring such violations, most notably inflated reliability estimates. Still, the influence of this…

Descriptors: Performance Based Assessment, Item Response Theory, Models, Test Reliability

Validating English Language Proficiency Assessment Uses for English Learners: Academic Language Proficiency and Content Assessment Performance

Peer reviewed

Direct link

Wolf, Mikyung Kim; Faulkner-Bond, Molly – Educational Measurement: Issues and Practice, 2016

States use standards-based English language proficiency (ELP) assessments to inform relatively high-stakes decisions for English learner (EL) students. Results from these assessments are one of the primary criteria used to determine EL students' level of ELP and readiness for reclassification. The results are also used to evaluate the…

Descriptors: High Stakes Tests, Language Proficiency, Hierarchical Linear Modeling, Scores

An Investigation of Rater Cognition in the Assessment of Projects

Peer reviewed

Direct link

Crisp, Victoria – Educational Measurement: Issues and Practice, 2012

In the United Kingdom, the majority of national assessments involve human raters. The processes by which raters determine the scores to award are central to the assessment process and affect the extent to which valid inferences can be made from assessment outcomes. Thus, understanding rater cognition has become a growing area of research in the…

Descriptors: Foreign Countries, Scores, Protocol Analysis, Social Influences

Commentary on Values and Standards in Performance Assessment.

Peer reviewed

Guion, Robert M. – Educational Measurement: Issues and Practice, 1995

This commentary discusses three essential themes in performance assessment and its scoring. First, scores should mean something. Second, performance scores should permit fair and meaningful comparisons. Third, validity-reducing errors should be minimal. Increased attention to performance assessment may overcome these problems. (SLD)

Descriptors: Educational Assessment, Performance Based Assessment, Scores, Scoring

Standards of Validity and the Validity of Standards in Performance Assessment.

Peer reviewed

Messick, Samuel – Educational Measurement: Issues and Practice, 1995

Six distinguishable aspects of construct validity are discussed as they apply to performance assessment, emphasizing content, substantive, structural, generalizability, external, and consequential aspects. Taken together, these aspects provide a way to address validity questions in score interpretation and use. (SLD)

Descriptors: Construct Validity, Content Validity, Educational Assessment, Generalization

Portfolio Assessment: A Theoretical Estimate of Score Reliability.

Peer reviewed

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995

An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)

Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

Performance Based Assessment	7
Scores	7
Educational Assessment	3
Comparative Analysis	2
Models	2
Scoring	2
Standards	2
Test Reliability	2
Alternative Assessment	1
Construct Validity	1
Content Validity	1
Cost Estimates	1
Decision Making	1
Educational Legislation	1
Educational Theories	1
English (Second Language)	1
English Language Learners	1
Evaluation Methods	1
Evaluators	1
Evidence	1
Federal Legislation	1
Foreign Countries	1
Generalization	1
Goodness of Fit	1
Hierarchical Linear Modeling	1
More ▼