ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Evaluation Methods	8
Test Validity	8
Test Reliability	3
Construct Validity	2
Cutting Scores	2
Decision Making	2
Interrater Reliability	2
Item Analysis	2
Measurement Techniques	2
Models	2
Scoring	2
Test Construction	2
Accuracy	1
Achievement Tests	1
Advisory Committees	1
Classification	1
College Entrance Examinations	1
Compliance (Legal)	1
Constitutional Law	1
Court Litigation	1
Criteria	1
Critical Thinking	1
Culture Fair Tests	1
Data Analysis	1
Design	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	8
Reports - Evaluative	4
Reports - Research	4
Information Analyses	1

Education Level

Grade 12	1
Grade 7	1
High Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Evaluation of the Standard Setting on the 2005 Grade 12 National Assessment of Educational Progress Mathematics Test

Peer reviewed

Direct link

Sireci, Stephen G.; Hauger, Jeffrey B.; Wells, Craig S.; Shea, Christine; Zenisky, April L. – Applied Measurement in Education, 2009

The National Assessment Governing Board used a new method to set achievement level standards on the 2005 Grade 12 NAEP Math test. In this article, we summarize our independent evaluation of the process used to set these standards. The evaluation data included observations of the standard-setting meeting, observations of advisory committee meetings…

Descriptors: Advisory Committees, Mathematics Tests, Standard Setting, National Competency Tests

A Refined Item Digraph Analysis of a Proportional Reasoning Test.

Peer reviewed

Bart, William M.; Williams-Morris, Ruth – Applied Measurement in Education, 1990

Refined item digraph analysis (RIDA) is a way of studying diagnostic and prescriptive testing. It permits assessment of a test item's diagnostic value by examining the extent to which the item has properties of ideal items. RIDA is illustrated with the Orange Juice Test, which assesses the proportionality concept. (TJH)

Descriptors: Diagnostic Tests, Evaluation Methods, Item Analysis, Mathematical Models

Methodological Approaches to the Validation of Academic Self-Concept: The Construct and Its Measures.

Peer reviewed

Byrne, Barbara M. – Applied Measurement in Education, 1990

Methodological procedures used in validating the theoretical structure of academic self-concept and validating associated measurement instruments are reviewed. Substantive findings from research related to modes of inquiry are summarized, and recommendations for future research are outlined. (TJH)

Descriptors: Classification, Construct Validity, Evaluation Methods, Literature Reviews

Score Resolution: An Investigation of the Reliability and Validity of Resolved Scores

Peer reviewed

Direct link

Johnson, Robert L.; Penny, Jim; Fisher, Steve; Kuhs, Therese – Applied Measurement in Education, 2003

When raters assign different scores to a performance task, a method for resolving rating differences is required to report a single score to the examinee. Recent studies indicate that decisions about examinees, such as pass/fail decisions, differ across resolution methods. Previous studies also investigated the interrater reliability of…

Descriptors: Test Reliability, Test Validity, Scores, Interrater Reliability

How to Evaluate the Legal Defensibility of High-Stakes Tests.

Peer reviewed

Mehrens, William A.; Popham, W. James – Applied Measurement in Education, 1992

This paper discusses how to determine whether a test was developed in a legally defensible manner, reviewing general issues, specific cases bearing on different types of test use, some evaluative dimensions, and evidence of test quality. Tests constructed and used according to existing standards will generally stand legal scrutiny. (SLD)

Descriptors: College Entrance Examinations, Compliance (Legal), Constitutional Law, Court Litigation

Bart, William M.	1
Byrne, Barbara M.	1
Cohen, Allan	1
Ercikan, Kadriye	1
Fisher, Steve	1
Hauger, Jeffrey B.	1
Johnson, Robert L.	1
Kuhs, Therese	1
Mehrens, William A.	1
Oliveri, María Elena	1
Penny, Jim	1
Phillips, Gary W.	1
Popham, W. James	1
Raczynski, Kevin	1
Shea, Christine	1
Sireci, Stephen G.	1
Wells, Craig S.	1
Williams-Morris, Ruth	1
Zenisky, April L.	1
More ▼