ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Test Interpretation	7
Test Validity	7
Test Construction	4
Scores	3
Academic Achievement	2
Accountability	2
Educational Assessment	2
Evaluation Utilization	2
Inferences	2
National Competency Tests	2
Quality Control	2
Test Results	2
Test Use	2
Achievement	1
Automation	1
Benchmarking	1
Best Practices	1
Comparative Analysis	1
Computer Assisted Testing	1
Context Effect	1
Criterion Referenced Tests	1
Curriculum	1
Data Collection	1
Data Interpretation	1
Decision Making	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	7
Reports - Evaluative	4
Reports - Descriptive	3
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	2
Grade 12	1
Grade 4	1
Grade 8	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Prescribing Structure for Validation Arguments: Elemental, Structural, and Ecological Validity

Peer reviewed

Direct link

Jacobson, Erik; Svetina, Dubravka – Applied Measurement in Education, 2019

Contingent argument-based approaches to validity require a unique argument for each use, in contrast to more prescriptive approaches that identify the common kinds of validity evidence researchers should consider for every use. In this article, we evaluate our use of an approach that is both prescriptive "and" argument-based to develop a…

Descriptors: Test Validity, Test Items, Test Construction, Test Interpretation

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

Prologue: An Introduction to the Evaluation of NAEP

Peer reviewed

Direct link

Lane, Suzanne; Zumbo, Bruno D.; Abedi, Jamal; Benson, Jeri; Dossey, John; Elliott, Stephen N.; Kane, Michael; Linn, Robert; Paredes-Ziker, Cindy; Rodriguez, Michael; Schraw, Gregg; Slattery, Jean; Thomas, Veronica; Willhoft, Joe – Applied Measurement in Education, 2009

Given the changing landscape of educational accountability at the local, state, and national levels, and the changes in the uses of the National Assessment of Educational Progress (NAEP), including the evolving uses of NAEP as a policy tool to interpret state assessment and accountability systems, an explicit statement of the current and potential…

Descriptors: National Competency Tests, Academic Achievement, Accountability, Test Validity

Evaluation of the National Assessment of Educational Progress: Next Steps

Peer reviewed

Direct link

Noell, Jay; Ginsburg, Alan – Applied Measurement in Education, 2009

The report, "Evaluation of the National Assessment of Educational Progress", provides a number of recommendations for addressing validity concerns about NAEP. This article identifies actions that could be taken by the Congress, the National Center for Education Statistics, and the National Assessment Governing Board--which share responsibility for…

Descriptors: National Competency Tests, Federal Government, Public Agencies, Test Validity

Psychometric Issues in Testing Students with Disabilities.

Peer reviewed

Geisinger, Kurt F. – Applied Measurement in Education, 1994

Federal law requires that individuals with handicapping conditions be administered assessments in ways that accommodate their disabilities without penalizing them. Validation studies are needed to evaluate the meaning of scores resulting from nonstandard test administrations. The limited number of these studies to date is reviewed. (SLD)

Descriptors: Disabilities, Educational Assessment, Elementary School Students, Elementary Secondary Education

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

Quality Control in the Development and Use of Performance Assessments.

Peer reviewed

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991

Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)

Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Abedi, Jamal	1
Benson, Jeri	1
Dossey, John	1
Dunbar, Stephen B.	1
Elliott, Stephen N.	1
Geisinger, Kurt F.	1
Ginsburg, Alan	1
Hambleton, Ronald K.	1
Jacobson, Erik	1
Kane, Michael	1
Lane, Suzanne	1
Linn, Robert	1
Linn, Robert L.	1
Noell, Jay	1
Paredes-Ziker, Cindy	1
Rodriguez, Michael	1
Rupp, André A.	1
Schraw, Gregg	1
Slattery, Jean	1
Svetina, Dubravka	1
Thomas, Veronica	1
Willhoft, Joe	1
Zumbo, Bruno D.	1
More ▼