ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	14

Descriptor

Test Interpretation	25
Scores	12
Validity	11
Test Construction	7
Test Validity	7
Test Use	6
Measurement Techniques	5
Decision Making	4
Educational Assessment	4
Inferences	4
Scoring	4
Test Items	4
Test Results	4
Academic Achievement	3
Classification	3
Elementary Secondary Education	3
Evaluation Utilization	3
Evidence	3
Generalizability Theory	3
Measurement	3
National Competency Tests	3
Performance Based Assessment	3
Quality Control	3
Reliability	3
Accountability	2
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	25
Reports - Evaluative	14
Reports - Descriptive	5
Reports - Research	5
Speeches/Meeting Papers	2
Book/Product Reviews	1
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Grade 4	2
Secondary Education	2
Elementary Education	1
Grade 12	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Location

California (Los Angeles)	1
Louisiana	1

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…	2
National Assessment of…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Comparison of Two Approaches to Interpretive Use Arguments

Peer reviewed

Direct link

Carney, Michele; Crawford, Angela; Siebert, Carl; Osguthorpe, Rich; Thiede, Keith – Applied Measurement in Education, 2019

The "Standards for Educational and Psychological Testing" recommend an argument-based approach to validation that involves a clear statement of the intended interpretation and use of test scores, the identification of the underlying assumptions and inferences in that statement--termed the interpretation/use argument, and gathering of…

Descriptors: Inquiry, Test Interpretation, Validity, Scores

Challenges to the Cattell-Horn-Carroll Theory: Empirical, Clinical, and Policy Implications

Peer reviewed

Direct link

Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…

Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests

The One and the Many: Enduring Legacies of Spearman and Thurstone on Intelligence Test Score Interpretation

Peer reviewed

Direct link

Beaujean, A. Alexander; Benson, Nicholas F. – Applied Measurement in Education, 2019

Charles Spearman and L. L. Thurstone were pioneers in the field of intelligence. They not only developed methods to assess and understand intelligence, but also developed theories about its structure and function. Methodologically, their approaches were not that distinct, but their theories of intelligence were philosophically very different --…

Descriptors: Psychologists, Intelligence Tests, Scores, Theories

Integrating Validation Arguments with the Assessment Triangle: A Framework for Operationalizing and Instantiating Validation

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019

Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…

Descriptors: Validity, Educational Assessment, Models, Screening Tests

Prescribing Structure for Validation Arguments: Elemental, Structural, and Ecological Validity

Peer reviewed

Direct link

Jacobson, Erik; Svetina, Dubravka – Applied Measurement in Education, 2019

Contingent argument-based approaches to validity require a unique argument for each use, in contrast to more prescriptive approaches that identify the common kinds of validity evidence researchers should consider for every use. In this article, we evaluate our use of an approach that is both prescriptive "and" argument-based to develop a…

Descriptors: Test Validity, Test Items, Test Construction, Test Interpretation

A Multilevel Factor Analysis of Third-Party Evaluations of Noncognitive Constructs Used in Admissions Decision Making

Peer reviewed

Direct link

Oliveri, Maria; McCaffrey, Daniel; Ezzo, Chelsea; Holtzman, Steven – Applied Measurement in Education, 2017

The assessment of noncognitive traits is challenging due to possible response biases, "subjectivity" and "faking." Standardized third-party evaluations where an external evaluator rates an applicant on their strengths and weaknesses on various noncognitive traits are a promising alternative. However, accurate score-based…

Descriptors: Factor Analysis, Decision Making, College Admission, Likert Scales

Evaluating Score and Decision Consistency across Claims in a Validation Argument

Peer reviewed

Direct link

Schmidgall, Jonathan – Applied Measurement in Education, 2017

This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…

Descriptors: Scores, Reliability, Validity, Generalizability Theory

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

Motivation Filtering on a Multi-Institution Assessment of General College Outcomes

Peer reviewed

Direct link

Steedle, Jeffrey T. – Applied Measurement in Education, 2014

Possible lack of motivation is a perpetual concern when tests have no stakes attached to performance. Specifically, the validity of test score interpretations may be compromised when examinees are unmotivated to exert their best efforts. Motivation filtering, a procedure that filters out apparently unmotivated examinees, was applied to the…

Descriptors: College Outcomes Assessment, Student Motivation, Sampling, Validity

Evidence-Centered Assessment Design as a Foundation for Achievement-Level Descriptor Development and for Standard Setting

Peer reviewed

Direct link

Plake, Barbara S.; Huff, Kristen; Reshetar, Rosemary – Applied Measurement in Education, 2010

In many large-scale assessment programs, achievement level descriptors (ALDs) provide a critical role in communicating what scores on the assessment mean and in interpreting what examinees know and are able to do based on their test performance. Based on their test performance, examinees are often classified into performance categories. The…

Descriptors: Evidence, Test Construction, Measurement, Standard Setting

Prologue: An Introduction to the Evaluation of NAEP

Peer reviewed

Direct link

Lane, Suzanne; Zumbo, Bruno D.; Abedi, Jamal; Benson, Jeri; Dossey, John; Elliott, Stephen N.; Kane, Michael; Linn, Robert; Paredes-Ziker, Cindy; Rodriguez, Michael; Schraw, Gregg; Slattery, Jean; Thomas, Veronica; Willhoft, Joe – Applied Measurement in Education, 2009

Given the changing landscape of educational accountability at the local, state, and national levels, and the changes in the uses of the National Assessment of Educational Progress (NAEP), including the evolving uses of NAEP as a policy tool to interpret state assessment and accountability systems, an explicit statement of the current and potential…

Descriptors: National Competency Tests, Academic Achievement, Accountability, Test Validity

Conducting a Lifecycle Audit of the National Assessment of Educational Progress

Peer reviewed

Direct link

Buckendahl, Chad W.; Plake, Barbara S.; Davis, Susan L. – Applied Measurement in Education, 2009

The National Assessment of Educational Progress (NAEP) program is a series of periodic assessments administered nationally to samples of students and designed to measure different content areas. This article describes a multi-year study that focused on the breadth of the development, administration, maintenance, and renewal of the assessments in…

Descriptors: National Competency Tests, Audits (Verification), Testing Programs, Program Evaluation

Using a Taxonomy of Differential Step Functioning to Improve the Interpretation of DIF in Polytomous Items: An Illustration

Peer reviewed

Direct link

Penfield, Randall D.; Alvarez, Karina; Lee, Okhee – Applied Measurement in Education, 2009

The assessment of differential item functioning (DIF) in polytomous items addresses between-group differences in measurement properties at the item level, but typically does not inform which score levels may be involved in the DIF effect. The framework of differential step functioning (DSF) addresses this issue by examining between-group…

Descriptors: Test Bias, Classification, Test Items, Criteria

Evaluation of the National Assessment of Educational Progress: Next Steps

Peer reviewed

Direct link

Noell, Jay; Ginsburg, Alan – Applied Measurement in Education, 2009

The report, "Evaluation of the National Assessment of Educational Progress", provides a number of recommendations for addressing validity concerns about NAEP. This article identifies actions that could be taken by the Congress, the National Center for Education Statistics, and the National Assessment Governing Board--which share responsibility for…

Descriptors: National Competency Tests, Federal Government, Public Agencies, Test Validity

Validating Licensing and Certification Test Score Interpretations and Decisions: A Response.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 1997

This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)

Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Previous Page | Next Page »

Pages: 1 | 2

Kane, Michael	2
Linn, Robert L.	2
Plake, Barbara S.	2
Abedi, Jamal	1
Ackerman, Terry A.	1
Adams, Elizabeth	1
Alvarez, Karina	1
Beaujean, A. Alexander	1
Benson, Jeri	1
Benson, Nicholas F.	1
Buckendahl, Chad W.	1
Canivez, Gary L.	1
Carney, Michele	1
Crawford, Angela	1
Crone, Linda J.	1
Davis, Susan L.	1
Dossey, John	1
Downing, Steven M.	1
Dunbar, Stephen B.	1
Elliott, Stephen N.	1
Ezzo, Chelsea	1
Geisinger, Kurt F.	1
Ginsburg, Alan	1
Haladyna, Thomas M.	1
More ▼