ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	16

Descriptor

Test Interpretation	75
Scores	27
Test Use	24
Testing Problems	24
Test Validity	19
Educational Assessment	17
Achievement Tests	16
Norm Referenced Tests	15
Scoring	15
Test Results	15
Elementary Secondary Education	14
Test Construction	12
Evaluation Methods	11
Standardized Tests	10
Elementary Education	9
Test Items	9
Performance Based Assessment	8
Student Evaluation	8
Test Reliability	8
Testing Programs	8
Criterion Referenced Tests	6
Evaluators	6
Interrater Reliability	6
Computer Assisted Testing	5
Higher Education	5
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	75
Reports - Evaluative	31
Opinion Papers	19
Reports - Research	14
Reports - Descriptive	10
Tests/Questionnaires	9
Guides - Non-Classroom	4
Speeches/Meeting Papers	4
Information Analyses	2
Guides - Classroom - Learner	1

Education Level

Elementary Secondary Education

Audience

Counselors	1
Practitioners	1
Teachers	1

Location

Kentucky

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Comprehensive Tests of Basic…	3
National Assessment of…	3
Iowa Tests of Basic Skills	2
SAT (College Admission Test)	2
ACT Assessment	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
Sequential Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 75 results Save | Export

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

When Should I Use a Measure to Support Instructional Improvement at Scale? The Importance of Considering Both Intended and Actual Use in Validity Arguments

Peer reviewed

Direct link

Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021

Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…

Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation

Exploring the Impact of Rater Effects on Person Fit in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2020

Researchers have documented the impact of rater effects, or raters' tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test-takers' achievement estimates given their response patterns,…

Descriptors: Performance Based Assessment, Evaluators, Achievement, Influences

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Argumentation Surrounding Argument-Based Validation: A Systematic Review of Validation Methodology in Peer-Reviewed Articles

Peer reviewed

Direct link

Lavery, Matthew Ryan; Bostic, Jonathan D.; Kruse, Lance; Krupa, Erin E.; Carney, Michele B. – Educational Measurement: Issues and Practice, 2020

Since it was formalized by Kane, the argument-based approach to validation has been promoted as the preferred method for validating interpretations and uses of test scores. Because validation is discussed in terms of arguments, and arguments are both interactive and social, the present review systematically examines the scholarly arguments which…

Descriptors: Persuasive Discourse, Validity, Research Methodology, Peer Evaluation

Actual Interpretations and Use of Scores as Aspects of Validity

Peer reviewed

Direct link

O'Leary, Timothy M.; Hattie, John A. C.; Griffin, Patrick – Educational Measurement: Issues and Practice, 2017

Validity is the most fundamental consideration in test development. Understandably, much time, effort, and money is spent in its pursuit. Central to the modern conception of validity are the interpretations made, and uses planned, on the basis of test scores. There is, unfortunately, however, evidence that test users have difficulty understanding…

Descriptors: Test Interpretation, Scores, Test Validity, Evidence

Reliability, Dimensionality, and Internal Consistency as Defined by Cronbach: Distinct Albeit Related Concepts

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2015

This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…

Descriptors: Reliability, Definitions, Mathematics, Test Interpretation

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

The Media and Educational Testing: In Pursuit of the Truth or in Pursuit of a Good Story?

Peer reviewed

Direct link

Camara, Wayne J.; Shaw, Emily J. – Educational Measurement: Issues and Practice, 2012

The measurement community needs to better understand how to interact with the media to effectively disseminate important findings from educational testing efforts. To this end, the current paper will review media coverage of educational testing and related issues and elaborate on areas of concern and opportunities for improved communication…

Descriptors: Test Results, Educational Testing, Measurement, Information Dissemination

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Pommerich, Mary – Educational Measurement: Issues and Practice, 2012

Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…

Descriptors: Testing, Scores, Measurement, Test Construction

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Interpreting IQ Scores.

Peer reviewed

Hills, John R. – Educational Measurement: Issues and Practice, 1984

A true-false measure concerning the interpretation of intelligence quotients (IQ) is presented. The correct responses to the 10 items, and a brief explanation of each, are also included. The author attempts to reveal many misconceptions about the interpretation of IQ scores. (DWH)

Descriptors: Intelligence Quotient, Intelligence Tests, Objective Tests, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Hills, John R.	8
Linn, Robert L.	4
Mehrens, William A.	3
Plake, Barbara S.	3
Frisbie, David A.	2
Hoover, H. D.	2
Kolen, Michael J.	2
Krupa, Erin E.	2
An, Lily Shiao	1
Armstrong, Anne-Marie	1
Bachman, Lyle F.	1
Bond, Lloyd	1
Bostic, Jonathan	1
Bostic, Jonathan D.	1
Brennan, Robert L.	1
Burket, George R.	1
Burstein, Leigh	1
Burton, Elizabeth	1
Camara, Wayne J.	1
Carney, Michele B.	1
Chinen, Starlie	1
Cizek, Gregory J.	1
Clauser, Brian E.	1
Cohen, Allan	1
More ▼