ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	8

Descriptor

Achievement Tests	24
Test Validity	16
Elementary Secondary Education	8
Testing Problems	8
Norm Referenced Tests	7
Standardized Tests	7
Test Use	7
Evaluation Methods	6
Test Construction	6
Test Interpretation	6
Educational Testing	5
National Norms	5
Validity	5
Elementary Education	4
Foreign Countries	4
International Assessment	4
Test Items	4
Test Reliability	4
Test Results	4
Computer Assisted Testing	3
Construct Validity	3
Criterion Referenced Tests	3
Item Analysis	3
School Districts	3
Scoring	3
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	24
Reports - Descriptive	7
Reports - Evaluative	7
Reports - Research	7
Opinion Papers	4
Information Analyses	1

Education Level

Secondary Education	3
Elementary Secondary Education	2
Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1

Audience

Researchers

Location

California	1
Germany	1
Idaho	1
Michigan	1
New York (New York)	1

Laws, Policies, & Programs

Every Student Succeeds Act…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Program for International…	3
Comprehensive Tests of Basic…	1
National Assessment of…	1
Progress in International…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

When Assessment Validation Neglects Any Strand of Validity Evidence: An Instructive Example from PISA

Peer reviewed

Direct link

Pepper, David – Educational Measurement: Issues and Practice, 2020

The Standards for Educational and Psychological Testing identify several strands of validity evidence that may be needed as support for particular interpretations and uses of assessments. Yet assessment validation often does not seem guided by these Standards, with validations lacking a particular strand even when it appears relevant to an…

Descriptors: Validity, Foreign Countries, Achievement Tests, International Assessment

Construct Equivalence of PISA Reading Comprehension Measured with Paper-Based and Computer-Based Assessments

Peer reviewed

Direct link

Kroehne, Ulf; Buerger, Sarah; Hahnel, Carolin; Goldhammer, Frank – Educational Measurement: Issues and Practice, 2019

For many years, reading comprehension in the Programme for International Student Assessment (PISA) was measured via paper-based assessment (PBA). In the 2015 cycle, computer-based assessment (CBA) was introduced, raising the question of whether central equivalence criteria required for a valid interpretation of the results are fulfilled. As an…

Descriptors: Reading Comprehension, Computer Assisted Testing, Achievement Tests, Foreign Countries

Alignment and Implications for Test Takers

Peer reviewed

Direct link

Welch, Catherine J.; Dunbar, Stephen B. – Educational Measurement: Issues and Practice, 2020

The use of assessment results to inform school accountability relies on the assumption that the test design appropriately represents the content and cognitive emphasis reflected in the state's standards. Since the passage of the Every Student Succeeds Act and the certification of accountability assessments through federal peer review practices,…

Descriptors: Accountability, Test Construction, State Standards, Content Validity

Multistage Adaptive Testing Design in International Large-Scale Assessments

Peer reviewed

Direct link

Yamamoto, Kentaro; Shin, Hyo Jeong; Khorramdel, Lale – Educational Measurement: Issues and Practice, 2018

A multistage adaptive testing (MST) design was implemented for the Programme for the International Assessment of Adult Competencies (PIAAC) starting in 2012 for about 40 countries and has been implemented for the 2018 cycle of the Programme for International Student Assessment (PISA) for more than 80 countries. Using examples from PISA and PIAAC,…

Descriptors: International Assessment, Foreign Countries, Achievement Tests, Test Validity

An Evaluative Framework for Reviewing Fairness Standards and Practices in Educational Tests

Peer reviewed

Direct link

Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019

One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…

Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests

Construct-Irrelevant Variance in High-Stakes Testing

Peer reviewed

Direct link

Haladyna, Thomas M.; Downing, Steven M. – Educational Measurement: Issues and Practice, 2004

There are many threats to validity in high-stakes achievement testing. One major threat is construct-irrelevant variance (CIV). This article defines CIV in the context of the contemporary, unitary view of validity and presents logical arguments, hypotheses, and documentation for a variety of CIV sources that commonly threaten interpretations of…

Descriptors: Student Evaluation, Evaluation Methods, High Stakes Tests, Construct Validity

A Validity Framework for Evaluating the Technical Quality of Alternate Assessments

Peer reviewed

Direct link

Marion, Scott F.; Pellegrino, James W. – Educational Measurement: Issues and Practice, 2006

This article presents findings from two projects designed to improve evaluations of technical quality of alternate assessments for students with the most significant cognitive disabilities. We argue that assessment technical documents should allow for the evaluation of the construct validity of the alternate assessments following the traditions of…

Descriptors: Construct Validity, Student Evaluation, Cognitive Processes, Achievement Tests

Consequential Aspects of the Validity of Achievement Tests: A Publisher's Point of View.

Peer reviewed

Green, Donald Ross – Educational Measurement: Issues and Practice, 1998

Asserts that publishers of achievement tests are, for the most part, not in a position to obtain on their own any decent evidence about the consequences of uses made of their tests. Reasons why this is so are discussed, and what publishers can be expected to do is outlined. (SLD)

Descriptors: Achievement Tests, Elementary Secondary Education, Test Construction, Test Use

Building Validity Evidence for Scores on a State-Wide Alternate Assessment: A Contrasting Groups, Multimethod Approach

Peer reviewed

Direct link

Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007

The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…

Descriptors: Inferences, Disabilities, Rating Scales, Eligibility

Consequential Validity: A Practitioner's Perspective.

Peer reviewed

Taleporos, Elizabeth – Educational Measurement: Issues and Practice, 1998

Two achievement testing programs in New York City that have had and are having some consequences are described, and how the school system tries to determine and deal with these consequences is explored. Cooperation and open dialog between test publishers and the school district are making it possible to deal with the real-life problems of test…

Descriptors: Achievement Tests, Educational Testing, Elementary Secondary Education, School Districts

Some Problems, Pitfalls, and Paradoxes in Educational Measurement.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 2001

Discusses some problems, pitfalls, and paradoxes that challenge measurement theory and practice, especially for K-12 achievement testing. Considers a number of technical issues, especially some related to reliability. Also discusses a number of practical or political issues related to validation and accountability. (SLD)

Descriptors: Accountability, Achievement Tests, Educational Testing, Educational Theories

Using Microcomputers to Assess Achievement and Instruction.

Peer reviewed

Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984

The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)

Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing

The Multiple True-False Item Format: A Status Review.

Peer reviewed

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992

Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)

Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests

Valid Normative Information from Customized Achievement Tests.

Peer reviewed

Yen, Wendy M.; And Others – Educational Measurement: Issues and Practice, 1987

This paper discusses how to maintain the integrity of national nomative information for achievement tests when the test that is administered has been customized to satisfy local needs and is not a test that has been nationally normed. Alternative procedures for item selection and calibration are examined. (Author/LMO)

Descriptors: Achievement Tests, Elementary Secondary Education, Goodness of Fit, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Brennan, Robert L.	1
Buerger, Sarah	1
Burger, Donald L.	1
Burger, Susan E.	1
Compton, Elizabeth	1
Downing, Steven M.	1
Dunbar, Stephen B.	1
Elliott, Stephen N.	1
Forsyth, Robert A.	1
Frisbie, David A.	1
Goldhammer, Frank	1
Gramenz, Gary W.	1
Green, Donald Ross	1
Hahnel, Carolin	1
Haladyna, Thomas M.	1
Hall, Bruce W.	1
Ji, Xuejun Ryan	1
Jolly, S. Jean	1
Jonson, Jessica L.	1
Khorramdel, Lale	1
Kroehne, Ulf	1
Linn, Robert L.	1
Marion, Scott F.	1
Mehrens, William A.	1
More ▼