ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	12

Descriptor

Test Theory	12
Testing Problems	12
Measurement Techniques	7
Educational Testing	6
Evaluation Methods	6
Foreign Countries	6
Classification	5
Definitions	5
Educational Assessment	5
Equated Scores	5
Psychometrics	5
Test Interpretation	5
Test Items	5
Test Use	5
Comparative Analysis	4
High Stakes Tests	4
Predictive Measurement	4
Construct Validity	3
Predictive Validity	3
Test Bias	3
Test Construction	3
Test Validity	3
Testing Accommodations	3
Accessibility (for Disabled)	2
Adaptive Testing	2
More ▼

Source

Measurement:…	4
Review of Research in…	2
Assessment in Education:…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Online Submission	1
Rehabilitation Research,…	1

Publication Type

Journal Articles	12
Opinion Papers	4
Reports - Descriptive	4
Reports - Evaluative	4
Information Analyses	1

Education Level

Elementary Secondary Education	5
Adult Education	1
Higher Education	1
Secondary Education	1

Audience

Location

United Kingdom	3
United States	3
United Kingdom (England)	2
Australia	1
Netherlands	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
Advanced Placement…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Test Affordances or Test Function? Did We Get Messick's Message Right?

Download full text

Salmani Nodoushan, Mohammad Ali – Online Submission, 2021

This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…

Descriptors: Construct Validity, Test Theory, Test Use, Affordances

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Test Equity for People Who Are Deaf or Hard-of-Hearing: Commission on Rehabilitation Counselor Certification Steps for Implementation

Peer reviewed

Direct link

Saladin, Shawn P.; Reid, Christine; Shiels, John – Rehabilitation Research, Policy, and Education, 2011

The Commission on Rehabilitation Counselor Certification (CRCC) has taken a proactive stance on perceived test inequities of the Certified Rehabilitation Counselor (CRC) exam as it relates to people who are prelingually deaf and hard of hearing. This article describes the process developed and implemented by the CRCC to help maximize test equity…

Descriptors: Test Items, Rehabilitation Counseling, Counselor Certification, Deafness

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Theory of Test Translation Error

Peer reviewed

Direct link

Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009

In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…

Descriptors: Test Items, Investigations, Semantics, Translation

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Detecting Differential Speededness in Multistage Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007

A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…

Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Backhoff, Eduardo	1
Baird, Jo-Anne	1
Beguin, A. A.	1
Breithaupt, Krista	1
Chuah, Siang Chee	1
Contreras-Nino, Luis Angel	1
Cresswell, Mike	1
Kettler, Ryan J.	1
Longford, Nicholas T.	1
Newton, Paul E.	1
Reid, Christine	1
Saladin, Shawn P.	1
Salmani Nodoushan, Mohammad…	1
Shiels, John	1
Solano-Flores, Guillermo	1
Verstralen, H. H. F. M.	1
Wiliam, Dylan	1
Zhang, Yanwei	1
van Rijn, P. W.	1
van der Linden, Wim J.	1
von Davier, Alina A.	1
More ▼