ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Equated Scores	10
Evaluation Methods	10
Testing Problems	10
Comparative Analysis	6
Foreign Countries	6
Measurement Techniques	6
Test Interpretation	6
Test Use	6
Classification	5
Definitions	5
Educational Assessment	5
Educational Testing	5
High Stakes Tests	5
Predictive Measurement	5
Psychometrics	5
Test Theory	5
Test Validity	4
Scaling	3
Testing Programs	3
Item Analysis	2
Item Response Theory	2
Predictive Validity	2
Test Construction	2
Weighted Scores	2
Academic Achievement	1
More ▼

Source

Measurement:…	5
Applied Measurement in…	1
Assessment in Education:…	1
Journal of Educational and…	1

Author

Arter, Judith A.	1
Baird, Jo-Anne	1
Beguin, A. A.	1
Cresswell, Mike	1
Mushkin, Selma J.	1
Newton, Paul E.	1
Phillips, Gary W.	1
Verstralen, H. H. F. M.	1
Walker, Michael E.	1
van Rijn, P. W.	1
van der Linden, Wim J.	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	8
Opinion Papers	6
Reports - Evaluative	2
Reports - Research	2
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Secondary Education	1

Audience

Location

United Kingdom (England)	3
United States	3
United Kingdom	2
United Kingdom (Wales)	2
Australia	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
SAT (College Admission Test)	2

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Lord's Equity Theorem Revisited

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…

Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Out-of-Level Versus In-Level Testing: When Should We Recommend Each?

Arter, Judith A. – 1982

Specific recommendations are made concerning the circumstances under which the benefits of out-of-level testing outweigh the problems associated with it. Topics explored are: various methods for deciding when a set of test scores is invalid and the utility of these methods for local evaluators, the accuracy of vertical scaling, and the usefulness…

Descriptors: Equated Scores, Evaluation Methods, Local Norms, Scores

A Proposal for a "SIR" Adjusted Index of Educational Competence.

Download full text

Mushkin, Selma J. – 1973

The increasing use of educational performance or outcome measurements for a range of policy purposes points to new procedures for adjusting data for population composition. The purposes include: program formulation, budget resource allocation, grant-in-aid designs, performance incentive payments, consumer information for school selection, and…

Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Demography