ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Educational Testing	11
Evaluation Methods	11
Test Theory	11
Measurement Techniques	6
Comparative Analysis	5
Educational Assessment	5
Foreign Countries	5
High Stakes Tests	5
Psychometrics	5
Test Use	5
Testing Problems	5
Definitions	4
Equated Scores	4
Predictive Measurement	4
Test Interpretation	4
Classification	3
Psychological Testing	3
Educational History	2
Elementary Secondary Education	2
Measurement Objectives	2
Measures (Individuals)	2
Predictive Validity	2
Program Evaluation	2
Scaling	2
Standards	2
More ▼

Source

Measurement:…	4
American Psychologist	1
Educational Research	1
Educational Research and…	1
International Journal of…	1
Journal of Experimental…	1

Author

Williams, Richard H.	2
Zimmerman, Donald W.	2
Baird, Jo-Anne	1
Bos, Wilfried	1
Cresswell, Mike	1
Goy, Martin	1
Livingston, Samuel A.	1
Newton, Paul E.	1
Ross, Donald	1
Shaycoft, Marion F.	1
Stobart, Gordon	1
Wendt, Heike	1
Zumbo, Bruno D.	1
von Davier, Alina A.	1
von Mayrhauser, Richard T.	1
More ▼

Publication Type

Journal Articles	9
Opinion Papers	7
Information Analyses	2
Reports - Research	2
Books	1
Guides - Non-Classroom	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	6
Higher Education	1

Audience

Location

United Kingdom	3
United Kingdom (England)	2
United States	2
Australia	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Determining Validity in National Curriculum Assessments

Peer reviewed

Direct link

Stobart, Gordon – Educational Research, 2009

Background: Validity is a central concern in any assessment, though this has often not been made explicit in the UK assessment context. This article applies current validity theorising, largely derived from American formulations, to national curriculum assessments in England. Purpose: The aim is to consider validity arguments in relation to the…

Descriptors: National Curriculum, Foreign Countries, Elementary Secondary Education, Educational Policy

Louis Guttman's Contributions to Classical Test Theory

Peer reviewed

Direct link

Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005

This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…

Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing

On the Virtues and Vices of the Standard Error of Measurement.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984

This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)

Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing

Handbook of Criterion-Referenced Testing: Development, Evaluation, and Use.

Shaycoft, Marion F. – 1979

Focusing on the use of "paper and pencil" criterion-referenced tests in educational measurement, and to correct misconceptions, the definitions of basic terms and historical antecedents are discussed. Classifications of the tests are compared with other achievement tests. The phases in developing criterion-referenced tests are presented with the…

Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods

Issues in Standard Setting: Some Comments, Some Suggestions, and Maybe Even a Few Answers.

Download full text

Livingston, Samuel A. – 1983

Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…

Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives

The Mental Testing Community and Validity: A Prehistory.

Peer reviewed

von Mayrhauser, Richard T. – American Psychologist, 1992

Examines accuracy evaluation in published testing programs of the following: J. M. Cattell; C. Spearman; A. Binet; L. M. Terman; R. M. Yerkes; E. L. Thorndike; and W. D. Scott. Developing community and consensus on testing required convergence between theorists and practitioners. (SLD)

Descriptors: Cognitive Ability, Cognitive Tests, Educational History, Educational Testing