Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 6 |
Descriptor
Educational Testing | 11 |
Evaluation Methods | 11 |
Test Theory | 11 |
Measurement Techniques | 6 |
Comparative Analysis | 5 |
Educational Assessment | 5 |
Foreign Countries | 5 |
High Stakes Tests | 5 |
Psychometrics | 5 |
Test Use | 5 |
Testing Problems | 5 |
More ▼ |
Source
Measurement:… | 4 |
American Psychologist | 1 |
Educational Research | 1 |
Educational Research and… | 1 |
International Journal of… | 1 |
Journal of Experimental… | 1 |
Author
Williams, Richard H. | 2 |
Zimmerman, Donald W. | 2 |
Baird, Jo-Anne | 1 |
Bos, Wilfried | 1 |
Cresswell, Mike | 1 |
Goy, Martin | 1 |
Livingston, Samuel A. | 1 |
Newton, Paul E. | 1 |
Ross, Donald | 1 |
Shaycoft, Marion F. | 1 |
Stobart, Gordon | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Opinion Papers | 7 |
Information Analyses | 2 |
Reports - Research | 2 |
Books | 1 |
Guides - Non-Classroom | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 6 |
Higher Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Stobart, Gordon – Educational Research, 2009
Background: Validity is a central concern in any assessment, though this has often not been made explicit in the UK assessment context. This article applies current validity theorising, largely derived from American formulations, to national curriculum assessments in England. Purpose: The aim is to consider validity arguments in relation to the…
Descriptors: National Curriculum, Foreign Countries, Elementary Secondary Education, Educational Policy
Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005
This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…
Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984
This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)
Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing
Shaycoft, Marion F. – 1979
Focusing on the use of "paper and pencil" criterion-referenced tests in educational measurement, and to correct misconceptions, the definitions of basic terms and historical antecedents are discussed. Classifications of the tests are compared with other achievement tests. The phases in developing criterion-referenced tests are presented with the…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods
Livingston, Samuel A. – 1983
Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…
Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives

von Mayrhauser, Richard T. – American Psychologist, 1992
Examines accuracy evaluation in published testing programs of the following: J. M. Cattell; C. Spearman; A. Binet; L. M. Terman; R. M. Yerkes; E. L. Thorndike; and W. D. Scott. Developing community and consensus on testing required convergence between theorists and practitioners. (SLD)
Descriptors: Cognitive Ability, Cognitive Tests, Educational History, Educational Testing