Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 12 |
Test Theory | 12 |
Testing Problems | 12 |
Foreign Countries | 7 |
Comparative Analysis | 6 |
Measurement Techniques | 6 |
Educational Testing | 5 |
Equated Scores | 5 |
Psychometrics | 5 |
Classification | 4 |
Definitions | 4 |
More ▼ |
Source
Measurement:… | 4 |
History and Social Science… | 2 |
Assessment in Education:… | 1 |
Instructional Science | 1 |
Journal of Educational… | 1 |
Author
Baird, Jo-Anne | 1 |
Beguin, A. A. | 1 |
Bhaskar, R. | 1 |
Breithaupt, Krista | 1 |
Carlman, Nancy | 1 |
Chuah, Siang Chee | 1 |
Cresswell, Mike | 1 |
Dillard, Jesse F. | 1 |
Livingston, Samuel A. | 1 |
Newton, Paul E. | 1 |
Norris, Stephen P. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Opinion Papers | 5 |
Guides - General | 2 |
Reports - General | 2 |
Reports - Research | 2 |
Collected Works - Proceedings | 1 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 4 |
Secondary Education | 1 |
Audience
Location
United Kingdom (England) | 3 |
Netherlands | 2 |
United Kingdom | 2 |
United Kingdom (Wales) | 2 |
United States | 2 |
Australia | 1 |
Canada | 1 |
Sweden | 1 |
United Kingdom (Northern… | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
Cornell Critical Thinking Test | 1 |
SAT (College Admission Test) | 1 |
Watson Glaser Critical… | 1 |
What Works Clearinghouse Rating
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007
A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…
Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

Norris, Stephen P. – History and Social Science Teacher, 1986
Reviews selected research, widely-used commercial tests, and recent literature on critical thinking. States that evaluation of critical thinking processes should encompass more than simple tests. Concludes with six guidelines for evaluating critical thinking. (JDH)
Descriptors: Critical Thinking, Evaluation Methods, Logical Thinking, Secondary Education
Livingston, Samuel A. – 1983
Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…
Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives

Weddle, Perry – History and Social Science Teacher, 1986
Reports the efforts of the California Assessment Program (CAP) to interject critical thinking items into its new statewide social studies tests. Provides definitions of the critical thinking skills tested by the 8th grade social studies CAP test, and lists 100 critical thinking skills vocabulary words. (JDH)
Descriptors: Critical Thinking, Evaluation Methods, Grade 8, Junior High Schools

Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983
Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…
Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis
Carlman, Nancy – 1985
A study examined whether Canadian twelfth grade students' papers would rate differently when they were written in different modes and whether there are significant differences between global (modified holistic) scores and rhetorical effectiveness (modified primary trait) scores for the same papers. Fifty students wrote on two transactional topics…
Descriptors: Comparative Analysis, Discourse Modes, Evaluation Methods, Foreign Countries
van Weeren, J., Ed. – 1983
Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level