Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Assessment in Education:… | 4 |
Author
Baird, Jo-Anne | 1 |
Beguin, A. A. | 1 |
Beguin, Anton | 1 |
El Masri, Yasmine H. | 1 |
Graesser, Art | 1 |
Klieme, Eckhard | 1 |
Kuger, Susanne | 1 |
Marksteiner, Tamara | 1 |
Verstralen, H. H. F. M. | 1 |
Wheadon, Christopher | 1 |
van Rijn, P. W. | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 2 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Secondary Education | 4 |
Elementary Secondary Education | 1 |
Audience
Location
Netherlands | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
What Works Clearinghouse Rating
Marksteiner, Tamara; Kuger, Susanne; Klieme, Eckhard – Assessment in Education: Principles, Policy & Practice, 2019
We investigate whether Anchoring Vignettes (AV) improve intercultural comparability of non-cognitive student-directed factors (e.g., procrastination). So far, correlation analyses for anchored and non-anchored scores with a criterion have been used to demonstrate the effectiveness of AV in improving data quality. However, correlation analyses are…
Descriptors: Vignettes, Equated Scores, International Assessment, Test Reliability
El Masri, Yasmine H.; Baird, Jo-Anne; Graesser, Art – Assessment in Education: Principles, Policy & Practice, 2016
We investigate the extent to which language versions (English, French and Arabic) of the same science test are comparable in terms of item difficulty and demands. We argue that language is an inextricable part of the scientific literacy construct, be it intended or not by the examiner. This argument has considerable implications on methodologies…
Descriptors: International Assessment, Difficulty Level, Test Items, Language Variation
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Wheadon, Christopher; Beguin, Anton – Assessment in Education: Principles, Policy & Practice, 2010
Tiering is a multi-stage test design whereby teachers allocate students to a particular difficulty level (tier) of a test. This approach to the challenge of delivering assessments to students with a heterogeneous ability distribution is normal practice in UK public examinations at the age of 16. This study uses Item Response Theory number-correct…
Descriptors: Difficulty Level, Item Response Theory, Achievement Tests, Standard Setting (Scoring)