Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 10 |
Descriptor
Educational Testing | 43 |
Evaluation Methods | 43 |
Test Interpretation | 43 |
Educational Assessment | 19 |
Test Use | 14 |
Student Evaluation | 13 |
Test Construction | 13 |
Testing Problems | 13 |
Elementary Secondary Education | 12 |
Test Validity | 11 |
Evaluation Criteria | 10 |
More ▼ |
Source
Author
Schafer, William D. | 3 |
Newton, Paul E. | 2 |
Popham, W. James | 2 |
Algina, James | 1 |
Allen, R. R. | 1 |
Baird, Jo-Anne | 1 |
Bell, Gregory | 1 |
Biggs, J. B. | 1 |
Bossone, Richard M., Ed. | 1 |
Braden, Jeffrey P. | 1 |
Bramley, Tom | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 10 |
Elementary Education | 1 |
Grade 6 | 1 |
Audience
Practitioners | 3 |
Location
United Kingdom | 5 |
United Kingdom (England) | 3 |
United States | 3 |
Australia | 2 |
United Kingdom (Wales) | 2 |
New Jersey | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 2 |
Elementary and Secondary… | 1 |
Assessments and Surveys
Advanced Placement… | 2 |
SAT (College Admission Test) | 2 |
Iowa Tests of Basic Skills | 1 |
Program for International… | 1 |
Sequential Tests of… | 1 |
What Works Clearinghouse Rating
Klesch, Heather S. – ProQuest LLC, 2010
The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing
Coe, Robert – Research Papers in Education, 2010
Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…
Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Gray, B. Thomas – 1997
Validity is a critically important issue with far-reaching implications for testing. The history of conceptualizations of validity over the past 50 years is reviewed, and 3 important areas of controversy are examined. First, the question of whether the three traditionally recognized types of validity should be integrated as a unitary entity of…
Descriptors: Educational Testing, Evaluation Methods, Reliability, Scores
Harris, Deborah J. – 2003
Tests and assessments are generally administered to gather data to aid in decision making, with at an individual student level or at an aggregated level. In order to incorporate assessment data in informed decision making, test users need to understand the test results. This chapter highlights the types of test scores and test score…
Descriptors: Decision Making, Educational Assessment, Educational Testing, Evaluation Methods
Schafer, William D. – 2003
Three groups of persons are involved in the testing enterprise: test producers, test users, and test takers. A wide literature is available to guide the first two groups, but only recently have measurement professionals considered the interests of test takers in any careful way. The content of this chapter is presented as a set of 26…
Descriptors: Educational Assessment, Educational Testing, Evaluation Methods, Guidelines

Popham, W. James – Educational Leadership, 2004
The importance of educational accountability and assessment literacy is recognized as a long-term challenge to the educational system. The complexities in becoming an assessment literate and provide an opportunity to the educators to display their effective learning is discussed.
Descriptors: Accountability, Student Evaluation, Evaluation Methods, Educational Testing
Fortna, Richard O. – 1981
Measurement terms used in Title I evaluation are contained in this glossary. Several types of measurement techniques are identified and defined. Other measurement terms which are defined include those relating to validity, reliability, statistical analysis, test interpretation, and program effectiveness. (DWH)
Descriptors: Educational Testing, Evaluation Methods, Glossaries, Program Evaluation
Helms, Janet E. – 2003
In the United States, standardized educational tests have been used for assessment purposes in grades K through 12 almost since the inception of the testing movement in the early 1900s. Because test-based assessment can have wide-ranging positive or negative effects on K-12 students, the test user must ensure that the tests used for assessment…
Descriptors: Educational Testing, Elementary Secondary Education, Evaluation Methods, Scoring