Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 11 |
Descriptor
Comparative Analysis | 57 |
Test Interpretation | 57 |
Test Validity | 45 |
Test Reliability | 18 |
Statistical Analysis | 15 |
Test Construction | 14 |
Achievement Tests | 13 |
Evaluation Methods | 13 |
Test Results | 12 |
Psychometrics | 10 |
Testing Problems | 10 |
More ▼ |
Source
Author
Linn, Robert L. | 3 |
Haladyna, Tom | 2 |
Appenzellar, Anne B. | 1 |
Armstrong, Robert J. | 1 |
Bailey, Roger L. | 1 |
Baird, Jo-Anne | 1 |
Baker, Eva L. | 1 |
Blair, Bernadette | 1 |
Bradbury, Alice | 1 |
Bramley, Tom | 1 |
Carver, Ronald P. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 7 |
Elementary Education | 2 |
Grade 7 | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Researchers | 1 |
Location
United Kingdom | 5 |
United Kingdom (England) | 3 |
United States | 2 |
Arizona | 1 |
Australia | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Tuccitto, Daniel E.; Giacobbi, Peter R., Jr.; Leite, Walter L. – Educational and Psychological Measurement, 2010
This study tested five confirmatory factor analytic (CFA) models of the Positive Affect Negative Affect Schedule (PANAS) to provide validity evidence based on its internal structure. A sample of 223 club sport athletes indicated their emotions during the past week. Results revealed that an orthogonal two-factor CFA model, specifying error…
Descriptors: Factor Analysis, Models, Affective Measures, Validity
Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014
There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…
Descriptors: Student Surveys, National Surveys, Art Education, Design
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy
Coe, Robert – Research Papers in Education, 2010
Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…
Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Noell, Jay; Ginsburg, Alan – Applied Measurement in Education, 2009
The report, "Evaluation of the National Assessment of Educational Progress", provides a number of recommendations for addressing validity concerns about NAEP. This article identifies actions that could be taken by the Congress, the National Center for Education Statistics, and the National Assessment Governing Board--which share responsibility for…
Descriptors: National Competency Tests, Federal Government, Public Agencies, Test Validity
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Johnson, D. Lamont; Shinedling, Martin M. – Psychological Reports, 1974
An investigation of three intelligence tests reveals that the Slosson shows signs of becoming a legitimate substitute for other intelligence tests, while the Columbia yielded erratic results for the mentally retarded participants in this study. (Author/KM)
Descriptors: Comparative Analysis, Intelligence Tests, Mental Retardation, Test Interpretation

Sattler, Jerome M.; And Others – Psychology in the Schools, 1978
Fabricated test protocols were used to study how effectively examiners agree in scoring ambiguous WISC-R responses. The results suggest that, even with the improved WISC-R manual, scoring remains a difficult and challenging task. (Author)
Descriptors: Comparative Analysis, Intelligence Tests, Research Projects, Scoring Formulas

Newmark, Charles S. – Psychological Reports, 1971
Descriptors: Comparative Analysis, Males, Neurosis, Personality Measures

Mathewson, Peter D. – Journal of Consulting and Clinical Psychology, 1977
Navy enlisted personnel (N=60) were administered the Recall scale of the Kahn Intelligence Test (Experimental Form; KIT) and the Digit Span subtest of the Wechsler Adult Intelligence Scale (WAIS). Scores for the KIT tasks indicate a significant transfer of data to long-term memory. (Author)
Descriptors: Comparative Analysis, Intelligence Tests, Psychological Testing, Research Projects