Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 18 |
Descriptor
Source
Author
Geisinger, Kurt F. | 3 |
Hoover, H. D. | 3 |
Hills, John R. | 2 |
Jaeger, Richard M. | 2 |
Koretz, Daniel | 2 |
Lenke, Joanne M. | 2 |
Linn, Robert L. | 2 |
Mehrens, William A. | 2 |
Plake, Barbara S. | 2 |
Shepard, Lorrie A. | 2 |
von Davier, Alina A. | 2 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 9 |
Higher Education | 1 |
Junior High Schools | 1 |
Audience
Practitioners | 8 |
Researchers | 3 |
Parents | 1 |
Location
United Kingdom | 4 |
United States | 4 |
Canada | 3 |
United Kingdom (England) | 3 |
United Kingdom (Wales) | 2 |
Australia | 1 |
Israel | 1 |
Japan | 1 |
Netherlands | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Education for All Handicapped… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Tavares, Walter; Kuper, Ayelet; Kulasegaram, Kulamakan; Whitehead, Cynthia – Advances in Health Sciences Education, 2020
The array of different philosophical positions underlying contemporary views on competence, assessment strategies and justification have led to advances in assessment science. Challenges may arise when these philosophical positions are not considered in assessment design. These can include (a) a logical incompatibility leading to varied or…
Descriptors: Performance Based Assessment, Educational Testing, Test Interpretation, Test Results
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Laird, Robert D. – Developmental Psychology, 2020
Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019
The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…
Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests
Meyer, Emily M.; Reynolds, Matthew R. – Journal of Psychoeducational Assessment, 2018
The purpose of this study was to use multidimensional scaling (MDS) to investigate relations among scores from the standardization sample of the Wechsler Intelligence Scale for Children--Fifth edition (WISC-V; Wechsler, 2014). Nonmetric two-dimensional MDS maps were selected for interpretation. The most cognitively complex subtests and indexes…
Descriptors: Children, Intelligence Tests, Scaling, Factor Analysis
Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020
The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Tanner, John R. – School Administrator, 2011
State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…
Descriptors: Federal Legislation, Standardized Tests, Educational Change, Decision Making
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics