Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 22 |
Descriptor
Test Interpretation | 488 |
Testing Problems | 488 |
Test Validity | 157 |
Elementary Secondary Education | 136 |
Standardized Tests | 116 |
Achievement Tests | 105 |
Educational Testing | 99 |
Test Bias | 94 |
Test Reliability | 87 |
Test Construction | 86 |
Test Results | 82 |
More ▼ |
Source
Author
Geisinger, Kurt F. | 4 |
Green, Donald Ross | 4 |
Linn, Robert L. | 4 |
Beck, Michael D. | 3 |
Dyer, Henry S. | 3 |
Ebel, Robert L. | 3 |
Echternacht, Gary | 3 |
Frary, Robert B. | 3 |
Hambleton, Ronald K. | 3 |
Hoover, H. D. | 3 |
Mehrens, William A. | 3 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 11 |
Elementary Education | 2 |
Junior High Schools | 2 |
Grade 4 | 1 |
Grade 8 | 1 |
Higher Education | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Location
United States | 6 |
United Kingdom | 5 |
Canada | 4 |
United Kingdom (England) | 3 |
Australia | 2 |
California | 2 |
Japan | 2 |
New Jersey | 2 |
United Kingdom (Wales) | 2 |
Arizona | 1 |
California (Los Angeles) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Tavares, Walter; Kuper, Ayelet; Kulasegaram, Kulamakan; Whitehead, Cynthia – Advances in Health Sciences Education, 2020
The array of different philosophical positions underlying contemporary views on competence, assessment strategies and justification have led to advances in assessment science. Challenges may arise when these philosophical positions are not considered in assessment design. These can include (a) a logical incompatibility leading to varied or…
Descriptors: Performance Based Assessment, Educational Testing, Test Interpretation, Test Results
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Laird, Robert D. – Developmental Psychology, 2020
Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019
The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…
Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests
Indiana Department of Education, 2024
The Elementary and Secondary Education Act (ESEA), as amended by the Every Student Succeeds Act (ESSA), requires state education agencies to establish and implement standardized, statewide entrance and exit procedures for English learners (ELs). WIDA provides the English language proficiency placement and annual assessments administered in…
Descriptors: English Language Learners, State Standards, Language Proficiency, Language Tests
Meyer, Emily M.; Reynolds, Matthew R. – Journal of Psychoeducational Assessment, 2018
The purpose of this study was to use multidimensional scaling (MDS) to investigate relations among scores from the standardization sample of the Wechsler Intelligence Scale for Children--Fifth edition (WISC-V; Wechsler, 2014). Nonmetric two-dimensional MDS maps were selected for interpretation. The most cognitively complex subtests and indexes…
Descriptors: Children, Intelligence Tests, Scaling, Factor Analysis
Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020
The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Skinner, Rebecca R. – Congressional Research Service, 2018
Assessing the achievement of students in elementary and secondary schools and the nation's educational progress is fundamental to informing education policy approaches. Congressional interest in this area includes and extends beyond the annual assessments administered by states to comply with the educational accountability requirements of Title…
Descriptors: National Competency Tests, Achievement Tests, Mathematics Achievement, Mathematics Tests
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Feuer, Michael J. – Educational Testing Service, 2011
Few arguments about education are as effective at galvanizing public attention and motivating political action as those that compare the performance of students with their counterparts in other countries and that connect academic achievement to economic performance. Because data from international large-scale assessments (ILSA) have a powerful…
Descriptors: International Assessment, Test Interpretation, Testing Problems, Comparative Testing
Tanner, John R. – School Administrator, 2011
State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…
Descriptors: Federal Legislation, Standardized Tests, Educational Change, Decision Making
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis