ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	18

Descriptor

Test Interpretation	150
Testing Problems	150
Test Validity	43
Scores	32
Elementary Secondary Education	30
Test Use	30
Standardized Tests	27
Educational Testing	24
Achievement Tests	22
Test Reliability	21
Evaluation Methods	19
Scoring	19
Elementary Education	18
Test Construction	18
Intelligence Tests	17
Test Results	15
Educational Assessment	14
Psychometrics	14
Test Norms	14
Test Theory	14
Testing	14
Criterion Referenced Tests	13
Foreign Countries	13
Higher Education	13
Measurement Techniques	13
More ▼

Publication Type

Journal Articles	150
Reports - Research	45
Opinion Papers	40
Reports - Evaluative	27
Information Analyses	25
Reports - Descriptive	15
Guides - Non-Classroom	9
Reports - General	4
Tests/Questionnaires	4
Speeches/Meeting Papers	2
Book/Product Reviews	1
More ▼

Education Level

Elementary Secondary Education	9
Higher Education	1
Junior High Schools	1

Audience

Practitioners	8
Researchers	3
Parents	1

Location

United Kingdom	4
United States	4
Canada	3
United Kingdom (England)	3
United Kingdom (Wales)	2
Australia	1
Israel	1
Japan	1
Netherlands	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Education for All Handicapped…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 150 results Save | Export

Preventing Satisficing: A Narrative Review

Peer reviewed

Direct link

Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024

Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…

Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

The Compatibility Principle: On Philosophies in the Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Kuper, Ayelet; Kulasegaram, Kulamakan; Whitehead, Cynthia – Advances in Health Sciences Education, 2020

The array of different philosophical positions underlying contemporary views on competence, assessment strategies and justification have led to advances in assessment science. Challenges may arise when these philosophical positions are not considered in assessment design. These can include (a) a logical incompatibility leading to varied or…

Descriptors: Performance Based Assessment, Educational Testing, Test Interpretation, Test Results

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

Analytical Challenges of Testing Hypotheses of Agreement and Discrepancy: Comment on Campione-Barr, Lindell, and Giron (2020)

Peer reviewed

Direct link

Laird, Robert D. – Developmental Psychology, 2020

Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…

Descriptors: Educational Research, Hypothesis Testing, Differences, Scores

Challenges to the Cattell-Horn-Carroll Theory: Empirical, Clinical, and Policy Implications

Peer reviewed

Direct link

Canivez, Gary L.; Youngstrom, Eric A. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) taxonomy of cognitive abilities married John Horn and Raymond Cattell's Extended Gf-Gc theory with John Carroll's Three-Stratum Theory. While there are some similarities in arrangements or classifications of tasks (observed variables) within similar broad or narrow dimensions, other salient theoretical features and…

Descriptors: Taxonomy, Cognitive Ability, Intelligence, Cognitive Tests

Scores in Space: Multidimensional Scaling of the WISC-V

Peer reviewed

Direct link

Meyer, Emily M.; Reynolds, Matthew R. – Journal of Psychoeducational Assessment, 2018

The purpose of this study was to use multidimensional scaling (MDS) to investigate relations among scores from the standardization sample of the Wechsler Intelligence Scale for Children--Fifth edition (WISC-V; Wechsler, 2014). Nonmetric two-dimensional MDS maps were selected for interpretation. The most cognitively complex subtests and indexes…

Descriptors: Children, Intelligence Tests, Scaling, Factor Analysis

More Data, More Problems: Analytical Complications of Studying Differential Family Experiences over Time: Reply to Laird (2020)

Peer reviewed

Direct link

Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020

The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…

Descriptors: Educational Research, Hypothesis Testing, Differences, Scores

The Leading Group Effect: Illusionary Declines in Scholastic Standard Scores of Mid-Range Japanese Junior High School Pupils

Peer reviewed

Direct link

Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012

Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…

Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems

Worldwide Test Reviewing at the Beginning of the Twenty-First Century

Peer reviewed

Direct link

Geisinger, Kurt F. – International Journal of Testing, 2012

This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…

Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach

Missing the Mark: What Test Scores Really Tell Us

Direct link

Tanner, John R. – School Administrator, 2011

State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…

Descriptors: Federal Legislation, Standardized Tests, Educational Change, Decision Making

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational Measurement:…	24
Journal of Educational…	13
Educational and Psychological…	8
American Psychologist	5
Measurement:…	5
School Psychology Review	4
Applied Psychological…	3
Journal of Clinical Psychology	3
Journal of Consulting and…	3
New Directions for Testing…	3
School Psychology Digest	3
Applied Measurement in…	2
Developmental Psychology	2
Diagnostique	2
International Journal of…	2
Journal of Counseling…	2
Journal of Educational…	2
Journal of Educational…	2
NCME Measurement in Education	2
Perceptual and Motor Skills	2
Psychological Assessment	2
Advances in Health Sciences…	1
American Annals of the Deaf	1
American Educator: The…	1
American Journal of Family…	1
More ▼

Geisinger, Kurt F.	3
Hoover, H. D.	3
Hills, John R.	2
Jaeger, Richard M.	2
Koretz, Daniel	2
Lenke, Joanne M.	2
Linn, Robert L.	2
Mehrens, William A.	2
Plake, Barbara S.	2
Shepard, Lorrie A.	2
von Davier, Alina A.	2
Anderson, Kent E.	1
Atkinson, Leslie	1
Attali, Yigal	1
Baig, Basim	1
Baird, Jo-Anne	1
Beal, Judy	1
Beck, Michael D.	1
Ben-David, Amith	1
Benton, Tom	1
Bloom, Allan S.	1
Bond, Lloyd	1
Bower, Ruth	1
Bracey, Gerald W.	1
More ▼

SAT (College Admission Test)	6
Comprehensive Tests of Basic…	5
Wechsler Intelligence Scale…	5
Iowa Tests of Basic Skills	3
Advanced Placement…	2
General Aptitude Test Battery	2
Hopelessness Scale	2
Kaufman Assessment Battery…	2
National Assessment of…	2
Stanford Binet Intelligence…	2
Strong Campbell Interest…	2
System of Multicultural…	2
Vineland Adaptive Behavior…	2
Woodcock Johnson Psycho…	2
Woodcock Johnson Tests of…	2
Bayley Scales of Infant…	1
California Achievement Tests	1
Child Abuse Potential…	1
Developmental Indicators for…	1
Family Adaptability Cohesion…	1
Manifest Anxiety Scale	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Minnesota Teacher Attitude…	1
Peabody Picture Vocabulary…	1
More ▼