ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	12

Descriptor

Test Theory	93
Testing Problems	93
Test Construction	33
Test Reliability	25
Test Validity	25
Test Interpretation	24
Test Items	24
Measurement Techniques	21
Test Use	21
Educational Testing	18
Criterion Referenced Tests	16
Achievement Tests	13
Comparative Analysis	13
Mathematical Models	13
Psychometrics	13
Statistical Analysis	13
Elementary Secondary Education	12
Equated Scores	12
Evaluation Methods	12
Latent Trait Theory	12
Test Bias	12
Foreign Countries	11
Item Analysis	11
Reading Tests	11
Scores	11
More ▼

Education Level

Elementary Secondary Education	5
Adult Education	1
Higher Education	1
Secondary Education	1

Audience

Researchers	10
Practitioners	5
Teachers	2
Counselors	1
Students	1

Location

United Kingdom	4
United Kingdom (England)	3
United States	3
Canada	2
Netherlands	2
United Kingdom (Wales)	2
Australia	1
Israel	1
Sweden	1
Texas	1
United Kingdom (Northern…	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	4
Advanced Placement…	1
California Achievement Tests	1
Childrens Depression Inventory	1
Comprehensive Tests of Basic…	1
Cornell Critical Thinking Test	1
Developmental Indicators for…	1
Expressive One Word Picture…	1
General Aptitude Test Battery	1
Graduate Management Admission…	1
Kaufman Assessment Battery…	1
Nelson Denny Reading Tests	1
Peabody Picture Vocabulary…	1
Stanford Achievement Tests	1
Watson Glaser Critical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 93 results Save | Export

Test Affordances or Test Function? Did We Get Messick's Message Right?

Download full text

Salmani Nodoushan, Mohammad Ali – Online Submission, 2021

This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…

Descriptors: Construct Validity, Test Theory, Test Use, Affordances

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Test Equity for People Who Are Deaf or Hard-of-Hearing: Commission on Rehabilitation Counselor Certification Steps for Implementation

Peer reviewed

Direct link

Saladin, Shawn P.; Reid, Christine; Shiels, John – Rehabilitation Research, Policy, and Education, 2011

The Commission on Rehabilitation Counselor Certification (CRCC) has taken a proactive stance on perceived test inequities of the Certified Rehabilitation Counselor (CRC) exam as it relates to people who are prelingually deaf and hard of hearing. This article describes the process developed and implemented by the CRCC to help maximize test equity…

Descriptors: Test Items, Rehabilitation Counseling, Counselor Certification, Deafness

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Theory of Test Translation Error

Peer reviewed

Direct link

Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009

In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…

Descriptors: Test Items, Investigations, Semantics, Translation

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Thorndike's and Wood's Principles of Educational Measurement: A View from the 1980's.

Download full text

Engelhard, George, Jr. – 1988

The purpose of this essay is to describe the principles of educational measurement proposed by B. Wood during the 1920s in his dissertation, written under the direction of E. L. Thorndike, and later published as "Measurement in Higher Education" (1923). These principles were selected because they illustrate one of the earliest and most complete…

Descriptors: Educational History, Educational Testing, Test Theory, Testing Problems

Detecting Differential Speededness in Multistage Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007

A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…

Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

Basic Concepts in Classical Test Theory: Tests Aren't Reliable, the Nature of Alpha, and Reliability Generalization as a Meta-analytic Method.

Download full text

Helms, LuAnn Sherbeck – 1999

This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…

Descriptors: Effect Size, Meta Analysis, Reliability, Scores

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

The Reliability of a Profile.

Peer reviewed

Yarnold, Paul R. – Educational and Psychological Measurement, 1984

Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)

Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Journal of Educational…	5
Measurement:…	4
Educational Measurement:…	2
Educational and Psychological…	2
History and Social Science…	2
International Journal of…	2
Journal of Experimental…	2
Review of Research in…	2
Alberta Journal of…	1
American Educator: The…	1
American Psychologist	1
Applied Psychological…	1
Assessment in Education:…	1
Canadian Journal for…	1
Educational Studies	1
Evaluation in Education:…	1
Executive Review	1
Instructional Science	1
Intelligence	1
International Journal of…	1
Journal of Cross-Cultural…	1
Journal of Educational and…	1
Journal of Reading	1
Journal of Research in…	1
Online Submission	1
More ▼

Bormuth, John R.	2
Livingston, Samuel A.	2
Norris, Stephen P.	2
Powell, J. C.	2
Weiss, David J.	2
Wilcox, Rand R.	2
Zimmerman, Donald W.	2
van der Linden, Wim J.	2
Airaisian, Peter W.	1
Alliger, George M.	1
Altepeter, Tom	1
Andrich, David	1
Angoff, William H.	1
Armour-Thomas, Eleanor	1
Aronson, Edith	1
Backhoff, Eduardo	1
Baird, Jo-Anne	1
Beal, Judy	1
Beard, John D., Ed.	1
Beguin, A. A.	1
Bhaskar, R.	1
Biggs, John	1
Breithaupt, Krista	1
Brittain, Clay V.	1
More ▼

Journal Articles	42
Reports - Research	41
Speeches/Meeting Papers	21
Opinion Papers	15
Reports - Evaluative	13
Reports - Descriptive	10
Information Analyses	7
Books	3
Collected Works - Serials	3
Guides - Non-Classroom	3
Collected Works - General	2
Collected Works - Proceedings	2
Guides - General	2
Reports - General	2
Book/Product Reviews	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼