ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	20

Descriptor

Educational Assessment	100
Evaluation Methods	100
Testing Problems	100
Elementary Secondary Education	47
Student Evaluation	45
Educational Testing	29
Test Use	28
Psychometrics	22
Test Construction	21
Test Validity	21
Accountability	20
Measurement Techniques	20
Performance Based Assessment	19
Testing Programs	17
Standardized Tests	16
State Programs	16
Disabilities	15
Evaluation Problems	15
Achievement Tests	14
Testing Accommodations	14
Evaluation Research	13
Academic Standards	12
Measurement	12
Program Evaluation	12
Academic Achievement	11
More ▼

Education Level

Elementary Secondary Education	16
Elementary Education	2
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners	7
Administrators	1
Counselors	1
Policymakers	1
Researchers	1
Teachers	1

Location

United Kingdom (England)	4
United States	4
United Kingdom	3
United Kingdom (Wales)	2
Australia	1
California	1
Canada	1
Florida	1
Nevada	1
New Jersey	1
New York	1
South Africa	1
More ▼

Laws, Policies, & Programs

Education Consolidation…	2
Elementary and Secondary…	1
Hawkins Stafford Act 1988	1
Individuals with Disabilities…	1

Assessments and Surveys

National Assessment of…	5
Advanced Placement…	4
SAT (College Admission Test)	2

What Works Clearinghouse Rating

Showing 1 to 15 of 100 results Save | Export

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Assessment Themes and Issues.

Peer reviewed

Rose, Amy D.; Leahy, Meredyth A. – New Directions for Adult and Continuing Education, 1997

Summarizes themes from the articles in this issue: distinction between evaluation and assessment, innovation, bias, integration into program delivery, and transfer. (SK)

Descriptors: Adult Education, Adult Learning, Educational Assessment, Evaluation Methods

Tests of Learning Disabilities (Assessment).

Peer reviewed

Shanahan, Timothy – Reading Teacher, 1989

Reviews two tests which highlight important trends in the assessment of learning disabilities, and presents potentially valuable, practical developments in assessment. Identifies the recurring problem of the lack of a theory, construct, or clear definition to guide the assessment of learning disabilities. (MG)

Descriptors: Diagnostic Tests, Educational Assessment, Elementary Education, Evaluation Methods

Assessment in Early Childhood Education.

Download full text

Haney, Walt; Gelberg, Wendy – 1980

The goal of this booklet is to describe some of the special challenges posed by early childhood assessment in general, and particularly as they apply to Title I program evaluation. The booklet has four purposes: (1) to describe special issues in early childhood assessment; (2) to describe briefly alternative approaches to early childhood…

Descriptors: Early Childhood Education, Educational Assessment, Evaluation Methods, Measurement Techniques

Psychological Testing: Misdiagnosis and Half-Diagnosis

Peer reviewed

Hutson, Barbara A. – Psychology in the Schools, 1974

Delineates a model of the complete diagnosis process which is motivated by concern that every child achieve the potential he possesses. Discusses the usual functions of diagnosis, the nature of psychological-educational diagnosis, the effects of inaccurate or inadequate diagnosis, and implication for a more adequate process. (Author/PC)

Descriptors: Educational Assessment, Evaluation Methods, Models, Psychoeducational Methods

Randomised Items in Computer-Based Tests: Russian Roulette in Assessment?

Peer reviewed

Direct link

Marks, Anthony M.; Cronje, Johannes C. – Educational Technology & Society, 2008

Computer-based assessments are becoming more commonplace, perhaps as a necessity for faculty to cope with large class sizes. These tests often occur in large computer testing venues in which test security may be compromised. In an attempt to limit the likelihood of cheating in such venues, randomised presentation of items is automatically…

Descriptors: Educational Assessment, Educational Testing, Research Needs, Test Items

Test and Item Bias: What They Are, What They Aren't, and How To Detect Them.

Download full text

Ellis, Barbara B.; Raju, Nambury S. – 2003

This chapter briefly describes some of the methods that test developers and psychometricians have devised to identify item and test bias and some of the challenges they still face. Although it may not be reasonable for classroom teachers to use these methods on a day-to-day basis in constructing tests, the authors argue that it is important for…

Descriptors: Academic Achievement, Educational Assessment, Educational Testing, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Measurement:…	14
Journal of Educational…	3
College English	2
Educational Researcher	2
Applied Measurement in…	1
Assessment for Effective…	1
Educational Measurement:…	1
Educational Technology &…	1
Educational and Psychological…	1
English Education	1
Florida Educational Research…	1
Highway One	1
Journal of Career Education	1
NCEO Policy Directions	1
New Directions for Adult and…	1
Psychology in the Schools	1
Reading & Writing Quarterly	1
Reading Teacher	1
Research Connections in…	1
Studies in Educational…	1
Understanding Our Gifted	1
Urban Review	1
More ▼

Thurlow, Martha	11
Liu, Kristin	4
Spicuzza, Richard	4
Erickson, Ronald	3
Almond, Patricia	2
Bielinski, John	2
Minnema, Jane	2
Scott, Dorene	2
Tindal, Gerald	2
White, Edward M.	2
Alonzo, Alicia C.	1
Aschbacher, Pamela R.	1
Baird, Jo-Anne	1
Baldwin, Su G.	1
Bartek, Mary M.	1
Beard, Joseph W.	1
Bell-Mick, Lori	1
Bossone, Richard M., Ed.	1
Boys, Chris	1
Clauser, Brian E.	1
Cline, Jerome	1
Cousins, J. Bradley	1
Crehan, Kevin	1
Cresswell, Mike	1
More ▼

Journal Articles	34
Opinion Papers	32
Reports - Evaluative	20
Reports - Research	17
Speeches/Meeting Papers	16
Information Analyses	11
Guides - Non-Classroom	9
Reports - Descriptive	6
Collected Works - Proceedings	5
Collected Works - Serials	3
Books	2
Legal/Legislative/Regulatory…	2
Reference Materials -…	2
Book/Product Reviews	1
Collected Works - General	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Guides - General	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼