ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	15

Descriptor

Educational Assessment	89
Educational Testing	89
Testing Problems	89
Elementary Secondary Education	44
Testing Programs	33
Evaluation Methods	29
Test Construction	27
Student Evaluation	24
Achievement Tests	20
Standardized Tests	20
National Surveys	19
Test Use	19
Accountability	17
Test Validity	17
Test Interpretation	16
Test Bias	15
Measurement Objectives	14
Psychometrics	13
Measurement Techniques	12
Foreign Countries	11
Academic Achievement	10
Program Evaluation	10
Academic Standards	9
Criterion Referenced Tests	9
Disabilities	9
More ▼

Publication Type

Opinion Papers	37
Journal Articles	21
Reports - Research	13
Information Analyses	12
Reports - Descriptive	12
Collected Works - Proceedings	11
Speeches/Meeting Papers	8
Reports - Evaluative	7
Books	5
Collected Works - Serials	4
Guides - Non-Classroom	4
Collected Works - General	2
ERIC Publications	2
Guides - General	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	11
Higher Education	2
Postsecondary Education	2
Secondary Education	1

Audience

Practitioners	7
Researchers	2
Teachers	1

Location

United Kingdom	4
United Kingdom (England)	4
United States	4
United Kingdom (Wales)	3
Florida	2
New Jersey	2
Australia	1
California	1
Colorado (Denver)	1
Connecticut	1
Georgia	1
Lesotho	1
Missouri	1
Netherlands	1
Oregon	1
Oregon (Portland)	1
South Africa	1
West Germany	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	3
Americans with Disabilities…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	18
Advanced Placement…	3
SAT (College Admission Test)	3
International Association for…	1
Kaufman Test of Educational…	1
Wechsler Individual…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 89 results Save | Export

Investigating Effect of Ignoring Hierarchical Data Structures on Accuracy of Vertical Scaling Using Mixed-Effects Rasch Model

Download full text

Wang, Shudong; Jiao, Hong; Jin, Ying; Thum, Yeow Meng – Online Submission, 2010

The vertical scales of large-scale achievement tests created by using item response theory (IRT) models are mostly based on cluster (or correlated) educational data in which students usually are clustered in certain groups or settings (classrooms or schools). While such application directly violated assumption of independent sample of person in…

Descriptors: Scaling, Achievement Tests, Data Analysis, Item Response Theory

On the Design of Online Synchronous Assessments in a Synchronous Cyber Classroom

Peer reviewed

Direct link

Chao, K.-J.; Hung, I.-C.; Chen, N.-S. – Journal of Computer Assisted Learning, 2012

Online learning has been rapidly developing in the last decade. However, there is very little literature available about the actual adoption of online synchronous assessment approaches and any guidelines for effective assessment design and implementation. This paper aims at designing and evaluating the possibility of applying online synchronous…

Descriptors: Electronic Learning, Student Evaluation, Online Courses, Computer Software

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

A Review of Academic Achievement Tests: Recommendations for Age Appropriate Administration

Direct link

Kozloff, Allison Burstein – ProQuest LLC, 2009

Comprehensive academic achievement tests are routinely used by school psychologists in psycho-educational assessment batteries to identify learning disabled students. A variety of assessment measures are used across age groups to determine if a discrepancy exists between academic achievement and intellectual functioning; however, among the most…

Descriptors: Intelligence, Educational Assessment, Academic Achievement, Achievement Tests

Developments in Multidimensional Item Response Theory.

Peer reviewed

Ackerman, Terry – Applied Psychological Measurement, 1996

This special issue is devoted to current developments in multidimensional item response theory (MIRT). The six papers included in this issue (and two for the March issue) present applications of MIRT to practical testing problems and demonstrate innovative MIRT techniques for assessment. (SLD)

Descriptors: Data Analysis, Educational Assessment, Educational Testing, Item Response Theory

Questions You Should Ask About Your Testing Program

Peer reviewed

Damon, J. Parker – National Elementary Principal, 1976

Presents some specific questions that educators ought to raise about standardized tests and some alternatives to standardized tests. (Author/IRT)

Descriptors: Data Collection, Educational Assessment, Educational Testing, Elementary Education

Randomised Items in Computer-Based Tests: Russian Roulette in Assessment?

Peer reviewed

Direct link

Marks, Anthony M.; Cronje, Johannes C. – Educational Technology & Society, 2008

Computer-based assessments are becoming more commonplace, perhaps as a necessity for faculty to cope with large class sizes. These tests often occur in large computer testing venues in which test security may be compromised. In an attempt to limit the likelihood of cheating in such venues, randomised presentation of items is automatically…

Descriptors: Educational Assessment, Educational Testing, Research Needs, Test Items

Strategies for Improving the Process of Educational Assessment. ERIC/AE Digest.

Download full text

Matter, M. Kevin – 1999

This digest presents seven strategies that an assessment director may use to improve test administration practices. These strategies highlight clear communication, the responsibility of the Building Test Coordinator, and rewarding and reinforcing quality. The strategies are: (1) focusing on communication; (2) designating a building test…

Descriptors: Communication (Thought Transfer), Coordination, Educational Assessment, Educational Planning

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Measurement:…	7
Journal of Educational…	3
National Elementary Principal	2
Studies in Educational…	2
ASAP Notes	1
Applied Psychological…	1
Educational Measurement:…	1
Educational Researcher	1
Educational Technology &…	1
Florida Educational Research…	1
International Journal of…	1
Journal of Computer Assisted…	1
New Directions for Testing…	1
New Schools, New Communities	1
Online Submission	1
Popular Computing	1
ProQuest LLC	1
Research Connections in…	1
Review of Research in…	1
School Guidance Worker	1
More ▼

Thurlow, Martha	7
Bielinski, John	2
Bossone, Richard M., Ed.	2
Goldstein, Harvey	2
Haney, Walt	2
Minnema, Jane	2
Scott, Dorene	2
Ackerman, Terry	1
Airaisian, Peter W.	1
Almond, Patricia	1
Bailis, Lawrence Neil	1
Baird, Jo-Anne	1
Baldwin, Su G.	1
Beard, Joseph W.	1
Billings, Bradley B.	1
Boys, Chris	1
Butler, Erik Payne	1
Carlson, Ken	1
Cassie, J. R. Bruce	1
Chandler, John W.	1
Chao, K.-J.	1
Chen, N.-S.	1
Clauser, Brian E.	1
Coffman, William E.	1
More ▼