ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	9

Descriptor

Evaluation Methods	14
Psychometrics	14
Testing Programs	14
Educational Assessment	7
Student Evaluation	7
Test Items	5
Educational Testing	4
Elementary Secondary Education	4
Item Response Theory	4
Standardized Tests	4
Test Validity	4
Comparative Analysis	3
Computer Assisted Testing	3
Evaluation Problems	3
Evaluation Research	3
Mathematics Tests	3
Measurement Techniques	3
Program Effectiveness	3
Scoring	3
Testing Accommodations	3
Testing Problems	3
Academic Achievement	2
Academic Standards	2
Achievement Tests	2
Adaptive Testing	2
More ▼

Source

Journal of Applied Testing…	3
Anatomical Sciences Education	1
Applied Measurement in…	1
Educational and Psychological…	1
International Journal of…	1
OECD Publishing	1
Regional Educational…	1

Publication Type

Journal Articles	7
Reports - Evaluative	5
Reports - Research	5
Reports - Descriptive	4
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Adult Education	1
Grade 10	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

Connecticut	1
Dominica	1
Grenada	1
Massachusetts	1
Saint Lucia	1
Saint Vincent and the…	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Program for International…	2
California Achievement Tests	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

A Review of International Large-Scale Assessments in Education: Assessing Component Skills and Collecting Contextual Data. PISA for Development

Direct link

Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015

This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…

Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

A Proposed Framework of Test Administration Methods

Peer reviewed

Direct link

Thompson, Nathan A. – Journal of Applied Testing Technology, 2008

The widespread application of personal computers to educational and psychological testing has substantially increased the number of test administration methodologies available to testing programs. Many of these mediums are referred to by their acronyms, such as CAT, CBT, CCT, and LOFT. The similarities between the acronyms and the methods…

Descriptors: Testing Programs, Psychological Testing, Classification, Educational Testing

Construct Equivalence across Grades in a Vertical Scale for a K-12 Large-Scale Reading Assessment

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009

In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…

Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics

Matching the Judgmental Task with Standard Setting Panelist Expertise: The Item-Descriptor (ID) Matching Method

Peer reviewed

Direct link

Ferrara, Steve; Perie, Marianne; Johnson, Eugene – Journal of Applied Testing Technology, 2008

Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…

Descriptors: Test Results, Test Content, Testing Programs, Educational Testing

The Predictive Validity of Selected Benchmark Assessments Used in the Mid-Atlantic Region. Issues & Answers. REL 2007-No. 017

Peer reviewed
PDF on ERIC

Download full text

Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007

This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…

Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness

Computer-Based Signing Accommodations: Comparing a Recorded Human with an Avatar

Peer reviewed

Direct link

Russell, Michael; Kavanaugh, Maureen; Masters, Jessica; Higgins, Jennifer; Hoffmann, Thomas – Journal of Applied Testing Technology, 2009

Many students who are deaf or hard-of-hearing are eligible for a signing accommodation for state and other standardized tests. The signing accommodation, however, presents several challenges for testing programs that attempt to administer tests under standardized conditions. One potential solution for many of these challenges is the use of…

Descriptors: Testing Programs, Student Attitudes, Standardized Tests, Academic Achievement

OCOD-CTTP Test Evaluation Report.

Download full text

Shorey, Leonard – 1991

Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…

Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Strategies for Statewide Student Assessment. Policy Briefs, Number 17.

Download full text

Moody, David – 1991

Traditional standardized tests of basic skills are no longer considered meaningful by many leading authorities in educational measurement. Alternative approaches are not yet fully developed, although many efforts are being made. This paper explores the issues surrounding student assessment in the context of existing and evolving state practices,…

Descriptors: Achievement Tests, Alternative Assessment, Basic Skills, Educational Assessment

Test and Measurement Expert Opinions: A Dialogue about Testing Students with Disabilities Out of Level in Large-Scale Assessments. Out-of-Level Testing Report.

Download full text

Minnema, Jane; Thurlow, Martha; Bielinski, John – 2002

Two focus groups of test and measurement experts were held to explore the use of out-of-level testing for students with disabilities. The participants (n=17) included state and federal level assessment personnel, test company employees, and university professors. A content analysis of the narrative results indicated that there was no clear…

Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Disabilities

Scale Score Comparability across Two Levels of a Norm-Referenced Math Computation Test for Students with Learning Disabilities. Out-of-Level Testing Report.

Download full text

Bielinski, John; Thurlow, Martha; Minnema, Jane; Scott, Jim – 2002

In this study, special education teachers identified students with learning disabilities who were working on math skills usually taught two grades below the grade in which the student was enrolled. Each student (n=33) took two levels of the MAT/7 math computation test, an on-grade test, and an out-of-level test intended for students two grades…

Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Educational Assessment

Bielinski, John	2
Minnema, Jane	2
Thurlow, Martha	2
Brown, Richard S.	1
Cook, Linda L.	1
Coughlin, Ed	1
Cresswell, John	1
Ferrara, Steve	1
Gilliland, Kurt O.	1
Higgins, Jennifer	1
Hoffmann, Thomas	1
Jiao, Hong	1
Johnson, Eugene	1
Kavanaugh, Maureen	1
Kernick, Edward T.	1
Mapuranga, Raymond	1
Masters, Jessica	1
Moody, David	1
Perie, Marianne	1
Petersen, Nancy S.	1
Puhan, Gautam	1
Royal, Kenneth D.	1
Russell, Michael	1
Schwantner, Ursula	1
Scott, Jim	1
More ▼