ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Psychometrics	6
Testing	6
Testing Programs	6
Item Response Theory	3
Scoring	3
Evaluation Methods	2
High Stakes Tests	2
Test Items	2
Anatomy	1
Aptitude Tests	1
Cheating	1
College Entrance Examinations	1
Comparative Analysis	1
Cutting Scores	1
Data Analysis	1
Developing Nations	1
Difficulty Level	1
Disabilities	1
Educational Finance	1
Equated Scores	1
Error Patterns	1
Error of Measurement	1
Evaluation Research	1
Foreign Countries	1
Graphs	1
More ▼

Source

Applied Measurement in…	2
Anatomical Sciences Education	1
Education Policy Analysis…	1
Psychometrika	1

Author

Bennett, Randy Elliot	1
Chakwera, Elias	1
Gilliland, Kurt O.	1
Guo, Hongwen	1
Kernick, Edward T.	1
Khembo, Dafter	1
Meyers, Jason L.	1
Miller, G. Edward	1
Puhan, Gautam	1
Royal, Kenneth D.	1
Sireci, Stephen G.	1
Way, Walter D.	1
More ▼

Publication Type

Journal Articles	5
Reports - Evaluative	3
Reports - Research	2
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Malawi	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

Accumulative Equating Error after a Chain of Linear Equatings

Peer reviewed

Direct link

Guo, Hongwen – Psychometrika, 2010

After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…

Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

High-Stakes Testing in the Warm Heart of Africa: The Challenges and Successes of the Malawi National Examinations Board

Peer reviewed
PDF on ERIC

Download full text

Chakwera, Elias; Khembo, Dafter; Sireci, Stephen G. – Education Policy Analysis Archives, 2004

In the United States, tests are held to high standards of quality. In developing countries such as Malawi, psychometricians must deal with these same high standards as well as several additional pressures such as widespread cheating, test administration difficulties due to challenging landscapes and poor resources, difficulties in reliably scoring…

Descriptors: Testing Programs, Testing, High Stakes Tests, Measurement

The Psychometric Characteristics of the SAT for Nine Handicapped Groups. Studies of Admissions Testing and Handicapped People. Report No. 3.

Download full text

Bennett, Randy Elliot; And Others – 1985

This study examined the psychometric characteristics of the Scholastic Aptitude Test (SAT) administered under special conditions for nine handicapped groups. Four psychometric characteristics were studied: level of test performance, test reliability, speededness, and extent of unexpected differential item performance. Psychometric comparisons were…

Descriptors: Aptitude Tests, College Entrance Examinations, Disabilities, Hearing Impairments