ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	23

Descriptor

Comparative Analysis	28
Equated Scores	28
Evaluation Methods	28
Item Response Theory	12
Measurement Techniques	10
Educational Testing	9
Test Interpretation	9
Educational Assessment	8
Foreign Countries	8
Psychometrics	8
Test Items	8
High Stakes Tests	6
Test Use	6
Testing Problems	6
Correlation	5
Definitions	5
Predictive Measurement	5
Scaling	5
Test Theory	5
Classification	4
Evaluation Criteria	4
Item Analysis	4
Program Effectiveness	4
Achievement Tests	3
College Entrance Examinations	3
More ▼

Source

Measurement:…	5
Applied Psychological…	4
ProQuest LLC	4
ETS Research Report Series	2
ACT, Inc.	1
Applied Measurement in…	1
Australian Educational…	1
College Board	1
Educational Research and…	1
Journal of Educational…	1
Ministerial Council on…	1
Research Papers in Education	1
Stanford Center for Education…	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	12
Opinion Papers	5
Reports - Evaluative	5
Dissertations/Theses -…	4
Numerical/Quantitative Data	3
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	8
Higher Education	4
Postsecondary Education	3
High Schools	2
Elementary Education	1
Grade 4	1
Grade 6	1
Secondary Education	1

Audience

Policymakers	1
Researchers	1

Location

Australia	3
United Kingdom	3
United Kingdom (England)	3
United States	3
United Kingdom (Wales)	2
Missouri (Saint Louis)	1

Laws, Policies, & Programs

Education Consolidation…	1
Elementary and Secondary…	1
Hawkins Stafford Act 1988	1

Assessments and Surveys

SAT (College Admission Test)	4
Advanced Placement…	2
ACT Assessment	1
Iowa Tests of Educational…	1
National Merit Scholarship…	1
Praxis Series	1
Preliminary Scholastic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Equating a Large-Scale Writing Assessment Using Pairwise Comparisons of Performances

Peer reviewed

Direct link

Humphry, Stephen M.; McGrane, Joshua A. – Australian Educational Researcher, 2015

This paper presents a method for equating writing assessments using pairwise comparisons which does not depend upon conventional common-person or common-item equating designs. Pairwise comparisons have been successfully applied in the assessment of open-ended tasks in English and other areas such as visual art and philosophy. In this paper,…

Descriptors: Writing Evaluation, Evaluation Methods, Comparative Analysis, Writing Tests

Evaluating Equity at the Local Level Using Bootstrap Tests. Research Report 2016-4

Download full text

Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016

Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…

Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference

Assessing the Impact of Characteristics of the Test, Common-Items, and Examinees on the Preservation of Equity Properties in Mixed-Format Test Equating

Direct link

Wolf, Raffaela – ProQuest LLC, 2013

Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…

Descriptors: Testing, Item Response Theory, Equated Scores, Test Items

Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

Peer reviewed

Direct link

Brossman, Bradley G.; Lee, Won-Chan – Applied Psychological Measurement, 2013

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the multidimensional item response theory (MIRT) framework. Three equating procedures--two observed score procedures and one true score procedure--were created and described in detail. One observed score procedure was…

Descriptors: Equated Scores, True Scores, Item Response Theory, Mathematics Tests

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…

Descriptors: School Districts, Scores, Statistical Distributions, Database Design

Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

Peer reviewed

Direct link

He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei – Applied Psychological Measurement, 2013

Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

Descriptors: Regression (Statistics), Item Response Theory, Test Items, Equated Scores

Robust Scale Transformation Methods in IRT True Score Equating under Common-Item Nonequivalent Groups Design

Direct link

He, Yong – ProQuest LLC, 2013

Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…

Descriptors: Test Items, Regression (Statistics), Simulation, Comparative Analysis

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

Comparison of Kernel Equating and Item Response Theory Equating Methods

Direct link

Meng, Yu – ProQuest LLC, 2012

The kernel method of test equating is a unified approach to test equating with some advantages over traditional equating methods. Therefore, it is important to evaluate in a comprehensive way the usefulness and appropriateness of the Kernel equating (KE) method, as well as its advantages and disadvantages compared with several popular item…

Descriptors: Equated Scores, Evaluation Methods, Item Response Theory, Comparative Analysis

Two Approaches for Using Multiple Anchors in NEAT Equating: A Description and Demonstration

Peer reviewed

Direct link

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Applied Psychological Measurement, 2011

Nonequivalent groups with anchor test (NEAT) equating functions that use a single anchor can have accuracy problems when the groups are extremely different and/or when the anchor weakly correlates with the tests being equated. Proposals have been made to address these issues by incorporating more than one anchor into NEAT equating functions. These…

Descriptors: Equated Scores, Tests, Comparative Analysis, Correlation

Impact of Matched Samples Equating Methods on Equating Accuracy and the Adequacy of Equating Assumptions

Direct link

Powers, Sonya Jean – ProQuest LLC, 2010

When test forms are administered to examinee groups that differ in proficiency, equating procedures are used to disentangle group differences from form differences. This dissertation investigates the extent to which equating results are population invariant, the impact of group differences on equating results, the impact of group differences on…

Descriptors: Evidence, Advanced Placement, Effect Size, True Scores

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Evaluating the Rank-Ordering Method for Standard Maintaining

Peer reviewed

Direct link

Bramley, Tom; Gill, Tim – Research Papers in Education, 2010

The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…

Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2

Chen, Hanwei	2
Cui, Zhongmin	2
He, Yong	2
Lee, Won-Chan	2
Liu, Jinghua	2
Anderson, Judith I.	1
Baird, Jo-Anne	1
Ban, Jae-Chun	1
Bramley, Tom	1
Brossman, Bradley G.	1
Carey, Jill	1
Cresswell, Mike	1
Curley, Edward	1
DeCarlo, Lawrence T.	1
Deng, Weiling	1
Donovan, Jenny	1
Fang, Yu	1
Fraillon, Julian	1
Gao, Xiaohong	1
Gill, Tim	1
Hills, John R.	1
Ho, Andrew D.	1
House, Gary D.	1
Hu, Huiqin	1
More ▼