ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	12

Descriptor

Equated Scores	21
Error of Measurement	21
Test Format	21
Item Response Theory	10
Comparative Analysis	7
Simulation	6
Test Items	6
Test Construction	5
College Entrance Examinations	4
Correlation	4
Test Theory	4
Sample Size	3
Statistical Analysis	3
Statistical Bias	3
Ability	2
Achievement Tests	2
Bias	2
Computer Assisted Testing	2
Difficulty Level	2
Evaluation Methods	2
High School Students	2
High Schools	2
Higher Education	2
Mathematics Tests	2
Models	2
More ▼

Source

ETS Research Report Series	3
Applied Psychological…	2
Measurement:…	2
ProQuest LLC	2
Applied Measurement in…	1
International Journal of…	1
Practical Assessment,…	1
Psychological Assessment	1

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Evaluative	7
Speeches/Meeting Papers	5
Dissertations/Theses -…	2
Reports - Descriptive	2
Information Analyses	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
ACT Assessment	1
Advanced Placement…	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

Peer reviewed

Direct link

Tao, Wei; Cao, Yi – Applied Measurement in Education, 2016

Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

Descriptors: Item Response Theory, Equated Scores, Test Format, Models

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

An Alternative Data Collection Design for Equating with Very Small Samples. Research Report. ETS RR-08-11

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008

A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…

Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size

Clarifying Problems and Offering Solutions for Correlated Error when Assessing the Validity of Selected-Subtest Short Forms

Peer reviewed

Direct link

Girard, Todd A.; Christensen, Bruce K. – Psychological Assessment, 2008

The correlation between a short-form (SF) test and its full-scale (FS) counterpart is a mainstay in the evaluation of SF validity. However, in correcting for overlapping error variance in this measure, investigators have overattenuated the validity coefficient through an intuitive misapplication of P. Levy's (1967) formula. The authors of the…

Descriptors: Error of Measurement, Computation, Psychiatric Services, Correlation

An Exploration of Kernel Equating Using SAT® Data: Equating to a Similar Population and to a Distant Population. Research Report. ETS RR-07-17

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007

This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…

Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods

Checking the Equivalence of Nearly Identical Test Editions.

Download full text

Dorans, Neil J.; Lawrence, Ida M. – 1988

A procedure for checking the score equivalence of nearly identical editions of a test is described. The procedure employs the standard error of equating (SEE) and utilizes graphical representation of score conversion deviation from the identity function in standard error units. Two illustrations of the procedure involving Scholastic Aptitude Test…

Descriptors: Equated Scores, Error of Measurement, Test Construction, Test Format

Standard Errors of Levine Linear Equating.

Peer reviewed

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993

The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling

A Comparison of Three Equating Approaches to A Random-Groups, Common-Forms Design.

PDF pending restoration

Ito, Kyoko; Sykes, Robert C. – 1996

Equating multiple test forms is frequently desired. When multiple forms are linked in a chain of equating, error tends to build up in the process. This paper compares three procedures for equating multiple forms in a common-form design where each school administered, in a spiraled fashion, only a subset of multiple forms. Data used were from a…

Descriptors: Comparative Analysis, Equated Scores, Error of Measurement, Grade 11

Equating Multiple Tests via an IRT Linking Design: Utilizing a Single Set of Anchor Items with Fixed Common Item Parameters during the Calibration Process.

Download full text

Li, Yuan H.; Griffith, William D.; Tam, Hak P. – 1997

This study explores the relative merits of a potentially useful item response theory (IRT) linking design: using a single set of anchor items with fixed common item parameters (FCIP) during the calibration process. An empirical study was conducted to investigate the appropriateness of this linking design using 6 groups of students taking 6 forms…

Descriptors: Ability, Difficulty Level, Equated Scores, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Hanson, Bradley A.	2
Sykes, Robert C.	2
van der Linden, Wim J.	2
Andrews, Benjamin James	1
Angoff, William H.	1
Bielinski, John	1
Cao, Yi	1
Christensen, Bruce K.	1
Cui, Zhongmin	1
Dogan, Nuri	1
Dorans, Neil J.	1
Fitzpatrick, Steven J.	1
Girard, Todd A.	1
Grant, Mary	1
Griffith, William D.	1
Hanson, Brad	1
Harris, Deborah J.	1
He, Yong	1
Hou, Liling	1
Inga Laukaityte	1
Ito, Kyoko	1
Lawrence, Ida M.	1
Lee, Yi-Hsuan	1
Li, Yuan H.	1
More ▼