ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Equated Scores	16
Statistical Analysis	16
Testing Programs	16
Test Items	5
Comparative Analysis	4
Difficulty Level	4
Academic Achievement	3
College Entrance Examinations	3
Error of Measurement	3
Latent Trait Theory	3
Sample Size	3
Scaling	3
Test Format	3
Test Reliability	3
Achievement Tests	2
Criterion Referenced Tests	2
Educational Assessment	2
Elementary Secondary Education	2
Evaluation Methods	2
Item Analysis	2
Item Response Theory	2
Mathematical Models	2
Mathematics Tests	2
Scoring	2
Simulation	2
More ▼

Source

ETS Research Report Series	3
ACT, Inc.	1
Applied Measurement in…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Psychometrika	1
US Department of Education	1

Publication Type

Journal Articles	8
Reports - Evaluative	8
Reports - Research	8
Speeches/Meeting Papers	6
Numerical/Quantitative Data	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Iowa Tests of Basic Skills	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Multiple Linking in Equating and Random Scale Drift. Research Report. ETS RR-11-46

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011

Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…

Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

Accumulative Equating Error after a Chain of Linear Equatings

Peer reviewed

Direct link

Guo, Hongwen – Psychometrika, 2010

After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…

Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

The Behavior Of Linking Items In Test Equating. CSE Report 630

Download full text

Haertel, Edward H. – US Department of Education, 2004

Large-scale testing programs often require multiple forms to maintain test security over time or to enable the measurement of change without repeating the identical questions. The comparability of scores across forms is consequential: Students are admitted to colleges based on their test scores, and the meaning of a given scale score one year …

Descriptors: Measurement, Testing Programs, Equated Scores, Test Use

An Alternative to Equating with Small Samples in the Non-Equivalent Groups Anchor Test Design. Research Report. ETS RR-06-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006

This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…

Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias

Use versus Nonuse of Repeater Examinees in Common Item Linear Equating with Nonrandom Groups.

Cope, Ronald T. – 1985

This study considers the use of repeaters when test equating. The subjects consist of five groups of applicants to a professional certification program. Each group comprises first time examinees and repeaters. The procedures include a common item linear equating with nonrandom groups, use of equating chains, and the use of total examinee group…

Descriptors: Certification, Equated Scores, Measurement Techniques, Postsecondary Education

Comparability of Test Scores for the Same Individual: Implications for Vertical Equating.

Forster, Fred; Karr, Chad – 1988

A method for equating test scores between two standardized achievement testing programs was developed. The first test was the Survey of Basic Skills (SBS) published by Science Research Associates. The second was the Tests of Individual Performance (TIP) of the Portland Public Schools in Oregon. Scores reported in Rasch units (RIT) from the TIP…

Descriptors: Achievement Tests, Elementary Education, Elementary School Students, Equated Scores

The Determination of Empirical Standard Errors of Equating the Scores on SAT-Verbal and SAT-Mathematical.

Download full text

Angoff, William H. – 1991

An attempt was made to evaluate the standard error of equating (at the mean of the scores) in an ongoing testing program. The interest in estimating the empirical standard error of equating is occasioned by some discomfort with the error normally reported for test scores. Data used for this evaluation came from the Admissions Testing Program of…

Descriptors: College Entrance Examinations, Equated Scores, Error of Measurement, High School Students

Using Performance Standards to Link Statewide Achievement Results to NAEP.

Peer reviewed

Waltman, Kristie K. – Journal of Educational Measurement, 1997

A socially moderated link was established between statewide achievement results and the National Assessment of Educational Progress (NAEP) by using the same achievement level descriptions in an Iowa Test of Basic Skills standard-setting and an NAEP standard setting study. A statistically moderated link was established through an equipercentile…

Descriptors: Academic Achievement, Achievement Tests, Equated Scores, National Surveys

Cautionary Observations on Reliability and Equating of Forms in High Stakes Performance Assessment: The Problem of Granularity.

Download full text

Cope, Ronald T. – 1995

This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…

Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests

The Use of Rasch Based Scale in a Criterion Referenced Evaluation for the State Student Compensatory Education Program in Dade County, Florida Public Schools.

Baker, Jean; Wongbundhit, Yuwadee – 1984

This study was designed to illustrate the use of the Rasch model procedure to equate the Dade County Compensatory Education Skills Test (DCCEST) to the State Student Assessment Test (SSAT) for both mathematics skills and communications skills. The SSAT was a test developed by the Florida State Department of Education to measure students' level of…

Descriptors: Academic Achievement, Basic Skills, Compensatory Education, Criterion Referenced Tests

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2

Cope, Ronald T.	2
Guo, Hongwen	2
Haberman, Shelby	2
Kim, Sooyeon	2
von Davier, Alina A.	2
Algina, James	1
Angoff, William H.	1
Baker, Jean	1
Chen, Hanwei	1
Cui, Zhongmin	1
Dorans, Neil	1
Feigenbaum, Miriam	1
Forster, Fred	1
Gao, Xiaohong	1
Haertel, Edward H.	1
Karr, Chad	1
Lee, Yi-Hsuan	1
Legg, Sue M.	1
Liu, Jinghua	1
Longford, Nicholas T.	1
Marascuilo, Leonard A.	1
Qian, Jiahe	1
Waltman, Kristie K.	1
Wang, Lin	1
Wongbundhit, Yuwadee	1
More ▼