ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Equated Scores	13
Test Format	13
Testing Programs	13
Item Response Theory	5
Scoring	5
Test Items	5
Testing Problems	5
College Entrance Examinations	4
Educational Assessment	4
Elementary Secondary Education	4
Higher Education	4
National Programs	4
Test Construction	4
Comparative Analysis	3
Difficulty Level	3
Statistical Analysis	3
Test Reliability	3
Context Effect	2
Error of Measurement	2
Estimation (Mathematics)	2
High School Students	2
High Stakes Tests	2
Mathematics Tests	2
Performance Based Assessment	2
Sample Size	2
More ▼

Source

Educational Measurement:…	2
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
Pearson	1

Publication Type

Reports - Research	7
Reports - Evaluative	6
Speeches/Meeting Papers	6
Journal Articles	5
Numerical/Quantitative Data	3

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Graduate Record Examinations	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

Direct link

Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012

Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

Comparability of Scores from Performance Assessments.

Peer reviewed

Green, Bert F. – Educational Measurement: Issues and Practice, 1995

If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

Evaluating Cross-Lingual Equating.

Download full text

Rapp, Joel; Allalouf, Avi – 2002

This study examined the cross-lingual equating process adopted by a large scale testing system in which target language (TL) forms are equated to the source language (SL) forms using a set of translated items. The focus was on evaluating the degree of error inherent in the routine cross-lingual equating of the Verbal Reasoning subtest of the…

Descriptors: College Applicants, College Entrance Examinations, Equated Scores, High Stakes Tests

Using Multiple DIF Statistics with the Same Items Appearing in Different Test Forms.

Download full text

Kubiak, Anna T.; Cowell, William R. – 1990

A procedure used to average several Mantel-Haenszel delta difference values for an item is described and evaluated. The differential item functioning (DIF) procedure used by the Educational Testing Service (ETS) is based on the Mantel-Haenszel statistical technique for studying matched groups. It is standard procedure at ETS to analyze test items…

Descriptors: Difficulty Level, Elementary Secondary Education, Equated Scores, Item Bias

The Determination of Empirical Standard Errors of Equating the Scores on SAT-Verbal and SAT-Mathematical.

Download full text

Angoff, William H. – 1991

An attempt was made to evaluate the standard error of equating (at the mean of the scores) in an ongoing testing program. The interest in estimating the empirical standard error of equating is occasioned by some discomfort with the error normally reported for test scores. Data used for this evaluation came from the Admissions Testing Program of…

Descriptors: College Entrance Examinations, Equated Scores, Error of Measurement, High School Students

Research Leading to the Revision of the Format of the Graduate Record Examinations Aptitude Test in October 1981.

Wild, Cheryl L.; And Others – 1982

The research leading to the decisions to revise the Graduate Record Examination Aptitude Test (GRE) (beginning in October 1981) is reviewed. The issues discussed include the format of the test (the timing of each section and the number of sections, the content of the sections--especially the analytical section), the scoring procedure for the GRE,…

Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Graduate Study

Effects of Passage and Item Scrambling on Equating Relationships.

Peer reviewed

Harris, Deborah J. – Applied Psychological Measurement, 1991

Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)

Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Effects of Item Order and Context on Estimation of NAEP Reading Proficiency.

Peer reviewed

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991

Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…

Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)

Scoring Issues in Selected Statewide Assessment Programs Using Non-Multiple-Choice Formats.

Download full text

Kahl, Stuart R. – 1995

Although few question the positive impacts alternative forms of assessment can have on instruction, concerns about the psychometric quality of data obtained from such assessments are taking their toll. Scoring issues are at the heart of many of these concerns. This paper addresses the causes of these concerns: misinformation about psychometric…

Descriptors: Alternative Assessment, Educational Assessment, Equated Scores, Performance Based Assessment

Some Issues in Free Response Testing.

Pollack, Judith M. – 1990

This paper summarizes an investigation of applications and issues in free response (FR) testing during 1989. It draws on ideas from the results of the National Educational Longitudinal Study 1988 (NELS:88) field test, a seminar series at the Educational Testing Service (ETS), working papers prepared for several FR testing applications, and…

Descriptors: Comparative Analysis, Costs, Educational Assessment, Elementary Secondary Education

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

Algina, James	1
Allalouf, Avi	1
Angoff, William H.	1
Colvin, Kimberly F.	1
Cook, Robert J.	1
Cowell, William R.	1
Goodman, Joshua	1
Green, Bert F.	1
Harris, Deborah J.	1
Kahl, Stuart R.	1
Keller, Lisa A.	1
Keller, Robert	1
Kubiak, Anna T.	1
Lee, Yi-Hsuan	1
Legg, Sue M.	1
Meyers, Jason L.	1
Murphy, Stephen	1
Pollack, Judith M.	1
Qian, Jiahe	1
Rapp, Joel	1
Turhan, Ahmet	1
Wang, Lin	1
Wild, Cheryl L.	1
Zwick, Rebecca	1
More ▼