ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Descriptor

Comparative Analysis	16
Equated Scores	16
Testing Programs	16
Educational Assessment	8
Elementary Secondary Education	4
Item Response Theory	4
State Programs	4
Statistical Analysis	4
Test Results	4
Academic Achievement	3
Achievement Tests	3
Correlation	3
Error of Measurement	3
Mathematics Tests	3
Sample Size	3
Simulation	3
Standardized Tests	3
Test Construction	3
Test Format	3
Test Items	3
Test Reliability	3
Demography	2
Evaluation Methods	2
Language Tests	2
National Surveys	2
More ▼

Source

ETS Research Report Series	3
ACT, Inc.	1
Applied Measurement in…	1
Educational Measurement:…	1
Journal of Experimental…	1
Language Testing	1
ProQuest LLC	1

Publication Type

Reports - Research	8
Journal Articles	7
Reports - Evaluative	7
Speeches/Meeting Papers	4
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

California	1
Delaware	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Equating in Small-Scale Language Testing Programs

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan – Language Testing, 2017

Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…

Descriptors: Language Tests, Equated Scores, Testing Programs, Comparative Analysis

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A. – ETS Research Report Series, 2012

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…

Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling

Relationship between Air Traffic Selection and Training (AT-SAT)) Battery Test Scores and Composite Scores in the Initial en Route Air Traffic Control Qualification Training Course at the Federal Aviation Administration (FAA) Academy

Direct link

Kelley, Ronald Scott – ProQuest LLC, 2012

Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…

Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

Standard Errors of the Tucker Method for Linear Equating under the Common Item Nonrandom Groups Design. ACT Technical Bullegin Number 44.

Download full text

Kolen, Michael J. – 1984

Large sample standard errors for the Tucker method of linear equating under the common item nonrandom groups design are derived under normality assumptions as well as under less restrictive assumptions. Standard errors of Tucker equating are estimated using the bootstrap method described by Efron. The results from different methods are compared…

Descriptors: Certification, Comparative Analysis, Equated Scores, Error of Measurement

An Alternative to Equating with Small Samples in the Non-Equivalent Groups Anchor Test Design. Research Report. ETS RR-06-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006

This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…

Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias

Comparability of Scores from Performance Assessments.

Peer reviewed

Green, Bert F. – Educational Measurement: Issues and Practice, 1995

If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Technical Issues in Linking Assessments across Languages.

Download full text

Sireci, Stephen G. – 1996

Test developers continue to struggle with the technical and logistical problems inherent in assessing achievement across different languages. Many testing programs offer separate language versions of a test to evaluate the achievement of examinees in different language groups. However, comparison of individuals who took different language versions…

Descriptors: Achievement Tests, Bilingual Education, Comparative Analysis, Educational Assessment

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Download full text

Kiplinger, Vonda L.; Linn, Robert L. – 1994

Recently, several states have expressed interest in linking their statewide assessments to the National Assessment of Educational Progress (NAEP) in the hope that, through equating, they can be compared to national results. This study considers the degree to which existing statewide assessments may be linked to NAEP, without violating the basic…

Descriptors: Comparative Analysis, Educational Assessment, Elementary Secondary Education, Equated Scores

A Comparison of the Results from Two Equating Designs for Performance-Based Student Assessments.

Download full text

Baghi, Heibatollah; And Others – 1995

Issues related to linking tests with constructed response items were explored, specifically by comparing single-group and anchor-test designs to link raw scores from alternate forms of performance-based student assessments in the context of Delaware's assessment program using performance-based assessment. This study explored use of the two test…

Descriptors: Comparative Analysis, Constructed Response, Correlation, Educational Assessment

Some Issues in Free Response Testing.

Pollack, Judith M. – 1990

This paper summarizes an investigation of applications and issues in free response (FR) testing during 1989. It draws on ideas from the results of the National Educational Longitudinal Study 1988 (NELS:88) field test, a seminar series at the Educational Testing Service (ETS), working papers prepared for several FR testing applications, and…

Descriptors: Comparative Analysis, Costs, Educational Assessment, Elementary Secondary Education

A Proposal for a "SIR" Adjusted Index of Educational Competence.

Download full text

Mushkin, Selma J. – 1973

The increasing use of educational performance or outcome measurements for a range of policy purposes points to new procedures for adjusting data for population composition. The purposes include: program formulation, budget resource allocation, grant-in-aid designs, performance incentive payments, consumer information for school selection, and…

Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Demography

Measuring Differences among Non-Randomized Groups: an Epidemiological Model for Identifying Successful School Programs.

Peer reviewed

Marascuilo, Leonard A. – Journal of Experimental Education, 1979

The utility of the biomedical model of adjusted statistics is demonstrated. The model is recommended for use by educational researchers to randomize subjects for a more accurate estimate of school programs' success or failure when compared across classrooms or other units. (Author/MH)

Descriptors: Academic Achievement, Analysis of Variance, Comparative Analysis, Criterion Referenced Tests

Previous Page | Next Page »

Pages: 1 | 2

Kiplinger, Vonda L.	2
Linn, Robert L.	2
von Davier, Alina A.	2
Baghi, Heibatollah	1
Chen, Hanwei	1
Cui, Zhongmin	1
Gao, Xiaohong	1
Green, Bert F.	1
Gutierrez Arvizu, Maria Nelly	1
Haberman, Shelby	1
Isbell, Daniel	1
Jamieson, Joan	1
Kelley, Ronald Scott	1
Kim, Sooyeon	1
Kolen, Michael J.	1
LaFlair, Geoffrey T.	1
Lee, Yi-Hsuan	1
Marascuilo, Leonard A.	1
May, L. D. Nicolas	1
Mushkin, Selma J.	1
Pollack, Judith M.	1
Qian, Jiahe	1
Sireci, Stephen G.	1
Wang, Lin	1
More ▼