ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Descriptor

Equated Scores	16
Test Items	16
Testing Programs	16
Difficulty Level	6
Statistical Analysis	5
Test Format	5
College Entrance Examinations	4
Sample Size	4
Comparative Analysis	3
Criterion Referenced Tests	3
Elementary Secondary Education	3
Error of Measurement	3
Higher Education	3
Item Analysis	3
Latent Trait Theory	3
Mathematical Models	3
Mathematics Tests	3
Minimum Competency Testing	3
Reading Tests	3
Scaling	3
Scores	3
State Programs	3
Test Construction	3
Test Reliability	3
Correlation	2
More ▼

Source

ETS Research Report Series	3
Educational Measurement:…	1
Educational Testing Service	1
Journal of Educational and…	1
Pearson	1
ProQuest LLC	1

Publication Type

Reports - Research	11
Speeches/Meeting Papers	8
Journal Articles	5
Numerical/Quantitative Data	2
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Researchers

Location

Florida	1
Georgia	1
Michigan	1
New Jersey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	1
Graduate Management Admission…	1
New Jersey High School…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Multiple Linking in Equating and Random Scale Drift. Research Report. ETS RR-11-46

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011

Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…

Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

Limits on the Accuracy of Linking. Research Report. ETS RR-10-22

Download full text

Haberman, Shelby J. – Educational Testing Service, 2010

Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…

Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy

Relationship between Air Traffic Selection and Training (AT-SAT)) Battery Test Scores and Composite Scores in the Initial en Route Air Traffic Control Qualification Training Course at the Federal Aviation Administration (FAA) Academy

Direct link

Kelley, Ronald Scott – ProQuest LLC, 2012

Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…

Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

Direct link

Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012

Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

An Alternative to Equating with Small Samples in the Non-Equivalent Groups Anchor Test Design. Research Report. ETS RR-06-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006

This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…

Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias

Evaluating Cross-Lingual Equating.

Download full text

Rapp, Joel; Allalouf, Avi – 2002

This study examined the cross-lingual equating process adopted by a large scale testing system in which target language (TL) forms are equated to the source language (SL) forms using a set of translated items. The focus was on evaluating the degree of error inherent in the routine cross-lingual equating of the Verbal Reasoning subtest of the…

Descriptors: College Applicants, College Entrance Examinations, Equated Scores, High Stakes Tests

Using Multiple DIF Statistics with the Same Items Appearing in Different Test Forms.

Download full text

Kubiak, Anna T.; Cowell, William R. – 1990

A procedure used to average several Mantel-Haenszel delta difference values for an item is described and evaluated. The differential item functioning (DIF) procedure used by the Educational Testing Service (ETS) is based on the Mantel-Haenszel statistical technique for studying matched groups. It is standard procedure at ETS to analyze test items…

Descriptors: Difficulty Level, Elementary Secondary Education, Equated Scores, Item Bias

New Jersey Statewide Testing System: High School Proficiency Test, 1985-86. Technical Report.

New Jersey State Dept. of Education, Trenton. – 1986

This Technical Report provides descriptions and summary data that assist measurement specialists in assessing the procedures used in developing New Jersey's High School Proficiency Test (HSPT), the technical qualities of the tests, and the statewide results obtained from its use. The data summarized in this report were collected during the various…

Descriptors: Equated Scores, Graduation Requirements, High Schools, Item Analysis

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

An Exploratory Study of the Applicability of Item Response Theory Methods to the Graduate Management Admission Test.

Download full text

Kingston, Neal; And Others – 1985

A necessary prerequisite to the operational use of item response theory (IRT) in any testing program is the investigation of the feasibility of such an approach. This report presents the results of such research for the Graduate Management Admission Test (GMAT). Despite the fact that GMAT data appear to violate a basic assumption of the…

Descriptors: College Entrance Examinations, Computer Software, Correlation, Equated Scores

Generating Parallel Test Forms for Minimum Competency Exams.

Nassif, Paula M.; And Others – 1979

A procedure which employs a method of item substitution based on item difficulty is recommended for developing parallel criterion referenced test forms. This procedure is currently being used in the Florida functional literacy testing program and the Georgia teacher certification testing program. Reasons for developing parallel test forms involve…

Descriptors: Criterion Referenced Tests, Difficulty Level, Equated Scores, Functional Literacy

Previous Page | Next Page »

Pages: 1 | 2

Algina, James	1
Allalouf, Avi	1
Bauer, Ernest A.	1
Cook, Linda L.	1
Cowell, William R.	1
Dorans, Neil	1
Dorans, Neil J.	1
Feigenbaum, Miriam	1
Goodman, Joshua	1
Guo, Hongwen	1
Haberman, Shelby	1
Haberman, Shelby J.	1
Kelley, Ronald Scott	1
Kim, Sooyeon	1
Kingston, Neal	1
Kubiak, Anna T.	1
Lee, Yi-Hsuan	1
Legg, Sue M.	1
Liang, Longjuan	1
Liu, Jinghua	1
Longford, Nicholas T.	1
Meyers, Jason L.	1
Murphy, Stephen	1
Nassif, Paula M.	1
More ▼