ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Difficulty Level	9
Equated Scores	9
Testing Programs	9
Test Items	6
Statistical Analysis	4
Criterion Referenced Tests	3
Elementary Secondary Education	3
Item Analysis	3
Test Construction	3
Test Format	3
Test Reliability	3
Educational Assessment	2
Higher Education	2
Item Response Theory	2
Mathematical Models	2
Mathematics Tests	2
Minimum Competency Testing	2
Reading Tests	2
Research Reports	2
Sample Size	2
Scaling	2
State Programs	2
Ability Grouping	1
Academic Achievement	1
Academic Standards	1
More ▼

Source

ACT, Inc.	1
Journal of Educational…	1
Journal of Educational and…	1
Pearson	1

Author

Algina, James	1
Bauer, Ernest A.	1
Chen, Hanwei	1
Cope, Ronald T.	1
Cowell, William R.	1
Cui, Zhongmin	1
Gao, Xiaohong	1
Goodman, Joshua	1
Kubiak, Anna T.	1
Legg, Sue M.	1
Linn, Robert L.	1
Longford, Nicholas T.	1
Meyers, Jason L.	1
Murphy, Stephen	1
Nassif, Paula M.	1
Slinde, Jefferey A.	1
Turhan, Ahmet	1
Zhu, Rongchun	1
More ▼

Publication Type

Speeches/Meeting Papers	6
Reports - Research	5
Reports - Evaluative	3
Numerical/Quantitative Data	2
Journal Articles	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Florida	1
Georgia	1
Michigan	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

Direct link

Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012

Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

An Exploration of the Adequacy of the Rasch Model For the Problem of Vertical Equating

Peer reviewed

Slinde, Jefferey A.; Linn, Robert L. – Journal of Educational Measurement, 1978

Use of the Rasch model for vertical equating of tests is discussed. Although use of the model is promising, empirical results raise questions about the adequacy of the Rasch model. Latent trait models with more parameters may be necessary. (JKS)

Descriptors: Achievement Tests, Difficulty Level, Equated Scores, Higher Education

Using Multiple DIF Statistics with the Same Items Appearing in Different Test Forms.

Download full text

Kubiak, Anna T.; Cowell, William R. – 1990

A procedure used to average several Mantel-Haenszel delta difference values for an item is described and evaluated. The differential item functioning (DIF) procedure used by the Educational Testing Service (ETS) is based on the Mantel-Haenszel statistical technique for studying matched groups. It is standard procedure at ETS to analyze test items…

Descriptors: Difficulty Level, Elementary Secondary Education, Equated Scores, Item Bias

Cautionary Observations on Reliability and Equating of Forms in High Stakes Performance Assessment: The Problem of Granularity.

Download full text

Cope, Ronald T. – 1995

This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…

Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

Generating Parallel Test Forms for Minimum Competency Exams.

Nassif, Paula M.; And Others – 1979

A procedure which employs a method of item substitution based on item difficulty is recommended for developing parallel criterion referenced test forms. This procedure is currently being used in the Florida functional literacy testing program and the Georgia teacher certification testing program. Reasons for developing parallel test forms involve…

Descriptors: Criterion Referenced Tests, Difficulty Level, Equated Scores, Functional Literacy

How Minimal is Minimal?

PDF pending restoration

Bauer, Ernest A.; And Others – 1979

The reading portion of the Michigan Educational Assessment Program (MEAP) was equated to the reading comprehension subtest of the Comprehensive Tests of Basic Skills (CTBS) using the Rasch Model. Both tests were administered to 366 low achieving fourth grade students. MEAP was treated as both a 95-item test and a 19-item (number of objectives…

Descriptors: Academic Standards, Criterion Referenced Tests, Difficulty Level, Educational Objectives