ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	11
Equated Scores	11
Test Length	11
Test Items	6
Item Response Theory	5
Sample Size	5
Comparative Analysis	3
Simulation	3
Test Format	3
True Scores	3
Adults	2
Correlation	2
Latent Trait Theory	2
Statistical Analysis	2
Test Bias	2
Test Construction	2
Test Reliability	2
Accuracy	1
Achievement Tests	1
Adult Basic Education	1
Aptitude Tests	1
Basic Skills	1
Cognitive Ability	1
Cognitive Tests	1
Constructed Response	1
More ▼

Source

ProQuest LLC	2
Applied Measurement in…	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Applied Measurement	1

Publication Type

Reports - Research	6
Journal Articles	4
Dissertations/Theses -…	2
Reports - Evaluative	2
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Otis Lennon School Ability…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Subscore Equating and Profile Reporting

Peer reviewed

Direct link

Lim, Euijin; Lee, Won-Chan – Applied Measurement in Education, 2020

The purpose of this study is to address the necessity of subscore equating and to evaluate the performance of various equating methods for subtests. Assuming the random groups design and number-correct scoring, this paper analyzed real data and simulated data with four study factors including test dimensionality, subtest length, form difference in…

Descriptors: Equated Scores, Test Length, Test Format, Difficulty Level

Linking Composite Scores: Effects of Anchor Test Length and Content Representativeness. Research Report. ETS RR-16-36

Peer reviewed
PDF on ERIC

Download full text

Lin, Peng; Dorans, Neil; Weeks, Jonathan – ETS Research Report Series, 2016

The nonequivalent groups with anchor test (NEAT) design is frequently used in test score equating or linking. One important assumption of the NEAT design is that the anchor test is a miniversion of the 2 tests to be equated/linked. When the content of the 2 tests is different, it is not possible for the anchor test to be adequately representative…

Descriptors: Equated Scores, Test Length, Test Content, Difficulty Level

Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

Direct link

Lee, Eunjung – ProQuest LLC, 2013

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

Descriptors: Equated Scores, Tests, Comparative Analysis, Item Response Theory

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

An Investigation of Factors Affecting Test Equating in Latent Trait Theory.

Peer reviewed

Sunathong, Surintorn; Schumacker, Randall E.; Beyerlein, Michael M. – Journal of Applied Measurement, 2000

Studied five factors that can affect the equating of scores from two tests onto a common score scale through the simulation and equating of 4,860 item data sets. Findings indicate three statistically significant two-way interactions for common item length and test length, item difficulty standard deviation and item distribution type, and item…

Descriptors: Difficulty Level, Equated Scores, Interaction, Item Response Theory

Characteristics of Anchor Tests.

Download full text

Chang, S. Tai; Bashaw, W. L. – 1984

The purpose of this study was twofold: to investigate to what extent characteristics of anchor tests may affect precision of item calibration, and to estimate to what extent precision of item calibration may be affected by removal of persons whose response patterns deviate from those normally expected from the Rasch one-parameter logistic model.…

Descriptors: Aptitude Tests, Difficulty Level, Equated Scores, Junior High Schools

On Examinee Choice in Educational Testing. GRE Board Professional Report No. 91-17P.

Download full text

Wainer, Howard; Thissen, David – 1994

When an examination consists in whole or part of constructed response test items, it is common practice to allow the examinee to choose a subset of the constructed response questions from a larger pool. It is sometimes argued that, if choice were not allowed, the limitations on domain coverage forced by the small number of items might unfairly…

Descriptors: Constructed Response, Difficulty Level, Educational Testing, Equated Scores

Tailoring Tests to Educational Levels.

Download full text

de Jong, John H. A. L. – 1984

The Netherlands' secondary education system is highly differentiated, with four different school types for four scholastic ability levels. Final examinations must accommodate these four levels, and require a test-independent definition of the intended final ability levels as well as a sample-free evaluation of the range of ability levels at which…

Descriptors: Difficulty Level, Efficiency, Equated Scores, Foreign Countries

Methods for Equating Mental Tests. Interim Report for Period March 1982-October 1984.

Download full text

Gialluca, Kathleen A.; And Others – 1984

In this study, simulated and actual Air Force test data were used to compare the different procedures for equating mental tests: conventional (equipercentile and linear), Item Response Theory (IRT), and strong true-score theory (STST); data collection designs used were single-group, equivalent-groups, and anchor test. Equating transformations were…

Descriptors: Adults, Cognitive Ability, Cognitive Tests, Comparative Analysis

Manual for the USES Basic Occupational Literacy Test. Section 2: Development.

PDF pending restoration

Manpower Administration (DOL), Washington, DC. – 1972

The Basic Occupational Literacy Test (BOLT) was developed as an achievement test of basic skills in reading and arithmetic, for educationally disadvantaged adults. The objective was to develop a test appropriate for this population with regard to content, format, instructions, timing, norms, and difficulty level. A major issue, the use of grade…

Descriptors: Achievement Tests, Adult Basic Education, Adults, Basic Skills

Bashaw, W. L.	1
Beyerlein, Michael M.	1
Chang, S. Tai	1
Debelak, Rudolf	1
Dorans, Neil	1
Fellinghauer, Carolina	1
Gialluca, Kathleen A.	1
Lee, Eunjung	1
Lee, Won-Chan	1
Lim, Euijin	1
Lin, Peng	1
Schumacker, Randall E.	1
Strobl, Carolin	1
Sunathong, Surintorn	1
Sunnassee, Devdass	1
Thissen, David	1
Wainer, Howard	1
Weeks, Jonathan	1
de Jong, John H. A. L.	1
More ▼