ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	21

Descriptor

Comparative Analysis	37
Equated Scores	37
Test Format	37
Test Items	16
Item Response Theory	13
Difficulty Level	9
Statistical Analysis	9
Simulation	8
College Entrance Examinations	7
Error of Measurement	7
Mathematics Tests	6
Multiple Choice Tests	6
Sample Size	6
Scores	6
Test Construction	6
Computer Assisted Testing	5
Correlation	5
High School Students	5
High Schools	5
Raw Scores	5
Foreign Countries	4
Gender Differences	4
Item Analysis	4
Language Tests	4
Models	4
More ▼

Source

ETS Research Report Series	7
ProQuest LLC	6
Educational Measurement:…	3
Journal of Educational…	2
ACT, Inc.	1
College Entrance Examination…	1
Discover Education	1
Educational Sciences: Theory…	1
Language Testing	1
Pearson	1
TEFLIN Journal: A publication…	1
More ▼

Publication Type

Reports - Research	18
Journal Articles	16
Reports - Evaluative	8
Speeches/Meeting Papers	7
Dissertations/Theses -…	6
Reports - Descriptive	4
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Secondary Education	1
Grade 12	1
High Schools	1
Secondary Education	1

Audience

Location

Canada	1
Indonesia	1
Israel	1
Nigeria	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	4
ACT Assessment	3
Advanced Placement…	2
Graduate Record Examinations	2
English Proficiency Test	1
Graduate Management Admission…	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Impact of Multidimensionality on Unidimensional IRT Linking and Equating Methods

Direct link

Uk Hyun Cho – ProQuest LLC, 2024

The present study investigates the influence of multidimensionality on linking and equating in a unidimensional IRT. Two hypothetical multidimensional scenarios are explored under a nonequivalent group common-item equating design. The first scenario examines test forms designed to measure multiple constructs, while the second scenario examines a…

Descriptors: Item Response Theory, Classification, Correlation, Test Format

Test Score Equating of Multiple-Choice Mathematics Items: Techniques from Characteristic Curve of Modern Psychometric Theory

Peer reviewed

Direct link

Musa Adekunle Ayanwale – Discover Education, 2023

Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…

Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format

An Investigation of Differential Mode Effects When Comparing Paper and Online ACT Testing. ACT Research & Policy. Technical Brief

Download full text

Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020

In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…

Descriptors: Test Format, Reading Tests, Language Tests, English

The Equivalence of TOEP Forms

Peer reviewed

Direct link

Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019

This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Effect of Differential Item Functioning on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Kabasakal, Kübra Atalay; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2015

This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…

Descriptors: Test Bias, Equated Scores, Item Response Theory, Simulation

Assessing the Impact of Characteristics of the Test, Common-Items, and Examinees on the Preservation of Equity Properties in Mixed-Format Test Equating

Direct link

Wolf, Raffaela – ProQuest LLC, 2013

Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…

Descriptors: Testing, Item Response Theory, Equated Scores, Test Items

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Population Invariance of Vertical Scaling Results

Direct link

Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012

The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests

Evaluating Subpopulation Invariance of Linking Functions to Determine the Anchor Composition for a Mixed-Format Test. Research Report. ETS RR-09-36

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2009

We examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. We derived linking functions in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b) a mix of…

Descriptors: Test Format, Equated Scores, Test Items, Multiple Choice Tests

NCME 2007 Presidential Address: The Concordance Table--An Invitation to Misuse Test Scores

Peer reviewed

Direct link

Eignor, Daniel R. – Educational Measurement: Issues and Practice, 2008

This article discusses a particular type of concordance table and the potential for test score misuse that may result from employing such a table. The concordance that is discussed is typically created between scores on different, nonequatable versions of a test that share the same or close to the same test title. These concordance tables often…

Descriptors: Scores, Tables (Data), Comparative Analysis, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Kim, Sooyeon	4
Harris, Deborah J.	2
Kolen, Michael J.	2
Liu, Jinghua	2
Livingston, Samuel A.	2
Schaeffer, Gary A.	2
Wainer, Howard	2
Walker, Michael E.	2
Andrews, Benjamin James	1
Apino, Ezi	1
Binici, Salih	1
Eignor, Daniel R.	1
Fitzpatrick, Steven J.	1
Green, Bert F.	1
Haberman, Shelby	1
Hanson, Bradley A.	1
Ito, Kyoko	1
Kabasakal, Kübra Atalay	1
Kelecioglu, Hülya	1
Lawrence, Ida M.	1
Lee, Yi-Hsuan	1
Liao, Chi-Wen	1
Low, Albert C.	1
Luzzo, Darrell Anthony	1
More ▼