ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	53

Descriptor

Equated Scores	95
Simulation	95
Item Response Theory	45
Test Items	35
Comparative Analysis	27
Error of Measurement	25
Statistical Analysis	24
Sample Size	23
Difficulty Level	19
Sampling	17
Test Format	17
Evaluation Methods	13
Test Construction	12
True Scores	12
Estimation (Mathematics)	10
Models	10
Scores	10
Correlation	9
Methods	8
Accuracy	7
Scaling	7
Test Length	7
Computation	6
Item Analysis	6
Psychometrics	6
More ▼

Publication Type

Journal Articles	65
Reports - Research	51
Reports - Evaluative	30
Speeches/Meeting Papers	11
Dissertations/Theses -…	8
Reports - Descriptive	4
Numerical/Quantitative Data	3
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Florida	1
Singapore	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	4
ACT Assessment	2
SAT (College Admission Test)	2
Advanced Placement…	1
Armed Services Vocational…	1
Florida Comprehensive…	1
Iowa Tests of Basic Skills	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 95 results Save | Export

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Equating with Small and Unbalanced Samples

Peer reviewed

Direct link

Goodman, Joshua T.; Dallas, Andrew D.; Fan, Fen – Applied Measurement in Education, 2020

Recent research has suggested that re-setting the standard for each administration of a small sample examination, in addition to the high cost, does not adequately maintain similar performance expectations year after year. Small-sample equating methods have shown promise with samples between 20 and 30. For groups that have fewer than 20 students,…

Descriptors: Equated Scores, Sample Size, Sampling, Weighted Scores

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Investigating Test Equating Methods in Small Samples through Various Factors

Peer reviewed
PDF on ERIC

Download full text

Asiret, Semih; Sünbül, Seçil Ömür – Educational Sciences: Theory and Practice, 2016

In this study, equating methods for random group design using small samples through factors such as sample size, difference in difficulty between forms, and guessing parameter was aimed for comparison. Moreover, which method gives better results under which conditions was also investigated. In this study, 5,000 dichotomous simulated data…

Descriptors: Equated Scores, Sample Size, Difficulty Level, Guessing (Tests)

Section Preequating under the Equivalent Groups Design without IRT

Peer reviewed

Direct link

Guo, Hongwen; Puhan, Gautam – Journal of Educational Measurement, 2014

In this article, we introduce a section preequating (SPE) method (linear and nonlinear) under the randomly equivalent groups design. In this equating design, sections of Test X (a future new form) and another existing Test Y (an old form already on scale) are administered. The sections of Test X are equated to Test Y, after adjusting for the…

Descriptors: Equated Scores, Correlation, Simulation, Testing

Equating with Miditests Using IRT

Peer reviewed

Direct link

Fitzpatrick, Joseph; Skorupski, William P. – Journal of Educational Measurement, 2016

The equating performance of two internal anchor test structures--miditests and minitests--is studied for four IRT equating methods using simulated data. Originally proposed by Sinharay and Holland, miditests are anchors that have the same mean difficulty as the overall test but less variance in item difficulties. Four popular IRT equating methods…

Descriptors: Difficulty Level, Test Items, Comparative Analysis, Test Construction

A Stepwise Test Characteristic Curve Method to Detect Item Parameter Drift

Peer reviewed

Direct link

Guo, Rui; Zheng, Yi; Chang, Hua-Hua – Journal of Educational Measurement, 2015

An important assumption of item response theory is item parameter invariance. Sometimes, however, item parameters are not invariant across different test administrations due to factors other than sampling error; this phenomenon is termed item parameter drift. Several methods have been developed to detect drifted items. However, most of the…

Descriptors: Item Response Theory, Test Items, Evaluation Methods, Equated Scores

Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

Peer reviewed

Direct link

Andersson, Björn – Journal of Educational Measurement, 2016

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests

Optimal Bandwidth Selection in Observed-Score Kernel Equating

Peer reviewed

Direct link

Häggström, Jenny; Wiberg, Marie – Journal of Educational Measurement, 2014

The selection of bandwidth in kernel equating is important because it has a direct impact on the equated test scores. The aim of this article is to examine the use of double smoothing when selecting bandwidths in kernel equating and to compare double smoothing with the commonly used penalty method. This comparison was made using both an equivalent…

Descriptors: Equated Scores, Data Analysis, Comparative Analysis, Simulation

Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

Peer reviewed

Direct link

Kopf, Julia; Zeileis, Achim; Strobl, Carolin – Educational and Psychological Measurement, 2015

Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…

Descriptors: Test Items, Equated Scores, Test Bias, Item Response Theory

Adjoined Piecewise Linear Approximations (APLAs) for Equating: Accuracy Evaluations of a Postsmoothing Equating Method

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2013

The purpose of this study was to evaluate the use of adjoined and piecewise linear approximations (APLAs) of raw equipercentile equating functions as a postsmoothing equating method. APLAs are less familiar than other postsmoothing equating methods (i.e., cubic splines), but their use has been described in historical equating practices of…

Descriptors: Equated Scores, Accuracy, Simulation, Comparative Analysis

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

Effect of Item Response Theory (IRT) Model Selection on Testlet-Based Test Equating. Research Report. ETS RR-14-19

Peer reviewed
PDF on ERIC

Download full text

Cao, Yi; Lu, Ru; Tao, Wei – ETS Research Report Series, 2014

The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…

Descriptors: Item Response Theory, Equated Scores, Test Items, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Journal of Educational…	14
ETS Research Report Series	11
Applied Psychological…	9
Applied Measurement in…	8
ProQuest LLC	8
Educational and Psychological…	4
Journal of Educational and…	4
Educational Sciences: Theory…	3
International Journal of…	2
Journal of Applied Measurement	2
Psychometrika	2
ACT, Inc.	1
American Institutes for…	1
College Board	1
Educational Testing Service	1
Journal of Experimental…	1
Measurement:…	1
Practical Assessment,…	1
Research Matters	1
Studies in Educational…	1
Teaching Statistics: An…	1
More ▼

Moses, Tim	8
Wang, Tianyou	5
Holland, Paul	4
Kolen, Michael J.	4
Cui, Zhongmin	3
Dorans, Neil J.	3
Hanson, Bradley A.	3
Lee, Won-Chan	3
Li, Yuan H.	3
Yen, Wendy M.	3
Antal, Judit	2
Eignor, Daniel R.	2
Goodman, Joshua T.	2
Harris, Deborah J.	2
He, Yong	2
Holland, Paul W.	2
Kelecioglu, Hülya	2
Lissitz, Robert W.	2
Liu, Jinghua	2
Sinharay, Sandip	2
Zhang, Wenmin	2
von Davier, Alina A.	2
Albano, Anthony	1
Andersson, Björn	1
More ▼