ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Source

Applied Measurement in…

Author

Lee, Won-Chan	2
Antal, Judit	1
Ban, Jae-Chun	1
Bolt, Daniel M.	1
Dallas, Andrew D.	1
DeMars, Christine	1
Eignor, Daniel R.	1
Fan, Fen	1
Fitzpatrick, Anne R.	1
Goodman, Joshua T.	1
Lee, Guemin	1
Melican, Gerald J.	1
Proctor, Thomas P.	1
Yen, Wendy M.	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Equating with Small and Unbalanced Samples

Peer reviewed

Direct link

Goodman, Joshua T.; Dallas, Andrew D.; Fan, Fen – Applied Measurement in Education, 2020

Recent research has suggested that re-setting the standard for each administration of a small sample examination, in addition to the high cost, does not adequately maintain similar performance expectations year after year. Small-sample equating methods have shown promise with samples between 20 and 30. For groups that have fewer than 20 students,…

Descriptors: Equated Scores, Sample Size, Sampling, Weighted Scores

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

The Effect of Anchor Test Construction on Scale Drift

Peer reviewed

Direct link

Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014

In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Incomplete Data and Item Parameter Estimates under JMLE and MML Estimates.

Peer reviewed

DeMars, Christine – Applied Measurement in Education, 2002

Simulated items from two test forms using joint maximum likelihood estimation (JMLE) and marginal maximum likelihood estimation (MML) in the vertical equating situation (using an anchor test) when data were nonrandomly missing. Under MML, when the different ability parameters of students were not taken into account, the item difficulty parameters…

Descriptors: Ability, Equated Scores, Estimation (Mathematics), Maximum Likelihood Statistics

Simulation Results of Effects on Linear and Curvilinear Observed- and True-Score Equating Procedures of Matching on a Fallible Criterion.

Peer reviewed

Eignor, Daniel R.; And Others – Applied Measurement in Education, 1990

Two independent replications of a sequence of simulations were conducted to aid in the diagnosis and interpretation of equating differences found between representative (random) and matched (nonrandom) samples for three commonly used conventional observed-score equating procedures and one item-response-theory-based equating procedure. (SLD)

Descriptors: Equated Scores, Item Response Theory, Sampling, Simulation

Evaluating the Effects of Multidimensionality on IRT True-Score Equating.

Peer reviewed

Bolt, Daniel M. – Applied Measurement in Education, 1999

Examined whether the item response theory (IRT) true-score equating method is more adversely affected by the presence of multidimensionality than two conventional equating methods, linear and equipercentile equating. Results of two simulation studies suggest that the IRT method performs as well as the conventional methods when the correlation…

Descriptors: Correlation, Equated Scores, Item Response Theory, Simulation

The Effects of Test Length and Sample Size on the Reliability and Equating of Tests Composed of Constructed-Response Items.

Peer reviewed

Fitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001

Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…

Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability

Equated Scores	8
Simulation	8
Item Response Theory	5
Test Items	3
True Scores	3
Error of Measurement	2
Evaluation Criteria	2
Evaluation Methods	2
Sample Size	2
Sampling	2
Test Construction	2
Ability	1
Advanced Placement Programs	1
College Students	1
Comparative Analysis	1
Constructed Response	1
Correlation	1
Data Analysis	1
Design	1
Difficulty Level	1
Educational Assessment	1
Educational Testing	1
Estimation (Mathematics)	1
Evaluation Research	1
Matched Groups	1
More ▼