ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Difficulty Level	7
Error of Measurement	7
Test Items	6
Item Response Theory	5
Equated Scores	3
Test Construction	3
Achievement Tests	2
Comparative Analysis	2
Computation	2
Computer Assisted Testing	2
Sample Size	2
Simulation	2
Ability	1
Accuracy	1
Adaptive Testing	1
Change	1
Classification	1
College Students	1
Foreign Countries	1
Grade 8	1
Guessing (Tests)	1
Higher Education	1
International Assessment	1
Item Analysis	1
Item Banks	1
More ▼

Source

Applied Measurement in…

Author

Abulela, Mohammed A. A.	1
Antal, Judit	1
Bergstrom, Betty A.	1
Finch, Holmes	1
French, Brian F.	1
Green, Donald Ross	1
Haertel, Edward H.	1
Melican, Gerald J.	1
Michaelides, Michalis P.	1
Proctor, Thomas P.	1
Rios, Joseph A.	1
Sass, D. A.	1
Schmitt, T. A.	1
Walker, C. M.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	2
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

The Effect of Anchor Test Construction on Scale Drift

Peer reviewed

Direct link

Antal, Judit; Proctor, Thomas P.; Melican, Gerald J. – Applied Measurement in Education, 2014

In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

Descriptors: Test Items, Equated Scores, Difficulty Level, Item Response Theory

Selection of Common Items as an Unrecognized Source of Variability in Test Equating: A Bootstrap Approximation Assuming Random Sampling of Common Items

Peer reviewed

Direct link

Michaelides, Michalis P.; Haertel, Edward H. – Applied Measurement in Education, 2014

The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…

Descriptors: Equated Scores, Test Items, Sampling, Statistical Inference

Estimating Non-Normal Latent Trait Distributions within Item Response Theory Using True and Estimated Item Parameters

Peer reviewed

Direct link

Sass, D. A.; Schmitt, T. A.; Walker, C. M. – Applied Measurement in Education, 2008

Item response theory (IRT) procedures have been used extensively to study normal latent trait distributions and have been shown to perform well; however, less is known concerning the performance of IRT with non-normal latent trait distributions. This study investigated the degree of latent trait estimation error under normal and non-normal…

Descriptors: Difficulty Level, Item Response Theory, Test Items, Computation

Experiences in the Application of Item Response Theory in Test Construction.

Peer reviewed

Green, Donald Ross; And Others – Applied Measurement in Education, 1989

Potential benefits of using item response theory in test construction are evaluated using the experience and evidence accumulated during nine years of using a three-parameter model in the development of major achievement batteries. Topics addressed include error of measurement, test equating, item bias, and item difficulty. (TJH)

Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Equated Scores

Altering the Level of Difficulty in Computer Adaptive Testing.

Peer reviewed

Bergstrom, Betty A.; And Others – Applied Measurement in Education, 1992

Effects of altering test difficulty on examinee ability measures and test length in a computer adaptive test were studied for 225 medical technology students in 3 test difficulty conditions. Results suggest that, with an item pool of sufficient depth and breadth, acceptable targeting to test difficulty is possible. (SLD)

Descriptors: Ability, Adaptive Testing, Change, College Students