ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Test Items	7
Test Length	7
Item Response Theory	4
Simulation	4
Sample Size	3
Adaptive Testing	2
Computer Assisted Testing	2
Error of Measurement	2
Scores	2
Test Bias	2
Ability	1
Computation	1
Computer Simulation	1
Correlation	1
Decision Making	1
Differences	1
Efficiency	1
Equations (Mathematics)	1
Evaluation Methods	1
Evaluation Research	1
Hypothesis Testing	1
Item Analysis	1
Mastery Tests	1
Measurement	1
Models	1
More ▼

Source

International Journal of…

Publication Type

Journal Articles	7
Reports - Research	7

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

The Recovery of Correlation between Latent Abilities Using Compensatory and Noncompensatory Multidimensional IRT Models

Peer reviewed

Direct link

Fu, Yanyan; Strachan, Tyler; Ip, Edward H.; Willse, John T.; Chen, Shyh-Huei; Ackerman, Terry – International Journal of Testing, 2020

This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and…

Descriptors: Item Response Theory, Models, Test Items, Simulation

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Computerized Adaptive Testing with the Zinnes and Griggs Pairwise Preference Ideal Point Model

Peer reviewed

Direct link

Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011

This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…

Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis

A Range-Null Hypothesis Approach for Testing DIF under the Rasch Model

Peer reviewed

Direct link

Wells, Craig S.; Cohen, Allan S.; Patton, Jeffrey – International Journal of Testing, 2009

A primary concern with testing differential item functioning (DIF) using a traditional point-null hypothesis is that a statistically significant result does not imply that the magnitude of DIF is of practical interest. Similarly, for a given sample size, a non-significant result does not allow the researcher to conclude the item is free of DIF. To…

Descriptors: Test Bias, Test Items, Statistical Analysis, Hypothesis Testing

Sequential Computerized Mastery Tests--Three Simulation Studies

Peer reviewed

Direct link

Wiberg, Marie – International Journal of Testing, 2006

A simulation study of a sequential computerized mastery test is carried out with items modeled with the 3 parameter logistic item response theory model. The examinees' responses are either identically distributed, not identically distributed, or not identically distributed together with estimation errors in the item characteristics. The…

Descriptors: Test Length, Computer Simulation, Mastery Tests, Item Response Theory

Ackerman, Terry	1
Chen, Shyh-Huei	1
Chernyshenko, Oleksandr S.	1
Cohen, Allan S.	1
Emons, Wilco H. M.	1
Fu, Yanyan	1
Ip, Edward H.	1
Kruyen, Peter M.	1
Lee, Yi-Hsuan	1
Luo, Xiao	1
Patton, Jeffrey	1
Sijtsma, Klaas	1
Stark, Stephen	1
Strachan, Tyler	1
Wang, Xinrui	1
Wells, Craig S.	1
Wiberg, Marie	1
Willse, John T.	1
Zhang, Jinming	1
More ▼