ERIC - Search Results

Publication Date

In 2025	6
Since 2024	12
Since 2021 (last 5 years)	41

Source

Journal of Educational…

Publication Type

Journal Articles	41
Reports - Research	31
Reports - Descriptive	6
Reports - Evaluative	4

Education Level

Secondary Education	5
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Grade 7	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

A Comparison of Anchor Selection Strategies for DIF Analysis

Peer reviewed

Direct link

Haeju Lee; Kyung Yong Kim – Journal of Educational Measurement, 2025

When no prior information of differential item functioning (DIF) exists for items in a test, either the rank-based or iterative purification procedure might be preferred. The rank-based purification selects anchor items based on a preliminary DIF test. For a preliminary DIF test, likelihood ratio test (LRT) based approaches (e.g.,…

Descriptors: Test Items, Equated Scores, Test Bias, Accuracy

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

Controlling the Speededness of Assembled Test Forms: A Generalization to the Three-Parameter Lognormal Response Time Model

Peer reviewed

Direct link

Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023

When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…

Descriptors: Test Construction, Automation, Models, Test Items

Optimal Calibration of Items for Multidimensional Achievement Tests

Peer reviewed

Direct link

Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024

Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…

Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction

A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge

Peer reviewed

Direct link

Kasli, Murat; Zopluoglu, Cengiz; Toton, Sarah L. – Journal of Educational Measurement, 2023

Response times (RTs) have recently attracted a significant amount of attention in the literature as they may provide meaningful information about item preknowledge. In this study, a new model, the Deterministic Gated Lognormal Response Time (DG-LNRT) model, is proposed to identify examinees with item preknowledge using RTs. The proposed model was…

Descriptors: Reaction Time, Test Items, Models, Familiarity

Using Response Time in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

He, Yinhong; Qi, Yuanyuan – Journal of Educational Measurement, 2023

In multidimensional computerized adaptive testing (MCAT), item selection strategies are generally constructed based on responses, and they do not consider the response times required by items. This study constructed two new criteria (referred to as DT-inc and DT) for MCAT item selection by utilizing information from response times. The new designs…

Descriptors: Reaction Time, Adaptive Testing, Computer Assisted Testing, Test Items

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Expanding the Lognormal Response Time Model Using Profile Similarity Metrics to Improve the Detection of Anomalous Testing Behavior

Peer reviewed

Direct link

Gregory M. Hurtz; Regi Mucino – Journal of Educational Measurement, 2024

The Lognormal Response Time (LNRT) model measures the speed of test-takers relative to the normative time demands of items on a test. The resulting speed parameters and model residuals are often analyzed for evidence of anomalous test-taking behavior associated with fast and poorly fitting response time patterns. Extending this model, we…

Descriptors: Student Reaction, Reaction Time, Response Style (Tests), Test Items

An Unsupervised-Learning-Based Approach to Compromised Items Detection

Peer reviewed

Direct link

Pan, Yiqin; Wollack, James A. – Journal of Educational Measurement, 2021

As technologies have been improved, item preknowledge has become a common concern in the test security area. The present study proposes an unsupervised-learning-based approach to detect compromised items. The unsupervised-learning-based compromised item detection approach contains three steps: (1) classify responses of each examinee as either…

Descriptors: Test Items, Cheating, Artificial Intelligence, Identification

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3

Choe, Edison M.	3
Clauser, Brian E.	3
Lee, Won-Chan	3
Baldwin, Peter	2
DeCarlo, Lawrence T.	2
Han, Kyung T.	2
Lim, Hwanggyu	2
Paul De Boeck	2
Wollack, James A.	2
Yaneva, Victoria	2
Amanda Goodwin	1
Andersson, Björn	1
Becker, Benjamin	1
Becker, Kirk	1
Belov, Dmitry I.	1
Bengs, Daniel	1
Betsy Jane Becker	1
Bolt, Daniel M.	1
Brefeld, Ulf	1
Carolin Hahnel	1
Chen, Chia-Wen	1
Chen, Ping	1
Christine DeMars	1
Debeer, Dries	1
Emre Gonulates	1
More ▼

Test Items	41
Item Response Theory	18
Models	12
Scores	12
Computation	9
Computer Assisted Testing	8
Reaction Time	8
Adaptive Testing	7
Test Construction	7
Accuracy	6
Comparative Analysis	6
Difficulty Level	6
Simulation	6
Test Format	5
Achievement Tests	4
Error Patterns	4
Error of Measurement	4
Identification	4
Item Analysis	4
Multiple Choice Tests	4
Sample Size	4
Scoring	4
Evaluation Methods	3
Foreign Countries	3
Goodness of Fit	3
More ▼