ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5

Descriptor

Comparative Testing	5
Item Response Theory	5
Test Construction	3
Guessing (Tests)	2
Test Bias	2
Test Reliability	2
Test Validity	2
Ability Identification	1
Adaptive Testing	1
Algorithms	1
Business English	1
Career Choice	1
Career Exploration	1
College Faculty	1
College Students	1
Computer Assisted Testing	1
Construct Validity	1
Difficulty Level	1
Error of Measurement	1
Foreign Countries	1
International Assessment	1
Item Banks	1
Markov Processes	1
Mathematical Models	1
Multiple Choice Tests	1
More ▼

Source

Educational Measurement:…	1
Journal of Educational and…	1
Practical Assessment,…	1
ProQuest LLC	1
Society for Research on…	1

Author

Agus Santoso	1
Gulzhaina K. Kassymova	1
Heri Retnawati	1
Ibnu Rafi	1
Jiayi Deng	1
Jimmy de la Torre	1
Jinran Wu	1
Luping Niu	1
Munaya Nikma Rosyada	1
Peter F. Halpin	1
Seung W. Choi	1
Timbul Pardede	1
Wim J. van der Linden	1
Xu Wenxin	1
Xuelan Qiu	1
You-Gan Wang	1
More ▼

Publication Type

Reports - Research	4
Journal Articles	3
Dissertations/Theses -…	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Indonesia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Do Reported Treatment Effects Generalize to Other Measures of the Same Construct: A Specification Test

Peer reviewed

Direct link

Peter F. Halpin – Society for Research on Educational Effectiveness, 2024

Background: Meta-analyses of educational interventions have consistently documented the importance of methodological factors related to the choice of outcome measures. In particular, when interventions are evaluated using measures developed by researchers involved with the intervention or its evaluation, the effect sizes tend to be larger than…

Descriptors: College Students, College Faculty, STEM Education, Item Response Theory

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment