Publication Date
In 2025 | 0 |
Since 2024 | 5 |
Descriptor
Comparative Testing | 5 |
Item Response Theory | 5 |
Test Construction | 3 |
Guessing (Tests) | 2 |
Test Bias | 2 |
Test Reliability | 2 |
Test Validity | 2 |
Ability Identification | 1 |
Adaptive Testing | 1 |
Algorithms | 1 |
Business English | 1 |
More ▼ |
Source
Educational Measurement:… | 1 |
Journal of Educational and… | 1 |
Practical Assessment,… | 1 |
ProQuest LLC | 1 |
Society for Research on… | 1 |
Author
Agus Santoso | 1 |
Gulzhaina K. Kassymova | 1 |
Heri Retnawati | 1 |
Ibnu Rafi | 1 |
Jiayi Deng | 1 |
Jimmy de la Torre | 1 |
Jinran Wu | 1 |
Luping Niu | 1 |
Munaya Nikma Rosyada | 1 |
Peter F. Halpin | 1 |
Seung W. Choi | 1 |
More ▼ |
Publication Type
Reports - Research | 4 |
Journal Articles | 3 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Location
Indonesia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jiayi Deng – ProQuest LLC, 2024
Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…
Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement
Peter F. Halpin – Society for Research on Educational Effectiveness, 2024
Background: Meta-analyses of educational interventions have consistently documented the importance of methodological factors related to the choice of outcome measures. In particular, when interventions are evaluated using measures developed by researchers involved with the intervention or its evaluation, the effect sizes tend to be larger than…
Descriptors: College Students, College Faculty, STEM Education, Item Response Theory
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024
Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…
Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment