Publication Date
In 2025 | 41 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 515 |
Since 2016 (last 10 years) | 1133 |
Since 2006 (last 20 years) | 1667 |
Descriptor
Foreign Countries | 2122 |
Test Items | 2122 |
Test Construction | 533 |
Difficulty Level | 396 |
Item Response Theory | 393 |
Test Validity | 382 |
Item Analysis | 380 |
Achievement Tests | 362 |
Test Reliability | 332 |
Language Tests | 325 |
Multiple Choice Tests | 323 |
More ▼ |
Source
Author
Baghaei, Purya | 11 |
Bulut, Okan | 10 |
van der Linden, Wim J. | 9 |
Goldhammer, Frank | 8 |
Meijer, Rob R. | 8 |
Ercikan, Kadriye | 7 |
Janssen, Rianne | 7 |
Kelderman, Henk | 7 |
Robitzsch, Alexander | 7 |
Wang, Wen-Chung | 7 |
Bramley, Tom | 6 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 64 |
Students | 59 |
Teachers | 58 |
Researchers | 22 |
Policymakers | 6 |
Administrators | 4 |
Community | 1 |
Location
Canada | 220 |
Turkey | 219 |
Australia | 148 |
Germany | 112 |
China | 81 |
Taiwan | 75 |
United States | 74 |
Indonesia | 73 |
United Kingdom | 70 |
Netherlands | 63 |
Japan | 62 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Race to the Top | 1 |
United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kuan-Yu Jin; Yi-Jhen Wu; Ming Ming Chiu – Measurement: Interdisciplinary Research and Perspectives, 2025
Many education tests and psychological surveys elicit respondent views of similar constructs across scenarios (e.g., story followed by multiple choice questions) by repeating common statements across scales (one-statement-multiple-scale, OSMS). However, a respondent's earlier responses to the common statement can affect later responses to it…
Descriptors: Administrator Surveys, Teacher Surveys, Responses, Test Items
Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025
In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…
Descriptors: Test Items, Factor Structure, Test Reliability, Semantics
Christoph Ableitinger; Christian Dorner – International Journal of Mathematical Education in Science and Technology, 2025
The number of complaints university lecturers make about a lack of knowledge, especially first-year students' procedural knowledge, has increased recently. Due to missing adequate empirical evidence, a survey of procedural knowledge among students of Austrian high schools in their final year was conducted. For this purpose, test items for…
Descriptors: Knowledge Level, Cognitive Processes, High School Seniors, Foreign Countries
Chan Zhang; Shuaiying Cao; Minglei Wang; Jiangyan Wang; Lirui He – Field Methods, 2025
Previous research on grid questions has mostly focused on their comparability with the item-by-item method and the use of shading to help respondents navigate through a grid. This study extends prior work by examining whether lexical similarity among grid items affects how respondents answer the questions in an experiment where we manipulated…
Descriptors: Foreign Countries, Surveys, Test Construction, Design
Patrik Havan; Michal Kohút; Peter Halama – International Journal of Testing, 2025
Acquiescence is the tendency of participants to shift their responses to agreement. Lechner et al. (2019) introduced the following mechanisms of acquiescence: social deference and cognitive processing. We added their interaction into a theoretical framework. The sample consists of 557 participants. We found significant medium strong relationship…
Descriptors: Cognitive Processes, Attention, Difficulty Level, Reflection
Sherwin E. Balbuena – Online Submission, 2024
This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…
Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing
Haokun Liu – International Journal of Multilingualism, 2025
Globally, countries or regions across from east to west like Hong Kong, Macao, Taiwan, Singapore, the United Kingdom, and the United States have incorporated language item questions in their censuses. The assessment of such design advantages and disadvantages is crucial for academic investigation. Despite ongoing discussions, there is a noticeable…
Descriptors: Language Usage, Demography, Surveys, Questionnaires
Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025
This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…
Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics
Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023
The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…
Descriptors: Scoring, Tests, Evaluation Methods, Test Items
Kofi Nkonkonya Mpuangnan – Review of Education, 2024
Assessment practices play a crucial role in fostering student learning and guiding instructional decision-making. The ability to construct effective test items is of utmost importance in evaluating student learning and shaping instructional strategies. This study aims to investigate the skills of Ghanaian basic schoolteachers in test item…
Descriptors: Test Items, Test Construction, Student Evaluation, Foreign Countries
Xu, Yufeng; Liu, Huinan; Chen, Bo; Huang, Sihui; Zhong, Chongyu – Chemistry Education Research and Practice, 2023
Scientific methods have received widespread attention in recent years. Based on the analytical framework derived from Brandon's matrix consisting of four categories of scientific methods, this paper aims to conduct a content analysis to examine how the diversity of scientific methods is represented in college entrance chemistry examination papers…
Descriptors: College Entrance Examinations, Chemistry, Scientific Methodology, Test Items
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Ayfer Sayin; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023
Developments in the field of education have significantly affected test development processes, and computer-based test applications have been started in many institutions. In our country, research on the application of measurement and evaluation tools in the computer environment for use with distance education is gaining momentum. A large pool of…
Descriptors: Turkish, Literature, Test Items, Item Banks
Lars Andersson Hult; Anders Persson – Journal of Social Science Education, 2025
Purpose: This article's purpose is to examine the manifestations of the evolving modern society and what we now identify as civics or other contemporary social issues in the final examination questions from 1914 to 1937 at four teacher education institutions in Uppsala, Falun, Lund, and Landskrona. Design/methodology/approach: The method can be…
Descriptors: Civics, Tests, Preservice Teacher Education, Test Items
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)