Publication Date
In 2025 | 172 |
Since 2024 | 669 |
Since 2021 (last 5 years) | 2217 |
Since 2016 (last 10 years) | 4144 |
Since 2006 (last 20 years) | 6674 |
Descriptor
Test Construction | 16492 |
Test Validity | 5710 |
Test Reliability | 4241 |
Foreign Countries | 3558 |
Test Items | 2673 |
Higher Education | 1960 |
Evaluation Methods | 1850 |
Factor Analysis | 1849 |
Psychometrics | 1710 |
Elementary Secondary Education | 1699 |
Student Evaluation | 1572 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 643 |
Teachers | 450 |
Researchers | 436 |
Administrators | 124 |
Policymakers | 68 |
Students | 66 |
Counselors | 25 |
Parents | 24 |
Community | 10 |
Support Staff | 5 |
Media Staff | 3 |
More ▼ |
Location
Turkey | 575 |
Australia | 334 |
Canada | 251 |
China | 165 |
United States | 142 |
Indonesia | 135 |
United Kingdom | 128 |
Germany | 112 |
California | 107 |
Taiwan | 107 |
United Kingdom (England) | 105 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 2 |
Rik Iping; Thed N. van Leeuwen; Ed Noyons; Alex Burdorf; Irene M. J. Mathijssen; Johannes P. T. M. van Leeuwen; Adrian M. Cohen – Research Evaluation, 2025
This paper describes the development of a bibliometric strength, potential and risk analysis tool, and its applications for research strategy and evaluation. We focus specifically on the motivation, organizational strategic needs, the development and evaluation of the tool. Furthermore, we highlight the co-creation process of the tool and discuss…
Descriptors: Risk Assessment, Test Construction, Bibliometrics, Research Tools
Simen Hjellvik; Steven Mallam; Marte Fannelø Giskeødegård; Salman Nazir – Technology, Knowledge and Learning, 2024
Computer-based simulation is utilised across various educational fields, employing diverse technologies to facilitate practical understanding of content and the acquisition of skills that can help close the gap between theory and practice. The possibility of providing scenarios that resemble on-the-job tasks, enables instructors to both train and…
Descriptors: Computer Simulation, Competence, Evaluation Methods, Test Construction
Lily Tomlin; Andy Smidt; Elise Bogart – International Journal of Language & Communication Disorders, 2024
Background: Assessment tools that assess pragmatic skills in adults with a mild-severe traumatic brain injury (TBI) are hard to access, not person-centred and have a high risk of clinician bias. The Pragmatics Profile is an informant report tool that was originally designed to assess pragmatic skills in people with a developmental disability.…
Descriptors: Head Injuries, Brain, Adults, Test Construction
Yanyan Fu – Educational Measurement: Issues and Practice, 2024
The template-based automated item-generation (TAIG) approach that involves template creation, item generation, item selection, field-testing, and evaluation has more steps than the traditional item development method. Consequentially, there is more margin for error in this process, and any template errors can be cascaded to the generated items.…
Descriptors: Error Correction, Automation, Test Items, Test Construction
Steven Langsford; Zebo Xu; Zhenguang G. Cai – Reading and Writing: An Interdisciplinary Journal, 2025
In the digital age, handwriting literacy has declined to a worrying degree, especially in non-alphabetic writing systems. In particular, Chinese (and also Japanese) handwriters have suffered from character amnesia ([Chinese characters omitted]), where people cannot correctly produce a character though they can recognize it. Though character…
Descriptors: Test Construction, Handwriting, Memory, Adults
Montserrat Yepes-Baldó; Marina Romeo; Núria Codina; Gemma Pallarés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Given the significant gap in tailored assessment tools, this research seeks to adapt the Self-concept (Form 5-AF5) questionnaire for young students with intellectual disabilities, employing an inclusive approach. Method: Twenty-three disability experts initially assessed questionnaire suitability, leading to revisions for clarity.…
Descriptors: Test Construction, Self Concept, Questionnaires, Students with Disabilities
Séverin Lions; María Paz Blanco; Pablo Dartnell; Carlos Monsalve; Gabriel Ortega; Julie Lemarié – Applied Measurement in Education, 2024
Multiple-choice items are universally used in formal education. Since they should assess learning, not test-wiseness or guesswork, they must be constructed following the highest possible standards. Hundreds of item-writing guides have provided guidelines to help test developers adopt appropriate strategies to define the distribution and sequence…
Descriptors: Test Construction, Multiple Choice Tests, Guidelines, Test Items
Junjun Chen – Educational Management Administration & Leadership, 2025
Leading a school during the uncertainties of challenges, changes, and crises requires school principals to respond and react effectively, cohesively and proactively using resilience. Rather than using discrete contracts or dimensions to measure principal resilience, this paper tended to develop and validate a multidimensional instrument of…
Descriptors: Principals, Resilience (Psychology), Test Construction, Test Validity
Alaa Eldin A. Ayoub; Muneera R. Ghablan; Eid G. Abo Hamza; Ahmed M. Abdulla Alabbasi – European Journal of STEM Education, 2025
This study describes the development of the science, technology, engineering, and mathematics (STEM) Scale, intended to assess parental attitudes toward school programs designed to deliver STEM, and evaluates its psychometric properties. The study group included 400 parents of students (138 males and 262 females) enrolled in STEM programs…
Descriptors: STEM Education, Test Construction, Parent Attitudes, Psychometrics
Becker, Benjamin; Weirich, Sebastian; Goldhammer, Frank; Debeer, Dries – Journal of Educational Measurement, 2023
When designing or modifying a test, an important challenge is controlling its speededness. To achieve this, van der Linden (2011a, 2011b) proposed using a lognormal response time model, more specifically the two-parameter lognormal model, and automated test assembly (ATA) via mixed integer linear programming. However, this approach has a severe…
Descriptors: Test Construction, Automation, Models, Test Items
Miguel A. García-Pérez – Educational and Psychological Measurement, 2024
A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…
Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis
Barry B. Gelston – ProQuest LLC, 2024
The purpose of this study was originally to create an operational definition of the "appearance of competence" to design valid questions for educational professionals supporting twice-exceptional (2e) learners to create a testing instrument. Through the methodological process of grounded theory, a replacement research question emerged as…
Descriptors: Definitions, Competence, Academically Gifted, Models
Po-Chun Huang; Ying-Hong Chan; Ching-Yu Yang; Hung-Yuan Chen; Yao-Chung Fan – IEEE Transactions on Learning Technologies, 2024
Question generation (QG) task plays a crucial role in adaptive learning. While significant QG performance advancements are reported, the existing QG studies are still far from practical usage. One point that needs strengthening is to consider the generation of question group, which remains untouched. For forming a question group, intrafactors…
Descriptors: Automation, Test Items, Computer Assisted Testing, Test Construction
Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024
Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…
Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction
Leifeng Xiao; Kit-Tai Hau; Melissa Dan Wang – Educational Measurement: Issues and Practice, 2024
Short scales are time-efficient for participants and cost-effective in research. However, researchers often mistakenly expect short scales to have the same reliability as long ones without considering the effect of scale length. We argue that applying a universal benchmark for alpha is problematic as the impact of low-quality items is greater on…
Descriptors: Measurement, Benchmarking, Item Sampling, Sample Size