Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025
Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…
Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes
Ercan Doganay; Cemal Aküzüm – International Journal of Contemporary Educational Research, 2025
The aim of this study is to develop a valid and reliable measurement tool to measuring teachers' leader-member exchange behaviors. The study group of the research consists of 396 teachers working in secondary schools in the central districts of Eskisehir, Odunpazari and Tepebasi, in the 2018-2019 academic year. The construct validity of the scale…
Descriptors: Teacher Behavior, Test Construction, Test Validity, Test Reliability
Hanna Palmér; Camilla Björklund – Early Years: An International Journal of Research and Development, 2025
There is a growing consensus in research that children's numerical competence starts to develop at a very early age. However, there are few tools for screening the development of early numerical competence and thereby making this development researchable. One obstacle in designing such tools is that verbal utterances cannot be used as the primary…
Descriptors: Toddlers, Preschool Education, Numeracy, Screening Tests
Semere Kiros Bitew; Amir Hadifar; Lucas Sterckx; Johannes Deleu; Chris Develder; Thomas Demeester – IEEE Transactions on Learning Technologies, 2024
Multiple-choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, owing to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Test Construction, Test Items
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Ebru Dogruöz; Hülya Kelecioglu – International Journal of Assessment Tools in Education, 2024
In this research, multistage adaptive tests (MST) were compared according to sample size, panel pattern and module length for top-down and bottom-up test assembly methods. Within the scope of the research, data from PISA 2015 were used and simulation studies were conducted according to the parameters estimated from these data. Analysis results for…
Descriptors: Adaptive Testing, Test Construction, Foreign Countries, Achievement Tests
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
María Vallejo-Valdivielso; Pilar de Castro-Manglano; Cristina Vidal-Adroher; Azucena Díez-Suárez; Cesar A. Soutullo – Journal of Attention Disorders, 2024
Objective: To develop a short version of the Spanish 18-item ADHD-Rating Scale IV.es (sADHD-RS-IV.es) to be used as a potential screening tool in pediatric population. Methods: We recruited 652 subjects, ages 6 to 18 (mean ± SD = 11.14 ± 3.27): 518 patients with ADHD (per DSM-IV criteria); and 134 healthy controls. We performed a stepwise logistic…
Descriptors: Rating Scales, Attention Deficit Hyperactivity Disorder, Screening Tests, Children
Rachel Bowns; Jordan E. Loeffelman; Douglas Steinley; Kenneth J. Sher – Journal of American College Health, 2024
Objective: To develop a shortened form of the Young Adult Alcohol Problems Screening Test[superscript 1] (YAAPST; original length = 27 items) using a novel combinatorial approach. Participants: 489 college freshmen, half of whom were above average risk for alcohol use disorder based upon family history, attending a large, Midwestern University…
Descriptors: Test Construction, Screening Tests, Young Adults, Drinking
Melissa Whatley; Dominique Foster; Stephen Paul – Journal of Studies in International Education, 2024
The purpose of this study was to develop a measurement instrument that scholars and practitioners in international education can use as a means of exploring whether and how individuals who come into contact with international education programs develop a greater sense of cultural humility. Specifically, the study described here outlines the four…
Descriptors: Foreign Students, Cultural Awareness, Consciousness Raising, Test Construction
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Emily A. Holt; Jessica Duke; Ryan Dunk; Krystal Hinerman – Environmental Education Research, 2024
Student understanding of climate change is an active and growing area of research, but little research has documented undergraduate students' knowledge about the biotic impacts of climate change. Here, we address this literature gap by presenting the Inventory of Biotic Climate Literacy (IBCL), a concept inventory developed to assess undergraduate…
Descriptors: Climate, Undergraduate Students, Knowledge Level, Test Construction
Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024
The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…
Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity
Fulya Merve Kos; Murat Bektas; Dijle Ayar – Psychology in the Schools, 2025
This study was conducted to develop a measurement tool to achieve epilepsy self-management in teachers and examine its Turkish psychometric properties. This descriptive, comparative, correlational, and methodological study was conducted between May and August 2022 with 346 teachers between the ages of 24 and 67 working in public schools selected…
Descriptors: Test Construction, Psychometrics, Self Management, Epilepsy
Bomna Ko; Phillip Ward; Han Joo Lee; Yaohui He; Kelsey Higginson; Insook Kim – International Journal of Kinesiology in Higher Education, 2025
Developing valid and reliable instruments to assess common content knowledge (CCK) is a prerequisite for determining and improving the content knowledge of preservice teachers (PSTs) and teachers. We report on the development and psychometric analysis of an instrument for assessing PSTs' gymnastics CCK for secondary physical education teaching…
Descriptors: Athletics, Foreign Countries, Preservice Teachers, Test Validity

Peer reviewed
Direct link
