Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Federica Ferretti; Alessandro Gambini; Camilla Spagnolo – European Journal of Science and Mathematics Education, 2024
As highlighted in the literature, one of the main difficulties in mathematics is the management of different semiotic representations. This difficulty occurs in verticals throughout schooling and is often an obstacle to the proper learning process of mathematics. The present study aims to investigate the different facets of these difficulties with…
Descriptors: Semiotics, Mathematics Education, Mathematics Tests, Test Items
Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024
The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…
Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software
Okim Kang; Xun Yan; Maria Kostromitina; Ron Thomson; Talia Isaacs – Language Testing, 2024
This study aimed to answer an ongoing validity question related to the use of nonstandard English accents in international tests of English proficiency and associated issues of test fairness. More specifically, we examined (1) the extent to which different or shared English accents had an impact on listeners' performances on the Duolingo listening…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Nonstandard Dialects
Janhavi Mallaiah; Olajide Williams; John P. Allegrante – Health Education & Behavior, 2024
Community health workers (CHWs) are increasingly being required to perform complex health care activities, especially in community cardiovascular disease and stroke prevention. However, currently, there are no psychometrically validated instruments for assessing CHW competencies in these roles. This article describes the development and validation…
Descriptors: Community Health Services, Health Personnel, Test Construction, Test Validity
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Kelley Nelson-Strouts – ProQuest LLC, 2024
The current study used principles of high-quality measurement design including domain specification, item generation, expert review, and piloting to develop two versions of a morphological awareness measure to be administered to 63 primarily English-speaking second grade students: the "Static Morphological Awareness Assessment (StMA)"…
Descriptors: Morphology (Languages), Metalinguistics, Grade 2, Elementary School Students
Niclas Larson – Journal of the International Society for Teacher Education, 2024
This paper reports on a revision of the assessment model from the first mathematics course for pre-service teachers (PSTs) aiming for grades 5-10, at a Norwegian university. The weight of the final written exam was reduced and a new, mastery-based testing model, with weekly small tests, was introduced. Results from this study show that the PSTs…
Descriptors: High Stakes Tests, Test Length, Mathematics Tests, Preservice Teachers
Ahmet Berk Ustun; Fatma Gizem Karaoglan-Yilmaz; Ramazan Yilmaz; Mehmet Ceylan; Orhan Uzun – Education and Information Technologies, 2024
The primary aim of the study is to develop an augmented reality (AR) acceptance scale within the framework of the unified theory of acceptance and use of technology (UTAUT) model to measure individuals' acceptance and use of AR technology. The study was performed with a total of 546 university students with three participant groups in the…
Descriptors: Test Construction, Computer Simulation, College Students, Test Reliability
Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024
Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…
Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Madeline C. Schmitt – Grantee Submission, 2024
Several researchers have called for schools to interpret universal screening results using posterior probabilities. Following this recommendation could require schools to move away from direct-route, single-measure screening unless base rates of risk fall within a narrow range. In this descriptive study, we investigated two questions surrounding…
Descriptors: Reading Skills, Mathematics Skills, Screening Tests, Test Results
Under the Weather? The Effects of Temperature on Student Test Performance. EdWorkingPaper No. 24-910
Deven Carlson; Adam Shepardson – Annenberg Institute for School Reform at Brown University, 2024
As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a…
Descriptors: Weather, Climate, Heat, Academic Achievement
Rajeshwari Panigrahi; Khaliq Lubza Nihar; Neha Singh – Higher Learning Research Communications, 2024
Objective: This study aimed to develop and test a scale for measuring the quality of blended learning models in higher education. Methods: This research adopts a sequential mixed-method approach to construct a new measurement scale. The first phase consisted of the inductive approach to identify the items, followed by exploratory factor analysis.…
Descriptors: Blended Learning, Educational Quality, Higher Education, Test Construction
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Emine Aytekin; Hazal Sedef Erik; M. Betül Yilmaz – Education and Information Technologies, 2025
In the context of lifelong learning, it is crucial for adult educators and trainers to understand and learn adult learning theories in adult education settings. Furthermore, flexibility and adaptability to the ever-changing learning environment in a technological context, as well as strategies for making real-world connections with different…
Descriptors: Adult Education, Adult Educators, Technology Uses in Education, Teacher Education

Peer reviewed
Direct link
