Publication Date
In 2025 | 8 |
Descriptor
Source
Communique | 2 |
Educational Measurement:… | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Phi Delta Kappan | 1 |
Research Matters | 1 |
South African Journal of… | 1 |
Author
K. Kawena Begay | 2 |
Miranda Kucera | 2 |
Bryan R. Drost | 1 |
Char Shryock | 1 |
Corin D. Mathews | 1 |
Guher Gorgun | 1 |
Ki Lynn Cole | 1 |
Kylie Gorney | 1 |
Mark D. Reckase | 1 |
Okan Bulut | 1 |
Santi Lestari | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 5 |
Reports - Descriptive | 3 |
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025
In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Santi Lestari – Research Matters, 2025
The ability to draw visual representations such as diagrams and graphs is considered fundamental to science learning. Science exams therefore often include questions which require students to draw a visual representation, or to augment a partially provided one. The design features of such questions (e.g., layout of diagrams, amount of answer…
Descriptors: Science Education, Secondary Education, Visual Aids, Foreign Countries
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Bryan R. Drost; Char Shryock – Phi Delta Kappan, 2025
Creating assessment questions aligned to standards is a time-consuming task for teachers, but large language models such as ChatGPT can help. Bryan Drost & Char Shryock describe a three-step process for using ChatGPT to create assessments: 1) Ask ChatGPT to break standards into measurable targets. 2) Determine how much time to spend on each…
Descriptors: Artificial Intelligence, Computer Software, Technology Integration, Teaching Methods
Miranda Kucera; K. Kawena Begay – Communique, 2025
While the field advocates for a diversified and comprehensive professional role (National Association of School Psychologists, 2020), school psychologists have long spent most of their time in assessment-related activities (Farmer et al., 2021), averaging about eight cognitive evaluations monthly (Benson et al., 2020). Assessment practices have…
Descriptors: Equal Education, Student Evaluation, Evaluation Methods, Standardized Tests
Miranda Kucera; K. Kawena Begay – Communique, 2025
In Part 1 of this series, the authors briefly reviewed some challenges inherent in using standardized tools with students who are not well represented in norming data. To help readers clearly conceptualize the framework steps, the authors present two case studies that showcase how a nonstandardized approach to assessment can be individualized to…
Descriptors: Equal Education, Student Evaluation, Evaluation Methods, Standardized Tests
Corin D. Mathews – South African Journal of Childhood Education, 2025
Background: Base-ten thinking (BTT) -- children's ability to reason in tens and ones is a crucial measure of Foundation Phase learners' mathematical performance in South Africa. Aim: The study looks at the six learners using BTT to solve additive tasks through two different assessments. Setting: Six purposely selected Grade 3 learners in…
Descriptors: Evaluation Methods, Task Analysis, High Achievement, Low Achievement