Publication Date
In 2025 | 3 |
Since 2024 | 7 |
Descriptor
Evaluation Methods | 3 |
Growth Models | 3 |
Test Validity | 3 |
Validity | 3 |
Accuracy | 2 |
Achievement Gains | 2 |
Artificial Intelligence | 2 |
Error of Measurement | 2 |
Progress Monitoring | 2 |
Test Construction | 2 |
Test Reliability | 2 |
More ▼ |
Source
Educational Measurement:… | 7 |
Author
Derek C. Briggs | 2 |
Alina A. Davier | 1 |
Alina A. von Davier | 1 |
Andrew D. Ho | 1 |
Angela Johnson | 1 |
Daniel F. McCaffrey | 1 |
Daniel Murphy | 1 |
Deborah J. Harris | 1 |
Deborah L. Bandalos | 1 |
Elizabeth Barker | 1 |
Guher Gorgun | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Descriptive | 3 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Education Level
Elementary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Indiana | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Daniel Murphy; Sarah Quesen; Matthew Brunetti; Quintin Love – Educational Measurement: Issues and Practice, 2024
Categorical growth models describe examinee growth in terms of performance-level category transitions, which implies that some percentage of examinees will be misclassified. This paper introduces a new procedure for estimating the classification accuracy of categorical growth models, based on Rudner's classification accuracy index for item…
Descriptors: Classification, Growth Models, Accuracy, Performance Based Assessment
Terry A. Ackerman; Deborah L. Bandalos; Derek C. Briggs; Howard T. Everson; Andrew D. Ho; Susan M. Lottridge; Matthew J. Madison; Sandip Sinharay; Michael C. Rodriguez; Michael Russell; Alina A. Davier; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2024
This article presents the consensus of an National Council on Measurement in Education Presidential Task Force on Foundational Competencies in Educational Measurement. Foundational competencies are those that support future development of additional professional and disciplinary competencies. The authors develop a framework for foundational…
Descriptors: Educational Assessment, Competence, Skill Development, Communication Skills
Katherine E. Castellano; Daniel F. McCaffrey; Joseph A. Martineau – Educational Measurement: Issues and Practice, 2025
Growth-to-standard models evaluate student growth against the growth needed to reach a future standard or target of interest, such as proficiency. A common growth-to-standard model involves comparing the popular Student Growth Percentile (SGP) to Adequate Growth Percentiles (AGPs). AGPs follow from an involved process based on fitting a series of…
Descriptors: Student Evaluation, Growth Models, Student Educational Objectives, Educational Indicators
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025
Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…
Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement