Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 3 |
Descriptor
Evaluation Methods | 4 |
Reliability | 4 |
Validity | 3 |
Measurement | 2 |
Achievement Tests | 1 |
Artificial Intelligence | 1 |
Computer Security | 1 |
Computer Software | 1 |
Data Analysis | 1 |
Educational Change | 1 |
Educational Research | 1 |
More ▼ |
Source
Educational Measurement:… | 4 |
Author
Alina A. von Davier | 1 |
Deborah J. Harris | 1 |
Ji, Xuejun Ryan | 1 |
Jiangang Hao | 1 |
Matthias von Davier | 1 |
Nichols, Paul D. | 1 |
Parkes, Jay | 1 |
Smith, Philip L. | 1 |
Susan Lottridge | 1 |
Victoria Yaneva | 1 |
Wu, Amery D. | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Descriptive | 2 |
Opinion Papers | 1 |
Reports - Research | 1 |
Education Level
Elementary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Progress in International… | 1 |
What Works Clearinghouse Rating
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Parkes, Jay – Educational Measurement: Issues and Practice, 2007
Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…
Descriptors: Validity, Reliability, Evaluation Methods, Measurement

Nichols, Paul D.; Smith, Philip L. – Educational Measurement: Issues and Practice, 1998
This essay argues that reliability should be reconceptualized in a way that reflects the importance of the theoretical expectations of the test specialist and the learning and problem solving of the test takers. It is time to characterize clearly the substantive theoretical framework supporting reliability studies and the technical evaluation of…
Descriptors: Data Analysis, Educational Research, Educational Theories, Evaluation Methods