ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	3

Descriptor

Evaluation Methods	4
Reliability	4
Validity	3
Measurement	2
Achievement Tests	1
Artificial Intelligence	1
Computer Security	1
Computer Software	1
Data Analysis	1
Educational Change	1
Educational Research	1
Educational Theories	1
Efficiency	1
Ethics	1
Evidence	1
Faculty Development	1
Foreign Countries	1
Grade 4	1
International Assessment	1
Learning	1
Models	1
Problem Solving	1
Reading Achievement	1
Reading Tests	1
Test Interpretation	1
More ▼

Source

Educational Measurement:…

Author

Alina A. von Davier	1
Deborah J. Harris	1
Ji, Xuejun Ryan	1
Jiangang Hao	1
Matthias von Davier	1
Nichols, Paul D.	1
Parkes, Jay	1
Smith, Philip L.	1
Susan Lottridge	1
Victoria Yaneva	1
Wu, Amery D.	1
More ▼

Publication Type

Journal Articles	4
Reports - Descriptive	2
Opinion Papers	1
Reports - Research	1

Education Level

Elementary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI

Peer reviewed

Direct link

Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024

The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…

Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

Reliability as Argument

Peer reviewed

Direct link

Parkes, Jay – Educational Measurement: Issues and Practice, 2007

Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…

Descriptors: Validity, Reliability, Evaluation Methods, Measurement

Contextualizing the Interpretation of Reliability Data.

Peer reviewed

Nichols, Paul D.; Smith, Philip L. – Educational Measurement: Issues and Practice, 1998

This essay argues that reliability should be reconceptualized in a way that reflects the importance of the theoretical expectations of the test specialist and the learning and problem solving of the test takers. It is time to characterize clearly the substantive theoretical framework supporting reliability studies and the technical evaluation of…

Descriptors: Data Analysis, Educational Research, Educational Theories, Evaluation Methods