Publication Date
In 2025 | 34 |
Since 2024 | 127 |
Since 2021 (last 5 years) | 347 |
Since 2016 (last 10 years) | 661 |
Since 2006 (last 20 years) | 1804 |
Descriptor
Evaluation Methods | 3945 |
Test Validity | 2067 |
Validity | 1463 |
Test Reliability | 987 |
Student Evaluation | 798 |
Foreign Countries | 628 |
Test Construction | 551 |
Reliability | 523 |
Higher Education | 450 |
Measurement Techniques | 417 |
Elementary Secondary Education | 414 |
More ▼ |
Source
Author
Fuchs, Lynn S. | 12 |
Baker, Eva L. | 11 |
Cronin, John | 11 |
Marsh, Herbert W. | 11 |
Amrein-Beardsley, Audrey | 9 |
Linn, Robert L. | 9 |
Sireci, Stephen G. | 9 |
Raykov, Tenko | 8 |
Deno, Stanley L. | 7 |
Epstein, Michael H. | 7 |
Matson, Johnny L. | 7 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 193 |
Practitioners | 121 |
Teachers | 45 |
Administrators | 31 |
Policymakers | 27 |
Students | 15 |
Counselors | 7 |
Media Staff | 4 |
Community | 3 |
Support Staff | 3 |
Parents | 2 |
More ▼ |
Location
Australia | 66 |
United Kingdom | 56 |
Canada | 47 |
California | 32 |
Netherlands | 30 |
United States | 30 |
United Kingdom (England) | 26 |
Germany | 23 |
Turkey | 22 |
Taiwan | 21 |
China | 20 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Tudor Cristea; Chris Snijders; Uwe Matzat; Ad Kleingeld – Education and Information Technologies, 2024
Self-regulated learning has seen a large increase in research interest due to its importance for online learning of higher education students. Several ways to measure self-regulated learning have been suggested. However, most measurements are either obtrusive, necessitating time and effort from students and potentially influencing the learning…
Descriptors: Learning Processes, Self Management, Evaluation Methods, Task Analysis
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Kylie Anglin – AERA Open, 2024
Given the rapid adoption of machine learning methods by education researchers, and the growing acknowledgment of their inherent risks, there is an urgent need for tailored methodological guidance on how to improve and evaluate the validity of inferences drawn from these methods. Drawing on an integrative literature review and extending a…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Fouché, Ilse – Applied Linguistics, 2023
This article, located in the discipline of academic literacy studies, draws upon the fields of critical realism, design research, and evaluation studies. It reports on the validation of a flexible evaluation design for assessing the impact of academic literacy interventions. The design was validated in two ways. Firstly, through a process of…
Descriptors: Foreign Countries, Intervention, Literacy Education, Feedback (Response)
Di Rezze, Briano; Gentles, Stephen James; Hidecker, Mary Jo Cooley; Zwaigenbaum, Lonnie; Rosenbaum, Peter; Duku, Eric; Georgiades, Stelios; Roncadin, Caroline; Fang, Hanna; Tajik-Parvinchi, Diana; Viveiros, Helena – Journal of Autism and Developmental Disorders, 2022
The Autism Classification System of Functioning: Social Communication (ACSF) describes social communication functioning levels. First developed for preschoolers with ASD, this study tests an expanded age range (2-to-18 years). The ACFS rates the child's typical and best (i.e., capacity) performance. Qualitative methods tested parent and clinician…
Descriptors: Content Validity, Reliability, Autism Spectrum Disorders, Classification
Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022
Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…
Descriptors: Validity, Performance, Standards, Multivariate Analysis
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Wan Fazwani Wan Mat; Lim Hooi Lian – Journal of Education and Learning (EduLearn), 2025
This bibliometric article examines the current state of publication in the field of classroom assessment, exploring the productivity and influence of countries, institutions, and authors. A search query of on the Scopus database using the term "classroom assessment" or "classroom-based assessment" or "assessment for…
Descriptors: Alternative Assessment, Student Evaluation, Bibliometrics, Formative Evaluation
Yingchen Wang – SAGE Open, 2024
Surveys are typical for student evaluation of teaching (SET). Survey research consistently confirms the negative impacts of careless responses on research validity, including low data quality and invalid research inferences. SET literature seldom addresses if careless responses are present and how to improve. To improve evaluation practices and…
Descriptors: Student Evaluation of Teacher Performance, Responses, Validity, Data Use
Scott H. Yamamoto – Journal of Psychoeducational Assessment, 2024
This was the first study in which a psychometrically validated STEM measure, the "Student STEM" (S-STEM), was studied for HSSWD. This study also represented the first time a psychometrically validated STEM measure, the "Student STEM" (S-STEM), was studied for HSSWD. Data were collected from 229 HSSWD in a western state and…
Descriptors: Psychometrics, STEM Education, Student Attitudes, High School Students
Karisse A. Callender; Abdulkadir Haktanir – British Journal of Guidance & Counselling, 2024
We developed the Counsellor Personal Wellness and Professional Wellbeing (CPW) assessment based on the integration of the Well-Being theory and the Indivisible Self Model of Wellness (IS-WEL). Our participants included 326 counsellors, counsellor educators, and counsellors-in-training in the United States. The participants primarily identified as…
Descriptors: Wellness, Counselors, Measures (Individuals), Counselor Educators
Soubhik Barari; Eric Newsom; Ji Eun Park; Susan M. Paddock – NORC at the University of Chicago, 2024
Prospective students and their families use college rankings to navigate their higher education options. Rising tuition and fees have made the college decision more fraught. Recently, the major college ranking providers have revised their methodologies to reflect costs and other considerations. These revisions raise important questions about the…
Descriptors: Construct Validity, Evaluation Methods, Educational Quality, Student Costs
Fu Chen; Ying Cui; Alina Lutsyk-King; Yizhu Gao; Xiaoxiao Liu; Maria Cutumisu; Jacqueline P. Leighton – Education and Information Technologies, 2024
Post-secondary data literacy education is critical to students' academic and career success. However, the literature has not adequately addressed the conceptualization and assessment of data literacy for post-secondary students. In this study, we introduced a novel digital performance-based assessment for teaching and evaluating post-secondary…
Descriptors: Performance Based Assessment, College Students, Information Literacy, Evaluation Methods
He, Yinhong – Journal of Educational Measurement, 2023
Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…
Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques