NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 79 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Cho, Minji; Castleman, Ann Marie; Umans, Haley; Mwirigi, Mike Osiemo – American Journal of Evaluation, 2023
Evaluation scholars have committed decades of work to the development of evaluator competencies. The 2018 American Evaluation Association (AEA) Evaluator Competencies may be useful for evaluators to identify their strengths and weaknesses to improve their practice; however, a few empirically validated self-assessment tools based on the…
Descriptors: Evaluators, Competence, Test Construction, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024
In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…
Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy
Vitello, Sylvia; Crisp, Victoria; Ireland, Jo – Research Matters, 2023
Assessment materials must be checked for errors before they are presented to candidates. Any errors have the potential to reduce validity. For example, in the most extreme cases, an error may turn an otherwise well-designed exam question into one that is impossible to answer. In Cambridge University Press & Assessment, assessment materials are…
Descriptors: Check Lists, Test Validity, Error Correction, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025
Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…
Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Jiuliang; Wang, Qian – Asian-Pacific Journal of Second and Foreign Language Education, 2021
Summary writing is essential for academic success, and has attracted renewed interest in academic research and large-scale language test. However, less attention has been paid to the development and evaluation of the scoring scales of summary writing. This study reports on the validation of a summary rubric that represented an approach to scale…
Descriptors: Validity, Rating Scales, Writing Skills, Writing Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022
The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…
Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise
Peer reviewed Peer reviewed
Direct linkDirect link
Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024
Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…
Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Castillo Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis – International Journal of Educational Methodology, 2021
The self-report and think-aloud approaches are the two dominant methodologies to measure metacognition. This is problematic, since they generate respondent and confirmation biases, respectively. The Meta-Performance Test is an innovative battery, which evaluates metacognition based on the respondent's performance, mitigating the aforementioned…
Descriptors: Metacognition, Measurement Techniques, Reading Comprehension, Arithmetic
Peer reviewed Peer reviewed
Direct linkDirect link
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Cahyono, Sulistio Mukti; Kartawagiran, Badrun; Mahmudah, Fitri Nur – European Journal of Educational Research, 2021
Teachers who can adapt and be ready for all changes will also be able to provide a balance to increase the competence of vocational high school students. This is also not denied when teachers become assessors in student competency tests. The objectives of this study were to produce an instrument for the readiness of teachers as assessors; to…
Descriptors: Readiness, Vocational Education Teachers, Vocational High Schools, High School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Limgomolvilas, Sasithorn; Wudthayagorn, Jirada – rEFLections, 2022
Although the ethnography of speaking is one of the approaches used to analyze discourse (Schiffrin, 1994; Cameron, 2012), its benefits and uses can be applied in the field of language assessment when designing a drug-dispensing, classroom-based test task. In this article, pharmacy specialists and students functioning as members of the…
Descriptors: Ethnography, Teaching Methods, Discourse Analysis, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Jølle, Lennart; Skar, Gustaf B. – Scandinavian Journal of Educational Research, 2020
This paper reports findings from a project called "The National Panel of Raters" (NPR) that took place within a writing test programme in Norway (2010-2016). A recent research project found individual differences between the raters in the NPR. This paper reports results from an explorative follow up-study where 63 NPR members were…
Descriptors: Foreign Countries, Validity, Scoring, Program Descriptions
Ballard, Laura – ProQuest LLC, 2017
Rater scoring has an impact on writing test reliability and validity. Thus, there has been a continued call for researchers to investigate issues related to rating (Crusan, 2015). Investigating the scoring process and understanding how raters arrive at particular scores are critical "because the score is ultimately what will be used in making…
Descriptors: Evaluators, Schemata (Cognition), Eye Movements, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Tam, Cheung On – International Journal of Art & Design Education, 2018
This article reports on the development and validation of a rubric for assessing students' written responses to artworks. Since the implementation of the Hong Kong New Senior Secondary Curriculum in 2009, art educators have seen responding to artworks as increasingly important. In this context, the Art Criticism Assessment Rubric (ACAR) was…
Descriptors: Foreign Countries, Art Education, Art Appreciation, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sharma Mittal, Ruhi; Nagar, Seema; Sharma, Mourvi; Dwivedi, Utkarsh; Dey, Prasenjit; Kokku, Ravi – International Educational Data Mining Society, 2018
As education gets increasingly digitized, and intelligent tutoring systems gain commercial prominence, scalable assessment generation mechanisms become a critical requirement for enabling increased learning outcomes. Assessments provide a way to measure learners' level of understanding and difficulty, and personalize their learning. There have…
Descriptors: Vocabulary Development, Language Tests, Semantics, Associative Learning
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6