ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	29

Descriptor

Evaluators	79
Test Construction	79
Scoring	24
Evaluation Methods	21
Interrater Reliability	18
Foreign Countries	17
Test Items	17
Elementary Secondary Education	14
Test Reliability	13
Test Validity	13
Language Tests	12
Writing Evaluation	10
Evaluation Criteria	9
Program Evaluation	9
Second Language Learning	9
Testing Programs	9
Comparative Analysis	8
Educational Assessment	8
Higher Education	8
Student Evaluation	8
Data Collection	7
English (Second Language)	7
Measurement Techniques	7
Performance Based Assessment	7
Scores	7
More ▼

Publication Type

Reports - Research	42
Journal Articles	37
Speeches/Meeting Papers	20
Reports - Evaluative	19
Reports - Descriptive	11
Tests/Questionnaires	11
Dissertations/Theses -…	3
Guides - General	2
Guides - Non-Classroom	2
Opinion Papers	2
Collected Works - Serials	1
Historical Materials	1
Information Analyses	1
Numerical/Quantitative Data	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	6
Postsecondary Education	4
Elementary Education	3
Secondary Education	3
Elementary Secondary Education	2
Adult Education	1
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 6	1
Grade 8	1
High Schools	1
More ▼

Audience

Researchers	5
Practitioners	1
Teachers	1

Location

Australia	3
United States	3
California	2
Florida	2
Hong Kong	2
Israel	2
Alabama	1
Canada	1
China	1
Europe	1
Indonesia	1
Kyrgyzstan	1
Mexico (Oaxaca)	1
Netherlands	1
New York	1
Nigeria	1
Norway	1
Ohio	1
Thailand	1
Turkey	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	3
Alabama High School…	1
International English…	1
National Teacher Examinations	1
Praxis Series	1
Program for International…	1
Test of English as a Foreign…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Measuring Evaluator Competencies: Developing and Validating the Evaluator Competencies Assessment Tool

Peer reviewed

Direct link

Cho, Minji; Castleman, Ann Marie; Umans, Haley; Mwirigi, Mike Osiemo – American Journal of Evaluation, 2023

Evaluation scholars have committed decades of work to the development of evaluator competencies. The 2018 American Evaluation Association (AEA) Evaluator Competencies may be useful for evaluators to identify their strengths and weaknesses to improve their practice; however, a few empirically validated self-assessment tools based on the…

Descriptors: Evaluators, Competence, Test Construction, Test Validity

Language Testers and Their Place in the Policy Web

Peer reviewed

Direct link

Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024

In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…

Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy

An Example of Redeveloping Checklists to Support Assessors Who Check Draft Exam Papers for Errors

Download full text

Vitello, Sylvia; Crisp, Victoria; Ireland, Jo – Research Matters, 2023

Assessment materials must be checked for errors before they are presented to candidates. Any errors have the potential to reduce validity. For example, in the most extreme cases, an error may turn an otherwise well-designed exam question into one that is impossible to answer. In Cambridge University Press & Assessment, assessment materials are…

Descriptors: Check Lists, Test Validity, Error Correction, Test Construction

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Development and Validation of a Rating Scale for Summarization as an Integrated Task

Peer reviewed

Direct link

Li, Jiuliang; Wang, Qian – Asian-Pacific Journal of Second and Foreign Language Education, 2021

Summary writing is essential for academic success, and has attracted renewed interest in academic research and large-scale language test. However, less attention has been paid to the development and evaluation of the scoring scales of summary writing. This study reports on the validation of a summary rubric that represented an approach to scale…

Descriptors: Validity, Rating Scales, Writing Skills, Writing Evaluation

Using Rasch Analysis to Examine Raters' Expertise Turkish Teacher Candidates' Competency Levels in Writing Different Types of Test Items

Peer reviewed
PDF on ERIC

Download full text

Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022

The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…

Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Presenting the Meta-Performance Test, a Metacognitive Battery Based on Performance

Peer reviewed
PDF on ERIC

Download full text

Castillo Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis – International Journal of Educational Methodology, 2021

The self-report and think-aloud approaches are the two dominant methodologies to measure metacognition. This is problematic, since they generate respondent and confirmation biases, respectively. The Meta-Performance Test is an innovative battery, which evaluates metacognition based on the respondent's performance, mitigating the aforementioned…

Descriptors: Metacognition, Measurement Techniques, Reading Comprehension, Arithmetic

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

Construct Exploration of Teacher Readiness as an Assessor of Vocational High School Competency Test

Peer reviewed
PDF on ERIC

Download full text

Cahyono, Sulistio Mukti; Kartawagiran, Badrun; Mahmudah, Fitri Nur – European Journal of Educational Research, 2021

Teachers who can adapt and be ready for all changes will also be able to provide a balance to increase the competence of vocational high school students. This is also not denied when teachers become assessors in student competency tests. The objectives of this study were to produce an instrument for the readiness of teachers as assessors; to…

Descriptors: Readiness, Vocational Education Teachers, Vocational High Schools, High School Students

Designing a Drug-Dispensing Test Task Using the SPEAKING Grid

Peer reviewed
PDF on ERIC

Download full text

Limgomolvilas, Sasithorn; Wudthayagorn, Jirada – rEFLections, 2022

Although the ethnography of speaking is one of the approaches used to analyze discourse (Schiffrin, 1994; Cameron, 2012), its benefits and uses can be applied in the field of language assessment when designing a drug-dispensing, classroom-based test task. In this article, pharmacy specialists and students functioning as members of the…

Descriptors: Ethnography, Teaching Methods, Discourse Analysis, Language Tests

"Digging for Gold" or "Sticking to the Criteria": Teachers' Rationales When Serving as Professional Raters

Peer reviewed

Direct link

Jølle, Lennart; Skar, Gustaf B. – Scandinavian Journal of Educational Research, 2020

This paper reports findings from a project called "The National Panel of Raters" (NPR) that took place within a writing test programme in Norway (2010-2016). A recent research project found individual differences between the raters in the NPR. This paper reports results from an explorative follow up-study where 63 NPR members were…

Descriptors: Foreign Countries, Validity, Scoring, Program Descriptions

The Effects of Primacy on Rater Cognition: An Eye-Tracking Study

Direct link

Ballard, Laura – ProQuest LLC, 2017

Rater scoring has an impact on writing test reliability and validity. Thus, there has been a continued call for researchers to investigate issues related to rating (Crusan, 2015). Investigating the scoring process and understanding how raters arrive at particular scores are critical "because the score is ultimately what will be used in making…

Descriptors: Evaluators, Schemata (Cognition), Eye Movements, Scoring Rubrics

Evaluating Students' Performance in Responding to Art: The Development and Validation of an Art Criticism Assessment Rubric

Peer reviewed

Direct link

Tam, Cheung On – International Journal of Art & Design Education, 2018

This article reports on the development and validation of a rubric for assessing students' written responses to artworks. Since the implementation of the Hong Kong New Senior Secondary Curriculum in 2009, art educators have seen responding to artworks as increasingly important. In this context, the Art Criticism Assessment Rubric (ACAR) was…

Descriptors: Foreign Countries, Art Education, Art Appreciation, Student Evaluation

Using a Common Sense Knowledge Base to Auto Generate Multi-Dimensional Vocabulary Assessments

Peer reviewed
PDF on ERIC

Download full text

Sharma Mittal, Ruhi; Nagar, Seema; Sharma, Mourvi; Dwivedi, Utkarsh; Dey, Prasenjit; Kokku, Ravi – International Educational Data Mining Society, 2018

As education gets increasingly digitized, and intelligent tutoring systems gain commercial prominence, scalable assessment generation mechanisms become a critical requirement for enabling increased learning outcomes. Assessments provide a way to measure learners' level of understanding and difficulty, and personalize their learning. There have…

Descriptors: Vocabulary Development, Language Tests, Semantics, Associative Learning

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Applied Measurement in…	5
American Journal of Evaluation	3
Language Testing	3
ProQuest LLC	3
Australian Review of Applied…	2
Educational Measurement:…	2
Asian-Pacific Journal of…	1
Assessment Update	1
Bill & Melinda Gates…	1
Educational Assessment	1
European Journal of…	1
Evaluation Review	1
Florida Journal of…	1
Grantee Submission	1
International Educational…	1
International Journal of Art…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Consulting and…	1
Journal of MultiDisciplinary…	1
Language Assessment Quarterly	1
Language and Education	1
Language and Intercultural…	1
More ▼

Myford, Carol M.	3
Friedman, Charles B.	2
Li, Jiuliang	2
Linacre, John M.	2
Linn, Robert L.	2
Akpe, C. S.	1
Albert Weideman	1
Alexander Kah	1
Angoff, William H.	1
Attali, Yigal	1
Baker, Eva L.	1
Ballard, Laura	1
Bart Deygers	1
Bolton, Dale L.	1
Boser, Judith A.	1
Braverman, Marc T., Ed.	1
Brickell, Henry M.	1
Bridgeford, Nancy J., Comp.	1
Bronson, William H.	1
Brossell, Gordon, Hoetker,…	1
Brown, Anne	1
Brull, Harry	1
Cahyono, Sulistio Mukti	1
Carifio, James	1
More ▼