Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 40 |
Since 2006 (last 20 years) | 62 |
Descriptor
Comparative Analysis | 80 |
Foreign Countries | 43 |
Reliability | 36 |
Interrater Reliability | 23 |
Test Reliability | 23 |
Validity | 21 |
Correlation | 20 |
Questionnaires | 18 |
Scores | 18 |
Statistical Analysis | 18 |
English (Second Language) | 16 |
More ▼ |
Source
Author
Publication Type
Tests/Questionnaires | 80 |
Reports - Research | 72 |
Journal Articles | 64 |
Speeches/Meeting Papers | 8 |
Reports - Evaluative | 5 |
Numerical/Quantitative Data | 3 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Location
Australia | 4 |
Iran | 4 |
New Jersey | 3 |
Saudi Arabia | 3 |
District of Columbia | 2 |
Finland | 2 |
Illinois | 2 |
Indonesia | 2 |
Japan | 2 |
Malaysia | 2 |
Netherlands | 2 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024
Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…
Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Hunter, Seth B. – Journal of Education Human Resources, 2023
Teacher performance scores inform education leaders' management of teacher human resources. However, prior research has implied that different interpretations of performance criteria between teachers and their evaluators suppress teacher development. Although research has examined teacher perceptions of performance scores and compared teacher…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Self Evaluation (Individuals), Interrater Reliability
Purwanto; Hidayah, Niswatul; Wagistina, Satti – International Journal of Educational Methodology, 2023
Learning geography in Indonesia philosophically aims to develop spatial literacy. Students must improve spatial literacy to form reasoning skills and apply spatial concepts in real life. Applying Gersmehl's spatial learning can improve students' spatial literacy through syntax arranged based on spatial aspects. The use of google earth helps…
Descriptors: Spatial Ability, Natural Disasters, Geography Instruction, Teaching Methods
Colvin, Kimberly F.; Gorgun, Guher – Practical Assessment, Research & Evaluation, 2020
This study compares a scale, the Rosenberg Self-Esteem Scale, that was administered with four response categories to versions of the same scale that were administered with six and eight response categories. Respondents were randomly assigned to take one of the three versions of RSES. A rating scale utility analysis was conducted on all three…
Descriptors: Psychometrics, Measures (Individuals), Self Concept Measures, Self Esteem
Azman Ong, Mohd Hanafi; Mohd Yasin, Norazlina; Ibrahim, Nur Syafikah – Asian Association of Open Universities Journal, 2022
Purpose: Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and research of validated instruments that could effectively measure online learning response behavior is limited. Thus, in this study, a new instrument was designed…
Descriptors: Online Courses, Student Surveys, Student Attitudes, Factor Analysis
Re-Imagining Narrative Writing and Assessment: A Post-NAPLAN Craft-Based Rubric for Creative Writing
Michael D. Carey; Shelley Davidow; Paul Williams – Australian Journal of Language and Literacy, 2022
According to creative writing pedagogies academic Susanne Gannon ("English in Australia, 54"(2), 43-56, 2019), and the Federal government-commissioned NAPLAN review (McGaw et al., 2020), NAPLAN has restricted how writing is taught in secondary schools. A NAPLAN-influenced structural approach to teaching writing has subsumed the…
Descriptors: Scoring Rubrics, Creative Writing, Writing Evaluation, National Competency Tests
Manzano, Dexter L. – International Journal of Language Testing, 2022
The increasing popularity of self-assessment prompted several scholars to investigate its effectiveness and accuracy in relation to teacher assessment. However, most of these studies focused only on the consistency estimate perspective. Thus, the current study investigated the interrater reliability between self- and teacher assessment of…
Descriptors: Oral Language, Self Evaluation (Individuals), College Students, Interrater Reliability
Bronkhorst, Hugo; Roorda, Gerrit; Suhre, Cor; Goedhart, Martin – Research in Mathematics Education, 2022
Logical reasoning as part of critical thinking is becoming more and more important to prepare students for their future life in society, work, and study. This article presents the results of a quasi-experimental study with a pre-test-post-test control group design focusing on the effective use of formalisations to support logical reasoning. The…
Descriptors: Mathematics Instruction, Teaching Methods, Logical Thinking, Critical Thinking
Marshall, Neil; Shaw, Kirsten; Hunter, Jodie; Jones, Ian – New Zealand Journal of Educational Studies, 2020
There is growing interest in using comparative judgement to assess student work as an alternative to traditional marking. Comparative judgement requires no rubrics and is instead grounded in experts making pairwise judgements about the relative 'quality' of students' work according to a high level criterion. The resulting decision data are fitted…
Descriptors: Comparative Analysis, Decision Making, Student Evaluation, Evaluation Methods
Romeo, Marina; Yepes-Baldó, Montserrat; González, Vicenta; Burset, Silvia; Martín, Carolina; Bosch, Emma – International Journal of Instruction, 2022
The assessment process in higher education considers four aspects: assessment agents, procedure, content, and scoring. In this study, we delve into the who. We analyze the role of transversal competence assessment agents in the framework of professional internships in university master's degree programs, comparing the suitability of their…
Descriptors: Internship Programs, Higher Education, Evaluators, Masters Programs
Farwell, Tricia M.; Alligood, Leon; Fitzgerald, Sharon; Blake, Ken – Journalism and Mass Communication Educator, 2016
This article introduces an objective grammar and math assessment and evaluates the assessment's outcome and reliability when fielded among eighty-one students in media writing courses. In addition, the article proposes a rubric for grading straight news leads and compares the rubric's reliability with the reliability of rating straight news leads…
Descriptors: Journalism, Journalism Education, Introductory Courses, Reliability
Jeong, Heejeong – Language Testing in Asia, 2019
In writing assessment, finding a valid, reliable, and efficient scale is critical. Appropriate scales, increase rater reliability, and can also save time and money. This exploratory study compared the effects of a binary scale and an analytic scale across teacher raters and expert raters. The purpose of the study is to find out how different scale…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Essa, Eman Bani; Alattari, Aref – Research in Educational Administration & Leadership, 2019
This study aimed at identifying patterns of the followership styles and their relation to the leadership styles of academic leaders as perceived by faculty members in public and private universities in northern Jordan. The researchers used the descriptive correlation approach. The Kelley's scale was adopted for the followership styles, and…
Descriptors: Leadership Styles, College Faculty, State Universities, Private Colleges
Linlin, Cao – English Language Teaching, 2020
Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…
Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning