Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 9 |
| Since 2017 (last 10 years) | 25 |
| Since 2007 (last 20 years) | 89 |
Descriptor
| Construct Validity | 150 |
| Evaluation Methods | 150 |
| Test Validity | 66 |
| Test Reliability | 51 |
| Test Construction | 43 |
| Foreign Countries | 37 |
| Factor Analysis | 28 |
| Measurement Techniques | 24 |
| Psychometrics | 22 |
| Student Evaluation | 20 |
| Questionnaires | 18 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 6 |
| Practitioners | 2 |
| Teachers | 1 |
Location
| United Kingdom | 5 |
| Taiwan | 4 |
| Australia | 3 |
| China | 2 |
| Indonesia | 2 |
| Iran | 2 |
| South Africa | 2 |
| Arizona | 1 |
| California | 1 |
| Canada (Calgary) | 1 |
| Canada (Montreal) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Mohammad Hmoud; Hadeel Swaity; Eman Anjass; Eva María Aguaded-Ramírez – Electronic Journal of e-Learning, 2024
This research aimed to develop and validate a rubric to assess Artificial Intelligence (AI) chatbots' effectiveness in accomplishing tasks, particularly within educational contexts. Given the rapidly growing integration of AI in various sectors, including education, a systematic and robust tool for evaluating AI chatbot performance is essential.…
Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction
Anani Sarab, Mohammad Reza; Rahmani, Simindokht – International Journal of Language Testing, 2023
Language testing and assessment have grown in popularity and gained significance in the last few decades, and there is a rising need for assessment literate stakeholders in the field of language education. As teachers play a major role in assessing students, there is a need to make sure they have the right level of assessment knowledge and skills…
Descriptors: Language Tests, Literacy, Second Language Learning, Factor Analysis
Renzulli, Joseph; Beghetto, Ronald; Brandon, Laurel; Karwowski, Maciej – Gifted Education International, 2022
This article describes the development of an instrument for examining schools as institutions where teaching practices and school structures provide opportunities and support for student imagination, creativity, and innovation, as well as initial comparisons using the instrument, using a sample of n = 5020 students and n = 268 teachers (n = 161…
Descriptors: Test Construction, Imagination, Creativity, Innovation
Corinna Jaschek; Julia von Thienen; Kim-Pascal Borchart; Christoph Meinel – Creativity Research Journal, 2023
The automation of creativity measurement is a promising avenue of development, given that classic creativity assessments face challenges such as resource-intensive expert judgments, subjective creativity ratings, and biases in people's self-reports. In this paper, we present a construct validation study for CollaboUse, a test developed to deliver…
Descriptors: Automation, Creativity Tests, Cooperation, Construct Validity
Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022
Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Kriswantoro; Kartowagiran, Badrun; Rohaeti, Eli – European Journal of Educational Research, 2021
Every school should be able to equip students to have the ability to integrate the knowledge gained with real life in responding to global challenges. Assessment of learning outcomes in the form of cognitive and skill aspects must go hand in hand. This study aims to produce: (1) a critical thinking model integrated with the science process, (2)…
Descriptors: Critical Thinking, Student Evaluation, Evaluation Methods, Science Process Skills
Nicole D. Martin; Stephanie N. Baker; Madeline Haynes; Jayce R. Warner – Computer Science Education, 2024
Background and Context: As computer science (CS) education expands and the need for well-prepared CS teachers grows, understanding what motivates teachers to teach CS can help address challenges to recruiting, preparing, and retaining teachers. Objective: The goal of this work was to develop and validate a scale that measures teachers' motivation…
Descriptors: Computer Science Education, Teacher Motivation, Measurement Techniques, Construct Validity
Jaime Barratt; Dean Dudley; Michalis Stylianou; George Thomas; Kai Wheeler; John Cairney – Measurement in Physical Education and Exercise Science, 2025
This study evaluated the reliability and construct validity of the 51-item Effective Early Childhood Physical Literacy Pedagogue self-report instrument (ECE-PLP) measuring Early Childhood Educators' (ECEs) perceived physical literacy (PPL) capabilities, knowledge, and practices for effectively promoting PL in young children. 494 ECEs completed the…
Descriptors: Foreign Countries, Physical Activity Level, Multiple Literacies, Construct Validity
Lee, Walter C.; Hall, Janice L.; Godwin, Allison; Knight, David B.; Verdín, Dina – Journal of Engineering Education, 2022
Background: Supporting undergraduate students in science, technology, engineering, and mathematics (STEM) has been a persistent need. However, assessing the impact of support efforts can prove challenging as it is difficult to operationalize student support and subsequently monitor the combined impacts of the various supports to which students…
Descriptors: Undergraduate Students, Engineering Education, Student Attitudes, STEM Education
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017
We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…
Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods
Pennell, Adam – ProQuest LLC, 2019
This dissertation consists of three studies which examined multidimensional balance in youth (= 21 years; Individuals with Disabilities Education Act, 2004) with visual impairments (VIs) using the Brief-Balance Evaluation Systems Test (Brief-BESTest). These studies have the potential to inform (adapted) physical education curricula and…
Descriptors: Psychomotor Skills, Youth, Visual Impairments, Human Posture
Bailes, Lauren P.; Nandakumar, Ratna – International Journal of Education Policy and Leadership, 2020
High-quality measurement tools are critical to school improvement efforts. Education researchers frequently employ surveys in order to assess a host of variables associated with school improvement. This article asserts that Rasch modeling techniques enhance the quality of a measurement tool because they comprise elements of both qualitative and…
Descriptors: Surveys, Evaluation Methods, Item Response Theory, Administrator Role
Subando, Joko; Kartowagiran, Badrun; Munadi, Sudji – International Journal of Evaluation and Research in Education, 2021
The purpose of this research was to develop a curriculum design evaluation instrument in strengthening Al-Irsyad ideology. The research activity began with a literature review on the curriculum then continues with the development of the instrument. The results of the development of the instrument items were validated by 11 experts and tested on a…
Descriptors: Foreign Countries, Curriculum Design, Curriculum Evaluation, Evaluation Methods

Peer reviewed
Direct link
