Publication Date
In 2025 | 16 |
Since 2024 | 81 |
Since 2021 (last 5 years) | 277 |
Since 2016 (last 10 years) | 576 |
Since 2006 (last 20 years) | 1137 |
Descriptor
Student Evaluation | 2397 |
Test Validity | 1474 |
Evaluation Methods | 798 |
Test Reliability | 733 |
Validity | 661 |
Foreign Countries | 483 |
Test Construction | 447 |
Elementary Secondary Education | 378 |
Academic Achievement | 308 |
Higher Education | 299 |
Reliability | 290 |
More ▼ |
Source
Author
Tindal, Gerald | 20 |
Cronin, John | 11 |
Herman, Joan L. | 10 |
Anderson, Daniel | 9 |
Fuchs, Lynn S. | 9 |
Alonzo, Julie | 8 |
Greenan, James P. | 8 |
Abedi, Jamal | 7 |
Deno, Stanley L. | 7 |
Elliott, Stephen N. | 7 |
Linn, Robert L. | 6 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 126 |
Researchers | 101 |
Teachers | 67 |
Administrators | 35 |
Policymakers | 20 |
Students | 13 |
Parents | 7 |
Community | 5 |
Support Staff | 5 |
Counselors | 4 |
Media Staff | 1 |
More ▼ |
Location
Australia | 55 |
United Kingdom | 42 |
Canada | 35 |
United Kingdom (England) | 34 |
Indonesia | 27 |
Turkey | 27 |
California | 25 |
Florida | 25 |
United States | 23 |
China | 21 |
New York | 17 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Harald A. Mieg; Katrin E. Klieme; Emma Barker; Jane Bryan; Caroline Gibson; Susanne Haberstroh; Femi Odebiyi; Frano P. Rismondo; Brigitte Römmer-Nossek; Janina Thiem; Erika Unterpertinger – Education and Information Technologies, 2024
This article presents a ten-item short scale for measuring digital competence. The scale is based on the Digital Competence Framework for Citizens, DigComp2.1 (Carretero et al., 2017). For our surveys, we used five items from the DigCompSat study (Clifford et al., 2020) and created five new ones to address the competence areas defined by…
Descriptors: Digital Literacy, Competence, Student Evaluation, Undergraduate Students
Christine M. White; Christopher Schatschneider – Contemporary School Psychology, 2024
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Gagan Shergill – Communique, 2025
Although school psychologists often comment on examinee motivation in their reports, systematic evaluation of effort is not common practice. Empirical assessment of performance effort provides critical evidence for the validity of evaluations and will likely lead to more valid assessments, recommendations, and placements. This article focuses on…
Descriptors: Testing, Student Behavior, Student Motivation, Student Evaluation
Christine M. White; Christopher Schatschneider – Grantee Submission, 2023
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Daniel R. Wissinger; Adrea J. Truckenmiller; Amber E. Konek; Stephen Ciullo – Reading & Writing Quarterly, 2024
The purpose of this meta-analysis was to evaluate the potential of two silent reading fluency measures as indicators of reading competence. Specifically, we analyzed score differences between the "Test of Silent Contextual Reading Fluency" (TOSCRF), the "Test of Silent Word Reading Fluency" (TOSWRF), and other standardized…
Descriptors: Silent Reading, Reading Fluency, Reading Tests, Test Validity
Matt Homer – Advances in Health Sciences Education, 2024
Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…
Descriptors: Examiners, Scoring, Validity, Cutting Scores
Mohsen Vahdani; Lorcan Cronin; Najmeh Rezasoltani – Journal of Teaching in Physical Education, 2024
Purpose: The purpose of this research was to develop and assess the psychometric properties of the Persian version of the Life Skills Scale for Physical Education (P-LSSPE). Method: During Study 1, which included four translators, eight physical education experts, and 45 physical education students, the LSSPE was translated and adapted into…
Descriptors: Psychometrics, Translation, Indo European Languages, Personal Autonomy
Chitra Sabapathy – Shanlax International Journal of Education, 2024
Background: Mid-semester evaluations are gaining traction as a means to gather evaluation data for formative purposes. However, it is not clear if course coordinators who conduct these evaluations are adequately equipped with evaluative knowledge and skills to guide them through their evaluative processes. Objectives: This study is a…
Descriptors: Evaluation Methods, Instructor Coordinators, Tutors, College Students
Moreno, Lorena; Briñol, Pablo; Petty, Richard E. – Metacognition and Learning, 2022
The present research examined the role of metacognitive confidence in understanding to what extent people's valenced thoughts guide their performance in academic settings. First, students were asked to engage in positive or negative thinking about exams in their major area of study (Study 1) or about themselves (Studies 2 and 3). The valence of…
Descriptors: Metacognition, Self Esteem, Academic Achievement, Student Evaluation
Seyda Aydin-Karaca; Sule Kilinç – Acta Educationis Generalis, 2024
Intellectual giftedness is an important student characteristic that teachers need to take into consideration when designing education programs and providing educational support to these students. Effective nomination and identification are the basis for further education. In nominating gifted students for special educational programs, teachers…
Descriptors: Foreign Countries, Teachers, Academically Gifted, Test Validity
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Scott J. Peters; Matthew C. Makel; Lindsay Ellis Lee; Tamra Stambaugh; Matthew T. McBee; D. Betsy McCoach; Kiana R. Johnson – Gifted Child Today, 2024
Universal screening is one of the most-common topics and well-accepted best practices within the field of gifted and talented education. There appears to be little disagreement that universally screening all students as part of a gifted and talented identification process results in fewer missed students. But surprisingly, there is little guidance…
Descriptors: Academically Gifted, Talent Identification, Screening Tests, Test Validity
Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024
Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…
Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability
Lovett, Benjamin J.; Spenceley, Laura M.; Schaberg, Theresa M.; Best, Haley – Psychology in the Schools, 2023
For psychoeducational evaluations to generate useful data, students must put forth sufficient effort on diagnostic tests, and they and their caregivers and teachers must respond honestly and carefully when asked about symptoms. These features are collectively known as response validity, a concept widely discussed in neuropsychological assessment…
Descriptors: Psychoeducational Methods, Diagnostic Tests, Responses, Validity