Publication Date
In 2025 | 13 |
Since 2024 | 68 |
Since 2021 (last 5 years) | 220 |
Since 2016 (last 10 years) | 435 |
Since 2006 (last 20 years) | 692 |
Descriptor
Student Evaluation | 1215 |
Test Validity | 715 |
Test Reliability | 381 |
Evaluation Methods | 369 |
Foreign Countries | 356 |
Validity | 343 |
Test Construction | 210 |
Reliability | 167 |
Scores | 146 |
Academic Achievement | 143 |
Higher Education | 141 |
More ▼ |
Source
Author
Tindal, Gerald | 10 |
Doherty, Austin | 5 |
Mentkowski, Marcia | 5 |
Pollock, Steven J. | 5 |
Deno, Stanley L. | 4 |
Greenan, James P. | 4 |
McDermott, Paul A. | 4 |
Nakamura, Yuji | 4 |
Smith, Douglas K. | 4 |
Abedi, Jamal | 3 |
Bogo, Marion | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 46 |
Practitioners | 29 |
Teachers | 12 |
Administrators | 9 |
Policymakers | 5 |
Parents | 1 |
Location
Australia | 28 |
Indonesia | 26 |
Canada | 25 |
Turkey | 23 |
China | 21 |
Florida | 18 |
United Kingdom (England) | 18 |
Germany | 14 |
United Kingdom | 14 |
Hong Kong | 11 |
Netherlands | 11 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 5 |
Individuals with Disabilities… | 3 |
Elementary and Secondary… | 2 |
Elementary and Secondary… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Harald A. Mieg; Katrin E. Klieme; Emma Barker; Jane Bryan; Caroline Gibson; Susanne Haberstroh; Femi Odebiyi; Frano P. Rismondo; Brigitte Römmer-Nossek; Janina Thiem; Erika Unterpertinger – Education and Information Technologies, 2024
This article presents a ten-item short scale for measuring digital competence. The scale is based on the Digital Competence Framework for Citizens, DigComp2.1 (Carretero et al., 2017). For our surveys, we used five items from the DigCompSat study (Clifford et al., 2020) and created five new ones to address the competence areas defined by…
Descriptors: Digital Literacy, Competence, Student Evaluation, Undergraduate Students
Christine M. White; Christopher Schatschneider – Contemporary School Psychology, 2024
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Christine M. White; Christopher Schatschneider – Grantee Submission, 2023
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Daniel R. Wissinger; Adrea J. Truckenmiller; Amber E. Konek; Stephen Ciullo – Reading & Writing Quarterly, 2024
The purpose of this meta-analysis was to evaluate the potential of two silent reading fluency measures as indicators of reading competence. Specifically, we analyzed score differences between the "Test of Silent Contextual Reading Fluency" (TOSCRF), the "Test of Silent Word Reading Fluency" (TOSWRF), and other standardized…
Descriptors: Silent Reading, Reading Fluency, Reading Tests, Test Validity
Matt Homer – Advances in Health Sciences Education, 2024
Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…
Descriptors: Examiners, Scoring, Validity, Cutting Scores
Mohsen Vahdani; Lorcan Cronin; Najmeh Rezasoltani – Journal of Teaching in Physical Education, 2024
Purpose: The purpose of this research was to develop and assess the psychometric properties of the Persian version of the Life Skills Scale for Physical Education (P-LSSPE). Method: During Study 1, which included four translators, eight physical education experts, and 45 physical education students, the LSSPE was translated and adapted into…
Descriptors: Psychometrics, Translation, Indo European Languages, Personal Autonomy
Chitra Sabapathy – Shanlax International Journal of Education, 2024
Background: Mid-semester evaluations are gaining traction as a means to gather evaluation data for formative purposes. However, it is not clear if course coordinators who conduct these evaluations are adequately equipped with evaluative knowledge and skills to guide them through their evaluative processes. Objectives: This study is a…
Descriptors: Evaluation Methods, Instructor Coordinators, Tutors, College Students
Moreno, Lorena; Briñol, Pablo; Petty, Richard E. – Metacognition and Learning, 2022
The present research examined the role of metacognitive confidence in understanding to what extent people's valenced thoughts guide their performance in academic settings. First, students were asked to engage in positive or negative thinking about exams in their major area of study (Study 1) or about themselves (Studies 2 and 3). The valence of…
Descriptors: Metacognition, Self Esteem, Academic Achievement, Student Evaluation
Seyda Aydin-Karaca; Sule Kilinç – Acta Educationis Generalis, 2024
Intellectual giftedness is an important student characteristic that teachers need to take into consideration when designing education programs and providing educational support to these students. Effective nomination and identification are the basis for further education. In nominating gifted students for special educational programs, teachers…
Descriptors: Foreign Countries, Teachers, Academically Gifted, Test Validity
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024
Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…
Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability
Lovett, Benjamin J.; Spenceley, Laura M.; Schaberg, Theresa M.; Best, Haley – Psychology in the Schools, 2023
For psychoeducational evaluations to generate useful data, students must put forth sufficient effort on diagnostic tests, and they and their caregivers and teachers must respond honestly and carefully when asked about symptoms. These features are collectively known as response validity, a concept widely discussed in neuropsychological assessment…
Descriptors: Psychoeducational Methods, Diagnostic Tests, Responses, Validity
Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022
Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…
Descriptors: Validity, Performance, Standards, Multivariate Analysis
Wan Fazwani Wan Mat; Lim Hooi Lian – Journal of Education and Learning (EduLearn), 2025
This bibliometric article examines the current state of publication in the field of classroom assessment, exploring the productivity and influence of countries, institutions, and authors. A search query of on the Scopus database using the term "classroom assessment" or "classroom-based assessment" or "assessment for…
Descriptors: Alternative Assessment, Student Evaluation, Bibliometrics, Formative Evaluation