Publication Date
In 2025 | 8 |
Since 2024 | 39 |
Since 2021 (last 5 years) | 115 |
Since 2016 (last 10 years) | 256 |
Since 2006 (last 20 years) | 416 |
Descriptor
Student Evaluation | 962 |
Test Reliability | 962 |
Test Validity | 700 |
Evaluation Methods | 335 |
Test Construction | 239 |
Foreign Countries | 188 |
Elementary Secondary Education | 159 |
Higher Education | 120 |
Psychometrics | 101 |
Academic Achievement | 100 |
Standardized Tests | 94 |
More ▼ |
Source
Author
Greenan, James P. | 8 |
Tindal, Gerald | 7 |
Deno, Stanley L. | 4 |
Fuchs, Lynn S. | 4 |
Popham, W. James | 4 |
Ysseldyke, James E. | 4 |
Alonzo, Julie | 3 |
Anderson, Daniel | 3 |
Baker, Eva L. | 3 |
Bracey, Gerald W. | 3 |
Epstein, Michael H. | 3 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 74 |
Researchers | 47 |
Teachers | 46 |
Administrators | 26 |
Policymakers | 8 |
Students | 7 |
Parents | 4 |
Support Staff | 4 |
Community | 3 |
Counselors | 1 |
Location
Australia | 22 |
United Kingdom | 20 |
Turkey | 18 |
Canada | 15 |
Indonesia | 12 |
United Kingdom (England) | 11 |
United States | 10 |
China | 8 |
Florida | 8 |
New York | 8 |
Germany | 7 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Scott J. Peters; Matthew C. Makel; Lindsay Ellis Lee; Tamra Stambaugh; Matthew T. McBee; D. Betsy McCoach; Kiana R. Johnson – Gifted Child Today, 2024
Universal screening is one of the most-common topics and well-accepted best practices within the field of gifted and talented education. There appears to be little disagreement that universally screening all students as part of a gifted and talented identification process results in fewer missed students. But surprisingly, there is little guidance…
Descriptors: Academically Gifted, Talent Identification, Screening Tests, Test Validity
Chunhua Liu; Panwang Yang – European Journal of Education, 2024
Student satisfaction in online live classes is considered an important criterion to evaluate the effectiveness of this instructional system. This study aims to develop a performance evaluation index to measure the satisfaction of students who have mastered Chinese language and literature through online live classes. Guided by survey techniques and…
Descriptors: Student Satisfaction, Online Courses, Performance Based Assessment, Chinese
Erlis Çela; Alban Tufa; Melsena Danglli – Journal of Media Literacy Education, 2025
Media and Information Literacy (MIL) is a topic discussed by many authors in recent years. Several authors have developed measurement scales and validated them mainly among students and teachers who have media and information literacy knowledge. Although no evidence is tracked for any research conducted, MIL has been an unexplored field in the…
Descriptors: College Students, Media Literacy, Information Literacy, Student Evaluation
Sievers, Matt; Reemts, Connor; Dickinson, Katherine J.; Mukerji, Joya; Beltran, Ismael Barreras; Theobald, Elli J.; Velasco, Vicente; Freeman, Scott – Biochemistry and Molecular Biology Education, 2023
Researchers have called for undergraduate courses to update teaching frameworks based on the Modern Synthesis with insights from molecular biology, by stressing the molecular underpinnings of variation and adaptation. To support this goal, we developed a modified version of the widely used Assessing Conceptual Reasoning of Natural Selection…
Descriptors: Student Evaluation, Knowledge Level, Molecular Biology, Evolution
Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024
Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…
Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children
Susan K. Johnsen – Gifted Child Today, 2024
The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…
Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity
Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024
The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…
Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity
Power, Jason Richard; Tanner, David – European Journal of Engineering Education, 2023
Self and peer assessments have been identified as effective strategies to develop a deeper understanding of complex concepts, enhance meta-cognitive capacity, and support learner self-efficacy. This study examines data related to peer and self-assessment exercises completed within a university engineering programme (n=61). Data related to…
Descriptors: Peer Evaluation, Self Evaluation (Individuals), Feedback (Response), Engineering Education
Emily L. Coderre – College Teaching, 2024
Psychometrics is the field of designing tests and assessments to measure certain psychological concepts. It is chiefly concerned with two fundamental properties: reliability and validity. These properties are often influenced by confounding variables: other things that can influence performance but are not what you are trying to measure. Here, I…
Descriptors: Teaching Methods, Psychometrics, Test Construction, Test Reliability
Harald A. Mieg; Katrin E. Klieme; Emma Barker; Jane Bryan; Caroline Gibson; Susanne Haberstroh; Femi Odebiyi; Frano P. Rismondo; Brigitte Römmer-Nossek; Janina Thiem; Erika Unterpertinger – Education and Information Technologies, 2024
This article presents a ten-item short scale for measuring digital competence. The scale is based on the Digital Competence Framework for Citizens, DigComp2.1 (Carretero et al., 2017). For our surveys, we used five items from the DigCompSat study (Clifford et al., 2020) and created five new ones to address the competence areas defined by…
Descriptors: Digital Literacy, Competence, Student Evaluation, Undergraduate Students
Katherine E. Castellano; Daniel F. McCaffrey; Joseph A. Martineau – Educational Measurement: Issues and Practice, 2025
Growth-to-standard models evaluate student growth against the growth needed to reach a future standard or target of interest, such as proficiency. A common growth-to-standard model involves comparing the popular Student Growth Percentile (SGP) to Adequate Growth Percentiles (AGPs). AGPs follow from an involved process based on fitting a series of…
Descriptors: Student Evaluation, Growth Models, Student Educational Objectives, Educational Indicators
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management
Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023
Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…
Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education
Delia Leuenberger; Elisabeth Moser Opitz; Noemi Gloor – Journal of Numerical Cognition, 2024
Computation competence (CC) in simple addition and subtraction using non-counting (NC) strategies is an important learning objective in Grade 1 mathematics but many children, especially low achievers in mathematics, struggle to acquire these skills. To provide these students with the support they need, it is important to have valid and reliable…
Descriptors: Computation, Mathematics Skills, Addition, Subtraction