Publication Date
In 2025 | 11 |
Since 2024 | 39 |
Since 2021 (last 5 years) | 87 |
Since 2016 (last 10 years) | 176 |
Since 2006 (last 20 years) | 423 |
Descriptor
Evaluation Methods | 946 |
Test Reliability | 946 |
Test Validity | 946 |
Student Evaluation | 249 |
Test Construction | 224 |
Foreign Countries | 158 |
Psychometrics | 129 |
Measurement Techniques | 119 |
Higher Education | 118 |
Elementary Secondary Education | 106 |
Evaluation Criteria | 84 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 53 |
Researchers | 49 |
Teachers | 20 |
Administrators | 16 |
Policymakers | 8 |
Students | 3 |
Support Staff | 3 |
Counselors | 2 |
Parents | 1 |
Location
Australia | 18 |
United Kingdom | 14 |
Canada | 11 |
Turkey | 11 |
California | 10 |
Netherlands | 7 |
China | 6 |
Florida | 6 |
Taiwan | 6 |
Germany | 5 |
Illinois | 5 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Paul Alexander Siegel – ProQuest LLC, 2024
While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…
Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability
Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025
Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…
Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity
Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025
The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…
Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques
Katie L. McDermott – ProQuest LLC, 2024
Nursing education programs are faced with urgent demands to transition to competency-based education (CBE) to address the limitations of the nursing workforce. The AACN (2021) has developed the Essentials, or the core competencies for graduating entry- and advanced-level nurses to inform CBE. A concept analysis of Foundational Competence was…
Descriptors: Job Skills, Employment Qualifications, Nurses, Nursing Education
Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024
The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…
Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity
Saltos-Rivas, Rafael; Novoa-Hernández, Pavel; Serrano Rodríguez, Rocío – SAGE Open, 2022
Evaluating digital competencies has become a topic of growing interest in recent years. Although several reviews and studies have summarized the main elements of progress and shortcomings in this area, some issues are yet to be explored. Very little information is available about the ways of ensuring the validity and reliability of the instrument…
Descriptors: Test Reliability, Test Validity, Evaluation Methods, Technological Literacy
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Kim, Mi Song – International Journal of Technology and Design Education, 2022
Teacher design work has gained increasing attention by re-conceptualizing teachers as designers rather than curriculum deliverers. However, assessing teacher design work can be challenging given that there are very few research tools to assess teacher design knowledge (TDK) competencies. To fill that gap, this study proposes a survey that assesses…
Descriptors: Design, Teacher Characteristics, Teacher Competencies, Teacher Evaluation
Mattar, João; Ramos, Daniela Karine; Lucas, Margarida Rocha – Education and Information Technologies, 2022
The purpose of this article is to compare digital competence assessment instruments based on DigComp related frameworks. The study aims to answer four questions: (a) What types of instruments based on these frameworks are available? (b) How were these instruments created from these frameworks? (c) What procedures were used to guarantee the…
Descriptors: Evaluation Methods, Literature Reviews, Test Construction, Competence
Scott F. Marion, Editor; James W. Pellegrino, Editor; Amy I. Berman, Editor – National Academy of Education, 2024
High-quality assessments are crucial to many aspects of the educational process. They can help policymakers monitor long-term educational trends, assist state educational agencies (SEAs) and local educational agencies (LEAs) in allocating resources and professional development opportunities, provide insights to teachers about how well students…
Descriptors: Educational Assessment, Educational Policy, Equal Education, Test Validity
Marianne Berg Halvorsen; Arvid Nikolai Kildahl; Sabine Kaiser; Brynhildur Axelsdottir; Michael G. Aman; Sissel Berge Helverschou – Journal of Autism and Developmental Disorders, 2025
In recent years, there has been a proliferation of instruments for assessing mental health (MH) among autistic people. This study aimed to review the psychometric properties of broadband instruments used to assess MH problems among autistic people. In accordance with the PRISMA guidelines (PROSPERO: CRD42022316571) we searched the APA PsycINFO via…
Descriptors: Psychometrics, Mental Health, Clinical Diagnosis, Evaluation Methods
Simon Massey – International Journal of Social Research Methodology, 2024
The UK-based article develops a quantitative method for measuring 8-9-year-old children's Gender Ability Beliefs through drawings, assessing the reliability and validity of the measure and its association with respondents' self-reported gender. The measure, originally used in the US by Beilock et al. (2010), required respondents to draw two…
Descriptors: Children, Sex, Childrens Attitudes, Gender Differences
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment