Publication Date
In 2025 | 3 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 13 |
Since 2016 (last 10 years) | 35 |
Since 2006 (last 20 years) | 117 |
Descriptor
Evaluation Methods | 305 |
Measurement Techniques | 305 |
Test Reliability | 170 |
Test Validity | 125 |
Reliability | 95 |
Validity | 58 |
Interrater Reliability | 51 |
Student Evaluation | 49 |
Psychometrics | 45 |
Test Construction | 42 |
Rating Scales | 37 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 25 |
Practitioners | 11 |
Students | 4 |
Teachers | 4 |
Administrators | 2 |
Policymakers | 2 |
Counselors | 1 |
Media Staff | 1 |
Location
California | 5 |
Canada | 5 |
United States | 5 |
Australia | 4 |
China | 4 |
Portugal | 3 |
Florida | 2 |
Israel | 2 |
Michigan | 2 |
Minnesota | 2 |
Ohio | 2 |
More ▼ |
Laws, Policies, & Programs
Education Amendments 1974 | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…
Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability
Mojtaba Elhami Athar; Randall T. Salekin; Mahdi Hassanabadi; Parnian Rezaei; Golnoush Fakhr; Elham Zamani – Child & Youth Care Forum, 2025
The Proposed Specifiers for Conduct Disorder (PSCD) assesses psychopathy components of grandiose-manipulative (GM), callous-unemotional (CU), daring-impulsive (DI), and conduct disorder (CD). Research on PSCD is still in its infancy, and further research is necessary to examine its psychometric properties. We investigated the correlations between…
Descriptors: Preadolescents, Adolescents, Psychopathology, Behavior Disorders
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques
Nicole D. Martin; Stephanie N. Baker; Madeline Haynes; Jayce R. Warner – Computer Science Education, 2024
Background and Context: As computer science (CS) education expands and the need for well-prepared CS teachers grows, understanding what motivates teachers to teach CS can help address challenges to recruiting, preparing, and retaining teachers. Objective: The goal of this work was to develop and validate a scale that measures teachers' motivation…
Descriptors: Computer Science Education, Teacher Motivation, Measurement Techniques, Construct Validity
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Sung, Jihyun – Education and Information Technologies, 2022
Computational thinking (CT) in young children has recently gained attention. This study verified the applicability of the Korean version of the Bebras cards and TACTIC-KIBO in measuring CT among young children in South Korea. A total of 450 children responded to the Bebras cards, TACTIC-KIBO, and Early Numeracy tasks that were used for the…
Descriptors: Foreign Countries, Computation, Thinking Skills, Young Children
Uzun, N. Bilge; Aktas, Mehtap; Akay, Cenk – Journal of Educational Technology, 2023
The challenges experienced in measurement and evaluation during the distance education process among student and instructor groups are discussed in the study. A qualitative meta-synthesis method is used in this research. Twenty studies were included in the meta-synthesis. The challenges experienced by the instructors are program utilization,…
Descriptors: Measurement Techniques, Evaluation Methods, Distance Education, Literature Reviews
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023
Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…
Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards
Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024
Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…
Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits
Konstantin Vinokic; Lukas Begrich; Mareike Kunter; Susanne Kuger – Frontline Learning Research, 2024
Thin slices ratings (i.e., ratings based on first impressions) have yielded intriguingly accurate results in various domains. Among other, researcher have applied the thin slices technique to assess instructional quality, showing that teacher-student interactions can be reliably inferred by just very short snippets of classroom instruction. The…
Descriptors: Teacher Effectiveness, Teacher Student Relationship, Foreign Countries, Classroom Observation Techniques
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Halfon, Ester; Biton, Yaniv – International Journal of Education in Mathematics, Science and Technology, 2022
As part of efforts to improve the quality of mathematics' teaching and evaluation, we examined the focus of math teachers' considerations in evaluating students' achievements, as well as the links between these focuses, regarding differences between students and the validity and reliability of assessment methods and examinations. Based on the…
Descriptors: Mathematics Teachers, Mathematics Instruction, Teacher Attitudes, Student Evaluation
Margulieux, Lauren; Ketenci, Tuba Ayer; Decker, Adrienne – Computer Science Education, 2019
Background and context: The variables that researchers measure and how they measure them are central in any area of research, including computing education. Which research questions can be asked and how they are answered depends on measurement. Objective: To summarize the commonly used variables and measurements in computing education and to…
Descriptors: Measurement Techniques, Standards, Evaluation Methods, Computer Science Education