Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 50 |
Descriptor
Benchmarking | 53 |
Validity | 53 |
Reliability | 16 |
Evaluation Methods | 15 |
Foreign Countries | 14 |
Models | 10 |
Student Evaluation | 10 |
Comparative Analysis | 8 |
Educational Policy | 8 |
Psychometrics | 8 |
Academic Achievement | 7 |
More ▼ |
Source
Author
Camara, Wayne | 2 |
Dietel, Ronald | 2 |
Herman, Joan L. | 2 |
Lastrapes, Renée E. | 2 |
Mooney, Paul | 2 |
Osmundson, Ellen | 2 |
Zumbo, Bruno D. | 2 |
Abolfazl Asudeh | 1 |
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Arnab, Sylvester | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Location
Canada | 3 |
Louisiana | 2 |
Netherlands | 2 |
New Jersey | 2 |
United Kingdom | 2 |
Afghanistan | 1 |
Arkansas | 1 |
Bhutan | 1 |
Burkina Faso | 1 |
Chile | 1 |
Czech Republic | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Hadis Anahideh; Nazanin Nezami; Abolfazl Asudeh – Grantee Submission, 2025
It is of critical importance to be aware of the historical discrimination embedded in the data and to consider a fairness measure to reduce bias throughout the predictive modeling pipeline. Given various notions of fairness defined in the literature, investigating the correlation and interaction among metrics is vital for addressing unfairness.…
Descriptors: Correlation, Measurement Techniques, Guidelines, Semantics
Atilla Ergin; Yelkin Diker Coskun – International Journal on Social and Education Sciences, 2024
This study aims to develop a scale to measure the design thinking process and to evaluate the reliability and validity of this scale. It fills this gap by introducing a 36-item scale specifically designed to measure design thinking abilities across the five key stages of the design thinking process: empathize, define, ideate, prototype, and test,…
Descriptors: Design, Thinking Skills, Likert Scales, Empathy
Olvera Astivia, Oscar L.; Zumbo, Bruno D. – Measurement: Interdisciplinary Research and Perspectives, 2019
Methods to generate random correlation matrices have been proposed in the literature, but very few instances exist where these correlation matrices are structured or where the statistical properties of the algorithms are known. By relying on the tetrad relation discovered by Spearman and the properties of the beta distribution, an algorithm is…
Descriptors: Correlation, Psychometrics, Benchmarking, Evaluation Methods
Gerald Tindal; Joseph F. T. Nese – Behavioral Research and Teaching, 2024
We present two types of validity evidence to support inferences and decisions about use of easyCBMs in relation to state testing programs. The first type involves the use of Benchmarks in reading to use in making predictions of performance on the Smarter Balanced (SB) test. These predictions can be made both well in advance (several months) or…
Descriptors: Classification, Accuracy, Validity, Criteria
Bournot-Trites, Monique; Friesen, Lucas; Ruest, Carl; Zumbo, Bruno D. – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2020
To ensure quality of education, a language framework should be the foundation on which second language curricula are developed. In 2010, the Council of Ministers of Education, Canada (CMEC), as suggested by Vandergrift (2006a, 2006b), recommended the use of the Common European Framework of Reference (CEFR) in the K-12 Canadian school context and…
Descriptors: Guidelines, Second Language Learning, Second Language Instruction, Rating Scales
Tang, Hui-Wen Vivian; Lee, Lynne – SAGE Open, 2021
The study was designed as a linked two-phase investigation, aiming to psychometrically develop and validate a Chinese version of the "Organizational Climate Diagnostic Instrument for Junior High Schools" (OCDI-JH) for use in Taiwan. Through extensive literature reviews, the complex phenomena of school climate were decomposed into a…
Descriptors: Organizational Climate, Validity, Measures (Individuals), Psychometrics
Carlisle, Sylvia – PRIMUS, 2020
Specifications grading is a version of mastery grading distinguished by giving students clear specifications that their work must meet, and grading most things pass/fail based on those specifications. Mastery grading systems can get quite elaborate, with hierarchies of objectives and various systems for rewriting and retesting. In this article I…
Descriptors: Grading, Standards, Mathematics Instruction, Calculus
Mooney, Paul; Lastrapes, Renée E. – Assessment for Effective Intervention, 2019
The purpose of the research was to replicate commonality analysis for two measures: critical content monitoring and sentence verification technique. Participants were 967 fourth-, fifth-, and sixth-grade students across seven public primary schools in a southeastern U.S. district. The predictor variables were administered as benchmarks 3 times in…
Descriptors: Validity, Elementary School Students, Science Tests, Reading Comprehension
Ogut, Burhan; Bohrnstedt, George; Broer, Markus – American Institutes for Research, 2021
Ensuring that students are ready for college when they graduate from high school has important implications for students, educators, education policymakers, and other stakeholders. This study focuses on an examination of the relationship between the National Assessment of Educational Progress (NAEP) grade 12 mathematics assessment and college…
Descriptors: National Competency Tests, Longitudinal Studies, High School Students, Mathematics Tests
Choiriyah, Siti; Kumaidi; Kartowagiran, Badrun – Journal of Social Studies Education Research, 2018
The purpose of this study is to examine aspects of internal quality assurance to evaluate Indonesian Islamic universities, develop Delta Internal Quality Assurance (DIQA), and provide empirical evidence for using DIQA as a standard model of evaluation. It is a research and development (R&D) endeavor in the context of the input, process, and…
Descriptors: Educational Quality, Quality Assurance, Islam, Institutional Evaluation
Floyd, Natosha N. – ProQuest LLC, 2016
The purpose of this study was to examine the psychometric properties of the Michigan School Libraries for the 21st Century Measurement Benchmarks (SL21). The instrument consists of 19 items with three subscales: Building the 21st Century Learning Environment Subscale, Teaching for 21st Century Learning Subscale, and Leading the Way to 21st Century…
Descriptors: School Libraries, Benchmarking, Psychometrics, Reliability
Mooney, Paul; Lastrapes, Renée E. – Assessment for Effective Intervention, 2016
The amount of research evaluating the technical merits of general outcome measures of science and social studies achievement is growing. This study targeted criterion validity for critical content monitoring. Questions addressed the concurrent criterion validity of alternate presentation formats of critical content monitoring and the measure's…
Descriptors: Outcome Measures, Academic Discourse, Benchmarking, Social Studies
Vargas-Madriz, Luis Francisco; Nocente, Norma; Best-Bertwistle, Rebecca; Forgie, Sarah – Canadian Journal of Higher Education, 2019
Student Evaluations of Teaching (SET) have been the most consistently administered tool, and they are still extensively used in higher education institutions to assess teaching effectiveness. The purpose of this study was to explore how SET are used by administrators in the teaching evaluation process at a large, research-intensive Canadian…
Descriptors: Foreign Countries, Student Evaluation of Teacher Performance, Teacher Effectiveness, Administrator Attitudes
Harsch, Claudia; Kanistra, Voula Paraskevi – Language Assessment Quarterly, 2020
We report on a standard-setting project in which the Item-Descriptor-Matching Method (IDM) and a complementary benchmarking approach were employed to align a suite of English language proficiency exams to the "Common European Framework of Reference" (CEFR), with a particular focus on the integrated and independent writing exams. Judges'…
Descriptors: Standard Setting, Guidelines, Rating Scales, Definitions
Arnab, Sylvester; Clarke, Samantha – British Journal of Educational Technology, 2017
The application of game-based learning adds play into educational and instructional contexts. Even though there is a lack of standard methodologies or formulaic frameworks to better inform game-based intervention development, there exist scientific and empirical studies that can serve as benchmarks for establishing scientific validity in terms of…
Descriptors: Interdisciplinary Approach, Guidelines, Sex Education, Intervention