Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 69 |
Descriptor
Evaluation Methods | 92 |
Evaluation Research | 92 |
Test Validity | 92 |
Test Reliability | 45 |
Measurement Techniques | 19 |
Evaluation Problems | 18 |
Student Evaluation | 18 |
Test Construction | 18 |
Psychometrics | 17 |
Educational Assessment | 16 |
Foreign Countries | 15 |
More ▼ |
Source
Author
Zimmer, Ron | 2 |
Abbott, Robert D. | 1 |
Alexiou, Jon J. | 1 |
Ali Panahi | 1 |
Anderson, Andrew | 1 |
Archwamety, Teara | 1 |
Arthur, Michael W. | 1 |
Audin, Kerry | 1 |
Baartman, Liesbeth K. J. | 1 |
Bagby, R. Michael | 1 |
Baker, Eva L. | 1 |
More ▼ |
Publication Type
Education Level
Location
Australia | 4 |
United Kingdom | 4 |
California | 2 |
Canada | 1 |
Europe | 1 |
Florida | 1 |
Japan | 1 |
Kenya | 1 |
Massachusetts | 1 |
Minnesota | 1 |
New Mexico | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wan Fazwani Wan Mat; Lim Hooi Lian – Journal of Education and Learning (EduLearn), 2025
This bibliometric article examines the current state of publication in the field of classroom assessment, exploring the productivity and influence of countries, institutions, and authors. A search query of on the Scopus database using the term "classroom assessment" or "classroom-based assessment" or "assessment for…
Descriptors: Alternative Assessment, Student Evaluation, Bibliometrics, Formative Evaluation
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Developing a High Performance Digital Education Ecosystem: Institutional Self-Assessment Instruments
Volungeviciene, Airina; Brown, Mark; Greenspon, Rasa; Gaebel, Michael; Morrisroe, Alison – European University Association, 2021
Digitally enhanced learning and teaching is widely used across the European Higher Education Area, with general acceptance growing over the years and institutions widely acknowledging the benefits it brings to the student experience. The strategic focus being placed on digitally enhanced learning and teaching has increased, undoubtedly accelerated…
Descriptors: Educational Technology, Technology Uses in Education, Program Evaluation, Self Evaluation (Groups)
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Phelps, Richard P. – Pioneer Institute for Public Policy Research, 2016
The Thomas B. Fordham Institute has released a report, "Evaluating the Content and Quality of Next Generation Assessments," ostensibly an evaluative comparison of four testing programs, the Common Core derived SBAC and PARCC, ACT's Aspire, and the Commonwealth of Massachusetts' MCAS. Of course, anyone familiar with Fordham's past work…
Descriptors: Evaluation Methods, Tests, Evaluation Research, Standardized Tests
Fives, Allyn; Canavan, John; Dolan, Pat – European Early Childhood Education Research Journal, 2017
There is significant controversy over what counts as evidence in the evaluation of social interventions. It is increasingly common to use methodological criteria to rank evidence types in a hierarchy, with Randomised Controlled Trials (RCTs) at or near the highest level. Because of numerous challenges to a hierarchical approach, this article…
Descriptors: Evaluation Methods, Evaluation Research, Randomized Controlled Trials, Ethics
Robbins, Joy; Firth, Amanda; Evans, Maria – Practitioner Research in Higher Education, 2018
Work based assessment (WBA) is a common but contentious practice increasingly used to grade university students on professional degrees. A key issue in WBA is the potentially low assessment literacy of the assessors, which can lead to a host of unintended results, including grade inflation. We identified grade inflation in the WBA of the clinical…
Descriptors: Grade Inflation, Weighted Scores, Evaluation Methods, Evaluation Research
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Dwyer, Andrew C. – Journal of Educational Measurement, 2016
This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…
Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards
Zimmer, Ron; Engberg, John – Journal of School Choice, 2016
School choice programs continue to be controversial, spurring a number of researchers into evaluating them. When possible, researchers evaluate the effect of attending a school of choice using randomized designs to eliminate possible selection bias. Randomized designs are often thought of as the gold standard for research, but many circumstances…
Descriptors: Inferences, School Choice, Educational Vouchers, Charter Schools
Grissom, Jason A., Ed.; Youngs, Peter, Ed. – Teachers College Press, 2015
This is the first book to gather and address what we have learned about the impacts and challenges of data-intensive teacher evaluation systems--a defining characteristic of the current education policy landscape. Expert researchers and practitioners speak to what we know (and what remains to be known) about evaluation measures themselves, the…
Descriptors: Teacher Evaluation, Evaluation Methods, Evaluation Research, Test Validity
Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015
The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…
Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship
Fives, Helenrose; Barnes, Nicole; Dacey, Charity; Gillis, Anna – Teacher Educator, 2016
We conducted a content analysis of 27 assessment textbooks to determine how assessment planning was framed in texts for preservice teachers. We identified eight assessment planning themes: alignment, assessment purpose and types, reliability and validity, writing goals and objectives, planning specific assessments, unpacking, overall assessment…
Descriptors: Student Evaluation, Lesson Plans, Knowledge Base for Teaching, Textbook Evaluation
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Farid, Alem – Electronic Journal of e-Learning, 2014
Although there are tools to assess student's readiness in an "online learning context," little is known about the "psychometric" properties of the tools used or not. A systematic review of 5107 published and unpublished papers identified in a literature search on student online readiness assessment tools between 1990 and…
Descriptors: Online Courses, Electronic Learning, Learning Readiness, Psychometrics