Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 147 |
Descriptor
Evaluation Research | 204 |
Test Validity | 204 |
Evaluation Methods | 92 |
Test Reliability | 86 |
Psychometrics | 49 |
Test Construction | 41 |
Measures (Individuals) | 37 |
Foreign Countries | 29 |
Student Evaluation | 28 |
Evaluation Problems | 24 |
Measurement Techniques | 24 |
More ▼ |
Source
Author
Bagby, R. Michael | 2 |
Canivez, Gary L. | 2 |
Hopwood, Christopher J. | 2 |
Konold, Timothy R. | 2 |
Mislevy, Robert J. | 2 |
Sireci, Stephen G. | 2 |
Young, John W. | 2 |
Zimmer, Ron | 2 |
Abbott, Robert D. | 1 |
Adams, Thomas | 1 |
Alexiou, Jon J. | 1 |
More ▼ |
Publication Type
Education Level
Location
Australia | 6 |
United Kingdom | 5 |
California | 4 |
Belgium | 2 |
Indiana | 2 |
New Mexico | 2 |
Brazil | 1 |
Canada | 1 |
China | 1 |
Europe | 1 |
Florida | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 2 |
Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Wan Fazwani Wan Mat; Lim Hooi Lian – Journal of Education and Learning (EduLearn), 2025
This bibliometric article examines the current state of publication in the field of classroom assessment, exploring the productivity and influence of countries, institutions, and authors. A search query of on the Scopus database using the term "classroom assessment" or "classroom-based assessment" or "assessment for…
Descriptors: Alternative Assessment, Student Evaluation, Bibliometrics, Formative Evaluation
Kathleen D. Dyer; Dermot Donnelly-Hermosillo – Research in Higher Education, 2024
This study aimed to demonstrate how one university worked to overcome some of the measurement problems associated with legacy student rating instruments through the creation and investigation of a new student rating instrument based on the most current scholarship on teaching and learning. Measurement problems with legacy instruments include…
Descriptors: Case Studies, Universities, Teacher Student Relationship, Student Evaluation of Teacher Performance
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018
Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…
Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales
Thippayacharoen, Thanakrit; Hoofd, Chonlatee; Pala, Napat; Sameephet, Banchakarn; Satthamnuwong, Bhirawit – LEARN Journal: Language Education and Acquisition Research Network, 2023
Research on English Medium Instruction (EMI) is rapidly increasing and well-documented worldwide; however, recent studies in EMI have given less emphasis on assessments in EMI classrooms. Indeed, assessment plays a significant role in informing teaching and learning competencies, but what to assess and how to assess are questions which have been…
Descriptors: Language of Instruction, English (Second Language), Second Language Learning, Second Language Instruction
Developing a High Performance Digital Education Ecosystem: Institutional Self-Assessment Instruments
Volungeviciene, Airina; Brown, Mark; Greenspon, Rasa; Gaebel, Michael; Morrisroe, Alison – European University Association, 2021
Digitally enhanced learning and teaching is widely used across the European Higher Education Area, with general acceptance growing over the years and institutions widely acknowledging the benefits it brings to the student experience. The strategic focus being placed on digitally enhanced learning and teaching has increased, undoubtedly accelerated…
Descriptors: Educational Technology, Technology Uses in Education, Program Evaluation, Self Evaluation (Groups)
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Phelps, Richard P. – Pioneer Institute for Public Policy Research, 2016
The Thomas B. Fordham Institute has released a report, "Evaluating the Content and Quality of Next Generation Assessments," ostensibly an evaluative comparison of four testing programs, the Common Core derived SBAC and PARCC, ACT's Aspire, and the Commonwealth of Massachusetts' MCAS. Of course, anyone familiar with Fordham's past work…
Descriptors: Evaluation Methods, Tests, Evaluation Research, Standardized Tests
Fives, Allyn; Canavan, John; Dolan, Pat – European Early Childhood Education Research Journal, 2017
There is significant controversy over what counts as evidence in the evaluation of social interventions. It is increasingly common to use methodological criteria to rank evidence types in a hierarchy, with Randomised Controlled Trials (RCTs) at or near the highest level. Because of numerous challenges to a hierarchical approach, this article…
Descriptors: Evaluation Methods, Evaluation Research, Randomized Controlled Trials, Ethics
Bergsmann, Evelyn; Klug, Julia; Burger, Christoph; Först, Nora; Spiel, Christiane – Assessment & Evaluation in Higher Education, 2018
There is a lively discussion on how to evaluate competence-based higher education in both evaluation and competence research. The instruments used are often limited to course evaluation or specific competences, taking a rather narrow perspective. Furthermore, the instruments often comprise predetermined competences that cannot be adapted to higher…
Descriptors: Questionnaires, Minimum Competency Testing, Screening Tests, Higher Education
Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020
This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…
Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles
Robbins, Joy; Firth, Amanda; Evans, Maria – Practitioner Research in Higher Education, 2018
Work based assessment (WBA) is a common but contentious practice increasingly used to grade university students on professional degrees. A key issue in WBA is the potentially low assessment literacy of the assessors, which can lead to a host of unintended results, including grade inflation. We identified grade inflation in the WBA of the clinical…
Descriptors: Grade Inflation, Weighted Scores, Evaluation Methods, Evaluation Research
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation