Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 18 |
Since 2006 (last 20 years) | 124 |
Descriptor
Source
Author
Bagby, R. Michael | 2 |
Canivez, Gary L. | 2 |
Hopwood, Christopher J. | 2 |
Konold, Timothy R. | 2 |
Mislevy, Robert J. | 2 |
Sireci, Stephen G. | 2 |
Zimmer, Ron | 2 |
Abbott, Robert D. | 1 |
Adams, Thomas | 1 |
Ali Panahi | 1 |
Anderson, Stephen A. | 1 |
More ▼ |
Publication Type
Journal Articles | 168 |
Reports - Evaluative | 71 |
Reports - Research | 67 |
Information Analyses | 17 |
Opinion Papers | 14 |
Reports - Descriptive | 11 |
Reports - General | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Practitioners | 2 |
Administrators | 1 |
Researchers | 1 |
Location
United Kingdom | 5 |
Australia | 4 |
Belgium | 2 |
Brazil | 1 |
California | 1 |
Canada | 1 |
China | 1 |
Illinois | 1 |
Indiana | 1 |
Iowa | 1 |
Italy | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Wan Fazwani Wan Mat; Lim Hooi Lian – Journal of Education and Learning (EduLearn), 2025
This bibliometric article examines the current state of publication in the field of classroom assessment, exploring the productivity and influence of countries, institutions, and authors. A search query of on the Scopus database using the term "classroom assessment" or "classroom-based assessment" or "assessment for…
Descriptors: Alternative Assessment, Student Evaluation, Bibliometrics, Formative Evaluation
Kathleen D. Dyer; Dermot Donnelly-Hermosillo – Research in Higher Education, 2024
This study aimed to demonstrate how one university worked to overcome some of the measurement problems associated with legacy student rating instruments through the creation and investigation of a new student rating instrument based on the most current scholarship on teaching and learning. Measurement problems with legacy instruments include…
Descriptors: Case Studies, Universities, Teacher Student Relationship, Student Evaluation of Teacher Performance
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018
Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…
Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales
Thippayacharoen, Thanakrit; Hoofd, Chonlatee; Pala, Napat; Sameephet, Banchakarn; Satthamnuwong, Bhirawit – LEARN Journal: Language Education and Acquisition Research Network, 2023
Research on English Medium Instruction (EMI) is rapidly increasing and well-documented worldwide; however, recent studies in EMI have given less emphasis on assessments in EMI classrooms. Indeed, assessment plays a significant role in informing teaching and learning competencies, but what to assess and how to assess are questions which have been…
Descriptors: Language of Instruction, English (Second Language), Second Language Learning, Second Language Instruction
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Fives, Allyn; Canavan, John; Dolan, Pat – European Early Childhood Education Research Journal, 2017
There is significant controversy over what counts as evidence in the evaluation of social interventions. It is increasingly common to use methodological criteria to rank evidence types in a hierarchy, with Randomised Controlled Trials (RCTs) at or near the highest level. Because of numerous challenges to a hierarchical approach, this article…
Descriptors: Evaluation Methods, Evaluation Research, Randomized Controlled Trials, Ethics
Bergsmann, Evelyn; Klug, Julia; Burger, Christoph; Först, Nora; Spiel, Christiane – Assessment & Evaluation in Higher Education, 2018
There is a lively discussion on how to evaluate competence-based higher education in both evaluation and competence research. The instruments used are often limited to course evaluation or specific competences, taking a rather narrow perspective. Furthermore, the instruments often comprise predetermined competences that cannot be adapted to higher…
Descriptors: Questionnaires, Minimum Competency Testing, Screening Tests, Higher Education
Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020
This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…
Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles
Robbins, Joy; Firth, Amanda; Evans, Maria – Practitioner Research in Higher Education, 2018
Work based assessment (WBA) is a common but contentious practice increasingly used to grade university students on professional degrees. A key issue in WBA is the potentially low assessment literacy of the assessors, which can lead to a host of unintended results, including grade inflation. We identified grade inflation in the WBA of the clinical…
Descriptors: Grade Inflation, Weighted Scores, Evaluation Methods, Evaluation Research
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Fendler, Lynn – Ethics and Education, 2016
In educational research that calls itself empirical, the relationship between validity and reliability is that of trade-off: the stronger the bases for validity, the weaker the bases for reliability (and vice versa). Validity and reliability are widely regarded as basic criteria for evaluating research; however, there are ethical implications of…
Descriptors: Educational Research, Ethics, Test Validity, Test Reliability
Dwyer, Andrew C. – Journal of Educational Measurement, 2016
This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…
Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards
Zimmer, Ron; Engberg, John – Journal of School Choice, 2016
School choice programs continue to be controversial, spurring a number of researchers into evaluating them. When possible, researchers evaluate the effect of attending a school of choice using randomized designs to eliminate possible selection bias. Randomized designs are often thought of as the gold standard for research, but many circumstances…
Descriptors: Inferences, School Choice, Educational Vouchers, Charter Schools