Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 28 |
Descriptor
Source
Author
Ali Panahi | 1 |
Baldwin, Su G. | 1 |
Banks, Kathleen | 1 |
Barford, Sean W. | 1 |
Buffolino, Judy | 1 |
Camilli, Gregory | 1 |
Cheng, Liying | 1 |
Clauser, Brian E. | 1 |
Cronje, Johannes C. | 1 |
Cui, Ying | 1 |
DiBello, Lou | 1 |
More ▼ |
Publication Type
Journal Articles | 27 |
Reports - Evaluative | 10 |
Opinion Papers | 8 |
Reports - Research | 8 |
Information Analyses | 7 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 11 |
Higher Education | 6 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Secondary Education | 2 |
High Schools | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Location
China | 1 |
Maine | 1 |
Michigan | 1 |
New Hampshire | 1 |
Oregon | 1 |
South Africa | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Zhaoyu Yang; Ping Wang – Language Testing in Asia, 2025
This study offers a bibliometric overview of the English language assessment research from 1992 to 2024. It aims to uncover the current state, research trends, and future directions of the field. A total of 927 articles published in Web of Science (WoS) were analyzed using the VOSviewer bibliometric software tool. Based on the sample of 927…
Descriptors: Bibliometrics, English (Second Language), Second Language Learning, Periodicals
Johnson, Martin; Shaw, Stuart – Journal of Further and Higher Education, 2019
With the introduction of a new initiative in a teaching and learning environment there is an ethical responsibility to consider whether the impact of the introduction has met its intended goals, and whether it has harmed those who are influenced by it. Technology and infrastructure developments have encouraged a continued growth in the development…
Descriptors: Computer Assisted Testing, Testing Problems, Evaluation Research, High Stakes Tests
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Banks, Kathleen – Practical Assessment, Research & Evaluation, 2015
This article introduces practitioners and researchers to the topic of missing data in the context of differential item functioning (DIF), reviews the current literature on the issue, discusses implications of the review, and offers suggestions for future research. A total of nine studies were reviewed. All of these studies determined what effect…
Descriptors: Test Bias, Data, Literature Reviews, Evaluation Research
Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020
This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…
Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles
Cheng, Liying; Sun, Youyi; Ma, Jia – Language Teaching, 2015
No area of language assessment research in the past 20 years has received a greater increase in attention than washback research. Beginning with the seminal work of Alderson & Wall (Alderson & Wall 1993; Wall & Alderson 1993), an evolving body of empirical washback studies has been conducted worldwide, especially in countries where…
Descriptors: Guidelines, Testing Problems, Second Language Learning, Language Tests
Green, Anthony – International Journal of English Studies, 2013
This paper reviews the progress made in washback studies over the quarter century since Hughes' (1989) placed it at the centre of his textbook "Testing for Language Teachers." Research into washback and the development of models of washback are described and an agenda is suggested for test developers wishing to build washback into…
Descriptors: Language Tests, Testing, Testing Problems, Models
Sturgis, Chris – International Association for K-12 Online Learning, 2014
This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…
Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2011
The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method against the oral examination (OE) method. MCQs are widely used and their importance seems likely to grow, due to their inherent suitability for electronic assessment. However, MCQs are influenced by the tendency of examinees to guess…
Descriptors: Grades (Scholastic), Scoring, Multiple Choice Tests, Test Format
de La Torre, Jimmy; Karelitz, Tzur M. – Journal of Educational Measurement, 2009
Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM-based data with a linear attribute structure. The study utilizes a procedure…
Descriptors: Simulation, Item Response Theory, Psychometrics, Evaluation Methods
Harmon, Oskar R.; Lambrinos, James; Buffolino, Judy – Online Journal of Distance Learning Administration, 2010
Many consider online courses to be an inferior alternative to traditional face-to-face (f2f) courses because exam cheating is thought to occur more often in online courses. This study examines how the assessment design in online courses contributes to this perception. Following a literature review, the assessment design in a sample of online…
Descriptors: Electronic Learning, Student Attitudes, Cheating, Online Courses
Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009
This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…
Descriptors: Test Bias, Simulation, Interaction, Effect Size
Previous Page | Next Page ยป
Pages: 1 | 2