Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 24 |
Descriptor
Source
Author
Ahmed, Ayesha | 1 |
Baker, Eva L. | 1 |
Barelds, Anna | 1 |
Bejar, Isaac I. | 1 |
Blieck, Yves | 1 |
Craig, Jaime | 1 |
Crumbley, D. Larry | 1 |
DePryck, Koen | 1 |
Diamond, Karen | 1 |
Druskyte, Ruta | 1 |
Dunbar, Stephen B. | 1 |
More ▼ |
Publication Type
Journal Articles | 35 |
Reports - Research | 19 |
Reports - Evaluative | 9 |
Reports - Descriptive | 7 |
Translations | 2 |
Guides - General | 1 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 10 |
Postsecondary Education | 7 |
Adult Education | 2 |
Early Childhood Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Kindergarten | 1 |
Preschool Education | 1 |
Secondary Education | 1 |
Audience
Location
Australia | 1 |
Austria | 1 |
China | 1 |
China (Beijing) | 1 |
Ethiopia | 1 |
Lithuania | 1 |
Netherlands | 1 |
Norway | 1 |
Oregon | 1 |
Slovenia | 1 |
Thailand | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Infant Toddler Environment… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Filio Constantinou – Practical Assessment, Research & Evaluation, 2024
This study investigated task contextualization as a means of assessing students' ability to apply their subject knowledge to new situations. Through analyzing 527 Functional Mathematics examination questions that claim to assess students' application skills, it developed a set of principles for embedding questions in context: deep…
Descriptors: Foreign Countries, Secondary School Mathematics, Secondary School Students, Context Effect
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Blieck, Yves; Kauwenberghs, Kurt; Zhu, Chang; Struyven, Katrien; Pynoo, Bram; DePryck, Koen – Journal of Computer Assisted Learning, 2019
Online and blended learning (OBL) is valued, but it also offers challenges. Literature indicates that OBL can enhance access to education and increase flexibility for students. However, the reported dropout rates indicate that student participation in OBL programmes is a concern. Scientifically valid knowledge about how factors that help students…
Descriptors: Online Courses, Blended Learning, Educational Technology, Technology Uses in Education
Fan, Jason; Jin, Yan – Asia Pacific Journal of Education, 2020
Despite the increasing discussions on quality and professionalism in the field of language assessment, limited empirical research is currently available on whether language testing practice conforms to the best practice model prescribed in professional standards. Situated in the context of Chinese higher education, this study examined how English…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Student Placement
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
Luecht, Richard M. – Journal of Applied Testing Technology, 2013
Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…
Descriptors: Engineering, Test Construction, Test Items, Models
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Wei, Youhua; Low, Albert – ETS Research Report Series, 2017
In most large-scale programs of tests that aid in making high-stakes decisions, such as the "TOEIC"® family of products and service, it is not unusual for a significant portion of test takers to retake the test at multiple times.The study reported here used multilevel growth modeling to explore the score change patterns of nearly 20,000…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Norris, Deborah J.; Guss, Shannon – Journal of Research in Childhood Education, 2016
Quality Rating Improvement Systems (QRIS) frequently include the Infant-Toddler Environment Rating Scale-Revised (ITERS-R) as part of rating and improving child care quality. However, studies utilizing the ITERS-R consistently report low quality, especially for basic caregiving items. This research examined whether the low scores reflected the…
Descriptors: Child Care, Educational Quality, Measurement Techniques, Test Reliability
Goldwater, Paul M.; Fogarty, Timothy J. – Behaviour & Information Technology, 2012
As accounting education transitions to more distance-learning formats, the integrity of student evaluation continues to serve as an obstacle to adoption. Greater technological possibilities will be opposed if faculty members believe that testing is compromised. This article investigates whether students taking exams remotely (and under no…
Descriptors: Student Evaluation, Accounting, Testing, Distance Education
McVilly, K.; Webber, L.; Paris, M.; Sharp, G. – Journal of Intellectual Disability Research, 2013
Background: Having an objective means of evaluating the quality of behaviour support plans (BSPs) could assist service providers and statutory authorities to monitor and improve the quality of support provided to people with intellectual disability (ID) who exhibit challenging behaviour. The Behaviour Support Plan Quality Evaluation Guide II…
Descriptors: Foreign Countries, Behavior Problems, Behavior Modification, Adults
Pižorn, Karmen; Moe, Eli – Center for Educational Policy Studies Journal, 2012
This article is a validation study of two national large-scale tests that measure the language proficiency of 11/12 year-old English learners in Norway and Slovenia. Following the example of Alderson and Banerjee (2008), the authors of the article have employed the EALTA guidelines for good practice to validate the tests, and to formulate major…
Descriptors: English (Second Language), Second Language Learning, Guidelines, Test Construction
Barelds, Anna; van de Goor, Ien; van Heck, Guus; Schols, Jos – Journal of Applied Research in Intellectual Disabilities, 2011
Background: Care and service trajectories for people with intellectual disabilities are routes within the health care delivery system that consist of all the steps that people with intellectual disability and their families have to take in order to realize needed care and services. In contrast to the growing body of system-orientated knowledge…
Descriptors: Disabilities, Mental Retardation, Focus Groups, Interviews
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance