Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 45 |
Descriptor
Quality Control | 79 |
Test Construction | 79 |
Test Validity | 28 |
Foreign Countries | 24 |
Evaluation Methods | 22 |
Test Reliability | 22 |
Test Items | 20 |
Testing | 16 |
Higher Education | 14 |
Scoring | 14 |
Computer Assisted Testing | 12 |
More ▼ |
Source
Author
Dorans, Neil J. | 2 |
Faurer, Judson C. | 2 |
Liu, Jinghua | 2 |
Luecht, Richard M. | 2 |
Martin, Michael O., Ed. | 2 |
Abiagam, Bridget | 1 |
Ahmed, Ayesha | 1 |
Anderson, Lorin W. | 1 |
Anu Kajamaa | 1 |
Arendasy, Martin | 1 |
Arslan, Sezen | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 14 |
Postsecondary Education | 11 |
Elementary Education | 7 |
Grade 4 | 7 |
Elementary Secondary Education | 6 |
Early Childhood Education | 5 |
Intermediate Grades | 5 |
Secondary Education | 5 |
Grade 3 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
More ▼ |
Audience
Researchers | 2 |
Practitioners | 1 |
Teachers | 1 |
Location
United Kingdom | 4 |
Italy | 3 |
Japan | 3 |
Russia | 3 |
Australia | 2 |
Canada | 2 |
Chile | 2 |
China | 2 |
Norway | 2 |
Spain | 2 |
United Kingdom (England) | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Trends in International… | 2 |
International Association for… | 1 |
National Longitudinal Study… | 1 |
Program for International… | 1 |
Progress in International… | 1 |
What Works Clearinghouse Rating
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Filio Constantinou – Practical Assessment, Research & Evaluation, 2024
This study investigated task contextualization as a means of assessing students' ability to apply their subject knowledge to new situations. Through analyzing 527 Functional Mathematics examination questions that claim to assess students' application skills, it developed a set of principles for embedding questions in context: deep…
Descriptors: Foreign Countries, Secondary School Mathematics, Secondary School Students, Context Effect
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
OECD Publishing, 2025
The OECD's Survey on Social and Emotional Skills (SSES) 2023 represents the largest global initiative to gather comparable data on the development of social and emotional skills among 10- and 15-year-old students. In the 2023 cycle of SSES, 16 sites implemented an assessment of students' social and emotional skills and collected contextual…
Descriptors: Social Development, Emotional Development, Interpersonal Competence, Surveys
Zubanova, Svetlana; Bodrova, Tatyana; Kruchkovich, Sofia – Journal of Educational Psychology - Propositos y Representaciones, 2020
Testing is a modern high-quality method of knowledge check. Informatization which began in the late XX-early XXI century contributed to the growth of various tests. However, the inclusion of tests in the educational process is at a slower pace. This is largely due to the lack of a methodological basis for test development. It is proved that the…
Descriptors: Testing, Educational Quality, Educational Indicators, Test Construction
Fan, Jason; Jin, Yan – Asia Pacific Journal of Education, 2020
Despite the increasing discussions on quality and professionalism in the field of language assessment, limited empirical research is currently available on whether language testing practice conforms to the best practice model prescribed in professional standards. Situated in the context of Chinese higher education, this study examined how English…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Student Placement
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
Sridharan, Bhavani; Leitch, Shona; Watty, Kim – Quality in Higher Education, 2015
This conceptual framework proposes a multi-level, multi-dimensional course alignment model to implement a contextualised constructive alignment of rubric design that authentically evidences and assesses learning outcomes. By embedding quality control mechanisms at each level for each dimension, this model facilitates the development of an aligned…
Descriptors: Alignment (Education), Quality Control, Scoring Rubrics, Higher Education
Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013
We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…
Descriptors: Tests, Test Construction, Test Format, Change
Kavakli, Nurdan; Arslan, Sezen – Online Submission, 2017
Within the scope of educational testing and assessment, setting standards and creating guidelines as a code of practice provide more prolific and sustainable outcomes. In this sense, internationally accepted and regionally accredited principles are suggested for standardization in language testing and assessment practices. Herein, ILTA guidelines…
Descriptors: Foreign Countries, Second Language Instruction, English (Second Language), Language Tests
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Luecht, Richard M. – Journal of Applied Testing Technology, 2013
Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…
Descriptors: Engineering, Test Construction, Test Items, Models
Artamonova, Ekaterina V.; Aytuganova, Jhanna I.; Grigoryeva, Elena V. – International Journal of Environmental and Science Education, 2016
The relevance of the investigated problem is caused by the objective necessity of construction of the Russian examination practice, taking into account the leading trends in the education system development, where student-activity approach advocates the dominant, and insufficient development of this issue in both the theoretical and methodical…
Descriptors: Foreign Countries, Vocational Education, Tests, Testing Problems
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Partnership for Assessment of Readiness for College and Careers, 2019
The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium designed to create next-generation assessments that, compared to traditional K-12 assessments, more accurately measure student progress toward college and career readiness. The PARCC assessments are aligned to the Common Core State Standards…
Descriptors: College Readiness, Career Readiness, Common Core State Standards, Language Arts