Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 17 |
Descriptor
Quality Control | 17 |
Test Construction | 17 |
Test Validity | 13 |
Test Reliability | 11 |
Testing | 10 |
Foreign Countries | 9 |
Scoring | 9 |
Automation | 7 |
Scaling | 7 |
Student Characteristics | 7 |
Test Bias | 7 |
More ▼ |
Source
Author
Arslan, Sezen | 1 |
Artamonova, Ekaterina V. | 1 |
Aytuganova, Jhanna I. | 1 |
Bodrova, Tatyana | 1 |
Fan, Jason | 1 |
Filio Constantinou | 1 |
Firdissa J. Aga | 1 |
Grigoryeva, Elena V. | 1 |
Guher Gorgun | 1 |
Jin, Yan | 1 |
Kavakli, Nurdan | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 7 |
Numerical/Quantitative Data | 6 |
Reports - Descriptive | 4 |
Reports - Evaluative | 4 |
Books | 1 |
Collected Works - General | 1 |
Guides - General | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 5 |
Grade 4 | 5 |
Intermediate Grades | 5 |
Secondary Education | 5 |
Early Childhood Education | 4 |
Grade 3 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
Grade 7 | 4 |
Grade 9 | 4 |
High Schools | 4 |
More ▼ |
Audience
Location
China | 2 |
Russia | 2 |
Brazil | 1 |
Bulgaria | 1 |
Chile | 1 |
Colombia (Bogota) | 1 |
Ethiopia | 1 |
Finland (Helsinki) | 1 |
India | 1 |
Indonesia | 1 |
Italy | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
International Association for… | 1 |
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Filio Constantinou – Practical Assessment, Research & Evaluation, 2024
This study investigated task contextualization as a means of assessing students' ability to apply their subject knowledge to new situations. Through analyzing 527 Functional Mathematics examination questions that claim to assess students' application skills, it developed a set of principles for embedding questions in context: deep…
Descriptors: Foreign Countries, Secondary School Mathematics, Secondary School Students, Context Effect
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
OECD Publishing, 2025
The OECD's Survey on Social and Emotional Skills (SSES) 2023 represents the largest global initiative to gather comparable data on the development of social and emotional skills among 10- and 15-year-old students. In the 2023 cycle of SSES, 16 sites implemented an assessment of students' social and emotional skills and collected contextual…
Descriptors: Social Development, Emotional Development, Interpersonal Competence, Surveys
Zubanova, Svetlana; Bodrova, Tatyana; Kruchkovich, Sofia – Journal of Educational Psychology - Propositos y Representaciones, 2020
Testing is a modern high-quality method of knowledge check. Informatization which began in the late XX-early XXI century contributed to the growth of various tests. However, the inclusion of tests in the educational process is at a slower pace. This is largely due to the lack of a methodological basis for test development. It is proved that the…
Descriptors: Testing, Educational Quality, Educational Indicators, Test Construction
Fan, Jason; Jin, Yan – Asia Pacific Journal of Education, 2020
Despite the increasing discussions on quality and professionalism in the field of language assessment, limited empirical research is currently available on whether language testing practice conforms to the best practice model prescribed in professional standards. Situated in the context of Chinese higher education, this study examined how English…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Student Placement
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
Kavakli, Nurdan; Arslan, Sezen – Online Submission, 2017
Within the scope of educational testing and assessment, setting standards and creating guidelines as a code of practice provide more prolific and sustainable outcomes. In this sense, internationally accepted and regionally accredited principles are suggested for standardization in language testing and assessment practices. Herein, ILTA guidelines…
Descriptors: Foreign Countries, Second Language Instruction, English (Second Language), Language Tests
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Artamonova, Ekaterina V.; Aytuganova, Jhanna I.; Grigoryeva, Elena V. – International Journal of Environmental and Science Education, 2016
The relevance of the investigated problem is caused by the objective necessity of construction of the Russian examination practice, taking into account the leading trends in the education system development, where student-activity approach advocates the dominant, and insufficient development of this issue in both the theoretical and methodical…
Descriptors: Foreign Countries, Vocational Education, Tests, Testing Problems
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Partnership for Assessment of Readiness for College and Careers, 2019
The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium designed to create next-generation assessments that, compared to traditional K-12 assessments, more accurately measure student progress toward college and career readiness. The PARCC assessments are aligned to the Common Core State Standards…
Descriptors: College Readiness, Career Readiness, Common Core State Standards, Language Arts
Partnership for Assessment of Readiness for College and Careers, 2018
The purpose of this technical report is to describe the third operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) assessments in the 2016-2017 academic year. PARCC is a state-led consortium creating next-generation assessments that, compared to traditional K-12 assessments, more accurately…
Descriptors: College Readiness, Career Readiness, Common Core State Standards, Language Arts
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
Previous Page | Next Page »
Pages: 1 | 2