ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	19

Descriptor

Test Format	40
Scoring	34
Test Items	16
Higher Education	13
Test Construction	12
Computer Assisted Testing	11
Test Interpretation	10
Testing Programs	10
Objective Tests	8
State Programs	8
Writing Evaluation	8
Educational Testing	7
Essay Tests	7
Student Placement	7
Test Norms	7
Test Reliability	7
College Freshmen	6
Equivalency Tests	6
Evaluation Methods	6
Language Tests	6
Scores	6
Testing	6
Writing Research	6
Automation	5
Computer Software	5
More ▼

Publication Type

Reports - Descriptive	40
Journal Articles	21
Reports - Research	6
Speeches/Meeting Papers	3
Guides - Classroom - Teacher	2
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Higher Education	6
Elementary Secondary Education	5
Postsecondary Education	5
Elementary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Teachers	5
Practitioners	4
Administrators	1
Researchers	1
Students	1

Location

California	7
Canada	2
China	1
Malawi	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…	2
National Assessment of…	2
Armed Services Vocational…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

To Score or Not to Score: Factors Influencing Performance and Feasibility of Automatic Content Scoring of Text Responses

Peer reviewed

Direct link

Zesch, Torsten; Horbach, Andrea; Zehner, Fabian – Educational Measurement: Issues and Practice, 2023

In this article, we systematize the factors influencing performance and feasibility of automatic content scoring methods for short text responses. We argue that performance (i.e., how well an automatic system agrees with human judgments) mainly depends on the linguistic variance seen in the responses and that this variance is indirectly influenced…

Descriptors: Influences, Academic Achievement, Feasibility Studies, Automation

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Understanding and Interpreting Human Scoring

Peer reviewed

Direct link

Glazer, Nancy; Wolfe, Edward W. – Applied Measurement in Education, 2020

This introductory article describes how constructed response scoring is carried out, particularly the rater monitoring processes and illustrates three potential designs for conducting rater monitoring in an operational scoring project. The introduction also presents a framework for interpreting research conducted by those who study the constructed…

Descriptors: Scoring, Test Format, Responses, Predictor Variables

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Guide to English Language Arts/Literacy Released Items: Understanding Scoring. 2015

Download full text

Partnership for Assessment of Readiness for College and Careers, 2015

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a group of states working together to develop a modern assessment that replaces previous state standardized tests. It provides better information for teachers and parents to identify where a student needs help, or is excelling, so they are able to enhance instruction to…

Descriptors: Literacy, Language Arts, Scoring Formulas, Scoring

Assessing Learners' Productive Vocabulary Knowledge: Formats and Considerations

Peer reviewed
PDF on ERIC

Download full text

Sharakhimov, Shoaziz; Nurmukhamedov, Ulugbek – English Teaching Forum, 2021

Vocabulary learning is an incremental process. Vocabulary knowledge, especially for second-language learners, may develop across a lifetime. Teachers with experience in providing feedback on their students' vocabulary use in writing or speech might have noticed that it is sometimes difficult to pinpoint one aspect of word knowledge. The reason is…

Descriptors: Vocabulary Development, Second Language Learning, Second Language Instruction, English (Second Language)

TIMSS 2023 Assessment Frameworks

Download full text

Mullis, Ina V. S., Ed.; Martin, Michael O., Ed.; von Davier, Matthias, Ed. – International Association for the Evaluation of Educational Achievement, 2021

TIMSS (Trends in International Mathematics and Science Study) is a long-standing international assessment of mathematics and science at the fourth and eighth grades that has been collecting trend data every four years since 1995. About 70 countries use TIMSS trend data for monitoring the effectiveness of their education systems in a global…

Descriptors: Achievement Tests, International Assessment, Science Achievement, Mathematics Achievement

Automated Scoring of Short-Answer Reading Items: Implications for Constructs

Peer reviewed

Direct link

Carr, Nathan T.; Xi, Xiaoming – Language Assessment Quarterly, 2010

This article examines how the use of automated scoring procedures for short-answer reading tasks can affect the constructs being assessed. In particular, it highlights ways in which the development of scoring algorithms intended to apply the criteria used by human raters can lead test developers to reexamine and even refine the constructs they…

Descriptors: Scoring, Automation, Reading Tests, Test Format

Preparing Students with Learning Disabilities for Large-Scale Writing Assessments

Peer reviewed

Direct link

Olinghouse, Natalie G.; Colwell, Ryan P. – Intervention in School and Clinic, 2013

This article provides recommendations for teachers to better prepare 3rd through 12th grade students with learning disabilities for large-scale writing assessments. The variation across large-scale writing assessments and the multiple needs of struggling writers indicate the need for test preparation to be embedded within a comprehensive,…

Descriptors: Learning Disabilities, Elementary Secondary Education, Writing Evaluation, Test Wiseness

The Problem of Semantic Openness and Constructed Response

Peer reviewed

Direct link

Solheim, Oddny Judith; Skaftun, Atle – Assessment in Education: Principles, Policy & Practice, 2009

During the last three decades the constructed response format has gradually gained entry in large-scale assessments of reading-comprehension. In their 1991 Reading Literacy Study The International Association for the Evaluation of Educational Achievement (IEA) included constructed response items on an exploratory basis. Ten years later, in…

Descriptors: Reading Comprehension, Literacy, Reading Tests, Responses

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

Fundamental Concerns in High-Stakes Language Testing: The Case of the College English Test

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Yan – Journal of Pan-Pacific Association of Applied Linguistics, 2011

The College English Test (CET) is an English language test designed for educational purposes, administered on a very large scale, and used for making high-stakes decisions. This paper discusses the key issues facing the CET during the course of its development in the past two decades. It argues that the most fundamental and critical concerns of…

Descriptors: High Stakes Tests, Language Tests, Measures (Individuals), Graduates

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
College Entrance Examination…	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Measurement:…	1
Engineering Education	1
English Teaching Forum	1
Evaluation and the Health…	1
Foreign Language Annals	1
International Association for…	1
Intervention in School and…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Pan-Pacific…	1
Journal of Technology,…	1
Language Assessment Quarterly	1
Language Testing	1
Music Educators Journal	1
National Assessment Governing…	1
National Council on…	1
Partnership for Assessment of…	1
Practical Assessment,…	1
Rowman & Littlefield…	1
More ▼

White, Edward M.	6
Alexander, Diane	1
Anderson, Paul S.	1
Baldwin, Peter	1
Becker, Benjamin	1
Buckendahl, Chad W.	1
Carr, Nathan T.	1
Casabianca, Jodi M.	1
Chakwera, Elias	1
Clariana, Roy B.	1
Clauser, Brian E.	1
Colwell, Richard	1
Colwell, Ryan P.	1
Curran, Linda T.	1
Debeer, Dries	1
DiVesta, Francis J.	1
Frink, Helen H.	1
Gifford, Bernard	1
Glazer, Nancy	1
Han, Chao	1
Holmes, Susan E.	1
Horbach, Andrea	1
Impara, James C.	1
Jin, Yan	1
More ▼