ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	16

Descriptor

Scoring	34
Test Format	34
Higher Education	12
Test Items	12
Computer Assisted Testing	10
Test Construction	10
Testing Programs	10
State Programs	8
Test Interpretation	8
Educational Testing	7
Essay Tests	7
Objective Tests	7
Student Placement	7
Test Norms	7
Test Reliability	7
Writing Evaluation	7
College Freshmen	6
Equivalency Tests	6
Language Tests	6
Testing	6
Writing Research	6
Computer Software	5
Foreign Countries	5
Automation	4
Evaluation Methods	4
More ▼

Publication Type

Reports - Descriptive	34
Journal Articles	18
Reports - Research	6
Guides - Classroom - Teacher	2
Speeches/Meeting Papers	2
Collected Works - General	1
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Elementary Secondary Education	3
Postsecondary Education	3
Elementary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Teachers	4
Practitioners	3
Administrators	1
Researchers	1
Students	1

Location

California	7
Canada	2
China	1
Malawi	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
Armed Services Vocational…	1
Graduate Record Examinations	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

To Score or Not to Score: Factors Influencing Performance and Feasibility of Automatic Content Scoring of Text Responses

Peer reviewed

Direct link

Zesch, Torsten; Horbach, Andrea; Zehner, Fabian – Educational Measurement: Issues and Practice, 2023

In this article, we systematize the factors influencing performance and feasibility of automatic content scoring methods for short text responses. We argue that performance (i.e., how well an automatic system agrees with human judgments) mainly depends on the linguistic variance seen in the responses and that this variance is indirectly influenced…

Descriptors: Influences, Academic Achievement, Feasibility Studies, Automation

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Understanding and Interpreting Human Scoring

Peer reviewed

Direct link

Glazer, Nancy; Wolfe, Edward W. – Applied Measurement in Education, 2020

This introductory article describes how constructed response scoring is carried out, particularly the rater monitoring processes and illustrates three potential designs for conducting rater monitoring in an operational scoring project. The introduction also presents a framework for interpreting research conducted by those who study the constructed…

Descriptors: Scoring, Test Format, Responses, Predictor Variables

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Assessing Learners' Productive Vocabulary Knowledge: Formats and Considerations

Peer reviewed
PDF on ERIC

Download full text

Sharakhimov, Shoaziz; Nurmukhamedov, Ulugbek – English Teaching Forum, 2021

Vocabulary learning is an incremental process. Vocabulary knowledge, especially for second-language learners, may develop across a lifetime. Teachers with experience in providing feedback on their students' vocabulary use in writing or speech might have noticed that it is sometimes difficult to pinpoint one aspect of word knowledge. The reason is…

Descriptors: Vocabulary Development, Second Language Learning, Second Language Instruction, English (Second Language)

TIMSS 2023 Assessment Frameworks

Download full text

Mullis, Ina V. S., Ed.; Martin, Michael O., Ed.; von Davier, Matthias, Ed. – International Association for the Evaluation of Educational Achievement, 2021

TIMSS (Trends in International Mathematics and Science Study) is a long-standing international assessment of mathematics and science at the fourth and eighth grades that has been collecting trend data every four years since 1995. About 70 countries use TIMSS trend data for monitoring the effectiveness of their education systems in a global…

Descriptors: Achievement Tests, International Assessment, Science Achievement, Mathematics Achievement

Guide to English Language Arts/Literacy Released Items: Understanding Scoring. 2015

Download full text

Partnership for Assessment of Readiness for College and Careers, 2015

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a group of states working together to develop a modern assessment that replaces previous state standardized tests. It provides better information for teachers and parents to identify where a student needs help, or is excelling, so they are able to enhance instruction to…

Descriptors: Literacy, Language Arts, Scoring Formulas, Scoring

Automated Scoring of Short-Answer Reading Items: Implications for Constructs

Peer reviewed

Direct link

Carr, Nathan T.; Xi, Xiaoming – Language Assessment Quarterly, 2010

This article examines how the use of automated scoring procedures for short-answer reading tasks can affect the constructs being assessed. In particular, it highlights ways in which the development of scoring algorithms intended to apply the criteria used by human raters can lead test developers to reexamine and even refine the constructs they…

Descriptors: Scoring, Automation, Reading Tests, Test Format

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

Fundamental Concerns in High-Stakes Language Testing: The Case of the College English Test

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Yan – Journal of Pan-Pacific Association of Applied Linguistics, 2011

The College English Test (CET) is an English language test designed for educational purposes, administered on a very large scale, and used for making high-stakes decisions. This paper discusses the key issues facing the CET during the course of its development in the past two decades. It argues that the most fundamental and critical concerns of…

Descriptors: High Stakes Tests, Language Tests, Measures (Individuals), Graduates

Reading Assessment and Item Specifications for the 2009 National Assessment of Educational Progress

Download full text

National Assessment Governing Board, 2009

As the ongoing national indicator of what American students know and can do, the National Assessment of Educational Progress (NAEP) in Reading regularly collects achievement information on representative samples of students in grades 4, 8, and 12. The information that NAEP provides about student achievement helps the public, educators, and…

Descriptors: National Competency Tests, Reading Tests, Test Items, Test Format

Advanced Placement: More than a Test.

Peer reviewed

Colwell, Richard – Music Educators Journal, 1990

Encourages music teachers to work with students interested in advanced placement (AP) music courses. Discusses the logistics and advantages of placing students in these courses. Describes the Advanced Placement Listening and Literature and the Advanced Placement Theory courses and examinations. Outlines the examination scoring method and looks at…

Descriptors: Acceleration (Education), Advanced Placement Programs, Advanced Students, Educational Attainment

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	1
Assessment & Evaluation in…	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Measurement:…	1
English Teaching Forum	1
Evaluation and the Health…	1
Foreign Language Annals	1
International Association for…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Pan-Pacific…	1
Journal of Technology,…	1
Language Assessment Quarterly	1
Language Testing	1
Music Educators Journal	1
National Assessment Governing…	1
National Council on…	1
Partnership for Assessment of…	1
Practical Assessment,…	1
Teaching of Psychology	1
Technological Horizons in…	1
More ▼

White, Edward M.	6
Alexander, Diane	1
Anderson, Paul S.	1
Baldwin, Peter	1
Becker, Benjamin	1
Carr, Nathan T.	1
Casabianca, Jodi M.	1
Chakwera, Elias	1
Clariana, Roy B.	1
Clauser, Brian E.	1
Colwell, Richard	1
Curran, Linda T.	1
Debeer, Dries	1
Frink, Helen H.	1
Gifford, Bernard	1
Glazer, Nancy	1
Han, Chao	1
Holmes, Susan E.	1
Horbach, Andrea	1
Jin, Yan	1
Jordan, Linda A.	1
Kalat, James W.	1
Khembo, Dafter	1
Kingsbury, G. Gage	1
More ▼