ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	14
Since 2007 (last 20 years)	23

Descriptor

Interrater Reliability	43
Test Format	43
Language Tests	12
Test Items	12
Test Reliability	12
Scoring	10
Test Validity	10
Computer Assisted Testing	9
Foreign Countries	9
Test Construction	9
Higher Education	8
English (Second Language)	7
Interviews	7
Second Language Learning	7
Essay Tests	6
Evaluators	6
Language Proficiency	6
Rating Scales	6
Scores	6
Second Language Instruction	6
Testing	6
Undergraduate Students	6
Comparative Analysis	5
Construct Validity	5
Evaluation Methods	5
More ▼

Publication Type

Journal Articles	28
Reports - Research	24
Reports - Evaluative	10
Speeches/Meeting Papers	7
Reports - Descriptive	4
Tests/Questionnaires	3
Dissertations/Theses -…	2
Guides - Non-Classroom	2
Collected Works - Serials	1
Information Analyses	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	12
Postsecondary Education	11
Elementary Education	3
High Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Administrators	1
Practitioners	1
Researchers	1
Teachers	1

Location

Turkey	2
Brazil	1
China	1
Colombia	1
Germany	1
India	1
Iran	1
Japan	1
Jordan	1
Louisiana	1
Mexico	1
Netherlands	1
New York	1
South Korea	1
Switzerland	1
United Kingdom	1
United Kingdom (Scotland)	1
United States	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

Assessments and Surveys

Test of English as a Foreign…	3
ACT Assessment	1
General Educational…	1
National Household Education…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 43 results Save | Export

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

Detecting Rater Bias in Mixed-Format Assessments

Peer reviewed

Direct link

Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024

Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…

Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

The Development of a Test to Explore the Students' Mental Models and External Representation Patterns of Hanging Objects

Peer reviewed
PDF on ERIC

Download full text

Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021

This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…

Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

Computer-Based and Paper-and-Pencil Tests: A Study in Calculus for STEM Majors

Peer reviewed

Direct link

Smolinsky, Lawrence; Marx, Brian D.; Olafsson, Gestur; Ma, Yanxia A. – Journal of Educational Computing Research, 2020

Computer-based testing is an expanding use of technology offering advantages to teachers and students. We studied Calculus II classes for science, technology, engineering, and mathematics majors using different testing modes. Three sections with 324 students employed: paper-and-pencil testing, computer-based testing, and both. Computer tests gave…

Descriptors: Test Format, Computer Assisted Testing, Paper (Material), Calculus

Measuring Multidimensional Science Learning: Item Design, Scoring, and Psychometric Considerations

Direct link

Castle, Courtney – ProQuest LLC, 2018

The Next Generation Science Standards propose a multidimensional model of science learning, comprised of Core Disciplinary Ideas, Science and Engineering Practices, and Crosscutting Concepts (NGSS Lead States, 2013). Accordingly, there is a need for student assessment aligned with the new standards. Creating assessments that validly and reliably…

Descriptors: Science Education, Student Evaluation, Science Tests, Test Construction

Developing and Validating a Computerized Oral Proficiency Test of English as a Foreign Language (COPTEFL)

Peer reviewed
PDF on ERIC

Download full text

Isler, Cemre; Aydin, Belgin – International Journal of Assessment Tools in Education, 2021

This study is about the development and validation process of the Computerized Oral Proficiency Test of English as a Foreign Language (COPTEFL). The test aims at assessing the speaking proficiency levels of students in Anadolu University School of Foreign Languages (AUSFL). For this purpose, three monologic tasks were developed based on the Global…

Descriptors: Test Construction, Construct Validity, Interrater Reliability, Scores

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

Student Epistemological Framing on Paper-Based Assessments

Peer reviewed

Direct link

Shar, Kelli; Russ, Rosemary S.; Laverty, James T. – Physical Review Physics Education Research, 2020

Assessments are usually thought of as ways for instructors to get information from students. In this work, we flip this perspective and explore how assessments communicate information to students. Specifically, we consider how assessments may provide information about what faculty and/or researchers think it means to know and do physics, i.e.,…

Descriptors: Epistemology, Science Instruction, Physics, Science Tests

Sign Language Learning and Assessment in German Switzerland: Exploring the Potential of Vocabulary Size Tests for Swiss German Sign Language

Peer reviewed
PDF on ERIC

Download full text

Haug, Tobias; Ebling, Sarah; Braem, Penny Boyes; Tissi, Katja; Sidler-Miserez, Sandra – Language Education & Assessment, 2019

In German Switzerland the learning and assessment of Swiss German Sign Language ("Deutschschweizerische Gebärdensprache," DSGS) takes place in different contexts, for example, in tertiary education or in continuous education courses. By way of the still ongoing implementation of the Common European Framework of Reference for DSGS,…

Descriptors: German, Sign Language, Language Tests, Test Items

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

WordBytes: Exploring an Intermediate Constraint Format for Rapid Classification of Student Answers on Constructed Response Assessments

Peer reviewed
PDF on ERIC

Download full text

Kim, Kerry J.; Meir, Eli; Pope, Denise S.; Wendel, Daniel – Journal of Educational Data Mining, 2017

Computerized classification of student answers offers the possibility of instant feedback and improved learning. Open response (OR) questions provide greater insight into student thinking and understanding than more constrained multiple choice (MC) questions, but development of automated classifiers is more difficult, often requiring training a…

Descriptors: Classification, Computer Assisted Testing, Multiple Choice Tests, Test Format

Rater Perceptions of Bias Using the Multiple Mini-Interview Format: A Qualitative Study

Peer reviewed
PDF on ERIC

Download full text

Alweis, Richard L.; Fitzpatrick, Caroline; Donato, Anthony A. – Journal of Education and Training Studies, 2015

Introduction: The Multiple Mini-Interview (MMI) format appears to mitigate individual rater biases. However, the format itself may introduce structural systematic bias, favoring extroverted personality types. This study aimed to gain a better understanding of these biases from the perspective of the interviewer. Methods: A sample of MMI…

Descriptors: Interviews, Interrater Reliability, Qualitative Research, Semi Structured Interviews

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	2
ETS Research Report Series	2
Educational and Psychological…	2
International Journal of…	2
Journal of Educational…	2
ALT-J: Research in Learning…	1
Adult Basic Education and…	1
Annual Review of Applied…	1
Assessing Writing	1
Assessment & Evaluation in…	1
Canadian Modern Language…	1
Educational Assessment	1
English Language Teaching	1
Journal of Communication…	1
Journal of Education and…	1
Journal of Education for…	1
Journal of Educational Data…	1
Language Assessment Quarterly	1
Language Education &…	1
Measurement:…	1
New York State Education…	1
Online Submission	1
Pegem Journal of Education…	1
Perceptual and Motor Skills	1
Physical Review Physics…	1
More ▼

Lunz, Mary E.	2
Ahmadi, Alireza	1
Alderson, J. Charles	1
Almond, Patricia	1
Alweis, Richard L.	1
Aydin, Belgin	1
Bagherkazemi, Marzieh	1
Birjandi, Parviz	1
Boldt, R. F.	1
Boyer, Michelle	1
Braem, Penny Boyes	1
Burk, John	1
Casabianca, Jodi M.	1
Castle, Courtney	1
Chalhoub-Deville, Micheline	1
Chang, Lei	1
Clariana, Roy B.	1
Curren, Randall R.	1
Davis, Larry	1
Dogan, Nuri	1
Donato, Anthony A.	1
Downing, Steven M.	1
Ebling, Sarah	1
Edward Paul Getman	1
More ▼