ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	42
Since 2007 (last 20 years)	64

Descriptor

Error Patterns	92
Scoring	92
Computer Assisted Testing	18
Evaluation Methods	15
Foreign Countries	15
Graduate Students	14
Scores	13
Student Evaluation	13
Comparative Analysis	12
English (Second Language)	12
Second Language Learning	11
Testing	11
Writing Evaluation	11
Accuracy	10
Intelligence Tests	10
Models	10
Elementary School Students	8
Second Language Instruction	8
Statistical Analysis	8
Test Items	8
Achievement Tests	7
Artificial Intelligence	7
Children	7
Computer Software	7
Correlation	7
More ▼

Publication Type

Journal Articles	62
Reports - Research	61
Reports - Evaluative	9
Reports - Descriptive	8
Dissertations/Theses -…	7
Speeches/Meeting Papers	7
Collected Works - Proceedings	4
Books	2
Guides - Classroom - Teacher	2
Collected Works - General	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	25
Postsecondary Education	20
Elementary Education	10
Secondary Education	9
Middle Schools	7
Junior High Schools	6
High Schools	5
Elementary Secondary Education	4
Grade 8	4
Grade 6	3
Early Childhood Education	2
Grade 10	2
Grade 7	2
Primary Education	2
Grade 1	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Practitioners	3
Teachers	2
Researchers	1

Location

Australia	2
Canada	2
China	2
Japan	2
Singapore	2
United States	2
Bosnia and Herzegovina	1
Brazil	1
Canada (Victoria)	1
Czech Republic	1
Greece	1
Indonesia	1
Italy	1
Mongolia	1
Mozambique	1
New York (New York)	1
New Zealand	1
Taiwan	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Individuals with Disabilities…	1

Assessments and Surveys

Wechsler Intelligence Scale…	9
Wechsler Adult Intelligence…	4
Kaufman Test of Educational…	3
National Assessment of…	2
Test of English as a Foreign…	2
Woodcock Johnson Tests of…	2
Graduate Record Examinations	1
Self Directed Search	1
Stanford Binet Intelligence…	1
Wechsler Individual…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 92 results Save | Export

Towards a More Nuanced Conceptualisation of Differential Examiner Stringency in OSCEs

Peer reviewed

Direct link

Matt Homer – Advances in Health Sciences Education, 2024

Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…

Descriptors: Examiners, Scoring, Validity, Cutting Scores

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Incorporating Calibration Errors in Oral Reading Fluency Scoring

Peer reviewed

Direct link

Xin Qiao; Akihito Kamata; Cornelis Potgieter – Grantee Submission, 2024

Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring.…

Descriptors: Oral Reading, Reading Fluency, Error Patterns, Scoring

Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality

Peer reviewed

Direct link

Mark White; Matt Ronfeldt – Educational Assessment, 2024

Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…

Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns

Wechsler Trickle-Down Errors: A Comparison between Master's Students and Doctoral Students

Direct link

Jessica Stinson – ProQuest LLC, 2024

Intelligence tests have been used in the United States since the early 1900s for assessing soldiers during World War I (Kaufman & Harrison, 2008; White & Hall, 1980). Presently, cognitive assessments are used in school, civil service, military, clinical, and industry settings (White & Hall, 1980). Although the results of these…

Descriptors: Graduate Students, Masters Programs, Doctoral Programs, Comparative Analysis

The Effect of Student Examiner Errors on WAIS-IV and WISC-V Composite Scores

Direct link

Atehortua, Laura – ProQuest LLC, 2022

Intelligence tests are used in a variety of settings such as schools, clinics, and courts to assess the intellectual capacity of individuals of all ages. Intelligence tests are used to make high-stakes decisions such as special education placement, employment, eligibility for social security services, and determination of the death penalty.…

Descriptors: Adults, Intelligence Tests, Children, Error of Measurement

Effects of Undergraduate Student Reviewers' Ability on Comments Provided, Reviewing Behavior, and Performance in an Online Video Peer Assessment Activity

Peer reviewed

Direct link

Li, Liang-Yi; Huang, Wen-Lung – Educational Technology & Society, 2023

With the increasing bandwidth, videos have been gradually used as submissions for online peer assessment activities. However, their transient nature imposes a high cognitive load on students, particularly lowability students. Therefore, reviewers' ability is a key factor that may affect the reviewing process and performance in an online video peer…

Descriptors: Peer Evaluation, Undergraduate Students, Video Technology, Evaluation Methods

An Error-Analysis Study from an EFL Writing Context: Human and Automated Essay Scoring Approaches

Peer reviewed

Direct link

Almusharraf, Norah; Alotaibi, Hind – Technology, Knowledge and Learning, 2023

Evaluating written texts is believed to be a time-consuming process that can lack consistency and objectivity. Automated essay scoring (AES) can provide solutions to some of the limitations of human scoring. This research aimed to evaluate the performance of one AES system, Grammarly, in comparison to human raters. Both approaches' performances…

Descriptors: Writing Evaluation, Writing Tests, Essay Tests, Essays

School Psychology Trainees' Administration and Scoring Errors on the Woodcock-Johnson IV Tests of Achievement

Peer reviewed

Direct link

Lockwood, Adam B.; Klatka, Kelsey; Freeman, Kelli; Farmer, Ryan L.; Benson, Nicholas – Journal of Psychoeducational Assessment, 2023

Sixty-three Woodcock-Johnson IV Tests of Achievement protocols, administered by 26 school psychology trainees, were examined to determine the frequency of examiner errors. Errors were noted on all protocols and ranged from 8 to 150 per administration. Critical (e.g., start, stop, and calculation) errors were noted on roughly 97% of protocols.…

Descriptors: Achievement Tests, School Psychology, Counselor Training, Trainees

Multitask Summary Scoring with Longformers

Peer reviewed
PDF on ERIC

Download full text

Direct link

Botarleanu, Robert-Mihai; Dascalu, Mihai; Allen, Laura K.; Crossley, Scott Andrew; McNamara, Danielle S. – Grantee Submission, 2022

Automated scoring of student language is a complex task that requires systems to emulate complex and multi-faceted human evaluation criteria. Summary scoring brings an additional layer of complexity to automated scoring because it involves two texts of differing lengths that must be compared. In this study, we present our approach to automate…

Descriptors: Automation, Scoring, Documentation, Likert Scales

Detecting Rater Effects in Trend Scoring

Direct link

Abdalla, Widad – ProQuest LLC, 2019

Trend scoring is often used in large-scale assessments to monitor for rater drift when the same constructed response items are administered in multiple test administrations. In trend scoring, a set of responses from Time "A" are rescored by raters at Time "B." The purpose of this study is to examine the ability of…

Descriptors: Scoring, Interrater Reliability, Test Items, Error Patterns

Reliability. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.

Descriptors: Scores, Measurement, Test Reliability, Error Patterns

Improving Automated Scoring of Student Open Responses in Mathematics

Peer reviewed
PDF on ERIC

Download full text

Baral, Sami; Botelho, Anthony F.; Erickson, John A.; Benachamardi, Priyanka; Heffernan, Neil T. – International Educational Data Mining Society, 2021

Open-ended questions in mathematics are commonly used by teachers to monitor and assess students' deeper conceptual understanding of content. Student answers to these types of questions often exhibit a combination of language, drawn diagrams and tables, and mathematical formulas and expressions that supply teachers with insight into the processes…

Descriptors: Scoring, Automation, Mathematics Tests, Student Evaluation

Q-Interactive: Training Implications for Accuracy and Technology Integration

Peer reviewed

Direct link

Corcoran, Stephanie – Contemporary School Psychology, 2022

With the iPad-mediated cognitive assessment gaining popularity with school districts and the need for alternative modes for training and instruction during this COVID-19 pandemic, school psychology training programs will need to adapt to effectively train their students to be competent in administering, scoring, an interpreting cognitive…

Descriptors: School Psychologists, Professional Education, Job Skills, Cognitive Tests

Teacher Trainees' Administration and Scoring Errors on the Kaufman Test of Educational Achievement

Peer reviewed

Direct link

Lockwood, Adam B.; Sealander, Karen; Gross, Thomas J.; Lanterman, Christopher – Journal of Psychoeducational Assessment, 2020

Achievement tests are used to make high-stakes (e.g., special education placement) decisions, and previous research on norm-referenced assessment suggests that errors are ubiquitous. In our study of 42 teacher trainees, utilizing five of the six core subtests of the Kaufman Test of Educational Achievement, Third Edition (KTEA-3), we found that…

Descriptors: Achievement Tests, Preservice Teachers, Testing, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

ProQuest LLC	7
Journal of Psychoeducational…	5
Journal of Educational…	4
Canadian Journal of School…	3
ETS Research Report Series	3
Grantee Submission	3
International Educational…	3
Advances in Health Sciences…	2
Applied Measurement in…	2
Educational Assessment	2
Journal of Educational and…	2
Journal of Speech, Language,…	2
Language Testing in Asia	2
Online Submission	2
Studies in Educational…	2
American Annals of the Deaf	1
Applied Linguistics	1
Contemporary School Psychology	1
Education and Information…	1
Educational Measurement:…	1
Educational Psychology	1
Educational Technology &…	1
English Journal	1
English Language Teaching	1
English in Australia	1
More ▼

Tatsuoka, Kikumi K.	5
Lockwood, Adam B.	2
Mrazik, Martin	2
Slate, John R.	2
Abdalla, Widad	1
Akihito Kamata	1
Alex J. Mechaber	1
Alfonso, Vincent C.	1
Allalouf, Avi	1
Allen, Laura K.	1
Almusharraf, Norah	1
Alonzo, Julie	1
Alotaibi, Hind	1
Amini, Mojtaba	1
Apel, Kenn	1
Apple, Kristen	1
Atehortua, Laura	1
Bailey, Dallin J.	1
Bao, Xiaoli	1
Baral, Sami	1
Barford, Sean W.	1
Baumer, Michal	1
Belur, Vinetha	1
Benachamardi, Priyanka	1
More ▼