Publication Date
In 2025 | 1 |
Since 2024 | 7 |
Descriptor
Source
ProQuest LLC | 2 |
Advances in Health Sciences… | 1 |
American Annals of the Deaf | 1 |
Educational Assessment | 1 |
Grantee Submission | 1 |
Journal of Educational… | 1 |
Author
Akihito Kamata | 1 |
Alex J. Mechaber | 1 |
Brian E. Clauser | 1 |
Cornelis Potgieter | 1 |
Jessica Stinson | 1 |
Kai North | 1 |
Kimberly Wolbers | 1 |
Le An Ha | 1 |
Mark White | 1 |
Matt Homer | 1 |
Matt Ronfeldt | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Journal Articles | 4 |
Dissertations/Theses -… | 2 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 11 | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Kindergarten | 1 |
More ▼ |
Audience
Location
China | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Adult Intelligence… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Matt Homer – Advances in Health Sciences Education, 2024
Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…
Descriptors: Examiners, Scoring, Validity, Cutting Scores
Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025
Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…
Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy
Xin Qiao; Akihito Kamata; Cornelis Potgieter – Grantee Submission, 2024
Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring.…
Descriptors: Oral Reading, Reading Fluency, Error Patterns, Scoring
Mark White; Matt Ronfeldt – Educational Assessment, 2024
Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…
Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns
Jessica Stinson – ProQuest LLC, 2024
Intelligence tests have been used in the United States since the early 1900s for assessing soldiers during World War I (Kaufman & Harrison, 2008; White & Hall, 1980). Presently, cognitive assessments are used in school, civil service, military, clinical, and industry settings (White & Hall, 1980). Although the results of these…
Descriptors: Graduate Students, Masters Programs, Doctoral Programs, Comparative Analysis
Yi Gui – ProQuest LLC, 2024
This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…
Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring
Yachong Cui; Rachel Saulsburry; Kimberly Wolbers – American Annals of the Deaf, 2024
Limited access to spoken and signed language is a worldwide phenomenon affecting deaf children. Language delay caused by impeded language acquisition has negative cascading effects on deaf children's learning and development. In the event of stymied language development, deaf students exhibit highly errored writing and commit errors unseen in the…
Descriptors: Deafness, Written Language, Writing Evaluation, North Americans