ERIC - Search Results

Publication Date

In 2025	28
Since 2024	115
Since 2021 (last 5 years)	514
Since 2016 (last 10 years)	1207
Since 2006 (last 20 years)	2540

Descriptor

Scoring	5100
Test Construction	812
Foreign Countries	761
Test Validity	673
Test Reliability	671
Evaluation Methods	660
Scores	634
Test Items	625
Student Evaluation	621
Testing	549
Comparative Analysis	533
Computer Assisted Testing	506
Higher Education	497
Elementary Secondary Education	496
Writing Evaluation	450
Language Tests	430
Correlation	406
English (Second Language)	403
Second Language Learning	391
Academic Achievement	365
Statistical Analysis	365
Elementary School Students	363
Educational Assessment	353
Test Interpretation	351
Achievement Tests	345
More ▼

Author

Attali, Yigal	22
McNamara, Danielle S.	21
Bennett, Randy Elliot	19
Wolfe, Edward W.	17
Hambleton, Ronald K.	15
Wainer, Howard	15
Bejar, Isaac I.	14
Williamson, David M.	14
Allen, Laura K.	13
Livingston, Samuel A.	13
Crossley, Scott A.	12
Plake, Barbara S.	12
White, Sheida	12
Baker, Eva L.	11
Bridgeman, Brent	11
Shavelson, Richard J.	11
Shermis, Mark D.	11
Sireci, Stephen G.	11
Xi, Xiaoming	11
Dimitrov, Dimiter M.	10
Dorans, Neil J.	10
Liu, Ou Lydia	10
Yen, Wendy M.	10
Clauser, Brian E.	9
More ▼

Education Level

Higher Education	654
Postsecondary Education	478
Elementary Education	432
Secondary Education	372
Elementary Secondary Education	257
Middle Schools	231
High Schools	182
Junior High Schools	165
Early Childhood Education	160
Grade 4	157
Intermediate Grades	146
Grade 8	145
Grade 5	112
Grade 6	105
Grade 3	104
Primary Education	101
Grade 7	92
Grade 10	59
Preschool Education	58
Kindergarten	54
Grade 2	39
Grade 11	34
Grade 1	31
Grade 9	30
Adult Education	29
More ▼

Audience

Practitioners	248
Teachers	220
Researchers	114
Administrators	76
Policymakers	27
Counselors	23
Students	20
Parents	18
Community	7
Support Staff	2

Location

Australia	70
Canada	70
United States	67
China	66
New York	66
California	55
Florida	53
Turkey	48
United Kingdom	46
Japan	41
Netherlands	33
Pennsylvania	31
United Kingdom (England)	30
Iran	24
South Carolina	24
Taiwan	23
Germany	22
Texas	22
South Korea	20
Arizona	19
India	19
Hong Kong	18
North Carolina	18
Tennessee	18
Kentucky	17
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	4
Does not meet standards	5

Scoring X

Showing 1 to 15 of 5,100 results Save | Export

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Statistically Guided Grading Judgements: Contextualisation or Contamination?

Peer reviewed

Direct link

Louise Badham – Oxford Review of Education, 2025

Different sources of assessment evidence are reviewed during International Baccalaureate (IB) grade awarding to convert marks into grades and ensure fair results for students. Qualitative and quantitative evidence are analysed to determine grade boundaries, with statistical evidence weighed against examiner judgement and teachers' feedback on…

Descriptors: Advanced Placement Programs, Grading, Interrater Reliability, Evaluative Thinking

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Inter-Rater Reliability in Comprehensive Examination Scoring: The Case for Consistent and Collaborative Rater Training and Calibration

Download full text

Saenz, David Arron – Online Submission, 2023

There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…

Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automated Short Answer Scoring Using an Ensemble of Neural Networks and Latent Semantic Analysis Classifiers

Peer reviewed

Direct link

Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023

We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…

Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics

Automatic Essay Scoring for Discussion Forum in Online Learning Based on Semantic and Keyword Similarities

Peer reviewed

Direct link

Dhini, Bachriah Fatwa; Girsang, Abba Suganda; Sufandi, Unggul Utan; Kurniawati, Heny – Asian Association of Open Universities Journal, 2023

Purpose: The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the…

Descriptors: Computer Assisted Testing, Scoring, Writing Evaluation, Essays

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Machine Learning and Hebrew NLP for Automated Assessment of Open-Ended Questions in Biology

Peer reviewed

Direct link

Ariely, Moriah; Nazaretsky, Tanya; Alexandron, Giora – International Journal of Artificial Intelligence in Education, 2023

Machine learning algorithms that automatically score scientific explanations can be used to measure students' conceptual understanding, identify gaps in their reasoning, and provide them with timely and individualized feedback. This paper presents the results of a study that uses Hebrew NLP to automatically score student explanations in Biology…

Descriptors: Artificial Intelligence, Algorithms, Natural Language Processing, Hebrew

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

A Rubric for the Detection of Students in Crisis

Peer reviewed

Direct link

Burkhardt, Amy; Lottridge, Susan; Woolf, Sherri – Educational Measurement: Issues and Practice, 2021

For some students, standardized tests serve as a conduit to disclose sensitive issues of harm or distress that may otherwise go unreported. By detecting this writing, known as "crisis papers," testing programs have a unique opportunity to assist in mitigating the risk of harm to these students. The use of machine learning to…

Descriptors: Scoring Rubrics, Identification, At Risk Students, Standardized Tests

Employing a Hierarchical Rater Models for Automated Scoring: Scope Review on the Application in Educational Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Akif Avcu – Malaysian Online Journal of Educational Technology, 2025

This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…

Descriptors: Automation, Scoring, Models, Educational Assessment

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Using Think-Aloud Interviews to Examine a Clinically Oriented Performance Assessment Rubric

Peer reviewed

Direct link

Roduta Roberts, Mary; Gotch, Chad M.; Cook, Megan; Werther, Karin; Chao, Iris C. I. – Measurement: Interdisciplinary Research and Perspectives, 2022

Performance-based assessment is a common approach to assess the development and acquisition of practice competencies among health professions students. Judgments related to the quality of performance are typically operationalized as ratings against success criteria specified within a rubric. The extent to which the rubric is understood,…

Descriptors: Protocol Analysis, Scoring Rubrics, Interviews, Performance Based Assessment

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 340

ProQuest LLC	123
Educational and Psychological…	122
Journal of Educational…	106
ETS Research Report Series	99
Grantee Submission	90
Language Testing	75
Educational Measurement:…	71
Applied Measurement in…	68
Journal of Psychoeducational…	67
Online Submission	56
Center on Education Policy	52
Applied Psychological…	50
Educational Assessment	33
New York State Education…	32
Psychology in the Schools	30
International Journal of…	28
Language Assessment Quarterly	28
Assessing Writing	27
Journal of Educational and…	27
Journal of Speech, Language,…	26
Psychometrika	24
Educational Testing Service	22
International Educational…	22
Assessment	19
Language, Speech, and Hearing…	19
More ▼

Journal Articles	2717
Reports - Research	2424
Reports - Evaluative	882
Reports - Descriptive	634
Speeches/Meeting Papers	564
Tests/Questionnaires	360
Guides - Non-Classroom	309
Dissertations/Theses -…	123
Numerical/Quantitative Data	121
Information Analyses	111
Guides - Classroom - Teacher	101
Opinion Papers	99
Books	73
Guides - General	39
Collected Works - General	31
Reports - General	20
Book/Product Reviews	18
Collected Works - Proceedings	16
ERIC Publications	15
ERIC Digests in Full Text	14
Guides - Classroom - Learner	12
Reference Materials -…	12
Collected Works - Serials	11
Legal/Legislative/Regulatory…	8
Multilingual/Bilingual…	6
More ▼

No Child Left Behind Act 2001	85
Individuals with Disabilities…	10
Elementary and Secondary…	9
Elementary and Secondary…	5
Kentucky Education Reform Act…	5
Race to the Top	4
Education Consolidation…	3
Every Student Succeeds Act…	3
Comprehensive Education…	2
Individuals with Disabilities…	2
Americans with Disabilities…	1
Family Educational Rights and…	1
Health Insurance Portability…	1
Improving Americas Schools…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
National Defense Education Act	1
Rehabilitation Act 1973…	1
More ▼

National Assessment of…	109
Test of English as a Foreign…	78
SAT (College Admission Test)	62
Graduate Record Examinations	54
Wechsler Intelligence Scale…	50
Advanced Placement…	21
ACT Assessment	20
Peabody Picture Vocabulary…	20
Program for International…	20
Woodcock Johnson Tests of…	18
Trends in International…	17
International English…	16
Wechsler Adult Intelligence…	16
Torrance Tests of Creative…	15
Kaufman Assessment Battery…	13
Wechsler Individual…	13
Bender Gestalt Test	11
General Educational…	11
Medical College Admission Test	11
Washington Assessment of…	11
College Level Academic Skills…	10
College Level Examination…	10
Iowa Tests of Basic Skills	9
Stanford Binet Intelligence…	9
Comprehensive Tests of Basic…	8
More ▼