ERIC - Search Results

Publication Date

In 2025	11
Since 2024	34
Since 2021 (last 5 years)	156
Since 2016 (last 10 years)	369
Since 2006 (last 20 years)	723

Descriptor

Comparative Analysis	966
Scoring	536
Scoring Rubrics	365
Foreign Countries	272
Teaching Methods	176
Scores	155
English (Second Language)	151
Second Language Learning	149
Statistical Analysis	145
Correlation	129
Evaluation Methods	110
Writing Evaluation	107
Language Tests	103
Second Language Instruction	100
Test Items	96
College Students	95
Student Attitudes	94
Student Evaluation	93
Undergraduate Students	87
Computer Assisted Testing	82
Pretests Posttests	81
Essays	79
Higher Education	76
Reliability	73
Elementary School Students	71
More ▼

Education Level

Higher Education	268
Postsecondary Education	221
Secondary Education	118
Elementary Education	106
High Schools	63
Middle Schools	58
Elementary Secondary Education	42
Early Childhood Education	37
Junior High Schools	34
Grade 4	24
Intermediate Grades	22
Grade 7	20
Grade 5	19
Grade 8	19
Primary Education	19
Grade 6	17
Grade 3	16
Grade 10	14
Preschool Education	14
Adult Education	13
Grade 9	13
Kindergarten	12
Grade 11	11
Grade 2	9
Two Year Colleges	8
More ▼

Audience

Teachers	9
Practitioners	6
Researchers	6
Administrators	1
Students	1

Location

Australia	21
China	21
Turkey	19
Taiwan	15
Canada	12
Iran	12
Netherlands	12
California	11
Japan	11
United States	11
United Kingdom	10
Florida	8
Germany	8
Indonesia	8
Spain	8
Arizona	7
New York	7
Texas	7
United Kingdom (England)	7
South Korea	6
Tennessee	6
Connecticut	5
Egypt	5
Georgia	5
India	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	9
Elementary and Secondary…	2
Every Student Succeeds Act…	2
Bilingual Education Act 1968	1
Brown v Board of Education	1
Civil Rights Act 1964	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Equal Educational…	1
Individuals with Disabilities…	1
Lau v Nichols	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	2
Meets WWC Standards with or without Reservations	3
Does not meet standards	2

Showing 1 to 15 of 966 results Save | Export

Interpreting Scores on the Enhanced ACT: Guidance for K-12 and Higher Education Institutions. ACT State and Federal Policy

Download full text

James Riddlesperger – ACT Education Corp., 2025

ACT announced a series of enhancements designed to modernize the ACT test and offer students more choice and flexibility in demonstrating their readiness for life after high school. The enhancements provide students more flexibility by allowing them to choose whether to take the science assessment, thereby reducing the test length by up to…

Descriptors: College Entrance Examinations, Testing, Change, Test Length

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Wechsler Trickle-Down Errors: A Comparison between Master's Students and Doctoral Students

Direct link

Jessica Stinson – ProQuest LLC, 2024

Intelligence tests have been used in the United States since the early 1900s for assessing soldiers during World War I (Kaufman & Harrison, 2008; White & Hall, 1980). Presently, cognitive assessments are used in school, civil service, military, clinical, and industry settings (White & Hall, 1980). Although the results of these…

Descriptors: Graduate Students, Masters Programs, Doctoral Programs, Comparative Analysis

Interpretable Cognitive State Prediction via Temporal Fuzzy Cognitive Map

Peer reviewed

Direct link

Yuang Wei; Bo Jiang – IEEE Transactions on Learning Technologies, 2024

Understanding student cognitive states is essential for assessing human learning. The deep neural networks (DNN)-inspired cognitive state prediction method improved prediction performance significantly; however, the lack of explainability with DNNs and the unitary scoring approach fail to reveal the factors influencing human learning. Identifying…

Descriptors: Cognitive Mapping, Models, Prediction, Short Term Memory

Coherence-Based Automatic Short Answer Scoring Using Sentence Embedding

Peer reviewed

Direct link

Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024

Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…

Descriptors: Scoring, Essays, Writing Evaluation, Memory

Grading the Graders: Comparing Generative AI and Human Assessment in Essay Evaluation

Peer reviewed

Direct link

Elizabeth L. Wetzler; Kenneth S. Cassidy; Margaret J. Jones; Chelsea R. Frazier; Nickalous A. Korbut; Chelsea M. Sims; Shari S. Bowen; Michael Wood – Teaching of Psychology, 2025

Background: Generative artificial intelligence (AI) represents a potentially powerful, time-saving tool for grading student essays. However, little is known about how AI-generated essay scores compare to human instructor scores. Objective: The purpose of this study was to compare the essay grading scores produced by AI with those of human…

Descriptors: Essays, Writing Evaluation, Scores, Evaluators

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Quality Testing Standards and Criteria for Comparability Claims. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

The Classification Accuracy and Consistency of Comparative Judgement of Writing Compared to Rubric-Based Teacher Assessment

Peer reviewed

Direct link

Pinot de Moira, Anne; Wheadon, Christopher; Christodoulou, Daisy – Research in Education, 2022

Writing is generally assessed internationally using rubric-based approaches, but there is a growing body of evidence to suggest that the reliability of such approaches is poor. In contrast, comparative judgement studies suggest that it is possible to assess open ended tasks such as writing with greater reliability. Many previous studies, however,…

Descriptors: Writing Evaluation, Classification, Accuracy, Scoring Rubrics

AI-Enabled Correction: A Professor's Journey

Peer reviewed

Direct link

Peter Daly; Emmanuelle Deglaire – Innovations in Education and Teaching International, 2025

AI-enabled assessment of student papers has the potential to provide both summative and formative feedback and reduce the time spent on grading. Using auto-ethnography, this study compares AI-enabled and human assessment of business student examination papers in a law module based on previously established rubrics. Examination papers were…

Descriptors: Artificial Intelligence, Computer Software, Technology Integration, College Faculty

An Investigation of the Comparability of Commission-Approved Teaching Performance Assessment Models. Final Report -- Volume II: Appendices. No. 120

Download full text

Sinclair, Andrea L., Ed.; Thacker, Arthur, Ed. – Human Resources Research Organization (HumRRO), 2019

These are the appendices for the technical report, "An Investigation of the Comparability of Commission-Approved Teaching Performance Assessment Models." California's Commission on Teacher Credentialing (Commission) requires all programs of preliminary multiple and single subject teacher preparation to use a Commission-approved Teaching…

Descriptors: Performance Based Assessment, Preservice Teachers, Models, Scoring Rubrics

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 65

ProQuest LLC	49
Journal of Educational…	24
Online Submission	23
Language Testing	22
Educational and Psychological…	20
ETS Research Report Series	16
Applied Measurement in…	15
Journal of Speech, Language,…	11
Language Assessment Quarterly	11
Grantee Submission	10
Assessment & Evaluation in…	9
Applied Psychological…	7
English Language Teaching	7
International Journal of…	7
Language, Speech, and Hearing…	7
Physical Review Physics…	7
Assessment in Education:…	6
Educational Measurement:…	6
Journal of Educational…	6
Journal of Educational and…	6
Physical Review Special…	6
Advances in Health Sciences…	5
CBE - Life Sciences Education	5
International Education…	5
Journal of Experimental…	5
More ▼

Attali, Yigal	6
Weiss, David J.	5
Linn, Marcia C.	4
Plake, Barbara S.	4
Singh, Chandralekha	4
Sinharay, Sandip	4
Wainer, Howard	4
Clauser, Brian E.	3
Donovan, Jenny	3
Hwang, Gwo-Jen	3
Kim, Sooyeon	3
Lennon, Melissa	3
Linda Bol	3
Linn, Robert L.	3
Liu, Ou Lydia	3
Livingston, Samuel A.	3
Martin, Michael O., Ed.	3
Sireci, Stephen G.	3
Zechner, Klaus	3
Ackermans, Kevin	2
Al-Salmani, Fatema	2
Alexander, Patricia A.	2
Allen, Melissa M.	2
Anderson, Paul S.	2
More ▼

Journal Articles	689
Reports - Research	669
Reports - Evaluative	152
Speeches/Meeting Papers	77
Tests/Questionnaires	77
Dissertations/Theses -…	51
Reports - Descriptive	41
Numerical/Quantitative Data	16
Collected Works - General	8
Books	7
Information Analyses	7
Guides - General	6
Guides - Non-Classroom	6
Opinion Papers	3
Guides - Classroom - Learner	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Translations	1
More ▼

National Assessment of…	22
Test of English as a Foreign…	19
Wechsler Intelligence Scale…	10
Peabody Picture Vocabulary…	8
SAT (College Admission Test)	8
International English…	7
Program for International…	7
Trends in International…	7
Graduate Record Examinations	6
ACT Assessment	5
Early Childhood Environment…	4
Dynamic Indicators of Basic…	3
Clinical Evaluation of…	2
College Board Achievement…	2
College and University…	2
Draw a Person Test	2
Flesch Kincaid Grade Level…	2
Gates MacGinitie Reading Tests	2
Graduate Management Admission…	2
Michigan Test of English…	2
National Assessment of Adult…	2
Nelson Denny Reading Tests	2
New York State Regents…	2
Praxis Series	2
Test of English for…	2
More ▼