ERIC - Search Results

Publication Date

In 2025	2
Since 2024	13
Since 2021 (last 5 years)	67
Since 2016 (last 10 years)	162
Since 2006 (last 20 years)	365

Descriptor

Comparative Analysis	533
Scoring	533
Foreign Countries	137
Scores	90
Correlation	85
English (Second Language)	80
Test Items	80
Second Language Learning	76
Statistical Analysis	73
Computer Assisted Testing	71
Language Tests	67
Evaluation Methods	61
Teaching Methods	53
Writing Evaluation	53
Item Response Theory	52
Test Construction	50
Accuracy	48
Higher Education	48
Essays	46
Reliability	46
Test Reliability	46
Test Validity	46
Models	45
College Students	44
Elementary School Students	44
More ▼

Publication Type

Journal Articles	363
Reports - Research	348
Reports - Evaluative	102
Speeches/Meeting Papers	47
Tests/Questionnaires	33
Dissertations/Theses -…	25
Reports - Descriptive	20
Numerical/Quantitative Data	12
Collected Works - General	7
Books	6
Guides - General	6
Information Analyses	6
Guides - Non-Classroom	4
Guides - Classroom - Learner	2
Opinion Papers	2
Collected Works - Proceedings	1
Dissertations/Theses -…	1
More ▼

Education Level

Higher Education	89
Postsecondary Education	68
Elementary Education	54
Secondary Education	50
Early Childhood Education	27
Elementary Secondary Education	23
High Schools	20
Middle Schools	17
Primary Education	13
Grade 4	11
Junior High Schools	11
Kindergarten	11
Grade 5	10
Grade 3	9
Grade 6	9
Preschool Education	9
Grade 10	8
Grade 11	8
Grade 2	8
Intermediate Grades	8
Grade 8	7
Grade 7	6
Grade 9	5
Grade 1	4
Grade 12	4
More ▼

Audience

Practitioners	4
Researchers	4
Teachers	4

Location

China	17
Australia	13
Netherlands	9
Taiwan	8
United States	8
Canada	7
Japan	7
United Kingdom	7
Germany	6
Turkey	6
United Kingdom (England)	6
Iran	5
Arizona	4
Florida	4
New York	4
Tennessee	4
California	3
Connecticut	3
Europe	3
Hong Kong	3
India	3
New Hampshire	3
Singapore	3
South Korea	3
Sweden	3
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	2
No Child Left Behind Act 2001	2

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 533 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Wechsler Trickle-Down Errors: A Comparison between Master's Students and Doctoral Students

Direct link

Jessica Stinson – ProQuest LLC, 2024

Intelligence tests have been used in the United States since the early 1900s for assessing soldiers during World War I (Kaufman & Harrison, 2008; White & Hall, 1980). Presently, cognitive assessments are used in school, civil service, military, clinical, and industry settings (White & Hall, 1980). Although the results of these…

Descriptors: Graduate Students, Masters Programs, Doctoral Programs, Comparative Analysis

Interpretable Cognitive State Prediction via Temporal Fuzzy Cognitive Map

Peer reviewed

Direct link

Yuang Wei; Bo Jiang – IEEE Transactions on Learning Technologies, 2024

Understanding student cognitive states is essential for assessing human learning. The deep neural networks (DNN)-inspired cognitive state prediction method improved prediction performance significantly; however, the lack of explainability with DNNs and the unitary scoring approach fail to reveal the factors influencing human learning. Identifying…

Descriptors: Cognitive Mapping, Models, Prediction, Short Term Memory

Coherence-Based Automatic Short Answer Scoring Using Sentence Embedding

Peer reviewed

Direct link

Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024

Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…

Descriptors: Scoring, Essays, Writing Evaluation, Memory

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Examining the Effect of Assessment Construct Characteristics on Machine Learning Scoring of Scientific Argumentation

Peer reviewed

Direct link

Kevin C. Haudek; Xiaoming Zhai – International Journal of Artificial Intelligence in Education, 2024

Argumentation, a key scientific practice presented in the "Framework for K-12 Science Education," requires students to construct and critique arguments, but timely evaluation of arguments in large-scale classrooms is challenging. Recent work has shown the potential of automated scoring systems for open response assessments, leveraging…

Descriptors: Accuracy, Persuasive Discourse, Artificial Intelligence, Learning Management Systems

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Rater Connections and the Detection of Bias in Performance Assessment

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2022

In many performance assessments, one or two raters from the complete rater pool scores each performance, resulting in a sparse rating design, where there are limited observations of each rater relative to the complete sample of students. Although sparse rating designs can be constructed to facilitate estimation of student achievement, the…

Descriptors: Evaluators, Bias, Identification, Performance Based Assessment

Comparative Judgement for Evaluating Young Learners' EFL Writing Performances: Reliability and Teacher Perceptions of Holistic and Dimension-Based Judgements

Peer reviewed

Direct link

Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025

Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

Standard Processes. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 36

ProQuest LLC	25
Language Testing	18
ETS Research Report Series	15
Journal of Educational…	15
Educational and Psychological…	13
Journal of Speech, Language,…	11
Applied Measurement in…	10
Language Assessment Quarterly	8
Grantee Submission	6
Journal of Educational and…	6
Applied Psychological…	5
Language, Speech, and Hearing…	5
Online Submission	5
Assessment in Education:…	4
English Language Teaching	4
Ministerial Council on…	4
New Meridian Corporation	4
Reading and Writing: An…	4
Computers & Education	3
Elementary School Journal	3
International Association for…	3
International Educational…	3
International Journal of…	3
International Journal of…	3
Journal of Applied School…	3
More ▼

Attali, Yigal	6
Sinharay, Sandip	4
Wainer, Howard	4
Clauser, Brian E.	3
Donovan, Jenny	3
Kim, Sooyeon	3
Lennon, Melissa	3
Linn, Marcia C.	3
Martin, Michael O., Ed.	3
Sireci, Stephen G.	3
Weiss, David J.	3
Zechner, Klaus	3
Allen, Melissa M.	2
Anderson, Paul S.	2
Baldwin, Peter	2
Barkaoui, Khaled	2
Bejar, Isaac I.	2
Bernstein, Jared	2
Berry, Jessica R.	2
Bertling, Maria	2
Brindle, Mary	2
Cho, Sun-Joo	2
Chuang, Chi-ching	2
Clariana, Roy B.	2
More ▼

National Assessment of…	16
Test of English as a Foreign…	16
Peabody Picture Vocabulary…	8
Program for International…	7
Trends in International…	7
Wechsler Intelligence Scale…	7
SAT (College Admission Test)	6
Graduate Record Examinations	5
Early Childhood Environment…	4
International English…	3
ACT Assessment	2
Clinical Evaluation of…	2
College Board Achievement…	2
College and University…	2
Draw a Person Test	2
Dynamic Indicators of Basic…	2
Flesch Kincaid Grade Level…	2
Michigan Test of English…	2
National Assessment of Adult…	2
Nelson Denny Reading Tests	2
New York State Regents…	2
Woodcock Johnson Tests of…	2
Advanced Placement…	1
Beery Developmental Test of…	1
Beginning Postsecondary…	1
More ▼