ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	31
Since 2006 (last 20 years)	57

Descriptor

Comparative Analysis	81
Scoring	81
Test Items	81
Item Response Theory	26
Foreign Countries	22
Item Analysis	22
Test Construction	20
Computer Assisted Testing	19
Scores	15
Psychometrics	14
Testing	13
Difficulty Level	12
Multiple Choice Tests	12
Test Format	12
Statistical Analysis	11
Test Reliability	11
Models	10
Accuracy	9
Educational Assessment	9
Mathematics Tests	9
Simulation	9
Test Bias	9
Test Content	9
Test Validity	8
Achievement Tests	7
More ▼

Publication Type

Journal Articles	46
Reports - Research	42
Reports - Evaluative	19
Speeches/Meeting Papers	10
Tests/Questionnaires	7
Reports - Descriptive	5
Dissertations/Theses -…	4
Guides - General	4
Numerical/Quantitative Data	4
Books	3
Collected Works - General	2
Guides - Classroom - Learner	2
Guides - Non-Classroom	1
More ▼

Education Level

Elementary Secondary Education	9
Higher Education	8
Postsecondary Education	7
Secondary Education	7
Elementary Education	5
Grade 6	3
Grade 7	2
Grade 8	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Grade 10	1
Grade 12	1
Grade 4	1
Intermediate Grades	1
More ▼

Audience

Teachers

Location

Australia	4
Arizona	2
China	2
Japan	2
Taiwan	2
United States	2
Canada	1
China (Shanghai)	1
Colorado	1
Czech Republic	1
Estonia	1
Europe	1
Florida	1
Georgia	1
Germany	1
Iran	1
Maryland	1
Nevada	1
North Carolina	1
Pennsylvania	1
Sweden	1
Tennessee	1
Turkey	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
National Assessment of…	3
Trends in International…	2
ACT Assessment	1
Early Childhood Environment…	1
Graduate Record Examinations	1
International Association for…	1
Progress in International…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Work Keys (ACT)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Standard Processes. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Implementing Confidence Assessment in Low-Stakes, Formative Mathematics Assessments

Peer reviewed

Direct link

Foster, Colin – International Journal of Science and Mathematics Education, 2022

Confidence assessment (CA) involves students stating alongside each of their answers a confidence rating (e.g. 0 low to 10 high) to express how certain they are that their answer is correct. Each student's score is calculated as the sum of the confidence ratings on the items that they answered correctly, minus the sum of the confidence ratings on…

Descriptors: Mathematics Tests, Mathematics Education, Secondary School Students, Meta Analysis

A Comparative Analysis of the "Early Childhood Environment Rating Scale--Revised" and "Early Childhood Environment Rating Scale, Third Edition"

Peer reviewed
PDF on ERIC

Download full text

Direct link

Neitzel, Jennifer; Early, Diane; Sideris, John; LaForrett, Doré; Abel, Michael B.; Soli, Margaret; Davidson, Dawn L.; Haboush-Deloye, Amanda; Hestenes, Linda L.; Jenson, Denise; Johnson, Cindy; Kalas, Jennifer; Mamrak, Angela; Masterson, Marie L.; Mims, Sharon U.; Oya, Patti; Philson, Bobbi; Showalter, Megan; Warner-Richter, Mallory; Kortright Wood, Jill – Journal of Early Childhood Research, 2019

The Early Childhood Environment Rating Scales, including the "Early Childhood Environment Rating Scale--Revised" (Harms et al., 2005) and the "Early Childhood Environment Rating Scale, Third Edition" (Harms et al., 2015) are the most widely used observational assessments in early childhood learning environments. The most recent…

Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Scoring

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	8
Educational and Psychological…	5
Journal of Educational…	5
Ministerial Council on…	4
New Meridian Corporation	4
ProQuest LLC	4
Applied Measurement in…	3
Journal of Educational and…	3
Arizona Department of…	2
International Journal of…	2
ACT Education Corp.	1
Advances in Health Sciences…	1
Alberta Journal of…	1
Asia Pacific Education Review	1
British Journal of…	1
Communique	1
Education Digest: Essential…	1
Electronic Journal of…	1
English Language Teaching	1
Interactive Learning…	1
International Association for…	1
International Education…	1
International Educational…	1
International Journal of…	1
JALT CALL Journal	1
More ▼

Donovan, Jenny	3
Lennon, Melissa	3
Hutton, Penny	2
Kim, Sooyeon	2
Lord, Frederic M.	2
Morrissey, Noni	2
O'Connor, Gayl	2
Rogers, W. Todd	2
Zhang, Mo	2
von Davier, Matthias	2
Abel, Michael B.	1
Ali, Usama S.	1
Allan S. Cohen	1
Alqarni, Abdulelah Mohammed	1
Anderson, Paul S.	1
Ann Arthur	1
Ashwell, Tim	1
Baldwin, Peter	1
Bauer, Daniel	1
Bejar, Isaac I.	1
Bilan Liang	1
Boyer, Michelle	1
Breyer, F. Jay	1
Chang, Hua-Hua	1
More ▼