ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	29
Since 2006 (last 20 years)	50

Descriptor

Difficulty Level	85
Scoring	85
Test Items	85
Item Response Theory	25
Test Construction	24
Test Reliability	21
Item Analysis	19
Psychometrics	18
Multiple Choice Tests	14
Test Format	14
Foreign Countries	13
Mathematics Tests	13
Test Validity	13
Comparative Analysis	12
Computer Assisted Testing	12
Scores	11
Guessing (Tests)	10
Measurement Techniques	9
Test Theory	9
Correlation	8
Elementary School Students	8
Equated Scores	8
Higher Education	8
Mathematical Models	8
Models	8
More ▼

Publication Type

Reports - Research	52
Journal Articles	41
Speeches/Meeting Papers	18
Reports - Evaluative	16
Reports - Descriptive	8
Tests/Questionnaires	5
Dissertations/Theses -…	4
Information Analyses	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1

Education Level

Elementary Education	15
Higher Education	10
Postsecondary Education	9
Secondary Education	7
Early Childhood Education	5
Elementary Secondary Education	5
Primary Education	5
High Schools	3
Grade 1	2
Grade 2	2
Grade 3	2
Grade 4	2
Grade 6	2
Intermediate Grades	2
Kindergarten	2
Grade 5	1
Grade 7	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers

Location

Florida	6
California	3
New York	2
Turkey	2
Alabama	1
Australia	1
Canada	1
Colorado	1
Germany	1
Illinois	1
Indiana	1
Poland	1
Saudi Arabia	1
Tennessee	1
United Kingdom	1
United Kingdom (Scotland)	1
West Virginia	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Program for International…	3
SAT (College Admission Test)	2
ACT Assessment	1
Advanced Placement…	1
Alabama High School…	1
English Proficiency Test	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Medical College Admission Test	1
Metropolitan Achievement Tests	1
National Assessment of…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 85 results Save | Export

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Marginalized Learners in International and Regional Test Data: The Extent of Floor Effects

Peer reviewed

Direct link

Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023

International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

Collaborative Problem-Solving Design in Large-Scale Assessments: Shedding Lights in Sequential Conversation-Based Measurement

Peer reviewed
PDF on ERIC

Download full text

Qiwei He – International Journal of Assessment Tools in Education, 2023

Collaborative problem solving (CPS) is inherently an interactive, conjoint, dual-strand process that considers how a student reasons about a problem as well as how s/he interacts with others to regulate social processes and exchange information (OECD, 2013). Measuring CPS skills presents a challenge for obtaining consistent, accurate, and reliable…

Descriptors: Cooperative Learning, Problem Solving, Test Items, International Assessment

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Chatbot Responses Suggest That Hypothetical Biology Questions Are Harder than Realistic Ones

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gregory J. Crowther; Usha Sankar; Leena S. Knight; Deborah L. Myers; Kevin T. Patton; Lekelia D. Jenkins; Thomas A. Knight – Journal of Microbiology & Biology Education, 2023

The biology education literature includes compelling assertions that unfamiliar problems are especially useful for revealing students' true understanding of biology. However, there is only limited evidence that such novel problems have different cognitive requirements than more familiar problems. Here, we sought additional evidence by using…

Descriptors: Science Instruction, Artificial Intelligence, Scoring, Molecular Structure

Investigating Invariant Item Ordering Using Mokken Scale Analysis for Dichotomously Scored Items

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2020

Mokken models have recently started to become the preferred method of researchers from different fields in studies of nonparametric item response theory (NIRT). Despite increasing application of these models, some features of this type of modelling need further study and explanation. Invariant item ordering (IIO) is one of these areas, which the…

Descriptors: Item Response Theory, Test Items, Nonparametric Statistics, Scoring

A Comparison of Score Equating Conducted Using Haebara and Stocking Lord Method for Polytomous

Peer reviewed
PDF on ERIC

Download full text

Setiawan, Risky – European Journal of Educational Research, 2019

The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…

Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items

Unidimensional Vertical Scaling in Multidimensional Space. Research Report. ETS RR-17-29

Peer reviewed
PDF on ERIC

Download full text

Carlson, James E. – ETS Research Report Series, 2017

In this paper, I consider a set of test items that are located in a multidimensional space, S[subscript M], but are located along a curved line in S[subscript M] and can be scaled unidimensionally. Furthermore, I am demonstrating a case in which the test items are administered across 6 levels, such as occurs in K-12 assessment across 6 grade…

Descriptors: Test Items, Item Response Theory, Difficulty Level, Scoring

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Analyzing Explanations of Substitution Reactions Using Lexical Analysis and Logistic Regression Techniques

Peer reviewed

Direct link

Dood, Amber J.; Dood, John C.; Cruz-Ramírez de Arellano, Daniel; Fields, Kimberly B.; Raker, Jeffrey R. – Chemistry Education Research and Practice, 2020

Assessments that aim to evaluate student understanding of chemical reactions and reaction mechanisms should ask students to construct written or oral explanations of mechanistic representations; students can reproduce pictorial mechanism representations with minimal understanding of the meaning of the representations. Grading such assessments is…

Descriptors: Chemistry, Student Evaluation, Regression (Statistics), Logical Thinking

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	7
Grantee Submission	5
ProQuest LLC	4
Applied Measurement in…	3
ETS Research Report Series	3
Online Submission	3
Assessment & Evaluation in…	2
Advances in Health Sciences…	1
American Journal of…	1
British Journal of…	1
Chemistry Education Research…	1
College Board	1
Communique	1
Comparative Education Review	1
Computers & Education	1
Education Digest: Essential…	1
Educational Assessment	1
Educational Measurement:…	1
English Teaching Forum	1
European Journal of…	1
European Journal of Science…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Early Childhood…	1
More ▼

Schoen, Robert C.	7
Yang, Xiaotong	5
Paek, Insu	4
Liu, Sicong	3
Lord, Frederic M.	3
Anderson, Daniel	2
Bauduin, Charity	2
Carlson, James E.	2
Dimitrov, Dimiter M.	2
Dorans, Neil J.	2
Abdellah, Antar Solhy	1
Akyildiz, Murat	1
Alicia A. Stoltenberg	1
Anderson, Paul S.	1
Atanasov, Dimitar V.	1
Barakat, Bilal Fouad	1
Batchelder, William H.	1
Bauer, Daniel	1
Becker, Benjamin	1
Bennett, Randy Elliot	1
Berenbon, Rebecca	1
Betts, Joe	1
Bidwell, Sarah L.	1
Boone, William J.	1
More ▼