ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	71

Descriptor

Test Items	83
Foreign Countries	31
Test Construction	30
Item Analysis	29
Mathematics Tests	26
Item Response Theory	24
Evaluation Methods	22
Educational Assessment	20
Elementary Secondary Education	20
Achievement Tests	18
Student Evaluation	18
Test Validity	17
Mathematics Achievement	16
Computer Assisted Testing	15
Measures (Individuals)	15
Science Tests	15
Comparative Analysis	14
Test Bias	14
Academic Standards	13
Benchmarking	13
Educational Testing	13
Test Format	13
Data Analysis	12
Scoring	12
Item Banks	11
More ▼

Publication Type

Reports - Evaluative	83
Journal Articles	55
Numerical/Quantitative Data	19
Tests/Questionnaires	4
Information Analyses	3
Books	1
Collected Works - Proceedings	1
Guides - General	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	83
Elementary Education	26
Secondary Education	14
Grade 8	11
Grade 5	10
Grade 4	9
Grade 6	7
Grade 7	6
Middle Schools	6
Grade 3	5
Higher Education	5
Grade 2	4
Postsecondary Education	4
Grade 10	3
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Adult Education	2
Grade 9	2
Kindergarten	2
Grade 1	1
Grade 12	1
Primary Education	1
More ▼

Audience

Policymakers	1
Practitioners	1
Teachers	1

Location

Oregon	8
Australia	5
United States	4
New York	3
Asia	2
California	2
Canada	2
Massachusetts	2
Taiwan	2
Texas	2
United Kingdom	2
Washington	2
Alabama	1
Arkansas	1
China	1
Finland	1
Florida	1
Germany	1
Idaho	1
India	1
Japan	1
Malaysia	1
Nebraska	1
New Mexico	1
North Dakota	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	9
No Child Left Behind Act 2001	4
Race to the Top	2
Elementary and Secondary…	1
Lau v Nichols	1

Assessments and Surveys

Program for International…	10
Trends in International…	7
National Assessment of…	4
SAT (College Admission Test)	2
Kaufman Test of Educational…	1
Massachusetts Comprehensive…	1
New York State Regents…	1
TerraNova Multiple Assessments	1
Washington Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

Marginalized Learners in International and Regional Test Data: The Extent of Floor Effects

Peer reviewed

Direct link

Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023

International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

A Cost-Benefit Analysis of Automatic Item Generation

Peer reviewed

Direct link

Kosh, Audra E.; Simpson, Mary Ann; Bickel, Lisa; Kellogg, Mark; Sanford-Moore, Ellie – Educational Measurement: Issues and Practice, 2019

Automatic item generation (AIG)--a means of leveraging technology to create large quantities of items--requires a minimum number of items to offset the sizable upfront investment (i.e., model development and technology deployment) in order to achieve cost savings. In this cost-benefit analysis, we estimated the cost of each step of AIG and manual…

Descriptors: Cost Effectiveness, Automation, Test Items, Mathematics Tests

DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images

Peer reviewed

Sami Baral; Li Lucy; Ryan Knight; Alice Ng; Luca Soldaini; Neil T. Heffernan; Kyle Lo – Grantee Submission, 2024

In real-world settings, vision language models (VLMs) should robustly handle naturalistic, noisy visual content as well as domain-specific language and concepts. For example, K-12 educators using digital learning platforms may need to examine and provide feedback across many images of students' math work. To assess the potential of VLMs to support…

Descriptors: Visual Learning, Visual Perception, Natural Language Processing, Freehand Drawing

Exploring the Prevalence of Covariational Reasoning across Mathematics and Science Using TIMSS 2011 Assessment Items

Peer reviewed

Direct link

Gantt, Allison L.; Paoletti, Teo; Corven, Julien – International Journal of Science and Mathematics Education, 2023

Covariational reasoning (or the coordination of two dynamically changing quantities) is central to secondary STEM subjects, but research has yet to fully explore its applicability to elementary and middle-grade levels within various STEM fields. To address this need, we selected a globally referenced STEM assessment--the Trends in International…

Descriptors: Incidence, Abstract Reasoning, Mathematics Education, Science Education

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

A Pragmatic Future for NAEP: Containing Costs and Updating Technologies. Consensus Study Report

Peer reviewed
PDF on ERIC

Download full text

Direct link

National Academies Press, 2022

The National Assessment of Educational Progress (NAEP) -- often called "The Nation's Report Card" -- is the largest nationally representative and continuing assessment of what students in public and private schools in the United States know and can do in various subjects and has provided policy makers and the public with invaluable…

Descriptors: Costs, Futures (of Society), National Competency Tests, Educational Trends

A Scoping Review of Empirical Research on Recent Computational Thinking Assessments

Peer reviewed

Direct link

Cutumisu, Maria; Adams, Cathy; Lu, Chang – Journal of Science Education and Technology, 2019

Computational thinking (CT) is regarded as an essential twenty-first century competency and it is already embedded in K-12 curricula across the globe. However, research on assessing CT has lagged, with few assessments being implemented and validated. Moreover, there is a lack of systematic grouping of CT assessments. This scoping review examines…

Descriptors: Computation, Thinking Skills, 21st Century Skills, Elementary Secondary Education

Methods and Procedures: TIMSS 2019 Technical Report

Download full text

Martin, Michael O., Ed.; von Davier, Matthias, Ed.; Mullis, Ina V. S., Ed. – International Association for the Evaluation of Educational Achievement, 2020

The chapters in this online volume comprise the TIMSS & PIRLS International Study Center's technical report of the methods and procedures used to develop, implement, and report the results of TIMSS 2019. There were various technical challenges because TIMSS 2019 was the initial phase of the transition to eTIMSS, with approximately half the…

Descriptors: Foreign Countries, Elementary Secondary Education, Achievement Tests, International Assessment

Test Review: Reynolds, C. R., Voress, J. V., Kamphaus, R. W. (2015), "Mathematics Fluency and Calculation Tests (MFaCTs) review." PRO-ED

Peer reviewed

Direct link

Marbach, Joshua – Journal of Psychoeducational Assessment, 2017

The Mathematics Fluency and Calculation Tests (MFaCTs) are a series of measures designed to assess for arithmetic calculation skills and calculation fluency in children ages 6 through 18. There are five main purposes of the MFaCTs: (1) identifying students who are behind in basic math fact automaticity; (2) evaluating possible delays in arithmetic…

Descriptors: Mathematics Tests, Computation, Mathematics Skills, Arithmetic

A Synthesis of the Peer-Reviewed Differential Bundle Functioning Research

Peer reviewed

Direct link

Banks, Kathleen – Educational Measurement: Issues and Practice, 2013

The purpose of this article was to present a synthesis of the peer-reviewed differential bundle functioning (DBF) research that has been conducted to date. A total of 16 studies were synthesized according to the following characteristics: tests used and learner groups, organizing principles used for developing bundles, DBF detection methods used,…

Descriptors: Test Bias, Research, Tests, Student Characteristics

How PARCC's False Rigor Stunts the Academic Growth of All Students. White Paper No. 135

Download full text

McQuillan, Mark; Phelps, Richard P.; Stotsky, Sandra – Pioneer Institute for Public Policy Research, 2015

In July 2010, the Massachusetts Board of Elementary and Secondary Education (BESE) voted to adopt Common Core's standards in English language arts (ELA) and mathematics in place of the state's own standards in these two subjects. The vote was based largely on recommendations by Commissioner of Education Mitchell Chester and then Secretary of…

Descriptors: Reading Tests, Writing Tests, Achievement Tests, Common Core State Standards

Language Effects in International Testing: The Case of PISA 2006 Science Items

Peer reviewed

Direct link

El Masri, Yasmine H.; Baird, Jo-Anne; Graesser, Art – Assessment in Education: Principles, Policy & Practice, 2016

We investigate the extent to which language versions (English, French and Arabic) of the same science test are comparable in terms of item difficulty and demands. We argue that language is an inextricable part of the scientific literacy construct, be it intended or not by the examiner. This argument has considerable implications on methodologies…

Descriptors: International Assessment, Difficulty Level, Test Items, Language Variation

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

The Interrelations of Features of Questions, Mark Schemes and Examinee Responses and Their Impact upon Marker Agreement

Peer reviewed

Direct link

Black, Beth; Suto, Irenka; Bramley, Tom – Assessment in Education: Principles, Policy & Practice, 2011

In this paper we develop an evidence-based framework for considering many of the factors affecting marker agreement in GCSEs and A levels. A logical analysis of the demands of the marking task suggests a core grouping comprising: (i) question features; (ii) mark scheme features; and (iii) examinee response features. The framework synthesises…

Descriptors: Interrater Reliability, Grading, Scoring, High Stakes Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Behavioral Research and…	8
Applied Measurement in…	4
Measurement:…	4
Ministerial Council on…	4
American Institutes for…	3
Educational Measurement:…	3
Educational and Psychological…	3
International Association for…	3
Journal of Educational…	3
Journal of Psychoeducational…	3
Journal of Science Education…	3
Online Submission	3
Achieve, Inc.	2
Assessment in Education:…	2
Canadian Journal of School…	2
Computers & Education	2
Educational Assessment	2
Educational Research and…	2
Measurement in Physical…	2
National Center for Education…	2
Scandinavian Journal of…	2
Australian Journal of…	1
Brookings Institution	1
Comparative Education Review	1
Educational Studies in…	1
More ▼

Tindal, Gerald	8
Alonzo, Julie	7
Lai, Cheng Fei	7
Donovan, Jenny	3
Hill, Heather C.	3
Lennon, Melissa	3
Avery, Marybell	2
Banks, Kathleen	2
Blunk, Merrie	2
Dyson, Ben	2
Fisette, Jennifer L.	2
Fox, Connie	2
Franck, Marian	2
Gierl, Mark J.	2
Goffney, Imani Masters	2
Graber, Kim C.	2
Hutton, Penny	2
Morrissey, Noni	2
O'Connor, Gayl	2
Park, Youngsik	2
Placek, Judith H.	2
Raynes, De	2
Rink, Judy	2
Wu, Margaret	2
Zhu, Weimo	2
More ▼