ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	21
Since 2016 (last 10 years)	52
Since 2006 (last 20 years)	79

Descriptor

Achievement Tests	107
Comparative Analysis	107
Test Items	107
Foreign Countries	51
Mathematics Tests	33
Item Analysis	31
International Assessment	30
Difficulty Level	29
Item Response Theory	25
Secondary School Students	23
Test Construction	22
Academic Achievement	21
Science Achievement	21
Science Tests	21
Elementary Secondary Education	20
Reading Tests	19
Mathematics Achievement	18
Scores	18
Test Format	15
Elementary School Students	14
Correlation	13
Test Bias	13
Evaluation Methods	11
Multiple Choice Tests	11
Standardized Tests	11
More ▼

Publication Type

Reports - Research	83
Journal Articles	60
Reports - Evaluative	11
Speeches/Meeting Papers	11
Dissertations/Theses -…	5
Numerical/Quantitative Data	5
Information Analyses	3
Reports - Descriptive	3
Tests/Questionnaires	3
Books	2
Collected Works - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Secondary Education	34
Elementary Secondary Education	20
Elementary Education	18
Middle Schools	13
Grade 8	10
Junior High Schools	10
High Schools	9
Grade 4	8
Higher Education	8
Grade 3	7
Grade 7	7
Intermediate Grades	7
Postsecondary Education	7
Early Childhood Education	6
Grade 9	6
Primary Education	6
Grade 5	3
Grade 10	2
Grade 11	1
Grade 12	1
Grade 6	1
More ▼

Audience

Researchers

Location

Turkey	7
Canada	5
Germany	3
Massachusetts	3
Ohio	3
United Kingdom (England)	3
United States	3
Australia	2
Hong Kong	2
Japan	2
Singapore	2
Arkansas	1
Botswana	1
Chile	1
China (Shanghai)	1
Colorado	1
Colorado (Boulder)	1
District of Columbia	1
Georgia Republic	1
Germany (Berlin)	1
Greece	1
Illinois	1
India	1
Indonesia	1
Kansas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 107 results Save | Export

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Establishing Statistical Significance for Comparisons Using Pattern-Based Items: Change at Scale

Peer reviewed
PDF on ERIC

Download full text

Walter M. Stroup; Anthony Petrosino; Corey Brady; Karen Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

Tests of statistical significance often play a decisive role in establishing the empirical warrant of evidence-based research in education. The results from pattern-based assessment items, as introduced in this paper, are categorical and multimodal and do not immediately support the use of measures of central tendency as typically related to…

Descriptors: Statistical Significance, Comparative Analysis, Research Methodology, Evaluation Methods

Assessing, Accommodating, and Guiding English Learners: A Collection of Studies

Direct link

Stephanie B. Moore – ProQuest LLC, 2024

This three-manuscript dissertation attempts to answer the question: "How does students' English language proficiency (ELP) inform the availability, structure, and use of English language accommodations and intervention to support the academic achievement of English learner (EL) students?" The question is addressed using three independent…

Descriptors: English Language Learners, Language Proficiency, English (Second Language), Second Language Learning

Comparison of Disengagement Levels and the Impact of Disengagement on Item Parameters between PISA 2015 and PISA 2018 in the United States

Peer reviewed

Direct link

Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023

Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…

Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Complexity Analysis of Integrated Science Test Item Global Competence on Environmental Sustainability Content

Peer reviewed
PDF on ERIC

Download full text

Lasminawati, Endang; Jumadi; Wilujeng, Insih; Firmanshah, Muhammad Imam – International Society for Technology, Education, and Science, 2022

This qualitative descriptive analysis aims to interpret the complexity of the science items that are integrated with environmental sustainability content. This research was conducted on science test items in the 2013 Curriculum student science textbooks released by the Ministry of Education and Culture of the Republic of Indonesia. This study…

Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment

Beyond Group Comparisons: Accounting for Intersectional Sources of Bias in International Survey Measures

Peer reviewed

Direct link

Rujun Xu; James Soland – International Journal of Testing, 2024

International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…

Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries

Fairness and Comparability in Achievement Motivation Items: A Differential Item Functioning Analysis

Peer reviewed

Direct link

Bialo, Jacquelyn A.; Li, Hongli – Journal of Psychoeducational Assessment, 2022

Achievement motivation is a well-documented predictor of a variety of positive student outcomes. However, given observed group differences in motivation and related outcomes, motivation instruments should be checked for comparable item and scale functioning. Therefore, the purpose of this study was to evaluate measurement scale comparability and…

Descriptors: Student Motivation, Academic Achievement, Item Analysis, Gender Differences

Latent Class Approach to Detect Differential Item Functioning: PISA 2015 Science Sample

Peer reviewed
PDF on ERIC

Download full text

Uyar, Seyma – Eurasian Journal of Educational Research, 2020

Purpose: This study aimed to compare the performance of latent class differential item functioning (DIF) approach and IRT based DIF methods using manifest grouping. With this study, it was thought to draw attention to carry out latent class DIF studies in Turkey. The purpose of this study was to examine DIF in PISA 2015 science data set. Research…

Descriptors: Item Response Theory, Foreign Countries, Cross Cultural Studies, Item Analysis

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

The Unit Testlet Dilemma: PISA Sample

Peer reviewed
PDF on ERIC

Download full text

Ayan, Cansu; Baris Pekmezci, Fulya – International Journal of Assessment Tools in Education, 2021

Testlets have advantages such as making it possible to measure higher-order thinking skills and saving time, which are accepted in the literature. For this reason, they have often been preferred in many implementations from in-class assessments to large-scale assessments. Because of increased usage of testlets, the following questions are…

Descriptors: Foreign Countries, International Assessment, Secondary School Students, Achievement Tests

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Comparison of Difficulty Indices Calculated for Open-Ended Items According to Classical Test Theory and Many Facet Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Ilhan, Mustafa; Guler, Nese – Eurasian Journal of Educational Research, 2018

Purpose: This study aimed to compare difficulty indices calculated for open-ended items in accordance with the classical test theory (CTT) and the Many-Facet Rasch Model (MFRM). Although theoretical differences between CTT and MFRM occupy much space in the literature, the number of studies empirically comparing the two theories is quite limited.…

Descriptors: Difficulty Level, Test Items, Test Theory, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

ProQuest LLC	5
Educational and Psychological…	4
Assessment & Evaluation in…	3
Educational Research and…	3
Eurasian Journal of…	3
International Journal of…	3
Journal of Psychoeducational…	3
Large-scale Assessments in…	3
Partnership for Assessment of…	3
Applied Measurement in…	2
Grantee Submission	2
International Journal of…	2
Journal of Educational…	2
Journal of Educational and…	2
National Center for Education…	2
School Science and Mathematics	2
ACT, Inc.	1
African Journal of Research…	1
Applied Psychological…	1
Assessment for Effective…	1
Assessment in Education:…	1
Australasian Journal of…	1
Biochemistry and Molecular…	1
Cambridge Assessment	1
College Board	1
More ▼

Donlon, Thomas F.	2
Ercikan, Kadriye	2
Goldhammer, Frank	2
Ilhan, Mustafa	2
Kroehne, Ulf	2
Lüdtke, Oliver	2
Nelson, Gena	2
Robitzsch, Alexander	2
Steedle, Jeffrey	2
von Davier, Matthias	2
Abulela, Mohammed A. A.	1
Ahmed, Tamim	1
Akar, Cüneyt	1
Aktas, Elif	1
Albanese, Mark A.	1
Ali, Usama	1
Alpayar, Cagla	1
Anagnostopoulou, Kyriaki	1
Anthony Petrosino	1
Armani Talwar	1
Ayan, Cansu	1
Bacon, Tina P.	1
Baris Pekmezci, Fulya	1
Barry, Carol	1
More ▼

Program for International…	23
Trends in International…	16
Progress in International…	5
California Achievement Tests	4
Iowa Tests of Basic Skills	4
Measures of Academic Progress	3
National Assessment of…	3
ACT Assessment	2
Comprehensive Tests of Basic…	2
International Association for…	2
Metropolitan Achievement Tests	2
SAT (College Admission Test)	2
Sequential Tests of…	2
Stanford Achievement Tests	2
Wide Range Achievement Test	2
Advanced Placement…	1
Bender Gestalt Test	1
California Test of Mental…	1
Child Behavior Checklist	1
Draw a Person Test	1
Iowa Tests of Educational…	1
Kaufman Assessment Battery…	1
Massachusetts Comprehensive…	1
Raven Progressive Matrices	1
Stanford Binet Intelligence…	1
More ▼