Publication Date
In 2025 | 25 |
Since 2024 | 67 |
Since 2021 (last 5 years) | 276 |
Since 2016 (last 10 years) | 688 |
Descriptor
Source
Author
Erberber, Ebru | 10 |
Zhang, Jijun | 8 |
Braeken, Johan | 7 |
Ferraro, David | 7 |
Stearns, Pat | 7 |
Wendt, Heike | 7 |
Strietholt, Rolf | 6 |
Wang, Ke | 6 |
Barmer, Amy | 5 |
Choi, Kyong Mi | 5 |
Dilig, Rita | 5 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 84 |
United States | 83 |
Singapore | 76 |
South Korea | 62 |
Australia | 55 |
Japan | 49 |
Hong Kong | 47 |
South Africa | 41 |
Sweden | 38 |
Norway | 35 |
Taiwan | 32 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 3 |
No Child Left Behind Act 2001 | 3 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Every Student Succeeds Act… | 1 |
Improving Americas Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024
A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…
Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models
Liqun Yin; Ummugul Bezirhan; Matthias von Davier – International Electronic Journal of Elementary Education, 2025
This paper introduces an approach that uses latent class analysis to identify cut scores (LCA-CS) and categorize respondents based on context scales derived from largescale assessments like PIRLS, TIMSS, and NAEP. Context scales use Likert scale items to measure latent constructs of interest and classify respondents into meaningful ordered…
Descriptors: Multivariate Analysis, Cutting Scores, Achievement Tests, Foreign Countries
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Ersoy Öz; Okan Bulut; Zuhal Fatma Cellat; Hülya Yürekli – Education and Information Technologies, 2025
Predicting student performance in international large-scale assessments (ILSAs) is crucial for understanding educational outcomes on a global scale. ILSAs, such as the Program for International Student Assessment and the Trends in International Mathematics and Science Study, serve as vital tools for policymakers, educators, and researchers to…
Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment
H. Cigdem Bulut; Okan Bulut; Ashley Clelland – Field Methods, 2025
In this study, we explored psychometric network analysis (PNA) as an alternative method for identifying item wording effects in self-report instruments. We examined the functioning of negatively worded items in the network structures of two math-related scales from the 2019 Trends in International Mathematics and Science Study (TIMSS); Students…
Descriptors: Psychometrics, Network Analysis, Identification, Test Items
Dihao Leng; Ummugul Bezirhan; Lale Khorramdel; Bethany Fishbein; Matthias von Davier – Educational Measurement: Issues and Practice, 2024
This study capitalizes on response and process data from the computer-based TIMSS 2019 Problem Solving and Inquiry tasks to investigate gender differences in test-taking behaviors and their association with mathematics achievement at the eighth grade. Specifically, a recently proposed hierarchical speed-accuracy-revisits (SAR) model was adapted to…
Descriptors: Gender Differences, Test Wiseness, Achievement Tests, Mathematics Tests
Henry Isaiah Braun; Matthias von Davier; Jihang Chen – Large-scale Assessments in Education, 2025
International large-scale assessments (ILSA) are an important source of information for education policymakers across the globe. Despite sponsors' warnings, when results are published, media attention focuses on country rankings and changes in scores. Score changes are evaluated using a two-sided z-statistic, with statistical significance declared…
Descriptors: Educational Trends, Elementary Secondary Education, Foreign Countries, Achievement Tests
Musa Sadak – European Educational Research Journal, 2025
This study focused on the relationships between teacher characteristics and students' mathematics achievement in EU countries, including Hungary, Italy, Lithuania, Malta, Slovenia, Sweden, and Turkey, which are the only EU countries participated in TIMSS 2015 at the eighth-grade level. The data consisted of the sample of 31,969 eighth-grade…
Descriptors: Teacher Characteristics, Predictor Variables, Mathematics Achievement, Foreign Countries
Shelby J. Haberman; Sabine Meinck; Ann-Kristin Koop – Large-scale Assessments in Education, 2024
This paper extends existing work on teacher weighting in student-centered surveys by looking into aspects of practical implementation of deriving and using weights for teacher-centered analysis in the Trends in International Mathematics and Science Study (TIMSS) and the Progress in International Reading Literacy Study (PIRLS). The formal…
Descriptors: Elementary Secondary Education, Foreign Countries, Achievement Tests, Mathematics Achievement
Gulnar Ozyildirim; Engin Karadag – Psychology in the Schools, 2024
Focusing on the variables that can affect both academic achievement and the well-being of students has been crucial for their development, making schools effective and designing educational policy as well as curriculum. The study has aimed to investigate the effect of peer bullying on academic achievement and to determine moderators in the…
Descriptors: Bullying, Academic Achievement, Meta Analysis, Test Results
Ken Ardon – Pioneer Institute for Public Policy Research, 2024
This paper reviews overall student performance as well as the performance of student subgroups on the assessment system developed in response to the Massachusetts Education Reform Act of 1993 (MERA), the Massachusetts Comprehensive Assessment System (MCAS). Comparing students in Massachusetts to students in the rest of the United States or against…
Descriptors: Accuracy, Test Reliability, Elementary Secondary Education, Achievement Tests
Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023
International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…
Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries
Yi-Hsin Chen – Applied Measurement in Education, 2024
This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…
Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment
Saskia van Laar; Johan Braeken – International Journal of Testing, 2024
This study examined the impact of two questionnaire characteristics, scale position and questionnaire length, on the prevalence of random responders in the TIMSS 2015 eighth-grade student questionnaire. While there was no support for an absolute effect of questionnaire length, we did find a positive effect for scale position, with an increase of…
Descriptors: Middle School Students, Grade 8, Questionnaires, Test Length
Daniel Kasper; Katrin Schulz-Heidorf; Knut Schwippert – Sociological Methods & Research, 2024
In this article, we extend Liao's test for across-group comparisons of the fixed effects from the generalized linear model to the fixed and random effects of the generalized linear mixed model (GLMM). Using as our basis the Wald statistic, we developed an asymptotic test statistic for across-group comparisons of these effects. The test can be…
Descriptors: Models, Achievement Tests, Foreign Countries, International Assessment