Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 25 |
Since 2016 (last 10 years) | 65 |
Since 2006 (last 20 years) | 95 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 59 |
Secondary Education | 41 |
Elementary Education | 38 |
Grade 8 | 36 |
Middle Schools | 32 |
Junior High Schools | 30 |
Grade 4 | 26 |
Intermediate Grades | 22 |
High Schools | 10 |
Grade 9 | 7 |
Grade 3 | 3 |
More ▼ |
Audience
Researchers | 2 |
Policymakers | 1 |
Practitioners | 1 |
Location
Turkey | 13 |
United States | 12 |
Singapore | 7 |
Taiwan | 6 |
Germany | 5 |
Canada | 4 |
South Africa | 4 |
South Korea | 4 |
Finland | 3 |
Hong Kong | 3 |
Massachusetts | 3 |
More ▼ |
Laws, Policies, & Programs
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023
International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…
Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries
H. Cigdem Bulut; Okan Bulut; Ashley Clelland – Field Methods, 2025
In this study, we explored psychometric network analysis (PNA) as an alternative method for identifying item wording effects in self-report instruments. We examined the functioning of negatively worded items in the network structures of two math-related scales from the 2019 Trends in International Mathematics and Science Study (TIMSS); Students…
Descriptors: Psychometrics, Network Analysis, Identification, Test Items
Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024
A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…
Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models
Saatcioglu, Fatima Munevver; Sen, Sedat – International Journal of Testing, 2023
In this study, we illustrated an application of the confirmatory mixture IRT model for multidimensional tests. We aimed to examine the differences in student performance by domains with a confirmatory mixture IRT modeling approach. A three-dimensional and three-class model was analyzed by assuming content domains as dimensions and cognitive…
Descriptors: Item Response Theory, Foreign Countries, Elementary Secondary Education, Achievement Tests
Julien Corven; Teo Paoletti; Allison L. Gantt – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
We previously (Gantt et al., 2023; Paoletti et al., 2021) identified items from the publicly released TIMSS 2011 assessments that had potential for students to employ covariational reasoning as a solution strategy. In this report, we explore the extent to which fourth-grade students' performance on such items in mathematics differed among 26…
Descriptors: Achievement Tests, Foreign Countries, Mathematics Achievement, Mathematics Tests
Maurice M. W. Cheng – Assessment Matters, 2023
Much of the focus of international comparative studies of students' achievement has been on New Zealand students' falling standards. Using the most recent findings from the Trends in International Mathematics and Science Study (TIMSS), and, in particular, Year 9 New Zealand students' performance in specific items, this article suggests that there…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Science Tests
Oyar, Esra; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2021
The aim of this study is to examine whether or not the positive and negative items in the Mathematical Self-Confidence Scale employed in TIMSS 2015 lead to wording effect. While examining whether the expression effect is present or not, analyzes were conducted both on the general sample and on a separate sample for female and male students. To…
Descriptors: Foreign Countries, International Assessment, Self Concept Measures, Mathematics
Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022
Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…
Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Gübes, Nese; Uyar, Seyma – International Journal of Progressive Education, 2020
This study aims to compare the performance of different small sample equating methods in the presence and absence of differential item functioning (DIF) in common items. In this research, Tucker linear equating, Levine linear equating, unsmoothed and pre-smoothed (C=4) chained equipercentile equating, and simplified circle arc equating methods…
Descriptors: Test Bias, Equated Scores, Test Items, Methods
Gantt, Allison L.; Paoletti, Teo; Corven, Julien – International Journal of Science and Mathematics Education, 2023
Covariational reasoning (or the coordination of two dynamically changing quantities) is central to secondary STEM subjects, but research has yet to fully explore its applicability to elementary and middle-grade levels within various STEM fields. To address this need, we selected a globally referenced STEM assessment--the Trends in International…
Descriptors: Incidence, Abstract Reasoning, Mathematics Education, Science Education
Esin Yilmaz Kogar; Sumeyra Soysal – International Journal of Assessment Tools in Education, 2023
In this paper, it is aimed to evaluate different aspects of students' response time to items in the mathematics test and their test effort as an indicator of test motivation with the help of some variables at the item and student levels. The data consists of 4th-grade Singapore and Turkish students participating in the TIMSS 2019. Response time…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Mathematics Achievement
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Chen, Yi-Hsin – Journal of Psychoeducational Assessment, 2022
The quality of diagnostic profiles and probability assignment depends on the validity of the proposed attributes and Q-matrix. The rule-space method (RSM), one of diagnostic classification models, provides the quality indices of diagnostic profiles, such as the classification rate and the squared Mahalanobis distance. The study aims to further…
Descriptors: Profiles, Probability, Classification, Construct Validity
Ma, Wenchao; de la Torre, Jimmy – Journal of Educational and Behavioral Statistics, 2019
Solving a constructed-response item usually requires successfully performing a sequence of tasks. Each task could involve different attributes, and those required attributes may be "condensed" in various ways to produce the responses. The sequential generalized deterministic input noisy "and" gate model is a general cognitive…
Descriptors: Test Items, Cognitive Measurement, Models, Hypothesis Testing