Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 68 |
Since 2006 (last 20 years) | 105 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Location
Singapore | 16 |
South Korea | 12 |
Japan | 11 |
Turkey | 11 |
Hong Kong | 9 |
United States | 9 |
Finland | 7 |
Taiwan | 7 |
Australia | 6 |
Germany | 5 |
United Kingdom (England) | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Yanan Feng – ProQuest LLC, 2021
This dissertation aims to investigate the effect size measures of differential item functioning (DIF) detection in the context of cognitive diagnostic models (CDMs). A variety of DIF detection techniques have been developed in the context of CDMs. However, most of the DIF detection procedures focus on the null hypothesis significance test. Few…
Descriptors: Effect Size, Item Response Theory, Cognitive Measurement, Models
Ulrich, Monika – ProQuest LLC, 2023
The National Council of Teachers of Mathematics (NCTM), has made an effort to increase the use of technology and the use of calculators into the classroom and curriculum. As a result, many studies and articles have been written on the subject of calculator use in the classroom. A review of over 600 studies revealed that it is not curricula that…
Descriptors: Calculators, Educational Technology, Technology Uses in Education, Mathematics Education
Suk, Youmi; Kim, Jee-Seon; Kang, Hyunseung – Journal of Educational and Behavioral Statistics, 2021
There has been increasing interest in exploring heterogeneous treatment effects using machine learning (ML) methods such as causal forests, Bayesian additive regression trees, and targeted maximum likelihood estimation. However, there is little work on applying these methods to estimate treatment effects in latent classes defined by…
Descriptors: Artificial Intelligence, Statistical Analysis, Statistical Inference, Classification
Akin Arikan, Cigdem – Eurasian Journal of Educational Research, 2019
Problem Statement: Equating can be defined as a statistical process that allows modifying the differences between test forms with similar content and difficulty so that the scores obtained from these forms can be used interchangeably. In the literature, there are many equating methods, one of which is Kernel equating. Trends in International…
Descriptors: Equated Scores, Foreign Countries, Achievement Tests, International Assessment
Wang, Jianjun; Ma, Xin – Athens Journal of Education, 2019
This rejoinder keeps the original focus on statistical computing pertaining to the correlation of student achievement between mathematics and science from the Trend in Mathematics and Science Study (TIMSS). Albeit the availability of student performance data in TIMSS and the emphasis of the inter-subject connection in the Next Generation Science…
Descriptors: Scores, Correlation, Achievement Tests, Elementary Secondary Education
Yalcin, Seher – Eurasian Journal of Educational Research, 2018
Purpose: Studies in the literature have generally demonstrated that the causes of differential item functioning (DIF) are complex and not directly related to defined groups. The purpose of this study is to determine the DIF according to the mixture item response theory (MixIRT) model, based on the latent group approach, as well as the…
Descriptors: Item Response Theory, Test Items, Test Bias, Error of Measurement
Traynor, Anne – Educational Assessment, 2017
Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…
Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum
Wang, Yan; Kim, Eun Sook; Dedrick, Robert F.; Ferron, John M.; Tan, Tony – Educational and Psychological Measurement, 2018
Wording effects associated with positively and negatively worded items have been found in many scales. Such effects may threaten construct validity and introduce systematic bias in the interpretation of results. A variety of models have been applied to address wording effects, such as the correlated uniqueness model and the correlated traits and…
Descriptors: Test Items, Test Format, Correlation, Construct Validity
Braun, Henry; von Davier, Matthias – Large-scale Assessments in Education, 2017
Background: Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV) methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT…
Descriptors: Scores, Test Use, Measurement, Psychometrics
Walkington, Candace; Clinton, Virginia; Shivraj, Pooja – American Educational Research Journal, 2018
The link between reading and mathematics achievement is well known, and an important question is whether readability factors in mathematics problems are differentially impacting student groups. Using 20 years of data from the National Assessment of Educational Progress and the Trends in International Mathematics and Science Study, we examine how…
Descriptors: Readability, Word Problems (Mathematics), Mathematics Instruction, Problem Solving
Casey, Beth M.; Lombardi, Caitlin McPherran; Pollock, Amanda; Fineman, Bonnie; Pezaris, Elizabeth – Journal of Cognition and Development, 2017
This study investigated longitudinal pathways leading from early spatial skills in first-grade girls to their fifth-grade analytical math reasoning abilities (N = 138). First-grade assessments included spatial skills, verbal skills, addition/subtraction skills, and frequency of choice of a decomposition or retrieval strategy on the…
Descriptors: Females, Arithmetic, Mathematics Instruction, Predictor Variables
Li, Hongli; Qin, Qi; Lei, Pui-Wa – Educational Assessment, 2017
In recent years, students' test scores have been used to evaluate teachers' performance. The assumption underlying this practice is that students' test performance reflects teachers' instruction. However, this assumption is generally not empirically tested. In this study, we examine the effect of teachers' instruction on test performance at the…
Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Mathematics Achievement
Daus, Stephan; Braeken, Johan – Large-scale Assessments in Education, 2018
Background: Fair comparisons of educational systems in large-scale assessments can be made only if the differences in curricula have little impact on the outcomes. This study investigated the sensitivity of science achievement rankings to varying degrees of curriculum implementation in the Trends in International Mathematics and Science Study…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, International Assessment
George, Ann Cathrice; Robitzsch, Alexander – Applied Measurement in Education, 2018
This article presents a new perspective on measuring gender differences in the large-scale assessment study Trends in International Science Study (TIMSS). The suggested empirical model is directly based on the theoretical competence model of the domain mathematics and thus includes the interaction between content and cognitive sub-competencies.…
Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests
Traynor, Anne – Applied Measurement in Education, 2017
It has long been argued that U.S. states' differential performance on nationwide assessments may reflect differences in students' opportunity to learn the tested content that is primarily due to variation in curricular content standards, rather than in instructional quality or educational investment. To quantify the effect of differences in…
Descriptors: Test Items, Difficulty Level, State Standards, Academic Standards