Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 47 |
Since 2006 (last 20 years) | 193 |
Descriptor
Source
Author
Alonzo, Julie | 7 |
Lai, Cheng Fei | 7 |
Tindal, Gerald | 7 |
Bradham, Tamala S. | 5 |
Munoz, Karen F. | 4 |
Beddow, Peter A. | 3 |
Catherine P. Bradshaw | 3 |
Goldhaber, Dan | 3 |
Hoffman, Jeff | 3 |
Houston, K. Todd | 3 |
Nelson, Lauri | 3 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 205 |
Elementary Education | 37 |
Secondary Education | 23 |
Higher Education | 18 |
Grade 8 | 13 |
Junior High Schools | 11 |
Middle Schools | 11 |
Grade 4 | 10 |
Adult Education | 8 |
Grade 6 | 7 |
High Schools | 7 |
More ▼ |
Location
Australia | 12 |
Turkey | 11 |
Oregon | 7 |
Canada | 6 |
United States | 5 |
Taiwan | 4 |
California | 3 |
Florida | 3 |
Japan | 3 |
Pennsylvania | 3 |
Singapore | 3 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 10 |
No Child Left Behind Act 2001 | 7 |
Every Student Succeeds Act… | 2 |
National Defense Education Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024
As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…
Descriptors: Psychometrics, Ethics, Decision Making, Algorithms
Sung-eun Baek; Christine Myung-hee Ahn – Journal of Psychoeducational Assessment, 2025
The purpose of this study was to evaluate the reliability and validity of the Korean Behavior Assessment System for Children 3rd Edition Teacher Rating Scales--Child Form (K·BASC-3 TRS-C). We used the generalized partial credit model based on item response theory (IRT) to examine the internal validity of the scale's items and the latent trait in a…
Descriptors: Reliability, Teacher Attitudes, Elementary Secondary Education, Asians
Serap Buyukkidik – International Journal of Assessment Tools in Education, 2023
In the current study, differential item functioning (DIF) detection using real data was conducted with the application of "Mantel-Haenszel (MH)", "Simultaneous item bias test (SIBTEST)", "Lord's chi-square", and "Raju's area" methods, both when item purification was carried out and when item purification was…
Descriptors: Language Tests, Test Items, Item Analysis, Gender Differences
Saatcioglu, Fatima Munevver; Sen, Sedat – International Journal of Testing, 2023
In this study, we illustrated an application of the confirmatory mixture IRT model for multidimensional tests. We aimed to examine the differences in student performance by domains with a confirmatory mixture IRT modeling approach. A three-dimensional and three-class model was analyzed by assuming content domains as dimensions and cognitive…
Descriptors: Item Response Theory, Foreign Countries, Elementary Secondary Education, Achievement Tests
Julien Corven; Teo Paoletti; Allison L. Gantt – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
We previously (Gantt et al., 2023; Paoletti et al., 2021) identified items from the publicly released TIMSS 2011 assessments that had potential for students to employ covariational reasoning as a solution strategy. In this report, we explore the extent to which fourth-grade students' performance on such items in mathematics differed among 26…
Descriptors: Achievement Tests, Foreign Countries, Mathematics Achievement, Mathematics Tests
Ferrara, Steve; Steedle, Jeffrey T.; Frantz, Roger S. – Applied Measurement in Education, 2022
Item difficulty modeling studies involve (a) hypothesizing item features, or item response demands, that are likely to predict item difficulty with some degree of accuracy; and (b) entering the features as independent variables into a regression equation or other statistical model to predict difficulty. In this review, we report findings from 13…
Descriptors: Reading Comprehension, Reading Tests, Test Items, Item Response Theory
Chin, Sze Looi; Choy, Ban Heng; Leong, Yew Hoong – Mathematics Education Research Journal, 2022
This article presents a case study on a secondary mathematics teacher, Mary (pseudonym), and her design of a set of instructional tasks in the context of proportional reasoning. In keeping with the way Singapore teachers generally conceive of instructional planning, we investigated the connections between four comparison tasks she designed through…
Descriptors: Instructional Design, Teaching Methods, Mathematics Instruction, Case Studies
Bernhardt, Amery E. – ProQuest LLC, 2022
This quantitative correlational study dives into the heart of understanding the significance of model fidelity for implementing school threat assessment teams. The target population was instructional staff and threat assessment team members from schools in Dutchess, Putnam, and Westchester Counties in New York State that use the Comprehensive…
Descriptors: Evaluation Methods, Educational Environment, Correlation, Fidelity
Kritika Thapa – ProQuest LLC, 2023
Measurement invariance is crucial for making valid comparisons across different groups (Kline, 2016; Vandenberg, 2002). To address the challenges associated with invariance testing such as large sample size requirements, the complexity of the model, etc., applied researchers have incorporated parcels. Parcels have been shown to alleviate skewness,…
Descriptors: Elementary Secondary Education, Achievement Tests, Foreign Countries, International Assessment
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Chen, Yi-Hsin – Journal of Psychoeducational Assessment, 2022
The quality of diagnostic profiles and probability assignment depends on the validity of the proposed attributes and Q-matrix. The rule-space method (RSM), one of diagnostic classification models, provides the quality indices of diagnostic profiles, such as the classification rate and the squared Mahalanobis distance. The study aims to further…
Descriptors: Profiles, Probability, Classification, Construct Validity
Lin, Jing-Wen; Yu, Ruan-Ching – Asia Pacific Journal of Education, 2022
Modelling ability is one of the essential elements of the latest educational reforms, and Trends in International Mathematics and Science Study (TIMSS) is a curriculum-based assessment which allows educational systems worldwide to inspect the curricular influences. The aims of this study were to examine the role of modelling ability in the…
Descriptors: Grade 8, Educational Change, Cross Cultural Studies, Test Items
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Achieve, Inc., 2019
Assessment is a key lever for educational improvement. Assessments can be used to monitor, signal, and influence science teaching and learning -- provided that they are of high quality, reflect the rigor and intent of academic standards, and elicit meaningful student performances. Since the release of "A Framework for K-12 Science…
Descriptors: Difficulty Level, Evaluation Criteria, Cognitive Processes, Test Items
Alhadi, Moosa A. A.; Zhang, Dake; Wang, Ting; Maher, Carolyn A. – North American Chapter of the International Group for the Psychology of Mathematics Education, 2022
This research synthesizes studies that used a Digitalized Interactive Component (DIC) to assess K-12 student mathematics performance during Computer-based-Assessments (CBAs) in mathematics. A systematic search identified ten studies that categorized existing DICs according to the tools that provided language assistance to students and tools that…
Descriptors: Computer Assisted Testing, Mathematics Tests, English Language Learners, Geometry