Publication Date
In 2025 | 3 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 13 |
Since 2016 (last 10 years) | 66 |
Since 2006 (last 20 years) | 95 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Education | 39 |
Secondary Education | 37 |
Grade 3 | 26 |
Grade 4 | 22 |
Grade 5 | 22 |
Middle Schools | 22 |
Primary Education | 21 |
Early Childhood Education | 20 |
Grade 8 | 19 |
Grade 6 | 18 |
Grade 7 | 18 |
More ▼ |
Audience
Parents | 1 |
Policymakers | 1 |
Location
Florida | 11 |
Turkey | 6 |
Texas | 4 |
Wisconsin | 4 |
Hong Kong | 3 |
South Korea | 3 |
Taiwan | 3 |
Australia | 2 |
Belgium | 2 |
Canada | 2 |
Illinois | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024
Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…
Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)
Ulitzsch, Esther; Domingue, Benjamin W.; Kapoor, Radhika; Kanopka, Klint; Rios, Joseph A. – Educational Measurement: Issues and Practice, 2023
Common response-time-based approaches for non-effortful response behavior (NRB) in educational achievement tests filter responses that are associated with response times below some threshold. These approaches are, however, limited in that they require a binary decision on whether a response is classified as stemming from NRB; thus ignoring…
Descriptors: Reaction Time, Responses, Behavior, Achievement Tests
Umut Atasever; Francis L. Huang; Leslie Rutkowski – Large-scale Assessments in Education, 2025
When analyzing large-scale assessments (LSAs) that use complex sampling designs, it is important to account for probability sampling using weights. However, the use of these weights in multilevel models has been widely debated, particularly regarding their application at different levels of the model. Yet, no consensus has been reached on the best…
Descriptors: Mathematics Tests, International Assessment, Elementary Secondary Education, Foreign Countries
Schonberg, Christina – Online Submission, 2023
IXL is an end-to-end teaching and learning solution that engages learners in grades Pre-K through 12 with a comprehensive curriculum and a first-of-its-kind assessment suite. A core component of IXL's assessment suite is the IXL Diagnostic, an interim assessment designed by a team of educators and mathematicians that uses Item Response Theory…
Descriptors: Academic Achievement, Achievement Tests, Computer Uses in Education, Elementary School Students
Liqun Yin; Ummugul Bezirhan; Matthias von Davier – International Electronic Journal of Elementary Education, 2025
This paper introduces an approach that uses latent class analysis to identify cut scores (LCA-CS) and categorize respondents based on context scales derived from largescale assessments like PIRLS, TIMSS, and NAEP. Context scales use Likert scale items to measure latent constructs of interest and classify respondents into meaningful ordered…
Descriptors: Multivariate Analysis, Cutting Scores, Achievement Tests, Foreign Countries
Lyu, Weicong; Kim, Jee-Seon; Suk, Youmi – Journal of Educational and Behavioral Statistics, 2023
This article presents a latent class model for multilevel data to identify latent subgroups and estimate heterogeneous treatment effects. Unlike sequential approaches that partition data first and then estimate average treatment effects (ATEs) within classes, we employ a Bayesian procedure to jointly estimate mixing probability, selection, and…
Descriptors: Hierarchical Linear Modeling, Bayesian Statistics, Causal Models, Statistical Inference
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021
In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…
Descriptors: Perception, Bias, Theories, Test Items
Klingbeil, David A.; Van Norman, Ethan R.; Nelson, Peter M. – Assessment for Effective Intervention, 2021
This direct replication study compared the use of dichotomized likelihood ratios and interval likelihood ratios, derived using a prior sample of students, for predicting math risk in middle school. Data from the prior year state test and the Measures of Academic Progress were analyzed to evaluate differences in the efficiency and diagnostic…
Descriptors: Achievement Tests, Grade 6, Grade 7, At Risk Students
Can, Ömer Sinan; Isleyen, Tevfik – African Educational Research Journal, 2020
The purpose of this study is to investigate the effect of probability teaching with the argumentation approach on the academic achievement of pre-service mathematics teachers and the permanence of probability knowledge. Quantitative research method was adopted in the study and quasi-experimental design was used. The study group consisted of 44…
Descriptors: Preservice Teachers, Mathematics Teachers, Probability, Mathematics Instruction
Chen, Yi-Hsin – Journal of Psychoeducational Assessment, 2022
The quality of diagnostic profiles and probability assignment depends on the validity of the proposed attributes and Q-matrix. The rule-space method (RSM), one of diagnostic classification models, provides the quality indices of diagnostic profiles, such as the classification rate and the squared Mahalanobis distance. The study aims to further…
Descriptors: Profiles, Probability, Classification, Construct Validity
Carly Oddleifson; Stephen Kilgus; David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Ishan N. Vengurlekar – Grantee Submission, 2025
The purpose of this study was to conduct a conceptual replication of Pendergast et al.'s (2018) study that examined the diagnostic accuracy of a nomogram procedure, also known as a naive Bayesian approach. The specific naive Bayesian approach combined academic and social-emotional and behavioral (SEB) screening data to predict student performance…
Descriptors: Bayesian Statistics, Accuracy, Social Emotional Learning, Diagnostic Tests
Benson, Nicholas F.; Beaujean, A. Alexander; Donohue, Ashley; Ward, Emily – Journal of Psychoeducational Assessment, 2018
W scores are used in a number of commercially available tests. Due to their complex nature, it can be hard for applied researchers and practitioners to understand them or even acquire information about them beyond what is provided in technical manuals. In this article, we provide information regarding the background and derivation of W scores that…
Descriptors: Scores, Item Response Theory, Achievement Tests, Cognitive Ability
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Center for Research and Reform in Education, 2020
This brief provides a summary of the validity of Istation ISIP Early Reading scores in predicting students' performance levels on the Idaho Standards Achievement Test (ISAT) in English language arts (ELA). This correlational study analyzed how well third grade students' winter performance on the ISIP Early Reading test predicted their spring…
Descriptors: Reading Programs, Early Reading, Reading Tests, Scores
Hong, Jeehye; Kim, Hyunjung; Hong, Hun-Gi – Asia-Pacific Science Education, 2022
This study explored science-related variables that have an impact on the prediction of science achievement groups by applying the educational data mining (EDM) method of the random forest analysis to extract factors associated with students categorized in three different achievement groups (high, moderate, and low) in the Korean data from the 2015…
Descriptors: Science Achievement, Prediction, Teaching Methods, Science Teachers