Publication Date
In 2025 | 1 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 18 |
Since 2016 (last 10 years) | 35 |
Since 2006 (last 20 years) | 67 |
Descriptor
Achievement Tests | 102 |
Simulation | 102 |
Foreign Countries | 42 |
Item Response Theory | 34 |
International Assessment | 29 |
Test Items | 27 |
Secondary School Students | 22 |
Statistical Analysis | 21 |
Comparative Analysis | 20 |
Science Achievement | 17 |
Science Tests | 17 |
More ▼ |
Source
Author
Rutkowski, Leslie | 5 |
Reckase, Mark D. | 3 |
Rutkowski, David | 3 |
Choi, Seung W. | 2 |
Chun Wang | 2 |
David Kaplan | 2 |
DeMars, Christine E. | 2 |
Kim, Dong-In | 2 |
Liaw, Yuan-Ling | 2 |
Luk, HingKwan | 2 |
Lüdtke, Oliver | 2 |
More ▼ |
Publication Type
Education Level
Secondary Education | 29 |
Elementary Secondary Education | 14 |
Elementary Education | 6 |
Higher Education | 6 |
Junior High Schools | 6 |
Middle Schools | 6 |
Postsecondary Education | 6 |
Grade 4 | 5 |
Grade 7 | 5 |
High Schools | 5 |
Grade 8 | 4 |
More ▼ |
Audience
Researchers | 2 |
Practitioners | 1 |
Teachers | 1 |
Location
Turkey | 4 |
Iran | 3 |
Canada | 2 |
Germany | 2 |
Norway | 2 |
Philippines | 2 |
Armenia | 1 |
Austria | 1 |
California | 1 |
Florida | 1 |
Malaysia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Julia Mang; Helmut Küchenhoff; Sabine Meinck – Large-scale Assessments in Education, 2024
Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the "Programme for International Student Assessment" (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Andersen, Nico; Zehner, Fabian; Goldhammer, Frank – Journal of Computer Assisted Learning, 2023
Background: In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim: Our semi-supervised coding method eco (exploring…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025
The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…
Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables
Kyle T. Turner; George Engelhard Jr. – Journal of Experimental Education, 2024
The purpose of this study is to demonstrate clustering methods within a functional data analysis (FDA) framework for identifying subgroups of individuals that may be exhibiting categories of misfit. Person response functions (PRFs) estimated within a FDA framework (FDA-PRFs) provide graphical displays that can aid in the identification of persons…
Descriptors: Data Analysis, Multivariate Analysis, Individual Characteristics, Behavior
Kaplan, David; Chen, Jianshen; Lyu, Weicong; Yavuz, Sinan – Large-scale Assessments in Education, 2023
The purpose of this paper is to extend and evaluate methods of "Bayesian historical borrowing" applied to longitudinal data with a focus on parameter recovery and predictive performance. Bayesian historical borrowing allows researchers to utilize information from previous data sources and to adjust the extent of borrowing based on the…
Descriptors: Bayesian Statistics, Longitudinal Studies, Children, Surveys
David Kaplan; Jianshen Chen; Weicong Lyu; Sinan Yavuz – Grantee Submission, 2023
The purpose of this paper is to extend and evaluate methods of "Bayesian historical borrowing" applied to longitudinal data with a focus on parameter recovery and predictive performance. Bayesian historical borrowing allows researchers to utilize information from previous data sources and to adjust the extent of borrowing based on the…
Descriptors: Bayesian Statistics, Longitudinal Studies, Children, Surveys
Zhou, Hao; Ma, Xin – Sociological Methods & Research, 2023
Hierarchical linear modeling (HLM) is often used to estimate the effects of socioeconomic status (SES) on academic achievement at different levels of an educational system. However, if a prior academic achievement measure is missing in a HLM model, biased estimates may occur on the effects of student SES and school SES. Phantom effects describe…
Descriptors: Simulation, Hierarchical Linear Modeling, Socioeconomic Status, Institutional Characteristics
Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021
Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…
Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling
Lundgren, Erik – Journal of Educational Data Mining, 2022
Response process data have the potential to provide a rich description of test-takers' thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational…
Descriptors: Problem Solving, Classification, Accuracy, Foreign Countries
Giada Spaccapanico Proietti; Mariagiulia Matteucci; Stefania Mignani; Bernard P. Veldkamp – Journal of Educational and Behavioral Statistics, 2024
Classical automated test assembly (ATA) methods assume fixed and known coefficients for the constraints and the objective function. This hypothesis is not true for the estimates of item response theory parameters, which are crucial elements in test assembly classical models. To account for uncertainty in ATA, we propose a chance-constrained…
Descriptors: Automation, Computer Assisted Testing, Ambiguity (Context), Item Response Theory
Reina Karen M. Celestino-Salcedo; Jr. Sotero O. Malayao Jr.; Monera A. Salic-Hairulla; Ellen J. Castro; Ivy Claire V. Mordeno – Journal of Education and Learning (EduLearn), 2024
The challenge of creating reliable technology-based resources for science learning is a perennial challenge in Philippine education, with limited learning materials accessible to all learners. This study is about the development of a videocast embedded with physics education technology (PhET) simulation that served as supplementary learning…
Descriptors: Physics, Science Education, Motion, Scientific Concepts
Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023
One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…
Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests
Fujimoto, Ken A. – Journal of Educational Measurement, 2020
Multilevel bifactor item response theory (IRT) models are commonly used to account for features of the data that are related to the sampling and measurement processes used to gather those data. These models conventionally make assumptions about the portions of the data structure that represent these features. Unfortunately, when data violate these…
Descriptors: Bayesian Statistics, Item Response Theory, Achievement Tests, Secondary School Students
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing
Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023
Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…
Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy