ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	27

Descriptor

Simulation	27
Foreign Countries	22
Achievement Tests	21
International Assessment	20
Secondary School Students	20
Test Items	12
Item Response Theory	11
Evaluation Methods	10
Models	7
Accuracy	6
Data Analysis	6
Science Achievement	6
Bayesian Statistics	5
Item Analysis	5
Academic Achievement	4
Comparative Analysis	4
Computation	4
Computer Software	4
Error of Measurement	4
Hierarchical Linear Modeling	4
Mathematics	4
Mathematics Achievement	4
Mathematics Tests	4
Responses	4
Statistical Analysis	4
More ▼

Source

Large-scale Assessments in…	5
Journal of Educational…	4
Journal of Educational and…	3
Grantee Submission	2
International Journal of…	2
Applied Measurement in…	1
Center for American Progress	1
Educational Measurement:…	1
Educational and Psychological…	1
Harvard Education Press	1
International Educational…	1
Journal of Computer Assisted…	1
Journal of Educational Data…	1
Journal of Research in…	1
ProQuest LLC	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	19
Reports - Descriptive	3
Reports - Evaluative	3
Collected Works - Proceedings	1
Collected Works - Serial	1
Dissertations/Theses -…	1

Education Level

Secondary Education	21
Elementary Secondary Education	2
Elementary Education	1
Grade 6	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Finland	1
France	1
Germany	1
Massachusetts	1
Norway	1
Oregon	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	27
Trends in International…	3
Early Childhood Longitudinal…	2
National Assessment of…	2
Law School Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Evaluating German PISA Stratification Designs: A Simulation Study

Peer reviewed

Direct link

Julia Mang; Helmut Küchenhoff; Sabine Meinck – Large-scale Assessments in Education, 2024

Stratification is an important design feature of many studies using complex sampling designs and it is often used in large-scale assessment (LSA) studies, such as the "Programme for International Student Assessment" (PISA), for two main reasons. First, stratification variables that achieve a high between and low within strata variance…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Semi-Automatic Coding of Open-Ended Text Responses in Large-Scale Assessments

Peer reviewed

Direct link

Andersen, Nico; Zehner, Fabian; Goldhammer, Frank – Journal of Computer Assisted Learning, 2023

Background: In the context of large-scale educational assessments, the effort required to code open-ended text responses is considerably more expensive and time-consuming than the evaluation of multiple-choice responses because it requires trained personnel and long manual coding sessions. Aim: Our semi-supervised coding method eco (exploring…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Predictive Performance of Bayesian Stacking in Multilevel Education Data

Peer reviewed

Direct link

Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025

The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables

Bayesian Historical Borrowing with Longitudinal Large-Scale Assessments

Peer reviewed

Direct link

Kaplan, David; Chen, Jianshen; Lyu, Weicong; Yavuz, Sinan – Large-scale Assessments in Education, 2023

The purpose of this paper is to extend and evaluate methods of "Bayesian historical borrowing" applied to longitudinal data with a focus on parameter recovery and predictive performance. Bayesian historical borrowing allows researchers to utilize information from previous data sources and to adjust the extent of borrowing based on the…

Descriptors: Bayesian Statistics, Longitudinal Studies, Children, Surveys

Bayesian Historical Borrowing with Longitudinal Large-Scale Assessments

Peer reviewed
PDF on ERIC

Download full text

Direct link

David Kaplan; Jianshen Chen; Weicong Lyu; Sinan Yavuz – Grantee Submission, 2023

Descriptors: Bayesian Statistics, Longitudinal Studies, Children, Surveys

A Partial Simulation Study of Phantom Effects in Multilevel Analysis of School Effects: The Case of School Socioeconomic Composition

Peer reviewed

Direct link

Zhou, Hao; Ma, Xin – Sociological Methods & Research, 2023

Hierarchical linear modeling (HLM) is often used to estimate the effects of socioeconomic status (SES) on academic achievement at different levels of an educational system. However, if a prior academic achievement measure is missing in a HLM model, biased estimates may occur on the effects of student SES and school SES. Phantom effects describe…

Descriptors: Simulation, Hierarchical Linear Modeling, Socioeconomic Status, Institutional Characteristics

Sampling Weights in Multilevel Modelling: An Investigation Using PISA Sampling Structures

Peer reviewed

Direct link

Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021

Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…

Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling

Latent Program Modeling: Inferring Latent Problem-Solving Strategies from a PISA Problem-Solving Task

Peer reviewed
PDF on ERIC

Download full text

Lundgren, Erik – Journal of Educational Data Mining, 2022

Response process data have the potential to provide a rich description of test-takers' thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational…

Descriptors: Problem Solving, Classification, Accuracy, Foreign Countries

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

A More Flexible Bayesian Multilevel Bifactor Item Response Theory Model

Peer reviewed

Direct link

Fujimoto, Ken A. – Journal of Educational Measurement, 2020

Multilevel bifactor item response theory (IRT) models are commonly used to account for features of the data that are related to the sampling and measurement processes used to gather those data. These models conventionally make assumptions about the portions of the data structure that represent these features. Unfortunately, when data violate these…

Descriptors: Bayesian Statistics, Item Response Theory, Achievement Tests, Secondary School Students

A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023

Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…

Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

On the Treatment of Missing Data in Background Questionnaires in Educational Large-Scale Assessments: An Evaluation of Different Procedures

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…

Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference

Identifying Patterns of Students' Performance on Simulated Inquiry Tasks Using PISA 2015 Log-File Data

Peer reviewed

Direct link

Teig, Nani; Scherer, Ronny; Kjaernsli, Marit – Journal of Research in Science Teaching, 2020

Previous research has demonstrated the potential of examining log-file data from computer-based assessments to understand student interactions with complex inquiry tasks. Rather than solely providing information about what has been achieved or the accuracy of student responses ("product data"), students' log files offer additional…

Descriptors: Science Process Skills, Thinking Skills, Inquiry, Simulation

Previous Page | Next Page »

Pages: 1 | 2

Rutkowski, David	3
Rutkowski, Leslie	3
David Kaplan	2
Liaw, Yuan-Ling	2
Lüdtke, Oliver	2
Robitzsch, Alexander	2
Abulela, Mohammed A. A.	1
Adams, Raymond J.	1
Andersen, Nico	1
Berezner, Alla	1
Bolsinova, Maria	1
Cai, Li	1
Chauncey, Caroline T., Ed.	1
Chen, Jianshen	1
Chun Wang	1
De Boeck, Paul	1
Debeer, Dries	1
Fujimoto, Ken A.	1
Goldhammer, Frank	1
Grund, Simon	1
Haag, Nicole	1
Helmut Küchenhoff	1
Janssen, Rianne	1
Jianshen Chen	1
Jing Lu	1
More ▼