NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 28 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Chalmers, R. Philip; Zheng, Guoguo – Applied Measurement in Education, 2023
This article presents generalizations of SIBTEST and crossing-SIBTEST statistics for differential item functioning (DIF) investigations involving more than two groups. After reviewing the original two-group setup for these statistics, a set of multigroup generalizations that support contrast matrices for joint tests of DIF are presented. To…
Descriptors: Test Bias, Test Items, Item Response Theory, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Van Lissa, Caspar J.; van Erp, Sara; Clapper, Eli-Boaz – Research Synthesis Methods, 2023
When meta-analyzing heterogeneous bodies of literature, meta-regression can be used to account for potentially relevant between-studies differences. A key challenge is that the number of candidate moderators is often high relative to the number of studies. This introduces risks of overfitting, spurious results, and model non-convergence. To…
Descriptors: Bayesian Statistics, Regression (Statistics), Maximum Likelihood Statistics, Meta Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Eser, Mehmet Taha; Asku, Gökhan – Pegem Journal of Education and Instruction, 2021
The main aim of achieving with the reliability generalization is to investigate the variability related to the reliability estimates and to try to characterize the sources of this variability. As part of the research, a reliability generalization study was carried out on the basis of Beck Depression Inventory-II to investigate potential factors…
Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Abdulaziz Alshahrani – AILA Review, 2023
The aim of this paper was to evaluate gender differences in the language used in United Nations (UN) General Assembly debates by one male and one female representative each from India, China, the USA, and Indonesia. The critical discourse analysis (CDA) framework of van Dijk (2015) was used along with the 25 discursive devices in this framework.…
Descriptors: Discourse Analysis, Gender Differences, International Organizations, Language Usage
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Soysal, Sümeyra – Participatory Educational Research, 2023
Applying a measurement instrument developed in a specific country to other countries raise a critical and important question of interest in especially cross-cultural studies. Confirmatory factor analysis (CFA) is the most preferred and used method to examine the cross-cultural applicability of measurement tools. Although CFA is a sophisticated…
Descriptors: Generalization, Cross Cultural Studies, Measurement Techniques, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Peer reviewed Peer reviewed
Direct linkDirect link
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Brown, Ted; Peres, Lisa – Journal of Occupational Therapy, Schools & Early Intervention, 2018
The "Motor-Free Visual Perception Test-fourth edition" (MVPT-4) is a revised version of the "Motor-Free Visual Perception Test-third edition." The MVPT-4 is used to assess the visual-perceptual ability of individuals aged 4.0 through 80+ years via a series of visual-perceptual tasks that do not require a motor response. Test…
Descriptors: Visual Perception, Vision Tests, Test Validity, Culture Fair Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hong, Hwanhee; Chu, Haitao; Zhang, Jing; Carlin, Bradley P. – Research Synthesis Methods, 2016
Bayesian statistical approaches to mixed treatment comparisons (MTCs) are becoming more popular because of their flexibility and interpretability. Many randomized clinical trials report multiple outcomes with possible inherent correlations. Moreover, MTC data are typically sparse (although richer than standard meta-analysis, comparing only two…
Descriptors: Bayesian Statistics, Meta Analysis, Outcomes of Treatment, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Warne, Russell T. – Journal of Advanced Academics, 2011
Reliability generalization (RG) is a meta-analysis that combines and synthesizes reliability coefficients from different studies to ascertain the average observed reliability across studies. An RG study was conducted on previously reported data from 16 samples of the Overexcitability Questionnaire--Two (OEQII) with a combined "N" of 5,275.…
Descriptors: Measures (Individuals), Error of Measurement, Psychometrics, Generalization
Peer reviewed Peer reviewed
Direct linkDirect link
Leue, Anja; Lange, Sebastian – Assessment, 2011
The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…
Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior
Peer reviewed Peer reviewed
Direct linkDirect link
Martin, Nancy K.; Sass, Daniel A.; Schmitt, Thomas A. – Teaching and Teacher Education: An International Journal of Research and Studies, 2012
The models presented here posit a complex relationship between efficacy in student engagement and intent-to-leave that is mediated by in-class variables of instructional management, student behavior stressors, aspects of burnout, and job satisfaction. Using data collected from 631 teachers, analyses provided support for the two models that…
Descriptors: Learner Engagement, Teacher Effectiveness, Student Behavior, Job Satisfaction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rupp, Andre A.; Gushta, Matthew; Mislevy, Robert J.; Shaffer, David Williamson – Journal of Technology, Learning, and Assessment, 2010
We are currently at an exciting juncture in developing effective means for assessing so-called 21st-century skills in an innovative yet reliable fashion. One of these avenues leads through the world of "epistemic games" (Shaffer, 2006a), which are games designed to give learners the rich experience of professional practica within a discipline.…
Descriptors: Research Methodology, Educational Research, Evaluation Methods, Educational Games
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Emons, Wilco H. M. – Applied Psychological Measurement, 2008
Person-fit methods are used to uncover atypical test performance as reflected in the pattern of scores on individual items in a test. Unlike parametric person-fit statistics, nonparametric person-fit statistics do not require fitting a parametric test theory model. This study investigates the effectiveness of generalizations of nonparametric…
Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Goodness of Fit
Previous Page | Next Page »
Pages: 1  |  2