Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 19 |
Descriptor
Error of Measurement | 28 |
Generalization | 28 |
Meta Analysis | 10 |
Reliability | 10 |
Scores | 10 |
Item Response Theory | 7 |
Models | 5 |
Simulation | 5 |
Comparative Analysis | 4 |
Evaluation Methods | 4 |
Gender Differences | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 28 |
Reports - Research | 19 |
Reports - Evaluative | 8 |
Information Analyses | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Adult Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
United States | 3 |
China | 1 |
Costa Rica | 1 |
Finland | 1 |
India | 1 |
Indonesia | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Beck Depression Inventory | 2 |
Bem Sex Role Inventory | 1 |
Learning Style Inventory | 1 |
Mathematics Anxiety Rating… | 1 |
Myers Briggs Type Indicator | 1 |
Program for International… | 1 |
Teacher Efficacy Scale | 1 |
What Works Clearinghouse Rating
Chalmers, R. Philip; Zheng, Guoguo – Applied Measurement in Education, 2023
This article presents generalizations of SIBTEST and crossing-SIBTEST statistics for differential item functioning (DIF) investigations involving more than two groups. After reviewing the original two-group setup for these statistics, a set of multigroup generalizations that support contrast matrices for joint tests of DIF are presented. To…
Descriptors: Test Bias, Test Items, Item Response Theory, Error of Measurement
Van Lissa, Caspar J.; van Erp, Sara; Clapper, Eli-Boaz – Research Synthesis Methods, 2023
When meta-analyzing heterogeneous bodies of literature, meta-regression can be used to account for potentially relevant between-studies differences. A key challenge is that the number of candidate moderators is often high relative to the number of studies. This introduces risks of overfitting, spurious results, and model non-convergence. To…
Descriptors: Bayesian Statistics, Regression (Statistics), Maximum Likelihood Statistics, Meta Analysis
Eser, Mehmet Taha; Asku, Gökhan – Pegem Journal of Education and Instruction, 2021
The main aim of achieving with the reliability generalization is to investigate the variability related to the reliability estimates and to try to characterize the sources of this variability. As part of the research, a reliability generalization study was carried out on the basis of Beck Depression Inventory-II to investigate potential factors…
Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Error of Measurement
Abdulaziz Alshahrani – AILA Review, 2023
The aim of this paper was to evaluate gender differences in the language used in United Nations (UN) General Assembly debates by one male and one female representative each from India, China, the USA, and Indonesia. The critical discourse analysis (CDA) framework of van Dijk (2015) was used along with the 25 discursive devices in this framework.…
Descriptors: Discourse Analysis, Gender Differences, International Organizations, Language Usage
Soysal, Sümeyra – Participatory Educational Research, 2023
Applying a measurement instrument developed in a specific country to other countries raise a critical and important question of interest in especially cross-cultural studies. Confirmatory factor analysis (CFA) is the most preferred and used method to examine the cross-cultural applicability of measurement tools. Although CFA is a sophisticated…
Descriptors: Generalization, Cross Cultural Studies, Measurement Techniques, Factor Analysis
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
Brown, Ted; Peres, Lisa – Journal of Occupational Therapy, Schools & Early Intervention, 2018
The "Motor-Free Visual Perception Test-fourth edition" (MVPT-4) is a revised version of the "Motor-Free Visual Perception Test-third edition." The MVPT-4 is used to assess the visual-perceptual ability of individuals aged 4.0 through 80+ years via a series of visual-perceptual tasks that do not require a motor response. Test…
Descriptors: Visual Perception, Vision Tests, Test Validity, Culture Fair Tests
Hong, Hwanhee; Chu, Haitao; Zhang, Jing; Carlin, Bradley P. – Research Synthesis Methods, 2016
Bayesian statistical approaches to mixed treatment comparisons (MTCs) are becoming more popular because of their flexibility and interpretability. Many randomized clinical trials report multiple outcomes with possible inherent correlations. Moreover, MTC data are typically sparse (although richer than standard meta-analysis, comparing only two…
Descriptors: Bayesian Statistics, Meta Analysis, Outcomes of Treatment, Comparative Analysis
Warne, Russell T. – Journal of Advanced Academics, 2011
Reliability generalization (RG) is a meta-analysis that combines and synthesizes reliability coefficients from different studies to ascertain the average observed reliability across studies. An RG study was conducted on previously reported data from 16 samples of the Overexcitability Questionnaire--Two (OEQII) with a combined "N" of 5,275.…
Descriptors: Measures (Individuals), Error of Measurement, Psychometrics, Generalization
Leue, Anja; Lange, Sebastian – Assessment, 2011
The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…
Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior
Martin, Nancy K.; Sass, Daniel A.; Schmitt, Thomas A. – Teaching and Teacher Education: An International Journal of Research and Studies, 2012
The models presented here posit a complex relationship between efficacy in student engagement and intent-to-leave that is mediated by in-class variables of instructional management, student behavior stressors, aspects of burnout, and job satisfaction. Using data collected from 631 teachers, analyses provided support for the two models that…
Descriptors: Learner Engagement, Teacher Effectiveness, Student Behavior, Job Satisfaction
Rupp, Andre A.; Gushta, Matthew; Mislevy, Robert J.; Shaffer, David Williamson – Journal of Technology, Learning, and Assessment, 2010
We are currently at an exciting juncture in developing effective means for assessing so-called 21st-century skills in an innovative yet reliable fashion. One of these avenues leads through the world of "epistemic games" (Shaffer, 2006a), which are games designed to give learners the rich experience of professional practica within a discipline.…
Descriptors: Research Methodology, Educational Research, Evaluation Methods, Educational Games
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Emons, Wilco H. M. – Applied Psychological Measurement, 2008
Person-fit methods are used to uncover atypical test performance as reflected in the pattern of scores on individual items in a test. Unlike parametric person-fit statistics, nonparametric person-fit statistics do not require fitting a parametric test theory model. This study investigates the effectiveness of generalizations of nonparametric…
Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Goodness of Fit
Previous Page | Next Page »
Pages: 1 | 2