ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	20

Descriptor

Error of Measurement	29
Generalization	29
Reliability	11
Meta Analysis	10
Scores	10
Item Response Theory	7
Evaluation Methods	5
Models	5
Simulation	5
Comparative Analysis	4
Gender Differences	4
Test Reliability	4
Foreign Countries	3
Probability	3
Psychometrics	3
Regression (Statistics)	3
Research Methodology	3
Sample Size	3
Test Items	3
Achievement Tests	2
Bayesian Statistics	2
Computation	2
Correlation	2
Cutting Scores	2
Decision Making	2
More ▼

Source

Educational and Psychological…	9
Applied Measurement in…	3
Research Synthesis Methods	2
AILA Review	1
Applied Psychological…	1
Assessment	1
Dyslexia	1
Forum on Public Policy Online	1
Journal of Advanced Academics	1
Journal of Educational…	1
Journal of Occupational…	1
Journal of Technology,…	1
Measurement and Evaluation in…	1
Participatory Educational…	1
Pegem Journal of Education…	1
Practical Assessment,…	1
Psychometrika	1
Teaching and Teacher…	1
More ▼

Publication Type

Journal Articles	29
Reports - Research	20
Reports - Evaluative	8
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Adult Education	1
Secondary Education	1

Audience

Researchers

Location

United States	3
China	1
Costa Rica	1
Finland	1
India	1
Indonesia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Beck Depression Inventory	2
Bem Sex Role Inventory	1
Learning Style Inventory	1
Mathematics Anxiety Rating…	1
Myers Briggs Type Indicator	1
Program for International…	1
Teacher Efficacy Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

An R Package for Optimizing the Composite Reliability in Multivariate Nested Designs

Peer reviewed
PDF on ERIC

Download full text

Joyce M. W. Moonen-van Loon; Jeroen Donkers – Practical Assessment, Research & Evaluation, 2025

The reliability of assessment tools is critical for accurately monitoring student performance in various educational contexts. When multiple assessments are combined to form an overall evaluation, each assessment serves as a data point contributing to the student's performance within a broader educational framework. Determining composite…

Descriptors: Programming Languages, Reliability, Evaluation Methods, Student Evaluation

Multi-Group Generalizations of SIBTEST and Crossing-SIBTEST

Peer reviewed

Direct link

Chalmers, R. Philip; Zheng, Guoguo – Applied Measurement in Education, 2023

This article presents generalizations of SIBTEST and crossing-SIBTEST statistics for differential item functioning (DIF) investigations involving more than two groups. After reviewing the original two-group setup for these statistics, a set of multigroup generalizations that support contrast matrices for joint tests of DIF are presented. To…

Descriptors: Test Bias, Test Items, Item Response Theory, Error of Measurement

Selecting Relevant Moderators with Bayesian Regularized Meta-Regression

Peer reviewed

Direct link

Van Lissa, Caspar J.; van Erp, Sara; Clapper, Eli-Boaz – Research Synthesis Methods, 2023

When meta-analyzing heterogeneous bodies of literature, meta-regression can be used to account for potentially relevant between-studies differences. A key challenge is that the number of candidate moderators is often high relative to the number of studies. This introduces risks of overfitting, spurious results, and model non-convergence. To…

Descriptors: Bayesian Statistics, Regression (Statistics), Maximum Likelihood Statistics, Meta Analysis

Beck Depression Inventory-II: A Study for Meta Analytical Reliability Generalization

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Asku, Gökhan – Pegem Journal of Education and Instruction, 2021

The main aim of achieving with the reliability generalization is to investigate the variability related to the reliability estimates and to try to characterize the sources of this variability. As part of the research, a reliability generalization study was carried out on the basis of Beck Depression Inventory-II to investigate potential factors…

Descriptors: Depression (Psychology), Measures (Individuals), Test Reliability, Error of Measurement

Discourse Analysis of Male and Female Representatives of Selected Countries at the United Nations General Debates

Peer reviewed

Direct link

Abdulaziz Alshahrani – AILA Review, 2023

The aim of this paper was to evaluate gender differences in the language used in United Nations (UN) General Assembly debates by one male and one female representative each from India, China, the USA, and Indonesia. The critical discourse analysis (CDA) framework of van Dijk (2015) was used along with the 25 discursive devices in this framework.…

Descriptors: Discourse Analysis, Gender Differences, International Organizations, Language Usage

Examining Cross-Cultural Applicability via Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sümeyra – Participatory Educational Research, 2023

Applying a measurement instrument developed in a specific country to other countries raise a critical and important question of interest in especially cross-cultural studies. Confirmatory factor analysis (CFA) is the most preferred and used method to examine the cross-cultural applicability of measurement tools. Although CFA is a sophisticated…

Descriptors: Generalization, Cross Cultural Studies, Measurement Techniques, Factor Analysis

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

A Critical Review of the "Motor-Free Visual Perception Test-Fourth Edition" (MVPT-4)

Peer reviewed

Direct link

Brown, Ted; Peres, Lisa – Journal of Occupational Therapy, Schools & Early Intervention, 2018

The "Motor-Free Visual Perception Test-fourth edition" (MVPT-4) is a revised version of the "Motor-Free Visual Perception Test-third edition." The MVPT-4 is used to assess the visual-perceptual ability of individuals aged 4.0 through 80+ years via a series of visual-perceptual tasks that do not require a motor response. Test…

Descriptors: Visual Perception, Vision Tests, Test Validity, Culture Fair Tests

A Bayesian Missing Data Framework for Generalized Multiple Outcome Mixed Treatment Comparisons

Peer reviewed

Direct link

Hong, Hwanhee; Chu, Haitao; Zhang, Jing; Carlin, Bradley P. – Research Synthesis Methods, 2016

Bayesian statistical approaches to mixed treatment comparisons (MTCs) are becoming more popular because of their flexibility and interpretability. Many randomized clinical trials report multiple outcomes with possible inherent correlations. Moreover, MTC data are typically sparse (although richer than standard meta-analysis, comparing only two…

Descriptors: Bayesian Statistics, Meta Analysis, Outcomes of Treatment, Comparative Analysis

A Reliability Generalization of the Overexcitability Questionnaire--Two

Peer reviewed

Direct link

Warne, Russell T. – Journal of Advanced Academics, 2011

Reliability generalization (RG) is a meta-analysis that combines and synthesizes reliability coefficients from different studies to ascertain the average observed reliability across studies. An RG study was conducted on previously reported data from 16 samples of the Overexcitability Questionnaire--Two (OEQII) with a combined "N" of 5,275.…

Descriptors: Measures (Individuals), Error of Measurement, Psychometrics, Generalization

Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule

Peer reviewed

Direct link

Leue, Anja; Lange, Sebastian – Assessment, 2011

The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…

Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior

Teacher Efficacy in Student Engagement, Instructional Management, Student Stressors, and Burnout: A Theoretical Model Using In-Class Variables to Predict Teachers' Intent-to-Leave

Peer reviewed

Direct link

Martin, Nancy K.; Sass, Daniel A.; Schmitt, Thomas A. – Teaching and Teacher Education: An International Journal of Research and Studies, 2012

The models presented here posit a complex relationship between efficacy in student engagement and intent-to-leave that is mediated by in-class variables of instructional management, student behavior stressors, aspects of burnout, and job satisfaction. Using data collected from 631 teachers, analyses provided support for the two models that…

Descriptors: Learner Engagement, Teacher Effectiveness, Student Behavior, Job Satisfaction

Evidence-Centered Design of Epistemic Games: Measurement Principles for Complex Learning Environments

Peer reviewed
PDF on ERIC

Download full text

Rupp, Andre A.; Gushta, Matthew; Mislevy, Robert J.; Shaffer, David Williamson – Journal of Technology, Learning, and Assessment, 2010

We are currently at an exciting juncture in developing effective means for assessing so-called 21st-century skills in an innovative yet reliable fashion. One of these avenues leads through the world of "epistemic games" (Shaffer, 2006a), which are games designed to give learners the rich experience of professional practica within a discipline.…

Descriptors: Research Methodology, Educational Research, Evaluation Methods, Educational Games

A Generalizability Theory Approach to Standard Error Estimates for Bookmark Standard Settings

Peer reviewed

Direct link

Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008

The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…

Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores

Previous Page | Next Page »

Pages: 1 | 2

Henson, Robin K.	4
Capraro, Mary Margaret	2
Capraro, Robert M.	2
Vacha-Haase, Tammi	2
Abdulaziz Alshahrani	1
Asku, Gökhan	1
Bollen, Kenneth A.	1
Brantmeier, Cindy	1
Brown, Ted	1
Carlin, Bradley P.	1
Chalmers, R. Philip	1
Chu, Haitao	1
Clapper, Eli-Boaz	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Cotton, Sue M.	1
Crewther, David P.	1
Crewther, Sheila G.	1
Emons, Wilco H. M.	1
Eser, Mehmet Taha	1
Fan, Xitao	1
Finch, Holmes	1
Gao, Xiaohong	1
Gushta, Matthew	1
More ▼