Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 19 |
Descriptor
Generalization | 54 |
Reliability | 35 |
Scores | 33 |
Meta Analysis | 25 |
Error of Measurement | 9 |
Measures (Individuals) | 9 |
Psychometrics | 9 |
Item Response Theory | 8 |
Evaluation Methods | 6 |
Models | 6 |
Adults | 4 |
More ▼ |
Source
Educational and Psychological… | 54 |
Author
Publication Type
Journal Articles | 54 |
Reports - Research | 30 |
Reports - Evaluative | 16 |
Reports - Descriptive | 5 |
Speeches/Meeting Papers | 2 |
Book/Product Reviews | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Audience
Researchers | 2 |
Location
India | 1 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yongze Xu – Educational and Psychological Measurement, 2024
The questionnaire method has always been an important research method in psychology. The increasing prevalence of multidimensional trait measures in psychological research has led researchers to use longer questionnaires. However, questionnaires that are too long will inevitably reduce the quality of the completed questionnaires and the efficiency…
Descriptors: Item Response Theory, Questionnaires, Generalization, Simulation
Bzdok, Danilo; Varoquaux, Gaël; Thirion, Bertrand – Educational and Psychological Measurement, 2017
Brain-imaging technology has boosted the quantification of neurobiological phenomena underlying human mental operations and their disturbances. Since its inception, drawing inference on neurophysiological effects hinged on classical statistical methods, especially, the general linear model. The tens of thousands of variables per brain scan were…
Descriptors: Neurosciences, Brain, Diagnostic Tests, Statistical Inference
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021
This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…
Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items
Wheeler, Denna L.; Vassar, Matt; Worley, Jody A.; Barnes, Laura L. B. – Educational and Psychological Measurement, 2011
The purpose of this study was to synthesize internal consistency reliability for the subscale scores on the Maslach Burnout Inventory (MBI). The authors addressed three research questions: (a) What is the mean subscale score reliability for the MBI across studies? (b) What factors are associated with observed variance in MBI subscale score…
Descriptors: Burnout, Reliability, Measures (Individuals), Meta Analysis
Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010
In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…
Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization
Romano, Jeanine L.; Kromrey, Jeffrey D. – Educational and Psychological Measurement, 2009
This study was conducted to evaluate alternative analysis strategies for the meta-analysis method of reliability generalization when the reliability estimates are not statistically independent. Five approaches to dealing with the violation of independence were implemented: ignoring the violation and treating each observation as independent,…
Descriptors: Reliability, Generalization, Meta Analysis, Correlation
Kulas, John T.; Thompson, Richard C.; Anderson, Michael G. – Educational and Psychological Measurement, 2011
The California Psychological Inventory's Dominance scale was investigated for inconsistencies in item-trait associations across four samples (one American normative and three culturally dissociated manager groupings). The Kim, Cohen, and Park procedure was used, enabling simultaneous multigroup comparison in addition to the traditional…
Descriptors: Personality Traits, Measures (Individuals), Correlation, Prediction
Fidalgo, Angel M.; Madeira, Jaqueline M. – Educational and Psychological Measurement, 2008
Mantel-Haenszel methods comprise a highly flexible methodology for assessing the degree of association between two categorical variables, whether they are nominal or ordinal, while controlling for other variables. The versatility of Mantel-Haenszel analytical approaches has made them very popular in the assessment of the differential functioning…
Descriptors: Test Bias, Statistical Analysis, Generalization, Evaluation Research
Beretvas, S. Natasha; Suizzo, Marie-Anne; Durham, Jennifer A.; Yarnell, Lisa M. – Educational and Psychological Measurement, 2008
The most commonly used measures of locus of control are Rotter's Internality-Externality Scale (I-E) and Nowicki and Strickland's Internality-Externality Scale (NSIE). A reliability generalization study is conducted to explore variability in I-E and NSIE score reliability. Studies are coded for aspects of the scales used (number of response…
Descriptors: Locus of Control, Age, Reliability, Measures (Individuals)
Howell, Ryan T.; Shields, Alan L. – Educational and Psychological Measurement, 2008
Meta-analytic reliability generalizations (RGs) are limited by the scarcity of reliability reporting in primary articles, and currently, RG investigators lack a method to quantify the impact of such nonreporting. This article introduces a stepwise procedure to address this challenge. First, the authors introduce a formula that allows researchers…
Descriptors: Reliability, Meta Analysis, Generalization, Evaluation Methods
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Mason, Corinne; Allam, Reynald; Brannick, Michael T. – Educational and Psychological Measurement, 2007
Reliability generalization studies have provided estimates of the mean reliability coefficients and examined factors that explain the variability in the reliability estimates across studies for many different tests and measures. Different authors have used different data analyses to do such meta-analyses, and little research has addressed whether…
Descriptors: Reliability, Monte Carlo Methods, Meta Analysis, Generalization
Scherbaum, Charles A.; Goldstein, Harold W. – Educational and Psychological Measurement, 2008
Recent research examining racial differences on standardized cognitive tests has focused on the impact of test item difficulty. Studies using data from the SAT and GRE have reported a correlation between item difficulty and differential item functioning (DIF) such that minority test takers are less likely than majority test takers to respond…
Descriptors: Race, Test Items, Standardized Tests, Cognitive Tests
Rexrode, Kathryn R.; Petersen, Suni; O'Toole, Siobhan – Educational and Psychological Measurement, 2008
For more than 20 years, the Ways of Coping Scale (WOCS) has been used extensively to measure coping. Yet beyond the original psychometric data, few studies have reexamined its properties utilizing the enormous body of research generated on the WOCS. Reliability has been assumed to be consistent as an attribute of the test. This study used…
Descriptors: Evaluation Research, Test Reliability, Coping, Measures (Individuals)