Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 15 |
Descriptor
Bias | 29 |
Correlation | 10 |
Comparative Analysis | 7 |
Factor Analysis | 5 |
Rating Scales | 5 |
Reliability | 5 |
Response Style (Tests) | 5 |
Scoring | 5 |
Test Validity | 5 |
Measures (Individuals) | 4 |
Monte Carlo Methods | 4 |
More ▼ |
Source
Educational and Psychological… | 29 |
Author
Publication Type
Journal Articles | 19 |
Reports - Research | 14 |
Reports - Evaluative | 3 |
Reports - Descriptive | 2 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
Secondary Education | 2 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Marlowe Crowne Social… | 1 |
What Works Clearinghouse Rating
Nazari, Sanaz; Leite, Walter L.; Huggins-Manley, A. Corinne – Educational and Psychological Measurement, 2023
Social desirability bias (SDB) has been a major concern in educational and psychological assessments when measuring latent variables because it has the potential to introduce measurement error and bias in assessments. Person-fit indices can detect bias in the form of misfitted response vectors. The objective of this study was to compare the…
Descriptors: Social Desirability, Bias, Indexes, Goodness of Fit
Sanaz Nazari; Walter L. Leite; A. Corinne Huggins-Manley – Educational and Psychological Measurement, 2024
Social desirability bias (SDB) is a common threat to the validity of conclusions from responses to a scale or survey. There is a wide range of person-fit statistics in the literature that can be employed to detect SDB. In addition, machine learning classifiers, such as logistic regression and random forest, have the potential to distinguish…
Descriptors: Social Desirability, Bias, Artificial Intelligence, Identification
Soland, James – Educational and Psychological Measurement, 2022
Considerable thought is often put into designing randomized control trials (RCTs). From power analyses and complex sampling designs implemented preintervention to nuanced quasi-experimental models used to estimate treatment effects postintervention, RCT design can be quite complicated. Yet when psychological constructs measured using survey scales…
Descriptors: Item Response Theory, Surveys, Scoring, Randomized Controlled Trials
Wind, Stefanie A.; Guo, Wenjing – Educational and Psychological Measurement, 2019
Rater effects, or raters' tendencies to assign ratings to performances that are different from the ratings that the performances warranted, are well documented in rater-mediated assessments across a variety of disciplines. In many real-data studies of rater effects, researchers have reported that raters exhibit more than one effect, such as a…
Descriptors: Evaluators, Bias, Scoring, Data Collection
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Audette, Lillian M.; Hammond, Marie S.; Rochester, Natalie K. – Educational and Psychological Measurement, 2020
Longitudinal studies are commonly used in the social and behavioral sciences to answer a wide variety of research questions. Longitudinal researchers often collect data anonymously from participants when studying sensitive topics to ensure that accurate information is provided. One difficulty gathering longitudinal anonymous data is that of…
Descriptors: Research Methodology, Longitudinal Studies, Research Design, Social Science Research
Bürkner, Paul-Christian; Schulte, Niklas; Holling, Heinz – Educational and Psychological Measurement, 2019
Forced-choice questionnaires have been proposed to avoid common response biases typically associated with rating scale questionnaires. To overcome ipsativity issues of trait scores obtained from classical scoring approaches of forced-choice items, advanced methods from item response theory (IRT) such as the Thurstonian IRT model have been…
Descriptors: Item Response Theory, Measurement Techniques, Questionnaires, Rating Scales
Devlieger, Ines; Mayer, Axel; Rosseel, Yves – Educational and Psychological Measurement, 2016
In this article, an overview is given of four methods to perform factor score regression (FSR), namely regression FSR, Bartlett FSR, the bias avoiding method of Skrondal and Laake, and the bias correcting method of Croon. The bias correcting method is extended to include a reliable standard error. The four methods are compared with each other and…
Descriptors: Regression (Statistics), Comparative Analysis, Structural Equation Models, Monte Carlo Methods
Lai, Emily R.; Wolfe, Edward W.; Vickers, Daisy – Educational and Psychological Measurement, 2015
This report summarizes an empirical study that addresses two related topics within the context of writing assessment--illusory halo and how much unique information is provided by multiple analytic scores. Specifically, we address the issue of whether unique information is provided by analytic scores assigned to student writing, beyond what is…
Descriptors: Writing Tests, Scores, Bias, Holistic Approach
Kam, Chester Chun Seng; Zhou, Mingming – Educational and Psychological Measurement, 2015
Previous research has found the effects of acquiescence to be generally consistent across item "aggregates" within a single survey (i.e., essential tau-equivalence), but it is unknown whether this phenomenon is consistent at the" individual item" level. This article evaluated the often assumed but inadequately tested…
Descriptors: Test Items, Surveys, Criteria, Correlation
Paulhus, Delroy L.; Dubois, Patrick J. – Educational and Psychological Measurement, 2014
The overclaiming technique is a novel assessment procedure that uses signal detection analysis to generate indices of knowledge accuracy (OC-accuracy) and self-enhancement (OC-bias). The technique has previously shown robustness over varied knowledge domains as well as low reactivity across administration contexts. Here we compared the OC-accuracy…
Descriptors: Educational Assessment, Knowledge Level, Accuracy, Cognitive Ability
Balogh, Jennifer; Bernstein, Jared; Cheng, Jian; Van Moere, Alistair; Townshend, Brent; Suzuki, Masanori – Educational and Psychological Measurement, 2012
A two-part experiment is presented that validates a new measurement tool for scoring oral reading ability. Data collected by the U.S. government in a large-scale literacy assessment of adults were analyzed by a system called VersaReader that uses automatic speech recognition and speech processing technologies to score oral reading fluency. In the…
Descriptors: Reading Fluency, Measures (Individuals), Scoring, Reading Ability
Romano, Jeanine L.; Kromrey, Jeffrey D. – Educational and Psychological Measurement, 2009
This study was conducted to evaluate alternative analysis strategies for the meta-analysis method of reliability generalization when the reliability estimates are not statistically independent. Five approaches to dealing with the violation of independence were implemented: ignoring the violation and treating each observation as independent,…
Descriptors: Reliability, Generalization, Meta Analysis, Correlation
Nilsson, Johanna E.; Marszalek, Jacob M.; Linnemeyer, Rachel M.; Bahner, Angela D.; Misialek, Leah Hanson – Educational and Psychological Measurement, 2011
This article describes the development and the initial psychometric evaluation of the Social Issues Advocacy Scale in two studies. In the first study, an exploratory factor analysis (n = 278) revealed a four-factor scale, accounting for 71.4% of the variance, measuring different aspects of social issue advocacy: Political and Social Advocacy,…
Descriptors: Social Problems, Life Satisfaction, Test Validity, Measures (Individuals)

Bruvold, William H. – Educational and Psychological Measurement, 1975
Judges holding divergent attitudes rated two sets of statements regarding uses of water reclaimed from sewage. Results showed a close linear relationship between item scale values obtained from positive and negative attitudinal groups, and a somewhat reduced rating range for judges holding unfavorable personal attitudes toward reuse. (Author/RC)
Descriptors: Attitudes, Bias, Intervals, Measurement
Previous Page | Next Page »
Pages: 1 | 2