NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 106 to 120 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Nicolas Pichot; Boris Forthmann; Eric Bonetto; Thomas Arciszewski; Nathalie Bonnardel; Sara Jaubert; Jean B. Pavani – Journal of Creative Behavior, 2024
The term "creative" is commonly used in everyday language and in academic discourse to discuss the nature of artistic and innovative productions. This usage inherently implies the existence of a variable of creativity that allows different creative works to be compared. The standard definition of creativity asserts that a production must…
Descriptors: Creativity, Test Construction, Test Validity, Productive Thinking
Peer reviewed Peer reviewed
Direct linkDirect link
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel McNeish; Melissa G. Wolf – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Despite the popularity of traditional fit index cutoffs like RMSEA [less than or equal to] 0.06 and CFI [greater than or equal to] 0.95, several studies have noted issues with overgeneralizing traditional cutoffs. Computational methods have been proposed to avoid overgeneralization by deriving cutoffs specifically tailored to the characteristics…
Descriptors: Structural Equation Models, Cutting Scores, Generalizability Theory, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Hyunjung Lee; Heining Cham – Educational and Psychological Measurement, 2024
Determining the number of factors in exploratory factor analysis (EFA) is crucial because it affects the rest of the analysis and the conclusions of the study. Researchers have developed various methods for deciding the number of factors to retain in EFA, but this remains one of the most difficult decisions in the EFA. The purpose of this study is…
Descriptors: Factor Structure, Factor Analysis, Monte Carlo Methods, Goodness of Fit
Peer reviewed Peer reviewed
Direct linkDirect link
Suyoung Kim; Sooyong Lee; Jiwon Kim; Tiffany A. Whittaker – Structural Equation Modeling: A Multidisciplinary Journal, 2024
This study aims to address a gap in the social and behavioral sciences literature concerning interaction effects between latent factors in multiple-group analysis. By comparing two approaches for estimating latent interactions within multiple-group analysis frameworks using simulation studies and empirical data, we assess their relative merits.…
Descriptors: Social Science Research, Behavioral Sciences, Structural Equation Models, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Rebekka Kupffer; Susanne Frick; Eunike Wetzel – Educational and Psychological Measurement, 2024
The multidimensional forced-choice (MFC) format is an alternative to rating scales in which participants rank items according to how well the items describe them. Currently, little is known about how to detect careless responding in MFC data. The aim of this study was to adapt a number of indices used for rating scales to the MFC format and…
Descriptors: Measurement Techniques, Alternative Assessment, Rating Scales, Questionnaires
Peer reviewed Peer reviewed
Direct linkDirect link
Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024
Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…
Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024
The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…
Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Julian F. Lohmann; Steffen Zitzmann; Martin Hecht – Structural Equation Modeling: A Multidisciplinary Journal, 2024
The recently proposed "continuous-time latent curve model with structured residuals" (CT-LCM-SR) addresses several challenges associated with longitudinal data analysis in the behavioral sciences. First, it provides information about process trends and dynamics. Second, using the continuous-time framework, the CT-LCM-SR can handle…
Descriptors: Time Management, Behavioral Science Research, Predictive Validity, Predictor Variables
Peer reviewed Peer reviewed
Direct linkDirect link
Yuejin Zhou; Wenwu Wang; Tao Hu; Tiejun Tong; Zhonghua Liu – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Causal mediation analysis is a popular approach for investigating whether the effect of an exposure on an outcome is through a mediator to better understand the underlying causal mechanism. In recent literature, mediation analysis with multiple mediators has been proposed for continuous and dichotomous outcomes. In contrast, methods for mediation…
Descriptors: Regression (Statistics), Causal Models, Evaluation Methods, Vignettes
Peer reviewed Peer reviewed
Direct linkDirect link
Emily M. Stump; Mark Hughes; N. G. Holmes; Gina Passante – Physical Review Physics Education Research, 2024
Previous research on student thinking about experimental measurement and uncertainty has primarily focused on students' procedural reasoning: Given some data, what should students calculate or do next? This approach, however, cannot tell us what beliefs or conceptual understanding leads to students' procedural decisions. To explore this…
Descriptors: College Students, Mechanics (Physics), Calculus, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Louise Badham – Oxford Review of Education, 2025
Different sources of assessment evidence are reviewed during International Baccalaureate (IB) grade awarding to convert marks into grades and ensure fair results for students. Qualitative and quantitative evidence are analysed to determine grade boundaries, with statistical evidence weighed against examiner judgement and teachers' feedback on…
Descriptors: Advanced Placement Programs, Grading, Interrater Reliability, Evaluative Thinking
Pages: 1  |  ...  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12  |  ...  |  220