Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Chenglu Li; Wanli Xing; Walter Leite – Interactive Learning Environments, 2024
As instruction shifts away from traditional approaches, online learning has grown in popularity in K-12 and higher education. Artificial intelligence (AI) and learning analytics methods such as machine learning have been used by educational scholars to support online learners on a large scale. However, the fairness of AI prediction in educational…
Descriptors: Artificial Intelligence, Prediction, Mathematics Achievement, Algorithms
Alisa Remizova; Maksim Rudnev; Eldad Davidov – Sociological Methods & Research, 2024
Individual religiosity measures are used by researchers to describe and compare individuals and societies. However, the cross-cultural comparability of the measures has often been questioned but rarely empirically tested. In the current study, we examined the cross-national measurement invariance properties of generalized individual religiosity in…
Descriptors: Religious Factors, Surveys, Cross Cultural Studies, Social Values
Chunhua Cao; Benjamin Lugu; Jujia Li – Structural Equation Modeling: A Multidisciplinary Journal, 2024
This study examined the false positive (FP) rates and sensitivity of Bayesian fit indices to structural misspecification in Bayesian structural equation modeling. The impact of measurement quality, sample size, model size, the magnitude of misspecified path effect, and the choice or prior on the performance of the fit indices was also…
Descriptors: Structural Equation Models, Bayesian Statistics, Measurement, Error of Measurement
Xiaohui Luo; Yueqin Hu – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Intensive longitudinal data has been widely used to examine reciprocal or causal relations between variables. However, these variables may not be temporally aligned. This study examined the consequences and solutions of the problem of temporal misalignment in intensive longitudinal data based on dynamic structural equation models. First the impact…
Descriptors: Structural Equation Models, Longitudinal Studies, Data Analysis, Causal Models
Jackson, Kayla – ProQuest LLC, 2023
Prior research highlights the benefits of multimode surveys and best practices for item-by-item (IBI) and matrix-type survey items. Some researchers have explored whether mode differences for online and paper surveys persist for these survey item types. However, no studies discuss measurement invariance when both item types and online modes are…
Descriptors: Test Items, Surveys, Error of Measurement, Item Response Theory
Penaloza, Roberto V.; Berends, Mark – Sociological Methods & Research, 2022
To measure "treatment" effects, social science researchers typically rely on nonexperimental data. In education, school and teacher effects on students are often measured through value-added models (VAMs) that are not fully understood. We propose a framework that relates to the education production function in its most flexible form and…
Descriptors: Data, Value Added Models, Error of Measurement, Correlation
Levin, Joel R.; Ferron, John M.; Gafurov, Boris S. – Journal of Education for Students Placed at Risk, 2022
The present simulation study examined the statistical properties (namely, Type I error and statistical power) of various novel randomized single-case multiple-baseline designs and associated randomized-test analyses for comparing the A- to B-phase immediate abrupt outcome changes in two independent intervention conditions. It was found that with…
Descriptors: Statistical Analysis, Error of Measurement, Intervention, Program Effectiveness
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022
The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…
Descriptors: Test Reliability, Scores, Test Items, Correlation
Ryan Derickson – ProQuest LLC, 2022
Item Response Theory (IRT) models are a popular analytic method for self report data. We show how traditional IRT models can be vulnerable to specific kinds of asymmetric measurement error (AME) in self-report data, because the models spread the error to all estimates -- even those of items that do not contribute error. We quantify the impact of…
Descriptors: Item Response Theory, Measurement Techniques, Error of Measurement, Models
John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024
International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024
The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…
Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024
The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…
Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing
Nicolas Pichot; Boris Forthmann; Eric Bonetto; Thomas Arciszewski; Nathalie Bonnardel; Sara Jaubert; Jean B. Pavani – Journal of Creative Behavior, 2024
The term "creative" is commonly used in everyday language and in academic discourse to discuss the nature of artistic and innovative productions. This usage inherently implies the existence of a variable of creativity that allows different creative works to be compared. The standard definition of creativity asserts that a production must…
Descriptors: Creativity, Test Construction, Test Validity, Productive Thinking
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Daniel McNeish; Melissa G. Wolf – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Despite the popularity of traditional fit index cutoffs like RMSEA [less than or equal to] 0.06 and CFI [greater than or equal to] 0.95, several studies have noted issues with overgeneralizing traditional cutoffs. Computational methods have been proposed to avoid overgeneralization by deriving cutoffs specifically tailored to the characteristics…
Descriptors: Structural Equation Models, Cutting Scores, Generalizability Theory, Error of Measurement

Peer reviewed
Direct link
