Publication Date
In 2025 | 12 |
Since 2024 | 30 |
Since 2021 (last 5 years) | 80 |
Since 2016 (last 10 years) | 218 |
Since 2006 (last 20 years) | 361 |
Descriptor
Test Validity | 606 |
Models | 372 |
Test Reliability | 262 |
Test Construction | 170 |
Foreign Countries | 168 |
Factor Analysis | 155 |
Structural Equation Models | 129 |
Psychometrics | 99 |
Statistical Analysis | 91 |
Test Items | 80 |
Factor Structure | 77 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Australia | 15 |
China | 15 |
Turkey | 15 |
Malaysia | 14 |
Germany | 9 |
Indonesia | 9 |
Taiwan | 8 |
Canada | 7 |
Spain | 7 |
Netherlands | 6 |
Italy | 5 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Funda Ugurlu; Filiz Evran Acar – Journal of Pedagogical Research, 2025
The aim of this study is to develop a valid and reliable measurement tool to identify teachers' tendencies towards professional development models. In line with the purpose of scale development, a survey model was preferred. The scale was designed to be applicable to teachers from various disciplines currently working in any institution…
Descriptors: Measures (Individuals), Test Reliability, Test Validity, Faculty Development
Keke Lai – Structural Equation Modeling: A Multidisciplinary Journal, 2024
When a researcher proposes an SEM model to explain the dynamics among some latent variables, the real question in model evaluation is the fit of the model's structural part. A composite index that lumps the fit of the structural part and measurement part does not directly address that question. The need for more attention to structural-level fit…
Descriptors: Goodness of Fit, Structural Equation Models, Statistics, Statistical Distributions
Timothy Teo; Fang Huang; Jinbo He – Interactive Learning Environments, 2024
Given the lack of cultural consideration of studies on digital natives, this study reports on a large-scale validation of the Digital Native Assessment Scale (DNAS) among university students from three regions of Greater China: Chinese mainland, Macau, and Taiwan, to examine measurement invariance and latent mean differences in the four constructs…
Descriptors: Foreign Countries, Digital Literacy, Structural Equation Models, College Students
Benjamin R. Shear; Derek C. Briggs – Asia Pacific Education Review, 2024
Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four…
Descriptors: Measurement Techniques, Inferences, COVID-19, Pandemics
Sergio Dominguez-Lara; Mario A. Trógolo; Rodrigo Moreta-Herrera; Diego Vaca-Quintana; Manuel Fernández-Arata; Ana Paredes-Proaño – Journal of Psychoeducational Assessment, 2025
Academic engagement plays a crucial role in students' learning and performance. One of the most popular measures for assessing this construct is the Utrecht Work Engagement Scale for Students (UWES-S), which is based on a tridimensional conceptualization consisting of dedication, vigor, and absorption. However, prior research on its factor…
Descriptors: Learner Engagement, College Students, Foreign Countries, Factor Analysis
Hale Hancer; Suna Tokgoz-Yilmaz – International Journal of Language & Communication Disorders, 2025
Background: Secondary behaviours, which encompass reactions developed due to an individual's fear and stress about stuttering, have the potential to exacerbate the condition. Therefore, self-evaluation of secondary behaviours is significant in the multidimensional approach for people who stutter (PWS). Aim: To determine the validity and…
Descriptors: Stuttering, Causal Models, Influences, Behavior Rating Scales
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Sean N. Weeks; Tyler L. Renshaw; Allysia A. Rainey; Aubrey Hiatt – Journal of Emotional and Behavioral Disorders, 2024
Internalizing and externalizing problems are common targets for school mental health screening. Prior research supports the interpretation of scores from the Youth Internalizing Problems Screener (YIPS) and the Youth Externalizing Problems Screener (YEPS), which were developed separately yet intended as companion measures. We extended previous…
Descriptors: Adolescents, Screening Tests, Behavior Problems, Mental Health
K. K. Mashood; Arvind Kumar; Anwesh Mazumdar – International Journal of Mathematical Education in Science and Technology, 2024
Working with approximations is a common practice in physics. This paper presents an exploratory study of student understanding of some of the elementary aspects of approximations encountered in college physics. For this purpose, a questionnaire (a set of 14 multiple-choice questions) was developed to probe how students dealt with these aspects in…
Descriptors: Physics, Science Instruction, Mathematical Models, Undergraduate Students
Joanna Williamson – Cambridge University Press & Assessment, 2023
There is a lot of interest in providing detailed reports to schools indicating which skills pupils have mastered and which still need development -- and, more broadly, the knowledge, skills and understanding that pupils have acquired and not yet acquired. Cognitive diagnostic assessment is an approach designed to provide this kind of insight.…
Descriptors: Intelligence Tests, Diagnostic Tests, Test Construction, Mastery Learning
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Subarkah, Edi; Kartowagiran, Badrun; Sumarno; Hamdi, Syukrul; Rahim, Abdul – International Journal of Educational Methodology, 2022
This research aims to develop the product of the life skill education program (LSEP) which is accurate, credible, and effective. This research used the Plomp model. The model covers the input, process, output, outcome and consists of instrument, scoring guidance, and good or bad criteria. The instruments used in the model are the questionnaire,…
Descriptors: Daily Living Skills, Questionnaires, Observation, Test Validity