NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 635 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Funda Ugurlu; Filiz Evran Acar – Journal of Pedagogical Research, 2025
The aim of this study is to develop a valid and reliable measurement tool to identify teachers' tendencies towards professional development models. In line with the purpose of scale development, a survey model was preferred. The scale was designed to be applicable to teachers from various disciplines currently working in any institution…
Descriptors: Measures (Individuals), Test Reliability, Test Validity, Faculty Development
Peer reviewed Peer reviewed
Direct linkDirect link
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Sean N. Weeks; Tyler L. Renshaw; Allysia A. Rainey; Aubrey Hiatt – Journal of Emotional and Behavioral Disorders, 2024
Internalizing and externalizing problems are common targets for school mental health screening. Prior research supports the interpretation of scores from the Youth Internalizing Problems Screener (YIPS) and the Youth Externalizing Problems Screener (YEPS), which were developed separately yet intended as companion measures. We extended previous…
Descriptors: Adolescents, Screening Tests, Behavior Problems, Mental Health
Peer reviewed Peer reviewed
Direct linkDirect link
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Peer reviewed Peer reviewed
Direct linkDirect link
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Joanna Williamson – Cambridge University Press & Assessment, 2023
There is a lot of interest in providing detailed reports to schools indicating which skills pupils have mastered and which still need development -- and, more broadly, the knowledge, skills and understanding that pupils have acquired and not yet acquired. Cognitive diagnostic assessment is an approach designed to provide this kind of insight.…
Descriptors: Intelligence Tests, Diagnostic Tests, Test Construction, Mastery Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Subarkah, Edi; Kartowagiran, Badrun; Sumarno; Hamdi, Syukrul; Rahim, Abdul – International Journal of Educational Methodology, 2022
This research aims to develop the product of the life skill education program (LSEP) which is accurate, credible, and effective. This research used the Plomp model. The model covers the input, process, output, outcome and consists of instrument, scoring guidance, and good or bad criteria. The instruments used in the model are the questionnaire,…
Descriptors: Daily Living Skills, Questionnaires, Observation, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Peter Elsborg; Paulina S. Melby; Mette Kurtzhals; Helene Kirkegaard; Johannes Carl; Steffen Rask; Peter Bentsen; Glen Nielsen – Measurement in Physical Education and Exercise Science, 2024
This study aimed to develop and test MyPL, a questionnaire that measures self-reported physical literacy (PL) among children and adolescents. First, the item pool was developed and adapted, and face validity was tested with cognitive interviewing. Then, factor structures were identified through multidimensional scaling and exploratory factor…
Descriptors: Questionnaires, Children, Early Adolescents, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Sternberg, Robert J. – Journal of Creative Behavior, 2020
Creativity testing as it is now done is often based on a defective assumption that different kinds of creativity can be compressed into a single unidimensional scale. There is no reason to believe that the different kinds of creativity represent, simply, different amounts of a single unidimensional construct. The article shows how three different…
Descriptors: Creativity Tests, Test Validity, Misconceptions, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Yan Jin; Jason Fan – Language Assessment Quarterly, 2023
In language assessment, AI technology has been incorporated in task design, assessment delivery, automated scoring of performance-based tasks, score reporting, and provision of feedback. AI technology is also used for collecting and analyzing performance data in language assessment validation. Research has been conducted to investigate the…
Descriptors: Language Tests, Artificial Intelligence, Computer Assisted Testing, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Xie, Chen; Song, Pingping; Hu, Huimin – Asia-Pacific Education Researcher, 2021
Echoing research interests in recent concepts and models of teacher leadership, the focus of existing scales needs to be updated, and their quality needs to be improved. This study summarizes common ground of influential definitions, models, and frameworks for teacher leadership, proposes a six-factor model (association, professional learning,…
Descriptors: Teacher Leadership, Test Construction, Test Validity, Measures (Individuals)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aykan, Ahmet; Elkonca, Fuat – Shanlax International Journal of Education, 2022
This study aimed to develop the "Lesson Study Model Perception Scale" (LSMPS) and to determine the psychometric properties of the scale. The study was designed as a survey model. A survey model is a research approach that aims to describe a situation as it is. When all the findings obtained to determine the psychometric properties of the…
Descriptors: Communities of Practice, Models, Test Construction, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Beheshti, Shima; Safa, Mohammad Ahmadi – Iranian Journal of Language Teaching Research, 2023
The indefinite nature of test fairness and different interpretations and definitions of the concept have stirred a lot of controversy over the years, necessitating the reconceptualization of the concept. On this basis, this study aimed to explore the empirical validity of Kunnan's (2008) Test Fairness Framework (TFF) and revisit the established…
Descriptors: Test Bias, Equal Education, Grounded Theory, Test Construction
Ge, Yuan – ProQuest LLC, 2022
My dissertation research explored responder behaviors (e.g., demonstrating response styles, carelessness, and possessing misconceptions) that compromise psychometric quality and impact the interpretation and use of assessment results. Identifying these behaviors can help researchers understand and minimize their potentially construct-irrelevant…
Descriptors: Test Wiseness, Response Style (Tests), Item Response Theory, Psychometrics
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  43