Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Sarah K. Anderson; Sevda Ozsezer-Kurnuc; Pinky Jain – British Journal of Educational Studies, 2024
This paper reports on a systematic literature review to understand better methodologies and data collection tools used to judge student teaching effectiveness, ways in which validity and reliability are considered, the processes involved in assessing new teaching effectiveness within teacher education programmes, and how evaluation and results are…
Descriptors: Literature Reviews, Content Analysis, Student Teachers, Teacher Effectiveness
Samuel J. Howarth; Erinn McCreath Frangakis; Steven Hirsch; Diana De Carvalho – Measurement in Physical Education and Exercise Science, 2024
The flexion relaxation ratio (FRR) of the lumbar extensor muscles is often assessed in experimental and clinical studies. This study evaluated within- and between-session test--retest reliability and measurement error for different FRR formulations. Participants completed two identical data collection sessions 1-week apart. Spine flexion and…
Descriptors: Exercise Physiology, Human Body, Pretests Posttests, Error of Measurement
Xijuan Zhang; Hao Wu – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A full structural equation model (SEM) typically consists of both a measurement model (describing relationships between latent variables and observed scale items) and a structural model (describing relationships among latent variables). However, often researchers are primarily interested in testing hypotheses related to the structural model while…
Descriptors: Structural Equation Models, Goodness of Fit, Robustness (Statistics), Factor Structure
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)
Lientje Maas; Matthew J. Madison; Matthieu J. S. Brinkhuis – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that yield probabilistic classifications of respondents according to a set of discrete latent variables. The current study examines the recently introduced one-parameter log-linear cognitive diagnosis model (1-PLCDM), which has increased interpretability compared with general DCMs due…
Descriptors: Clinical Diagnosis, Classification, Models, Psychometrics
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
Yuhei Kodani; Kazuki Sekine; Yasuhiro Tanaka; Shinsuke Nagami; Katsuya Nakamura; Shinya Fukunaga; Hikaru Nakamura – International Journal of Language & Communication Disorders, 2024
Background: The Scenario Test is recognised for its effectiveness in assessing the interactive aspects of functional communication in people with post-stroke aphasia (PWA). Aims: To develop a Japanese version of the Scenario Test (Scenario Test-JP) and assess its reliability and validity. Methods & Procedures: Among 66 participants, we…
Descriptors: Foreign Countries, Aphasia, Communication Disorders, Translation
Chunhua Liu; Panwang Yang – European Journal of Education, 2024
Student satisfaction in online live classes is considered an important criterion to evaluate the effectiveness of this instructional system. This study aims to develop a performance evaluation index to measure the satisfaction of students who have mastered Chinese language and literature through online live classes. Guided by survey techniques and…
Descriptors: Student Satisfaction, Online Courses, Performance Based Assessment, Chinese
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Chun Sing Maxwell Ho; Jiafang Lu – Journal of Professional Capital and Community, 2024
Purpose: This study aims to develop and validate a scale to measure Teacher Entrepreneurial Behavior (TEB), which encapsulates the behaviors teachers employ to identify and amplify innovation in schools. TEB are catalysts for innovation, navigating their peers through risks and building trust, which empowers the collective to transcend structural…
Descriptors: Teacher Behavior, Entrepreneurship, Innovation, Measures (Individuals)
Abdulkadir Haktanir; M. Furkan Kurnaz; Zeynep Simsir Gökalp – Measurement and Evaluation in Counseling and Development, 2024
Objective: Brief Self-Control Scale (BSCS) is the most widely used instrument to assess self-control. The purpose of this reliability generalization meta-analysis was to examine the degree to which consistency reliability coefficients for scores on the BSCS generalize across age groups and languages. Method: We included studies using the BSCS and…
Descriptors: Self Control, Measures (Individuals), Meta Analysis, Test Reliability
Agostino Cioffi; Silvia Galano; Raffaella Passeggia; Italo Testa – Physical Review Physics Education Research, 2024
The assessment of test anxiety has received increasing attention in educational research due to the potential negative effects of anxiety on student performance. Traditionally, test anxiety scales have been developed for mathematics, but few studies have focused on physics. In this study, we validated two test anxiety scales for undergraduate…
Descriptors: Test Anxiety, Test Validity, Undergraduate Students, Physics
Muhammed Tayyib Kadak; Nihal Serdengeçti; Meryem Seçen Yazici; Tuncay Sandikçi; Aybike Aydin; Zehra Koyuncu; Yavuz Meral; Abas Hasimoglu; Yasin Çaliskan; Gizem Bayraktar; Elif Can Öztürk; Mehmet Enes Gökler; Roula Choueiri; Mahmut Cem Tarakçioglu – Autism: The International Journal of Research and Practice, 2024
This study aims to investigate the validation of the Rapid Interactive Screening Test for Autism in Toddlers (RITA-T) in Turkish toddlers between 18 and 36 months of age. Children aged 18-36 months were referred to the department of child psychiatry for concerns of autism spectrum disorder, language disorder, developmental delay, and typically…
Descriptors: Foreign Countries, Turkish, Screening Tests, Autism Spectrum Disorders
Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024
Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…
Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

Peer reviewed
Direct link
