Publication Date
In 2025 | 2 |
Since 2024 | 11 |
Since 2021 (last 5 years) | 38 |
Since 2016 (last 10 years) | 66 |
Since 2006 (last 20 years) | 142 |
Descriptor
Source
Author
Publication Type
Education Level
Location
Canada | 8 |
Turkey | 8 |
Australia | 7 |
China | 4 |
United Kingdom | 4 |
United Kingdom (England) | 4 |
Illinois | 3 |
Iran | 3 |
Ohio | 3 |
United Kingdom (Great Britain) | 3 |
California | 2 |
More ▼ |
Laws, Policies, & Programs
Bilingual Education Act 1968 | 2 |
Elementary and Secondary… | 2 |
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Head Start | 1 |
Occupational Safety and… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jeff Coon; Paulina N. Silva; Alexander Etz; Barbara W. Sarnecka – Journal of Cognition and Development, 2025
Bayesian methods offer many advantages when applied to psychological research, yet they may seem esoteric to researchers who are accustomed to traditional methods. This paper aims to lower the barrier of entry for developmental psychologists who are interested in using Bayesian methods. We provide worked examples of how to analyze common study…
Descriptors: Developmental Psychology, Bayesian Statistics, Research Methodology, Psychological Studies
Wesley A. Sims; Rondy Yu; Danielle Zahn – Contemporary School Psychology, 2024
While disruptions to typical education, special education, and psycho-educational service delivery practices in response to the COVID-19 pandemic have dissipated, their impact magnified educational systems' overreliance on evaluations to determine eligibility for special education and related services. Given that the potential for future…
Descriptors: Special Education, COVID-19, Pandemics, Student Evaluation
Austin M. Shin; Ayaan M. Kazerouni – ACM Transactions on Computing Education, 2024
Background and Context: Students' programming projects are often assessed on the basis of their tests as well as their implementations, most commonly using test adequacy criteria like branch coverage, or, in some cases, mutation analysis. As a result, students are implicitly encouraged to use these tools during their development process (i.e., so…
Descriptors: Feedback (Response), Programming, Student Projects, Computer Software
Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023
Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…
Descriptors: Testing, Computation, Classification, Accuracy
Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023
The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…
Descriptors: Item Response Theory, Standard Setting, Testing, Sampling
Daniel McNeish – Grantee Submission, 2023
Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…
Descriptors: Models, Testing, Indexes, Factor Analysis
Walker, Joshua D.; Robinson, Daniel H. – Journal of Experimental Education, 2023
Two-stage testing (TST) involves individual testing followed by taking the same test in teams. Previously, Vogler and Robinson ("The Journal of Experimental Education," 84(4), 787-803, 2016) found that TST facilitated individual performance. The present study addressed methodological limitations in the Vogler and Robinson study in two…
Descriptors: Testing, Undergraduate Students, Test Wiseness, Repetition
Edward C. Bell – Discover Education, 2023
Pharmacy calculations is a course that can be challenging and is often associated with student anxiety about assessments and grades. This study was conducted to determine if student anxiety would be reduced in pharmacy calculations using self-paced, multiple-attempt assessments. Self-paced, multiple-attempt assessments were presented to students…
Descriptors: Pharmacy, Anxiety, Computation, Grades (Scholastic)
Inga Fokken; Ilka Staub; Tobias Vogt – Journal of Teaching in Physical Education, 2024
Purpose: This study aims to investigate how physical education teachers analyze their students' swimming skills. Particular attention is given to information gathering within the diagnostic process. Methods: Data were collected from a quantitative online survey of German physical education teachers from primary and secondary schools (n = 551).…
Descriptors: Physical Education Teachers, Aquatic Sports, Psychomotor Skills, Student Evaluation
Samsa, Gregory – Journal of Curriculum and Teaching, 2021
Objective: Our master's program in biostatistics requires a qualifying examination (QE). A curriculum review led us to question whether to replace a closed-book format with an open-book one. Our goal was to improve the QE. Methods: This is a case study and commentary, where we describe the evolution of the QE, both in its goals and its content.…
Descriptors: Testing, Cooperative Learning, Evaluation Methods, Test Format
Wagner, Inga; Loesche, Philipp; Bißantz, Steven – European Journal of Psychology of Education, 2022
The German school system employs centrally organized performance assessments (some of which are called "VERA") as a way of promoting lesson development. In recent years, several German federal states introduced a computer-based performance testing system which will replace the paper-pencil testing system in the future. Scores from…
Descriptors: Foreign Countries, Computer Assisted Testing, Testing, Evaluation Methods
Simon Vurayai – Educational Practice and Theory, 2024
This study employed the Systematic Review (SR) methodology to examine the content and reasons for resisting the implementation of Continuous Assessment Learning Activities (CALA) in Zimbabwean Secondary schools. The Overcoming Resistance to Change (ORC) model was exploited as the analytical lenses. The study found that factors such as education,…
Descriptors: Foreign Countries, Secondary Education, Program Implementation, Resistance to Change
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024
This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…
Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation
Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023
In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…
Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis