NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 322 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jeff Coon; Paulina N. Silva; Alexander Etz; Barbara W. Sarnecka – Journal of Cognition and Development, 2025
Bayesian methods offer many advantages when applied to psychological research, yet they may seem esoteric to researchers who are accustomed to traditional methods. This paper aims to lower the barrier of entry for developmental psychologists who are interested in using Bayesian methods. We provide worked examples of how to analyze common study…
Descriptors: Developmental Psychology, Bayesian Statistics, Research Methodology, Psychological Studies
Peer reviewed Peer reviewed
Direct linkDirect link
Wesley A. Sims; Rondy Yu; Danielle Zahn – Contemporary School Psychology, 2024
While disruptions to typical education, special education, and psycho-educational service delivery practices in response to the COVID-19 pandemic have dissipated, their impact magnified educational systems' overreliance on evaluations to determine eligibility for special education and related services. Given that the potential for future…
Descriptors: Special Education, COVID-19, Pandemics, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Austin M. Shin; Ayaan M. Kazerouni – ACM Transactions on Computing Education, 2024
Background and Context: Students' programming projects are often assessed on the basis of their tests as well as their implementations, most commonly using test adequacy criteria like branch coverage, or, in some cases, mutation analysis. As a result, students are implicitly encouraged to use these tools during their development process (i.e., so…
Descriptors: Feedback (Response), Programming, Student Projects, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023
Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…
Descriptors: Testing, Computation, Classification, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023
The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…
Descriptors: Item Response Theory, Standard Setting, Testing, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel McNeish – Grantee Submission, 2023
Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…
Descriptors: Models, Testing, Indexes, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Joshua D.; Robinson, Daniel H. – Journal of Experimental Education, 2023
Two-stage testing (TST) involves individual testing followed by taking the same test in teams. Previously, Vogler and Robinson ("The Journal of Experimental Education," 84(4), 787-803, 2016) found that TST facilitated individual performance. The present study addressed methodological limitations in the Vogler and Robinson study in two…
Descriptors: Testing, Undergraduate Students, Test Wiseness, Repetition
Peer reviewed Peer reviewed
Direct linkDirect link
Edward C. Bell – Discover Education, 2023
Pharmacy calculations is a course that can be challenging and is often associated with student anxiety about assessments and grades. This study was conducted to determine if student anxiety would be reduced in pharmacy calculations using self-paced, multiple-attempt assessments. Self-paced, multiple-attempt assessments were presented to students…
Descriptors: Pharmacy, Anxiety, Computation, Grades (Scholastic)
Peer reviewed Peer reviewed
Direct linkDirect link
Inga Fokken; Ilka Staub; Tobias Vogt – Journal of Teaching in Physical Education, 2024
Purpose: This study aims to investigate how physical education teachers analyze their students' swimming skills. Particular attention is given to information gathering within the diagnostic process. Methods: Data were collected from a quantitative online survey of German physical education teachers from primary and secondary schools (n = 551).…
Descriptors: Physical Education Teachers, Aquatic Sports, Psychomotor Skills, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Samsa, Gregory – Journal of Curriculum and Teaching, 2021
Objective: Our master's program in biostatistics requires a qualifying examination (QE). A curriculum review led us to question whether to replace a closed-book format with an open-book one. Our goal was to improve the QE. Methods: This is a case study and commentary, where we describe the evolution of the QE, both in its goals and its content.…
Descriptors: Testing, Cooperative Learning, Evaluation Methods, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Wagner, Inga; Loesche, Philipp; Bißantz, Steven – European Journal of Psychology of Education, 2022
The German school system employs centrally organized performance assessments (some of which are called "VERA") as a way of promoting lesson development. In recent years, several German federal states introduced a computer-based performance testing system which will replace the paper-pencil testing system in the future. Scores from…
Descriptors: Foreign Countries, Computer Assisted Testing, Testing, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Simon Vurayai – Educational Practice and Theory, 2024
This study employed the Systematic Review (SR) methodology to examine the content and reasons for resisting the implementation of Continuous Assessment Learning Activities (CALA) in Zimbabwean Secondary schools. The Overcoming Resistance to Change (ORC) model was exploited as the analytical lenses. The study found that factors such as education,…
Descriptors: Foreign Countries, Secondary Education, Program Implementation, Resistance to Change
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024
This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…
Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023
In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…
Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  22