NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Does not meet standards1
Showing 1 to 15 of 168 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Su, Kun; Henson, Robert A. – Journal of Educational and Behavioral Statistics, 2023
This article provides a process to carefully evaluate the suitability of a content domain for which diagnostic classification models (DCMs) could be applicable and then optimized steps for constructing a test blueprint for applying DCMs and a real-life example illustrating this process. The content domains were carefully evaluated using a set of…
Descriptors: Classification, Models, Science Tests, Physics
Peer reviewed Peer reviewed
Direct linkDirect link
Zhan, Peida; Jiao, Hong; Liao, Dandan; Li, Feiming – Journal of Educational and Behavioral Statistics, 2019
Providing diagnostic feedback about growth is crucial to formative decisions such as targeted remedial instructions or interventions. This article proposed a longitudinal higher-order diagnostic classification modeling approach for measuring growth. The new modeling approach is able to provide quantitative values of overall and individual growth…
Descriptors: Classification, Growth Models, Educational Diagnosis, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Anam Aslam; Sagheer Ahamd; Hans-Stefan Siller; Abida Nasreen – Asia-Pacific Science Education, 2024
Science education is crucial for fostering knowledge across academic disciplines. Past efforts to enhance science achievement at the elementary level have explored various instructional strategies. Among these, the Understanding by Design (UbD) model has shown notable potential in improving science achievement outcomes compared to traditional…
Descriptors: Science Achievement, Science Instruction, Grade 5, Elementary School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Andrew M. Olney – Grantee Submission, 2023
Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tawil, Muh.; Said, Muhammad Amin; Suryansari, Kemala – International Journal of Education and Practice, 2023
This study aimed to examine authentic models of science assessment in assessing the competence of senior high school students who met the criteria of validity, practicality, and effectiveness. The methodology applied was an evaluation and developmental research. Research was conducted in senior high school in accordance with the needs of the…
Descriptors: Foreign Countries, Performance Based Assessment, Student Evaluation, Science Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024
A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…
Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gunes Keskin Cevik; Hikmet Surmeli – Journal of Baltic Science Education, 2024
Learning about the Earth through geoscience education is important in order to make informed decisions about the future of the Earth. The aim of this study is to examine students' conceptual development on geoscience subjects based on inquiry-based learning. 7th grade students participated in this study. The researcher prepared lesson plans and…
Descriptors: Earth Science, Science Tests, Heuristics, Scientific Concepts
Peer reviewed Peer reviewed
Direct linkDirect link
Emily K. Toutkoushian; Kihyun Ryoo – Measurement: Interdisciplinary Research and Perspectives, 2024
The Next Generation Science Standards (NGSS) delineate three interrelated dimensions that describe what students should know and how they should engage in science learning. These present significant challenges for assessment because traditional assessments may not be able to capture the ways in which students engage with content. Science…
Descriptors: Middle School Students, Academic Standards, Science Education, Learner Engagement
Peer reviewed Peer reviewed
Direct linkDirect link
Hansen, John; Stewart, John – Physical Review Physics Education Research, 2021
This work is the fourth of a series of papers applying multidimensional item response theory (MIRT) to widely used physics conceptual assessments. This study applies MIRT analysis using both exploratory and confirmatory methods to the Brief Electricity and Magnetism Assessment (BEMA) to explore the assessment's structure and to determine a…
Descriptors: Item Response Theory, Science Tests, Energy, Magnets
Peer reviewed Peer reviewed
Direct linkDirect link
Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023
Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…
Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel Kasper; Katrin Schulz-Heidorf; Knut Schwippert – Sociological Methods & Research, 2024
In this article, we extend Liao's test for across-group comparisons of the fixed effects from the generalized linear model to the fixed and random effects of the generalized linear mixed model (GLMM). Using as our basis the Wald statistic, we developed an asymptotic test statistic for across-group comparisons of these effects. The test can be…
Descriptors: Models, Achievement Tests, Foreign Countries, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021
In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…
Descriptors: Perception, Bias, Theories, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
McConnell, Sarah E. A.; Mooney, Christopher J. – Anatomical Sciences Education, 2021
Knowledge of embryology is foundational for understanding normal anatomy and birth defects, yet, embryology is a notoriously difficult subject for medical students. Embryonic lateral folding in particular is one of the most challenging concepts in embryology. Highly effective teaching methods that promote active engagement with dynamic,…
Descriptors: Teaching Methods, Medical Education, Anatomy, Congenital Impairments
Peer reviewed Peer reviewed
Direct linkDirect link
Yamaguchi, Kazuhiro – Journal of Educational and Behavioral Statistics, 2023
Understanding whether or not different types of students master various attributes can aid future learning remediation. In this study, two-level diagnostic classification models (DCMs) were developed to represent the probabilistic relationship between external latent classes and attribute mastery patterns. Furthermore, variational Bayesian (VB)…
Descriptors: Bayesian Statistics, Classification, Statistical Inference, Sampling
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12