NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 627 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Gregory Chernov – Evaluation Review, 2025
Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…
Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure
Peer reviewed Peer reviewed
Direct linkDirect link
Geraci, Lisa; Kurpad, Nayantara; Tirso, Robert; Gray, Kathryn N.; Wang, Yan – Metacognition and Learning, 2023
Students often make incorrect predictions about their exam performance, with the lowest-performing students showing the greatest inaccuracies in their predictions. The reasons why low-performing students make inaccurate predictions are not fully understood. In two studies, we tested the hypothesis that low-performing students erroneously predict…
Descriptors: Prediction, Tests, Scores, Low Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Laird, Robert D. – Developmental Psychology, 2020
Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Sam Trejo – Grantee Submission, 2024
Birth weight is a robust predictor of valued life course outcomes, emphasizing the importance of prenatal development. But does birth weight act as a proxy for environmental conditions in utero, or do biological processes surrounding birth weight themselves play a role in healthy development? To answer this question, we leverage variation in birth…
Descriptors: Body Weight, Prenatal Influences, Genetics, Hypothesis Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020
The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019
According to Wollack and Schoenig (2018), score differencing is one of six types of statistical methods used to detect test fraud. In this paper, we suggested the use of Bayes factors (e.g., Kass & Raftery, 1995) for score differencing. A simulation study shows that the suggested approach performs slightly better than an existing frequentist…
Descriptors: Cheating, Deception, Statistical Analysis, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015
With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Al-Hilawani, Yasser A. – Educational Studies, 2018
The purpose of this study was to examine the relationship between metacognition as measured in real-life situations and IQ scores as reflected by performance on the Raven Standard Progressive Matrices Scale. It is also intended in this study to report on whether or not there were significant differences in performance on the metacognitive…
Descriptors: Intelligence Quotient, Metacognition, Correlation, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
An, Chen; Braun, Henry; Walsh, Mary E. – Educational Measurement: Issues and Practice, 2018
Making causal inferences from a quasi-experiment is difficult. Sensitivity analysis approaches to address hidden selection bias thus have gained popularity. This study serves as an introduction to a simple but practical form of sensitivity analysis using Monte Carlo simulation procedures. We examine estimated treatment effects for a school-based…
Descriptors: Statistical Inference, Intervention, Program Effectiveness, Quasiexperimental Design
Peer reviewed Peer reviewed
Direct linkDirect link
Chang, Todd P.; Schrager, Sheree M.; Rake, Alyssa J.; Chan, Michael W.; Pham, Phung K.; Christman, Grant – Advances in Health Sciences Education, 2017
Multimedia in assessing clinical decision-making skills (CDMS) has been poorly studied, particularly in comparison to traditional text-based assessments. The literature suggests multimedia is more difficult for trainees. We hypothesize that pediatric residents score lower in diagnostic skill when clinical vignettes use multimedia rather than text…
Descriptors: Medical Students, Pediatrics, Multimedia Materials, Clinical Diagnosis
Peer reviewed Peer reviewed
Direct linkDirect link
Sangwin, Christopher J.; Jones, Ian – Educational Studies in Mathematics, 2017
In this paper we report the results of an experiment designed to test the hypothesis that when faced with a question involving the inverse direction of a reversible mathematical process, students solve a multiple-choice version by verifying the answers presented to them by the direct method, not by undertaking the actual inverse calculation.…
Descriptors: Mathematics Achievement, Mathematics Tests, Multiple Choice Tests, Computer Assisted Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jeffry White – Journal of Educational Research and Practice, 2024
Violations of normality and homogeneity are common in educational data. When this occurs, the use of parametric statistics may be inappropriate. A generalized form of nonparametric analyses based on the Puri and Sen L statistic provides an alternative approach. Using a chi-square distribution, this technique is easy to apply and has significant…
Descriptors: Nonparametric Statistics, Learning Analytics, Evaluation Methods, Guidance
Peer reviewed Peer reviewed
Direct linkDirect link
Davis, Laurie Laughlin; Kong, Xiaojing; McBride, Yuanyuan; Morrison, Kristin M. – Applied Measurement in Education, 2017
The definition of what it means to take a test online continues to evolve with the inclusion of a broader range of item types and a wide array of devices used by students to access test content. To assure the validity and reliability of test scores for all students, device comparability research should be conducted to evaluate the impact of…
Descriptors: Educational Technology, Technology Uses in Education, High School Students, Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Al-Hadabi, Abddulsalam; Al-soudi, Mabrook Saleh Ali – Educational Research and Reviews, 2020
Preparing pre-service science teachers (PSSTs) with the scientific research skills (SRSs) is an ultimate aim of PSSTs' programs. This study aimed to explore PSSTs' understanding level of SRHs (SRHUL). To this end, an action research (AR) was adopted using a pre-post-test design. In doing so, a multiple choice test which consists of 15 items was…
Descriptors: Scientific Research, Research Skills, Science Process Skills, Hypothesis Testing
Ayodele, Alicia Nicole – ProQuest LLC, 2017
Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…
Descriptors: Statistical Analysis, Test Bias, Test Items, Scores
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  42