ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	23

Descriptor

Computation	29
Evaluation Methods	29
Probability	29
Models	14
Simulation	9
Item Response Theory	7
Statistical Analysis	7
Comparative Analysis	6
Data Analysis	5
Validity	5
Equations (Mathematics)	4
Foreign Countries	4
Maximum Likelihood Statistics	4
Measurement	4
Measurement Techniques	4
Sample Size	4
Scores	4
Statistics	4
Testing	4
Algebra	3
Bayesian Statistics	3
Cognitive Processes	3
Computer Software	3
Correlation	3
Error Patterns	3
More ▼

Publication Type

Journal Articles	21
Reports - Research	12
Reports - Descriptive	8
Reports - Evaluative	5
Collected Works - Proceedings	2
Speeches/Meeting Papers	2
Books	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1

Education Level

Higher Education	3
Elementary Secondary Education	2
Adult Education	1
Early Childhood Education	1
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

New Zealand	2
Australia	1
Cyprus	1
Denmark	1
Estonia	1
Germany	1
Illinois	1
Norway	1
Oregon	1
Pakistan	1
South Korea	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

Estimating Statistical Power When Making Adjustments for Multiple Tests

Peer reviewed
PDF on ERIC

Download full text

Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016

In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…

Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods

Sample Size Calculations for Precise Interval Estimation of the Eta-Squared Effect Size

Peer reviewed

Direct link

Shieh, Gwowen – Journal of Experimental Education, 2015

Analysis of variance is one of the most frequently used statistical analyses in the behavioral, educational, and social sciences, and special attention has been paid to the selection and use of an appropriate effect size measure of association in analysis of variance. This article presents the sample size procedures for precise interval estimation…

Descriptors: Statistical Analysis, Sample Size, Computation, Effect Size

Assessing Retest Effects at the Individual Level: A General IRT-Based Approach

Peer reviewed
PDF on ERIC

Download full text

Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2015

Test-retest studies for assessing stability and change are widely used in different domains and allow improved or additional individual estimates of interest to be obtained. However, if these estimates are to be validly interpreted the responses given at Time-2 must be free of retest effects, and the fulfilment of this assumption must be…

Descriptors: Item Response Theory, Evaluation Methods, Responses, Testing

Optimizing Partial Credit Algorithms to Predict Student Performance

Download full text

Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015

As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…

Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits

Use of Item Parceling in Structural Equation Modeling with Missing Data

Direct link

Orcan, Fatih – ProQuest LLC, 2013

Parceling is referred to as a procedure for computing sums or average scores across multiple items. Parcels instead of individual items are then used as indicators of latent factors in the structural equation modeling analysis (Bandalos 2002, 2008; Little et al., 2002; Yang, Nay, & Hoyle, 2010). Item parceling may be applied to alleviate some…

Descriptors: Structural Equation Models, Evaluation Methods, Simulation, Sample Size

Comparing Performance of Methods to Deal with Differential Attrition in Lottery Based Evaluations

Peer reviewed
PDF on ERIC

Download full text

Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey – Society for Research on Educational Effectiveness, 2016

The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…

Descriptors: Data Analysis, Student Attrition, Evaluation Methods, Evaluation Research

Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

Peer reviewed

Direct link

Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015

When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

Descriptors: Competence, Tests, Evaluation Methods, Adults

Mixture Factor Analysis for Approximating a Nonnormally Distributed Continuous Latent Factor with Continuous and Dichotomous Observed Variables

Peer reviewed

Direct link

Wall, Melanie M.; Guo, Jia; Amemiya, Yasuo – Multivariate Behavioral Research, 2012

Mixture factor analysis is examined as a means of flexibly estimating nonnormally distributed continuous latent factors in the presence of both continuous and dichotomous observed variables. A simulation study compares mixture factor analysis with normal maximum likelihood (ML) latent factor modeling. Different results emerge for continuous versus…

Descriptors: Sample Size, Simulation, Form Classes (Languages), Diseases

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

Many Tests of Significance: New Methods for Controlling Type I Errors

Peer reviewed

Direct link

Keselman, H. J.; Miller, Charles W.; Holland, Burt – Psychological Methods, 2011

There have been many discussions of how Type I errors should be controlled when many hypotheses are tested (e.g., all possible comparisons of means, correlations, proportions, the coefficients in hierarchical models, etc.). By and large, researchers have adopted familywise (FWER) control, though this practice certainly is not universal. Familywise…

Descriptors: Validity, Statistical Significance, Probability, Computation

Smaller Is Better (when Sampling from the Crowd within): Low Memory-Span Individuals Benefit More from Multiple Opportunities for Estimation

Peer reviewed

Direct link

Hourihan, Kathleen L.; Benjamin, Aaron S. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2010

Recently, Vul and Pashler (2008) demonstrated that the average of 2 responses from a single subject to general knowledge questions was more accurate than either single estimate. Importantly, this reveals that each guess contributes unique evidence relevant to the decision, contrary to views that eschew probabilistic representations of the…

Descriptors: Memory, Task Analysis, Cognitive Processes, Undergraduate Students

A Note on Item-Restscore Association in Rasch Models

Peer reviewed

Direct link

Kreiner, Svend – Applied Psychological Measurement, 2011

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…

Descriptors: Item Analysis, Correlation, Item Response Theory, Models

Technical Adequacy of Response to Intervention Decisions

Peer reviewed

Direct link

VanDerHeyden, Amanda M. – Exceptional Children, 2011

Perhaps the greatest value of response to intervention (RTI) as a decision framework is that it brings attention to variables (e.g., mastery of prerequisite skills, frequency of instructional corrective feedback, reinforcement schedules for correct responding) that if changed might make a meaningful difference for students (e.g., child rate of…

Descriptors: Feedback (Response), Intervention, Classification, Response to Intervention

Killeen's Probability of Replication and Predictive Probabilities: How to Compute, Use, and Interpret Them

Peer reviewed

Direct link

Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010

P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…

Descriptors: Research Methodology, Guidelines, Probability, Computation

Previous Page | Next Page »

Pages: 1 | 2

Psychometrika	4
Multivariate Behavioral…	2
Psychological Methods	2
Society for Research on…	2
American Biology Teacher	1
Applied Psychological…	1
Asia Pacific Education Review	1
Cognitive Science	1
Educational and Psychological…	1
Exceptional Children	1
International Educational…	1
International Group for the…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Experimental…	1
Journal of Statistics…	1
Mathematics Education…	1
National Center for Research…	1
ProQuest LLC	1
Psicologica: International…	1
More ▼

Amemiya, Yasuo	1
Anderson, Kaitlin	1
Benjamin, Aaron S.	1
Beswick, Kim, Ed.	1
Cai, Li	1
Carlton, Matthew A.	1
Carstensen, Claus H.	1
De Boeck, Paul	1
Donnelly, Chistopher	1
Dougherty, Barbara J., Ed.	1
Ferrando, Pere J.	1
Glas, C. A. W.	1
Guo, Jia	1
Heffernan, Neil	1
Hernandez, Adolfo	1
Holland, Burt	1
Hourihan, Kathleen L.	1
Jansen, M. G. H.	1
Jedidi, Kamel	1
Keselman, H. J.	1
Kim, Woojae	1
Kohli, Rajeev	1
Kreiner, Svend	1
Köhler, Carmen	1
Lecoutre, Bruno	1
More ▼