ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	19

Descriptor

Computation	25
Evaluation Methods	25
Evaluation Research	25
Simulation	9
Data Analysis	7
Item Response Theory	6
Research Methodology	6
Statistical Analysis	5
Comparative Analysis	4
Correlation	4
Error Patterns	4
Factor Analysis	4
Models	4
Robustness (Statistics)	4
Sample Size	4
Scores	4
Structural Equation Models	4
Academic Achievement	3
Educational Research	3
Effect Size	3
Error of Measurement	3
Intervals	3
Intervention	3
Predictor Variables	3
Pretests Posttests	3
More ▼

Source

Psychological Methods	4
Educational and Psychological…	3
Applied Psychological…	2
Structural Equation Modeling	2
American Journal of Evaluation	1
Educational Researcher	1
Evaluation Review	1
International Education…	1
Journal of Economic Education	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
National Center for Education…	1
ProQuest LLC	1
Psychometrika	1
School Effectiveness and…	1
Science & Education	1
Society for Research on…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	12
Reports - Evaluative	7
Reports - Descriptive	5
Information Analyses	2
Dissertations/Theses -…	1

Education Level

Higher Education	2
Adult Education	1
Elementary Secondary Education	1

Audience

Researchers

Location

Oregon

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

The Nonuse, Misuse, and Proper Use of Pilot Studies in Experimental Evaluation Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017

This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…

Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments

Comparing Performance of Methods to Deal with Differential Attrition in Lottery Based Evaluations

Peer reviewed
PDF on ERIC

Download full text

Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey – Society for Research on Educational Effectiveness, 2016

The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…

Descriptors: Data Analysis, Student Attrition, Evaluation Methods, Evaluation Research

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

Improving IRT Parameter Estimates with Small Sample Sizes: Evaluating the Efficacy of a New Data Augmentation Technique

Direct link

Foley, Brett Patrick – ProQuest LLC, 2010

The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using…

Descriptors: Test Length, Sample Size, Simulation, Item Response Theory

When Can Categorical Variables Be Treated as Continuous? A Comparison of Robust Continuous and Categorical SEM Estimation Methods under Suboptimal Conditions

Peer reviewed

Direct link

Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…

Descriptors: Factor Analysis, Computation, Simulation, Sample Size

Understanding Variation in Treatment Effects in Education Impact Evaluations: An Overview of Quantitative Methods. NCEE 2014-4017

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z.; Puma, Mike; Deke, John – National Center for Education Evaluation and Regional Assistance, 2014

This report summarizes the complex research literature on quantitative methods for assessing how impacts of educational interventions on instructional practices and student learning differ across students, educators, and schools. It also provides technical guidance about the use and interpretation of these methods. The research topics addressed…

Descriptors: Statistical Analysis, Evaluation Methods, Educational Research, Intervention

Accuracy of the Parallel Analysis Procedure with Polychoric Correlations

Peer reviewed

Direct link

Cho, Sun-Joo; Li, Feiming; Bandalos, Deborah – Educational and Psychological Measurement, 2009

The purpose of this study was to investigate the application of the parallel analysis (PA) method for choosing the number of factors in component analysis for situations in which data are dichotomous or ordinal. Although polychoric correlations are sometimes used as input for component analyses, the random data matrices generated for use in PA…

Descriptors: Correlation, Evaluation Methods, Data Analysis, Matrices

Estimation of IRT Graded Response Models: Limited versus Full Information Methods

Peer reviewed

Direct link

Forero, Carlos G.; Maydeu-Olivares, Alberto – Psychological Methods, 2009

The performance of parameter estimates and standard errors in estimating F. Samejima's graded response model was examined across 324 conditions. Full information maximum likelihood (FIML) was compared with a 3-stage estimator for categorical item factor analysis (CIFA) when the unweighted least squares method was used in CIFA's third stage. CIFA…

Descriptors: Factor Analysis, Least Squares Statistics, Computation, Item Response Theory

A Robust Bayesian Approach for Structural Equation Models with Missing Data

Peer reviewed

Direct link

Lee, Sik-Yum; Xia, Ye-Mao – Psychometrika, 2008

In this paper, normal/independent distributions, including but not limited to the multivariate t distribution, the multivariate contaminated distribution, and the multivariate slash distribution, are used to develop a robust Bayesian approach for analyzing structural equation models with complete or missing data. In the context of a nonlinear…

Descriptors: Structural Equation Models, Bayesian Statistics, Evaluation Methods, Evaluation Research

Evaluating Teaching in Higher Education

Peer reviewed

Direct link

Weinberg, Bruce A.; Hashimoto, Masanori; Fleisher, Belton M. – Journal of Economic Education, 2009

The authors develop an original measure of learning in higher education, based on grades in subsequent courses. Using this measure of learning, they show that student evaluations are positively related to current grades but unrelated to learning once current grades are controlled. They offer evidence that the weak relationship between learning and…

Descriptors: Higher Education, Student Evaluation, Grades (Scholastic), Evaluation Methods

Multilevel Design Efficiency in Educational Effectiveness Research

Peer reviewed

Direct link

Cools, Wilfried; De Fraine, Bieke; Van den Noortgate, Wim; Onghena, Patrick – School Effectiveness and School Improvement, 2009

In educational effectiveness research, multilevel data analyses are often used because research units (most frequently, pupils or teachers) are studied that are nested in groups (schools and classes). This hierarchical data structure complicates designing the study because the structure has to be taken into account when approximating the accuracy…

Descriptors: Effective Schools Research, Program Effectiveness, School Effectiveness, Simulation

A Critique of Raju and Oshima's Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Wang, Wen-Chung – Applied Psychological Measurement, 2008

Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…

Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods

Replication and Pedagogy in the History of Psychology VI: Egon Brunswik on Perception and Explicit Reasoning

Peer reviewed

Direct link

Athy, Jeremy; Friedrich, Jeff; Delany, Eileen – Science & Education, 2008

Egon Brunswik (1903-1955) first made an interesting distinction between perception and explicit reasoning, arguing that perception included quick estimates of an object's size, nearly always resulting in good approximations in uncertain environments, whereas explicit reasoning, while better at achieving exact estimates, could often fail by wide…

Descriptors: Psychology, Logical Thinking, Perception, Psychological Studies

Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression

Peer reviewed

Direct link

Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008

For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…

Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2

Raykov, Tenko	2
Stuart, Elizabeth A.	2
Anderson, Kaitlin	1
Athy, Jeremy	1
Bandalos, Deborah	1
Bardhoshi, Gerta	1
Bauer, Daniel J.	1
Bergeron, Jennifer M.	1
Blitstein, Jonathan L.	1
Brosseau-Liard, Patricia E.	1
Chan, Daniel W.-L.	1
Chan, Wai	1
Cheung, Mike W. L.	1
Cho, Sun-Joo	1
Cools, Wilfried	1
Curran, Patrick J.	1
Darmawan, I Gusti Ngurah	1
De Fraine, Bieke	1
Deke, John	1
Delany, Eileen	1
Erford, Bradley T.	1
Fleisher, Belton M.	1
Foley, Brett Patrick	1
Forero, Carlos G.	1
Friedrich, Jeff	1
More ▼