Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 19 |
Descriptor
Computation | 25 |
Evaluation Methods | 25 |
Evaluation Research | 25 |
Simulation | 9 |
Data Analysis | 7 |
Item Response Theory | 6 |
Research Methodology | 6 |
Statistical Analysis | 5 |
Comparative Analysis | 4 |
Correlation | 4 |
Error Patterns | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 22 |
Reports - Research | 12 |
Reports - Evaluative | 7 |
Reports - Descriptive | 5 |
Information Analyses | 2 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 2 |
Adult Education | 1 |
Elementary Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Oregon | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017
This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…
Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments
Zamarro, Gema; Anderson, Kaitlin; Steele, Jennifer; Miller, Trey – Society for Research on Educational Effectiveness, 2016
The purpose of this study is to study the performance of different methods (inverse probability weighting and estimation of informative bounds) to control for differential attrition by comparing the results of different methods using two datasets: an original dataset from Portland Public Schools (PPS) subject to high rates of differential…
Descriptors: Data Analysis, Student Attrition, Evaluation Methods, Evaluation Research
Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012
This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…
Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring
Foley, Brett Patrick – ProQuest LLC, 2010
The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using…
Descriptors: Test Length, Sample Size, Simulation, Item Response Theory
Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012
A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…
Descriptors: Factor Analysis, Computation, Simulation, Sample Size
Schochet, Peter Z.; Puma, Mike; Deke, John – National Center for Education Evaluation and Regional Assistance, 2014
This report summarizes the complex research literature on quantitative methods for assessing how impacts of educational interventions on instructional practices and student learning differ across students, educators, and schools. It also provides technical guidance about the use and interpretation of these methods. The research topics addressed…
Descriptors: Statistical Analysis, Evaluation Methods, Educational Research, Intervention
Cho, Sun-Joo; Li, Feiming; Bandalos, Deborah – Educational and Psychological Measurement, 2009
The purpose of this study was to investigate the application of the parallel analysis (PA) method for choosing the number of factors in component analysis for situations in which data are dichotomous or ordinal. Although polychoric correlations are sometimes used as input for component analyses, the random data matrices generated for use in PA…
Descriptors: Correlation, Evaluation Methods, Data Analysis, Matrices
Forero, Carlos G.; Maydeu-Olivares, Alberto – Psychological Methods, 2009
The performance of parameter estimates and standard errors in estimating F. Samejima's graded response model was examined across 324 conditions. Full information maximum likelihood (FIML) was compared with a 3-stage estimator for categorical item factor analysis (CIFA) when the unweighted least squares method was used in CIFA's third stage. CIFA…
Descriptors: Factor Analysis, Least Squares Statistics, Computation, Item Response Theory
Lee, Sik-Yum; Xia, Ye-Mao – Psychometrika, 2008
In this paper, normal/independent distributions, including but not limited to the multivariate t distribution, the multivariate contaminated distribution, and the multivariate slash distribution, are used to develop a robust Bayesian approach for analyzing structural equation models with complete or missing data. In the context of a nonlinear…
Descriptors: Structural Equation Models, Bayesian Statistics, Evaluation Methods, Evaluation Research
Weinberg, Bruce A.; Hashimoto, Masanori; Fleisher, Belton M. – Journal of Economic Education, 2009
The authors develop an original measure of learning in higher education, based on grades in subsequent courses. Using this measure of learning, they show that student evaluations are positively related to current grades but unrelated to learning once current grades are controlled. They offer evidence that the weak relationship between learning and…
Descriptors: Higher Education, Student Evaluation, Grades (Scholastic), Evaluation Methods
Cools, Wilfried; De Fraine, Bieke; Van den Noortgate, Wim; Onghena, Patrick – School Effectiveness and School Improvement, 2009
In educational effectiveness research, multilevel data analyses are often used because research units (most frequently, pupils or teachers) are studied that are nested in groups (schools and classes). This hierarchical data structure complicates designing the study because the structure has to be taken into account when approximating the accuracy…
Descriptors: Effective Schools Research, Program Effectiveness, School Effectiveness, Simulation
Wang, Wen-Chung – Applied Psychological Measurement, 2008
Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…
Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods
Athy, Jeremy; Friedrich, Jeff; Delany, Eileen – Science & Education, 2008
Egon Brunswik (1903-1955) first made an interesting distinction between perception and explicit reasoning, arguing that perception included quick estimates of an object's size, nearly always resulting in good approximations in uncertain environments, whereas explicit reasoning, while better at achieving exact estimates, could often fail by wide…
Descriptors: Psychology, Logical Thinking, Perception, Psychological Studies
Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…
Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)
Previous Page | Next Page ยป
Pages: 1 | 2