Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 20 |
Descriptor
Error of Measurement | 28 |
Evaluation Methods | 28 |
Evaluation Research | 28 |
Evaluation Criteria | 6 |
Evaluation Problems | 6 |
Measurement Techniques | 6 |
Models | 5 |
Structural Equation Models | 5 |
Test Reliability | 5 |
Academic Achievement | 4 |
Educational Policy | 4 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 23 |
Reports - Research | 13 |
Reports - Descriptive | 10 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Opinion Papers | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 6 |
Higher Education | 6 |
Adult Education | 4 |
Postsecondary Education | 3 |
High Schools | 2 |
Secondary Education | 1 |
Audience
Location
California | 1 |
Illinois | 1 |
Iran | 1 |
Maine | 1 |
Michigan | 1 |
Nevada | 1 |
New Hampshire | 1 |
Ohio | 1 |
Oklahoma | 1 |
Oregon | 1 |
Rhode Island | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Praxis Series | 1 |
What Works Clearinghouse Rating
Lotfi Simon Kerzabi – ProQuest LLC, 2021
Monte Carlo methods are an accepted methodology in regards to generation critical values for a Maximum test. The same methods are also applicable to the evaluation of the robustness of the new created test. A table of critical values was created, and the robustness of the new maximum test was evaluated for five different distributions. Robustness…
Descriptors: Data, Monte Carlo Methods, Testing, Evaluation Research
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017
This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…
Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments
Dwyer, Andrew C. – Journal of Educational Measurement, 2016
This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…
Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards
Robinson, Lauren; Dudensing, Rebekka; Granovsky, Nancy L. – Journal of Extension, 2016
Program evaluation often suffers due to time constraints, imperfect instruments, incomplete data, and the need to report standardized metrics. This article about the evaluation process for the Wi$eUp financial education program showcases the difficulties inherent in evaluation and suggests best practices for assessing program effectiveness. We…
Descriptors: Evaluation Methods, Evaluation Research, Error of Measurement, Money Management
Barakat, Bilal Fouad – International Journal of Educational Development, 2012
The number of years a child of school-entry age can expect to remain in school is of great interest both as a measure of individual human capital and of the performance of an education system. An approximate indicator of this concept is the sum of age-specific enrolment rates. The relatively low data demands of this indicator that are feasible to…
Descriptors: Human Capital, Measurement Techniques, Simulation, Evaluation Methods
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012
Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…
Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies
Milanowski, Anthony T. – Online Submission, 2011
After decades of disinterest, evaluation of the performance of elementary and secondary teachers in the United States has become an important educational policy issue. As U.S. states and districts have tried to upgrade their evaluation processes, one of the models that has been increasingly used is the Framework for Teaching. This paper summarizes…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Observation
Sturgis, Chris – International Association for K-12 Online Learning, 2014
This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…
Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research
Reichardt, Charles S. – Multivariate Behavioral Research, 2011
Maxwell, Cole, and Mitchell (2011) demonstrated that simple structural equation models, when used with cross-sectional data, generally produce biased estimates of meditated effects. I extend those results by showing how simple structural equation models can produce biased estimates of meditated effects when used even with longitudinal data. Even…
Descriptors: Structural Equation Models, Statistical Data, Longitudinal Studies, Error of Measurement
Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012
A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…
Descriptors: Factor Analysis, Computation, Simulation, Sample Size
Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012
Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…
Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests
Turner, Gill; Gibbs, Graham – Assessment & Evaluation in Higher Education, 2010
There is considerable variation between male and female Bachelor degree performance at Oxford and Cambridge (Oxbridge) where male students attain more First and Third Class degrees and female students attain more Second Class degrees. Various hypotheses have been put forward to explain this phenomenon including the possibility that the distinctive…
Descriptors: Gender Differences, Questionnaires, Evaluation Methods, Evaluation Research
Teasley, C.E. Wynn; Hornyak, Martin – American Journal of Business Education, 2010
The 2009 college football season is here, but there has been a continuing controversy swirling over how the Football Bowl Subdivision (FBS) selects its national champion. College football uses a multi-criterion decision matrix (MCDM) evaluation technique to determine which two teams will play for the national championship. We analyzed the BCS…
Descriptors: Business Administration, Business Administration Education, Team Sports, College Athletics
McKenzie, Robert G. – Learning Disability Quarterly, 2009
The assessment procedures within Response to Intervention (RTI) models have begun to supplant the use of traditional, discrepancy-based frameworks for identifying students with specific learning disabilities (SLD). Many RTI proponents applaud this shift because of perceived shortcomings in utilizing discrepancy as an indicator of SLD. However,…
Descriptors: Intervention, Learning Disabilities, Error of Measurement, Psychometrics
Previous Page | Next Page ยป
Pages: 1 | 2