ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	20

Descriptor

Error of Measurement	28
Evaluation Methods	28
Evaluation Research	28
Evaluation Criteria	6
Evaluation Problems	6
Measurement Techniques	6
Models	5
Structural Equation Models	5
Test Reliability	5
Academic Achievement	4
Educational Policy	4
Interrater Reliability	4
Item Response Theory	4
Simulation	4
Statistical Analysis	4
Change Strategies	3
Computation	3
Educational Assessment	3
Goodness of Fit	3
Item Analysis	3
Measurement	3
Program Effectiveness	3
Psychometrics	3
Research Methodology	3
Sample Size	3
More ▼

Publication Type

Journal Articles	23
Reports - Research	13
Reports - Descriptive	10
Reports - Evaluative	4
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	6
Higher Education	6
Adult Education	4
Postsecondary Education	3
High Schools	2
Secondary Education	1

Audience

Location

California	1
Illinois	1
Iran	1
Maine	1
Michigan	1
Nevada	1
New Hampshire	1
Ohio	1
Oklahoma	1
Oregon	1
Rhode Island	1
Texas	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

A Maximum Test of Three Non-Parametric Two-Sample Procedures for Ordinal Data

Direct link

Lotfi Simon Kerzabi – ProQuest LLC, 2021

Monte Carlo methods are an accepted methodology in regards to generation critical values for a Maximum test. The same methods are also applicable to the evaluation of the robustness of the new created test. A table of critical values was created, and the robustness of the new maximum test was evaluated for five different distributions. Robustness…

Descriptors: Data, Monte Carlo Methods, Testing, Evaluation Research

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

The Nonuse, Misuse, and Proper Use of Pilot Studies in Experimental Evaluation Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Westlund, Erik; Stuart, Elizabeth A. – American Journal of Evaluation, 2017

This article discusses the nonuse, misuse, and proper use of pilot studies in experimental evaluation research. The authors first show that there is little theoretical, practical, or empirical guidance available to researchers who seek to incorporate pilot studies into experimental evaluation research designs. The authors then discuss how pilot…

Descriptors: Use Studies, Pilot Projects, Evaluation Research, Experiments

Maintaining Equivalent Cut Scores for Small Sample Test Forms

Peer reviewed

Direct link

Dwyer, Andrew C. – Journal of Educational Measurement, 2016

This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards

Evaluation Strategies in Financial Education: Evaluation with Imperfect Instruments

Peer reviewed

Direct link

Robinson, Lauren; Dudensing, Rebekka; Granovsky, Nancy L. – Journal of Extension, 2016

Program evaluation often suffers due to time constraints, imperfect instruments, incomplete data, and the need to report standardized metrics. This article about the evaluation process for the Wi$eUp financial education program showcases the difficulties inherent in evaluation and suggests best practices for assessing program effectiveness. We…

Descriptors: Evaluation Methods, Evaluation Research, Error of Measurement, Money Management

A Second Look at "School-Life Expectancy"

Peer reviewed

Direct link

Barakat, Bilal Fouad – International Journal of Educational Development, 2012

The number of years a child of school-entry age can expect to remain in school is of great interest both as a measure of individual human capital and of the performance of an education system. An approximate indicator of this concept is the sum of age-specific enrolment rates. The relatively low data demands of this indicator that are feasible to…

Descriptors: Human Capital, Measurement Techniques, Simulation, Evaluation Methods

A Revision of School Effectiveness Analysis

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012

Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…

Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies

Validity Research on Teacher Evaluation Systems Based on the Framework for Teaching

Download full text

Milanowski, Anthony T. – Online Submission, 2011

After decades of disinterest, evaluation of the performance of elementary and secondary teachers in the United States has become an important educational policy issue. As U.S. states and districts have tried to upgrade their evaluation processes, one of the models that has been increasingly used is the Framework for Teaching. This paper summarizes…

Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Observation

Progress and Proficiency: Redesigning Grading for Competency Education. CompetencyWorks Issue Brief

Download full text

Sturgis, Chris – International Association for K-12 Online Learning, 2014

This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…

Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research

Commentary: Are Three Waves of Data Sufficient for Assessing Mediation?

Peer reviewed

Direct link

Reichardt, Charles S. – Multivariate Behavioral Research, 2011

Maxwell, Cole, and Mitchell (2011) demonstrated that simple structural equation models, when used with cross-sectional data, generally produce biased estimates of meditated effects. I extend those results by showing how simple structural equation models can produce biased estimates of meditated effects when used even with longitudinal data. Even…

Descriptors: Structural Equation Models, Statistical Data, Longitudinal Studies, Error of Measurement

When Can Categorical Variables Be Treated as Continuous? A Comparison of Robust Continuous and Categorical SEM Estimation Methods under Suboptimal Conditions

Peer reviewed

Direct link

Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…

Descriptors: Factor Analysis, Computation, Simulation, Sample Size

Generalizability of Student Writing across Multiple Tasks: A Challenge for Authentic Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012

Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…

Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests

Are Assessment Environments Gendered? An Analysis of the Learning Responses of Male and Female Students to Different Assessment Environments

Peer reviewed

Direct link

Turner, Gill; Gibbs, Graham – Assessment & Evaluation in Higher Education, 2010

There is considerable variation between male and female Bachelor degree performance at Oxford and Cambridge (Oxbridge) where male students attain more First and Third Class degrees and female students attain more Second Class degrees. Various hypotheses have been put forward to explain this phenomenon including the possibility that the distinctive…

Descriptors: Gender Differences, Questionnaires, Evaluation Methods, Evaluation Research

BCS or Just BS: How College Football Could Crown the Wrong National Champion? Just Do the Math--Correctly!

Peer reviewed
PDF on ERIC

Download full text

Teasley, C.E. Wynn; Hornyak, Martin – American Journal of Business Education, 2010

The 2009 college football season is here, but there has been a continuing controversy swirling over how the Football Bowl Subdivision (FBS) selects its national champion. College football uses a multi-criterion decision matrix (MCDM) evaluation technique to determine which two teams will play for the national championship. We analyzed the BCS…

Descriptors: Business Administration, Business Administration Education, Team Sports, College Athletics

Obscuring Vital Distinctions: The Oversimplification of Learning Disabilities within RTI

Peer reviewed

Direct link

McKenzie, Robert G. – Learning Disability Quarterly, 2009

The assessment procedures within Response to Intervention (RTI) models have begun to supplant the use of traditional, discrepancy-based frameworks for identifying students with specific learning disabilities (SLD). Many RTI proponents applaud this shift because of perceived shortcomings in utilizing discrepancy as an indicator of SLD. However,…

Descriptors: Intervention, Learning Disabilities, Error of Measurement, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2

Structural Equation Modeling	5
Journal of Educational…	2
American Journal of Business…	1
American Journal of Evaluation	1
Assessment	1
Assessment & Evaluation in…	1
Educational Measurement:…	1
Educational Policy…	1
Educational and Psychological…	1
International Association for…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Extension	1
Learning Disability Quarterly	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Online Submission	1
ProQuest LLC	1
Psychological Methods	1
Quality of Higher Education	1
RAND Corporation	1
Research & Practice in…	1
More ▼

Ankenmann, Robert D.	1
Barakat, Bilal Fouad	1
Bardhoshi, Gerta	1
Brosseau-Liard, Patricia E.	1
Conley, David T.	1
Dolan, Conor V.	1
Dudensing, Rebekka	1
Dudgeon, Paul	1
Dwyer, Andrew C.	1
Erford, Bradley T.	1
Fitzgerald, Robert	1
Foster, Jeff L.	1
Gibbs, Graham	1
Granovsky, Nancy L.	1
Hamilton, Laura S.	1
Hathcoat, John D.	1
Hau, Kit-Tai	1
Hornyak, Martin	1
Hox, Joop	1
Koretz, Daniel M.	1
Leark, Robert A.	1
Lensvelt-Mulders, Gerty	1
Lockwood, J. R.	1
Longford, Nicholas T.	1
Lotfi Simon Kerzabi	1
More ▼