ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	13

Descriptor

Educational Testing	27
Evaluation Methods	27
Statistical Analysis	27
Program Evaluation	8
Educational Assessment	6
Student Evaluation	6
Academic Achievement	5
Correlation	5
Educational Policy	5
Measurement Techniques	5
Research Methodology	5
Test Interpretation	5
Scores	4
Test Reliability	4
Test Results	4
Tests	4
Achievement Gains	3
Comparative Analysis	3
Data Collection	3
Educational Objectives	3
Educational Research	3
Elementary Education	3
Evaluation Criteria	3
Foreign Countries	3
Measurement	3
More ▼

Source

International Journal of…	2
Regional Educational…	2
Research Papers in Education	2
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
Economics of Education Review	1
Education Finance and Policy	1
Journal of Educational…	1
Journal of International…	1
National Academies Press	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	7
Reports - Descriptive	5
Reports - Evaluative	5
Information Analyses	4
Speeches/Meeting Papers	2
Books	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Opinion Papers	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	8
Elementary Education	3
High Schools	3
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Kindergarten	1
Secondary Education	1

Audience

Location

United Kingdom	2
Iran	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

ACT Assessment	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Stanford Achievement Tests	2

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States are increasingly interested in including measures of student achievement growth, or "value- added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot …

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

Using Alternative Student Growth Measures for Evaluating Teacher Performance: What the Literature Says. Summary. REL 2013-002

Peer reviewed
PDF on ERIC

Download full text

Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013

States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…

Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing

The Quality vs. the Quantity of Schooling: What Drives Economic Growth?

Peer reviewed

Direct link

Breton, Theodore R. – Economics of Education Review, 2011

This paper challenges Hanushek and Woessmann's (2008) contention that the quality and not the quantity of schooling determines a nation's rate of economic growth. I first show that their statistical analysis is flawed. I then show that when a nation's average test scores and average schooling attainment are included in a national income model,…

Descriptors: Economic Progress, Income, Statistical Significance, Educational Quality

Would Accountability Based on Teacher Value Added Be Smart Policy? An Examination of the Statistical Properties and Policy Alternatives

Peer reviewed

Direct link

Harris, Douglas N. – Education Finance and Policy, 2009

Annual student testing may make it possible to measure the contributions to student achievement made by individual teachers. But would these "teacher value-added" measures help to improve student achievement? I consider the statistical validity, purposes, and costs of teacher value-added policies. Many of the key assumptions of teacher value added…

Descriptors: Credentials, Educational Testing, Educational Policy, Policy Analysis

Getting Value out of Value-Added: Report of a Workshop

Direct link

Braun, Henry, Ed.; Chudowsky, Naomi, Ed.; Koenig, Judith, Ed. – National Academies Press, 2010

Value-added methods refer to efforts to estimate the relative contributions of specific teachers, schools, or programs to student test performance. In recent years, these methods have attracted considerable attention because of their potential applicability for educational accountability, teacher pay-for-performance systems, school and teacher…

Descriptors: Accountability, Teacher Improvement, Workshops, Program Evaluation

Understanding Comparability of Examination Standards

Peer reviewed

Direct link

Coe, Robert – Research Papers in Education, 2010

Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…

Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

The Effect of Tennessee's Prekindergarten Programs on Young Children's School Readiness Skills: A Regression Discontinuity Design

Direct link

Coburn, Jamie Lynn – ProQuest LLC, 2009

This study sought to explore the relationship between attendance in public prekindergarten programs and school readiness skills using regression discontinuity methodology. A sample of 179 students entering prekindergarten and 67 students entering kindergarten who had completed prekindergarten the previous year was collected with parental consent…

Descriptors: School Readiness, Preschool Education, Statistical Analysis, Geographic Regions

Contrasting Conceptions of Comparability

Peer reviewed

Direct link

Newton, Paul E. – Research Papers in Education, 2010

Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…

Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests

The Importance of Examining Teacher and Learner's Attitudes and Understanding Learning Needs in the Twenty First Century

Peer reviewed

Direct link

Jabbarifar, Taghi; Elhambakhsh, ELham – Journal of International Education Research, 2012

An indispensable part of any curriculum design in an educational setting is the analysis of the needs of the learners involved in the context. The needs can be addressed from different perspectives. Among them, the learners' needs in terms of their perceptions toward what constitute learning/teaching and testing processes are of prominent values.…

Descriptors: Foreign Countries, Mixed Methods Research, Observation, Instructional Design

Louis Guttman's Contributions to Classical Test Theory

Peer reviewed

Direct link

Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005

This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…

Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing

A Glossary of Measurement Terms Used in Title I Evaluation.

Download full text

Fortna, Richard O. – 1981

Measurement terms used in Title I evaluation are contained in this glossary. Several types of measurement techniques are identified and defined. Other measurement terms which are defined include those relating to validity, reliability, statistical analysis, test interpretation, and program effectiveness. (DWH)

Descriptors: Educational Testing, Evaluation Methods, Glossaries, Program Evaluation

Previous Page | Next Page »

Pages: 1 | 2

Booker, Kevin	2
Bruch, Julie	2
Gill, Brian	2
And Others.	1
Armstrong, Ronald D.	1
Ball, Samuel	1
Braun, Henry, Ed.	1
Breton, Theodore R.	1
Chissom, Brad S.	1
Chudowsky, Naomi, Ed.	1
Coburn, Jamie Lynn	1
Coe, Robert	1
DeMars, Christine E.	1
Doherty, William J.	1
Elhambakhsh, ELham	1
Everson, Howard T.	1
Fortna, Richard O.	1
Haberman, Shelby	1
Harris, Douglas N.	1
Hoepfner, Ralph	1
Huberty, Carl J.	1
Jabbarifar, Taghi	1
Kandaswamy, Subramaniam	1
Kim, Sooyeon	1
More ▼