ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	11

Descriptor

Educational Testing	13
Error of Measurement	13
Scores	13
Correlation	7
Effect Size	5
Computation	4
Educational Research	4
Academic Achievement	3
Achievement Gains	3
Educational Policy	3
Item Response Theory	3
Longitudinal Studies	3
Measurement	3
Predictor Variables	3
Student Evaluation	3
Teacher Effectiveness	3
Teacher Evaluation	3
Test Theory	3
Achievement Tests	2
Credentials	2
Educational Assessment	2
Evaluation Problems	2
Foreign Countries	2
Generalizability Theory	2
Mathematics Achievement	2
More ▼

Source

Journal of Educational and…	2
National Center for Analysis…	2
ACT, Inc.	1
Applied Psychological…	1
Carnegie Foundation for the…	1
Educational Measurement:…	1
International Education…	1
International Journal of…	1
National Center for Education…	1
ProQuest LLC	1

Publication Type

Reports - Evaluative	7
Journal Articles	6
Reports - Research	4
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

New York	3
California	2
Arizona	1
Germany	1
Illinois	1
Missouri	1
New Jersey	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

Direct link

Topczewski, Anna Marie – ProQuest LLC, 2013

Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

Descriptors: Item Response Theory, Scaling, Scores, Student Development

How Unstable Are "School Effects" Assessed by a Value-Added Technique?

Peer reviewed
PDF on ERIC

Download full text

Gorad, Stephen; Hordosy, Rita; Siddiqui, Nadia – International Education Studies, 2013

This paper re-considers the widespread use of value-added approaches to estimate school "effects", and shows the results to be very unstable over time. The paper uses as an example the contextualised value-added scores of all secondary schools in England. The study asks how many schools with at least 99% of their pupils included in the…

Descriptors: Foreign Countries, Outcomes of Education, Secondary Education, Educational Testing

How Stable Are Value-Added Estimates across Years, Subjects and Student Groups? What We Know Series: Value-Added Methods and Applications. Knowledge Brief 3

Download full text

Loeb, Susanna; Candelaria, Christopher A. – Carnegie Foundation for the Advancement of Teaching, 2012

Value-added models measure teacher performance by the test score gains of their students, adjusted for a variety of factors such as the performance of students when they enter the class. The measures are based on desired student outcomes such as math and reading scores, but they have a number of potential drawbacks. One of them is the…

Descriptors: Academic Achievement, Teacher Effectiveness, Scores, Peer Influence

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Estimating the Impacts of Educational Interventions Using State Tests or Study-Administered Tests. NCEE 2012-4016

Peer reviewed
PDF on ERIC

Download full text

Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011

This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…

Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement

When Can Subscores Have Value?

Peer reviewed

Direct link

Haberman, Shelby J. – Journal of Educational and Behavioral Statistics, 2008

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Testing Programs, Regression (Statistics), Scores, Student Evaluation

Using Value-Added Measures of Teacher Quality. Brief 9

Download full text

Hanushek, Eric A.; Rivkin, Steven G. – National Center for Analysis of Longitudinal Data in Education Research, 2010

Extensive education research on the contribution of teachers to student achievement produces two generally accepted results. First, teacher quality varies substantially as measured by the value added to student achievement or future academic attainment or earnings. Second, variables often used to determine entry into the profession and…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Performance Assessments with Microworlds and Their Difficulty

Peer reviewed

Direct link

Kluge, Annette – Applied Psychological Measurement, 2008

The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…

Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

Reliability. ERIC Digest.

Download full text

Rudner, Lawrence M.; Schafer, William D. – 2001

This digest discusses sources of error in testing, several approaches to estimating reliability, and several ways to increase test reliability. Reliability has been defined in different ways by different authors, but the best way to look at reliability may be the extent to which measurements resulting from a test are characteristics of those being…

Descriptors: Educational Testing, Error of Measurement, Reliability, Scores

Classical Test Theory in Historical Perspective.

Peer reviewed

Traub, Ross E. – Educational Measurement: Issues and Practice, 1997

Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)

Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics

Measuring Effect Sizes: The Effect of Measurement Error. Working Paper 19

Download full text

Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008

Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Loeb, Susanna	3
Boyd, Donald	2
Lankford, Hamilton	2
Wyckoff, James	2
Candelaria, Christopher A.	1
Cui, Zhongmin	1
DeMars, Christine E.	1
Fang, Yu	1
Gorad, Stephen	1
Grossman, Pamela	1
Haberman, Shelby J.	1
Hanushek, Eric A.	1
Hordosy, Rita	1
Jaciw, Andrew P.	1
Kluge, Annette	1
Olsen, Robert B.	1
Phan, Ha	1
Price, Cristofer	1
Rivkin, Steven G.	1
Rudner, Lawrence M.	1
Schafer, William D.	1
Siddiqui, Nadia	1
Socha, Alan	1
Topczewski, Anna Marie	1
Traub, Ross E.	1
More ▼