ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Source

Applied Measurement in…

Publication Type

Journal Articles	15
Reports - Evaluative	9
Reports - Research	5
Information Analyses	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Grade 2	1
Grade 3	1

Audience

Location

Canada	1
Colorado	1
Vermont	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	2

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Are Multiple Measures Meaningful?: Lessons from a Statewide Performance Assessment.

Peer reviewed

Goldberg, Gail Lynn; Roswell, Barbara Sherr – Applied Measurement in Education, 2001

To determine the factors that contribute to or compromise the effectiveness of multiscored items, this study combined analysis of statewide score data from the 1996 Maryland School Performance Assessment Program tests with systematic analyses of 60 activities providing measures of writing, language usage, or both, and one or more content areas.…

Descriptors: Performance Based Assessment, Scores, State Programs, Testing Programs

Estimation of the All Tests Pass Rate When No Examinee Took All of the Tests

Peer reviewed

Direct link

Miller, G. Edward; Yoes, Michael E.; Twing, Jon S. – Applied Measurement in Education, 2004

Two models are presented in this article for estimating the proportion of students who would pass all of three or more content area tests given that none have actually been tested in more than two of the content areas. The first model allows one to estimate the proportion of students who would pass all of three or more content area tests from the…

Descriptors: Scores, Standardized Tests, Student Evaluation, Testing Programs

Linking Statewide Tests to the National Assessment of Educational Progress: Accuracy of Combining Test Results across States.

Peer reviewed

Ercikan, Kadriye – Applied Measurement in Education, 1997

Linking scores from the National Assessment of Educational Progress (NAEP) to statewide test results was studied. Results based on an equipercentile procedure suggest that such a link does not provide precise information. Information from a linking study should be limited to rough estimates of students in each NAEP achievement level. (SLD)

Descriptors: Equated Scores, Estimation (Mathematics), National Surveys, State Programs

Student Test Score Reports and Interpretive Guides: Review of Current Practices and Suggestions for Future Research

Peer reviewed

Direct link

Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004

A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…

Descriptors: Test Results, Student Evaluation, Scores, Testing Programs

Stability of School-Level Scores from Large-Scale Student Assessment.

Peer reviewed

Sicoly, Fiore – Applied Measurement in Education, 2002

Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)

Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics

Comments on Holland and Wainer's "Sources of Uncertainty Often Ignored in Adjusting State Mean SAT Scores for Differential Participation Rates: The Rules of the Game."

Peer reviewed

Edwards, Don; Cummings, Cynthia B. – Applied Measurement in Education, 1990

An evolved form of the Edwards and Beckworth (1989) model for probability selection for Scholastic Achievement Test takers using truncated normal distributions is presented. It is shown that the arguments of Holland and Wainer are not sufficient to reject this model. (SLD)

Descriptors: College Entrance Examinations, Models, Participation, Probability

Aligning Policy and Methodology to Achieve Consistent Across-Grade Performance Standards

Peer reviewed

Direct link

Lewis, Daniel M.; Haug, Carolyn A. – Applied Measurement in Education, 2005

In a coherent educational assessment system, the relation between the cut scores across the grades of a content area should reflect the goals of the educational system moderated by the state of the standards within that system. In this article, we discuss the need for consistent cut scores across the grades of a content area; we present and…

Descriptors: Federal Legislation, Testing Programs, Educational Assessment, Standard Setting

Vertically Articulated Performance Standards: Logic, Procedures, and Likely Classification Accuracy

Peer reviewed

Direct link

Ferrara, Steve; Johnson, Eugene; Chen, Wen-Hung – Applied Measurement in Education, 2005

Psychometricians continue to develop and evaluate methods for linking test scores, both horizontally and vertically. This article describes a social moderation process for articulating (i.e., linking) performance standards across grade levels for an operational state assessment program. The researchers used generated data to evaluate the likely…

Descriptors: Grade 2, Grade 3, Scores, Error of Measurement

Sources of Uncertainty Often Ignored in Adjusting State Mean SAT Scores for Differential Participation Rates: The Rules of the Game.

Peer reviewed

Holland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990

Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…

Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias

The Reliability of Mathematics Portfolio Scores: Lessons from the Vermont Experience.

Peer reviewed

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995

Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)

Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Testing Programs	15
Scores	8
State Programs	7
Equated Scores	5
Evaluation Methods	4
Test Reliability	4
Test Results	4
Educational Assessment	3
Item Response Theory	3
Models	3
Student Evaluation	3
Academic Achievement	2
College Entrance Examinations	2
Cutting Scores	2
Educational Testing	2
Elementary Secondary Education	2
Error of Measurement	2
Estimation (Mathematics)	2
Foreign Countries	2
Mathematics Tests	2
Performance Based Assessment	2
Probability	2
Scoring	2
Standardized Tests	2
State Standards	2
More ▼

Chen, Wen-Hung	1
Colvin, Kimberly F.	1
Cook, Robert J.	1
Cummings, Cynthia B.	1
Edwards, Don	1
Ercikan, Kadriye	1
Ferrara, Steve	1
Goldberg, Gail Lynn	1
Goodman, Dean P.	1
Haberman, Shelby	1
Hambleton, Ronald K.	1
Haug, Carolyn A.	1
Holland, Paul W.	1
Johnson, Eugene	1
Keller, Lisa A.	1
Keller, Robert	1
Kim, Sooyeon	1
Kiplinger, Vonda L.	1
Klein, Stephen P.	1
Lewis, Daniel M.	1
Linn, Robert L.	1
Miller, G. Edward	1
Phillips, Gary W.	1
Puhan, Gautam	1
More ▼