ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Scores	24
Test Reliability	24
Testing Programs	24
Test Validity	14
State Programs	12
Achievement Tests	9
Educational Assessment	7
Elementary Secondary Education	7
Standardized Tests	6
Test Construction	6
Academic Achievement	5
Performance Based Assessment	5
Portfolios (Background…	5
Test Results	5
Evaluation Methods	4
Psychometrics	4
Scaling	4
Scoring	4
Student Evaluation	4
Test Use	4
Comparative Analysis	3
Elementary Education	3
Item Analysis	3
Mathematics	3
Multiple Choice Tests	3
More ▼

Source

Applied Measurement in…	2
Educational Measurement:…	1
Educational and Psychological…	1
GED Testing Service	1
NJEA Review	1
National Center for Education…	1
New York State Education…	1
Regional Educational…	1
Review of Research in…	1
TESOL Journal	1

Publication Type

Reports - Evaluative	10
Reports - Research	9
Journal Articles	6
Speeches/Meeting Papers	4
Numerical/Quantitative Data	2
Reports - Descriptive	2
Guides - General	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
High Schools	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 12	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High School Equivalency…	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Researchers

Location

Vermont	4
Canada	2
Alaska	1
California	1
New York (Albany)	1
New York (Buffalo)	1
New York (New York)	1
New York (Rochester)	1
New York (Syracuse)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	2
California Achievement Tests	1
Comprehensive Tests of Basic…	1
General Educational…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
North Carolina End of Course…	1
SRA Achievement Series	1
Sequential Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

New York State Alternate Assessment Technical Report, 2014-15

Download full text

New York State Education Department, 2015

This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…

Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Generalizability Theory as Evidence of Concerns about Fairness in Large-Scale ESL Writing Assessments

Peer reviewed

Direct link

Huang, Jinyan – TESOL Journal, 2011

Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…

Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Whether and How to Use State Tests to Measure Student Achievement in a Multi-State Randomized Experiment: An Empirical Assessment Based on Four Recent Evaluations. NCEE 2012-4015

Peer reviewed
PDF on ERIC

Download full text

Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011

This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…

Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness

Technical Manual: 2002 Series GED Tests

Download full text

Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009

This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…

Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability

The Predictive Validity of Selected Benchmark Assessments Used in the Mid-Atlantic Region. Issues & Answers. REL 2007-No. 017

Peer reviewed
PDF on ERIC

Download full text

Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007

This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…

Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness

The Effects of Functional Level Testing on Five New Standardized Reading Achievement Tests.

Download full text

Easton, John Q.; Washington, Elois D. – 1982

The effects of students taking different levels of the same standardized achievement test were assessed by administering two levels of the same test to each student. The functional level of the test was taken by all students. The second level of testing was randomly assigned at the adjacent higher or lower level of the test. Functional level…

Descriptors: Elementary Education, Pilot Projects, Reading Achievement, Scores

Portfolio Assessment: A Theoretical Estimate of Score Reliability.

Peer reviewed

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995

An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)

Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

A Tale of Testing in Two Cities

McKenna, Bernard H. – NJEA Review, 1976

Article presented a true story of how two cities ran testing programs and the lessons that can be learned from their failures. (Editor/RK)

Descriptors: Learning Processes, Scores, Standardized Tests, Student Attitudes

Sources of Uncertainty Often Ignored in Adjusting State Mean SAT Scores for Differential Participation Rates: The Rules of the Game.

Peer reviewed

Holland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990

Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…

Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias

The Reliability of Mathematics Portfolio Scores: Lessons from the Vermont Experience.

Peer reviewed

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995

Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)

Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Alaska Instructional Diagnostic System, 1978 Pilot Test Results: Technical Report.

Download full text

Northwest Regional Educational Lab., Portland, OR. – 1978

Key findings of a pilot study of the Alaska Instructional Diagnostic System (AIDS) are summarized. The AIDS pilot test served to verify the appropriateness of the skills survey as well as the validity and reliability of the items. The AIDS testing system includes three components: (1) upper level skills surveys (grades 3-8); (2) lower level skill…

Descriptors: Achievement Tests, Diagnostic Tests, Educational Assessment, Educational Objectives

Performance-Based Assessment: Questions and Answers.

Download full text

Seyfarth, John T. – 1993

Performance based assessment refers to tasks that require students to construct responses or take actions to demonstrate specific knowledge or skills. Performance assessment tasks appear in a variety of formats, but they focus on higher order skills and are nonroutine, and sometimes loosely structured, in nature. A number of concerns have been…

Descriptors: Accountability, Comparative Analysis, Educational Assessment, Educational Change

The Reliability of Vermont Portfolio Scores in the 1992-93 School Year. Interim Report. RAND Reprints Series.

Download full text

Koretz, Daniel; And Others – 1994

The 1992-93 school year saw the second statewide implementation of the Vermont portfolio-assessment program, and RAND continued its ongoing evaluation of the program's implementation, effects, and data quality. While the first year's study found evidence of the impact of the assessment program and low reliability of portfolio scoring, this year's…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Mathematics

Previous Page | Next Page »

Pages: 1 | 2

Koretz, Daniel	3
Allen, Nancy L.	1
Anderson, Lorin W.	1
Bennett, Randy Elliot	1
Brown, Richard S.	1
Carvajal, Jorge	1
Coughlin, Ed	1
Easton, John Q.	1
Ebel, Robert L.	1
Ezzelle, Carol	1
Holland, Paul W.	1
Huang, Jinyan	1
Isham, Steven P.	1
Kettler, Ryan J.	1
Klein, Stephen P.	1
Mandeville, Garrett K.	1
McKenna, Bernard H.	1
Reckase, Mark D.	1
Setzer, J. Carl	1
Seyfarth, John T.	1
Skorupski, William P.	1
Somers, Marie-Andree	1
Wainer, Howard	1
Washington, Elois D.	1
More ▼