ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

State Programs	56
Test Reliability	56
Testing Programs	56
Test Validity	29
Educational Assessment	24
Elementary Secondary Education	24
Scoring	20
Achievement Tests	18
Test Construction	18
Performance Based Assessment	12
Scores	12
Writing Evaluation	11
Academic Achievement	9
Essay Tests	9
Test Interpretation	9
Test Results	9
Elementary Education	8
Portfolios (Background…	8
Test Format	8
Test Use	8
Basic Skills	7
Educational Testing	7
Equivalency Tests	7
Higher Education	7
Objective Tests	7
More ▼

Source

Applied Measurement in…	4
Educational and Psychological…	2
Northwest Education	1
School Administrator	1

Publication Type

Reports - Research	27
Reports - Evaluative	18
Reports - Descriptive	14
Speeches/Meeting Papers	9
Journal Articles	8
Numerical/Quantitative Data	4
Guides - Non-Classroom	3
Tests/Questionnaires	3
Collected Works - Proceedings	1
Guides - General	1
Information Analyses	1
Opinion Papers	1
More ▼

Education Level

Audience

Practitioners	3
Researchers	3
Policymakers	2

Location

California	8
Vermont	5
Arizona	2
Canada	2
Florida	2
Georgia	2
Kentucky	2
New Jersey	2
New York	2
South Carolina	2
Alaska	1
Colorado	1
Louisiana	1
Maine	1
Michigan	1
North Carolina	1
Pennsylvania	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	3
National Assessment of…	2
New Jersey High School…	2
Pennsylvania Educational…	2
North Carolina End of Course…	1
SAT (College Admission Test)	1
SRA Achievement Series	1
Teacher Performance…	1
Texas Essential Knowledge and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

An Evaluation of a Multiple Matrix Sampling Procedure for a State Assessment Program.

Download full text

Kohr, Richard L. – 1976

Pennsylvania's Educational Quality Assessment Program provides each participating school with a building level report in which state percentiles are a prominent part. Multiple matrix sampling was being considered as a technique to reduce testing time. However, there was great concern that the error associated with estimating the school mean might…

Descriptors: Educational Assessment, Elementary Secondary Education, Item Sampling, Measurement Techniques

The Registered Holistic Scoring Method for Scoring Student Essays. New Jersey's Statewide Testing System High School Proficiency Test.

Bloom, Diane S. – 1985

The Registered Holistic Scoring Method, which has been used for one year to score the ninth grade writing test of the New Jersey High School Proficiency Test, is described. Registered Holistic Scoring was developed from the previous holistic approach in order to provide more reliable scoring guidelines year after year. Two trained evaluators…

Descriptors: Essay Tests, Evaluation Criteria, Grade 9, Holistic Evaluation

Sources of Uncertainty Often Ignored in Adjusting State Mean SAT Scores for Differential Participation Rates: The Rules of the Game.

Peer reviewed

Holland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990

Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…

Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias

Overview of the Most Difficult Technical Issues on the VNT.

Download full text

Skaggs, Gary; Bourque, Mary Lyn – 1998

Political and legislative pressures have posed a number of measurement issues and challenges to the development of sound, valid voluntary national tests (VNTs). This paper focuses on what appear to be the most difficult technical issues related to the VNT proposed by President Clinton in 1997. Technical issues refer to psychometric issues, as…

Descriptors: Academic Achievement, Achievement Tests, Classification, Difficulty Level

The Reliability of Mathematics Portfolio Scores: Lessons from the Vermont Experience.

Peer reviewed

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995

Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)

Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Alaska Instructional Diagnostic System, 1978 Pilot Test Results: Technical Report.

Download full text

Northwest Regional Educational Lab., Portland, OR. – 1978

Key findings of a pilot study of the Alaska Instructional Diagnostic System (AIDS) are summarized. The AIDS pilot test served to verify the appropriateness of the skills survey as well as the validity and reliability of the items. The AIDS testing system includes three components: (1) upper level skills surveys (grades 3-8); (2) lower level skill…

Descriptors: Achievement Tests, Diagnostic Tests, Educational Assessment, Educational Objectives

The Objective-Referenced Measure in Mathematics for Delaware Grade Four Students. Final Report.

Download full text

Delaware State Dept. of Public Instruction, Dover. Div. of Research, Planning, and Evaluation. – 1977

Part I of this report attempts to describe the system that was developed for local educational agencies by the Delaware State Department of Public Instruction to support classroom and curricular improvement in mathematics through the administration of an objective-referenced test in mathematics to grade four students. This system includes the…

Descriptors: Criterion Referenced Tests, Diagnostic Tests, Educational Assessment, Educational Objectives

High Stakes: Innovation Meets Backlash As States Struggle with Large-Scale Assessment.

Caudell, Lee Sherman – Northwest Education, 1996

Most states have expanded their statewide testing programs to include alternative educational assessments, and two (Kentucky and Maine) have completely abandoned the multiple-choice format. However, over half of states designing alternative assessments are encountering major difficulties related to the high cost of performance-based assessments,…

Descriptors: Accountability, Alternative Assessment, Costs, Educational Assessment

Applying the APA/AERA/NCME "Standards": Evidence for the Validity and Reliability of Three Statewide Teaching Assessment Instruments.

Download full text

Rothenberg, Lori; Hessling, Peter A. – 1990

The statewide teaching performance assessment instruments being used in Georgia, North Carolina, and Florida were examined. Forty-one reliability and validity studies regarding the instruments in use in each state were collected from state departments and universities. Georgia uses the Georgia Teacher Performance Assessment Instrument. North…

Descriptors: Construct Validity, Educational Assessment, Elementary Secondary Education, Meta Analysis

Development of the State Speaking Assessment Instrument: Reliability and Bias Study, Summary Report. Basic Skills Improvement Policy.

Massachusetts State Dept. of Education, Boston. Bureau of Research and Assessment. – 1982

Since the approval of the Basic Skills Improvement Policy in 1978, the Massachusetts Department of Education has been developing tests and alternative forms for the assessment of student achievement in five basic skills content areas: reading, writing, mathematics, listening, and speaking. Because of the lack of previous research on which to draw…

Descriptors: Achievement Tests, Basic Skills, Elementary Secondary Education, Policy Formation

Interpreting the Reliability and Validity of the Michigan Educational Assessment Program. Fact Finding on the Michigan Educational Assessment Program.

Download full text

Burns, Matthew – 1998

The psychometric properties of the testing tools of the Michigan Educational Assessment Program (MEAP), the state standardized testing program, are examined. Reliability studies have indicated that the scores from the MEAP, ranging from 0.654 to 0.949, are generally acceptable. The State Department of Education offered supporting evidence for the…

Descriptors: Academic Achievement, Achievement Tests, Criterion Referenced Tests, Elementary Secondary Education

Technical Digest for the Academic Year 1997-1998. Texas Student Assessment Program.

Texas Education Agency, Austin. – 1998

This digest is designed to provide information to Texas testing coordinators, other educators, and interested citizens about the development procedures and technical attributes of the state-mandated criterion-referenced assessment program. The chapters are: (1) "Background"; (2) "Test Development"; (3) "Test…

Descriptors: Alternative Assessment, Criterion Referenced Tests, Elementary Secondary Education, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

White, Edward M.	6
Koretz, Daniel	4
Bloom, Diane S.	2
Falk, Beverly	2
Kohr, Richard L., Comp.	2
Anderson, Lorin W.	1
Bourque, Mary Lyn	1
Burns, Matthew	1
Busbee, Cyril B.	1
Carvajal, Jorge	1
Caudell, Lee Sherman	1
Cromack, Theodore R.	1
Gearhart, Maryl	1
Hazelton, Alexander	1
Herman, Joan L.	1
Hessling, Peter A.	1
Hill, Richard	1
Hines, Constance V.	1
Holland, Paul W.	1
Kahl, Stuart R.	1
Kiplinger, Vonda L.	1
Klein, Stephen P.	1
Kohr, Richard L.	1
Lewis, Anne C.	1
More ▼