ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Comparative Analysis	9
Test Reliability	9
Testing Programs	9
Test Validity	5
Educational Assessment	4
Elementary Secondary Education	3
Equated Scores	3
Scores	3
Standardized Tests	3
State Programs	3
Student Evaluation	3
Test Construction	3
Academic Achievement	2
Accountability	2
Achievement Tests	2
Correlation	2
Cost Effectiveness	2
Costs	2
Measures (Individuals)	2
National Competency Tests	2
Performance Based Assessment	2
Psychometrics	2
Statistical Analysis	2
Thinking Skills	2
Adaptive Testing	1
More ▼

Source

Applied Measurement in…	1
Australian Educational…	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Applied Testing…	1
National Center for Education…	1

Author

Carvajal, Jorge	1
Haberman, Shelby	1
Hacker, Jacob	1
Hathaway, Walter	1
Heldsinger, Sandra	1
Humphry, Stephen	1
Kim, Sooyeon	1
Kiplinger, Vonda L.	1
Linn, Robert L.	1
Luecht, Richard M.	1
Pollack, Judith M.	1
Seyfarth, John T.	1
Skorupski, William P.	1
Somers, Marie-Andree	1
Wong, Edmond	1
Zhu, Pei	1
von Davier, Alina A.	1
More ▼

Publication Type

Reports - Evaluative	8
Journal Articles	5
Reports - Research	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education

Audience

Location

Australia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Using the Method of Pairwise Comparison to Obtain Reliable Teacher Assessments

Peer reviewed
PDF on ERIC

Download full text

Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010

Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…

Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)

Whether and How to Use State Tests to Measure Student Achievement in a Multi-State Randomized Experiment: An Empirical Assessment Based on Four Recent Evaluations. NCEE 2012-4015

Peer reviewed
PDF on ERIC

Download full text

Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011

This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…

Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness

An Alternative to Equating with Small Samples in the Non-Equivalent Groups Anchor Test Design. Research Report. ETS RR-06-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006

This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…

Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Performance-Based Assessment: Questions and Answers.

Download full text

Seyfarth, John T. – 1993

Performance based assessment refers to tasks that require students to construct responses or take actions to demonstrate specific knowledge or skills. Performance assessment tasks appear in a variety of formats, but they focus on higher order skills and are nonroutine, and sometimes loosely structured, in nature. A number of concerns have been…

Descriptors: Accountability, Comparative Analysis, Educational Assessment, Educational Change

Some Issues in Free Response Testing.

Pollack, Judith M. – 1990

This paper summarizes an investigation of applications and issues in free response (FR) testing during 1989. It draws on ideas from the results of the National Educational Longitudinal Study 1988 (NELS:88) field test, a seminar series at the Educational Testing Service (ETS), working papers prepared for several FR testing applications, and…

Descriptors: Comparative Analysis, Costs, Educational Assessment, Elementary Secondary Education

Toward Extended Assessment: The Big Picture.

Download full text

Hacker, Jacob; Hathaway, Walter – 1991

Testing and assessment that are "more authentic" (performance-based or alternative) represent the most pressing issue in education today. Some of the major criticisms leveled at standardized testing are examined, and the advantages and disadvantages of more authentic assessment are reviewed. A general direction for integrating traditional and…

Descriptors: Comparative Analysis, Cost Effectiveness, Educational Assessment, Educational Trends

Some Useful Cost-Benefit Criteria for Evaluating Computer-Based Test Delivery Models and Systems

Peer reviewed

Direct link

Luecht, Richard M. – Journal of Applied Testing Technology, 2005

Computer-based testing (CBT) is typically implemented using one of three general test delivery models: (1) multiple fixed testing (MFT); (2) computer-adaptive testing (CAT); or (3) multistage testing (MSTs). This article reviews some of the real cost drivers associated with CBT implementation--focusing on item production costs, the costs…

Descriptors: Adaptive Testing, Computer Assisted Testing, Quality Control, Costs