ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	12

Source

Behavioral Research and…	4
National Center for Education…	2
Assessing Writing	1
Australian Educational…	1
Canadian Journal of School…	1
IAP - Information Age…	1
International Journal of…	1
Journal of Applied Testing…	1
Regional Educational…	1
Review of Educational Research	1

Publication Type

Reports - Evaluative	8
Journal Articles	6
Numerical/Quantitative Data	4
Reports - Research	3
Reports - Descriptive	2
Books	1
Collected Works - General	1
Guides - General	1
Information Analyses	1

Education Level

Elementary Secondary Education	14
Elementary Education	5
Higher Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1

Audience

Policymakers	1
Teachers	1

Location

Australia	1
New Zealand	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Worldwide Test Reviewing at the Beginning of the Twenty-First Century

Peer reviewed

Direct link

Geisinger, Kurt F. – International Journal of Testing, 2012

This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…

Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

Download full text

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

Download full text

Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 6, Grade 3, Curriculum Based Assessment, Educational Testing

Administration and Scoring Errors of Graduate Students Learning the WISC-IV: Issues and Controversies

Peer reviewed

Direct link

Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012

A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…

Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring

Using the Method of Pairwise Comparison to Obtain Reliable Teacher Assessments

Peer reviewed
PDF on ERIC

Download full text

Heldsinger, Sandra; Humphry, Stephen – Australian Educational Researcher, 2010

Demands for accountability have seen the implementation of large scale testing programs in Australia and internationally. There is, however, a growing body of evidence to show that externally imposed testing programs do not have a sustained impact on student achievement. It has been argued that teacher assessment is more effective in raising…

Descriptors: Testing Programs, Testing, Academic Achievement, Measures (Individuals)

Extended Time Testing Accommodations for Students with Disabilities: Answers to Five Fundamental Questions

Peer reviewed

Direct link

Lovett, Benjamin J. – Review of Educational Research, 2010

Extended time is one of the most common testing accommodations provided to students with disabilities. It is also controversial; critics of extended time accommodations argue that extended time is used too readily, without concern for how it changes the skills measured by tests, leading to scores that cannot be compared fairly with those of other…

Descriptors: Testing Accommodations, Academic Accommodations (Disabilities), Literature Reviews, Meta Analysis

Whether and How to Use State Tests to Measure Student Achievement in a Multi-State Randomized Experiment: An Empirical Assessment Based on Four Recent Evaluations. NCEE 2012-4015

Peer reviewed
PDF on ERIC

Download full text

Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011

This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…

Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness

Assessing Students in the Margin: Challenges, Strategies, and Techniques

Direct link

Russell, Michael; Kavanaugh, Maureen – IAP - Information Age Publishing, Inc., 2011

The importance of student assessment, particularly for summative purposes, has increased greatly over the past thirty years. At the same time, emphasis on including all students in assessment programs has also increased. Assessment programs, whether they are large-scale, district-based, or teacher developed, have traditionally attempted to assess…

Descriptors: Testing Accommodations, Testing Programs, Educational Assessment, Adaptive Testing

Using State Tests in Education Experiments: A Discussion of the Issues. NCEE 2009-013

Peer reviewed
PDF on ERIC

Download full text

May, Henry; Perez-Johnson, Irma; Haimson, Joshua; Sattar, Samina; Gleason, Phil – National Center for Education Evaluation and Regional Assistance, 2009

Securing data on students' academic achievement is typically one of the most important and costly aspects of conducting education experiments. As state assessment programs have become practically universal and more uniform in terms of grades and subjects tested, the relative appeal of using state tests as a source of study outcome measures has…

Descriptors: Testing Programs, Academic Achievement, Researchers, Educational Research

The Predictive Validity of Selected Benchmark Assessments Used in the Mid-Atlantic Region. Issues & Answers. REL 2007-No. 017

Peer reviewed
PDF on ERIC

Download full text

Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007

This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…

Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness

Accuracy in the Scoring of Writing: Studies of Reliability and Validity Using a New Zealand Writing Assessment System

Peer reviewed

Direct link

Brown, Gavin T. L.; Glasswell, Kath; Harland, Don – Assessing Writing, 2004

Accuracy in the scoring of writing is critical if standardized tasks are to be used in a national assessment scheme. Three approaches to establishing accuracy (i.e., consensus, consistency, and measurement) exist and commonly large-scale assessment programs of primary school writing demonstrate adjacent agreement consensus rates of between 80% and…

Descriptors: Writing Evaluation, Student Evaluation, Educational Assessment, Writing Tests

Some Useful Cost-Benefit Criteria for Evaluating Computer-Based Test Delivery Models and Systems

Peer reviewed

Direct link

Luecht, Richard M. – Journal of Applied Testing Technology, 2005

Computer-based testing (CBT) is typically implemented using one of three general test delivery models: (1) multiple fixed testing (MFT); (2) computer-adaptive testing (CAT); or (3) multistage testing (MSTs). This article reviews some of the real cost drivers associated with CBT implementation--focusing on item production costs, the costs…

Descriptors: Adaptive Testing, Computer Assisted Testing, Quality Control, Costs

Test Reliability	14
Testing Programs	14
Test Validity	8
Statistical Analysis	6
Student Evaluation	6
Test Construction	6
Educational Testing	5
Evaluation Research	5
Academic Achievement	4
Curriculum Based Assessment	4
Item Response Theory	4
Multiple Choice Tests	4
Program Effectiveness	4
Reading Comprehension	4
Reading Tests	4
Screening Tests	4
Comparative Analysis	3
Evaluation Methods	3
Foreign Countries	3
Measures (Individuals)	3
Psychometrics	3
Standardized Tests	3
State Standards	3
Accountability	2
Achievement Tests	2
More ▼

Alonzo, Julie	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Tindal, Gerald	4
Barford, Sean W.	1
Brown, Gavin T. L.	1
Brown, Richard S.	1
Coughlin, Ed	1
Dombrowski, Stefan C.	1
Geisinger, Kurt F.	1
Glasswell, Kath	1
Gleason, Phil	1
Haimson, Joshua	1
Harland, Don	1
Heldsinger, Sandra	1
Humphry, Stephen	1
Janzen, Troy M.	1
Kavanaugh, Maureen	1
Krawchuk, Lindsey L.	1
Lovett, Benjamin J.	1
Luecht, Richard M.	1
May, Henry	1
Mrazik, Martin	1
Perez-Johnson, Irma	1
More ▼