ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	6

Descriptor

Scores	27
Test Items	27
Testing Programs	27
State Programs	12
Elementary Secondary Education	9
Test Results	9
Test Construction	8
Test Use	7
Academic Achievement	6
Achievement Tests	5
Elementary School Students	5
Item Analysis	5
Mathematics Achievement	5
Mathematics Tests	5
Comparative Analysis	4
Educational Assessment	4
Foreign Countries	4
Multiple Choice Tests	4
Test Validity	4
Achievement Gains	3
Comparative Testing	3
Computer Assisted Testing	3
Computer Software	3
Correlation	3
Equated Scores	3
More ▼

Source

ETS Research Report Series	2
Annenberg Institute for…	1
Educational Measurement:…	1
National Center for Education…	1
Online Submission	1
ProQuest LLC	1

Publication Type

Reports - Research	12
Speeches/Meeting Papers	7
Reports - Evaluative	6
Reports - Descriptive	5
Numerical/Quantitative Data	4
Guides - Non-Classroom	3
Journal Articles	3
Tests/Questionnaires	2
Dissertations/Theses -…	1
Dissertations/Theses -…	1

Education Level

Grade 8	2
Elementary Secondary Education	1
Grade 4	1
Grade 6	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners	4
Teachers	2
Administrators	1
Researchers	1

Location

Australia	2
Delaware	2
Canada (Edmonton)	1
Florida	1
Kentucky	1
New Jersey	1
Oregon	1
Tennessee	1

Laws, Policies, & Programs

Comprehensive Education…

Assessments and Surveys

National Assessment of…	2
California Achievement Tests	1
Delaware Student Testing…	1
Florida State Student…	1
Gates MacGinitie Reading Tests	1
Graduate Management Admission…	1
National Teacher Examinations	1
New Jersey High School…	1
North Carolina End of Course…	1
Praxis Series	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

How Can Released State Test Items Support Interim Assessment Purposes in an Educational Crisis? EdWorkingPaper No. 20-292

Download full text

Klugman, Emma M.; Ho, Andrew D. – Annenberg Institute for School Reform at Brown University, 2020

State testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable estimation of student score distributions and achievement…

Descriptors: Test Items, State Programs, Testing Programs, Scores

How Can Released State Test Items Support Interim Assessment Purposes in an Educational Crisis?

Peer reviewed

Direct link

Klugman, Emma M.; Ho, Andrew D. – Educational Measurement: Issues and Practice, 2020

Descriptors: Testing Programs, State Programs, Test Items, Scores

Constructed-Response DIF Evaluations for Mixed-Format Tests. Research Report. ETS RR-13-33

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013

In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis

Relationship between Air Traffic Selection and Training (AT-SAT)) Battery Test Scores and Composite Scores in the Initial en Route Air Traffic Control Qualification Training Course at the Federal Aviation Administration (FAA) Academy

Direct link

Kelley, Ronald Scott – ProQuest LLC, 2012

Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…

Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

2011 NAEP-TIMSS Linking Study: Linking Methodologies and Their Evaluations. NCES 2013-469

Peer reviewed
PDF on ERIC

Download full text

National Center for Education Statistics, 2013

The 2011 NAEP-TIMSS linking study conducted by the National Center for Education Statistics (NCES) was designed to predict Trends in International Mathematics and Science Study (TIMSS) scores for the U.S. states that participated in 2011 National Assessment of Educational Progress (NAEP) mathematics and science assessment of eighth-grade students.…

Descriptors: Grade 8, Research Methodology, Research Design, Trend Analysis

Delaware Student Testing Program: A Score Results Guide for Educators.

Download full text

Delaware State Dept. of Education, Dover. Assessment and Accountability Branch. – 2003

This guide contains materials to help Delaware educators understand and use reports from the Delaware Student Testing Program (DSTP). The DSTP tests are tied to the Delaware content standards that define the knowledge and skills required for students to progress beyond high school. In spring 2003, the DSTP reading, writing, and mathematics tests…

Descriptors: Elementary Secondary Education, Scores, State Programs, Teachers

The Validity of Gains in Scores on the Kentucky Instructional Results Information System (KIRIS).

Koretz, Daniel M.; Barron, Sheila I. – 1998

Large gains in scores have been observed over the first years of the Kentucky Instructional Results Information System (KIRIS) program. The extent to which these gains in scores indicate that student learning improved was evaluated. Previous studies have suggested that KIRIS score gains might be appreciably inflated, something that might result…

Descriptors: Achievement Gains, Elementary Secondary Education, Scores, State Programs

Using Test Results to Improve Instruction.

Bassler, Otto C.; Caulkins, Thomas G. – 1984

A model for summarizing test scores and using them to modify instructional programs is presented. The proposed model consists of two types of summaries of the data gathered through standardized tests. The first summary contains individual and single class results. Information in a "Class Item Response Record" chart provides individual student…

Descriptors: Elementary Secondary Education, Instructional Improvement, Models, Scores

A Comparison of Three Linear Polytomous Scoring Methods.

Download full text

Crehan, Kevin D.; Haladyna, Thomas M. – 1994

More attention is currently being paid to the distractors of a multiple-choice test item (Thissen, Steinberg, and Fitzpatrick, 1989). A systematic relationship exists between the keyed response and distractors in multiple-choice items (Levine and Drasgow, 1983). New scoring methods have been introduced, computer programs developed, and research…

Descriptors: Comparative Analysis, Computer Assisted Testing, Distractors (Tests), Models

The Effect of Item Sequence on Bar Examination Scores.

Klein, Stephen P.; Bolus, Roger – 1983

A solution to reduce the likelihood of one examinee copying another's answers on large scale tests that require all examinees to answer the same set of questions is to use multiple test forms that differ in terms of item ordering. This study was conducted to determine whether varying the sequence in which blocks of items were presented to…

Descriptors: Adults, Cheating, Cost Effectiveness, Item Analysis

The Assignment of Raters to Items: Controlling for Rater Effects.

Download full text

Sykes, Robert C.; Heidorn, Mark; Lee, Guemin – 1999

A study was conducted to evaluate the effect of different modes (modalities) of assigning raters to test items. The impact on total constructed response (c.r.) score, and subsequently on total test score, of assigning a single versus multiple raters to an examination reading of a student's set of c.r. responses was evaluated for several mixed-item…

Descriptors: Constructed Response, Elementary School Students, Elementary Secondary Education, Evaluators

Problem Choice by Test Takers: Implications for Comparability and Construct Validity. CSE Technical Report 485.

Download full text

Linn, Robert L.; Betebenner, Damian W.; Wheeler, Kerry S. – 1998

For assessments that present problems that require extended responses and substantial amounts of time, there is often a desire to allow students to choose which problem they will respond to among two or more options. Student choice of problem may allow students a better opportunity to demonstrate what they know and are able to do. On the other…

Descriptors: Comparative Analysis, Construct Validity, Constructed Response, Grade 10

Writing Education Interpretive Report. Michigan Educational Assessment Program, 1982-83.

Michigan State Board of Education, Lansing. Michigan Educational Assessment Program. – 1983

A statewide sample testing of writing skills in the fourth, seventh, and tenth grades was conducted by the Michigan Educational Assessment Program in the fall of 1982. This report, which shares the writing assessment information (historical, descriptive, and interpretive) for educational decision-making, was prepared in the hope of increasing…

Descriptors: Academic Achievement, Criterion Referenced Tests, Educational Assessment, Elementary Secondary Education

The Effect of Year-to-Year Rater Variation on IRT Linking

Download full text

Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg – Online Submission, 2005

Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…

Descriptors: Test Items, Writing Tests, Reading Tests, Measures (Individuals)

Previous Page | Next Page »

Pages: 1 | 2

Ho, Andrew D.	2
Klugman, Emma M.	2
Allen, Nancy L.	1
Arter, Judith A.	1
Barron, Sheila I.	1
Bassler, Otto C.	1
Betebenner, Damian W.	1
Bolus, Roger	1
Bowman, Harry L.	1
Breyer, F. Jay	1
Caulkins, Thomas G.	1
Clarke, S. C. T.	1
Crehan, Kevin D.	1
Deng, Weiling	1
Dorans, Neil J.	1
Estes, Gary D.	1
Friedman, Greg	1
Haladyna, Thomas M.	1
Heidorn, Mark	1
Huang, Zheng Sen	1
Isham, Steven P.	1
Kelley, Ronald Scott	1
Kingston, Neal	1
Klein, Stephen P.	1
More ▼