NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)6
Laws, Policies, & Programs
Comprehensive Education…1
What Works Clearinghouse Rating
Showing 1 to 15 of 27 results Save | Export
Klugman, Emma M.; Ho, Andrew D. – Annenberg Institute for School Reform at Brown University, 2020
State testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable estimation of student score distributions and achievement…
Descriptors: Test Items, State Programs, Testing Programs, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Klugman, Emma M.; Ho, Andrew D. – Educational Measurement: Issues and Practice, 2020
State testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable estimation of student score distributions and achievement…
Descriptors: Testing Programs, State Programs, Test Items, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013
In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis
Kelley, Ronald Scott – ProQuest LLC, 2012
Scope and Method of Study: This study focused on the development and use of the AT-SAT test battery and the Initial En Route Qualification training course for the selection, training, and evaluation of air traffic controller candidates. The Pearson product moment correlation coefficient was used to measure the linear relationship between the…
Descriptors: Traffic Safety, Scores, Equated Scores, Multiple Regression Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
National Center for Education Statistics, 2013
The 2011 NAEP-TIMSS linking study conducted by the National Center for Education Statistics (NCES) was designed to predict Trends in International Mathematics and Science Study (TIMSS) scores for the U.S. states that participated in 2011 National Assessment of Educational Progress (NAEP) mathematics and science assessment of eighth-grade students.…
Descriptors: Grade 8, Research Methodology, Research Design, Trend Analysis
Delaware State Dept. of Education, Dover. Assessment and Accountability Branch. – 2003
This guide contains materials to help Delaware educators understand and use reports from the Delaware Student Testing Program (DSTP). The DSTP tests are tied to the Delaware content standards that define the knowledge and skills required for students to progress beyond high school. In spring 2003, the DSTP reading, writing, and mathematics tests…
Descriptors: Elementary Secondary Education, Scores, State Programs, Teachers
Koretz, Daniel M.; Barron, Sheila I. – 1998
Large gains in scores have been observed over the first years of the Kentucky Instructional Results Information System (KIRIS) program. The extent to which these gains in scores indicate that student learning improved was evaluated. Previous studies have suggested that KIRIS score gains might be appreciably inflated, something that might result…
Descriptors: Achievement Gains, Elementary Secondary Education, Scores, State Programs
Bassler, Otto C.; Caulkins, Thomas G. – 1984
A model for summarizing test scores and using them to modify instructional programs is presented. The proposed model consists of two types of summaries of the data gathered through standardized tests. The first summary contains individual and single class results. Information in a "Class Item Response Record" chart provides individual student…
Descriptors: Elementary Secondary Education, Instructional Improvement, Models, Scores
Crehan, Kevin D.; Haladyna, Thomas M. – 1994
More attention is currently being paid to the distractors of a multiple-choice test item (Thissen, Steinberg, and Fitzpatrick, 1989). A systematic relationship exists between the keyed response and distractors in multiple-choice items (Levine and Drasgow, 1983). New scoring methods have been introduced, computer programs developed, and research…
Descriptors: Comparative Analysis, Computer Assisted Testing, Distractors (Tests), Models
Klein, Stephen P.; Bolus, Roger – 1983
A solution to reduce the likelihood of one examinee copying another's answers on large scale tests that require all examinees to answer the same set of questions is to use multiple test forms that differ in terms of item ordering. This study was conducted to determine whether varying the sequence in which blocks of items were presented to…
Descriptors: Adults, Cheating, Cost Effectiveness, Item Analysis
Sykes, Robert C.; Heidorn, Mark; Lee, Guemin – 1999
A study was conducted to evaluate the effect of different modes (modalities) of assigning raters to test items. The impact on total constructed response (c.r.) score, and subsequently on total test score, of assigning a single versus multiple raters to an examination reading of a student's set of c.r. responses was evaluated for several mixed-item…
Descriptors: Constructed Response, Elementary School Students, Elementary Secondary Education, Evaluators
Linn, Robert L.; Betebenner, Damian W.; Wheeler, Kerry S. – 1998
For assessments that present problems that require extended responses and substantial amounts of time, there is often a desire to allow students to choose which problem they will respond to among two or more options. Student choice of problem may allow students a better opportunity to demonstrate what they know and are able to do. On the other…
Descriptors: Comparative Analysis, Construct Validity, Constructed Response, Grade 10
Michigan State Board of Education, Lansing. Michigan Educational Assessment Program. – 1983
A statewide sample testing of writing skills in the fourth, seventh, and tenth grades was conducted by the Michigan Educational Assessment Program in the fall of 1982. This report, which shares the writing assessment information (historical, descriptive, and interpretive) for educational decision-making, was prepared in the hope of increasing…
Descriptors: Academic Achievement, Criterion Referenced Tests, Educational Assessment, Elementary Secondary Education
Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg – Online Submission, 2005
Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…
Descriptors: Test Items, Writing Tests, Reading Tests, Measures (Individuals)
Previous Page | Next Page »
Pages: 1  |  2