Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 17 |
Descriptor
Scores | 67 |
Testing Programs | 67 |
Test Validity | 51 |
State Programs | 23 |
Elementary Secondary Education | 19 |
Achievement Tests | 17 |
Test Use | 15 |
Standardized Tests | 14 |
Test Reliability | 14 |
Test Results | 13 |
Validity | 13 |
More ▼ |
Source
Author
Bowman, Harry L. | 3 |
Alonzo, Julie | 2 |
Anderson, Daniel | 2 |
Kupermintz, Haggai | 2 |
Tindal, Gerald | 2 |
Abdulla, Abdulbaset | 1 |
Allen, Nancy L. | 1 |
Armstrong, Bill | 1 |
Arter, Judith A. | 1 |
Barron, Sheila I. | 1 |
Bay, Luz G. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 6 |
Grade 3 | 4 |
Grade 4 | 4 |
Grade 5 | 4 |
Grade 6 | 4 |
Grade 7 | 4 |
Grade 8 | 4 |
Secondary Education | 3 |
Elementary Education | 2 |
High Schools | 2 |
Higher Education | 2 |
More ▼ |
Location
Alaska | 2 |
Indiana | 2 |
Kentucky | 2 |
Oregon | 2 |
Tennessee | 2 |
Australia | 1 |
California | 1 |
Canada | 1 |
Canada (Edmonton) | 1 |
Japan | 1 |
Louisiana | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Rudner, Lawrence M. – Graduate Management Admission Council, 2012
In order to remain relevant and useful, testing programs must periodically update their tests to match shifts in student populations and school curricula. One might think that because the publishers are offering a product, the responsibility for updates rests entirely on them. Publishers conduct studies to assure new content is appropriate. But…
Descriptors: College Entrance Examinations, Graduate Study, Business Administration Education, Test Validity
New York State Education Department, 2015
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…
Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5
Creagh, Sue – TESOL in Context, 2014
Teachers are now experiencing the age of quantitative test-driven assessment, in which there is little weight accorded to teacher-based judgement about student progress. In the Australian context, the NAPLaN test has become a driving force in school and teacher accountability. The language of NAPLaN is one of bands and numerical scores and…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Student Evaluation
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
In this technical report, we document the results of a cross-validation study designed to identify optimal cut-scores for the use of the easyCBM[R] mathematics test in the state of Washington. A large sample, randomly split into two groups of roughly equal size, was used for this study. Students' performance classification on the Washington state…
Descriptors: Testing Programs, Mathematics Tests, Prediction, Measurement Techniques
Park, Bitnara Jasmine; Irvin, P. Shawn; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
This technical report presents results from a cross-validation study designed to identify optimal cut scores when using easyCBM[R] reading tests in Oregon. The cross-validation study analyzes data from the 2009-2010 academic year for easyCBM[R] reading measures. A sample of approximately 2,000 students per grade, randomly split into two groups of…
Descriptors: Testing Programs, Reading Tests, Prediction, Measurement Techniques
Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011
This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…
Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness
Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007
This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…
Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness
Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009
This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…
Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability
Wang, Shudong; Jiao, Hong – Educational and Psychological Measurement, 2009
In practice, vertical scales have been continually used to measure students' achievement progress across several grade levels and have been considered very challenging psychometric procedures. Recently, such practices have been drawing many criticisms. The major criticisms focus on dimensionality and construct equivalence of the latent trait or…
Descriptors: Reading Comprehension, Elementary Secondary Education, Measures (Individuals), Psychometrics
Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007
This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…
Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills
Li, Yuan H.; Tompkins, Leroy J. – International Journal of Testing, 2004
The primary objective of this study was to examine the construct validity for the 2 multiple-content testing programs-the multiple-choice Comprehensive Tests of Basic Skills (CTBS/5) together with the performance-based Maryland School Performance Assessment Program (MSPAP)-by evaluating the true-score longitudinal associations among…
Descriptors: Testing Programs, Structural Equation Models, Performance Based Assessment, Multitrait Multimethod Techniques