Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Author
Koretz, Daniel | 2 |
Pomplun, Mark | 2 |
Roth, Rodney | 2 |
Airasian, Peter W. | 1 |
Baldwin, Janet | 1 |
Barno, Trina Adler | 1 |
Bebell, Damian | 1 |
Behuniak, Peter | 1 |
Burns, Matthew | 1 |
Carey, Neil B. | 1 |
Carvajal, Jorge | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 37 |
Journal Articles | 14 |
Speeches/Meeting Papers | 7 |
Numerical/Quantitative Data | 4 |
Reports - Research | 2 |
Collected Works - Proceedings | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 4 |
Elementary Education | 1 |
High Schools | 1 |
Secondary Education | 1 |
Location
Vermont | 3 |
Arkansas | 2 |
California | 2 |
Florida | 2 |
Massachusetts | 2 |
Alabama | 1 |
Alaska | 1 |
Arizona | 1 |
Idaho | 1 |
Illinois | 1 |
Kansas | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Safe and Drug Free Schools… | 1 |
Assessments and Surveys
Florida State Student… | 2 |
Alabama High School… | 1 |
National Assessment of… | 1 |
National Teacher Examinations | 1 |
New Jersey High School… | 1 |
Preschool Language Scale | 1 |
What Works Clearinghouse Rating
Olinghouse, Natalie G.; Zheng, Jinjie; Morlock, Larissa – Reading & Writing Quarterly, 2012
This study evaluated large-scale state writing assessments for the inclusion of motivational characteristics in the writing task and written prompt. We identified 6 motivational variables from the authentic activity literature: time allocation, audience specification, audience intimacy, definition of task, allowance for multiple perspectives, and…
Descriptors: Writing Evaluation, Writing Tests, Writing Achievement, Audiences
Goldstein, Jessica; Behuniak, Peter – Assessment for Effective Intervention, 2011
State-level testing programs continue to grow, and the challenge of validation does not wane. Although more than a decade has passed since the 1999 Joint Standards for Educational and Psychological Testing set out a call for the organization of validity evidence into validity arguments, practical examples of such arguments are not readily…
Descriptors: Testing Programs, State Programs, Alternative Assessment, Test Validity
Gold, Abby; Barno, Trina Adler; Sherman, Shelley; Lovett, Kathleen; Hurtado, G. Ali – Journal of Extension, 2013
Systematic evaluation is an essential tool for understanding program effectiveness. This article describes the pilot test of a statewide evaluation tool for the Supplemental Nutrition Assistance Program-Education (SNAP-Ed). A computer algorithm helped Community Nutrition Educators (CNEs) build surveys specific to their varied educational settings…
Descriptors: State Programs, Program Evaluation, Program Effectiveness, Evaluation Methods
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility
Roth, Rodney – 1980
In 1979-80, the Arkansas minimum competency tests were administered to a sample of 5,000 students in grades 3, 6, and 8. To determine how well test objectives matched the curriculum, their teachers estimated how many of the four items per objective a randomly selected student would answer correctly. Because chi square test comparisons of teacher…
Descriptors: Elementary Education, Minimum Competency Testing, Models, Probability

Pomplun, Mark – Applied Measurement in Education, 1997
A method to investigate consequential evidence of validity for a state assessment developed to change teacher instructional practices is presented. Survey responses from over 1,000 Kansas teachers were used to construct a path model that allowed effects of the state assessment to be studied at building and teacher levels. (SLD)
Descriptors: Educational Assessment, Educational Change, Instructional Effectiveness, Path Analysis
Masonis, Edward J. – 1987
Security procedures for the New Jersey High School Proficiency Test (HSPT) are discussed and evaluated. All New Jersey high school students are required to pass the HSPT, which was administered for the first time in 1984. Generally, security plans are designed to limit access to test questions prior to test administration and to prevent…
Descriptors: Cheating, Confidentiality, High Schools, Planning

Haney, Walt; Fowler, Clarke; Wheelock, Anne; Bebell, Damian; Malec, Nicole – Education Policy Analysis Archives, 1999
Using data from state and academic reports, an independent committee of researchers has evaluated the Massachusetts Teacher Tests. Scores are found to be highly unreliable, and the tests are found to contain questionable content. Suspending use of the tests is recommended. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation

Koretz, Daniel; Stecher, Brian; Klein, Stephen; McCaffrey, Daniel – Educational Measurement: Issues and Practice, 1994
Reports on an ongoing evaluation of the Vermont portfolio assessment program. Indicates that the positive news about the instructional effects of the assessment program are in contrast with the empirical findings about the quality of the data the program has yielded. (SLD)
Descriptors: Accountability, Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment
Maestri, Melissa Amy – History Teacher, 2006
The research for this study was undertaken to analyze the New York State 11th grade United States History Regents exams through conducting a content analysis of the types of multiple-choice questions asked in Part I of the tests with a particular emphasis on the variety of questions asked regarding women and race. Because these tests stand at the…
Descriptors: Grade 11, United States History, Multicultural Education, Content Analysis
Mead, Nancy A. – 1980
Focusing on the problems of assessing the speaking skills of secondary school students, this paper provides one example of how those problems were addressed in the Massachusetts speaking assessment. The paper identifies four requirements for measures of speaking skills: (1) feasibility, (2) reliability, (3) validity, and (4) freedom from bias. The…
Descriptors: Educational Assessment, Evaluation Criteria, Evaluation Methods, Measurement Techniques

Pecheone, Raymond L.; Carey, Neil B. – Journal of Personnel Evaluation in Education, 1990
The Connecticut Teacher Assessment Center Project has, since 1986, been developing a semistructured interview in the area of mathematics to evaluate beginning teacher competence. The strategy for validation of the project's performance tests, Connecticut's reform initiatives, and implications of systematic validity for traditional psychometric…
Descriptors: Beginning Teachers, Higher Education, Interviews, Licensing Examinations (Professions)
Northwest Regional Educational Lab., Portland, OR. – 1978
Key findings of a pilot study of the Alaska Instructional Diagnostic System (AIDS) are summarized. The AIDS pilot test served to verify the appropriateness of the skills survey as well as the validity and reliability of the items. The AIDS testing system includes three components: (1) upper level skills surveys (grades 3-8); (2) lower level skill…
Descriptors: Achievement Tests, Diagnostic Tests, Educational Assessment, Educational Objectives

Airasian, Peter W. – Educational Evaluation and Policy Analysis, 1988
High-stakes state-mandated testing programs are discussed, illustrating that proposed educational innovations are adopted because of their power as symbols of value orientations in the wider culture. In such programs, tests represent order and control, focus on important outcomes, and symbolize basic moral values. (SLD)
Descriptors: College Entrance Examinations, Cultural Influences, Educational Change, Educational Improvement