Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 2 |
Early Childhood Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Students | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Northwest Evaluation Association, 2016
Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…
Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores
Hawkinson, Laura E.; Quick, Heather E.; Muenchow, Susan; Anthony, Jennifer; Weinberg, Emily; Holod, Aleksandra; Parrish, Deborah; Meakin, John; Lee, Dong Hoon; Tarrant, Kate; Cannon, Jill S.; Zellman, Gail L.; Karoly, Lynn A. – American Institutes for Research, 2015
The first step in the Validity and Reliability Study summarizes the history and purpose of California's quality rating and improvement system (QRIS), reviews findings from other QRIS evaluation studies, and describes the approach to validating the system in California. The majority of this report focuses on providing context for the California…
Descriptors: Competition, Early Childhood Education, Achievement Rating, State Standards
Hixson, Nate; Rhudy, Vaughn – West Virginia Department of Education, 2013
Student responses to the West Virginia Educational Standards Test (WESTEST) 2 Online Writing Assessment are scored by a computer-scoring engine. The scoring method is not widely understood among educators, and there exists a misperception that it is not comparable to hand scoring. To address these issues, the West Virginia Department of Education…
Descriptors: Scoring Formulas, Scoring Rubrics, Interrater Reliability, Test Scoring Machines
Harris, Milton E.; Tiedemann-Fuller, Meghan – Journal of Psychoeducational Assessment, 2010
A table is provided giving observed difference frequencies for caregiver versus teacher ratings of children on the Child Behavior Checklist and Teacher's Report Form Internalizing, Externalizing, and Total Problems scales per the original normative samples. The table permits accurate evaluation of the empirical rarity of specific cross-informant…
Descriptors: Check Lists, Performance Based Assessment, Test Validity, Child Behavior
Nathanson, Lori; Cole, Rachel; Kemple, James J.; Lent, Jessica; McCormick, Meghan; Segeritz, Micha – Online Submission, 2013
The New York City Department of Education's (DOE) annual survey of parents, students, and teachers is the largest of its kind in the United States. The DOE relies on the survey to identify schools' strengths and to target areas for improvement. School Survey scores, along with attendance, are also the only non-academic indicators used in the DOE's…
Descriptors: Validity, Urban Schools, Institutional Characteristics, School Surveys
Boldt, R. F. – 1992
The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…
Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

Livingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979
Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)
Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement
Lee, Guemin; Frisbie, David A. – 1997
Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…
Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Scores
Subkoviak, Michael J. – 1985
Current methods of obtaining reliability coefficients for mastery tests are laborious from a practitioner's perspective. Some methods require two test administrations; while others require access to computer facilities and/or advanced measurement and statistical procedures. This report provides tables from which practitioners can read such…
Descriptors: Estimation (Mathematics), Mastery Tests, Statistical Studies, Tables (Data)
Hendrickson, Amy B. – 2001
The purpose of the study was to compare reliability estimates for a test composed of stimulus-dependent testlets as derived from item scores, testlet scores, and under the univariate generalizability theory and multivariate generalizability theory designs, as well as to determine the influence of the number of testlets and the number of items per…
Descriptors: Comparative Analysis, Reliability, Scores, Standardized Tests
Maryland State Dept. of Education, Baltimore. – 1998
Maryland School Performance Assessment Program (MSPAP) assessments are criterion-referenced performance tests designed, developed, and implemented by the Maryland State Department of Education in collaboration with classroom teachers and other Maryland educators. MSPAP is the major strategy for implementing Maryland's educational reform…
Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring
Measurement Inc., Durham, NC. – 1999
Maryland School Performance Assessment Program (MSPAP) assessments are criterion-referenced performance tests designed, developed, and implemented by the Maryland State Department of Education in collaboration with classroom teachers and other Maryland educators. MSPAP is the major strategy for implementing Maryland's educational reform…
Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring
Maryland State Dept. of Education, Baltimore. – 2000
Maryland School Performance Assessment Program (MSPAP) assessments are criterion-referenced performance tests designed, developed, and implemented by the Maryland State Department of Education in collaboration with classroom teachers and other Maryland educators. MSPAP is the major strategy for implementing Maryland's educational reform…
Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring
Resnick, Lauren; And Others – 1993
The New Standards Project (NSP) is an effort to create a state- and district-based assessment and professional development system to serve as a catalyst for major educational reform. As part of a professional development strategy tied to assessment, 114 teachers, curriculum supervisors, and assessment directors, representing 23 states and…
Descriptors: Academic Standards, Educational Assessment, Educational Change, Elementary Secondary Education
Thompson, Bruce; Arnau, Randolph C. – 1998
The Personal Preferences Self-Description Questionnaire (PPSDQ) (B. Thompson) was developed to measure personal preferences with regard to Jungian psychological types. Instruments in this area are among the most popular measures used in education and psychology; the measures are used in matching teaching and learning styles, in individual…
Descriptors: Cognitive Style, College Students, Higher Education, Personality Assessment