ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Tables (Data)	45
Reliability	22
Test Reliability	20
Test Construction	14
Elementary Secondary Education	12
Comparative Analysis	10
Scoring	10
Test Validity	9
Data Collection	8
Research Methodology	8
Evaluation Methods	7
Mathematics Tests	7
Validity	7
Elementary Education	6
Higher Education	6
Interrater Reliability	6
National Surveys	6
Questionnaires	6
Scores	6
Test Items	6
Testing Programs	6
Academic Achievement	5
Equated Scores	5
Item Analysis	5
Standardized Tests	5
More ▼

Source

American Institutes for…	1
College Student Experiences…	1
College and University	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Northwest Evaluation…	1
Online Submission	1
Perceptual and Motor Skills	1
West Virginia Department of…	1

Publication Type

Numerical/Quantitative Data	45
Reports - Research	21
Reports - Evaluative	13
Speeches/Meeting Papers	6
Journal Articles	4
Reports - Descriptive	4
Tests/Questionnaires	4
Collected Works - General	3
Books	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
More ▼

Education Level

Elementary Secondary Education	2
Early Childhood Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers	1
Students	1

Location

California	1
Illinois	1
Israel	1
New York	1
South Carolina	1
West Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	3
Trends in International…	3
California Achievement Tests	2
Metropolitan Achievement Tests	2
National Household Education…	2
SRA Achievement Series	2
Bayley Scales of Infant…	1
Child Behavior Checklist	1
College Student Experiences…	1
Gates MacGinitie Reading Tests	1
Group Embedded Figures Test	1
Iowa Tests of Basic Skills	1
National Longitudinal Survey…	1
Schools and Staffing Survey…	1
Sequential Tests of…	1
Stanford Achievement Tests	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Psycho…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Linking the ACT ASPIRE Assessments to NWEA MAP Assessments

Download full text

Northwest Evaluation Association, 2016

Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…

Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores

Independent Evaluation of California's Race to the Top: Early Learning Challenge Quality Rating and Improvement System. Half-Term Report

Download full text

Hawkinson, Laura E.; Quick, Heather E.; Muenchow, Susan; Anthony, Jennifer; Weinberg, Emily; Holod, Aleksandra; Parrish, Deborah; Meakin, John; Lee, Dong Hoon; Tarrant, Kate; Cannon, Jill S.; Zellman, Gail L.; Karoly, Lynn A. – American Institutes for Research, 2015

The first step in the Validity and Reliability Study summarizes the history and purpose of California's quality rating and improvement system (QRIS), reviews findings from other QRIS evaluation studies, and describes the approach to validating the system in California. The majority of this report focuses on providing context for the California…

Descriptors: Competition, Early Childhood Education, Achievement Rating, State Standards

Findings from the 2012 West Virginia Online Writing Scoring Comparability Study

Download full text

Hixson, Nate; Rhudy, Vaughn – West Virginia Department of Education, 2013

Student responses to the West Virginia Educational Standards Test (WESTEST) 2 Online Writing Assessment are scored by a computer-scoring engine. The scoring method is not widely understood among educators, and there exists a misperception that it is not comparable to hand scoring. To address these issues, the West Virginia Department of Education…

Descriptors: Scoring Formulas, Scoring Rubrics, Interrater Reliability, Test Scoring Machines

Frequencies of T-Score Differences between Achenbach Child Behavior Checklist and Teacher's Report Form Summary Scales

Peer reviewed

Direct link

Harris, Milton E.; Tiedemann-Fuller, Meghan – Journal of Psychoeducational Assessment, 2010

A table is provided giving observed difference frequencies for caregiver versus teacher ratings of children on the Child Behavior Checklist and Teacher's Report Form Internalizing, Externalizing, and Total Problems scales per the original normative samples. The table permits accurate evaluation of the empirical rarity of specific cross-informant…

Descriptors: Check Lists, Performance Based Assessment, Test Validity, Child Behavior

New York City School Survey 2008-2010: Assessing the Reliability and Validity of a Progress Report Measure. Technical Report

Download full text

Nathanson, Lori; Cole, Rachel; Kemple, James J.; Lent, Jessica; McCormick, Meghan; Segeritz, Micha – Online Submission, 2013

The New York City Department of Education's (DOE) annual survey of parents, students, and teachers is the largest of its kind in the United States. The DOE relies on the survey to identify schools' strengths and to target areas for improvement. School Survey scores, along with attendance, are also the only non-academic indicators used in the DOE's…

Descriptors: Validity, Urban Schools, Institutional Characteristics, School Surveys

Reliability of the Test of Spoken English Revisited. Research Reports, Report 40.

Download full text

Boldt, R. F. – 1992

The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…

Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

Assessing the Reliability of Tests Used to Make Pass/Fail Decisions.

Peer reviewed

Livingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979

Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

A Generalizability Approach To Evaluating the Reliability of Testlet-Based Test Scores.

Download full text

Lee, Guemin; Frisbie, David A. – 1997

Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…

Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Scores

Tables of Reliability Coefficients for Mastery Tests.

Download full text

Subkoviak, Michael J. – 1985

Current methods of obtaining reliability coefficients for mastery tests are laborious from a practitioner's perspective. Some methods require two test administrations; while others require access to computer facilities and/or advanced measurement and statistical procedures. This report provides tables from which practitioners can read such…

Descriptors: Estimation (Mathematics), Mastery Tests, Statistical Studies, Tables (Data)

Reliability of Scores from Tests Composed of Testlets: A Comparison of Methods.

Download full text

Hendrickson, Amy B. – 2001

The purpose of the study was to compare reliability estimates for a test composed of stimulus-dependent testlets as derived from item scores, testlet scores, and under the univariate generalizability theory and multivariate generalizability theory designs, as well as to determine the influence of the number of testlets and the number of items per…

Descriptors: Comparative Analysis, Reliability, Scores, Standardized Tests

Maryland School Performance Assessment Program (MSPAP), 1997. Technical Report.

Download full text

Maryland State Dept. of Education, Baltimore. – 1998

Maryland School Performance Assessment Program (MSPAP) assessments are criterion-referenced performance tests designed, developed, and implemented by the Maryland State Department of Education in collaboration with classroom teachers and other Maryland educators. MSPAP is the major strategy for implementing Maryland's educational reform…

Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring

Maryland School Performance Assessment Program (MSPAP), 1998. Technical Report.

Download full text

Measurement Inc., Durham, NC. – 1999

Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring

Maryland School Performance Assessment Program (MSPAP), 1999. Technical Report.

Download full text

Maryland State Dept. of Education, Baltimore. – 2000

Descriptors: Elementary Secondary Education, Program Implementation, Reliability, Scoring

Report on Performance Standards in Mathematics and English: Results from the New Standards Project, Big Sky Scoring Conference. Project 2.3: Complex Performance Assessments: Expanding the Scope and Approaches to Assessment.

Download full text

Resnick, Lauren; And Others – 1993

The New Standards Project (NSP) is an effort to create a state- and district-based assessment and professional development system to serve as a catalyst for major educational reform. As part of a professional development strategy tied to assessment, 114 teachers, curriculum supervisors, and assessment directors, representing 23 states and…

Descriptors: Academic Standards, Educational Assessment, Educational Change, Elementary Secondary Education

Stability and Internal Consistency Reliability of Personal Preferences Self-Description Questionnaire (PPSDQ) Scores.

Download full text

Thompson, Bruce; Arnau, Randolph C. – 1998

The Personal Preferences Self-Description Questionnaire (PPSDQ) (B. Thompson) was developed to measure personal preferences with regard to Jungian psychological types. Instruments in this area are among the most popular measures used in education and psychology; the measures are used in matching teaching and learning styles, in individual…

Descriptors: Cognitive Style, College Students, Higher Education, Personality Assessment

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brick, J. Michael	2
Martin, Michael O., Ed.	2
Thompson, Bruce	2
Anthony, Jennifer	1
Arnau, Randolph C.	1
Atkinson, Leslie	1
Bashaw, W. L.	1
Beaton, Albert E.	1
Belcher, Marcia J.	1
Benor, Dan E.	1
Bergman, Rebecca	1
Bianchini, John C.	1
Boldt, R. F.	1
Canner, Jane	1
Cannon, Jill S.	1
Cole, Rachel	1
Dorans, Neil J.	1
Dugoni, Bernard	1
Frisbie, David A.	1
Gonyea, Robert M.	1
Goodman, Marvin	1
Gray, H. Dean	1
Halpin, Glennelle	1
Harris, Lynn J.	1
More ▼