ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Source

Applied Measurement in…	3
Educational Assessment	2
American Journal of Education	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Measurement:…	1
Educational Researcher	1
Gifted Child Quarterly	1
International Journal of…	1
Journal of Deaf Studies and…	1
Journal of Educational…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	9
Reports - Descriptive	3
Reports - Evaluative	2

Education Level

Elementary Secondary Education	2
Elementary Education	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

United States	2
Canada	1
Georgia	1
Maryland	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	2
Stanford Achievement Tests	2
Texas Assessment of Academic…	2
ACT Assessment	1
Cognitive Abilities Test	1
Early Childhood Longitudinal…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

The Sensitivity of Teacher Value-Added Scores to the Use of Fall or Spring Test Scores

Peer reviewed

Direct link

Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020

Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…

Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making

Combining Scores in Multiple-Criteria Assessment Systems: The Impact of Combination Rule

Peer reviewed

Direct link

McBee, Matthew T.; Peters, Scott J.; Waterman, Craig – Gifted Child Quarterly, 2014

Best practice in gifted and talented identification procedures involves making decisions on the basis of multiple measures. However, very little research has investigated the impact of different methods of combining multiple measures. This article examines the consequences of the conjunctive ("and"), disjunctive/complementary…

Descriptors: Best Practices, Ability Identification, Academically Gifted, Correlation

Multiple Linking in Equating and Random Scale Drift. Research Report. ETS RR-11-46

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011

Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…

Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs

Large-Scale Academic Achievement Testing of Deaf and Hard-of-Hearing Students: Past, Present, and Future

Peer reviewed

Direct link

Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012

The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…

Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement

Consistency of Standard Setting in an Augmented State Testing System

Peer reviewed

Direct link

Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008

In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…

Descriptors: Testing Programs, State Programs, Standard Setting, Reliability

Defending a State Graduation Test: "GI Forum v. Texas Education Agency." Measurement Perspectives from an External Evaluator.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 2000

Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…

Descriptors: Curriculum, Psychometrics, Reliability, Standards

Stability of School-Level Scores from Large-Scale Student Assessment.

Peer reviewed

Sicoly, Fiore – Applied Measurement in Education, 2002

Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)

Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics

An Investigation of Difference Scores for a Grade-Level Testing Program.

Peer reviewed

Yin, Ping; Brennan, Robert L. – International Journal of Testing, 2002

Studied longitudinal changes in performance at both the student and school district level in major content areas of a widely used norm-referenced grade-level testing program. Used data from grades 3 to 4 and from 7 to 8 of the Iowa Tests of Basic Skills (in Iowa). Reports descriptive statistics and empirical norms and reliability estimates for…

Descriptors: Achievement Tests, Elementary Education, Elementary School Students, Longitudinal Studies

Conditional Standard Errors of Measurement for Scale Scores.

Peer reviewed

Kolen, Michael J.; And Others – Journal of Educational Measurement, 1992

A procedure is described for estimating the reliability and conditional standard errors of measurement of scale scores incorporating the discrete transformation of raw scores to scale scores. The method is illustrated using a strong true score model, and practical applications are described. (SLD)

Descriptors: College Entrance Examinations, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)

Performance Puzzles.

Peer reviewed

Resnick, Lauren B. – American Journal of Education, 1994

Explores issues involved in using assessments to define standards and encourage efforts to meet them and compares the European examination system with the American testing system. Also considered are issues of the definition of learning domains in ways that do not encourage narrowly focused training on specific assessment items. (SLD)

Descriptors: Academic Achievement, Comparative Analysis, Definitions, Educational Assessment

The Myth of the Texas Miracle in Education.

Peer reviewed

Haney, Walt – Education Policy Analysis Archives, 2000

Summarizes the recent history of education reform and statewide testing in Texas. Suggest that analyses comparing Texas Assessment of Academic Skills (TAAS) reading, writing, and mathematics scores with one another and relevant high school grades raise doubts about the reliability and validity of TAAS scores. Discusses these problems. (SLD)

Descriptors: Academic Achievement, Achievement Gains, Educational Change, Grade Point Average

Generalizability of Large-Scale Performance Assessments in Science: Promises and Problems.

Peer reviewed

Gao, Xiaohong; And Others – Applied Measurement in Education, 1994

This study provides empirical evidence about the sampling variability and generalizability (reliability) of a statewide performance assessment for grade six. Results for 600 students at individual and school levels indicate that task-sampling variability was the major source of measurement error. Rater-sampling variability was negligible. (SLD)

Descriptors: Achievement Tests, Educational Assessment, Elementary School Students, Error of Measurement

Reasoning about Evidence in Portfolios: Cognitive Foundations for Valid and Reliable Assessment.

Peer reviewed

Heller, Joan I.; Shiengold, Karen; Myford, Carol M. – Educational Assessment, 1998

Analyses of 10 raters' reasoning during think-aloud interviews provided evidence to support a model of the fundamental processes involved in rating standards-based, nonprescriptive portfolios. This process model provides a framework within which to conceptualize sound-rater reasoning and to identify reasoning that distorts the meaning of scores.…

Descriptors: Elementary Secondary Education, Evaluators, Interviews, Performance Based Assessment

Reliability and Decision Consistency: An Analysis of Writing Mode at Two Times on a Statewide Test.

Peer reviewed

Hollenbeck, Keith; Tindal, Gerald; Almond, Patricia – Educational Assessment, 1999

Studied the amount of measurement error in a state's performance-based writing task as it relates to high-stakes decision reproducibility. Using 175 eighth-grade writing samples, the study finds moderate correlations between the two raters' scores, with significant differences for the rates for the handwritten, but not the typed, essays.(SLD)

Descriptors: Decision Making, Error of Measurement, Essay Tests, Grade 8

Reliability	14
Testing Programs	14
State Programs	7
Validity	6
Academic Achievement	4
Error of Measurement	4
Achievement Tests	3
Decision Making	3
Elementary Secondary Education	3
Performance Based Assessment	3
Standards	3
Test Use	3
College Entrance Examinations	2
Correlation	2
Educational Assessment	2
Educational Policy	2
Educational Testing	2
Elementary School Students	2
Foreign Countries	2
Longitudinal Studies	2
Mathematics	2
Reading	2
School Districts	2
Scores	2
Statistical Analysis	2
More ▼

Almond, Patricia	1
Atteberry, Allison	1
Brennan, Robert L.	1
Dorans, Neil	1
Feigenbaum, Miriam	1
Gao, Xiaohong	1
Guo, Hongwen	1
Haney, Walt	1
Heller, Joan I.	1
Hollenbeck, Keith	1
Kolen, Michael J.	1
Lissitz, Robert W.	1
Liu, Jinghua	1
Mangan, Daniel	1
McBee, Matthew T.	1
Mehrens, William A.	1
Mitchell, Ross E.	1
Myford, Carol M.	1
Peters, Scott J.	1
Qi, Sen	1
Resnick, Lauren B.	1
Shiengold, Karen	1
Sicoly, Fiore	1
Tindal, Gerald	1
More ▼