Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Scores | 14 |
Test Theory | 14 |
Reliability | 5 |
Test Reliability | 4 |
Computation | 3 |
Psychometrics | 3 |
Test Validity | 3 |
Achievement Tests | 2 |
Comparative Analysis | 2 |
Educational Assessment | 2 |
Elementary Secondary Education | 2 |
More ▼ |
Source
Author
Allalouf, Avi | 1 |
Baird, Jo-Anne | 1 |
Beddow, Peter A. | 1 |
Black, Paul | 1 |
Cizek, Gregory J. | 1 |
Clemens, Nathan H. | 1 |
Crocker, Linda | 1 |
Culpepper, Steven Andrew | 1 |
Davis, John L. | 1 |
Dawson, Thomas E. | 1 |
Frisbie, David A. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 14 |
Journal Articles | 9 |
Speeches/Meeting Papers | 3 |
Guides - Non-Classroom | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Middle Schools | 1 |
Audience
Policymakers | 1 |
Practitioners | 1 |
Location
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Allalouf, Avi – International Journal of Testing, 2014
The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
Descriptors: Quality Control, Scoring, Test Theory, Scores
Culpepper, Steven Andrew – Applied Psychological Measurement, 2013
A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…
Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement
Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013
Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…
Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory
Parker, Richard I.; Vannest, Kimberly J.; Davis, John L.; Clemens, Nathan H. – Journal of Special Education, 2012
Within a response to intervention model, educators increasingly use progress monitoring (PM) to support medium- to high-stakes decisions for individual students. For PM to serve these more demanding decisions requires more careful consideration of measurement error. That error should be calculated within a fixed linear regression model rather than…
Descriptors: Measurement, Computation, Response to Intervention, Regression (Statistics)
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Henson, Robin K. – 2000
Because reliability is a function of scores, and not tests per se, it is inaccurate to hold that a given test will yield scores with the same reliability across samples. Therefore, score reliability should always be reported and interpreted in both measurement and substantive studies. In an effort to facilitate this outcome, this paper is intended…
Descriptors: Reliability, Scores, Test Results, Test Theory
Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…
Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Zimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001
Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)
Descriptors: Models, Probability, Reliability, Scores
Helms, LuAnn Sherbeck – 1999
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Descriptors: Effect Size, Meta Analysis, Reliability, Scores
Dawson, Thomas E. – 1997
The basic processes in univariate statistics involve partitioning the sum of squares into two components: explained and within. This paper explains that the same partitioning occurs in measurement analyses, i.e., splitting the sum of squares into reliable and unreliable components. In addition, it is shown how the three types of error inherent in…
Descriptors: Estimation (Mathematics), Measurement Techniques, Scores, Statistical Analysis
Reeve, Charlie L.; Lam, Holly – Intelligence, 2005
The simple practice effects commonly observed when retaking general cognitive ability tests present a potential paradox. If observed score changes reflect real changes in g, we must revisit our understanding of its stability. Conversely, if observed score changes reflect something other than a true change in the underlying latent construct, this…
Descriptors: Psychometrics, Cognitive Ability, Cognitive Measurement, Test Theory
Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006
The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…
Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability
Jacobson, Linda – American School Board Journal, 1996
Education standards are left to the discretion of individual states. However, efforts to help states and local school districts define world-class standards are intensifying. The U.S. Department of Education, the National Education Goals Panel, and New Standards, a partnership of 17 states and 6 school districts, are among those involved. (MLF)
Descriptors: Academic Standards, Benchmarking, Comparative Analysis, Educational Assessment
Vermont State Dept. of Education, Montpelier. Div. of Federal Assistance. – 1981
Because testing, in many different forms, currently plays such an important role in education, Elementary Secondary Education Act, Title I, the Division of Federal Assistance in the Vermont State Department of Education, prepared this brochure to present a general introduction to terms and phrases commonly used in testing and to highlight some of…
Descriptors: Achievement Tests, Criterion Referenced Tests, Diagnostic Tests, Elementary Secondary Education