Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 3 |
Descriptor
Educational Testing | 7 |
Error of Measurement | 7 |
Test Reliability | 7 |
Correlation | 2 |
Evaluation Methods | 2 |
Mathematical Models | 2 |
Measurement Techniques | 2 |
Simulation | 2 |
Statistical Bias | 2 |
Test Items | 2 |
Achievement Tests | 1 |
More ▼ |
Source
American Educational Research… | 1 |
Educational Leadership | 1 |
Journal of Experimental… | 1 |
Journal of Teacher Education | 1 |
ProQuest LLC | 1 |
Author
Demetrulias, Diana Mayer | 1 |
Feldt, Leonard S. | 1 |
Gilmer, Jerry S. | 1 |
Papay, John P. | 1 |
Patience, Wayne M. | 1 |
Popham, W. James | 1 |
Reckase, Mark D. | 1 |
Topczewski, Anna Marie | 1 |
Williams, Richard H. | 1 |
Zimmerman, Donald W. | 1 |
Publication Type
Journal Articles | 4 |
Reports - Research | 3 |
Reports - Descriptive | 2 |
Dissertations/Theses -… | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Topczewski, Anna Marie – ProQuest LLC, 2013
Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…
Descriptors: Item Response Theory, Scaling, Scores, Student Development
Popham, W. James – Educational Leadership, 2009
If a person were to ask an educator to identify the two most important attributes of an education test, the response most certainly would be "validity and reliability." These two tightly wedded concepts have become icons in the field of education assessment. As far as validity is concerned, the term doesn't refer to the accuracy of a test. Rather,…
Descriptors: Educational Testing, Educational Assessment, Student Evaluation, Test Reliability
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984
This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)
Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing

Gilmer, Jerry S.; Feldt, Leonard S. – 1982
The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…
Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

Demetrulias, Diana Mayer – Journal of Teacher Education, 1980
Even in an educational tests and measurements classroom it is possible to stimulate student creativity and divergent thinking. Examples of projects and test constructions are given to provide food for thought for the creative teacher. (JN)
Descriptors: Creative Thinking, Creativity Tests, Divergent Thinking, Educational Testing
Patience, Wayne M.; Reckase, Mark D. – 1979
Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing