Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 8 |
Descriptor
Educational Assessment | 10 |
Error of Measurement | 10 |
Reliability | 10 |
Scores | 5 |
Generalizability Theory | 3 |
Mathematics Achievement | 3 |
Program Effectiveness | 3 |
Computation | 2 |
Correlation | 2 |
Elementary School Students | 2 |
Evaluation | 2 |
More ▼ |
Source
Author
Coverdale, Bradley J. | 1 |
DeMars, Christine | 1 |
Dimitrov, Dimiter M. | 1 |
Ferrao, Maria | 1 |
Flanagan, Kristin Denton | 1 |
Gao, Xiaohong | 1 |
Hendrickson, Amy | 1 |
Hosman, Clemens M. H. | 1 |
Jin, Ying | 1 |
Kim, YoungKoung | 1 |
Kok, Gerjo J. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Reports - Descriptive | 2 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Grade 8 | 2 |
Higher Education | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Grade 3 | 1 |
Grade 5 | 1 |
Kindergarten | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Longitudinal… | 1 |
National Merit Scholarship… | 1 |
Preliminary Scholastic… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
DeMars, Christine – Applied Measurement in Education, 2015
In generalizability theory studies in large-scale testing contexts, sometimes a facet is very sparsely crossed with the object of measurement. For example, when assessments are scored by human raters, it may not be practical to have every rater score all students. Sometimes the scoring is systematically designed such that the raters are…
Descriptors: Educational Assessment, Measurement, Data, Generalizability Theory
Kim, YoungKoung; Hendrickson, Amy; Patel, Priyank; Melican, Gerald; Sweeney, Kevin – College Board, 2013
The purpose of this report is to describe the procedure for revising the ReadiStep™ score scale using the field trial data, and to provide technical information about the development of the new ReadiStep scale score. In doing so, this report briefly introduces the three assessments--ReadiStep, PSAT/NMSQT®, and SAT®--in the College Board Pathway…
Descriptors: College Entrance Examinations, Educational Assessment, High School Students, Scores
Schafer, William D.; Coverdale, Bradley J.; Luxenberg, Harlan; Jin, Ying – Practical Assessment, Research & Evaluation, 2011
There are relatively few examples of quantitative approaches to quality control in educational assessment and accountability contexts. Among the several techniques that are used in other fields, Shewart charts have been found in a few instances to be applicable in educational settings. This paper describes Shewart charts and gives examples of how…
Descriptors: Charts, Quality Control, Educational Assessment, Statistical Analysis
Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 2010
The focus of this presidential address is on the contemporary treatment of reliability and validity in educational assessment. Highlights on reliability are provided under the classical true-score model using tools from latent trait modeling to clarify important assumptions and procedures for reliability estimation. In addition to reliability,…
Descriptors: Educational Assessment, Validity, Item Response Theory, Reliability
Raudenbush, Stephen W.; Sadoff, Sally – Journal of Research on Educational Effectiveness, 2008
A dramatic shift in research priorities has recently produced a large number of ambitious randomized trials in K-12 education. In most cases, the aim is to improve student academic learning by improving classroom instruction. Embedded in these studies are theories about how the quality of classroom must improve if these interventions are to…
Descriptors: Elementary Secondary Education, Error of Measurement, Statistical Inference, Program Evaluation
Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010
The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…
Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment
Molleman, Gerard R. M.; Peters, Louk W. H.; Hosman, Clemens M. H.; Kok, Gerjo J.; Oosterveld, Paul – Health Education Research, 2006
Preffi 2.0 is an evidence-based Dutch quality assessment instrument for health promotion interventions. It is mainly intended for both planning and assessing one's own projects but can also be used to assess other people's projects (external use). This article reports a study on the reliability of Preffi as an external quality assessment…
Descriptors: Expertise, Evidence, Generalizability Theory, Health Promotion
Flanagan, Kristin Denton; McPhee, Cameron – National Center for Education Statistics, 2009
Using data from the final two rounds of the Early Childhood Longitudinal Study, Birth Cohort (ECLS-B), a longitudinal study begun in 2001, this First Look provides a snapshot of the demographic characteristics, reading and mathematics knowledge, fine motor skills, school characteristics, and before- and after-school care arrangements of the cohort…
Descriptors: Child Development, Kindergarten, Longitudinal Studies, Cohort Analysis

Ruiz-Primo, Maria Araceli; And Others – Journal of Educational Measurement, 1993
The stability of scores on 2 types of performance assessments, an observed hands-on investigation and a notebook surrogate, was investigated for 29 sixth graders on 2 occasions. Results indicate that student performance and procedures changed and that generalizability across occasions was moderate. Implications for assessment are discussed. (SLD)
Descriptors: Educational Assessment, Elementary School Students, Error of Measurement, Generalizability Theory

Gao, Xiaohong; And Others – Applied Measurement in Education, 1994
This study provides empirical evidence about the sampling variability and generalizability (reliability) of a statewide performance assessment for grade six. Results for 600 students at individual and school levels indicate that task-sampling variability was the major source of measurement error. Rater-sampling variability was negligible. (SLD)
Descriptors: Achievement Tests, Educational Assessment, Elementary School Students, Error of Measurement