Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 34 |
Descriptor
Scores | 40 |
Academic Achievement | 16 |
Achievement Gains | 15 |
Teacher Effectiveness | 14 |
Teacher Evaluation | 10 |
Models | 9 |
Standardized Tests | 9 |
Longitudinal Studies | 8 |
Middle School Teachers | 8 |
Evaluation Methods | 7 |
Incentives | 7 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 23 |
Reports - Research | 23 |
Reports - Evaluative | 9 |
Reports - Descriptive | 6 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 2 |
Opinion Papers | 1 |
Education Level
Middle Schools | 12 |
Elementary Secondary Education | 8 |
Elementary Education | 7 |
Secondary Education | 6 |
Junior High Schools | 5 |
Grade 6 | 3 |
Grade 5 | 2 |
Grade 7 | 2 |
Grade 8 | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
More ▼ |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Location
Florida | 3 |
Texas | 3 |
Tennessee (Nashville) | 2 |
New York | 1 |
Tennessee | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Texas Assessment of Academic… | 2 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Ercikan, Kadriye; McCaffrey, Daniel F. – Journal of Educational Measurement, 2022
Artificial-intelligence-based automated scoring is often an afterthought and is considered after assessments have been developed, resulting in nonoptimal possibility of implementing automated scoring solutions. In this article, we provide a review of Artificial intelligence (AI)-based methodologies for scoring in educational assessments. We then…
Descriptors: Artificial Intelligence, Automation, Scores, Educational Assessment
Johnson, Matthew S.; Liu, Xiang; McCaffrey, Daniel F. – Journal of Educational Measurement, 2022
With the increasing use of automated scores in operational testing settings comes the need to understand the ways in which they can yield biased and unfair results. In this paper, we provide a brief survey of some of the ways in which the predictive methods used in automated scoring can lead to biased, and thus unfair automated scores. After…
Descriptors: Psychometrics, Measurement Techniques, Bias, Automation
Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020
Testing programs are often interested in using a student growth measure. This article presents analytic derivations of the accuracy of common student growth measures on both the raw scale of the test and the percentile rank scale in terms of the proportional reduction in mean squared error and the squared correlation between the estimator and…
Descriptors: Student Evaluation, Accuracy, Testing, Student Development
Yao, Lili; Haberman, Shelby; McCaffrey, Daniel F.; Lockwood, J. R. – ETS Research Report Series, 2020
Minimum discriminant information adjustment (MDIA), an approach to weighting samples to conform to known population information, provides a generalization of raking and poststratification. In the case of simple random sampling with replacement with uniform sampling weights, large-sample properties are available for MDIA estimates of population…
Descriptors: Discriminant Analysis, Sampling, Sample Size, Scores
Liu, Shuangshuang; Bell, Courtney A.; Jones, Nathan D.; McCaffrey, Daniel F. – Educational Assessment, Evaluation and Accountability, 2019
Researchers and practitioners sometimes presume that using a previously "validated" instrument will produce "valid" scores; however, contemporary views of validity suggest that there are many reasons this assumption can be faulty. In order to demonstrate just some of the problems with this view, and to support comparisons of…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Test Validity, Scores
Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020
The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…
Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods
Lockwood, J. R.; Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational and Behavioral Statistics, 2022
Many states and school districts in the United States use standardized test scores to compute annual measures of student achievement progress and then use school-level averages of these growth measures for various reporting and diagnostic purposes. These aggregate growth measures can vary consequentially from year to year for the same school,…
Descriptors: Accuracy, Prediction, Programming Languages, Standardized Tests
Mikeska, Jamie N.; Holtzman, Steven; McCaffrey, Daniel F.; Liu, Shuangshuang; Shattuck, Tamara – Science Education, 2019
Despite the prevalent use of observational measures in teacher evaluation systems, research has only recently begun to take into account how aspects of the instructional environment and lesson sampling design may interact with teachers' scores on these measures. Instead, one assumption guiding current evaluation systems is that the variation in…
Descriptors: Classroom Observation Techniques, Evaluation Methods, Lesson Observation Criteria, Science Teachers
McCaffrey, Daniel F.; Oliveri, Maria Elena; Holtzman, Steven – ETS Research Report Series, 2018
Scores from noncognitive measures are increasingly valued for their utility in helping to inform postsecondary admissions decisions. However, their use has presented challenges because of faking, response biases, or subjectivity, which standardized third-party evaluations (TPEs) can help minimize. Analysts and researchers using TPEs, however, need…
Descriptors: Generalizability Theory, Scores, College Admission, Admission Criteria
McCaffrey, Daniel F.; Castellano, Katherine E.; Lockwood, J. R. – Educational Measurement: Issues and Practice, 2015
Student growth percentiles (SGPs) express students' current observed scores as percentile ranks in the distribution of scores among students with the same prior-year scores. A common concern about SGPs at the student level, and mean or median SGPs (MGPs) at the aggregate level, is potential bias due to test measurement error (ME). Shang,…
Descriptors: Error of Measurement, Accuracy, Achievement Gains, Students
Goldhaber, Dan; Harris Douglas N.; Loeb, Susanna; McCaffrey, Daniel F.; Raudenbush, Stephen W. – Carnegie Foundation for the Advancement of Teaching, 2015
It is common knowledge that teacher quality is a key in-school factor affecting student achievement. While the quality of teaching clearly matters for how much students learn, this quality is challenging to measure. Evaluating teacher quality based on the level of their students' end-of-year test scores has been one method of assessing…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Evaluation Methods, Measurement Techniques
Casabianca, Jodi M.; Lockwood, J. R.; McCaffrey, Daniel F. – Educational and Psychological Measurement, 2015
Observations and ratings of classroom teaching and interactions collected over time are susceptible to trends in both the quality of instruction and rater behavior. These trends have potential implications for inferences about teaching and for study design. We use scores on the Classroom Assessment Scoring System-Secondary (CLASS-S) protocol from…
Descriptors: Scores, Middle School Teachers, Teacher Effectiveness, Teacher Evaluation
McCaffrey, Daniel F.; Yuan, Kun; Savitsky, Terrance D.; Lockwood, J. R.; Edelen, Maria O. – Educational Measurement: Issues and Practice, 2015
We examine the factor structure of scores from the CLASS-S protocol obtained from observations of middle school classroom teaching. Factor analysis has been used to support both interpretations of scores from classroom observation protocols, like CLASS-S, and the theories about teaching that underlie them. However, classroom observations contain…
Descriptors: Factor Structure, Multivariate Analysis, Scores, Factor Analysis
Liu, Ou Lydia; Liu, Huili; Roohr, Katrina Crotts; McCaffrey, Daniel F. – Journal of Educational Measurement, 2016
Learning outcomes assessment has been widely used by higher education institutions both nationally and internationally. One of its popular uses is to document learning gains of students. Prior studies have recognized the potential imbalance between freshmen and seniors in terms of their background characteristics and their prior academic…
Descriptors: College Outcomes Assessment, Achievement Gains, College Freshmen, College Seniors
Lockwood, J. R.; McCaffrey, Daniel F. – Journal of Educational and Behavioral Statistics, 2014
A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…
Descriptors: Error of Measurement, Scores, Statistical Analysis, Computation