Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Evaluation Methods | 9 |
Probability | 9 |
Research Methodology | 4 |
Evaluation Problems | 3 |
Measurement Techniques | 3 |
Statistical Inference | 3 |
Validity | 3 |
Bayesian Statistics | 2 |
Classification | 2 |
Effect Size | 2 |
Experiments | 2 |
More ▼ |
Source
Measurement:… | 2 |
Psychological Methods | 2 |
Harvard Educational Review | 1 |
Journal of Experimental… | 1 |
Journal of Marriage and the… | 1 |
Psychological Review | 1 |
Studies in Educational… | 1 |
Author
Anderson, Edward R. | 1 |
Callister Everson, Kimberlee | 1 |
Cui, Ying | 1 |
Cumming, Geoff | 1 |
Dana, Jason | 1 |
Davis-Stober, Clintin P. | 1 |
Deal, James E. | 1 |
Feinauer, Erika | 1 |
Frank, Till D. | 1 |
Gierl, Mark J. | 1 |
Guo, Ying | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Opinion Papers | 9 |
Reports - Descriptive | 2 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Callister Everson, Kimberlee; Feinauer, Erika; Sudweeks, Richard R. – Harvard Educational Review, 2013
In this article, the authors provide a methodological critique of the current standard of value-added modeling forwarded in educational policy contexts as a means of measuring teacher effectiveness. Conventional value-added estimates of teacher quality are attempts to determine to what degree a teacher would theoretically contribute, on average,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Accountability
Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P.; Guo, Ying – Psychological Review, 2011
Birnbaum raised important challenges to testing transitivity. We summarize why an approach based on counting response patterns does not solve these challenges. Foremost, we show why parsimonious tests of transitivity require at least 5 choice alternatives. While the approach of Regenwetter, Dana, and Davis-Stober achieves high power with modest…
Descriptors: Testing, Item Response Theory, Responses, Evaluation Methods
Killeen, Peter R. – Psychological Methods, 2010
Lecoutre, Lecoutre, and Poitevineau (2010) have provided sophisticated grounding for "p[subscript rep]." Computing it precisely appears, fortunately, no more difficult than doing so approximately. Their analysis will help move predictive inference into the mainstream. Iverson, Wagenmakers, and Lee (2010) have also validated…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Design, Research Methodology
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…
Descriptors: Test Items, Probability, Models, Diagnostic Tests
Cumming, Geoff – Psychological Methods, 2010
This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity
Muller, Hermann; Frank, Till D.; Sternad, Dagmar – Journal of Experimental Psychology: Human Perception and Performance, 2007
In their comment on the tolerance-noise covariation (TNC) method for decomposing variability by H. Muller and D. Sternad (2003, 2004b), J. B. J. Smeets and S. Louw show that covariation (C), as defined within the TNC method, is not invariant with respect to coordinate transformations and contend that it is, therefore, meaningless. Although the…
Descriptors: Statistical Analysis, Psychomotor Skills, Skill Development, Criticism
Gierl, Mark J.; Cui, Ying – Measurement: Interdisciplinary Research and Perspectives, 2008
One promising application of diagnostic classification models (DCM) is in the area of cognitive diagnostic assessment in education. However, the successful application of DCM in educational testing will likely come with a price--and this price may be in the form of new test development procedures and practices required to yield data that satisfy…
Descriptors: Educational Testing, Classification, Psychometrics, Test Construction

Mann, Lester; Kenowitz, Leonard A. – Studies in Educational Evaluation, 1985
Evaluations of special education interventions are different from research on medical interventions. Educational evaluations would be more useful if they applied an actuarial, or probability approach. Information should be collected over time and using various measures, thus increasing the available data. (GDC)
Descriptors: Elementary Secondary Education, Evaluation Methods, Evaluation Utilization, Higher Education

Deal, James E.; Anderson, Edward R. – Journal of Marriage and the Family, 1995
Presentation of quantitative research on the family often suffers from a tendency to interpret findings on a statistical rather than substantive basis. Advocates the use of data analysis that lends itself to an intuitive understanding of the nature of the findings, the strength of the association, and the import of the result. (JPS)
Descriptors: Data Analysis, Effect Size, Evaluation Methods, Goodness of Fit