Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Dekle, Dawn J.; Leung, Denis H. Y.; Zhu, Min – Psychological Methods, 2008
Across many areas of psychology, concordance is commonly used to measure the (intragroup) agreement in ranking a number of items by a group of judges. Sometimes, however, the judges come from multiple groups, and in those situations, the interest is to measure the concordance between groups, under the assumption that there is some within-group…
Descriptors: Item Response Theory, Statistical Analysis, Psychological Studies, Evaluators
Eid, Michael; Nussbeck, Fridtjof W.; Geiser, Christian; Cole, David A.; Gollwitzer, Mario; Lischetzke, Tanja – Psychological Methods, 2008
The question as to which structural equation model should be selected when multitrait-multimethod (MTMM) data are analyzed is of interest to many researchers. In the past, attempts to find a well-fitting model have often been data-driven and highly arbitrary. In the present article, the authors argue that the measurement design (type of methods…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Statistical Analysis, Error of Measurement
Hertzog, Christopher; von Oertzen, Timo; Ghisletta, Paolo; Lindenberger, Ulman – Structural Equation Modeling: A Multidisciplinary Journal, 2008
We evaluated the statistical power of single-indicator latent growth curve models to detect individual differences in change (variances of latent slopes) as a function of sample size, number of longitudinal measurement occasions, and growth curve reliability. We recommend the 2 degree-of-freedom generalized test assessing loss of fit when both…
Descriptors: Sample Size, Error of Measurement, Individual Differences, Statistical Analysis
Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008
This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…
Descriptors: Test Length, Test Content, Simulation, Computation
Webber, Douglas A. – Cornell Higher Education Research Institute, 2011
Using detailed individual-level data from public universities in the state of Ohio, I estimate the effect of various institutional expenditures on the probability of graduating from college. Using a competing risks regression framework, I find differential impacts of expenditure categories across student characteristics. I estimate that student…
Descriptors: Public Colleges, Educational Finance, Cost Effectiveness, College Administration
Peer reviewedSilverstein, A. B. – Journal of Clinical Psychology, 1985
A formula is presented for the standard error of estimate of Deviation Quotients (DQs). The formula is shown to perform well when used with data on short forms of two of Wechsler's scales. (Author/JAC)
Descriptors: Error of Measurement
Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010
In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…
Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques
Lipscomb, Stephen; Teh, Bing-ru; Gill, Brian; Chiang, Hanley; Owens, Antoniya – Mathematica Policy Research, Inc., 2010
This report summarizes research findings and implementation practices for teacher and principal value-added models (VAMs), as a first step in the Team Pennsylvania Foundation's (Team PA) pilot project to inform the development of a full, statewide model evaluation system. We have selected 21 studies that represent key issues and findings in the…
Descriptors: Pilot Projects, Outcomes of Education, Principals, Models
Rosch, David M.; Schwartz, Leslie M. – Journal of Leadership Education, 2009
As more institutions of higher education engage in the practice of leadership education, the effective assessment of these efforts lags behind due to a variety of factors. Without an intentional assessment plan, leadership educators are liable to make one or more of several common errors in assessing their programs and activities. This article…
Descriptors: Leadership Training, Administrator Education, College Outcomes Assessment, Program Evaluation
Kherif, Ferath; Josse, Goulven; Seghier, Mohamed L.; Price, Cathy J. – Journal of Cognitive Neuroscience, 2009
The aim of this study was to find the most prominent source of intersubject variability in neuronal activation for reading familiar words aloud. To this end, we collected functional imaging data from a large sample of subjects (n = 76) with different demographic characteristics such as handedness, sex, and age, while reading. The…
Descriptors: Handedness, Semantics, Reading Strategies, Error of Measurement
Huitema, Bradley E.; McKean, Joseph W. – Educational and Psychological Measurement, 2007
Regression models used in the analysis of interrupted time-series designs assume statistically independent errors. Four methods of evaluating this assumption are the Durbin-Watson (D-W), Huitema-McKean (H-M), Box-Pierce (B-P), and Ljung-Box (L-B) tests. These tests were compared with respect to Type I error and power under a wide variety of error…
Descriptors: Regression (Statistics), Evaluation Methods, Error of Measurement, Comparative Analysis
Jamshidian, M.; Khatoonabadi, M. – International Journal of Mathematical Education in Science and Technology, 2007
Almost all introductory and intermediate level statistics textbooks include the topic of confidence interval for the population mean. Almost all these texts introduce the median as a robust measure of central tendency. Only a few of these books, however, cover inference on the population median and in particular confidence interval for the median.…
Descriptors: Intervals, Simulation, Computation, Error of Measurement
Rice, Jennifer King – National Education Policy Center, 2012
Schools and school systems throughout the nation are increasingly experimenting with using various instructional technologies to improve productivity and decrease costs, but evidence on both the effectiveness and the costs of education technology is limited. A recent report published by the Thomas B. Fordham Institute sets out to describe "the…
Descriptors: Evidence, Electronic Learning, Distance Education, Online Courses
Bradley, Kelly D.; Royal, Kenneth D.; Bradley, James W. – Journal of College Teaching & Learning, 2008
The reliability and validity of course evaluations in higher education is often assumed. The typical Likert-type surveys utilized when students' evaluate the course and instructor often overlook measurement issues, or deal with them in an ineffective manner. Given the importance that is placed on higher education course evaluations, with results…
Descriptors: Higher Education, Course Evaluation, Reliability, Validity
Zhang, Bo; Stone, Clement A. – Educational and Psychological Measurement, 2008
This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Descriptors: Monte Carlo Methods, Sampling, Goodness of Fit, Evaluation Methods

Direct link
