Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |

Raykov, Tenko – Applied Psychological Measurement, 1998
Proposes a method for obtaining standard errors and confidence intervals of composite reliability coefficients based on bootstrap methods and using a structural-equation-modeling framework for estimating the composite reliability of congeneric measures (T. Raykov, 1997). Demonstrates the approach with simulated data. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Reliability, Simulation

Yuan, Ke-Hai; Bentler, Peter M. – Psychometrika, 2000
Studied whether the standard z-statistic that evaluates whether a factor loading is statistically necessary is correctly applied in such situations and more generally when the variables being analyzed are arbitrarily rescaled. An example illustrates that neither the factor loading estimates nor the standard error estimates possess scale…
Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Models, Maximum Likelihood Statistics

Dustmann, Christian; van Soest, Arthur – Industrial and Labor Relations Review, 2002
Analysis of panel data on immigrants to Germany 1984-94 focused on the relationship of language proficiency and productivity. Results show how time-varying measurement errors can lead to downward bias on the effect of fluency on earnings. Language proficiency is thus far more important than studies have suggested. (Contains 30 references.) (SK)
Descriptors: Error of Measurement, Foreign Countries, Immigrants, Language Proficiency

Liou, Michelle; Cheng, Philip E.; Johnson, Eugene G. – Applied Psychological Measurement, 1997
Derived simplified equations to compute the standard error of the frequency estimation method for equating score distributions that are continuized using a uniform or Gaussian kernel function. Results from two empirical studies indicate that these equations work reasonably well for moderate size samples. (SLD)
Descriptors: Computation, Equated Scores, Error of Measurement, Estimation (Mathematics)
Olsson, Henrik; Wennerholm, Pia; Lyxzen, Urban – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2004
J. P. Minda and J. D. Smith (2001) showed that a prototype model outperforms an exemplar model, especially in larger categories or categories that contained more complex stimuli. R. M. Nosofsky and S. R. Zaki (2002) showed that an exemplar model with a response-scaling mechanism outperforms a prototype model. The authors of the current study…
Descriptors: Error of Measurement, Stimuli, Models, Classification
Kowalchuk, Rhonda K.; Keselman, H. J.; Algina, James; Wolfinger, Russell D. – Educational and Psychological Measurement, 2004
One approach to the analysis of repeated measures data allows researchers to model the covariance structure of their data rather than presume a certain structure, as is the case with conventional univariate and multivariate test statistics. This mixed-model approach, available through SAS PROC MIXED, was compared to a Welch-James type statistic.…
Descriptors: Interaction, Sample Size, Statistical Analysis, Evaluation Methods
McDonald, Roderick P. – Structural Equation Modeling, 2004
Improper structures arising from the estimation of parameters in structural equation models (SEMs) are commonly an indication that the model is incorrectly specified. The use of boundary solutions cannot in general be recommended. Partly on the basis of theory given by Van Driel, and partly by example, suggestions are made for using the data as…
Descriptors: Structural Equation Models, Evaluation Methods, Error of Measurement, Evaluation Research
Becker, Gilbert – Psychological Methods, 2000
This article introduces a procedure for estimating reliability in which equivalent halves of a given test are systematically created and then administered a few days apart so that transient error can be included in the error calculus. The procedure not only estimates complete reliability (taking into account both specific-factor error and…
Descriptors: Reliability, Computation, Error of Measurement, College Students
Bock, R. Darrell; Brennan, Robert L.; Muraki, Eiji – Applied Psychological Measurement, 2002
In assessment programs where scores are reported for individual examinees, it is desirable to have responses to performance exercises graded by more than one rater. If more than one item on each test form is so graded, it is also desirable that different raters grade the responses of any one examinee. This gives rise to sampling designs in which…
Descriptors: Generalizability Theory, Test Items, Item Response Theory, Error of Measurement
Hoyt, William T. – Journal of Counseling Psychology, 2002
Rater bias has long been considered a source of error in observer ratings but has been ignored by process researchers using participant ratings. In particular, rater variance, or differences in generalized favorable or unfavorable perceptions of others, represents a neglected source of error in studies using participant ratings. The author…
Descriptors: Psychotherapy, Generalizability Theory, Research Methodology, Error of Measurement
Horng, Eileen Lai; Klasik, Daniel; Loeb, Susanna – National Center for Analysis of Longitudinal Data in Education Research, 2009
School principals have complex jobs. To better understand the work lives of principals, this study uses observational time-use data for all high school principals in Miami-Dade County Public Schools. This paper examines the relationship between the time principals spent on different types of activities and school outcomes including student…
Descriptors: School Effectiveness, Principals, High Schools, Time Management
Branch, Gregory; Hanushek, Eric; Rivkin, Steven – National Center for Analysis of Longitudinal Data in Education Research, 2009
Much has been written about the importance of school leadership, but there is surprisingly little systematic evidence on this topic. This paper presents preliminary estimates of key elements of the market for school principals, employing rich panel data on principals from Texas State. The consideration of teacher movements across schools suggests…
Descriptors: Principals, Administrator Effectiveness, Occupational Mobility, Administrator Attitudes
Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010
The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…
Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment
Li, Deping; Oranje, Andreas – ETS Research Report Series, 2007
Two versions of a general method for approximating standard error of regression effect estimates within an IRT-based latent regression model are compared. The general method is based on Binder's (1983) approach, accounting for complex samples and finite populations by Taylor series linearization. In contrast, the current National Assessment of…
Descriptors: Error of Measurement, Regression (Statistics), Trend Analysis, National Competency Tests
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models