Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedCamilli, Gregory – Journal of Educational Statistics, 1988
The phenomenon of scale shrinkage is examined. Focus is on the pattern of decreasing variances in item response theory scale scores from fall to spring within a grade. It is demonstrated that questions concerning population distributions of true ability can be addressed with empirical Bayes techniques. (TJH)
Descriptors: Academic Ability, Achievement Tests, Bayesian Statistics, Difficulty Level
Peer reviewedReddy, Srinivas K. – Educational and Psychological Measurement, 1992
Implications of ignoring correlated error on parameter estimates in some simple structural equation models are examined. It is shown analytically and empirically through simulation that ignoring positive between-construct correlated error overestimates the structural parameter linking the two constructs. Effects become more pronounced with…
Descriptors: Correlation, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewedRivkin, Stephen G. – Economics of Education Review, 2001
Conclusions about peer-group influences on academic achievement often depend on the estimation method used to account for endogeneity of school and neighborhood choice. Using High School and Beyond Longitudinal Survey data, this paper shows that aggregation does not reduce the specification error in estimating peer-group benefits. (Contains 25…
Descriptors: Academic Achievement, Elementary Secondary Education, Employment, Error of Measurement
Peer reviewedSabatelli, Ronald M.; Bartle, Suzanne E. – Journal of Marriage and the Family, 1995
Presents a multidimensional conceptualization of family functioning that is embedded within a family systems framework. Discusses operational issues pertaining to the assessment of family functioning when conceived of as a complex and multidimensional construct, and explores measurement strategies and analytical approaches. (JPS)
Descriptors: Content Validity, Error of Measurement, Evaluation Methods, Family (Sociological Unit)
Ferrara, Steve; Johnson, Eugene; Chen, Wen-Hung – Applied Measurement in Education, 2005
Psychometricians continue to develop and evaluate methods for linking test scores, both horizontally and vertically. This article describes a social moderation process for articulating (i.e., linking) performance standards across grade levels for an operational state assessment program. The researchers used generated data to evaluate the likely…
Descriptors: Grade 2, Grade 3, Scores, Error of Measurement
Song, Xin-Yuan; Lee, Sik-Yum – Multivariate Behavioral Research, 2005
In this article, a maximum likelihood approach is developed to analyze structural equation models with dichotomous variables that are common in behavioral, psychological and social research. To assess nonlinear causal effects among the latent variables, the structural equation in the model is defined by a nonlinear function. The basic idea of the…
Descriptors: Structural Equation Models, Simulation, Computation, Error of Measurement
Dirkzwager, Arie – International Journal of Testing, 2003
The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…
Descriptors: Psychometrics, Probability, Models, Measurement
Umbach, Paul D. – New Directions for Institutional Research, 2004
This chapter summarizes the most recent literature on the best practices of Web survey implementation and offers practical advice for researchers. (Contains 1 table.)
Descriptors: Response Rates (Questionnaires), Educational Researchers, Surveys, Internet
Graham, James M. – Educational and Psychological Measurement, 2006
Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…
Descriptors: Models, Test Theory, Reliability, Structural Equation Models
Solano-Flores, Guillermo – Teachers College Record, 2006
This article examines the intersection of psychometrics and sociolinguists in the testing of English language learners (ELLs); it discusses language, dialect, and register as sources of measurement error. Research findings show that the dialect of the language in which students are tested (e.g., local or standard English) is as important as…
Descriptors: Second Language Learning, Test Construction, Sociolinguistics, Psychometrics
Reeve, Charlie L.; Meyer, Rustin D.; Bonaccio, Silvia – Intelligence, 2006
The relationship between intelligence and personality has been of scientific interest for over 100 years. However, most contemporary estimates of these relationships are limited because they do not separate the variance due to general and narrow cognitive abilities. This study demonstrates that this methodological oversight can distort estimates…
Descriptors: Intelligence, Personality, Correlation, Cognitive Ability
Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006
A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…
Descriptors: Cheating, Test Items, Simulation, Statistical Analysis
Gardner, John; Cowan, Pamela – Assessment in Education Principles Policy and Practice, 2005
This paper sets out the findings from a large-scale analysis of the Northern Ireland Transfer Procedure Tests, used to select pupils for grammar schools. As it was not possible to get completed test scripts from government agencies, over 3000 practice scripts were completed in simulated conditions and were analysed to establish whether the tests…
Descriptors: Foreign Countries, Educational Testing, Error of Measurement, Test Use
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Wilson, Mark; Allen, Diane D.; Li, Jun Corser – Health Education Research, 2006
This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME "Standards for Educational and…
Descriptors: Health Education, Self Efficacy, Health Behavior, Measures (Individuals)

Direct link
