Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |

Opdenakker, Marie-Christine; van Damme, Jan – School Effectiveness and School Improvement, 2000
Explores effects of ignoring one or more levels of variation in hierarchical linear regression analysis, using a model with four hierarchical levels. Ignoring the top or intermediate levels influences fixed coefficients, variance components, and their corresponding standard error and can lead to different research conclusions. (Contains 16…
Descriptors: Effective Schools Research, Elementary Secondary Education, Error of Measurement, Regression (Statistics)

Ferrando, Pere J.; Lorenzo, Urbano – Educational and Psychological Measurement, 1998
A program for obtaining ability estimates and their standard errors under a variety of psychometric models is documented. The general models considered are (1) classical test theory; (2) item factor analysis for continuous censored responses; and (3) unidimensional and multidimensional item response theory graded response models. (SLD)
Descriptors: Ability, Error of Measurement, Estimation (Mathematics), Factor Analysis
Rutledge, Michael L. – Bioscene, 2001
This activity makes students a part of an investigation that determines the frequency of a particular plant variety in a simulated population. Provides an opportunity for students to observe the inherent variability of estimates, observe the relationship between sample size and sampling error, and consider aspects of research design. (Author/SAH)
Descriptors: Biology, Botany, Error of Measurement, Higher Education

Axelrod, Bradley N.; And Others – Psychological Assessment, 1996
The calculations of D. Schretlen, R. H. B. Benedict, and J. H. Bobholz for the reliabilities of a short form of the Wechsler Adult Intelligence Scale--Revised (WAIS-R) (1994) consistently overestimated the values. More accurate values are provided for the WAIS--R and a seven-subtest short form. (SLD)
Descriptors: Error Correction, Error of Measurement, Estimation (Mathematics), Intelligence Tests
Wang, Wen-Chung; Chyi-In, Wu – Educational and Psychological Measurement, 2004
Because of the requirement of reporting effect sizes and in the interest of measurement of change within the item response theory framework, their combination becomes a new issue. In the present study, repeated measures are decomposed as an initial ability and one or more modifiabilities (gain score) using a multidimensional Rasch model. The…
Descriptors: Simulation, Effect Size, Item Response Theory, Meta Analysis
Dudgeon, Paul – Structural Equation Modeling, 2004
This article considers the implications for other noncentrality parameter-based statistics from Steiger's (1998) multiple sample adjustment to the root mean square error of approximation (RMSEA) measure. When a structural equation model is fitted simultaneously in more than 1 sample, it is shown that the calculation of the noncentrality parameter…
Descriptors: Statistical Analysis, Monte Carlo Methods, Structural Equation Models, Error of Measurement
Helms, Janet E.; Jernigan, Maryam; Mascher, Jackquelyn – American Psychologist, 2005
The primary purpose of this article was to offer a methodological critique in support of arguments that racial categories should be replaced as explanatory constructs in psychological research and theory. To accomplish this goal, the authors (a) summarized arguments for why racial categories should be replaced; (b) used principles of the…
Descriptors: Race, Psychology, Scientific Methodology, Psychological Studies
Holroyd, Clay B.; Yeung, Nick; Coles, Michael G. H.; Cohen, Jonathan D. – Journal of Experimental Psychology: General, 2005
The concept of error detection plays a central role in theories of executive control. In this article, the authors present a mechanism that can rapidly detect errors in speeded response time tasks. This error monitor assigns values to the output of cognitive processes involved in stimulus categorization and response generation and detects errors…
Descriptors: Reaction Time, Cognitive Processes, Error of Measurement, Conceptual Tempo
Zimmerman, Donald W.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2005
Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…
Descriptors: Statistical Analysis, Psychological Testing, Raw Scores, Evaluation Methods
Yin, Ping – Educational and Psychological Measurement, 2005
The main purpose of this study is to examine the content structure of the Multistate Bar Examination (MBE) using the "table of specifications" model from the perspective of multivariate generalizability theory. Specifically, using MBE data collected over different years (six administrations: three from the February test and three from July test),…
Descriptors: Correlation, Generalizability Theory, Statistical Analysis, Multivariate Analysis
Kristjansson, Elizabeth; Aylesworth, Richard; Mcdowell, Ian; Zumbo, Bruno D. – Educational and Psychological Measurement, 2005
Item bias is a major threat to measurement validity. Methods for detecting differential item functioning (DIF) are now commonly used to identify potentially biased items. DIF detection methods for dichotomous items are well developed, but those for ordinal items are less well developed. In this article, the authors compare four methods for…
Descriptors: Discriminant Analysis, Test Bias, Multivariate Analysis, Regression (Statistics)
Graham, James M. – Educational and Psychological Measurement, 2006
Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…
Descriptors: Models, Test Theory, Reliability, Structural Equation Models
Solano-Flores, Guillermo – Teachers College Record, 2006
This article examines the intersection of psychometrics and sociolinguists in the testing of English language learners (ELLs); it discusses language, dialect, and register as sources of measurement error. Research findings show that the dialect of the language in which students are tested (e.g., local or standard English) is as important as…
Descriptors: Second Language Learning, Test Construction, Sociolinguistics, Psychometrics
Reeve, Charlie L.; Meyer, Rustin D.; Bonaccio, Silvia – Intelligence, 2006
The relationship between intelligence and personality has been of scientific interest for over 100 years. However, most contemporary estimates of these relationships are limited because they do not separate the variance due to general and narrow cognitive abilities. This study demonstrates that this methodological oversight can distort estimates…
Descriptors: Intelligence, Personality, Correlation, Cognitive Ability
Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006
A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…
Descriptors: Cheating, Test Items, Simulation, Statistical Analysis