Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Song, Xin-Yuan; Lee, Sik-Yum – Multivariate Behavioral Research, 2005
In this article, a maximum likelihood approach is developed to analyze structural equation models with dichotomous variables that are common in behavioral, psychological and social research. To assess nonlinear causal effects among the latent variables, the structural equation in the model is defined by a nonlinear function. The basic idea of the…
Descriptors: Structural Equation Models, Simulation, Computation, Error of Measurement
Dirkzwager, Arie – International Journal of Testing, 2003
The crux in psychometrics is how to estimate the probability that a respondent answers an item correctly on one occasion out of many. Under the current testing paradigm this probability is estimated using all kinds of statistical techniques and mathematical modeling. Multiple evaluation is a new testing paradigm using the person's own personal…
Descriptors: Psychometrics, Probability, Models, Measurement
Umbach, Paul D. – New Directions for Institutional Research, 2004
This chapter summarizes the most recent literature on the best practices of Web survey implementation and offers practical advice for researchers. (Contains 1 table.)
Descriptors: Response Rates (Questionnaires), Educational Researchers, Surveys, Internet
Gardner, John; Cowan, Pamela – Assessment in Education Principles Policy and Practice, 2005
This paper sets out the findings from a large-scale analysis of the Northern Ireland Transfer Procedure Tests, used to select pupils for grammar schools. As it was not possible to get completed test scripts from government agencies, over 3000 practice scripts were completed in simulated conditions and were analysed to establish whether the tests…
Descriptors: Foreign Countries, Educational Testing, Error of Measurement, Test Use
Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004
The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…
Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items
Vermunt, Jeroen K. – Multivariate Behavioral Research, 2005
A well-established approach to modeling clustered data introduces random effects in the model of interest. Mixed-effects logistic regression models can be used to predict discrete outcome variables when observations are correlated. An extension of the mixed-effects logistic regression model is presented in which the dependent variable is a latent…
Descriptors: Predictor Variables, Correlation, Maximum Likelihood Statistics, Error of Measurement
Hopwood, Christopher J.; Richard, David C. S. – Assessment, 2005
Research on the Wechsler Adult Intelligence Scale-Revised and Wechsler Adult Intelligence Scale-Third Edition (WAIS-III) suggests that practicing clinical psychologists and graduate students make item-level scoring errors that affect IQ, index, and subtest scores. Studies have been limited in that Full-Scale IQ (FSIQ) and examiner administration,…
Descriptors: Scoring, Psychologists, Intelligence Quotient, Graduate Students
Chance Favors the Prepared Mind: Mathematics and Science Indicators for Comparing States and Nations
Phillips, Gary W. – American Institutes for Research, 2007
This report provides international benchmarks to help states see how students are doing in math and science within an international context. It shows how state-by-state results from the National Assessment of Educational Progress (NAEP) can be linked with nation-by-nation results from the Trends in International Mathematics and Science Study…
Descriptors: Mathematics Achievement, Academic Achievement, Numeracy, National Competency Tests
Saperstein, Aliya – Social Forces, 2006
Social constructivist theories of race suggest no two measures of race will capture the same information, but the degree of "error" this creates for quantitative research on inequality is unclear. Using unique data from the General Social Survey, I find observed and self-reported measures of race yield substantively different results when used to…
Descriptors: Race, Correlation, Income, Educational Attainment
van der Linden, Wim J. – Applied Psychological Measurement, 2006
Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores
Castle, Nicholas G. – Gerontologist, 2006
Purpose: In this study the levels of staff turnover reported in the nursing home literature (1990-2003) are reviewed, as well as the definitions of turnover used in these prior studies. With the use of primary data collected from 354 facilities, the study addresses the various degrees of bias that result, depending on how staff turnover is defined…
Descriptors: Nursing Homes, Health Services, Error of Measurement, Data Collection
Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2007
In this technical report, the authors describe the development alternate forms of three types of early literacy measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fourth grade. They begin with a brief overview of the two conceptual frameworks underlying…
Descriptors: Emergent Literacy, Measures (Individuals), Naming, Alphabets
Zwick, Rebecca; And Others – 1993
Simulated data were used to investigate the performance of modified versions of the Mantel-Haenszel and standardization methods of differential item functioning (DIF) analysis in computer-adaptive tests (CATs). Each "examinee" received 25 items out of a 75-item pool. A three-parameter logistic item response model was assumed, and…
Descriptors: Adaptive Testing, Computer Assisted Testing, Correlation, Error of Measurement
Olson, Jeffery E. – 1992
Often, all of the variables in a model are latent, random, or subject to measurement error, or there is not an obvious dependent variable. When any of these conditions exist, an appropriate method for estimating the linear relationships among the variables is Least Principal Components Analysis. Least Principal Components are robust, consistent,…
Descriptors: Error of Measurement, Factor Analysis, Goodness of Fit, Mathematical Models
Longford, Nicholas T. – 1993
A model-based approach to rater reliability for essays read by multiple readers is presented. Variation of rater severity (between-rater variation) and rater inconsistency (within-rater variation) is considered in the presence of between-examinee variation. An additive variance component model is posited and the method of moments for its…
Descriptors: Educational Diagnosis, Error of Measurement, Essays, Estimation (Mathematics)