Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Fan, Weihua; Hancock, Gregory R. – Educational and Psychological Measurement, 2006
In the common two-step structural equation modeling process, modifications are routinely made to the measurement portion of the model prior to assessing structural relations. The effect of such measurement model modifications on the structural parameter estimates, however, is not well known and is the subject of the current investigation. For a…
Descriptors: Error of Measurement, Evaluation Methods, Monte Carlo Methods, Sample Size
Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006
Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…
Descriptors: Probability, Intervals, Guidelines, Computer Simulation
Vehrs, Pat R.; George, James D.; Fellingham, Gilbert W.; Plowman, Sharon A.; Dustman-Allen, Kymberli – Measurement in Physical Education and Exercise Science, 2007
This study was designed to develop a single-stage submaximal treadmill jogging (TMJ) test to predict VO[subscript 2]max in fit adults. Participants (N = 400; men = 250 and women = 150), ages 18 to 40 years, successfully completed a maximal graded exercise test (GXT) at 1 of 3 laboratories to determine VO[subscript 2]max. The TMJ test was completed…
Descriptors: Metabolism, Body Composition, Physical Activities, Physical Fitness
Herzog, Serge – New Directions for Institutional Research, 2008
Among the varied analytical challenges institutional researchers face, examining faculty pay may be one of the most vexing. Although the literature on faculty compensation analysis dates back to the 1970s (Loeb and Ferber, 1971; Gordon, Morton, and Braden, 1974; Scott, 1977; Braskamp and Johnson, 1978; McLaughlin, Smart, and Montgomery, 1978),…
Descriptors: Teacher Salaries, Land Grant Universities, Compensation (Remuneration), Workers Compensation
Attali, Yigal – ETS Research Report Series, 2007
Because there is no commonly accepted view of what makes for good writing, automated essay scoring (AES) ideally should be able to accommodate different theoretical positions, certainly at the level of state standards but also perhaps among teachers at the classroom level. This paper presents a practical approach and an interactive computer…
Descriptors: Computer Assisted Testing, Automation, Essay Tests, Scoring
Jiang, Ying Hong; And Others – 1997
As performance-based assessments have gained wider use, there are increasing concerns about their dependability. This study is a synthesis of existing studies regarding the reliability or generalizability of performance assessments. The meta-analysis involves summarizing, examining, and evaluating research findings. Articles on the dependability…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Judges
Nevitt, Jonathan; Tam, Hak P. – 1997
This study investigates parameter estimation under the simple linear regression model for situations in which the underlying assumptions of ordinary least squares estimation are untenable. Classical nonparametric estimation methods are directly compared against some robust estimation methods for conditions in which varying degrees of outliers are…
Descriptors: Comparative Analysis, Computer Simulation, Error of Measurement, Estimation (Mathematics)
Go, Imelda C.; Woodruff, David J. – 1996
In previous works, D. J. Woodruff derived expressions for three different conditional test score variances: (1) the conditional standard error of prediction (CSEP); (2) the conditional standard error of measurement in prediction (CSEMP); and (3) the conditional standard error of estimation (CSEE). He also presented step-up formulas that require…
Descriptors: College Entrance Examinations, Error of Measurement, Estimation (Mathematics), High School Students
PDF pending restorationHanson, Bradley A.; And Others – 1994
This paper compares various methods of smoothed equipercentile equating and linear equating in the random groups equating design. Three presmoothing methods (based on the beta binomial model, four-parameter beta binomial model and a log-linear model) are compared to postsmoothing using cubic splines, linear equating and unsmoothed equipercentile…
Descriptors: Comparative Analysis, Equated Scores, Error of Measurement, Estimation (Mathematics)
Tang, Huixing – 1994
A method is presented for the simultaneous analysis of differential item functioning (DIF) in multi-factor situations. The method is unique in that it combines item response theory (IRT) and analysis of variance (ANOVA), takes a simultaneous approach to multifactor DIF analysis, and is capable of capturing interaction and controlling for possible…
Descriptors: Ability, Analysis of Variance, Difficulty Level, Error of Measurement
Kwak, Nohoon; Davenport, Ernest C., Jr.; Davison, Mark L. – 1998
The purposes of this study were to introduce the iterative purification procedure and to compare this with the two-step purification procedure, to compare false positive error rates and the power of five observed score approaches and to identify factors affecting power and false positive rates in each method. This study used 2,400 data sets that…
Descriptors: Ability, Comparative Analysis, Error of Measurement, Estimation (Mathematics)
Parshall, Cynthia G.; Kromrey, Jeffrey D.; Chason, Walter M.; Yi, Qing – 1997
Accuracy of item parameter estimates is a critical concern for any application of item response theory (IRT). However, the necessary sample sizes are often difficult to obtain in practice, particularly for the more complex models. A promising avenue of research concerns modified item response models. This study both replicates and improves on an…
Descriptors: Ability, Error of Measurement, Estimation (Mathematics), Item Response Theory
Fox, Jean-Paul; Glas, Cees A. W. – 1998
A two-level regression model is imposed on the ability parameters in an item response theory (IRT) model. The advantage of using latent rather than observed scores as dependent variables of a multilevel model is that this offers the possibility of separating the influence of item difficulty and ability level and modeling response variation and…
Descriptors: Ability, Bayesian Statistics, Difficulty Level, Error of Measurement
McCallister, Corliss – 1991
The Pearson product moment (P.M.) correlation ("r") and four of its most widely used variations--the phi, the rho, the biserial, and the point-biserial coefficients--are reviewed. Using small data sets between one and nine, the conditions under which the various forms are restricted in power and robustness are explored. Seven sample data…
Descriptors: Comparative Analysis, Correlation, Educational Research, Error of Measurement
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…
Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level

Peer reviewed
Direct link
