Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 26 |
| Since 2007 (last 20 years) | 90 |
Descriptor
| True Scores | 416 |
| Error of Measurement | 121 |
| Test Reliability | 110 |
| Statistical Analysis | 107 |
| Mathematical Models | 97 |
| Item Response Theory | 87 |
| Correlation | 76 |
| Equated Scores | 76 |
| Reliability | 64 |
| Test Theory | 52 |
| Test Items | 51 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 12 |
| Practitioners | 2 |
| Administrators | 1 |
| Teachers | 1 |
Location
| Australia | 1 |
| Canada | 1 |
| China | 1 |
| Colorado | 1 |
| Illinois | 1 |
| Israel | 1 |
| New York | 1 |
| Oregon | 1 |
| Taiwan | 1 |
| Texas | 1 |
| United Kingdom (England) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedHuynh, Huynh – Psychometrika, 1976
A test is administered at the end of a unit. On the basis of test performance mastery status is awarded or withheld. An optimum decision rule is fomulated in terms of degree of success in a referral task. (HG)
Descriptors: Cutting Scores, Norm Referenced Tests, Pass Fail Grading, Test Results
Peer reviewedAllison, Paul A. – Psychometrika, 1976
A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)
Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length
Hendrickson, Amy B.; Kolen, Michael J. – 2001
This study compared various equating models and procedures for a sample of data from the Medical College Admission Test(MCAT), considering how item response theory (IRT) equating results compare with classical equipercentile results and how the results based on use of various IRT models, observed score versus true score, direct versus linked…
Descriptors: Equated Scores, Higher Education, Item Response Theory, Models
Lang, William Steve – Online Submission, 2005
This paper reports the analysis of the results from a pilot effort to create and use a battery of instruments based on INTASC principles indicators of teacher dispositions. The original conception of the battery was designed on the taxonomy of increasing levels of inference. This means that the intent to measure included multiple instruments in…
Descriptors: Cognitive Tests, True Scores, Teacher Certification, Pilot Projects
Peer reviewedGleason, Terry C.; Staelin, Richard – Psychometrika, 1973
In this paper a method is proposed whereby an investigator may improve the metric qualities of questionnaire and similar kinds of data. (Author)
Descriptors: Data Collection, Measurement, Monte Carlo Methods, Psychometrics
Peer reviewedJackson, Paul H. – Psychometrika, 1973
This paper deals with the situation where scores on a number of parallel tests are obtained for each of a set of persons, and these persons are assumed to constitute, in so far as their scores for the tests are concerned, a random sample from some population of interest. (Author)
Descriptors: Analysis of Variance, Bayesian Statistics, Measurement, Models
Peer reviewedEbel, Robert L. – Educational and Psychological Measurement, 1972
Author supports the credibility of the propositions that: (1) the true component of a score is proportional to the number of equivalent elements that contribute to it. And, (2) the error component of a score is proportional to the square root of the number of equivalent elements that contribute to it. (Author/MB)
Descriptors: Error of Measurement, Item Analysis, Mathematical Applications, Scores
Peer reviewedCarter, Walter H., Jr. – Educational and Psychological Measurement, 1971
Descriptors: Classification, Error Patterns, Grading, Guessing (Tests)
Stanley, Julian C. – Educ Psychol Meas, 1970
It is shown that all obtained scores must meet the requirements for classical test-score theory with respect to definitions of true scores and errors of measurement if that frame of reference is to yield valid variance errors of measurement. (DG)
Descriptors: Measurement Techniques, Scores, Scoring, Statistical Analysis
Peer reviewedCharter, Richard A.; Feldt, Leonard S. – Measurement and Evaluation in Counseling and Development, 2002
Presented is a detailed description of two true score confidence interval approaches, their use, interpretation, and a philosophical conflict that arises in many applied instances. (Contains 27 references.) (Author)
Descriptors: Error of Measurement, Psychometrics, Research Methodology, Statistical Analysis
Peer reviewedSabers, Darrell L.; And Others – Journal of Special Education, 1988
The paper considers the appropriate and inappropriate use of estimated true scores for normative comparisons and concludes that for normative interpretations of individual scores and class averages, the use of estimated true scores or average estimated true scores is not recommended. (Author/DB)
Descriptors: Elementary Secondary Education, Scores, Standardized Tests, Test Interpretation
Peer reviewedLund, Thorleif – Scandinavian Journal of Educational Research, 1993
Based on the division of attained change for each treated individual into causal and noncausal change, the product-moment correlation between causal change and initial level is studied and compared with the correlation between attained change and initial level. Relevant formulas for true scores at population level are presented. (SLD)
Descriptors: Causal Models, Change, Correlation, Measurement Techniques
Peer reviewedHanson, Bradley A. – Applied Psychological Measurement, 1991
Log-linear model bivariate smoothing and a bivariate smoothing model based on the four-parameter beta binomial model were compared for usefulness in frequency estimation common-item equipercentile equating using two datasets. The performance of smoothed equipercentile methods was also compared to that of linear methods of common-item equating.…
Descriptors: Comparative Analysis, Equated Scores, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedHanson, Bradley A. – Journal of Educational Statistics, 1991
The formula developed by R. Levine (1955) for equating unequally reliable tests is described. The formula can be interpreted as a method of moments estimate of an equating function that results in first order equity of the equated test score under a classical congeneric model. (TJH)
Descriptors: Equated Scores, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Perlman, Michal; Zellman, Gail L.; Le, Vi-Nhuan – Early Childhood Research Quarterly, 2004
The psychometric properties of the revised Early Childhood Environment Rating Scale (ECERS-R) were examined using 202 Colorado child care centers. A factor analysis revealed that the ECERS-R does not measure seven distinct aspects of quality, as asserted by the developers of the ECERS-R, but instead measures one global aspect of quality. This…
Descriptors: Psychometrics, Credentials, Child Care Centers, Rating Scales

Direct link
