Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Correlation | 32 |
| Test Interpretation | 32 |
| Test Reliability | 26 |
| Statistical Analysis | 12 |
| Scores | 9 |
| Item Analysis | 7 |
| Factor Analysis | 6 |
| Test Validity | 6 |
| Error of Measurement | 5 |
| Measurement Techniques | 5 |
| Psychometrics | 5 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 13 |
| Journal Articles | 11 |
| Speeches/Meeting Papers | 4 |
| Reports - Evaluative | 2 |
| Guides - Classroom - Teacher | 1 |
| Reports - Descriptive | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Secondary Education | 2 |
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 7 | 1 |
| Higher Education | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Postsecondary Education | 1 |
Audience
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Rindermann, Heiner; Baumeister, Antonia E. E. – International Journal of Testing, 2015
Scholastic tests regard cognitive abilities to be domain-specific competences. However, high correlations between competences indicate either high task similarity or a dependence on common factors. The present rating study examined the validity of 12 Programme for International Student Assessment (PISA) and Third or Trends in International…
Descriptors: Test Validity, Test Interpretation, Competence, Reading Tests
Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014
In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…
Descriptors: Generalizability Theory, Measurement, Reliability, Correlation
Moses, Tim – ETS Research Report Series, 2013
The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…
Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Martinez, Jose Felipe; Stecher, Brian; Borko, Hilda – Educational Assessment, 2009
In this study we use data from the Early Childhood Longitudinal Survey third- and fifth-grade samples to investigate teacher judgments of student achievement, the extent to which they offer a similar picture of student mathematics achievement compared to standardized test scores, and whether classroom assessment practices moderate the relationship…
Descriptors: Mathematics Achievement, Standardized Tests, Grade 5, Student Evaluation
Kane, Michael T.; Brennan, Robert L. – 1977
A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…
Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Peer reviewedStafford, Richard E. – Journal of Educational Measurement, 1971
Descriptors: Correlation, Statistical Analysis, Test Interpretation, Test Reliability
Test Service Bulletin, 1952
Some aspects of test reliability are discussed. Topics covered are: (1) how high should a reliability coefficient be?; (2) two factors affecting the interpretation of reliability coefficients--range of talent and interval between testings; (3) some common misconceptions--reliability of speed tests, part vs. total reliability, reliability for what…
Descriptors: Bulletins, Correlation, Scores, Statistical Analysis
Peer reviewedSchulman, Robert S. – Psychometrika, 1978
Ordinal measurement is the rank ordering of individuals in a population. For ordinal measurement, the concept of an individual propensity distribution is his or her true score. Estimation of, as well as other aspects of the distribution, are discussed. (Author/JKS)
Descriptors: Correlation, Measurement, Nonparametric Statistics, Probability
Peer reviewedWerts, C. E.; And Others – Educational and Psychological Measurement, 1978
A procedure for estimating the reliability of a factorially complex composite is considered. An application of its use with Scholastic Aptitude Test data is provided. (Author/JKS)
Descriptors: Correlation, Factor Analysis, Mathematical Models, Matrices
Peer reviewedWillson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores
Peer reviewedNoble, Gilbert H. – Educational and Psychological Measurement, 1977
A computer program providing comprehensive test and item analysis is presented. Completing its performance on one run, the program, written in Fortran and emphasizing ease of use, integrates various statistical techniques for analyzing individual items and the overall test, in addition to generating a variety of standard scores. (Author/JKS)
Descriptors: Computer Programs, Correlation, Equated Scores, Item Analysis
Peer reviewedPiersel, Wayne C.; Santos, Lande – Perceptual and Motor Skills, 1982
Comparison of the Goodenough-Harris and McCarthy scoring procedures for 60 kindergarten children's drawings yielded substantial agreement between the two scoring systems. The streamlined McCarthy scoring system should be utilized when large numbers of children are being evaluated with short periods of time. (Author)
Descriptors: Comparative Analysis, Correlation, Diagnostic Tests, Kindergarten

Direct link
