ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Correlation	32
Test Interpretation	32
Test Reliability	26
Statistical Analysis	12
Scores	9
Item Analysis	7
Factor Analysis	6
Test Validity	6
Error of Measurement	5
Measurement Techniques	5
Psychometrics	5
Reliability	5
Achievement Tests	4
Comparative Analysis	4
Intelligence Tests	4
Scoring	4
Test Construction	4
Test Results	4
Academic Achievement	3
Adolescents	3
Aptitude Tests	3
Criterion Referenced Tests	3
Intelligence Quotient	3
Interrater Reliability	3
Mathematical Models	3
More ▼

Source

Educational and Psychological…	5
Psychometrika	2
Contemporary Educational…	1
Creativity Research Journal	1
ETS Research Report Series	1
Educational Assessment	1
International Journal of…	1
J Educ Meas	1
Journal of Early Adolescence	1
Journal of Educational…	1
Perceptual and Motor Skills	1
Psychology in the Schools	1
Test Service Bulletin	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	11
Speeches/Meeting Papers	4
Reports - Evaluative	2
Guides - Classroom - Teacher	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 5	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Researchers

Location

Canada	1
Germany	1
Michigan	1

Laws, Policies, & Programs

Assessments and Surveys

McCarthy Scales of Childrens…	2
ACT Assessment	1
Early Childhood Longitudinal…	1
Goodenough Harris Drawing Test	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
Program for International…	1
Trends in International…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Validating the Interpretations of PISA and TIMSS Tasks: A Rating Study

Peer reviewed

Direct link

Rindermann, Heiner; Baumeister, Antonia E. E. – International Journal of Testing, 2015

Scholastic tests regard cognitive abilities to be domain-specific competences. However, high correlations between competences indicate either high task similarity or a dependence on common factors. The present rating study examined the validity of 12 Programme for International Student Assessment (PISA) and Third or Trends in International…

Descriptors: Test Validity, Test Interpretation, Competence, Reading Tests

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim – ETS Research Report Series, 2013

The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)

Is What You See What You Really Get? Comparison of Scoring Techniques in the Assessment of Real-World Divergent Thinking

Peer reviewed

Direct link

Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014

In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…

Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods

Classroom Assessment Practices, Teacher Judgments, and Student Achievement in Mathematics: Evidence from the ECLS

Peer reviewed

Direct link

Martinez, Jose Felipe; Stecher, Brian; Borko, Hilda – Educational Assessment, 2009

In this study we use data from the Early Childhood Longitudinal Survey third- and fifth-grade samples to investigate teacher judgments of student achievement, the extent to which they offer a similar picture of student mathematics achievement compared to standardized test scores, and whether classroom assessment practices moderate the relationship…

Descriptors: Mathematics Achievement, Standardized Tests, Grade 5, Student Evaluation

Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests. ACT Technical Bulletin No. 28.

Download full text

Kane, Michael T.; Brennan, Robert L. – 1977

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…

Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores

Essay Reliability: Form and Meaning.

Download full text

Shale, Doug – 1986

This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…

Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests

The Speededness Quotient: A New Descriptive Statistic for Tests

Peer reviewed

Stafford, Richard E. – Journal of Educational Measurement, 1971

Descriptors: Correlation, Statistical Analysis, Test Interpretation, Test Reliability

Reliability and Confidence.

Download full text

Test Service Bulletin, 1952

Some aspects of test reliability are discussed. Topics covered are: (1) how high should a reliability coefficient be?; (2) two factors affecting the interpretation of reliability coefficients--range of talent and interval between testings; (3) some common misconceptions--reliability of speed tests, part vs. total reliability, reliability for what…

Descriptors: Bulletins, Correlation, Scores, Statistical Analysis

Individual Distributions Under Ordinal Measurement

Peer reviewed

Schulman, Robert S. – Psychometrika, 1978

Ordinal measurement is the rank ordering of individuals in a population. For ordinal measurement, the concept of an individual propensity distribution is his or her true score. Estimation of, as well as other aspects of the distribution, are discussed. (Author/JKS)

Descriptors: Correlation, Measurement, Nonparametric Statistics, Probability

A General Method of Estimating the Reliability of a Composite.

Peer reviewed

Werts, C. E.; And Others – Educational and Psychological Measurement, 1978

A procedure for estimating the reliability of a factorially complex composite is considered. An application of its use with Scholastic Aptitude Test data is provided. (Author/JKS)

Descriptors: Correlation, Factor Analysis, Mathematical Models, Matrices

Regression Effects on Part Scores Based on Whole-Score Selected Samples.

Peer reviewed

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984

Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)

Descriptors: Correlation, Intelligence Tests, Profiles, Scores

GUIDCOUN: A Comprehensive FORTRAN IV Computer Program for Generating Item and Test Analyses as Well as a Complete Standard Scores Distribution

Peer reviewed

Noble, Gilbert H. – Educational and Psychological Measurement, 1977

A computer program providing comprehensive test and item analysis is presented. Completing its performance on one run, the program, written in Fortran and emphasizing ease of use, integrates various statistical techniques for analyzing individual items and the overall test, in addition to generating a variety of standard scores. (Author/JKS)

Descriptors: Computer Programs, Correlation, Equated Scores, Item Analysis

Comparison of McCarthy and Goodenough-Harris Scoring Systems for Kindergarten Children's Human Figure Drawings.

Peer reviewed

Piersel, Wayne C.; Santos, Lande – Perceptual and Motor Skills, 1982

Comparison of the Goodenough-Harris and McCarthy scoring procedures for 60 kindergarten children's drawings yielded substantial agreement between the two scoring systems. The streamlined McCarthy scoring system should be utilized when large numbers of children are being evaluated with short periods of time. (Author)

Descriptors: Comparative Analysis, Correlation, Diagnostic Tests, Kindergarten

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brennan, Robert L.	2
Baumeister, Antonia E. E.	1
Bayuk, Robert J.	1
Beck, Michael	1
Bethscheider, Janine K.	1
Borko, Hilda	1
Caroline M. Böhm	1
Chan, James Y.	1
Fan, Xitao	1
Harrington, Robert G.	1
Hogan, Thomas P.	1
Hopkins, Kenneth D.	1
Jennings, Valerie	1
Joreskog, K. G.	1
Kane, Michael F.	1
Kane, Michael T.	1
Katz, Martin R.	1
Koos, Eugenia M.	1
Lambrecht, Judith J.	1
Linn, Robert L.	1
Martinez, Jose Felipe	1
Mathis, Harry Ray	1
Moses, Tim	1
Nevo, Barukh	1
More ▼