ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Measurement Techniques	21
Scores	21
Test Theory	21
Test Reliability	6
Correlation	5
Error of Measurement	5
Estimation (Mathematics)	5
Models	5
Reliability	5
Achievement Gains	4
Psychometrics	4
Test Items	4
Change	3
Definitions	3
Difficulty Level	3
Generalizability Theory	3
Measurement	3
Measures (Individuals)	3
Research Methodology	3
Test Interpretation	3
Testing Problems	3
Advantaged	2
College Entrance Examinations	2
Criterion Referenced Tests	2
Disadvantaged	2
More ▼

Source

Applied Psychological…	4
Social Forces	2
College Board	1
Educational Measurement:…	1
Educational and Psychological…	1
Intelligence	1
International Journal of…	1
Journal of Educational…	1
National Center for Analysis…	1
Psychometrika	1
Social Indicators Research	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	9
Reports - Evaluative	7
Speeches/Meeting Papers	5
Book/Product Reviews	3
Opinion Papers	2
Reports - Descriptive	2
Books	1
Guides - Non-Classroom	1

Education Level

Elementary Secondary Education	3
High Schools	3
Secondary Education	3

Audience

Teachers	2
Practitioners	1

Location

New York	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
Childrens Depression Inventory	1
General Educational…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Measurement Error Correction Formula for Cluster-Level Group Differences in Cluster Randomized and Observational Studies

Peer reviewed

Direct link

Cho, Sun-Joo; Preacher, Kristopher J. – Educational and Psychological Measurement, 2016

Multilevel modeling (MLM) is frequently used to detect cluster-level group differences in cluster randomized trial and observational studies. Group differences on the outcomes (posttest scores) are detected by controlling for the covariate (pretest scores) as a proxy variable for unobserved factors that predict future attributes. The pretest and…

Descriptors: Error of Measurement, Error Correction, Multivariate Analysis, Hierarchical Linear Modeling

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Teaching Introductory Measurement: Suggestions for What to Include and How to Motivate Students

Peer reviewed

Direct link

Bandalos, Deborah L.; Kopp, Jason P. – Educational Measurement: Issues and Practice, 2012

In this article, we discuss the importance of measurement literacy and some issues encountered in teaching introductory measurement courses. We present results from a survey of introductory measurement instructors, including information about the topics included in such courses and the amount of time spent on each. Topics that were included by the…

Descriptors: Class Activities, Motivation Techniques, Item Analysis, Test Theory

Validity and the Consequences of Test Interpretation and Use

Peer reviewed

Direct link

Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011

The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…

Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation

Shadow Education: Theory, Analysis and Future Directions--A Rejoinder

Peer reviewed

Direct link

Buchmann, Claudia; Condron, Dennis J.; Roscigno, Vincent J. – Social Forces, 2010

The authors welcome and appreciate the comments of Eric Grodsky and Sigal Alon on their article "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment." In their comments, Grodsky takes issue with several important theoretical and methodological aspects of their article and Alon highlights key processes…

Descriptors: Race, Educational Mobility, Test Preparation, College Entrance Examinations

Learning in the Shadows and in the Light of Day: A Commentary on "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment"

Peer reviewed

Direct link

Grodsky, Eric – Social Forces, 2010

Buchmann, Condron and Roscigno argue in their article, "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment," that the activities in which students engage to prepare for college entrance exams are forms of shadow education, a means by which more advantaged parents seek to pass their privileged status along…

Descriptors: Enrollment, Criticism, Research Problems, Test Preparation

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Is Reliability Obsolete? A Commentary on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Collins, Linda M. – Applied Psychological Measurement, 1996

The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)

Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Basic Concepts in Classical Test Theory: Relating Variance Partitioning in Substantive Analyses to the Same Process in Measurement Analyses.

Download full text

Dawson, Thomas E. – 1997

The basic processes in univariate statistics involve partitioning the sum of squares into two components: explained and within. This paper explains that the same partitioning occurs in measurement analyses, i.e., splitting the sum of squares into reliable and unreliable components. In addition, it is shown how the three types of error inherent in…

Descriptors: Estimation (Mathematics), Measurement Techniques, Scores, Statistical Analysis

Classical Test Theory and Item Response Theory: Analytical and Empirical Comparisons.

Download full text

Hwang, Dae-Yeop – 2002

This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…

Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

The Psychometric Paradox of Practice Effects Due to Retesting: Measurement Invariance and Stable Ability Estimates in the Face of Observed Score Changes

Peer reviewed

Direct link

Reeve, Charlie L.; Lam, Holly – Intelligence, 2005

The simple practice effects commonly observed when retaking general cognitive ability tests present a potential paradox. If observed score changes reflect real changes in g, we must revisit our understanding of its stability. Conversely, if observed score changes reflect something other than a true change in the underlying latent construct, this…

Descriptors: Psychometrics, Cognitive Ability, Cognitive Measurement, Test Theory

On Interpreting Test Scores as Social Indicators: Statistical Considerations.

Peer reviewed

Spencer, Bruce D. – Journal of Educational Measurement, 1983

Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…

Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Bandalos, Deborah L.	1
Biswas, Ajoy Kumar	1
Boyd, Donald	1
Buchmann, Claudia	1
Chakrabartty, Satyendra Nath	1
Cho, Sun-Joo	1
Collins, Linda M.	1
Condron, Dennis J.	1
Crowley, Susan	1
Crowley, Susan L.	1
Dawson, Thomas E.	1
Engelhard, George, Jr.	1
Grodsky, Eric	1
Grossman, Pamela	1
Hambleton, Ronald K.	1
Hubley, Anita M.	1
Humphreys, Lloyd G.	1
Hwang, Dae-Yeop	1
Kopp, Jason P.	1
Lam, Holly	1
Lankford, Hamilton	1
Loeb, Susanna	1
Molenaar, Ivo W.	1
Moran, Joseph J.	1
More ▼