ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Descriptor

Scaling	37
Test Theory	37
Measurement Techniques	13
Latent Trait Theory	11
Mathematical Models	11
Equated Scores	10
Psychometrics	10
Test Construction	10
Test Items	8
Item Response Theory	7
Achievement Tests	6
Comparative Analysis	6
Educational Testing	6
Scores	6
Statistical Studies	6
Test Reliability	6
Educational Assessment	5
Evaluation Methods	5
Foreign Countries	5
Item Analysis	5
Test Interpretation	5
Testing Problems	5
Correlation	4
Difficulty Level	4
Research Methodology	4
More ▼

Source

Psychometrika	5
Applied Psychological…	2
Educational Measurement:…	2
Educational and Psychological…	2
Measurement:…	2
ACT, Inc.	1
Applied Measurement in…	1
Evaluation in Education: An…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Psychological Bulletin	1
More ▼

Publication Type

Reports - Research	22
Journal Articles	18
Speeches/Meeting Papers	9
Opinion Papers	6
Reports - Evaluative	6
Reports - Descriptive	3
Collected Works - General	2
Books	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	3
Higher Education	3
Elementary Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Researchers

Location

Australia	1
Colorado	1
Florida	1
New York	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Wales)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Comprehensive Tests of Basic…	2
ACT Assessment	1
Advanced Placement…	1
Armed Services Vocational…	1
Piers Harris Childrens Self…	1
Tennessee Self Concept Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Peer reviewed
PDF on ERIC

Download full text

Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015

For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…

Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

A Family of Association Cofficients for Metric Scales.

Peer reviewed

Zegers, Frits E.; ten Berge, Jos M. F. – Psychometrika, 1985

Four types of metric scales are distinguished: absolute, ratio, difference, and interval. A general coefficient of association for two variables of the same scale type is developed which reduces to specific coefficients of association for each scale type. (NSF)

Descriptors: Correlation, Mathematical Models, Scaling, Test Theory

Axiomatic Foundations of a Three-Set Guttman Simplex Model with Applicability to Longitudinal Data.

Peer reviewed

Collins, Linda M.; Cliff, Norman – Psychometrika, 1985

The axioms of a three-set Guttman simplex model are presented and the effects of relaxing the axioms for one of the three sets are examined. This model can be used to define longitudinal developmental scales. (NSF)

Descriptors: Mathematical Models, Measurement Techniques, Scaling, Test Construction

Maximally Reliable Composites for Unidimensional Measures.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1980

Reliability maximizing weights are related to theoretically specified true score scaling weights to show a constant relationship that is invariant under separate linear tranformations on each variable in the system. Test theoretic relations should be derived for the most general model available and not for unnecessarily constrained models.…

Descriptors: Mathematical Formulas, Scaling, Test Reliability, Test Theory

Measurement Scales and Standard Systems in Psychology.

Aftanas, Marion S. – 1984

Most discussions of measurement theory are focused on "scales" of measurement, but it is not clear whether reference is made to the mechanisms of measurement or the metric information derived from measurement. This emphasis on scales in measurement theory has not always provided a meaningful or fruitful description of measurement activities in…

Descriptors: Measurement, Measurement Techniques, Measures (Individuals), Psychological Studies

A General Framework for Using Latent Class Analysis to Test Hierarchical and Nonhierarchical Learning Models.

Peer reviewed

Rindskopf, David – Psychometrika, 1983

Various models have been proposed for analyzing dichotomous test or questionnaire items which were constructed to reflect an assumed underlying structure (e.g., hierarchical). This paper shows that many such models are special cases of latent class analysis and discusses a currently available computer program to analyze them. (Author/JKS)

Descriptors: Computer Programs, Item Analysis, Mathematical Models, Measurement Techniques

Problems, Perspectives, and Practical Issues in Equating.

Peer reviewed

Brennan, Robert L.; And Others – Applied Psychological Measurement, 1988

Seven papers on technical and practical issues in equating are presented. Problems related to the use of conventional and item response theory equating methods, using pre- and post-smoothing to increase equipercentile equating's precision, and linear equating models for common-item nonequivalent-population design are discussed. (SLD)

Descriptors: Equated Scores, Latent Trait Theory, Research Problems, Scaling

Multidimensional Scaling versus Components Analysis of Test Intercorrelations.

Peer reviewed

Davison, Mark L. – Psychological Bulletin, 1985

Considers the relationship between coordinate estimates in components analysis and multidimensional scaling. Reports three small Monte Carlo studies comparing nonmetric scaling solutions to components analysis. Results are related to other methodological issues surrounding research on the general ability factor, response tendencies in…

Descriptors: Ability, Monte Carlo Methods, Personnel Evaluation, Scaling

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brennan, Robert L.	2
Eignor, Daniel R.	2
Price, Gary G.	2
Aftanas, Marion S.	1
Almehrizi, Rashid S.	1
Batchelder, William H.	1
Beretvas, S. Natasha	1
Choppin, Bruce	1
Cliff, Norman	1
Coffman, William E.	1
Collins, Linda M.	1
Conger, Anthony J.	1
Cook, Linda L.	1
Cresswell, Mike	1
Cui, Zhongmin	1
Davison, Mark L.	1
Elosua, Paula	1
Fang, Yu	1
Fitzpatrick, Steven J.	1
Foong, Yoke-Yeen	1
France, Stephen L.	1
Haley, Kathleen	1
Hambleton, Ronald K.	1
Harnisch, Delwyn L.	1
More ▼