ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	90

Descriptor

True Scores	416
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	51
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	192
Reports - Research	176
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 331 to 345 of 416 results Save | Export

Estimating Item Characteristic Curves. Final Report for Period March 1978 through September 1978.

PDF pending restoration

Ree, Malcolm James – 1978

Item characteristic curve (ICC) theory describes the relationship between the ability of individuals and the probability of their answering a test question correctly; it is useful in estimating test scores, equating the scores of various tests, and scoring responses during adaptive testing. A simulation study of the effectiveness of the following…

Descriptors: Ability, Comparative Analysis, Computer Programs, Item Analysis

A Comparison of Item Response Theory Models for Use in a Classroom Examination System. Promising Applications of Latent Trait Models and Evidence for Their Validity.

Douglass, James B. – 1981

Methods and results relevant to the introduction of item characteristic curve (ICC) models into classroom achievement testing are provided. The overall objective was to compare several common ICC models for item calibration and test equating in a classroom examination system. Parameters for the one-, two- and three-parameter logistic ICC models…

Descriptors: Academic Achievement, Comparative Analysis, Difficulty Level, Equated Scores

Accounting for the Uncertainty in Performance Standards.

Download full text

deGruijter, Dato N. M. – 1980

The setting of standards involves subjective value judgments. The inherent arbitrariness of specific standards has been severely criticized by Glass. His antagonists agree that standard setting is a judgmental task but they have pointed out that arbitrariness in the positive sense of serious judgmental decisions is unavoidable. Further, small…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Comparison of Traditional and Latent Trait Theory Methods for Equating Tests.

Download full text

Kolen, Michael J. – 1980

Results from equipercentile, linear, and latent trait equating of the vocabulary and quantitative thinking tests of the Iowa Tests of Educational Development were compared. The study entailed both the equating of forms (of similar difficulty) and the equating of levels (of differing difficulty). The goal was to equate seventh edition tests to…

Descriptors: Achievement Tests, Difficulty Level, Equated Scores, Guessing (Tests)

The Attenuation Paradox and Internal Consistency.

Download full text

Gleser, Leon Jay – 1971

An attempt is made to indicate why the concept of "true score" naturally leads to the belief that test validity must increase with an increase in test and/or average item reliability, and why this is correct for the classical single-factor model first introduced by Spearman. The statistical model used by Loevinger is introduced to…

Descriptors: Factor Analysis, Item Analysis, Mathematical Models, Measurement Techniques

Testing a Linear Relation Between True Scores of Two Measures.

Download full text

Kristof, Walter – 1971

We concern ourselves with the hypothesis that two variables have a perfect disattenuated correlation, hence measure the same trait except for errors of measurement. This hypothesis is equivalent to saying, within the adopted model, that true scores of two psychological tests satisfy a linear relation. Statistical tests of this hypothesis are…

Descriptors: Analysis of Covariance, Comparative Analysis, Correlation, Error of Measurement

Obtaining Maximum Likelihood Trait Estimates from Number-Correct Scores for the Three-Parameter Logistic Model.

Peer reviewed

Yen, Wendy M. – Journal of Educational Measurement, 1984

A procedure for obtaining maximum likelihood trait estimates from number-correct (NC) scores for the three-parameter logistic model is presented. It produces an NC score to trait estimate conversion table. Analyses in the estimated true score metric confirm the conclusions made in the trait metric. (Author/DWH)

Descriptors: Achievement Tests, Error of Measurement, Estimation (Mathematics), Latent Trait Theory

A Theory-Based Comparison of the Reliabilities of Fixed-Length and Trials-to-Criterion Scoring of Physical Education Skills Tests.

Peer reviewed

Feldt, Leonard S.; Spray, Judith A. – Research Quarterly for Exercise and Sport, 1983

The reliabilities of two types of measurement plans were compared across six hypothetical distributions of true scores or abilities. The measurement plans were: (1) fixed-length, where the number of trials for all examinees is set in advance; and (2) trials-to-criterion, where examinees must keep trying until they complete a given number of trials…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Higher Education, Measurement Techniques

Assessing the Effect of Multidimensionality on IRT True-Score Equating for Subgroups of Examinees.

Download full text

De Champlain, Andre F. – 1995

The dimensionality of one form of the Law School Admission Test (LSAT) was assessed with respect to three ethnic groups of test takers. Whether differences in the ability composite have any noticeable impact on item response theory (IRT) true score equating results for these subgroups (African Americans, Hispanic Americans, and Whites) was also…

Descriptors: Ability, Blacks, Equated Scores, Ethnic Groups

Criterion-Referenced Applications of Classical Test Theory

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

A reliability coefficient for criterion-referenced tests is developed from the assumptions of classical test theory. The coefficient is based on deviations of scores from the criterion score, rather than from the mean. (Author/CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Mathematical Applications, Norm Referenced Tests

An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests

Peer reviewed

Harris, Chester W. – Journal of Educational Measurement, 1972

An alternative interpretation of Livingston's reliability coefficient (see TM 500 487) is based on the notion of the relation of the size of the reliability coefficient to the range of talent. (Author/CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Mathematical Applications, Norm Referenced Tests

Behavior Domains in Theory and in Practice

Peer reviewed

Direct link

McDonald, Roderick P. – Alberta Journal of Educational Research, 2003

The concept of a behavior domain is a reasonable and essential foundation for psychometric work based on true score theory, the linear model of common factor analysis, and the nonlinear models of item response theory. Investigators applying these models to test data generally treat the true scores or factors or traits as abstractive psychological…

Descriptors: Factor Analysis, Error of Measurement, True Scores, Psychometrics

The Comparability of the Standardized Mean Difference Effect Size across Different Measures of the Same Construct: Measurement Considerations

Peer reviewed

Direct link

Nugent, William R. – Educational and Psychological Measurement, 2006

One of the most important effect sizes used in meta-analysis is the standardized mean difference (SMD). In this article, the conditions under which SMD effect sizes based on different measures of the same construct are directly comparable are investigated. The results show that SMD effect sizes from different measures of the same construct are…

Descriptors: Effect Size, Meta Analysis, True Scores, Error of Measurement

Effect of Rasch Calibration on Ability and DIF Estimation in Computer-Adaptive Tests. Research Report RR-94-32.

PDF pending restoration

Zwick, Rebecca; And Others – 1994

A previous simulation study of methods for assessing item functioning (DIF) in computer-adaptive tests (CATs) showed that modified versions of the Mantel-Haenszel and standardization methods work well with CAT data. In that study, data were generated using the three-parameter logistic (3PL) model, and this same model was assumed in obtaining item…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Computer Simulation

When Unidimensional Data Are Not Unidimensional.

Download full text

Reckase, Mark D.; And Others – 1985

Factor analysis is the traditional method for studying the dimensionality of test data. However, under common conditions, the factor analysis of tetrachoric correlations does not recover the underlying structure of dichotomous data. The purpose of this paper is to demonstrate that the factor analyses of tetrachoric correlations is unlikely to…

Descriptors: Correlation, Difficulty Level, Factor Analysis, Item Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Educational Measurement:…	5
Multivariate Behavioral…	5
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼